Results for "Transformation"
6 / 23 posts
Filter by Category
Spark Transformations, Actions and Lazy Evaluation and DAG.
Apache Spark RDD supports two types of Operations: Transformations Actions A Transformation is a function that produces new RDD from the existing RDDs but …
sklearn.preprocessing Complete Guide: Data Scaling, Encoding & Transformation in Python
Master sklearn.preprocessing from scratch to advanced level. Learn StandardScaler, MinMaxScaler, LabelEncoder, OneHotEncoder, and 30+ transformers with real-...
What is Resilient Distributed Datasets (RDDs)
Resilient Distributed Datasets (RDDs) are a data structure of Apache Spark. They represent an immutable, distributed collection of objects that can be proc…
Application,Job,Stage,Task in Spark
In Apache Spark, the execution of a program is broken down into multiple levels of granularity: applications, jobs, stages, and tasks. Understanding these co...
What is the KDD (Knowledge Discovery in Databases)
In today's world, data is everywhere. Businesses, healthcare, banking, and even social media generate vast amounts of data every second. But raw data alone…
Categorical Data Handling in Machine Learning (Pandas + Sklearn) – Complete Practical Guide
Learn categorical data encoding end-to-end — Label, Ordinal, One-Hot, Target, Binary, Frequency encoding with Pandas & Sklearn. Beginner to advanced.