Results for "Transformation"

6 / 23 posts

Search: Transformation

Filter by Category

PySpark

Spark Transformations, Actions and Lazy Evaluation and DAG.

Apache Spark RDD supports two types of Operations: Transformations Actions A Transformation is a function that produces new RDD from the existing RDDs but …

Apache Spark azure cloud
Match in titleMatch in contentMatch in code
Mar 19, 2026 4 min read
Machine learning

sklearn.preprocessing Complete Guide: Data Scaling, Encoding & Transformation in Python

Master sklearn.preprocessing from scratch to advanced level. Learn StandardScaler, MinMaxScaler, LabelEncoder, OneHotEncoder, and 30+ transformers with real-...

sklearn preprocessing scikit-learn preprocessing StandardScaler
Match in titleMatch in tagsMatch in content
Apr 25, 2026 15 min read
PySpark

What is Resilient Distributed Datasets (RDDs)

Resilient Distributed Datasets (RDDs) are a data structure of Apache Spark. They represent an immutable, distributed collection of objects that can be proc…

ai artificial-intelligence data-engineering
Match in contentMatch in code
Mar 19, 2026 3 min read
PySpark

Application,Job,Stage,Task in Spark

In Apache Spark, the execution of a program is broken down into multiple levels of granularity: applications, jobs, stages, and tasks. Understanding these co...

PySpark
Match in contentMatch in code
Mar 19, 2026 3 min read
Data Warehouse

What is the KDD (Knowledge Discovery in Databases)

In today's world, data is everywhere. Businesses, healthcare, banking, and even social media generate vast amounts of data every second. But raw data alone…

Cleaning the Data Data Integration data mining
Match in tagsMatch in content
Mar 19, 2026 3 min read
Machine learning

Categorical Data Handling in Machine Learning (Pandas + Sklearn) – Complete Practical Guide

Learn categorical data encoding end-to-end — Label, Ordinal, One-Hot, Target, Binary, Frequency encoding with Pandas & Sklearn. Beginner to advanced.

categorical encoding machine learning pandas
Match in contentMatch in code
Apr 24, 2026 28 min read