Data Scaling 101: Standardization and Min-Max Scaling Explained | by Haden Pelletier | Aug, 2024

When to use MinMaxScaler vs StandardScaler vs something else

What is scaling?

When you first load a dataset into your Python script or notebook, and take a look at your numerical features, you’ll likely notice that they are all on different scales.

This means that each column or feature will have varying ranges. For example, one feature may have values ranging from 0 to 1, while another can have values ranging from 1000 to 10000.

Take the Wine Quality dataset from UCI Machine Learning Repository (CC by 4.0 License) for example.

A few features from the UCI Wine Quality dataset. Image by author

Scaling is essentially the process of bringing all the features closer to a similar or same range or scale, such as transforming them so all values are between 0 and 1.

When (and why) you need to scale

There are a few reasons why scaling features before fitting/training a machine learning model is important:

Ensures that all features contribute equally to the model. When one feature has a large and…

Data Scaling 101: Standardization and Min-Max Scaling Explained | by Haden Pelletier | Aug, 2024

When to use MinMaxScaler vs StandardScaler vs something else

What is scaling?

When (and why) you need to scale

Recent Articles

Basics of GANs & SMOTE for Data Augmentation | by Sunghyun Ahn | Jan, 2025

HCLTech’s AWS powered AutoWise Companion: A seamless experience for informed automotive buyer decisions with data-driven design

Where Florida’s new law falls short

Kill the Justice League Undid the Part About Killing the Justice League

A Gentle Introduction to Rust for Python Programmers

Related Stories

Leave A Reply Cancel reply