- The process in Data Science Analysis
- Acquire Data
- Prepare Data
- Exploring Data
- correlation, outliers
- Preprocess (Data Cleansing + Data Transformation)
- Exploring Data
- Data Analysis
- Data Report
- Action/Decision
Data Cleaning- how to deal with:
Inconsistent values -
Duplicate records - merge/ remove
Missing values - remove/estimate (eg. interplotation)
Invalid data
Outliers
Data Wrangling/ Data Munging
Dimensionality Reduction
Data Manipulation
Transformation
Feature
Yeo-Johnson Transformation
Lejung-Box Transformatioin