CT119-3-2-DMPM Data Pre-Processing
Data Pre Processing
1. Create a New Project ‘Exploratory Data Analysis’
2. Create a New Library ‘AAEM’
3. Create a New Data Source ‘PVA97NK’ from the SAS Table
4. Create a New Diagram ‘Pre-processing practice’
Perform Data exploration and identify the data quality issues such as Missing Values,
Outliers, Noisy data etc. [USE STATEXPLORE and GRAPHEXPLORE]
Use suitable Pre-processing methods from the Sample, Explore and Modify Tabs.
[USE IMPUTE, FILTER, TRANSFORM, FEATURE SELECTION]
Fill in the table below with your findings:
Data exploration - Findings Pre-processing Techniques Used
Missing value Impute
1) DemAge
2) GiftAvgcard36
High skewness Transform
1) GiftAvgAll
2) GiftAvgcard36
Noisy data Graph explore
1) TargetGiftAmount- has nosiy data
Data reduction Correlation plot : Pearson
GiftTimeFirst Stat explore
Share the process flow for this Lab activity.
Level X Asia Pacific University of Technology & Innovation Page # of #