Zero - Variance Features
Instruction
Please ensure you update all the details:
Name: _________________________
Batch Id: _______________________
Topic: Data Pre-Processing
Variance measures how far a set of data is spread out. A variance of zero indicates that
all the data values are identical. There are various techniques to remove this for
transforming the data into the suitable one for prediction.
Problem statement:
Find which columns of the given dataset with zero variance, explore various techniques used to
remove the zero variance from the dataset to perform certain analysis.
© 2013 - 2021 360DigiTMG. All Rights Reserved.
Hints:
For each assignment, the solution should be submitted in the below format
1. Work on each feature of the dataset to create a data dictionary as displayed in the
below image:
2. Consider the Z_dataset.csv dataset
3. Research and perform all possible steps for obtaining solution
4. All the codes (executable programs) should execute without errors
5. Code modularization should be followed
6. Each line of code should have comments explaining the logic and why you are using that
function
Grading Guidelines:
Note: 1. An Assignment submission is considered complete only when successful executable code(s),
and documentation explaining the applied solution and results are provided. Failing to submit either
of them will be considered an invalid submission and will not be considered for evaluation.
2. Assignments submitted after the deadline date will affect your grades.
Grading:
Ans Date Ans Date
Correct On time A 100
80% & above On time B 85 Correct Late
50% & above On time C 75 80% & above Late
50% & below On time D 65 50% & above Late
E 55 50% & below
Copied/No Submission F 45
Grade A: (>= 90): When all assignments are submitted on or before the given deadline date
© 2013 - 2021 360DigiTMG. All Rights Reserved.
Grade B: (>= 80 and < 90):
o When assignments are submitted on time but less than 80% of questions asked in
assignments are completed. (or)
o All assignments were submitted, however, after the given deadline
Grade C: (>= 70 and < 80):
o When assignments are submitted on time but less than 50% of questions asked in
assignments are completed. (or)
o Less than 80% of questions asked in assignments are submitted after the deadline
Grade D: (>= 60 and < 70): Assignments submitted after the Deadline and with 50% or less of
questions
Grade E: (>= 50 and < 60):
o Less than 30% of questions asked in the assignments are submitted after the deadline
(OR)
o Less than 30% of questions asked in the assignments are submitted before deadline
Grade F: (< 50): Copied submission or No submission
© 2013 - 2021 360DigiTMG. All Rights Reserved.