Machine Learning Assignment
Sreetam Ganguly
B.Tech Seventh Semester
September 7, 2021
The house prices dataset is downloaded and nan values are removed. The dataset is then split into
training and testing sets. The categorical data was encoded.
The training and test sets have the following sizes.
The Pearson Correlation heatmap is given below.
Then linear regression is performed. The scores are given below.
The resulting MSE is plotted below.
Using polynomial regression of degree two and three, we estimate the value of the
weights/coefficients for predicting 'SalePrice' based on 'LotArea' and have a graph output.