Homework
Consider the Boston data set file. Based on the data set, answer the following using R:
1. How many variables are there in the data set? Is there any qualitative variable in the
data set?
2. It is easy to understand that the main variable of interest in this data set is “medv”,
which provides the median value of owner-occupied homes in $1000s. Provide a brief
summary for this variable.
3. How many towns are there in the data set with median value of the owner-occupied
homes over $40,000?
4. Provide a brief summary for the average number of rooms per dwelling (i.e., rm) for
all these towns (i.e., towns with median value of owner-occupied homes over
$40,000).
5. How many river-side towns are there in the data set?
6. Is the median value of the owner-occupied homes in river-side towns is significantly
higher?
7. Does per-capita crime rate have any influence on the median value of the owner-
occupied homes?
8. Is the per-capita crime rate higher in the river-side towns?
9. Comment about the relationship of the per-capita crime rate and median value of the
owner-occupied homes in river-side towns.
10. Prepare a brief summary report based on the data set.
(Note: Try to create appropriate plots wherever possible.)