TO ANSWER THE FOLLOWING QUESTIONS USE THE FILE wage.RData.
The final report of this assignment should contain the answers to the questions. Each question should be analyzed
in one slide (or max two slides), reporting the final answer and the elements (tables, graphs, numbers) useful to
support the answer. The report should include screenshots of the output obtained as well as a comprehensive
interpretation of the results.
For the evaluation of assignments, we would take into account: correctness of statistical results, interpretation of
outcomes, originality of presentation, synthesis and layout.
The dataset provides information on a random sample of n = 150 workers, with a position as either laborer or
employee.
Variables:
• Wage: annual gross wage (thousand euros)
• Education: number of years of education
• Experience: number of years of work experience
• Tenure: number of years spent working in the current company
• Non_ita: (factor) 0 = italian: 1 = not italian
• Female: (factor) 0 = male; 1 = female
• Married: (factor) 0 = unmarried; 1 = married
• Region: (factor) region of the job (north, center, south)
QUESTION 1
Approaching the dataset with a preliminary analysis which are the main evidences and recommendations we can
collect?
QUESTION 2
Estimate the multiple linear regression model of Wage against Experience and Tenure. Which conclusions can we
draw?
QUESTION 3
We are interested in studying the joint effect of gender diversity (Female) and of Experience on Wage. What can
we say? (Hint: estimate three suitable multiple linear regression model)
QUESTION 4
We are still interested in studying the joint effect of gender diversity (Female) and of Experience on Wage but using
only those workers that have recently changed company (that is for which Tenure == 0). How do our conclusions
change?
QUESTION 5
Estimate a suitable multiple linear regression model for Wage against all other variables except Region (that is,
Education, Female, Married, Experience, Tenure and Non_ita). With a confidence level equal to 99%, what is the
smallest difference in average wage between a woman and a man, all other characteristics being equal?
QUESTION 6
Estimate a multiple linear regression model for Wage against all the other variables in the dataset. Compare this
model with a model that excludes Experience and Non_ita. Which managerial implications can we got?
QUESTION 7
We are interested in studying the relationship between Wage and Region only. Assuming α = 0.05, which are the
evidences?