0% found this document useful (0 votes)
44 views4 pages

Data Visualisation

An assignment on data visualization

Uploaded by

onlinemunene
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
44 views4 pages

Data Visualisation

An assignment on data visualization

Uploaded by

onlinemunene
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd

Data Visualization

Student's Name

Institutional Affiliation

Course Details

Instructor's Name

Date
1. Describe the differences in lottery sales between counties. What are the key insights
that emerge from the data?

A study of the counties’ lottery sales shows that the sales of the lotteries fluctuate greatly. The
average of the counties’ lottery sales is roughly 14.3 million with the median of approximately 3.76
million, thus there are some exceptional high sales counties that strongly influence the mean. Also a
great variability in sales is observed across counties as standard deviation = 83 million suggests, and
the range from minimum to maximum sales = 217,000 to 973 million. The histogram of sales shows
positive skewness with skewness of 11.43 and kurtosis of 132.38, which means that just a few
counties generate high proportions of total lotter sales while most of the counties’ sales are much
lower (see figure below).

totalsales
70000000

60000000

50000000

40000000

30000000

20000000

10000000

0
Highland Middlesex Nelson Essex Smyth FredericksburgSuffolkVirginia Beach

The bar chart which illustrates total sales per county reveal that most of the counties have low
lottery sales figures; three-cheap counties and they are Henrico, Arlington and Hampton. Such
distribution pattern corresponds to the skewed distribution as exemplified by the uneven nature of
the sales where some counties are more important players in the sales of lotteries.
Through the regression analysis above, the question was analysed using a model of R-square = 1,
suggesting that the model entirely explain the variability in total sales. If one were to accept this
outcome as representative of the X or Y = 1 scenario, the findings suggest a perfect fit, which has to
be qualified by the high values of multicollinearity between the game-specific sales variables. The
coefficients for games such as Scratcher, Lotto, and Kicker are about 1, which means that the
investment in each game type equals to sales. Concerning the correlations between game variables
and total sales the p-values are 0: this indicates that every type of game has made a statistically
significant contribution to the total sales. These results indicate that individual games comprise a
significant proportion of total lottery sales across counties and indicate that each game type is crucial
for overall total sales.

2. Potentially, what could be the important county factors that contribute to more or
less lottery sales?
The determining factor that contributes to more or less lottery sale was evaluated first using
correlation model. The finding was that Totalsales had a high positive correlation = 0.921 with the
retail. This indicated that counties with more retail experience higher lottery sales. Secondly,
Population size with 0.733 to totalsales reveal that larger counties generated more sales due to its
population advantage. Also with a correlation of 0.808 with total sales, retailpercap reinforced that
both the quantity and per capita distribution of retails contributed to a more lottery sales. Contrary,
demographic factors like %hsgrad and %coll had minute correlation with totalsales indicating a
smaller association with lottery sales (See figure below).
A graph of influencing factors
60000

50000

40000

30000

20000

10000

0
population Retailpercap Retail

Lottery sales depend with the fact that 16 counties are border counties, while the rest,145 counties
are non-border counties and 62 are metro counties while the rest 73 are non-metro counties. This
indicates that counties near state borders through which such motoring policies could be
implemented and the counties in metropolitan regions could posses’ different characteristics with
regard to lottery sales as influenced by higher population density, easier access, and presence of
cross border customers from other counties.

This suggests that counties near state borders and those within metropolitan areas could have
distinct patterns in lottery sales, due to increased population density, accessibility, and the attraction
of non-residents from nearby areas. Therefore, the status of the county as border or metro area is an
essential element which has to do with higher sales since people in such places might spend more on
lottery products passing through the shops.

You might also like