0% found this document useful (0 votes)
50 views52 pages

DA Script

The document is a mid-term examination answer script for a student named Khush Keyush Patel, enrolled in the Computer Applications course, specifically for the Data Analytics subject. It includes scores for various questions, detailing the student's performance on topics such as ANOVA, data transformation, outlier analysis, and statistical tests related to heart rates and delivery times. The script outlines specific questions and the corresponding scores achieved by the student, reflecting their understanding of data analytics concepts.

Uploaded by

khushpatel1222
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
50 views52 pages

DA Script

The document is a mid-term examination answer script for a student named Khush Keyush Patel, enrolled in the Computer Applications course, specifically for the Data Analytics subject. It includes scores for various questions, detailing the student's performance on topics such as ANOVA, data transformation, outlier analysis, and statistical tests related to heart rates and delivery times. The script outlines specific questions and the corresponding scores achieved by the student, reflecting their understanding of data analytics concepts.

Uploaded by

khushpatel1222
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd

Student Answer Script View

MIT MPL-BTech-M Sc - MCA - 1st-3rd-5th and 7th Semester - Mid Term Examination - Sep
Answer Sheet
2024

Student Name: KHUSH KEYUSH PATEL .


Roll Number: 230968018
Course: Computer applications 16.00
Year/Sem: Semester 3 30.00
Subject Name: DATA ANALYTICS
Exam Date: 23-Sep-2024
[Link] : 1) Score : 0.50 / 0.50
[Link] : 2) Score : 0.00 / 0.50
[Link] : 3) Score : 0.50 / 0.50
[Link] : 4) Score : 0.50 / 0.50
[Link] : 5) Score : 0.00 / 0.50
[Link] : 6) Score : 0.50 / 0.50
[Link] : 7) Score : 0.00 / 0.50
[Link] : 8) Score : 0.00 / 0.50
[Link] : 9) Score : 0.00 / 0.50
[Link] : 10) Score : 0.00 / 0.50
[Link] : 11) Score : 4.00 / 4.00
A company tests the efficiency of three different advertising strategies (Ad A, Ad B, Ad C) by measuring the number of sales (in thousands) generated by each strategy over 4 days:.

Day Ad A Ad B Ad C

1 30 22 25

2 35 28 30

3 40 32 35

4 42 30 40

Use one-way ANOVA to determine if there is a significant difference in the mean number of sales across the three advertising strategies at a 5% significance level.
[Link] : 12) Score : 2.50 / 3.00
Given the dataset with following values

Values=[5,15,25,35,45,55,65,75,85,95]

a. Explain the need for data transformation in data analytics and how Min-Max Normalization and Decimal Scaling help in preparing data for analysis.

b. Apply Min-Max Normalization to the dataset to transform the values to a range of [0, 1]. Show your calculations and results.

c. Apply Decimal Scaling to the dataset using a scaling factor of 100. Show your calculations and results.
[Link] : 13) Score : 1.50 / 3.00
You are analyzing a dataset of customer transaction amounts for a retail company. The dataset contains the following transaction values:

Values=[10,12,14,18,22,24,25,28,30,50]

a. As part of your analysis, you need to evaluate the role of the Interquartile Range (IQR) in identifying outliers. Calculate the IQR for this dataset and determine if there are any outliers.

b. Analyze how identifying and addressing outliers using the IQR can impact on the overall quality of your analysis and the insights you can derive about customer spending behavior.
[Link] : 14) Score : 0.00 / 3.00
Consider a dataset containing the following information about a set of customers:

Age

Annual Income

Spending Score (a measure of customer behaviour)

Using this dataset, perform an Exploratory Data Analysis to answer the following:

a. Identify the basic summary statistics (mean, median, and mode) for the Age and Annual Income columns.

b. Identify any outliers in the Annual Income column using the Interquartile Range (IQR) method.

c. Interpret the relationship between Age and Spending Score using a scatter plot or correlation analysis. .
[Link] : 15) Score : 2.00 / 3.00
A company is analyzing the distribution of delivery times for their products. After collecting data, they notice that most deliveries happen around 2-3 days, but a few deliveries take much longer
due to unexpected delays.

a. Explain how skewness affects the distribution of delivery times and its impact on the mean, median, and mode of the dataset .
[Link] : 16) Score : 2.00 / 3.00
A research study aimed to assess the effectiveness of a six-month high-intensity interval training (HIIT) program in lowering heart rates. For adults in China, the average heart rate is typically 72
beats per minute. After participating in HIIT, a sample of 25 individuals recorded an average heart rate of 69 beats per minute, with a standard deviation of 6.5 beats per minute. Using a
statistical test with 5% significace, determine if there is significant evidence to suggest that the HIIT successfully reduced heart rates.
[Link] : 17) Score : 0.00 / 2.00
A wildlife biologist is studying the alertness levels (arousal) of a population of "chill penguins" living in a tropical zoo. The arousal levels in this population are normally distributed, with a
known standard deviation of 6. The biologist collects a sample of 49 "chill penguins" and measures their arousal, finding a sample mean arousal level of 46.44 and a sample standard deviation of
5.6968. Under normal conditions, the expected arousal level of these penguins is 47. Using a significance level of α = 0.01, test whether the observed sample mean of 46.44 is significantly less
than the expected population mean of 47.

.
[Link] : 18) Score : 2.00 / 2.00
The following data represents hemoglobin values in gm/dl for 10 patients:

10.5 9 6.5 8 11 7 7.5 8.5 9.5 12

Is the mean value for patients significantly differ from the mean value of general population (12 gm/dl)? Evaluate the role of chance. (a = 0.05)

.
[Link] : 19) Score : 0.00 / 2.00
A company is evaluating the impact of two distinct advertising strategies (Strategy A and Strategy B) across three regions (Region 1, Region 2, and Region 3) to understand how they influence
sales. The marketing team gathers sales performance data after applying both strategies in all three regions.

1. Based on the given scenario, what is the appropriate statistical technique the company should use to determine if there is a significant effect of the advertising strategy, region, or their on
sales?

2. After selecting the appropriate statistical technique, what kinds of conclusions could the company expect from the analysis of the data?

.
© 2024 All rights reserved. epm IP: [Link] epCloud 1.5
^ Top

You might also like