Student Answer Script View
MIT MPL-BTech-M Sc - MCA - 1st-3rd-5th and 7th Semester - Mid Term Examination - Sep
Answer Sheet
2024
Student Name: KHUSH KEYUSH PATEL .
Roll Number: 230968018
Course: Computer applications 16.00
Year/Sem: Semester 3 30.00
Subject Name: DATA ANALYTICS
Exam Date: 23-Sep-2024
[Link] : 1) Score : 0.50 / 0.50
[Link] : 2) Score : 0.00 / 0.50
[Link] : 3) Score : 0.50 / 0.50
[Link] : 4) Score : 0.50 / 0.50
[Link] : 5) Score : 0.00 / 0.50
[Link] : 6) Score : 0.50 / 0.50
[Link] : 7) Score : 0.00 / 0.50
[Link] : 8) Score : 0.00 / 0.50
[Link] : 9) Score : 0.00 / 0.50
[Link] : 10) Score : 0.00 / 0.50
[Link] : 11) Score : 4.00 / 4.00
A company tests the efficiency of three different advertising strategies (Ad A, Ad B, Ad C) by measuring the number of sales (in thousands) generated by each strategy over 4 days:.
Day Ad A Ad B Ad C
1 30 22 25
2 35 28 30
3 40 32 35
4 42 30 40
Use one-way ANOVA to determine if there is a significant difference in the mean number of sales across the three advertising strategies at a 5% significance level.
[Link] : 12) Score : 2.50 / 3.00
Given the dataset with following values
Values=[5,15,25,35,45,55,65,75,85,95]
a. Explain the need for data transformation in data analytics and how Min-Max Normalization and Decimal Scaling help in preparing data for analysis.
b. Apply Min-Max Normalization to the dataset to transform the values to a range of [0, 1]. Show your calculations and results.
c. Apply Decimal Scaling to the dataset using a scaling factor of 100. Show your calculations and results.
[Link] : 13) Score : 1.50 / 3.00
You are analyzing a dataset of customer transaction amounts for a retail company. The dataset contains the following transaction values:
Values=[10,12,14,18,22,24,25,28,30,50]
a. As part of your analysis, you need to evaluate the role of the Interquartile Range (IQR) in identifying outliers. Calculate the IQR for this dataset and determine if there are any outliers.
b. Analyze how identifying and addressing outliers using the IQR can impact on the overall quality of your analysis and the insights you can derive about customer spending behavior.
[Link] : 14) Score : 0.00 / 3.00
Consider a dataset containing the following information about a set of customers:
Age
Annual Income
Spending Score (a measure of customer behaviour)
Using this dataset, perform an Exploratory Data Analysis to answer the following:
a. Identify the basic summary statistics (mean, median, and mode) for the Age and Annual Income columns.
b. Identify any outliers in the Annual Income column using the Interquartile Range (IQR) method.
c. Interpret the relationship between Age and Spending Score using a scatter plot or correlation analysis. .
[Link] : 15) Score : 2.00 / 3.00
A company is analyzing the distribution of delivery times for their products. After collecting data, they notice that most deliveries happen around 2-3 days, but a few deliveries take much longer
due to unexpected delays.
a. Explain how skewness affects the distribution of delivery times and its impact on the mean, median, and mode of the dataset .
[Link] : 16) Score : 2.00 / 3.00
A research study aimed to assess the effectiveness of a six-month high-intensity interval training (HIIT) program in lowering heart rates. For adults in China, the average heart rate is typically 72
beats per minute. After participating in HIIT, a sample of 25 individuals recorded an average heart rate of 69 beats per minute, with a standard deviation of 6.5 beats per minute. Using a
statistical test with 5% significace, determine if there is significant evidence to suggest that the HIIT successfully reduced heart rates.
[Link] : 17) Score : 0.00 / 2.00
A wildlife biologist is studying the alertness levels (arousal) of a population of "chill penguins" living in a tropical zoo. The arousal levels in this population are normally distributed, with a
known standard deviation of 6. The biologist collects a sample of 49 "chill penguins" and measures their arousal, finding a sample mean arousal level of 46.44 and a sample standard deviation of
5.6968. Under normal conditions, the expected arousal level of these penguins is 47. Using a significance level of α = 0.01, test whether the observed sample mean of 46.44 is significantly less
than the expected population mean of 47.
.
[Link] : 18) Score : 2.00 / 2.00
The following data represents hemoglobin values in gm/dl for 10 patients:
10.5 9 6.5 8 11 7 7.5 8.5 9.5 12
Is the mean value for patients significantly differ from the mean value of general population (12 gm/dl)? Evaluate the role of chance. (a = 0.05)
.
[Link] : 19) Score : 0.00 / 2.00
A company is evaluating the impact of two distinct advertising strategies (Strategy A and Strategy B) across three regions (Region 1, Region 2, and Region 3) to understand how they influence
sales. The marketing team gathers sales performance data after applying both strategies in all three regions.
1. Based on the given scenario, what is the appropriate statistical technique the company should use to determine if there is a significant effect of the advertising strategy, region, or their on
sales?
2. After selecting the appropriate statistical technique, what kinds of conclusions could the company expect from the analysis of the data?
.
© 2024 All rights reserved. epm IP: [Link] epCloud 1.5
^ Top