Ch.4.Data Science X-1

Uploaded by

manojthaware1972

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views3 pages

Ch.4.Data Science X-1

Uploaded by

manojthaware1972

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

STD-X AI PART-B

UNIT. 4 – DATA SCIENCE

Q1. Distinguish between data acquisition and data exploration.
Data Acquisition deals with collecting or obtaining data from different sources. This is based on the
need on the project. For example, traffic data for traffic lights management system. Data Exploration
deals on the other hand deals with finding patterns/trends from the acquired data. The AI system
will use these findings for solving the given problems.

Q2. How are NumPy arrays better than Python lists?

NumPy arrays have number of advantages over the Python lists. They are more convenient to use,
they are much faster than Python lists and the memory consumed by them is less because their data
structure takes less space

Q3. Explain some errors that can occur during data collection.
Some of the errors possible in data collection are:
1. Collection of incorrect values
2. Collecting invalid or null values
3. Data missing from dataset cells
4. Out of range data in the datasets

Q4. Explain the KNN model.

KNN or K-Nearest Neighbours is an algorithm used for supervised machine learning. KNN can be
used with both regression and classification tasks. This method evaluates the labels of a selected
number of data points around a target data point to predict the class on which the data point falls.

Q5. Write a short note on the Five-Factor Model.

The Five Factor Model is used for measuring the following five personality traits:
Openness: The trait of an individual to accept new things
Conscientiousness: The trait of being particular, organised, watchful etc.
Extroversion: The trait related to being social and outgoing
Agreeableness: The trait measuring friendliness of people with each other
Neuroticism: The trait related to controlling ones’ own emotions

Q6. In what ways can an AI capture data?

AI systems can collect or capture data from multiple sources. For example, eye-tracking sensors can
be used for capturing the movement of eye and body language, smartphones can be used for
capturing the usage data of the user, smart cameras can be used for capturing video data, data
streams can be used for capturing live data etc.

Q7. What is data visualisation? How can we use Python for data visualisation?
Data visualisation is a technique used for understanding and getting insights from the data. The two
most basic forms of data visualisation are graphs and charts. Python provides different plots for the
data visualisation, like Line plot, Histogram Plot, Scatter Plot, Bar Chart, Box and Whisker Plot etc.
Q8. What are Pandas and SciPy? Why are they used?
Pandas and SciPy are python libraries for assisting in scientific computations and data analysis.
Pandas deals with structured data operations and manipulations. It provides facility for data cleaning
and preparation. SciPy is one of the commonly used libraries for advanced-level science and
engineering functions. It provides access to Linear Algebra, Optimisation, Sparse Matrices etc.

Q9. Explain at least three different uses of data science in the financial sector.
The use of data science in the financial sector include:
Risk Analytics: this deals with analysing the risk associated with financial transactions like providing
loans or insuring something.
Fraud Detection: analysis of big data allows the financial sector to reduce frauds and scams
Personalised Services: financial sector use data science for providing personalised services to the
clients at the best possible rate.

Q10. What is problem scoping?

Problem scoping is part of the planning stage of the AI projects. It starts once the problem is
identified and includes establishing of goals, identification of stakeholders, finding out what is being
currently done for dealing with the problem and the ethical concerns related to the problem.
Problem Scoping establishes the limit of the problem.

Worksheet -1
Find out at least three uses of Data Science in the following sectors:

1. Education: Providing adaptive learning opportunities to students. Assisting in improving teacher

assessments.
2. Hospitality: Assisting hotels in predicting demand and customer behaviour. Automated dynamic
pricing systems for ensuring maximum revenue generation.
3. Defence: Assisting in smart targeting of enemy installations. Assisting in threat risk assessments.
4. Real Estate: Providing help with property valuations. Risk mitigation based on predictive models.
5. Law Enforcement: Smart surveillance systems. Assisting with criminal identification.

Worksheet -2
Recently, several unauthorised persons were seen roaming around in your colony. For dealing
with this situation, you have been asked to design a smart security system, which will allow only
authorised people to enter the colony.

Q1. What data will you need for designing and implementing the system?

1. The images of all the individuals authorised to enter the colony

2. The details of all the individuals authorised to enter the colony

Q2. From where will you collect this data?

1. Colony records or asking the authorised individuals to provide information.
Worksheet 3
Mean: sum of all the terms/number of terms
Median: {(n+1)/2)} th value. N= number of values in the data set
Mode: The most frequently occurring value/observation
Standard Deviation: � ∑|𝑥𝑥−𝜇𝜇|2
𝑁𝑁 Where ∑ = sum of, x = value in the data set, 𝜇𝜇 = mean of the data set
and N = number of data points
Variance: 𝑆𝑆 2 = ∑(𝑥𝑥 𝑖𝑖 −𝑥𝑥)2
𝑛𝑛−1 where 𝑆𝑆 2 = sample variance, 𝑥𝑥𝑖𝑖 = the value of one observation, 𝑥𝑥 = the mean
value of all observations and 𝑛𝑛 = the number of observations.

Q.Browse the internet and find out the names of five personality structure models like the Big Five
Model. Also, find out the name of the people who introduced the model.

1. Sixteen Personality Factor Questionnaire (Raymond Cattell)

2. Myers–Briggs Type Indicator (Katharine Cook Briggs and Isabel Briggs Myers)
3. Keirsey Temperament Sorter (Keirsey)
4. Enneagram of Personality (Oscar Ichazo is generally recognised for making this model known)
5. Type A and Type B personality theory (Meyer Friedman and Ray Rosenman)

Worksheet: 4

Q.Find out the five uses of the K-Nearest Neighbour model.

1. Used for smart surveillance. For example, detecting hidden packages at the bottom of shopping
carts.
2. For creating recommendation systems based on what the customer buys/watches/listens etc.
3. For finding documents which are semantically identical.
4. Predicting illnesses, for example breast cancer cases
5. Stock market predictions for trading

Big Data (Imp-Questions)
No ratings yet
Big Data (Imp-Questions)
17 pages
DS
No ratings yet
DS
7 pages
Foundation of Data Science Previous Year Question Paper
100% (1)
Foundation of Data Science Previous Year Question Paper
40 pages
01.ad3491 Fdsa QB
No ratings yet
01.ad3491 Fdsa QB
16 pages
Essentials of Data Science Exploration
No ratings yet
Essentials of Data Science Exploration
15 pages
FDS - 1 Solved
No ratings yet
FDS - 1 Solved
17 pages
Question Bank With Answers
No ratings yet
Question Bank With Answers
103 pages
Ixs8h l8mgc
No ratings yet
Ixs8h l8mgc
40 pages
Revision
No ratings yet
Revision
19 pages
Class 9 (Chap #4)
No ratings yet
Class 9 (Chap #4)
9 pages
Data Science
No ratings yet
Data Science
10 pages
Data Science MCQs Sample Mid2xlsx 2024 11-29-23!19!54
No ratings yet
Data Science MCQs Sample Mid2xlsx 2024 11-29-23!19!54
8 pages
Chapter No.4 Exercise Solution (Computer)
No ratings yet
Chapter No.4 Exercise Solution (Computer)
8 pages
Exercise PDF
No ratings yet
Exercise PDF
9 pages
Q - ClassX - AI - Ch5 and 6 - DS and CV
No ratings yet
Q - ClassX - AI - Ch5 and 6 - DS and CV
12 pages
DAta Sciencefull
No ratings yet
DAta Sciencefull
38 pages
Q1. Explain Data Science Process Along With Detailed Diagram
No ratings yet
Q1. Explain Data Science Process Along With Detailed Diagram
7 pages
Set. No - 2 P18pecs021-Data Science QP - Ph.d.
No ratings yet
Set. No - 2 P18pecs021-Data Science QP - Ph.d.
20 pages
Unit 1 - 5 FDS 2marks
No ratings yet
Unit 1 - 5 FDS 2marks
14 pages
Big Data: An Overview
No ratings yet
Big Data: An Overview
9 pages
Cls10datascience 24082024 113123
No ratings yet
Cls10datascience 24082024 113123
4 pages
Unit 4
No ratings yet
Unit 4
6 pages
UNIT 1 Material
No ratings yet
UNIT 1 Material
28 pages
Cs3352 - Foundation of Data Science
No ratings yet
Cs3352 - Foundation of Data Science
56 pages
AI - Book 10 - Part B - Answer Key (New Version)
No ratings yet
AI - Book 10 - Part B - Answer Key (New Version)
16 pages
DS Assignment No 2
No ratings yet
DS Assignment No 2
21 pages
FDS Imp Docs
No ratings yet
FDS Imp Docs
22 pages
Unit 4 & 5-Data Science and Computer Vision
No ratings yet
Unit 4 & 5-Data Science and Computer Vision
18 pages
Data Science Model 1 Ques
No ratings yet
Data Science Model 1 Ques
2 pages
Chapter 6 - Data Science and K Nearest Neighbour Model (PART B)
No ratings yet
Chapter 6 - Data Science and K Nearest Neighbour Model (PART B)
5 pages
Ds Revision 1
No ratings yet
Ds Revision 1
5 pages
Data Science Mcqs - Hamza Zahoor
No ratings yet
Data Science Mcqs - Hamza Zahoor
9 pages
Data Science
No ratings yet
Data Science
14 pages
File 2
No ratings yet
File 2
43 pages
Data Science Notes
No ratings yet
Data Science Notes
2 pages
Data Science Answer Key Overview
No ratings yet
Data Science Answer Key Overview
17 pages
Ch-4 Solved Exercise Class Ix
No ratings yet
Ch-4 Solved Exercise Class Ix
9 pages
FDS 1
No ratings yet
FDS 1
5 pages
Data Science Unit 1 Notes
No ratings yet
Data Science Unit 1 Notes
30 pages
Data Science Comprehension Worksheets
No ratings yet
Data Science Comprehension Worksheets
32 pages
2023 Dec18CSE396T
No ratings yet
2023 Dec18CSE396T
4 pages
Data Science Assignment
No ratings yet
Data Science Assignment
9 pages
Data Science and Analytics Reviewer
No ratings yet
Data Science and Analytics Reviewer
5 pages
FDS Unit 1 QB
No ratings yet
FDS Unit 1 QB
7 pages
Fds Question Bank With Answer
No ratings yet
Fds Question Bank With Answer
35 pages
DS Final 3 Marks
No ratings yet
DS Final 3 Marks
10 pages
Complete WorksheetAIClassX
No ratings yet
Complete WorksheetAIClassX
27 pages
Data Science Unit 01
No ratings yet
Data Science Unit 01
19 pages
AD3491 - Unit 1 - Introduction To Data Science Important Questions 2 Marks With Answer - 3-8
No ratings yet
AD3491 - Unit 1 - Introduction To Data Science Important Questions 2 Marks With Answer - 3-8
6 pages
Data Science Set - B
No ratings yet
Data Science Set - B
5 pages
X Ai SS CH4 Notes
No ratings yet
X Ai SS CH4 Notes
5 pages
Data Science Notes and Questions - 250605 - 112515
No ratings yet
Data Science Notes and Questions - 250605 - 112515
5 pages
Data Science - Notes - X
No ratings yet
Data Science - Notes - X
3 pages
Key Concepts in Data Science and Analysis
No ratings yet
Key Concepts in Data Science and Analysis
21 pages
DS 3-Marks Semeseter Suggestion
No ratings yet
DS 3-Marks Semeseter Suggestion
54 pages
Chapter - 2 - Arranging - and - Collecting - Data Class9
100% (1)
Chapter - 2 - Arranging - and - Collecting - Data Class9
10 pages
Data Science
No ratings yet
Data Science
10 pages
Grade 10 Ch-4 Data Science
No ratings yet
Grade 10 Ch-4 Data Science
34 pages
Common Phrasal Verbs
No ratings yet
Common Phrasal Verbs
6 pages
MBA Operations and Supply Chain Management Lecture Notes 2
No ratings yet
MBA Operations and Supply Chain Management Lecture Notes 2
7 pages
Research Methodes and Stats Introduction
No ratings yet
Research Methodes and Stats Introduction
3 pages
Day 5 & 6
No ratings yet
Day 5 & 6
8 pages
Online Shopping
50% (6)
Online Shopping
54 pages
Compilation of Written Works
No ratings yet
Compilation of Written Works
3 pages
Art II Course Outline
No ratings yet
Art II Course Outline
2 pages
Childhood and Adolescent Disorders Chapter Exam Questions
No ratings yet
Childhood and Adolescent Disorders Chapter Exam Questions
19 pages
Sintayehu - Chekolu Updated CV
No ratings yet
Sintayehu - Chekolu Updated CV
5 pages
Exploring Elements of Poetry
50% (2)
Exploring Elements of Poetry
4 pages
Prof Ed 06 Chapter 3 Importance of Educational Technology
100% (1)
Prof Ed 06 Chapter 3 Importance of Educational Technology
15 pages
Supporting Needy Individuals
No ratings yet
Supporting Needy Individuals
3 pages
COT Q2 MATH11 Truth Values of Propositions
No ratings yet
COT Q2 MATH11 Truth Values of Propositions
5 pages
Mile 13 Oct 2024 11th JEE Advanced M 2 PHASE 4 KPM MODEL Test 3
No ratings yet
Mile 13 Oct 2024 11th JEE Advanced M 2 PHASE 4 KPM MODEL Test 3
9 pages
Course Objectives
No ratings yet
Course Objectives
5 pages
Semi-Detailed Lesson Plan Mapeh 8 2nd Quarter
No ratings yet
Semi-Detailed Lesson Plan Mapeh 8 2nd Quarter
4 pages
This Article Is About The Academic Discipline
No ratings yet
This Article Is About The Academic Discipline
8 pages
Key Steps in HR Planning Process
No ratings yet
Key Steps in HR Planning Process
3 pages
GADoT GAN-based Adversarial Training For Robust DDoS Attack Detection
No ratings yet
GADoT GAN-based Adversarial Training For Robust DDoS Attack Detection
9 pages
Gate Theory Asu Switched
No ratings yet
Gate Theory Asu Switched
1 page
ICAI Convocation 2025 Contact Details
No ratings yet
ICAI Convocation 2025 Contact Details
1 page
Test Bank Advanced Practice Nursing Essential Knowledge For The Profession 4th Edition DeNisco
0% (1)
Test Bank Advanced Practice Nursing Essential Knowledge For The Profession 4th Edition DeNisco
3 pages
01a. Questionnaire Hf. Recurrent Rev. 01, Jan. 04, 2023-Lgtc-tt-Am-f004
No ratings yet
01a. Questionnaire Hf. Recurrent Rev. 01, Jan. 04, 2023-Lgtc-tt-Am-f004
4 pages
Experiments For B. Tech. 1 Year Physics Laboratory
No ratings yet
Experiments For B. Tech. 1 Year Physics Laboratory
6 pages
Mon56 p2 Designreport
No ratings yet
Mon56 p2 Designreport
56 pages
Digitizing Classical Rhetorics
No ratings yet
Digitizing Classical Rhetorics
18 pages
Syllabus CC DRM For Final
100% (1)
Syllabus CC DRM For Final
9 pages
Big Data Enabled Nursing Education, Research and Practice Complete Ebook Edition
100% (19)
Big Data Enabled Nursing Education, Research and Practice Complete Ebook Edition
16 pages
Index
No ratings yet
Index
164 pages
Methods of Acquiring Knowledge
No ratings yet
Methods of Acquiring Knowledge
37 pages