100% found this document useful (1 vote)
511 views10 pages

Chapter - 2 - Arranging - and - Collecting - Data Class9

Chapter 2 discusses data collection and arrangement, highlighting the differences between qualitative and quantitative data, as well as various data collection methods. It also covers the characteristics of Big Data, including its volume, variety, and velocity, and provides examples of algorithms used for data analysis. Additionally, the chapter includes activities and questions to reinforce understanding of data types and analysis techniques.

Uploaded by

HI TECH MOBILES
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
100% found this document useful (1 vote)
511 views10 pages

Chapter - 2 - Arranging - and - Collecting - Data Class9

Chapter 2 discusses data collection and arrangement, highlighting the differences between qualitative and quantitative data, as well as various data collection methods. It also covers the characteristics of Big Data, including its volume, variety, and velocity, and provides examples of algorithms used for data analysis. Additionally, the chapter includes activities and questions to reinforce understanding of data types and analysis techniques.

Uploaded by

HI TECH MOBILES
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 10

Chapter 2 – Arranging and Collecting

Data
I. Objective Type Questions

1. A school named ABC has recorded the total marks of every student in the
class. This is an example of:
(a) Qualitative data
✅ (b) Quantitative data
(c) Both qualitative and quantitative data
(d) None of the above
2. A food delivery app has asked for your feedback on the quality of the food.
You have written two paragraphs to describe the food. This is an example of:
✅ (a) Qualitative data
(b) Quantitative data
(c) Both qualitative and quantitative data
(d) None of the above
3. You need to predict what the temperature will be for next Friday. Which
algorithm will you use?
(a) Clustering
✅ (b) Regression
(c) Anomaly detection
(d) Binary classification
4. You need to predict if your car tyre will last for the next 1000 km. Which
algorithm will you use?
(a) Clustering
(b) Regression
✅ (c) Anomaly detection
(d) Binary classification
5. Which of the following options are the benefits of Big Data processing?
(a) Business can utilize outside intelligence while making decisions
(b) Improved customer service
(c) Better optimal efficiency
✅ (d) All of the above
6. The analysis of large amounts of data to see what patterns or other useful
information can be found is known as:
(a) Data Analysis
(b) Information Analytics
✅ (c) Big Data Analytics
(d) Data Analytics
7. Big data analysis does the following except:
(a) Collects data
✅ (b) Spreads data
(c) Organizes data
(d) Analyzes data
8. Primary data for the research process can be collected through:
(a) Experiment
(b) Survey
✅ (c) Both a and b
(d) None of the above
9. The advantage of secondary data are low cost, speed, availability, and
flexibility.
✅ (a) True
(b) False
10. The method of getting primary data by watching people is called:
(a) Survey
(b) Informative
✅ (c) Observational
(d) Experimental

II. Standard Questions

1. What is the difference between multivariate and univariate data?


Give some examples.

Comparison Table: Univariate vs. Multivariate Data


S. No. Feature Univariate Data Multivariate Data
1 Number of Variables One variable Two or more
variables
2 Nature of Analysis Simple analysis Complex analysis
with relationships
3 Purpose Describes a single Compares and
property relates multiple
properties
4 Example Height of students Sales of umbrellas
vs. rainfall amount
5 Graphical Bar chart, Scatter plot, line
Representation histogram, pie chart graph, multivariate
plots
6 Use Case To observe patterns To study influence,
or distribution correlation, trends,
or predictions

2. What are the common sources of data collection?


✅ Answer:
 Primary Sources – Data collected firsthand for a specific purpose:
👉 Surveys, Interviews, Questionnaires, Feedback forms, Observations.

 Secondary Sources – Data already collected for other purposes:


👉 Web traffic logs, Social media tracking, Satellite data, Transaction databases.

3.What are the primary characteristics of Big Data?


✅ Answer:

 Volume: Large size of data (terabytes or more)


 Variety: Data in multiple formats (text, audio, images, video)
 Velocity: Speed at which data is generated and processed

4.What are categorical variables? Give some examples.


✅ Answer:
Categorical variables are those that represent labels or categories, not numbers.

✅ Examples:

 Gender (Male/Female)
 Nationality (Indian, American)
 Vehicle color (Red, Blue, Black)

5. How is Big Data used in social media?


✅ Answer:
Big Data is used to:

 Track user behavior and preferences.


 Personalize ads and content.
 Monitor trending topics.
 Analyze posts, likes, shares, and user interactions in real time.
 Improve user experience with targeted recommendations.

III. Higher Order Thinking Skills (HOTS) – With Steps

Collect data of 50 Motor vehicles passing by your house. You can record the number of
wheels, the color of the vehicle and try to present the data as a dot plot. Compare the plots
with your classmates. Find the color of the vehicle, which is most common, and the least
common vehicle has how many wheels.
Activity: Collect data of 50 motor vehicles passing by your house.

✅ Steps to perform the activity:

1. Observe and record 50 motor vehicles.


2. Attributes to note:
o Number of wheels (2, 3, 4, or more)
o Color of the vehicle
3. Create a dot plot to visualize data.
4. Compare your dot plot with classmates.
5. Analyze:
o Which color appears most frequently?
o Which vehicle type (based on number of wheels) appears least
frequently?

IV.EXTRA MCQ

1. Which of the following is an example of categorical data?


(a) Height of students
(b) Weight of fruits
(c) Color of vehicles
(d) Number of pens
✅ Answer: (c) Color of vehicles

2. Which data type is best suited for understanding customer satisfaction from
reviews?
(a) Numerical
(b) Binary
(c) Qualitative
(d) Quantitative
✅ Answer: (c) Qualitative

3.Which algorithm is used to classify students into performance groups based on


scores?
(a) Regression
(b) Clustering
(c) Binary classification
(d) Anomaly detection
✅ Answer: (b) Clustering

4. What is the main goal of using regression algorithms?


(a) Group data
(b) Predict categories
(c) Detect anomalies
(d) Predict numeric values
✅ Answer: (d) Predict numeric values
5.Which of the following is an example of multivariate data?
(a) Student name
(b) Temperature readings
(c) Student grades and attendance
(d) Book title
✅ Answer: (c) Student grades and attendance

6. Which of the following is NOT a method for collecting secondary data?


(a) Social media tracking
(b) Satellite data
(c) Online feedback form
(d) Web traffic analysis
✅ Answer: (c) Online feedback form

7. Which component of Big Data refers to the variety of formats like text, images,
and video?
(a) Velocity
(b) Volume
(c) Variety
(d) Visualization
✅ Answer: (c) Variety

8.Which type of data can be easily measured and compared?


• a) Qualitative
• b) Quantitative
• c) Categorical
• d) Subjective
✅ Answer: b) Quantitative

9. What is a common example of primary data collection?


• a) Government census report
• b) Online news article
• c) Social media feed
• d) Face-to-face interview
✅ Answer: d) Face-to-face interview

10. Which algorithm helps in identifying unusual or inconsistent data entries?


• a) Regression
• b) Clustering
• c) Anomaly detection
• d) Classification
✅ Answer: c) Anomaly detection
11. Which of the following is NOT a characteristic of Big Data?
• a) Velocity
• b) Variety
• c) Volume
• d) Visibility
✅ Answer: d) Visibility

12.Which is an example of numerical variable?


• a) Student name
• b) Nationality
• c) Height
• d) Favorite color
✅ Answer: c) Height

13. Which algorithm is used when the answer is in Yes/No format?


• a) Regression
• b) Binary Classification
• c) Clustering
• d) Anomaly Detection
✅ Answer: b) Binary Classification

14. Which question type involves identifying unexpected records in data?


• a) Is this A or B?
• b) What should I do now?
• c) Is this odd?
• d) How much or how many?
✅ Answer: c) Is this odd?

15.What type of algorithm is used to group similar types of data?


• a) Binary Classification
• b) Clustering
• c) Regression
• d) Reinforcement Learning
✅ Answer: b) Clustering

16.“How many goals will a team score?” is an example of which type of question?
• a) Classification
• b) Regression
• c) Anomaly Detection
• d) Grouping
✅ Answer: b) Regression

17. Which question type is typically answered using reinforcement learning?


• a) What should I do now?
• b) Is this A or B?
• c) Is this odd?
• d) How much or how many?
✅ Answer: a) What should I do now?

V.EXTRA IMPORTANT TWO MARK:

1. Differentiate between qualitative and quantitative data with examples.

S. No. Qualitative Data Quantitative Data


1 Describes non-numeric Describes numeric and
characteristics (e.g., color, measurable values (e.g.,
opinion) height, weight)
2 Used to categorize or label Used to calculate and
data measure data
3 Cannot be measured Can be measured
numerically numerically
4 Example: Hair color, vehicle Example: Age, salary,
type number of students

2. Explain the terms “primary source” and “secondary source” of data.

✅ Primary source involves direct collection (e.g., surveys), while secondary source refers to
data previously collected for other purposes (e.g., government reports).

3. What is anomaly detection? Give an example where it is used.

✅ Anomaly detection identifies unusual patterns. Example: Detecting fraud in banking


transactions.

4. List any two methods used to collect primary data.

✅ Two methods: 1) Online surveys, 2) Face-to-face interviews.

5. Define regression and give one real-life example where it is used.

✅ Regression is a technique to predict numerical values. Example: Predicting next week's


temperature.

6. Explain how clustering is used to group data.

✅ Clustering groups similar data points together. Example: Grouping customers based on
purchasing behavior.

7. What are the three main characteristics (3Vs) of Big Data? Explain any two.

✅ 1) Volume – large amount of data, 2) Variety – different types of data, 3) Velocity – speed
of data generation. Example: Social media data.

8. Give an example where binary classification is used in everyday life.


✅ Example: Email spam detection (Spam/Not Spam).

9. How does the variety of Big Data make it more complex to handle?

✅ Big Data includes text, audio, video, etc., requiring advanced tools for processing.

10. Why is velocity an important aspect of Big Data? Give a suitable example.

✅ Velocity shows the speed of data creation. Example: Twitter generates thousands of
tweets per second.

VI. EXTRA IMPORTANT FOUR MARKS:

1. Explain the 3Vs of Big Data (Volume, Variety, Velocity) with suitable real-life
examples.

Big Data is defined by three main characteristics known as the 3Vs:

1. Volume: It refers to the massive amount of data generated every second. For example,
social media platforms like Facebook and Instagram store petabytes of data in the form of
posts, messages, and user activity.

2. Variety: Big Data comes in multiple formats including text, audio, video, images, and
sensor data. For instance, YouTube handles various video formats, and WhatsApp deals
with text, images, voice notes, and documents.

3. Velocity: This means the speed at which data is created and processed. Twitter, for
example, sees thousands of tweets every second. Companies use real-time processing to
analyze this data instantly for trends and feedback.

2. Describe different types of data with examples. Include qualitative, quantitative,


categorical, and numerical data.

There are different types of data used in data science:

1. Qualitative Data: This type describes characteristics that cannot be measured


numerically. Example: A person’s hair color or customer opinions.

2. Quantitative Data: These are numeric and measurable data points. Example: A student’s
test scores or a person's weight.

3. Categorical Data: These represent data grouped into categories. Example: Types of fruits
(apple, banana, orange).

4. Numerical Data: This data represents actual numbers and is used for calculations.
Example: Number of students in a class or number of pages in a book.
3. How are different types of questions used in data analysis? Explain with examples.

In data analysis, we ask specific types of questions that determine which algorithm to use:

1. Binary Classification – Questions with two possible outcomes. Example: Will a customer
buy this product? (Yes/No)

2. Anomaly Detection – Identifies outliers in data. Example: A sudden large withdrawal


from a bank account can be flagged as fraud.

3. Regression – Predicts numerical values. Example: Estimating the price of a house based
on location and size.

4. Clustering – Groups similar data points. Example: Categorizing customers based on


shopping behavior.

5. Reinforcement Learning – Based on decision-making and feedback. Example: A self-


driving car learns to stop at red lights based on programmed rules and trial-error learning.

4. Discuss primary and secondary sources of data collection. Give two examples for
each.

Data can be collected through primary or secondary sources:

1. Primary Data: This is data collected directly from the source for the first time. Examples:
- Conducting surveys to gather opinions on school uniforms.
- Holding face-to-face interviews to collect health-related information from patients.

2. Secondary Data: This is data collected by someone else but used for analysis. Examples:
- Reading government census reports to understand population trends.
- Using social media analytics to study customer engagement trends.

5. What is Big Data? Mention at least four sectors where Big Data is used and explain
how it benefits each.

Big Data refers to extremely large datasets that are difficult to process using traditional
tools. It requires advanced tools and algorithms. Some key sectors where Big Data is used:

1. Retail – Companies like Amazon use Big Data to recommend products based on customer
browsing and purchase history.

2. Healthcare – Big Data helps monitor patient conditions and track disease spread, such as
during the COVID-19 pandemic.

3. Social Media – Platforms like Facebook and Instagram analyze user activity to
personalize content and advertisements.

4. Sports – Teams use sensor data during training and matches to monitor athlete
performance and plan strategies for improvement.

You might also like