0% found this document useful (0 votes)

56 views6 pages

Class - X AI (Part-B Unit-4)

Uploaded by

krishnachitkara88

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

56 views6 pages

Class - X AI (Part-B Unit-4)

Uploaded by

krishnachitkara88

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Holy Ganges Public School

Class:- X

Subject:- Al
Part-B

Unit-4 Statistical Data Concepts & Its Applications

Notes:
Data Science:- Data Science is a deep study of the Massive amount of data, which
involves extracting meaningful insights from raw, structured and unstructured
data.

Data science is a concept to unify statistics, data analysis, machine learning and
their related methods in order to understand and analyse actual phenomena with
data.

Data science deals with all types of Data i.e., structure data, unstructured data,
and semi-structured data.
Types of Data:
1. Structured Data:- Structured Data is a well defined set of data values.
Structured Data is typically stored in a relational database.
Examples:- Records in Database, spreadsheets, etc.
2. Unstructured Data:- Unstructured data is information that either does not have
a predefined data model or is not organised in a predefined manner.
Examples:- Online search results returned by the search engines, etc.
3. Semi-structured Data:- Semi- Structured Data refers to what would normally be
considered unstructured data, but that also has metadata that identifies certain
characteristics.

Need for Data Science:- Some main reasons for using Data Science technology
are:

1. We can convert the massive amount of raw data and unstructured data into
meaningful insights.
2. It handle the huge amount of data are using Data Science algorithms for better
customer experience.
3. Data Science is working for automating transportation such as creating a self
driving car.
4. Data Science can help in different prediction such as various surveys, elections,
etc

Tools for Data Science:

1. Data Analysis Tools:- R, Python, Jupyter, Excel,etc.
2. Data Warehousing:- ETL, SQL, AWS Redshift, etc.
3. Data Visualisation:- R, Jupyter, Tableau, Cognos, etc.
4. Machine Learning Tools:- Spark, Mahout, Azure ML studio.
Application of data science

1. Internet Search - Nowadays search engines are using data science to know
what users want to search on the internet and also search engines want to know
that the searching information is useful for users or not, some of the search
engines using data science are Google, Yahoo, Bing and DuckDuckGo etc.
2. Targeted Advertising - Nowadays people are using the internet; digital
advertising can be shown to only specific people based on their interests. This is
the reason why digital ads have been able to get a much higher CTR (Click
Through Rate) than traditional advertisements. They can be targeted based on a
user's past behavior.
3. Website Recommendations - Website recommendations help the users to find
relevant products from billions of products available on e-commerce websites. A
lot of companies promote their products on e-commerce websites based on the
interests and relevance of the users. Internet giants like Amazon, Twitter, Google
Play, Netflix, Linkedln, IMDb, and many more use this system to improve the user
experience. The recommendations are made based on previous search results for
a user.

4. Genetics & Genomics - Data science techniques allow integration of different

kinds of data with genomic data in disease research, which provides a deeper
understanding of genetic issues in reactions to particular drugs and diseases. As
soon as we acquire reliable personal genome data, we will achieve a deeper
understanding of human DNA.
5. Finance - Data science also plays a crucial role in the finance sector. Data
science can help banks to identify the fraud and risk of losses. Nowadays, the
finance sector wants to identify and analyze the risk of loss automatically, here
data science can play a crucial role in identifying the risk factor of losses in the
banking sector. Data science can also examine the past behavior of the stock
market and make predictions for future outcomes.
6. Health Care- Nowadays many health industries use data science for identifying
tumors, medical related image analysis, Patient health record maintenance,
pharmaceutical development, predictive diagnosis etc. Data science also help the
hospital to make more accurate predictions which reduce the rate of treatment
failure.

7. Airline Route Planning:- With the help of Data Science Airline companies can
1. Predict delay in flights.
2. Decide which class of airplanes to buy.
3. Effectively drive customer loyalty programs.
4. It decide for directly land into the destination.
8. Image Recognition and Speech Recognition:- When we upload an image on
Facebook and start getting the suggestions to tag to our friends, this automatic
tagging suggestion uses image recognition algorithm, which is a part of Data
Science.

When we say something using "OK Google, Siri, Cortana", etc. these devices
responsed as per voice control and this has become possible through speech
recognition algorithm.
Define High-Code, Low-Code and No-Code Al Tool :
High-Code - High code development refers to traditional software development
where programmers write code manually using programming languages like Java,
Python, C# etc. High-code is also known as custom-code.
Low-Code Al - The person has some coding knowledge to create Al applications
with minimum coding. Low-code users have some programming skills, and they
can build their own applications. Low-code Al users can also use a drag-and-drop
interface to build the components of Al.
Features of Low-Code Al:
1. Pre-built Components.
2. Code customisation.

3. Simplified Al pipelines.
4. Integration with code.

5. Visual Development Environment.

No-Code Al - It is a tool and platform where the users can build Al applications
without writing any code. No-Code Al uses a drag-and-drop interface to build the
components of Al and make it easy for the people who do not have a technical
background.
Features of No-Code Al:
1. User friendly Interface.
2. Pre-built Models.

3. Automated Workflow.

4. Integration Capabilities.
5. No Programing knowledge Required.
Disadvantages of No-Code Tools:
1. Lack of flexibility.
2. Automation Bias.

3. Security Issues.
Some No-Code Tools:
Azure Machine Learning:- It is a Cloud based service provided by Microsoft
released in July 2014. It aims to simplify ML processes.
Google Cloud AutoML:- It is a suit of machine learning tools and services provided
by Google Cloud released in January 2018.
Apple CreatML:- developed by Apple Inc. It specially designed for MacOS and ioS
platforms.
Microsoft Lobe:- Developed by Microsoft in 2015, Love helps with no data science
experience import images and easily label them to create a machine learning
dataset.
Google Teachable Machine:- It is a Web based tool developed by Google released
in November 2017 that allows users to create machine learning models without
need of coding.

Orange Data Mining:- It is open source data visualisation, machine learning and
data mining toolkit released in Otober 1996.
What is the Orange data mining tool?
Orange is an open-source software of machine learning that helps to design based
on a no-code or lowcode framework. With the help of Orange software, you can
design the data visualization, predictive modeling, and analysis of the data. The
orange tool is easy to use and has a drag-and-drop interface, basically used in
education, research, business, etc.
Statistics in Al:

Statistics play an important role in analysis and dealing with data in data science.
Statistics is used for collecting, exploring, and analyzing the data. It also helps in
drawing conclusions from data.Learning Resources for Students
Important concepts in statistics
Statistical sampling
The entire set of raw data that you may have available for a test or experiment is
known as the population.
You cannot necessarily measure the patterns and trends across the entire
population.
Take a sample, or portion of the population, perform some computations.
Descriptive statistics:-Descriptive statistics refers to a set of methods used to
summarize and describe the main features of a dataset. Helps us to describe the
data and enables us to understand the underlying characteristics.
Mean - The central value, commonly called the average.
Median - The middle value if we ordered the data from low to high and divided it
exactly in half.
Mode - The value which occurs most often.

Standard Deviations:- This function is calculated on a given sample which is

available in the form of the list. It is the measure of dispersion of dataset from its
means.

Variance:- Variance is the squared deviation ofa variable from its means.
Data Visualisation:- It is the graphical representations of information and data by
using visual elements like charts, graphs, and maps.
Types of Problem during collection of data:
1.Erroneous Data:

Incorrect values
Invalid or Null Values

2. Missing Data
3. Outliners.!

Statistical Data
No ratings yet
Statistical Data
5 pages
Business Intelligence Unit 2 Engineering Notes
No ratings yet
Business Intelligence Unit 2 Engineering Notes
50 pages
Data Science Overview & Applications
No ratings yet
Data Science Overview & Applications
10 pages
UNIT - I Intro To DS
No ratings yet
UNIT - I Intro To DS
18 pages
DS R Unit-1
No ratings yet
DS R Unit-1
41 pages
Fdsa Unit 1
No ratings yet
Fdsa Unit 1
19 pages
BCA Lecture I
No ratings yet
BCA Lecture I
20 pages
Question Bank Syllbuswise
No ratings yet
Question Bank Syllbuswise
16 pages
PDF Data Science
No ratings yet
PDF Data Science
7 pages
Data Science - FYBCA-Sem-II
No ratings yet
Data Science - FYBCA-Sem-II
13 pages
UNIT IV Data Science
No ratings yet
UNIT IV Data Science
7 pages
Unit I TYCS DS
No ratings yet
Unit I TYCS DS
73 pages
Lecture 1 and 2 Powerpoints
No ratings yet
Lecture 1 and 2 Powerpoints
32 pages
Unit 1
No ratings yet
Unit 1
28 pages
Foundations of Data Science Course
No ratings yet
Foundations of Data Science Course
25 pages
Getting Started With Data Science: Grade VIII
No ratings yet
Getting Started With Data Science: Grade VIII
32 pages
Fundamentals of Data Science
100% (1)
Fundamentals of Data Science
53 pages
Notes Unit1 Unit2
No ratings yet
Notes Unit1 Unit2
83 pages
FODS Full Notes
No ratings yet
FODS Full Notes
217 pages
Introduction to Data Science Overview
No ratings yet
Introduction to Data Science Overview
17 pages
Data Science
No ratings yet
Data Science
244 pages
Ai Project 1
No ratings yet
Ai Project 1
21 pages
Data Science
No ratings yet
Data Science
10 pages
Unit I
No ratings yet
Unit I
29 pages
DS Unit 1 Chapter 1
No ratings yet
DS Unit 1 Chapter 1
40 pages
CH1 1
No ratings yet
CH1 1
41 pages
Data Science & Machine Learning Insights
No ratings yet
Data Science & Machine Learning Insights
29 pages
FDS CH1
No ratings yet
FDS CH1
4 pages
Fundamentals of Data Science Course
75% (4)
Fundamentals of Data Science Course
62 pages
The Field of Data Science
No ratings yet
The Field of Data Science
4 pages
Data Science Essentials for Learners
No ratings yet
Data Science Essentials for Learners
3 pages
3-Business Intelligence and Data Science-08!01!2024
No ratings yet
3-Business Intelligence and Data Science-08!01!2024
16 pages
Fundamentals of Data Science Course Overview
No ratings yet
Fundamentals of Data Science Course Overview
65 pages
Unit 2 Data Science
No ratings yet
Unit 2 Data Science
53 pages
Unit 1 - DS - 1st Year
No ratings yet
Unit 1 - DS - 1st Year
13 pages
Class X AI Unit 4: Data Science
No ratings yet
Class X AI Unit 4: Data Science
57 pages
Kadir
No ratings yet
Kadir
84 pages
Seminar On Data Science
100% (7)
Seminar On Data Science
25 pages
3961502-Class10 Ai Part B Unit3 Unit3 Data Science
No ratings yet
3961502-Class10 Ai Part B Unit3 Unit3 Data Science
15 pages
Introduction To Data Science Practical Approach With R and Python (B. Uma Maheswari, R. Sujatha) (Z-Library) - 8-28
No ratings yet
Introduction To Data Science Practical Approach With R and Python (B. Uma Maheswari, R. Sujatha) (Z-Library) - 8-28
21 pages
Introduction To Data Science and Big Data
No ratings yet
Introduction To Data Science and Big Data
124 pages
Mod 3
No ratings yet
Mod 3
96 pages
Introduction To Datasciecne
No ratings yet
Introduction To Datasciecne
50 pages
21css303t Datascience Unit 1 Notes
No ratings yet
21css303t Datascience Unit 1 Notes
246 pages
Unit I Introduction To Data Science and Big Data
No ratings yet
Unit I Introduction To Data Science and Big Data
121 pages
Ch7-Overview of Data Science-Part 1
No ratings yet
Ch7-Overview of Data Science-Part 1
37 pages
Ai Unit - 3
No ratings yet
Ai Unit - 3
26 pages
Data Final
No ratings yet
Data Final
4 pages
Introduction To Data Science
No ratings yet
Introduction To Data Science
15 pages
Applied - Data - Science MODULE 1 SEM8
No ratings yet
Applied - Data - Science MODULE 1 SEM8
16 pages
Introduction to Data Science Concepts
No ratings yet
Introduction to Data Science Concepts
161 pages
Unit 1-3
No ratings yet
Unit 1-3
39 pages
AD3491 UNIT 1 NOTES EduEngg
100% (1)
AD3491 UNIT 1 NOTES EduEngg
35 pages
Unit 1 Notes
No ratings yet
Unit 1 Notes
36 pages
Data Science
No ratings yet
Data Science
9 pages
DSBDA Unit 1
No ratings yet
DSBDA Unit 1
16 pages
Data Science Unit 1
No ratings yet
Data Science Unit 1
30 pages
Introduction To Data Science UNIT 1
No ratings yet
Introduction To Data Science UNIT 1
44 pages
Filipino Hierarchy
No ratings yet
Filipino Hierarchy
10 pages
Design Process and Perspectives M-1
No ratings yet
Design Process and Perspectives M-1
35 pages
Euthanasia: Ethical, Legal, and Societal Implications
No ratings yet
Euthanasia: Ethical, Legal, and Societal Implications
4 pages
Flowserve Pump
0% (1)
Flowserve Pump
5 pages
K.P.R. Sugar Mill Limited
No ratings yet
K.P.R. Sugar Mill Limited
7 pages
Tiny OSR v0.92
100% (1)
Tiny OSR v0.92
49 pages
Design and Construction Standards Volume
No ratings yet
Design and Construction Standards Volume
162 pages
Sylobloc 45 Tds
100% (1)
Sylobloc 45 Tds
3 pages
Philip Schultz Resume Final
No ratings yet
Philip Schultz Resume Final
1 page
13 Hospitality Business Ideas
No ratings yet
13 Hospitality Business Ideas
1 page
Letters
No ratings yet
Letters
3 pages
Soal PAT Bahasa Inggris Kelas X 2021/2022
No ratings yet
Soal PAT Bahasa Inggris Kelas X 2021/2022
6 pages
Fatima Raheel - Explanatory EssayTopic - Heist of The Century
No ratings yet
Fatima Raheel - Explanatory EssayTopic - Heist of The Century
2 pages
WCA Audit Document Checklist
No ratings yet
WCA Audit Document Checklist
2 pages
7 Standards of Textuality
No ratings yet
7 Standards of Textuality
43 pages
Faraday
No ratings yet
Faraday
4 pages
Garnishing Guide for Chefs
No ratings yet
Garnishing Guide for Chefs
160 pages
Trauma and Haemorrhage
No ratings yet
Trauma and Haemorrhage
42 pages
IB140063EN
No ratings yet
IB140063EN
45 pages
Week 5
No ratings yet
Week 5
8 pages
SSC LDC General Intelligence Paper 2023
No ratings yet
SSC LDC General Intelligence Paper 2023
6 pages
Company List by Din
No ratings yet
Company List by Din
216 pages
Notification
100% (1)
Notification
2 pages
How To Install Matchbox-Keyboard (En)
No ratings yet
How To Install Matchbox-Keyboard (En)
4 pages
DISTURBANCES IN ABSORPTION AND ELIMINATION Notes
100% (1)
DISTURBANCES IN ABSORPTION AND ELIMINATION Notes
7 pages
Literature Review - Exploring The Accessibility of Websites For Differently Abled Individuals in Sri La
No ratings yet
Literature Review - Exploring The Accessibility of Websites For Differently Abled Individuals in Sri La
4 pages
Other Sample of Evaluation Report
No ratings yet
Other Sample of Evaluation Report
28 pages
Weekly Log
No ratings yet
Weekly Log
4 pages
Honeywell International Stock Analysis Overview
No ratings yet
Honeywell International Stock Analysis Overview
8 pages
Quality Assuarance - Sop For Cleaning of Sparkler Filter
No ratings yet
Quality Assuarance - Sop For Cleaning of Sparkler Filter
5 pages

Class - X AI (Part-B Unit-4)

Uploaded by

Class - X AI (Part-B Unit-4)

Uploaded by

Holy Ganges Public School

Unit-4 Statistical Data Concepts & Its Applications

Tools for Data Science:

4. Genetics & Genomics - Data science techniques allow integration of different

5. Visual Development Environment.

Standard Deviations:- This function is calculated on a given sample which is

You might also like