INTRODUCTION OF
DATA SCIENCE
Presented by:
1. Md. Fahimur Rahman (221-15-5953)
2. Md. Rakibul Islam (221-15-5829)
3. Mahfuzur Rahman (221-15-5270)
4. Md. Naimul Hasan (221-15-5601)
Content
● Introduction of Data Science
● Need for Data Science
● Data Science Components
● Tools for Data Science
● Applications of Data Science
● Conclusion
Introduction
● The simplest definition of data science is the extraction of actionable
insights from raw data.
● Data science is a deep study of the massive amount of data, which
involves extracting meaningful insights from raw, structured, and
unstructured data that is processed using the scientific method, different
technologies, and algorithms.
● Data science uses the most powerful hardware, programming systems,
and most efficient algorithms to solve the data related problems. It is the
future of artificial intelligence
Introduction
It uses techniques and theories drawn from
many fields within the context of mathematics,
statistics, computer science, domain
knowledge and information science.
An area that manages, manipulates, extracts,
and interprets knowledge from tremendous
amount of data.
Data science (DS) is a multidisciplinary field
of study with goal to address the challenges in
big data.
Data science principles apply to all data i.e. big
and small.
USES OF DATA SCIENCE
We can say that data science is all about:
• Asking the correct questions and analyzing the raw data.
• Modeling the data using various complex and efficient algorithms.
• Visualizing the data to get a better perspective.
• Understanding the data to make better decisions and finding the final
result.
NEED FOR DATA SCEINCE
DATA SCIENCE COMPONENT
The main components of Data Science are given below:
1. Statistics: Statistics is one of the most important components of data science.
Statistics is a way to collect and analyze the numerical data in a large amount and
finding meaningful insights from it.
2. Domain Expertise: In data science, domain expertise binds data science
together. Domain expertise means specialized knowledge or skills of a particular
area. In data science, there are various areas for which we need domain experts.
3. Data engineering: Data engineering is a part of data science, which involves
acquiring, storing, retrieving, and transforming the data. Data engineering also
includes metadata (data about data) to the data.
4. Visualization: Data visualization is meant by representing data in a visual
context so that people can easily understand the significance of data. Data
visualization makes it easy to access the huge amount of data in visuals.
5. Advanced computing: Heavy lifting of data science is advanced
computing. Advanced computing involves designing, writing, debugging,
and maintaining the source code of computer programs.
TOOLS OF DATA SCIENCE
Following are some tools required for data science:
Data Analysis tools: R, Python, Statistics, SAS, Jupyter, R Studio,
MATLAB, Excel, RapidMiner.
Data Warehousing: ETL, SQL, Hadoop, Informatica/Talend, AWS
Redshift
Data Visualization tools: R, Jupyter, Tableau, Cognos.
Machine learning tools: Spark, Mahout, Azure ML studio.
APPLICATION OF DATA SCIENCE
There are some common application areas of Data Science-
Fraud and Risk Detection
Healthcare
Internet Search
Targeted Advertising
Website Recommendations
Advanced Image Recognition
Speech Recognition
Airline Route Planning
Gaming
Augmented Reality
CONCLUSION
Data science has emerged as a powerful field that unlocks the secrets
hidden within data. By combining statistical analysis, computer science,
and domain expertise, data scientists transform raw information into
actionable insights. This newfound knowledge empowers businesses to
make data-driven decisions, solve complex problems, and predict future
trends. As the world continues to generate ever-increasing amounts of
data, the importance of data science will only continue to grow.
THANK YOU