Data Sciences
(AC5072)
Objective: The objective of this course is to impart necessary knowledge of the
nathematical foundations needed for data science and develop programming skills
'equired to build data science applications.
Unit Topic
I lntroduction to Data Science: Concept of Data Science, Traits of Big data,
Web Scraping, Analysis vs Reporting.
II lntroduction to Programming Tools for Data Science : Toolkits usinp
)ython: Matplotlib, NumPy, Scikit-learn, NLTK, Visualizing Data: Bar Charts
-ine Charts, Scatterplots, Working with data: Reading Files, Scraping the Web
Jsing APIs (Example: Using the Twitter APIs), Cleaning and Munging
Vlanipulatin g Data, Rescaling, Dimensionality Reduction
III Mathematical Foundations: Linear Algebra: Vectors, Matrices, Statistics
Describing a Single Set of Data, Correlation, Simpson's Paradox, Correlatior
rnd Causation, Probability: Dependence and Independence, Conditiona
?robability, Bayes's Theorem, Random Variables, Continuous Distributions
fhe Normal Distribution, the Central Limit Theorem,Hypothesis and Inference:
Itatistical Hypothesis Testing, Confidence Intervals, Phacking, Bayesiar
.nference
IV Machine Learning: Overview of Machine learning concepts - Over fitting anc
.rain/test splits, Types of Machine learning * Supervised, Unsupervised,
Reinforced learning, Introduction to Bayes Theorem, Linear Regression- model
lssumptions, regularization (lasso, ridge, elastic net), Classification anc
Regression algorithms- Naive Bayes, K-Nearest Neighbors, logistic regression,
;upport vector machines (SVM), decision trees, and random forest,
llassification Errors, Analysis of Time Series- Linear Systems Analysis,
tlonlinear Dynamics, Rule Induction, Neural Networks- Learning And
Seneralization, Overview of Deep Learning.
v lase Studies of Data Science Application: Weather forecasting, Stock marker
rrediction, Object recognition, Real Time Sentiment Analysis.
References:
1. Joel Grus, "Data Science from Scratch: First Principles with Python", O'Reilly Media
2. Aurdlien G6ron, "Hands-On Machine Leaming with Scikit-Learn and Tensor Flow:
Concepts, Tools, and Techniques to Build Intelligent Systems", lst Edition, O'Reilly Media
3. Jain V.K., "Data Sciences", Khanna Publishing House, Delhi.