AOML Notes

The document outlines key machine learning models to learn, including Linear Regression, SVM, and Random Forest, as well as essential topics such as ROC AUC Curve, clustering metrics, and NLP techniques. It provides detailed explanations of clustering algorithms like DB Scan, including important terms and hyperparameters. Additionally, it emphasizes the importance of evaluation metrics and methods like GridSearchCV and Random Search CV for model optimization.

Uploaded by

Samyukta G

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views3 pages

AOML Notes

Uploaded by

Samyukta G

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

List of models to learn in Machine Learning

1. Linear Regression
2. Ridge Regression
3. K-Neighbors
4. SVM
5. Logistic Regression
6. Lasso
7. Random Forest
8. XGBoost

Topics to cover up
1. ROC AUC Curve (ROC v/s F1 -> Give more priority to ROC)
2. All the metrics
3. Davies Bouldin – Clustering Sklearn – Davies bouldin score
4. Clustering metrics
a. Davies Bouldin (Davies does pointwise comparision whereas silhouette
takes more time. DBI is higher for convex clusters, especially those coming
from DBScan. A Lower DBI is desiarable.
i. DBI Formula
b. Silouette
c. Elbow
5. Lime and shop?
6. Hyper opt
7. Aglomerative
8. K-Means
9. NLP
a. LLM does text generation
b. WSD – Word sense disambiguation(River Bank v/s corporate bank)
c. Basic Structure
i. Featuring
1. Each word becomes 1 token
2. Retain what is going to make the most sense
3. Stop word removal
4. Keep important word which makes the most sense
5. Stemming and Lemetization
a. Stemming
i. Studying
ii. Studies
b. Lematization
i. Lema – Root
ii. As a process it is complicated
iii. When stemming fails, it is a more probabilistic
approach, better for grammer rules.
ii. Vectorising
1. It means mapping
2. Stop word removal
3. Word count frequency -> Hits
4. It is stored in hashmaps which is very secure
5. Lower Scale => Ascii values
iii. Evaluation Matrix
1. Confusion matrix
2. Accuracy
3. KL – Divergence – Generation
iv. 3 Major Sectors
1. Syntax Based (Tokenization, Stemming)
2. Deterministic Task – POS Tagging/Simple text classification
3. Generation Based
v. TOC for NLP (idk if it is imp)
vi. Deterministic involves no randomness
vii. Fitting
1. It is now in sparse matrix
2. Type1 and Type2 errors (Tiwan)
3. TF IDF score (The more the score, the more important whe
word is)
viii. TF (Term Frequency)
1. TF(word, article1)
¿ count of samyukta∈article 1
2. =
¿ number of words∈article 1
ix. IDF (inverse document Frequency)
1. =
number of documents present∈a corpus
log
Number of documents wherethe word Samyukta has appeared
x. Transformers – Has 3 models
10. Study the official Parameter Grid
11. DB Scan

Learn how to use gridsearchcv

Learn how to use random search CV

Learn all CV methods.

Copy notes from all the notebooks

DB Scan
DB Scan is a density based clustering algorithm that is used for unsupervised learning
problems.

In a bid to eliminate the problems to K-Means clustering with nested data and high
dimensional data.

It has 3 important terms and 2 important hyper parameters.

1. Terms
a. Core Point:
It is the center point that has minPts number of datapoints present in its area
and these points under its area can extend the cluster.
b. Non-Core Point:
It is the center point that does not have minPts number of datapoints present
in its area and it cannot extend the cluster
c. Outliers/Noise:
It is the datapoints which are not part of any cluster
2. Hyper-Parameters
a. minPts:
The minimum number of datapoints that need to be present in an area of
point to be considered a core point.
b. Epsilon:
It is the radius of the area of a center point.

Machine Learning Theory Updated
No ratings yet
Machine Learning Theory Updated
8 pages
Aiml Model
No ratings yet
Aiml Model
13 pages
Unsupervised Machine Learning
No ratings yet
Unsupervised Machine Learning
10 pages
DS
No ratings yet
DS
7 pages
ML ModuleUntitled 2
No ratings yet
ML ModuleUntitled 2
8 pages
MACHINE LEARNING Notes
No ratings yet
MACHINE LEARNING Notes
8 pages
Ds Unit 2
No ratings yet
Ds Unit 2
36 pages
ML Module 5
No ratings yet
ML Module 5
15 pages
Data Science Interview - 1
No ratings yet
Data Science Interview - 1
32 pages
AIML IMP Question For UT II - Model Ans - Compressed
No ratings yet
AIML IMP Question For UT II - Model Ans - Compressed
23 pages
ML Notes
No ratings yet
ML Notes
12 pages
Final ML
No ratings yet
Final ML
2 pages
ML
No ratings yet
ML
16 pages
6th - SEM Machine Learning Notes PDF
100% (1)
6th - SEM Machine Learning Notes PDF
36 pages
Aiya Session 4
No ratings yet
Aiya Session 4
42 pages
MCA Machine Learning Practical File
No ratings yet
MCA Machine Learning Practical File
22 pages
Challenges in ML&DM
No ratings yet
Challenges in ML&DM
12 pages
Nit ML Sugg
No ratings yet
Nit ML Sugg
5 pages
Outline: Three Basic Algorithms
No ratings yet
Outline: Three Basic Algorithms
34 pages
ML Question Bank-1
No ratings yet
ML Question Bank-1
10 pages
Classification
No ratings yet
Classification
34 pages
Computer Vision-Lec 3
No ratings yet
Computer Vision-Lec 3
11 pages
MLQA
No ratings yet
MLQA
17 pages
AI and DS QB1
No ratings yet
AI and DS QB1
31 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
21 pages
Unit 4 Introduction To Algorithm
No ratings yet
Unit 4 Introduction To Algorithm
10 pages
Pattern Summary Final
No ratings yet
Pattern Summary Final
28 pages
Jadavpur University: Assignment Submission
No ratings yet
Jadavpur University: Assignment Submission
9 pages
Data Analytics Pyq
No ratings yet
Data Analytics Pyq
32 pages
Module 9
No ratings yet
Module 9
30 pages
ML - Machine Learning PDF
No ratings yet
ML - Machine Learning PDF
13 pages
Python 06 MachineLearning
No ratings yet
Python 06 MachineLearning
45 pages
ML Notes 1
No ratings yet
ML Notes 1
3 pages
Data Mining for Spatial and Web Data
100% (1)
Data Mining for Spatial and Web Data
28 pages
Ai&ml 2
No ratings yet
Ai&ml 2
15 pages
360DigiTmg E Book Data Science
100% (1)
360DigiTmg E Book Data Science
168 pages
360DigiTMG Practical Data Science New
100% (1)
360DigiTMG Practical Data Science New
168 pages
Unit 1 - Intro - ML
No ratings yet
Unit 1 - Intro - ML
20 pages
ISYE 6501 Notes
No ratings yet
ISYE 6501 Notes
45 pages
Topic 08 - Data Modelling - Part II
No ratings yet
Topic 08 - Data Modelling - Part II
59 pages
ML Extended
No ratings yet
ML Extended
25 pages
Machine Learning Concepts and Techniques
No ratings yet
Machine Learning Concepts and Techniques
9 pages
ML Question BanK
No ratings yet
ML Question BanK
5 pages
Logistic Regression and Classifiers Overview
No ratings yet
Logistic Regression and Classifiers Overview
10 pages
Data Science Unit 5 Sppu Notes
No ratings yet
Data Science Unit 5 Sppu Notes
23 pages
Define Machine Learning and List 3 Different R...
No ratings yet
Define Machine Learning and List 3 Different R...
23 pages
Final Exam Sujective Ch-1-8 Question Bank Fill in Blanks
No ratings yet
Final Exam Sujective Ch-1-8 Question Bank Fill in Blanks
5 pages
ML IA2 Answers
No ratings yet
ML IA2 Answers
4 pages
CHP 1,2
No ratings yet
CHP 1,2
18 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
19 pages
Machine Learning Techniques Explained
100% (1)
Machine Learning Techniques Explained
12 pages
MLQB2
No ratings yet
MLQB2
11 pages
Introduction To Pattern Recognition and Machine Learning PDF
No ratings yet
Introduction To Pattern Recognition and Machine Learning PDF
402 pages
MODELS (AutoRecovered)
No ratings yet
MODELS (AutoRecovered)
9 pages
UNIT 4 Mining Object Spatial Multimedia Text and Web Data
No ratings yet
UNIT 4 Mining Object Spatial Multimedia Text and Web Data
30 pages
Models
No ratings yet
Models
46 pages
Unit-1 Introduction To Machine Learning: 1. What Is Learning? Learning Data Example
No ratings yet
Unit-1 Introduction To Machine Learning: 1. What Is Learning? Learning Data Example
15 pages
Machine Learning in Additive Manufacturing A Review
No ratings yet
Machine Learning in Additive Manufacturing A Review
15 pages
(IJCST-V10I5P42) :mrs R Jhansi Rani, Manchinti Pavan Kumar Reddy
No ratings yet
(IJCST-V10I5P42) :mrs R Jhansi Rani, Manchinti Pavan Kumar Reddy
8 pages
2021 Hillel - Revirew of ML Classifiers For Mode Choice Prediction
No ratings yet
2021 Hillel - Revirew of ML Classifiers For Mode Choice Prediction
35 pages
Week 2 Notes
No ratings yet
Week 2 Notes
11 pages
02 01 KMeans
100% (1)
02 01 KMeans
62 pages
Neural Networks: Applications & Learning
No ratings yet
Neural Networks: Applications & Learning
6 pages
Naïve Bayes Classifier Overview
No ratings yet
Naïve Bayes Classifier Overview
3 pages
Machine Learning Algorithms For GeoSpatial Data. Applications and
No ratings yet
Machine Learning Algorithms For GeoSpatial Data. Applications and
9 pages
Week 8
No ratings yet
Week 8
70 pages
Habibi Et Al 2020 Classifiers Preprint
No ratings yet
Habibi Et Al 2020 Classifiers Preprint
35 pages
Project Report Format Copy
No ratings yet
Project Report Format Copy
51 pages
ML in Fashion Industry
No ratings yet
ML in Fashion Industry
40 pages
Ozker 2020
No ratings yet
Ozker 2020
6 pages
A Hybrid Approach of Solar Power Forecasting Using Machine Learning
No ratings yet
A Hybrid Approach of Solar Power Forecasting Using Machine Learning
6 pages
DBDA EANDC QB Practical Machine Learning PDF
No ratings yet
DBDA EANDC QB Practical Machine Learning PDF
4 pages
Data Mining and Warehousing Lab
No ratings yet
Data Mining and Warehousing Lab
4 pages
Email Spam Detection Final Presentation-21BSCHH010002
No ratings yet
Email Spam Detection Final Presentation-21BSCHH010002
17 pages
Machine Learning Course Overview
No ratings yet
Machine Learning Course Overview
3 pages
Aiml Assignment 1
No ratings yet
Aiml Assignment 1
6 pages
PHD Thesis Barngrover
No ratings yet
PHD Thesis Barngrover
154 pages
Data Mining & Warehousing Guide
No ratings yet
Data Mining & Warehousing Guide
2 pages
Data-Driven Approach To Equipment Taxonomy Classification: GE Digital, San Ramon, CA, 24013, USA
No ratings yet
Data-Driven Approach To Equipment Taxonomy Classification: GE Digital, San Ramon, CA, 24013, USA
13 pages
4-1 Syllabus of Jntuh Syllabus of 4th Year
No ratings yet
4-1 Syllabus of Jntuh Syllabus of 4th Year
5 pages
Deep Learning for Wheat Disease Detection
No ratings yet
Deep Learning for Wheat Disease Detection
32 pages
Hierarchical Clustering Techniques
No ratings yet
Hierarchical Clustering Techniques
84 pages
Projects 1920 A12
No ratings yet
Projects 1920 A12
78 pages
Ai204h DSC-1
No ratings yet
Ai204h DSC-1
11 pages
Credit Scorecard Evaluation Guide
No ratings yet
Credit Scorecard Evaluation Guide
33 pages
Bank Customer Churn Prediction
No ratings yet
Bank Customer Churn Prediction
38 pages

AOML Notes

Uploaded by

AOML Notes

Uploaded by

List of models to learn in Machine Learning

Learn how to use gridsearchcv

Learn all CV methods.

Copy notes from all the notebooks

It has 3 important terms and 2 important hyper parameters.

You might also like