0% found this document useful (0 votes)

38 views4 pages

Basic Machine Learning Terms 2

The document outlines essential machine learning terms that every engineer should know, including definitions of algorithms, training and test data, features, labels, and various learning types such as supervised, unsupervised, and reinforcement learning. It also covers important concepts like overfitting, regularization, and evaluation metrics such as precision, recall, and F1 score. Additionally, the document introduces advanced topics like neural networks, ensemble learning methods, and techniques for hyperparameter tuning and dimensionality reduction.

Uploaded by

Mini hydra

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views4 pages

Basic Machine Learning Terms 2

Uploaded by

Mini hydra

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

You are on page 1/ 4

BASIC MACHINE LEARNING TERMS

Sure, here are some basic terms that every machine learning engineer should be
familiar with:

1. Algorithm: A set of rules or instructions given to a machine to help it

learn on its own.

2. **Training Data**: The dataset used to train the machine learning model.

3. **Test Data**: The dataset used to evaluate the accuracy and performance of the
trained model.

4. Feature: An individual measurable property or characteristic used as input

in a model. For example, height and weight can be features in a health-related
model.

5. Label: The output variable in supervised learning, which the model is

trained to predict.

6. Supervised Learning: A type of machine learning where the model is trained

on labeled data.

7. Unsupervised Learning: A type of machine learning where the model is trained

on unlabeled data and tries to find hidden patterns.

8. Reinforcement Learning: A type of learning where an agent learns to make

decisions by taking actions in an environment to maximize cumulative reward.

9. **Overfitting**: When a model performs well on training data but poorly on test
data, indicating it has learned the noise in the training data instead of the
underlying pattern.

10. **Underfitting**: When a model performs poorly on both training and test data,
indicating it has not captured the underlying pattern in the data.

11. Regularization: Techniques used to prevent overfitting by adding a penalty

for more complex models.

12. Gradient Descent: An optimization algorithm used to minimize the loss

function by iteratively adjusting the model parameters.

13. **Hyperparameters**: Parameters whose values are set before the learning
process begins, such as learning rate and number of trees in a random forest.

14. Confusion Matrix: A table used to evaluate the performance of a

classification algorithm, showing the true positives, false positives, true
negatives, and false negatives.

15. ROC Curve: Receiver Operating Characteristic curve, a graphical

representation of a classifier's performance.

16. **Precision**: The ratio of true positive predictions to the total predicted
positives.

17. **Recall**: The ratio of true positive predictions to the total actual
positives.
18. **F1 Score**: The harmonic mean of precision and recall, used as a single
metric to balance them.

19. **AUC (Area Under the Curve)**: A performance metric for classification
problems, representing the area under the ROC curve.

20. Neural Network: A series of algorithms that attempt to recognize underlying

relationships in a set of data, inspired by the way human brains operate.

21. Activation Function: A function used in neural networks to introduce non-

linearity, such as ReLU, sigmoid, or tanh.

22. **Epoch**: One complete pass through the entire training dataset.

23. **Batch Size**: The number of training examples utilized in one iteration.

24. **Learning Rate**: A hyperparameter that controls how much to change the model
in response to the estimated error each time the model weights are updated.

25. Cross-Validation: A technique to evaluate the performance of a machine

learning model by partitioning the data into a training set and a test set multiple
times.

Certainly! Here are more essential machine learning terms every engineer should
know:

26. Feature Engineering: The process of selecting, modifying, and creating

features from raw data to improve the performance of a machine learning model.

27. Feature Scaling: The process of normalizing or standardizing features to

ensure they contribute equally to the model's performance. Common methods include
min-max scaling and z-score normalization.

28. Dimensionality Reduction: Techniques used to reduce the number of features

in a dataset while retaining important information. Principal Component Analysis
(PCA) and t-Distributed Stochastic Neighbor Embedding (t-SNE) are popular methods.

29. **Bias-Variance Tradeoff**: The balance between bias (error due to overly
simplistic models) and variance (error due to overly complex models). Achieving the
right tradeoff is crucial for model performance.

30. Ensemble Learning: Combining multiple models to improve performance. Common

ensemble methods include bagging, boosting, and stacking.

31. Bagging (Bootstrap Aggregating): An ensemble method that trains multiple

models on different subsets of the data and averages their predictions to reduce
variance.

32. **Boosting**: An ensemble method that trains models sequentially, with each new
model focusing on the errors made by previous models. Examples include AdaBoost,
Gradient Boosting, and XGBoost.

33. Stacking: An ensemble method that combines the predictions of multiple

models using a meta-model, which learns to make the final prediction.

34. **Decision Tree**: A model that uses a tree-like structure to make decisions
based on feature values. Each internal node represents a feature, each branch
represents a decision rule, and each leaf node represents an outcome.
35. **Random Forest**: An ensemble method that combines multiple decision trees to
improve performance and reduce overfitting.

36. **Support Vector Machine (SVM)**: A supervised learning algorithm used for
classification and regression tasks. It finds the hyperplane that best separates
the classes in the feature space.

37. K-Nearest Neighbors (KNN): A simple, instance-based learning algorithm that

classifies new data points based on the majority class of their k-nearest
neighbors.

38. Naive Bayes: A probabilistic classification algorithm based on Bayes'

theorem, assuming independence between features.

39. Clustering: An unsupervised learning technique that groups similar data

points together. K-means, hierarchical clustering, and DBSCAN are common clustering
algorithms.

40. Principal Component Analysis (PCA): A dimensionality reduction technique

that transforms features into a set of linearly uncorrelated components, ordered by
the amount of variance they explain.

41. t-SNE (t-Distributed Stochastic Neighbor Embedding): A dimensionality

reduction technique used for visualizing high-dimensional data by projecting it
into lower-dimensional space.

42. **Recurrent Neural Network (RNN)**: A type of neural network designed for
sequential data, where connections between nodes form a directed graph along a
sequence. Commonly used in natural language processing and time series analysis.

43. **Convolutional Neural Network (CNN)**: A type of neural network designed for
processing grid-like data, such as images. It uses convolutional layers to
automatically and adaptively learn spatial hierarchies of features.

44. Transfer Learning: A technique where a pre-trained model is adapted for a

new, but related task, allowing for faster training and improved performance with
less data.

45. **AutoML**: Automated Machine Learning, which aims to automate the end-to-end
process of applying machine learning to real-world problems.

46. Generative Adversarial Network (GAN): A type of neural network consisting

of two models (generator and discriminator) that are trained simultaneously, with
the generator creating realistic data and the discriminator distinguishing between
real and generated data.

47. **Long Short-Term Memory (LSTM)**: A type of RNN designed to capture long-term
dependencies in sequential data by using memory cells and gating mechanisms to
control the flow of information.

48. **Gradient Boosting Machine (GBM)**: An ensemble learning method that builds
additive models in a forward stage-wise manner, optimizing for a differentiable
loss function. Examples include XGBoost and LightGBM.

49. Bayesian Optimization: A method for optimizing hyperparameters by building

a probabilistic model of the objective function and iteratively selecting the most
promising hyperparameters to evaluate.

50. Hyperparameter Tuning: The process of selecting the best hyperparameters

for a machine learning model to improve its performance. Common techniques include
grid search, random search, and Bayesian optimization.

Comprehensive Machine Learning Guide
No ratings yet
Comprehensive Machine Learning Guide
20 pages
Machine Learning Engineer Cheatsheet
No ratings yet
Machine Learning Engineer Cheatsheet
3 pages
Paper 1
No ratings yet
Paper 1
12 pages
Unit-1 New
No ratings yet
Unit-1 New
27 pages
Unit-1 ML (1) .Docx 3rd Sem
No ratings yet
Unit-1 ML (1) .Docx 3rd Sem
20 pages
Machine Learning (ML)
No ratings yet
Machine Learning (ML)
2 pages
Machine Learning Guide: Types & Concepts
No ratings yet
Machine Learning Guide: Types & Concepts
4 pages
AI ML Concepts
No ratings yet
AI ML Concepts
97 pages
Machine Learning
No ratings yet
Machine Learning
3 pages
Machine Learning
No ratings yet
Machine Learning
6 pages
What Are The Basic Concepts in Machine Learning
No ratings yet
What Are The Basic Concepts in Machine Learning
3 pages
Data Science Notes B
No ratings yet
Data Science Notes B
5 pages
1
No ratings yet
1
2 pages
Core Machine Learning Concepts
No ratings yet
Core Machine Learning Concepts
2 pages
ChatPDF IMG 20250313 WA0000
No ratings yet
ChatPDF IMG 20250313 WA0000
2 pages
ChatPDF IMG 20250313 WA0000
No ratings yet
ChatPDF IMG 20250313 WA0000
2 pages
In Depth Explanation of Machine Learning Concepts
No ratings yet
In Depth Explanation of Machine Learning Concepts
3 pages
ML
No ratings yet
ML
2 pages
Supervised vs. Unsupervised Learning
No ratings yet
Supervised vs. Unsupervised Learning
7 pages
ML Sem
No ratings yet
ML Sem
24 pages
AI Algorithms Summary by Djemoui Badr
No ratings yet
AI Algorithms Summary by Djemoui Badr
5 pages
ML (Theory)
No ratings yet
ML (Theory)
11 pages
ML Notes
No ratings yet
ML Notes
16 pages
Class Notes: The Basics of Machine Learning
No ratings yet
Class Notes: The Basics of Machine Learning
4 pages
AI, ML, and DL: Key Differences Explained
No ratings yet
AI, ML, and DL: Key Differences Explained
4 pages
@DataScience - Ir - 111 Essential Concepts For Data Scientists
No ratings yet
@DataScience - Ir - 111 Essential Concepts For Data Scientists
14 pages
MACHINE LEARNING III (Terminology)
No ratings yet
MACHINE LEARNING III (Terminology)
7 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
6 pages
1.write The Formula For Sigmoid, Hyperbolic Tangen...
No ratings yet
1.write The Formula For Sigmoid, Hyperbolic Tangen...
3 pages
ML Unit 4 5 Detailed Answers
No ratings yet
ML Unit 4 5 Detailed Answers
4 pages
Key Machine Learning Terminologies and Their Expla
No ratings yet
Key Machine Learning Terminologies and Their Expla
4 pages
Chapter 1
No ratings yet
Chapter 1
28 pages
PSCS511 - Machine Learning
No ratings yet
PSCS511 - Machine Learning
23 pages
ML Notes-1
No ratings yet
ML Notes-1
59 pages
Notes On Machine Learning (ML)
No ratings yet
Notes On Machine Learning (ML)
3 pages
A I Glossary
No ratings yet
A I Glossary
11 pages
ML Unit 2
No ratings yet
ML Unit 2
23 pages
Basics of Machine Learning Explained
No ratings yet
Basics of Machine Learning Explained
3 pages
Notes On Machine Learning Fundamentals
No ratings yet
Notes On Machine Learning Fundamentals
4 pages
Key Concepts in Machine Learning
No ratings yet
Key Concepts in Machine Learning
2 pages
2 Marks
No ratings yet
2 Marks
5 pages
Machine Learning Topics
No ratings yet
Machine Learning Topics
4 pages
Team - 2 Term Glossary
No ratings yet
Team - 2 Term Glossary
5 pages
ML Module 1
No ratings yet
ML Module 1
12 pages
MACHINE LEARNING 1-5 (Ai &DS)
100% (1)
MACHINE LEARNING 1-5 (Ai &DS)
60 pages
ML Overview
No ratings yet
ML Overview
11 pages
Lecture Notes On Machine Learning Concepts
No ratings yet
Lecture Notes On Machine Learning Concepts
5 pages
New - Neural Network & Deep Learning
No ratings yet
New - Neural Network & Deep Learning
8 pages
Lecture 8
No ratings yet
Lecture 8
11 pages
Machine Learning Overview
No ratings yet
Machine Learning Overview
7 pages
Machine Learning Engineer Interview Preparation Guide
No ratings yet
Machine Learning Engineer Interview Preparation Guide
14 pages
2 Marks
No ratings yet
2 Marks
14 pages
MachineLearning Chatgpt
No ratings yet
MachineLearning Chatgpt
19 pages
ML Basics Guide
No ratings yet
ML Basics Guide
11 pages
Machine Learning Basics 2
No ratings yet
Machine Learning Basics 2
3 pages
Logistic Regression and Classifiers Overview
No ratings yet
Logistic Regression and Classifiers Overview
10 pages
Comprehensive Deep Learning Concepts
No ratings yet
Comprehensive Deep Learning Concepts
5 pages
Rohit Unit 1 ML Notes
No ratings yet
Rohit Unit 1 ML Notes
27 pages
Key AI Concepts and Techniques Explained
No ratings yet
Key AI Concepts and Techniques Explained
12 pages
Course Basic Level of Generative AI
No ratings yet
Course Basic Level of Generative AI
4 pages
5926 - Question - Paper ML
No ratings yet
5926 - Question - Paper ML
2 pages
Paper 2
No ratings yet
Paper 2
19 pages
Soft Computing Manual.-1
No ratings yet
Soft Computing Manual.-1
45 pages
ANN Calculations
No ratings yet
ANN Calculations
24 pages
Lab1 AIL303m
No ratings yet
Lab1 AIL303m
10 pages
UNIT 2 Notes
No ratings yet
UNIT 2 Notes
19 pages
Tiny ImageNet Challenge: CNN Performance
No ratings yet
Tiny ImageNet Challenge: CNN Performance
6 pages
Tiled CNNs for Advanced AI Research
No ratings yet
Tiled CNNs for Advanced AI Research
9 pages
Pneumonia Detection Using Convolutional Neural Networks (CNNS)
No ratings yet
Pneumonia Detection Using Convolutional Neural Networks (CNNS)
14 pages
DeepSeek图解10页
No ratings yet
DeepSeek图解10页
11 pages
Exam Guidence
No ratings yet
Exam Guidence
1 page
Question Bank - Machine Learning
No ratings yet
Question Bank - Machine Learning
16 pages
HRJ R1333
No ratings yet
HRJ R1333
6 pages
Presentation On: Neural Network
No ratings yet
Presentation On: Neural Network
30 pages
Deep Learning MCQ
90% (73)
Deep Learning MCQ
34 pages
Prediction of Student Academic Performance Based On Their Emotional Wellbeing and Interaction On Various e Learning Platforms
No ratings yet
Prediction of Student Academic Performance Based On Their Emotional Wellbeing and Interaction On Various e Learning Platforms
30 pages
Deep Dive Into ML
No ratings yet
Deep Dive Into ML
124 pages
Research On Text Classification Based On CNN and LSTM: Yuandong Luan Shaofu Lin
No ratings yet
Research On Text Classification Based On CNN and LSTM: Yuandong Luan Shaofu Lin
4 pages
Explanation of REINFORCE Training Code For CartPole
No ratings yet
Explanation of REINFORCE Training Code For CartPole
3 pages
Start For Free: Learning Vector Quantization Learning Vector Quantization
No ratings yet
Start For Free: Learning Vector Quantization Learning Vector Quantization
2 pages
Clustering: EE-671 Prof L. Behera, IITK
No ratings yet
Clustering: EE-671 Prof L. Behera, IITK
33 pages
Understanding Guided Backpropagation
No ratings yet
Understanding Guided Backpropagation
17 pages
AlexNet and Other Pretrained Models - Presentation
No ratings yet
AlexNet and Other Pretrained Models - Presentation
182 pages
Recurrent Neural Network
No ratings yet
Recurrent Neural Network
20 pages
Pattern Recognition
No ratings yet
Pattern Recognition
13 pages
Visualisation of 10 Common CNN Architectures
No ratings yet
Visualisation of 10 Common CNN Architectures
13 pages
CNN, RNN
No ratings yet
CNN, RNN
60 pages
Machine Learning Unit-2 Backpropagation Algorithm
No ratings yet
Machine Learning Unit-2 Backpropagation Algorithm
23 pages
cz4041 9 Ensemble
No ratings yet
cz4041 9 Ensemble
54 pages

Basic Machine Learning Terms 2

Uploaded by

Basic Machine Learning Terms 2

Uploaded by

BASIC MACHINE LEARNING TERMS

1. **Algorithm**: A set of rules or instructions given to a machine to help it

4. **Feature**: An individual measurable property or characteristic used as input

5. **Label**: The output variable in supervised learning, which the model is

6. **Supervised Learning**: A type of machine learning where the model is trained

7. **Unsupervised Learning**: A type of machine learning where the model is trained

8. **Reinforcement Learning**: A type of learning where an agent learns to make

11. **Regularization**: Techniques used to prevent overfitting by adding a penalty

12. **Gradient Descent**: An optimization algorithm used to minimize the loss

14. **Confusion Matrix**: A table used to evaluate the performance of a

15. **ROC Curve**: Receiver Operating Characteristic curve, a graphical

20. **Neural Network**: A series of algorithms that attempt to recognize underlying

21. **Activation Function**: A function used in neural networks to introduce non-

25. **Cross-Validation**: A technique to evaluate the performance of a machine

26. **Feature Engineering**: The process of selecting, modifying, and creating

27. **Feature Scaling**: The process of normalizing or standardizing features to

28. **Dimensionality Reduction**: Techniques used to reduce the number of features

30. **Ensemble Learning**: Combining multiple models to improve performance. Common

31. **Bagging (Bootstrap Aggregating)**: An ensemble method that trains multiple

33. **Stacking**: An ensemble method that combines the predictions of multiple

37. **K-Nearest Neighbors (KNN)**: A simple, instance-based learning algorithm that

38. **Naive Bayes**: A probabilistic classification algorithm based on Bayes'

39. **Clustering**: An unsupervised learning technique that groups similar data

40. **Principal Component Analysis (PCA)**: A dimensionality reduction technique

41. **t-SNE (t-Distributed Stochastic Neighbor Embedding)**: A dimensionality

44. **Transfer Learning**: A technique where a pre-trained model is adapted for a

46. **Generative Adversarial Network (GAN)**: A type of neural network consisting

49. **Bayesian Optimization**: A method for optimizing hyperparameters by building

50. **Hyperparameter Tuning**: The process of selecting the best hyperparameters

You might also like

1. Algorithm: A set of rules or instructions given to a machine to help it

4. Feature: An individual measurable property or characteristic used as input

5. Label: The output variable in supervised learning, which the model is

6. Supervised Learning: A type of machine learning where the model is trained

7. Unsupervised Learning: A type of machine learning where the model is trained

8. Reinforcement Learning: A type of learning where an agent learns to make

11. Regularization: Techniques used to prevent overfitting by adding a penalty

12. Gradient Descent: An optimization algorithm used to minimize the loss

14. Confusion Matrix: A table used to evaluate the performance of a

15. ROC Curve: Receiver Operating Characteristic curve, a graphical

20. Neural Network: A series of algorithms that attempt to recognize underlying

21. Activation Function: A function used in neural networks to introduce non-

25. Cross-Validation: A technique to evaluate the performance of a machine

26. Feature Engineering: The process of selecting, modifying, and creating

27. Feature Scaling: The process of normalizing or standardizing features to

28. Dimensionality Reduction: Techniques used to reduce the number of features

30. Ensemble Learning: Combining multiple models to improve performance. Common

31. Bagging (Bootstrap Aggregating): An ensemble method that trains multiple

33. Stacking: An ensemble method that combines the predictions of multiple

37. K-Nearest Neighbors (KNN): A simple, instance-based learning algorithm that

38. Naive Bayes: A probabilistic classification algorithm based on Bayes'

39. Clustering: An unsupervised learning technique that groups similar data

40. Principal Component Analysis (PCA): A dimensionality reduction technique

41. t-SNE (t-Distributed Stochastic Neighbor Embedding): A dimensionality

44. Transfer Learning: A technique where a pre-trained model is adapted for a

46. Generative Adversarial Network (GAN): A type of neural network consisting

49. Bayesian Optimization: A method for optimizing hyperparameters by building

50. Hyperparameter Tuning: The process of selecting the best hyperparameters