Algortihms
Supervised Machine Learning Unsupervised machine learning Reinforcemnet learning
Linear Regression k Means
Ridge &Lasco Hierarchial means
Elastic Net DB Scan
Logistic regression
Decision tree
Random Forest
Adaboost
xgboost
Simple Liner Regression
Machine learning is the field of building algorithms that can learn patterns by themselves without being programmed explicitly
In basic terms, ML is the process of training a piece of software, called a model, to make useful predictions or generate content from data.
ML systems fall into one or more of the following categories based on how they learn to make predictions or generate conten
Supervised learning
Unsupervised learning
Reinforcement learning
Generative AI
Supervised learning models can make predictions after seeing lots of data with the correct answers and then discovering the c
Unsupervised learning models make predictions by being given data that does not contain any correct answers.
Supervised machine learning is based on the following core concepts:
Data
Model
Training
Evaluating
Inference
Datasets are made up of individual examples that contain features and a label
Features are the values that a supervised model uses to predict the label/input variable
The label is the "answer," or the value we want the model to predict.
A dataset is characterized by its size and diversity. Size indicates the number of examples. Diversity indicates the range those e
Training
Before a supervised model can make predictions, it must be trained. To train a model, we give
the model a dataset with labeled examples. The model's goal is to work out the best solution for
predicting the labels from the features. The model finds the best solution by comparing its
predicted value to the label's actual value. Based on the difference between the predicted and
actual values—defined as the loss—the model gradually updates its solution. In other words,
the model learns the mathematical relationship between the features and the label so that it
can make the best predictions on unseen data.
ut being programmed explicitly
nerate content from data.
redictions or generate content:
ers and then discovering the connections between the elements in the data that produce the correct answers.
orrect answers.
A labeled example consists of one or more
features and a label. Labeled examples
are used during training.
An unlabeled example consists of one or
more features but no label. Unlabeled
examples are used during inference.
sity indicates the range those examples cover. Good datasets are both large and highly diverse
Loss
During the training of a supervised model,
a measure of how far a model's prediction
is from its label.
The two main data structures defined in the pandas package are DataFrame and Series
A DataFrame is a tabular data structure that is represented as a two-dimensional table. It is composed of rows, columns, index
also called as variables
observatio
n
posed of rows, columns, indexes, and cells. It is very similar to a sheet in Excel or a table in a database
DataFrame is a structured representation
observatio of some data organized by rows and
n columns. A row represents an observation
and each column contains the value of its
attributes.
Series: One-dimensional, like a single
column.
DataFrame: Two-dimensional, like a table
with rows and columns, composed of
multiple Series.