Introduction
Kelyn Botina Trujillo, MSc
Outline
Overview
Machine Learning Definition
Machine Learning Algorithms
Cocktail Party Problem
Overview
SPAM
Overview
Machine Learning
Grew out of work in AI
New capability for computers
Examples:
Database mining
Large datasets from growth of automation/web.
E.g., Web click data, medical records, biology, engineering
Applications can’t program by hand.
E.g., Autonomous helicopter, handwriting recognition, most of Natural
Language Processing (NLP), Computer Vision.
Self-customizing programs
E.g., Amazon, Netflix product recommendations
Understanding human learning (brain, real AI)
Machine Learning definition
Arthur Samuel (1959). Machine Learning: Field of study
that gives computers the ability to learn without being
explicitly programmed.
Machine Learning definition
Arthur Samuel (1959). Machine Learning: Field
of study that gives computers the ability to
learn without being explicitly programmed.
Tom Mitchell (1998) Well-posed Learning
Problem: A computer program is said to learn
from experience E with respect to some task T
and some performance measure P, if its
performance on T, as measured by P, improves
with experience E.
Machine Learning definition
“A computer program is said to learn from experience E with
respect to some task T and some performance measure P, if its
performance on T, as measured by P, improves with experience
E.”
Example: Suppose your email program watches which
emails you do or do not mark as spam, and based on that
learns how to better filter spam. What is the task T in this
setting?
ML Algorithms
Machine learning algorithms:
Supervisedlearning
Unsupervised learning
Others: Reinforcement learning,
recommender systems.
ML Algorithms: Supervised Learning
Housing Price Prediction
400
300
Price ($)
200
in 1000’s
100
0
0 500 1000 1500 2000 2500
Size in feet2
Supervised Learning Regression: Predict continuous
“right answers” given valued output (price)
ML Algorithms: Supervised Learning
Breast Cancer (Malignant, Benign)
1(Y)
Malignant?
0(N)
Tumor Size
Tumor Size
Classification
Discrete valued
output (0 or 1)
ML Algorithms: Supervised Learning
- Clump Thickness
- Uniformity of Cell Size
Age
- Uniformity of Cell Shape
…
Tumor Size
¿Cómo manejar un número infinito de características?
¿Cómo almacenar un número infinito de datos en el PC cuando se sabe que se va a
llenar la memoria?
ML Algorithms: Supervised Learning
Example
You’re running a company, and you want to develop learning algorithms
to address each of two problems.
Problem 1: You have a large inventory of identical items. You want
to predict how many of these items will sell over the next 3
months.
Problem 2: You’d like software to examine individual customer
accounts, and for each account decide if it has been
hacked/compromised.
Should you treat these as classification or as regression problems?
a) Treat both as classification problems.
b) Treat problem 1 as a classification problem, problem 2 as a regression problem.
c) Treat problem 1 as a regression problem, problem 2 as a classification problem.
d) Treat both as regression problems.
ML Algorithms: Unsupervised Learning
ML Algorithms: Unsupervised Learning
Ejemplo:
[Link]
ML Algorithms: Unsupervised Learning
ML Algorithms: Unsupervised Learning
ML Algorithms: Unsupervised Learning
Genes
Individuals
[Source: Daphne Koller]
ML Algorithms: Unsupervised Learning
Of the following examples, which would you address
using an unsupervised learning algorithm? (Check all
that apply.)
More Applications
Cocktail Party Problem
Speaker #1 Microphone #1
Speaker #2 Microphone #2
Cocktail Party Problem
Microphone #1: Output #1:
Microphone #2: Output #2:
Microphone #1: Output #1:
Microphone #2: Output #2:
Cocktail Party Problem
Microphone #1: Output #1:
Microphone #2: Output #2:
[W,s,v] = svd((repmat(sum(x.*x,1),size(x,1),1).*x)*x');
“Cocktail Party Problem is not clustering, the method
employed is the main components analysis”