0% found this document useful (0 votes)

27 views20 pages

Types of Learning in ML

The document explores two significant machine learning paradigms: Representation Learning, which automates feature discovery from raw data, and Active Learning, which selectively queries human experts for labeling data. Together, these techniques enhance AI's understanding and efficiency in data utilization, leading to improved accuracy and reduced annotation burdens. Additionally, the document discusses Association Rule Learning, Ensemble Learning, and Regularization, emphasizing their roles in uncovering patterns and enhancing model interpretability and performance.

Uploaded by

M Sridhar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views20 pages

Types of Learning in ML

Uploaded by

M Sridhar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Types of Learning:

Representation Learning &

Active Learning
Explore two pivotal machine learning paradigms that are revolutionising
how artificial intelligence understands and interacts with data. Discover
their individual strengths and how their combined power propels the
future of intelligent systems.
Understanding Representation Learning
Representation learning is a cutting-edge machine learning technique where the system autonomously
discovers the most effective features or representations from raw data. This shifts the paradigm from manual
feature engineering to automated insight discovery.

Automated Feature Discovery Unlocking Complex Data

Focuses on learning optimal data encodings Enables models to derive deeper insights from
that significantly enhance task performance, intricate data forms, such as high-dimensional
moving beyond traditional, laborious manual images, diverse text corpora, or complex
feature engineering processes. speech patterns, by identifying underlying
structures.

Enhanced Model Comprehension

By learning optimal representations, models
gain a more profound understanding of the
data, leading to improved accuracy, efficiency,
and robustness across various applications.
Key Paradigms of Representation
Learning
Autoassociative Learning (Autoencoders)
These models learn to reconstruct their input, effectively compressing and
decompressing data. This process allows them to capture essential features,
such as spell checkers identifying common word patterns or images being
efficiently encoded for storage and retrieval.

Heteroassociative Learning
In contrast, heteroassociative learning maps input data to distinct target
labels or categories. A prime example is an image classification system
learning to associate specific visual features with the label 'dog' or 'cat'.
Neural networks are particularly adept at extracting these layered features.
Visualising the Representation Learning Process
The journey from raw data to actionable insights involves multiple layers of transformation, where each stage refines the representation, making it more meaningful for the final
task.

Prediction

Final output based on learned representations

Feature Extraction

Layered transformations producing abstract

features

Raw Data

Unprocessed inputs from sensors and sources

Introducing Active Learning
Active learning is a sophisticated machine learning strategy where the algorithm intelligently
queries a human expert to label only the most informative data points. This selective approach
is crucial for optimising data acquisition and reducing annotation burdens.

Selective Data Querying

The core principle is to minimise the extensive cost and effort typically associated with
labeling vast datasets by focusing solely on the most uncertain or inherently valuable
samples.

Iterative Refinement Cycle

This process operates iteratively: the model undergoes initial training, then identifies and
selects specific data points requiring labels. Human experts provide these labels, and the
model subsequently retrains, progressively refining its understanding.
Diverse Strategies in Active Learning
Active learning employs several distinct strategies to identify and select the most impactful data points for human annotation, each
tailored to different scenarios and data characteristics.

Pool-based Sampling Stream-based Selective Query by Committee

The model assesses an entire pool of Sampling This advanced strategy involves
unlabeled data, identifying and Here, data points are evaluated multiple models (a 'committee')
selecting samples with the highest sequentially in a continuous stream. collectively voting on uncertain
uncertainty or diversity for expert The model makes real-time decisions samples. The points where the
labeling. This method is effective on whether to query a label for each committee exhibits the most
when a large, static pool of unlabeled new incoming data point, suitable for disagreement are prioritised for
data is available. high-throughput and dynamic human labeling, leveraging ensemble
environments. uncertainty for efficient data selection.
The Strategic Advantages of Active
Learning
Significant Cost Reduction Accelerated Accuracy &
By meticulously selecting only the Generalisation
most informative samples, active Models trained with active learning
learning dramatically cuts down on the often achieve higher accuracy and
expensive and time-consuming improved generalisation capabilities
process of manual data labeling, much faster compared to passive
optimising resource allocation and learning approaches, as they learn
accelerating project timelines. from the most impactful and
challenging data.

Crucial for Specialized Domains

This approach is indispensable in fields where labeled data is scarce, difficult to
obtain, or prohibitively expensive, such as in advanced medical imaging, highly
nuanced natural language processing, or complex autonomous driving scenarios.
Visualising the Active Learning Cycle
The iterative loop of active learning demonstrates a continuous feedback mechanism, where the model's performance is constantly refined through
targeted human input.

Machine Learning Model Select Uncertain Samples

Retrain & Improve Human Expert Labeling

This intelligent human-in-the-loop approach ensures that every expert annotation contributes maximally to the model's learning and performance
enhancement.
Real-World Applications

Representation Learning in Action Active Learning in Practice

A Symbiotic Future: Harnessing Both for Smarter AI

The synergy between Representation Learning and Active Learning heralds a new era for artificial intelligence, creating systems that
are both powerful and exceptionally efficient in their data utilisation.

Empowering Data Optimising Human-in-the-Loop Accelerating AI Development

Understanding Together, these two paradigms
Representation learning fundamentally Active learning critically optimises the significantly accelerate AI
builds powerful, automated data precious human effort required for data development, allowing for the creation
understanding, extracting deep, labeling, ensuring that every expert of more intelligent systems with
meaningful insights from raw inputs annotation contributes maximally to reduced data requirements and
without explicit human intervention, model performance and accelerates demonstrably higher accuracy. This
transforming complex data into usable the learning curve. collaborative approach defines the
features. cutting edge of modern AI.
Unlocking Data Insights:
Instance Association Rule
Learning, Ensemble Learning
& Regularization-Based
Learning
Explore how combining these powerful machine learning techniques
uncovers hidden patterns, enhances predictive accuracy, and delivers
interpretable models for real-world applications.
Understanding Association Rule Learning
Association Rule Learning is a rule-based machine learning technique designed to
uncover fascinating, often hidden relationships between variables within vast
datasets. It identifies strong rules discovered in data using various measures of
'interestingness'.

This technique is a cornerstone in applications such as market basket analysis,

where it suggests cross-selling opportunities, powers recommendation systems by
understanding user preferences, and aids in fraud detection by highlighting unusual
transaction patterns.

Key metrics like Support (how often items appear), Confidence (how often a rule is true), and Lift (how much more likely items appear together than by
chance) are critical for quantifying a rule's strength and practical relevance.
Key Algorithms for Association Rule
Discovery
Apriori
A foundational algorithm that generates frequent itemsets using a support-confidence
1 framework. It's best suited for small to medium-sized datasets due to its iterative
candidate generation.

FP-Growth
Utilises a tree-based structure (Frequent Pattern Tree) for more efficient mining on
2
large datasets. It avoids candidate generation, making it faster than Apriori.

Eclat
Employs a depth-first search approach to find frequent itemsets. While often faster than
3
Apriori, it can be memory-intensive due to its vertical data format.

Consider the example: "80% of customers who buy Bread also buy Milk." If the 'Lift' metric for
this rule is greater than 1, it indicates a strong, non-random correlation, suggesting a valuable
insight for marketing strategies.
Introducing the Power of Ensemble
Learning
Improved Accuracy Robustness
By combining predictions from multiple Ensembles are more resilient to noise
models, ensemble methods often achieve and outliers in data, leading to more
significantly higher accuracy than any stable and reliable predictions.
single base learner.

Generalisation
They enhance the model's ability to perform well on unseen data, effectively reducing the
risk of overfitting.

Ensemble Learning is a powerful meta-algorithm that ingeniously combines multiple simple

models, known as 'base learners', to collectively improve predictive accuracy and enhance the
overall robustness of a system.

Popular methods include Bagging (e.g., Random Forests), Boosting (e.g., AdaBoost, Gradient
Boosting), and Rule Ensembles, each with distinct approaches to model combination.
Rule Ensemble Learning: The Best of Both Worlds
Rule Ensemble Learning represents a sophisticated approach that constructs
models by combining a set of ranked rules, typically derived from decision
trees. This methodology cleverly bridges the gap between the transparency of
traditional rule-based models and the formidable predictive power of ensemble
techniques.

A notable application demonstrated how this method facilitated reducing

features from 39 to a more manageable 21 in supernova image classification,
all whilst maintaining a high level of accuracy. This highlights its efficiency in
complex scientific datasets.

Pioneering research by Friedman & Popescu introduced a rule ensemble

method that integrates penalised regression for creating sparse, highly
interpretable models, setting a benchmark for practical implementation.
Regularization in Ensemble Learning

Prevents Overfitting Promotes Sparsity

Regularization techniques, such as L1 (Lasso) and L2 (Ridge), Specifically, L1 regularization encourages sparsity by driving less
impose penalties on model complexity, effectively preventing important feature weights to zero, selecting only the most relevant
models from becoming overly tailored to the training data. rules or features for the final model.

Enhances Interpretability Boosts Efficiency

By simplifying the model, regularization significantly improves its Reduced complexity also translates into improved computational
interpretability, making it easier for humans to understand how efficiency, especially crucial for large-scale datasets and real-time
predictions are made. applications.

In rule ensemble methods, penalised regression is commonly employed to judiciously weigh and select rules, ensuring that only the most effective
and non-redundant rules contribute to the final prediction.
Practical Impact & Use Cases

Market Basket Analysis Fraud Detection

Revealing customer purchasing patterns to identify cross-selling Uncovering anomalous transaction sequences and patterns
opportunities and optimise store layouts. indicative of fraudulent activities, bolstering security systems.

Image Classification Recommendation Systems

Achieving efficient feature reduction and highly accurate Providing more personalised and interpretable product or content
predictions in complex scientific and medical image datasets. suggestions to users, improving engagement.

These advanced learning techniques translate directly into tangible benefits, offering robust solutions across diverse industries.
Challenges & Alternatives
Challenges Alternatives
Association rules can generate an overwhelming number of redundant or trivial rules without carefully set Collaborative Filtering: Widely used in recommendation systems, focusing on user-item interactions.
thresholds. Clustering: Grouping similar data points to discover underlying structures without predefined rules.
Ensemble models, if not carefully constructed, can become overly complex and lose their interpretability. Deep Learning: Powerful neural network architectures, particularly effective for complex patterns in unstructured
Selecting the optimal set of rules and regularization parameters can be a computationally intensive task. data, though often less interpretable.
Visualising the Integrated Learning Pipeline

Raw Data

Frequent Itemsets

Association Rules

Ensemble → Sparse Model

This diagram illustrates the journey from raw information to a refined, interpretable predictive model, highlighting the synergistic role of each learning phase.

Compared to Apriori's iterative candidate generation, FP-Growth demonstrates superior efficiency, especially with larger datasets, by Incorporating ensemble methods alongside regularization consistently leads to a marked improvement in model accuracy, while
avoiding redundant computations through its tree-based approach. simultaneously preventing the pitfalls of overfitting.
Conclusion: Harnessing Rules,
Ensembles & Regularization for
Smarter Learning
By meticulously combining the pattern-finding capabilities of association
rules with the robust predictive power of ensemble learning and the
precision of regularization techniques, we unlock a new era of powerful
and interpretable models.

This integrated approach enables the discovery of deeply meaningful

patterns within data, all whilst maintaining exemplary accuracy and
computational simplicity. The future promises more scalable and inherently
interpretable AI models, effectively bridging the gap between raw data
insights and actionable, strategic decisions.

Unit 4
No ratings yet
Unit 4
21 pages
Data Preparation For Machine Learning A Step by Step Guide
No ratings yet
Data Preparation For Machine Learning A Step by Step Guide
11 pages
TR1648
No ratings yet
TR1648
47 pages
Inductive Learning
No ratings yet
Inductive Learning
6 pages
Mathematics 11 00820
No ratings yet
Mathematics 11 00820
38 pages
Machine Learning A Comprehensive Report
No ratings yet
Machine Learning A Comprehensive Report
10 pages
Introduction To Unsupervised Learning: by SK Vali Sab
No ratings yet
Introduction To Unsupervised Learning: by SK Vali Sab
10 pages
Unit 1
No ratings yet
Unit 1
92 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
18 pages
Active Learning
No ratings yet
Active Learning
102 pages
AI in Machine Learning
No ratings yet
AI in Machine Learning
9 pages
Document 1
No ratings yet
Document 1
38 pages
ML Concepts
No ratings yet
ML Concepts
5 pages
M2 Transcript
No ratings yet
M2 Transcript
11 pages
Aiya Session 4
No ratings yet
Aiya Session 4
42 pages
ML Notes All
No ratings yet
ML Notes All
257 pages
Chapter-1 ML Intro
No ratings yet
Chapter-1 ML Intro
36 pages
Machine Learning
No ratings yet
Machine Learning
10 pages
Unit 1 Introduction To Machine Learning
No ratings yet
Unit 1 Introduction To Machine Learning
37 pages
Machine Learning Core Concepts
No ratings yet
Machine Learning Core Concepts
10 pages
Unit 1 ML
No ratings yet
Unit 1 ML
93 pages
Machine Learning
No ratings yet
Machine Learning
42 pages
Neural Presentation
No ratings yet
Neural Presentation
16 pages
Lect6 PDF
No ratings yet
Lect6 PDF
66 pages
The Machine Learning Task Taxonomy A Data Science Blueprint
No ratings yet
The Machine Learning Task Taxonomy A Data Science Blueprint
10 pages
DAIOT UNIT 5 (1) Own
No ratings yet
DAIOT UNIT 5 (1) Own
13 pages
Active Learning U1
No ratings yet
Active Learning U1
35 pages
Internship Presentation
No ratings yet
Internship Presentation
13 pages
ML and Decision Trees
No ratings yet
ML and Decision Trees
60 pages
Which ML Algo Should I Use SAS
No ratings yet
Which ML Algo Should I Use SAS
20 pages
Unit 1
No ratings yet
Unit 1
93 pages
Introduction To Machine Learning: by Aditya Sangwan
No ratings yet
Introduction To Machine Learning: by Aditya Sangwan
4 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
9 pages
Unit 4 (Ai)
No ratings yet
Unit 4 (Ai)
8 pages
ML - Unit 1
No ratings yet
ML - Unit 1
68 pages
Ai Cheat Sheet Machine Learning With Python Cheat Sheet
100% (5)
Ai Cheat Sheet Machine Learning With Python Cheat Sheet
2 pages
Lecture 5 - Feature Extraction, Model Building & Evaluation
No ratings yet
Lecture 5 - Feature Extraction, Model Building & Evaluation
35 pages
Lecture 06 - Machine Learning Types Unsupervised
No ratings yet
Lecture 06 - Machine Learning Types Unsupervised
25 pages
Data Mining: Practical Machine Learning Tools and Techniques
No ratings yet
Data Mining: Practical Machine Learning Tools and Techniques
11 pages
Data Mining and Data Warehousing Principles and Practical Techniques 1108727743 9781108727747 Compress
No ratings yet
Data Mining and Data Warehousing Principles and Practical Techniques 1108727743 9781108727747 Compress
513 pages
INT354 - Unit 1
No ratings yet
INT354 - Unit 1
72 pages
Machine Learning Notes
83% (12)
Machine Learning Notes
19 pages
MACHINE LEARNING 1-5 (Ai &DS)
100% (1)
MACHINE LEARNING 1-5 (Ai &DS)
60 pages
M Tech Ai Unit Iii
100% (1)
M Tech Ai Unit Iii
6 pages
Machine Learning Overview
No ratings yet
Machine Learning Overview
54 pages
Machine Learning with Python
100% (1)
Machine Learning with Python
31 pages
Unit 3.3 Learning
No ratings yet
Unit 3.3 Learning
33 pages
Unsupervised Learning & RL Guide
No ratings yet
Unsupervised Learning & RL Guide
6 pages
Markov
No ratings yet
Markov
21 pages
ML Unit 1
No ratings yet
ML Unit 1
15 pages
Data Mining - Utrecht University - 13. Active Learning
No ratings yet
Data Mining - Utrecht University - 13. Active Learning
57 pages
Machine Learning Representation Insights
No ratings yet
Machine Learning Representation Insights
18 pages
ML Week 2 Part 2
No ratings yet
ML Week 2 Part 2
6 pages
A Preliminary Idea On Machine Learning
No ratings yet
A Preliminary Idea On Machine Learning
40 pages
Processing and Visualizing Data
No ratings yet
Processing and Visualizing Data
20 pages
Operating System Virtual Memory
No ratings yet
Operating System Virtual Memory
10 pages
Secondary Storage Structure
No ratings yet
Secondary Storage Structure
10 pages
Unsupervised Learning & Neural Networks
No ratings yet
Unsupervised Learning & Neural Networks
20 pages
Google Analytics
No ratings yet
Google Analytics
20 pages
Supervised Learning
No ratings yet
Supervised Learning
14 pages
Social Media in 2025
No ratings yet
Social Media in 2025
20 pages
Modeling and Evaluation
No ratings yet
Modeling and Evaluation
20 pages
Operating Processes & CPU Scheduling
No ratings yet
Operating Processes & CPU Scheduling
10 pages
Sma Connections
No ratings yet
Sma Connections
11 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
20 pages
Deadlocks & Device Management
No ratings yet
Deadlocks & Device Management
10 pages
Social Media Landscape
No ratings yet
Social Media Landscape
10 pages
Introduction To Operating Systems
No ratings yet
Introduction To Operating Systems
10 pages
3D Studio Max Office Furniture Tutorial
No ratings yet
3D Studio Max Office Furniture Tutorial
7 pages
Professional Ethics-Lecture Notes Unit 1
No ratings yet
Professional Ethics-Lecture Notes Unit 1
20 pages
Expt-8 - Implementation of Stack Using Circular Linked List
No ratings yet
Expt-8 - Implementation of Stack Using Circular Linked List
8 pages
Short-Term Wave Height Distribution Analysis
No ratings yet
Short-Term Wave Height Distribution Analysis
1 page
ĐỀ CƯƠNG ÔN TẬP HK2 ILSW7
No ratings yet
ĐỀ CƯƠNG ÔN TẬP HK2 ILSW7
6 pages
ACE 5 Further Mathematkics Weekly Lesson Plan.25.26
No ratings yet
ACE 5 Further Mathematkics Weekly Lesson Plan.25.26
4 pages
AARIKA Udyam Certificate
No ratings yet
AARIKA Udyam Certificate
4 pages
GM-3 - Design of A Manual Plastic Brick Interlocking Machine
No ratings yet
GM-3 - Design of A Manual Plastic Brick Interlocking Machine
5 pages
De Hoc Sinh Gioi Tieng Anh 12 Nam 2024 2025 Truong THPT Binh Chieu TP HCM
No ratings yet
De Hoc Sinh Gioi Tieng Anh 12 Nam 2024 2025 Truong THPT Binh Chieu TP HCM
12 pages
Possible Chapter Questions
No ratings yet
Possible Chapter Questions
6 pages
Social Media Audit Guide for Businesses
No ratings yet
Social Media Audit Guide for Businesses
51 pages
F.A.S. Amps Model Gallery Overview
No ratings yet
F.A.S. Amps Model Gallery Overview
21 pages
TS90 Service Manual
No ratings yet
TS90 Service Manual
87 pages
Understanding Mental Health Concepts
No ratings yet
Understanding Mental Health Concepts
123 pages
Early Atomic Models: Postulates & Limitations
No ratings yet
Early Atomic Models: Postulates & Limitations
13 pages
CH04 Tutorial Student
No ratings yet
CH04 Tutorial Student
2 pages
HOPE - CHAPTER - 1,2, 3, 4, 5 Not YET
No ratings yet
HOPE - CHAPTER - 1,2, 3, 4, 5 Not YET
36 pages
Acne Studios Sustainability Report 20-21
No ratings yet
Acne Studios Sustainability Report 20-21
25 pages
Chap 8
No ratings yet
Chap 8
8 pages
Daily Lesson Plan P-Math-10-Q1-Week-2-Day - 2
No ratings yet
Daily Lesson Plan P-Math-10-Q1-Week-2-Day - 2
5 pages
Chapter 1 Intro To Basic Drawing and Materials
No ratings yet
Chapter 1 Intro To Basic Drawing and Materials
18 pages
Fiber Optic Production Report
No ratings yet
Fiber Optic Production Report
18 pages
Soal STS Inggris 8
No ratings yet
Soal STS Inggris 8
2 pages
Amg GT63S
No ratings yet
Amg GT63S
7 pages
Evaluation Sheet Classroom Advisory Tasks
No ratings yet
Evaluation Sheet Classroom Advisory Tasks
1 page
3-2-1 Strategy for Teachers
No ratings yet
3-2-1 Strategy for Teachers
2 pages
Group 1 - Data Automation & Artificial Intelligence
No ratings yet
Group 1 - Data Automation & Artificial Intelligence
25 pages
Middle TOEIC Test 650+ LC - UNI
No ratings yet
Middle TOEIC Test 650+ LC - UNI
12 pages
Discussion 1
No ratings yet
Discussion 1
15 pages
Tutorial 08 Part 1
No ratings yet
Tutorial 08 Part 1
95 pages

Types of Learning in ML

Uploaded by

Types of Learning in ML

Uploaded by

Types of Learning:

Representation Learning &

Automated Feature Discovery Unlocking Complex Data

Enhanced Model Comprehension

Final output based on learned representations

Layered transformations producing abstract

Unprocessed inputs from sensors and sources

Selective Data Querying

Iterative Refinement Cycle

Pool-based Sampling Stream-based Selective Query by Committee

Crucial for Specialized Domains

Machine Learning Model Select Uncertain Samples

Retrain & Improve Human Expert Labeling

Representation Learning in Action Active Learning in Practice

Empowering Data Optimising Human-in-the-Loop Accelerating AI Development

This technique is a cornerstone in applications such as market basket analysis,

Ensemble Learning is a powerful meta-algorithm that ingeniously combines multiple simple

A notable application demonstrated how this method facilitated reducing

Pioneering research by Friedman & Popescu introduced a rule ensemble

Prevents Overfitting Promotes Sparsity

Enhances Interpretability Boosts Efficiency

Market Basket Analysis Fraud Detection

Image Classification Recommendation Systems

Ensemble → Sparse Model

This integrated approach enables the discovery of deeply meaningful

You might also like