0% found this document useful (0 votes)

34 views52 pages

Week 8 - MMML - Introduction

Uploaded by

Nemesis Ccc

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

34 views52 pages

Week 8 - MMML - Introduction

Uploaded by

Nemesis Ccc

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Cognitive Computing

Lecture 8
Introduction to Multimodal Machine Learning

Dr. Hany Hanafy Mahmoud

Table of Contents
• What is Multimodal cognitive system?
• Multimodal History
• Multimodal learning
• Core Technical Challenges
• Multimodal Research Task
What is Multimodal cognitive system?

• Science related to data with more sensory modalities

• This approach is rooted in the theoretical assumption that

cognitive performance can be influenced by other modes
of psychological processing

• E.g., perceptual, emotional, social, and responses to the

physical environment.

https://www.youtube.com/watch?v=VIq5r7mCAyw&t=4131s
What is Multimodal cognitive system?

Multimodal Communicative Behaviors

• Verbal: What you see?

• Vocal: How you say it?

• Visual: How visual behavior looks like?

https://www.youtube.com/watch?v=VIq5r7mCAyw&t=4131s
What is Multimodal cognitive system?

• Verbal:
• Lexicon (words), Syntax (POS), …
• Lexical analyzer: divides the text into words, phrases, and
paragraphs. It identifies the structure of words in sentences
• Semantic Analysis: it determines if the text has any meaning and
attempts to discover its true meaning.

https://www.youtube.com/watch?v=VIq5r7mCAyw&t=4131s
What is Multimodal cognitive system?

• Vocal:
• Voice quality
• Intonation
• Vocal expressions (laugher,…)

https://www.youtube.com/watch?v=VIq5r7mCAyw&t=4131s
What is Multimodal cognitive system?

• Visual:
• Gestures: head gestures, eye gestures
• Body language: arm movements, body posture, proxemics
• Eye contact and head gaze
• Facial expressions: smile, …

https://www.youtube.com/watch?v=VIq5r7mCAyw&t=4131s
What is Multimodal cognitive system?
• Modality: is a certain type if information & data representation
format.
• Sensory Modality: primary forms of sensation as vision,
hearing, touch, ...
• Medium: is instrumentation for storing & communicating
information.

https://www.youtube.com/watch?v=DPkwjgaRvyI&list=PL-Fhd_vrvisMYs8A5j7sj8YW1wHhoJSmW
https://www.youtube.com/watch?v=VIq5r7mCAyw&t=4131s
What is Multimodal cognitive system?
Multiple Communicates

https://www.youtube.com/watch?v=VIq5r7mCAyw&t=4131s
What is Multimodal cognitive system?
Examples of Modalities
• NLP (text & speech)
• Visual (images or videos)
• Auditory (voice, sound or music)
• Smell, taste, touch
• Physiological Signals; Electrocardiogram, ECG, skin conductance
• Other Modalities: infrared images, depth images, fMRI

https://www.youtube.com/watch?v=VIq5r7mCAyw&t=4131s
What is Multimodal cognitive system?
Different modalities: show diverse qualities, structures and
representations.

https://www.youtube.com/watch?v=DPkwjgaRvyI&list=PL-Fhd_vrvisMYs8A5j7sj8YW1wHhoJSmW
What is Multimodal cognitive system?

https://www.youtube.com/watch?v=DPkwjgaRvyI&list=PL-Fhd_vrvisMYs8A5j7sj8YW1wHhoJSmW
What is Multimodal cognitive system?
Connection types:
• Correlation: there is a statistical association / relationship bet. variables. It reflects
things that appear to behave in a “similar” way.
• Causation: a change in one variable causes a change in another variable. It is when
you say something causes something else to happen.
• Co-occurrence: refers to the frequency with which two / more entities (such as
words, phrases, or concepts) appear together within a given context, such as a
document. It is a measure of how often entities are found in proximity to each other,
indicating potential relationships or associations between them.
• Associations: refers to any relationship between two variables, including linear,
curvilinear, or non-linear relationships. Therefore, all correlations are associations, but
not all associations are correlations

https://www.youtube.com/watch?v=DPkwjgaRvyI&list=PL-Fhd_vrvisMYs8A5j7sj8YW1wHhoJSmW
What is Multimodal cognitive system?

Multi-Modal Machine Learning (MMML): is the study of

computer algorithms that learn and improve through the use
and experience of data from multiple modalities.

Artificial Intelligence for Multimodal data: are able to

demonstrate intelligence capabilities such as understanding,
reasoning, planning, …

https://www.youtube.com/watch?v=DPkwjgaRvyI&list=PL-Fhd_vrvisMYs8A5j7sj8YW1wHhoJSmW
What is Multimodal cognitive system?

New Modality
Representation

Prediction

https://www.youtube.com/watch?v=DPkwjgaRvyI&list=PL-Fhd_vrvisMYs8A5j7sj8YW1wHhoJSmW
Multimodal Challenges
Core Multimodal Challenges

https://www.youtube.com/watch?v=DPkwjgaRvyI&list=PL-Fhd_vrvisMYs8A5j7sj8YW1wHhoJSmW
Multimodal Challenges
1. Representation:
It reflects cross-modal interactions between individual elements
across different modalities

https://www.youtube.com/watch?v=DPkwjgaRvyI&list=PL-Fhd_vrvisMYs8A5j7sj8YW1wHhoJSmW
Multimodal Challenges
1. Representation:

https://www.youtube.com/watch?v=DPkwjgaRvyI&list=PL-Fhd_vrvisMYs8A5j7sj8YW1wHhoJSmW
Multimodal Challenges
2. Alignment:
Identifying cross-modal connections between all elements of
multiple modalities, building from the data structure.
Most modalities have internal structure with multiple elements

https://www.youtube.com/watch?v=DPkwjgaRvyI&list=PL-Fhd_vrvisMYs8A5j7sj8YW1wHhoJSmW
Multimodal Challenges
2. Alignment:

https://www.youtube.com/watch?v=DPkwjgaRvyI&list=PL-Fhd_vrvisMYs8A5j7sj8YW1wHhoJSmW
Multimodal Challenges
3. Reasoning:
Combine knowledge through multiple inferential steps,
exploiting multimodal alignment and problem structure

https://www.youtube.com/watch?v=DPkwjgaRvyI&list=PL-Fhd_vrvisMYs8A5j7sj8YW1wHhoJSmW
Multimodal Challenges
4. Generation:
Learn a generative process to produce raw modalities that
reflects cross-modal interactions, structure and coherence.

https://www.youtube.com/watch?v=DPkwjgaRvyI&list=PL-Fhd_vrvisMYs8A5j7sj8YW1wHhoJSmW
Multimodal Challenges
5. Transference:
Transfer knowledge between modalities to help target modality
which may be noisy or with limited resources.

https://www.youtube.com/watch?v=DPkwjgaRvyI&list=PL-Fhd_vrvisMYs8A5j7sj8YW1wHhoJSmW
Multimodal Challenges
5. Transference:

https://www.youtube.com/watch?v=DPkwjgaRvyI&list=PL-Fhd_vrvisMYs8A5j7sj8YW1wHhoJSmW
Multimodal Challenges
6. Quantification:
Theoretical study to better understand heterogeneity, cross-
modal interactions and the multimodal learning process.

https://www.youtube.com/watch?v=DPkwjgaRvyI&list=PL-Fhd_vrvisMYs8A5j7sj8YW1wHhoJSmW
Multimodal History
• Behavioral: 1970 till late 1980s
• Computational: late 1980s sill late 2000
• Interaction: 2000 to 2010
• Deep learning: 2010s until now

• Next era: ?

https://www.youtube.com/watch?v=VIq5r7mCAyw&t=4131s
‫‪Multimodal History‬‬
‫‪• Behavioral: 1970 till late 1980s‬‬

‫اإليماءات هي في الواقع تفكير المتحدث في العمل ومكونات متكاملة للكالم‪ ،‬وليس مجرد مرافقات أو إضافات‬

‫‪https://www.youtube.com/watch?v=VIq5r7mCAyw&t=4131s‬‬
Multimodal History
• Computational: late 1980s sill late 2000
The goal of affective computing is to create a computing
system capable of perceiving, recognizing, and
understanding human emotions and responding
intelligently, sensitively, and naturally, thus making human–
computer interaction more natural

https://www.youtube.com/watch?v=VIq5r7mCAyw&t=4131s
Multimodal History
• Interaction: 2000 to 2010

https://www.youtube.com/watch?v=VIq5r7mCAyw&t=4131s
Multimodal History
• Deep learning: 2010s until now

https://www.youtube.com/watch?v=VIq5r7mCAyw&t=4131s
Multimodal History
• 1990 to 202X Timeline:

https://www.youtube.com/watch?v=607EcmU9mFs&list=PL-Fhd_vrvisMYs8A5j7sj8YW1wHhoJSmW&index=3
Multimodal Research Task
Real-world tasks for MMML:
A. Affected recognition: recognize emotions, sentiment
B. Media description: image and video captioning
C. Multimodal QA: image and video QA, visual reasoning
D. Multimodal navigation: language guided navigation, autonomous
driving
E. Multimodal Dialog: ground dialog
F. Event recognition: action recognition and segmentation
G. Multimedia information retrieval: content based, cross media

https://www.youtube.com/watch?v=607EcmU9mFs&list=PL-Fhd_vrvisMYs8A5j7sj8YW1wHhoJSmW&index=3
Multimodal Research Task
• Dataset

Datasets: https://www.youtube.com/watch?v=607EcmU9mFs&list=PL-Fhd_vrvisMYs8A5j7sj8YW1wHhoJSmW&index=2

On GitHub: https://github.com/topics/multimodal-datasets

https://www.youtube.com/watch?v=607EcmU9mFs&list=PL-Fhd_vrvisMYs8A5j7sj8YW1wHhoJSmW&index=3
Multimodal Research Task
• Dataset

Datasets:
https://www.youtube.com/watch?v=607EcmU9mFs&list=PL-Fhd_vrvisMYs8A5j7sj8YW1wHhoJSmW&index=2
On GitHub:
https://github.com/topics/multimodal-datasets
https://www.youtube.com/watch?v=607EcmU9mFs&list=PL-Fhd_vrvisMYs8A5j7sj8YW1wHhoJSmW&index=3
Datasets Affect Recognition

https://www.youtube.com/watch?v=fBYu8I52nVM&list=UULFqlHIJTGYhiwQpNuPU5e2gg&index=54
Datasets Affect Recognition

https://www.youtube.com/watch?v=fBYu8I52nVM&list=UULFqlHIJTGYhiwQpNuPU5e2gg&index=54
Datasets Affect Recognition
Cross media Retrieval

Confounding variable is an unmeasured third variable that

influences both the supposed cause and the supposed effect.

https://www.youtube.com/watch?v=fBYu8I52nVM&list=UULFqlHIJTGYhiwQpNuPU5e2gg&index=54
Datasets Media Description

https://www.youtube.com/watch?v=fBYu8I52nVM&list=UULFqlHIJTGYhiwQpNuPU5e2gg&index=54
Datasets Multimedia QA

https://www.youtube.com/watch?v=fBYu8I52nVM&list=UULFqlHIJTGYhiwQpNuPU5e2gg&index=54
Datasets Media Description

https://www.youtube.com/watch?v=fBYu8I52nVM&list=UULFqlHIJTGYhiwQpNuPU5e2gg&index=54
Example 1: Select-Additive Learning
Sentiment classification task for verbal, acoustic, visual. It improves the
generalizability of trained neural networks for multimodal sentiment analysis

https://arxiv.org/abs/1609.05244
Confounding variables are factors that can influence both the independent and dependent variables in a study, leading to
biased or incorrect conclusions about the relationship between them. In machine learning, addressing confounding variables is
crucial for accurate causal inference and prediction.
https://www.youtube.com/watch?v=fBYu8I52nVM&list=UULFqlHIJTGYhiwQpNuPU5e2gg&index=54
Example 1: Select-Additive Learning

https://www.youtube.com/watch?v=fBYu8I52nVM&list=UULFqlHIJTGYhiwQpNuPU5e2gg&index=54
Example 2: World-level gated Fusion

Multimodal Sentiment Analysis: Gated Multimodal Embedding LSTM with Temporal Attention (GME-LSTM(A)) model
https://arxiv.org/abs/1802.00924

https://www.youtube.com/watch?v=fBYu8I52nVM&list=UULFqlHIJTGYhiwQpNuPU5e2gg&index=54
Example 2: World-level gated Fusion

GME: Gated Multimodal Embedding

https://www.youtube.com/watch?v=fBYu8I52nVM&list=UULFqlHIJTGYhiwQpNuPU5e2gg&index=54
Multimodal Research Task
Datasets Requirements for the project
• Dataset should have at least two modalities
• Teams of 2 or 3 students
• Stages:
• Pre-proposal: define dataset and research task
• Study related work to your selected research topic
• Experiment with Unimodal representations
• Implement & evaluate state-of-the-art model(s)
• Create GitHub repository & it is accessible by course staff
• Each report should include a description of the task from each team member.
• Make a video that present the robot in action
• Write a paper.
https://www.youtube.com/watch?v=607EcmU9mFs&list=PL-Fhd_vrvisMYs8A5j7sj8YW1wHhoJSmW&index=3

Multimodel Deep Learning
No ratings yet
Multimodel Deep Learning
92 pages
Multi Model
No ratings yet
Multi Model
36 pages
Multimodal Perception Basics
No ratings yet
Multimodal Perception Basics
23 pages
ICML2023 - Tutorial多模态机器学习Multimodal Machine Learning
No ratings yet
ICML2023 - Tutorial多模态机器学习Multimodal Machine Learning
120 pages
Lecture1 1-Introduction
No ratings yet
Lecture1 1-Introduction
52 pages
Perception, Reason, Think, and Plan
No ratings yet
Perception, Reason, Think, and Plan
75 pages
Multimodal Machine Learning: A Survey and Taxonomy: Tadas Baltru Saitis, Chaitanya Ahuja, and Louis-Philippe Morency
No ratings yet
Multimodal Machine Learning: A Survey and Taxonomy: Tadas Baltru Saitis, Chaitanya Ahuja, and Louis-Philippe Morency
20 pages
BHV 020
No ratings yet
BHV 020
17 pages
Perception Reason Think Plan
No ratings yet
Perception Reason Think Plan
91 pages
MBL Final
No ratings yet
MBL Final
6 pages
Author NameAffiliationauthor@Email
No ratings yet
Author NameAffiliationauthor@Email
8 pages
Multimodal Learning
No ratings yet
Multimodal Learning
29 pages
Baltrusaitis MMML Survey
No ratings yet
Baltrusaitis MMML Survey
20 pages
Multisensory Interaction Models
No ratings yet
Multisensory Interaction Models
10 pages
Document390961606seminar Report On Multimodal Deep Learni PDF
No ratings yet
Document390961606seminar Report On Multimodal Deep Learni PDF
16 pages
Multimodal Machine Learning Survey
No ratings yet
Multimodal Machine Learning Survey
21 pages
Lec4 - Multimodal
No ratings yet
Lec4 - Multimodal
53 pages
Multimodal AI Benchmark: WorldSense
No ratings yet
Multimodal AI Benchmark: WorldSense
15 pages
Session 15-1 Multimodal
No ratings yet
Session 15-1 Multimodal
82 pages
MMML Tutorial ACL2017
No ratings yet
MMML Tutorial ACL2017
221 pages
Multimodal Affective Computing Trends
No ratings yet
Multimodal Affective Computing Trends
26 pages
Multimodal Machine Learning A Survey and Taxonomy
No ratings yet
Multimodal Machine Learning A Survey and Taxonomy
21 pages
Multimodal Fusion For Cross-Platform Content Generation: Science and Research, Coimbatore
No ratings yet
Multimodal Fusion For Cross-Platform Content Generation: Science and Research, Coimbatore
9 pages
2023 Multimodal Large Language Models - A Survey
No ratings yet
2023 Multimodal Large Language Models - A Survey
10 pages
Multimodality in VR: A Survey: Daniel Martin, Sandra Malpica, Diego Gutierrez, Belen Masia, Ana Serrano
No ratings yet
Multimodality in VR: A Survey: Daniel Martin, Sandra Malpica, Diego Gutierrez, Belen Masia, Ana Serrano
35 pages
Deep Multimodal Representation Learning A Survey
No ratings yet
Deep Multimodal Representation Learning A Survey
22 pages
ConFEDE: Multimodal Sentiment Analysis
No ratings yet
ConFEDE: Multimodal Sentiment Analysis
14 pages
Multimodal Interaction - A Review
No ratings yet
Multimodal Interaction - A Review
7 pages
Lec6 - Crossmodal
No ratings yet
Lec6 - Crossmodal
51 pages
Multimodal Brain-Computer Interfaces: AI-powered Decoding Methodologies
No ratings yet
Multimodal Brain-Computer Interfaces: AI-powered Decoding Methodologies
26 pages
Survey of Multimodal Large Language Models
No ratings yet
Survey of Multimodal Large Language Models
15 pages
Recent Advances and Trends in Multimodal Deep Learning A Review
No ratings yet
Recent Advances and Trends in Multimodal Deep Learning A Review
35 pages
Recent Trends of Multimodal Affective Computing: A Survey From NLP Perspective
No ratings yet
Recent Trends of Multimodal Affective Computing: A Survey From NLP Perspective
26 pages
Understanding Multimodal LLMs
No ratings yet
Understanding Multimodal LLMs
6 pages
M Z & M B: A Standardized Toolkit For Multimodal Deep Learning
No ratings yet
M Z & M B: A Standardized Toolkit For Multimodal Deep Learning
7 pages
Second Presentation
No ratings yet
Second Presentation
26 pages
Tun Et Al. - 2024 - Resource-Efficient Federated Multimodal Learning Via Layer-Wise and Progressive Training
No ratings yet
Tun Et Al. - 2024 - Resource-Efficient Federated Multimodal Learning Via Layer-Wise and Progressive Training
14 pages
Principles of Multimodal System Design
No ratings yet
Principles of Multimodal System Design
5 pages
A Survey On Multi-Modal Summarization
No ratings yet
A Survey On Multi-Modal Summarization
37 pages
Artigo - The - Development - of - Multisensory - Processes
No ratings yet
Artigo - The - Development - of - Multisensory - Processes
16 pages
AI Beyond Text: Integrating Vision, Audio, and Language For Multimodal Learning
No ratings yet
AI Beyond Text: Integrating Vision, Audio, and Language For Multimodal Learning
7 pages
From The Lab To The Real World Affect Recognition Using Multiple Cues and Modalities
No ratings yet
From The Lab To The Real World Affect Recognition Using Multiple Cues and Modalities
37 pages
Lecture1 2-MultimodalResearchTasks
No ratings yet
Lecture1 2-MultimodalResearchTasks
46 pages
Multimodal Analysis of Social Signals
No ratings yet
Multimodal Analysis of Social Signals
16 pages
EEG-Enhanced Multimodal Emotion Recognition
No ratings yet
EEG-Enhanced Multimodal Emotion Recognition
4 pages
Visual Influences On Auditory Behavioral, Neural, and Perceptual Processes: A Review
No ratings yet
Visual Influences On Auditory Behavioral, Neural, and Perceptual Processes: A Review
22 pages
Lec8 - Large Multimodal Models
No ratings yet
Lec8 - Large Multimodal Models
45 pages
2023 M LLM
No ratings yet
2023 M LLM
11 pages
(1999) Ten Myths of Multimodal Interaction
No ratings yet
(1999) Ten Myths of Multimodal Interaction
8 pages
Multimodal Interaction - Lecture 5 - Next Generation User Interfaces (4018166FNR)
No ratings yet
Multimodal Interaction - Lecture 5 - Next Generation User Interfaces (4018166FNR)
46 pages
Embodied Cognition Explored
No ratings yet
Embodied Cognition Explored
58 pages
Presentation 4
No ratings yet
Presentation 4
71 pages
Sentiment Analysis for Medical Image Classification
No ratings yet
Sentiment Analysis for Medical Image Classification
20 pages
Universal Network
No ratings yet
Universal Network
18 pages
Fnbot 17 1084000
No ratings yet
Fnbot 17 1084000
21 pages
Multimodal Learning With Transformers - A Survey
No ratings yet
Multimodal Learning With Transformers - A Survey
23 pages
Multimodal Interfaces Ayta Naquila
No ratings yet
Multimodal Interfaces Ayta Naquila
28 pages
Filipino Psych Syllabus Template AY 2025 2026 1
No ratings yet
Filipino Psych Syllabus Template AY 2025 2026 1
8 pages
Case Study
No ratings yet
Case Study
8 pages
Final Research Paper
No ratings yet
Final Research Paper
22 pages
Cataract Visual Function Tests
No ratings yet
Cataract Visual Function Tests
3 pages
The Importance of Mental and Emotional Wellness at Workplace
No ratings yet
The Importance of Mental and Emotional Wellness at Workplace
44 pages
Social Stratification
No ratings yet
Social Stratification
5 pages
Anatomy Welcome Back One Pager
100% (1)
Anatomy Welcome Back One Pager
2 pages
Abstract
No ratings yet
Abstract
2 pages
(LN) The Gal Is Sitting Behind Me, and Loves Me - Volume 02 (JNCodex)
No ratings yet
(LN) The Gal Is Sitting Behind Me, and Loves Me - Volume 02 (JNCodex)
251 pages
30 Day Respect Husband
No ratings yet
30 Day Respect Husband
20 pages
Barbra 2
No ratings yet
Barbra 2
18 pages
Lesson Plan Draft
No ratings yet
Lesson Plan Draft
5 pages
The Stranger Initial Essay
No ratings yet
The Stranger Initial Essay
16 pages
Grade 6 Catch-Up Friday Activities
No ratings yet
Grade 6 Catch-Up Friday Activities
9 pages
10 Tips To Fight Depression
100% (1)
10 Tips To Fight Depression
2 pages
Organizational Culture Adaptability at CBE
No ratings yet
Organizational Culture Adaptability at CBE
34 pages
Effort Heuristic's Impact on Quality Judgment
No ratings yet
Effort Heuristic's Impact on Quality Judgment
20 pages
6 1 Unit
No ratings yet
6 1 Unit
15 pages
Agenda: Butler County Commissioners' Meeting
No ratings yet
Agenda: Butler County Commissioners' Meeting
5 pages
DLL - Science 4 - Q2 - W7
No ratings yet
DLL - Science 4 - Q2 - W7
4 pages
Final Reflection 2020
No ratings yet
Final Reflection 2020
3 pages
Diagnosis and Management of Idiopathic Normal-Pressure Hydrocephalus
100% (1)
Diagnosis and Management of Idiopathic Normal-Pressure Hydrocephalus
11 pages
Neal 1997 Reconsidering The Phases of Disaster - Ocred
No ratings yet
Neal 1997 Reconsidering The Phases of Disaster - Ocred
26 pages
Brand As Resource
No ratings yet
Brand As Resource
12 pages
FA2 PROJECT WORKS 2025-26 Classes 6-10
No ratings yet
FA2 PROJECT WORKS 2025-26 Classes 6-10
6 pages
LGBT Discrimination in Philippine Schools
No ratings yet
LGBT Discrimination in Philippine Schools
1 page
NCM 117J (Maladaptive Pattern of Behavior) : Overview of Psychiatric Mental Health Nursing Module 1/7
100% (3)
NCM 117J (Maladaptive Pattern of Behavior) : Overview of Psychiatric Mental Health Nursing Module 1/7
41 pages
Facebook Moderators Are Dying at Their Desks
No ratings yet
Facebook Moderators Are Dying at Their Desks
2 pages
Lacan Avec Peirce: A Semeiotic Approach To Lacanian Thought
No ratings yet
Lacan Avec Peirce: A Semeiotic Approach To Lacanian Thought
117 pages
Family & Community Health Theories
No ratings yet
Family & Community Health Theories
4 pages

Week 8 - MMML - Introduction

Uploaded by

Week 8 - MMML - Introduction

Uploaded by

Cognitive Computing

Dr. Hany Hanafy Mahmoud

• Science related to data with more sensory modalities

• This approach is rooted in the theoretical assumption that

• E.g., perceptual, emotional, social, and responses to the

Multimodal Communicative Behaviors

• Verbal: What you see?

• Vocal: How you say it?

• Visual: How visual behavior looks like?

Multi-Modal Machine Learning (MMML): is the study of

Artificial Intelligence for Multimodal data: are able to

Confounding variable is an unmeasured third variable that

GME: Gated Multimodal Embedding

You might also like