0% found this document useful (0 votes)

12 views13 pages

Course Structure

The document outlines an elective course on Speech and Natural Language Processing, covering foundational concepts, techniques, and applications in the field. Key topics include text processing, language models, automatic speech recognition (ASR), and text-to-speech synthesis (TTS). The course aims to equip students with essential skills in speech and text processing, with a structured syllabus and grading policy.

Uploaded by

kghosh.cs

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views13 pages

Course Structure

Uploaded by

kghosh.cs

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

Introduction to Speech &

Natural Language Processing

Lecture 0
Introducing Course
Krishnendu Ghosh & S R Mahadeva Prasanna
Course Details
Credit: 1 (1-0-0-0-1)

Course: Introduction to Speech and Natural Language Processing

Course Type: Elective

Course Overview
This course provides a foundational understanding of Speech
Processing and Natural Language Processing (NLP). It focuses on the
core principles, techniques, and applications. The course covers text
processing, language models, speech signal processing, automatic
speech recognition (ASR), and text-to-speech synthesis (TTS).
Course Objectives
By the end of this course, students will:

Understand basic text and speech processing techniques.

Learn about language models, and embeddings.

Explore speech feature extraction, ASR, and TTS models.

Syllabus
Unit 1: Lexical Processing in NLP (2 Hours)

What is NLP & Speech Processing

Text Normalization: Tokenization, Stemming, Lemmatization.

Language Modeling: N-grams, Word2Vec, GloVe.

Syllabus
Unit 2: Syntactic Processing in NLP (2 Hours)

Sequence Labeling for Parts-of-Speech (POS) Tagging and Named

Entity Recognition (NER).

Context-Free Grammars and Constituency Parsing. Dependency

Parsing.
Syllabus
Unit 3: Semantic Processing in NLP (2 Hours)

Word Sense Disambiguation (WSD) – Understanding word meanings in

context.

Semantic Role Labeling (SRL) – Assigning roles like agent, object, etc.
Coreference Resolution – Identifying references to the same entity.
Syllabus
Unit 4: Phonetics & Speech Signal Processing (3 Hours)

Basics of Speech Production & Phonetics.

Feature Extraction: MFCC, Spectrograms, PLP Features.

Deep Learning for Speech: Introduction to Wave2Vec, WavLM.

Syllabus
Unit 5: Automatic Speech Recognition (ASR) & Text-to-Speech (TTS) (3
Hours)

ASR Pipeline: Feature extraction → Acoustic modeling → Decoding.

HMM vs. DNN-based ASR systems.

End-to-End ASR Models: Wave2Vec, Whisper API.

TTS Pipeline: Text preprocessing → Prosody → Synthesis.

Deep Learning-based TTS Models: Tacotron, FastSpeech, WaveNet.

Challenges in Speech Synthesis (Low-Resource Languages, Prosody Control).
Text books
Speech and Language Processing – Daniel Jurafsky & James H. Martin
(3rd Edition, Draft Available Online)

Springer Handbook of Speech Processing - Jacob Benesty, M. Mohan

Sondhi, Yiteng Arden Huang
Reference books / Material
Natural Language Processing with Python (NLTK Book) – Steven Bird,
Ewan Klein, Edward Loper

Spoken Language Processing: A Guide to Theory, Algorithm, and

System Development – Xuedong Huang, Alex Acero, Hsiao-Wuen Hon

Fundamentals of Speech Recognition – Lawrence Rabiner, Biing-

Hwang Juang
Grading Policy
Theoretical Assignments (2) 14%
Quizzes (12) 36%
End-Term Exam 30%

Classroom Notes 5%
Activeness in Classes 5%
Attendance 5%
X-Factor (Originality, Creativity, or Initiative) 5%

224s 22 Lec1
No ratings yet
224s 22 Lec1
31 pages
Speech and Language Processing Course
No ratings yet
Speech and Language Processing Course
3 pages
CCS369 TEXT AND SPEECH ANALYSIS - Syllabus
No ratings yet
CCS369 TEXT AND SPEECH ANALYSIS - Syllabus
4 pages
Course Code: Course Title Credit CSDO7011 Atural Language Processing 3
No ratings yet
Course Code: Course Title Credit CSDO7011 Atural Language Processing 3
4 pages
NLP IT-7th Sem
No ratings yet
NLP IT-7th Sem
2 pages
Bcse409l Natural-Language-Processing TH 1.1 0 Bcse409l
No ratings yet
Bcse409l Natural-Language-Processing TH 1.1 0 Bcse409l
2 pages
NLP (1) (1) - Merged
No ratings yet
NLP (1) (1) - Merged
239 pages
Advanced Topics in Speech Processing (IT60116) : K Sreenivasa Rao School of Information Technology IIT Kharagpur
No ratings yet
Advanced Topics in Speech Processing (IT60116) : K Sreenivasa Rao School of Information Technology IIT Kharagpur
17 pages
Cse4022 Natural-Language-Processing Eth 1.0 37 Cse4022
No ratings yet
Cse4022 Natural-Language-Processing Eth 1.0 37 Cse4022
2 pages
Natural Language Processing Course Content
No ratings yet
Natural Language Processing Course Content
2 pages
Standfordsd Speech Recognition
No ratings yet
Standfordsd Speech Recognition
4 pages
NLP A
No ratings yet
NLP A
6 pages
Csa4006 Natural-Language-Processing LT 1.0 6 Csa4006
No ratings yet
Csa4006 Natural-Language-Processing LT 1.0 6 Csa4006
2 pages
Ccs369-Text and Speech Analysis
No ratings yet
Ccs369-Text and Speech Analysis
3 pages
Ai in Natural Language Processing
No ratings yet
Ai in Natural Language Processing
4 pages
Natural Language Processing Course Overview
No ratings yet
Natural Language Processing Course Overview
31 pages
Data Science: Text & Speech Analysis Course
No ratings yet
Data Science: Text & Speech Analysis Course
2 pages
CS-416 Natural Language Processing
No ratings yet
CS-416 Natural Language Processing
1 page
Introduction To Speech Processing
No ratings yet
Introduction To Speech Processing
38 pages
Natural Language Processing - Session 1 - Introduction
100% (1)
Natural Language Processing - Session 1 - Introduction
55 pages
TSA Book
No ratings yet
TSA Book
154 pages
Speech Processing
No ratings yet
Speech Processing
5 pages
Natural Language Processing With Python
No ratings yet
Natural Language Processing With Python
3 pages
Al3501 NLP
100% (1)
Al3501 NLP
2 pages
GBHRFTHRDF
No ratings yet
GBHRFTHRDF
3 pages
NLP Syallabus Elective
No ratings yet
NLP Syallabus Elective
3 pages
Syllabus NLP (UE19CS334)
No ratings yet
Syllabus NLP (UE19CS334)
2 pages
01CE0713 - Natural Language Processing 2022
No ratings yet
01CE0713 - Natural Language Processing 2022
4 pages
NLP Course Overview for MS Students
No ratings yet
NLP Course Overview for MS Students
4 pages
15CS421E - Natural Language Processing
No ratings yet
15CS421E - Natural Language Processing
2 pages
CS 388 NLP Course Syllabus
No ratings yet
CS 388 NLP Course Syllabus
1 page
NLP Course for CS & Linguistics Students
No ratings yet
NLP Course for CS & Linguistics Students
6 pages
Intro 2025
No ratings yet
Intro 2025
15 pages
Natural Language Processing Course Syllabus
No ratings yet
Natural Language Processing Course Syllabus
6 pages
17B1NCI731 - ML&NLP - CD - Odd - 25-26
No ratings yet
17B1NCI731 - ML&NLP - CD - Odd - 25-26
2 pages
Natural Language Processing Course
No ratings yet
Natural Language Processing Course
2 pages
MScIT Sem4
No ratings yet
MScIT Sem4
8 pages
NLP BAD613B FullNotes
No ratings yet
NLP BAD613B FullNotes
158 pages
ChatGPT-NLP Course Summary
No ratings yet
ChatGPT-NLP Course Summary
34 pages
COMP 473 Speech Language Processing1643697957
No ratings yet
COMP 473 Speech Language Processing1643697957
3 pages
Swe1017 NLP Syllabus
No ratings yet
Swe1017 NLP Syllabus
2 pages
Natural Language Processing Nanodegree
No ratings yet
Natural Language Processing Nanodegree
11 pages
NLP Semester 7
100% (1)
NLP Semester 7
1,072 pages
ME02023011
No ratings yet
ME02023011
3 pages
Lec-1 Introduction
No ratings yet
Lec-1 Introduction
68 pages
NLP
No ratings yet
NLP
2 pages
Natural Language Processing (Pe-2)
No ratings yet
Natural Language Processing (Pe-2)
2 pages
Lecture 1 Introduction
No ratings yet
Lecture 1 Introduction
57 pages
NLP & Text Analytics Course
No ratings yet
NLP & Text Analytics Course
4 pages
Natural Language Processing
No ratings yet
Natural Language Processing
77 pages
NLP Course for Engineering Students
No ratings yet
NLP Course for Engineering Students
30 pages
CMU NLP Online Course Overview
No ratings yet
CMU NLP Online Course Overview
13 pages
INT344
50% (2)
INT344
2 pages
ccs369 Ts A Syllabus
No ratings yet
ccs369 Ts A Syllabus
3 pages
Natural Language Processing Course Overview
No ratings yet
Natural Language Processing Course Overview
6 pages
NLP Course for AI & Data Science PG
No ratings yet
NLP Course for AI & Data Science PG
4 pages
SYLLABUS
No ratings yet
SYLLABUS
2 pages
Data Manipulation On-Line Documentation
No ratings yet
Data Manipulation On-Line Documentation
17 pages
QR Code Attendance System Study
No ratings yet
QR Code Attendance System Study
15 pages
Digital Lab Manual New PDF
No ratings yet
Digital Lab Manual New PDF
76 pages
23fmath1500 14
No ratings yet
23fmath1500 14
11 pages
XII 1st PRE BOARD QP Withsolution 2023
No ratings yet
XII 1st PRE BOARD QP Withsolution 2023
13 pages
Management Information System LIC - To Be Printed
No ratings yet
Management Information System LIC - To Be Printed
6 pages
Nintendo DSi Security Analysis
No ratings yet
Nintendo DSi Security Analysis
6 pages
Logcat
No ratings yet
Logcat
1,051 pages
Computer Engineering As A Discipline
100% (1)
Computer Engineering As A Discipline
16 pages
Computer Operation Cat 1
No ratings yet
Computer Operation Cat 1
2 pages
MAE Assignment
No ratings yet
MAE Assignment
7 pages
Waves Complete KeyGen Guide
No ratings yet
Waves Complete KeyGen Guide
2 pages
VI Ch-2 Computer Virus-4-07-2020
No ratings yet
VI Ch-2 Computer Virus-4-07-2020
14 pages
Logix Hot Backup Code Generator Tool
No ratings yet
Logix Hot Backup Code Generator Tool
3 pages
VISION 2022 Training Courses
No ratings yet
VISION 2022 Training Courses
2 pages
CS-208 Stack and Related Instructions in 8085 Microprocessor by B R MALI GPC JODHPUR
No ratings yet
CS-208 Stack and Related Instructions in 8085 Microprocessor by B R MALI GPC JODHPUR
7 pages
Samsung Galaxy Tab S8 Series Overview
No ratings yet
Samsung Galaxy Tab S8 Series Overview
4 pages
PKL PPC 1040H Operation Manual 03 - 2018
No ratings yet
PKL PPC 1040H Operation Manual 03 - 2018
151 pages
Roaming Network Engineer Resume
No ratings yet
Roaming Network Engineer Resume
1 page
Operator Station
No ratings yet
Operator Station
174 pages
P&W Apqp - Asqr01 Rev 14 2024-02-07 Final3
No ratings yet
P&W Apqp - Asqr01 Rev 14 2024-02-07 Final3
14 pages
The Language of New Media - Lev Manovich (2001)
No ratings yet
The Language of New Media - Lev Manovich (2001)
202 pages
App Unit 1 Notes
No ratings yet
App Unit 1 Notes
20 pages
RVITM SelectionList 2024-Batch 19th Sep2024 181 Candidates-2
No ratings yet
RVITM SelectionList 2024-Batch 19th Sep2024 181 Candidates-2
9 pages
Programming WCF Services 4th Edition Juval Löwy Instant Download
No ratings yet
Programming WCF Services 4th Edition Juval Löwy Instant Download
134 pages
XML B2B Setup HTTP Server v02
No ratings yet
XML B2B Setup HTTP Server v02
8 pages
COA-chapter 3 - Assembly Language
100% (1)
COA-chapter 3 - Assembly Language
49 pages
SE1592 Presentation 1592 Se1592 20project
No ratings yet
SE1592 Presentation 1592 Se1592 20project
16 pages
Programming Paradigms Overview
No ratings yet
Programming Paradigms Overview
7 pages
Python Student Marks Management
No ratings yet
Python Student Marks Management
6 pages

Course Structure

Uploaded by

Course Structure

Uploaded by

Introduction to Speech &

Natural Language Processing

Course: Introduction to Speech and Natural Language Processing

Course Type: Elective

Understand basic text and speech processing techniques.

Learn about language models, and embeddings.

Explore speech feature extraction, ASR, and TTS models.

What is NLP & Speech Processing

Text Normalization: Tokenization, Stemming, Lemmatization.

Language Modeling: N-grams, Word2Vec, GloVe.

Sequence Labeling for Parts-of-Speech (POS) Tagging and Named

Context-Free Grammars and Constituency Parsing. Dependency

Word Sense Disambiguation (WSD) – Understanding word meanings in

Basics of Speech Production & Phonetics.

Feature Extraction: MFCC, Spectrograms, PLP Features.

Deep Learning for Speech: Introduction to Wave2Vec, WavLM.

ASR Pipeline: Feature extraction → Acoustic modeling → Decoding.

End-to-End ASR Models: Wave2Vec, Whisper API.

TTS Pipeline: Text preprocessing → Prosody → Synthesis.

Deep Learning-based TTS Models: Tacotron, FastSpeech, WaveNet.

Springer Handbook of Speech Processing - Jacob Benesty, M. Mohan

Spoken Language Processing: A Guide to Theory, Algorithm, and

Fundamentals of Speech Recognition – Lawrence Rabiner, Biing-

You might also like