Handout - LLM Training and Inference

Large Language Models (LLMs) utilize extensive neural network architectures and vast datasets for training, focusing on predicting the next word in a sequence. They operate by assigning probabilities to potential words based on context and are trained through a process of masking words in a corpus to create a dataset for learning. The inference stage involves generating sentences autoregressively, with leading models like GPT and Claude designed for this task.

Uploaded by

Anurag Kulkarni

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

47 views5 pages

Handout - LLM Training and Inference

Uploaded by

Anurag Kulkarni

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

LLM Training and Inference

Introduction to Large Language Models (LLMs)

● LLMs are distinguished by their neural network architecture and vast scale of training data,
making them significantly larger in size compared to previous machine learning models.
● The term "large" in LLMs refers to both the size of the neural network architecture and the
scale of the text dataset used for training. LLMs are trained on internet-scale datasets,
surpassing previous benchmarks in both model size and dataset volume.
● The model size and the dataset used to train the model need to be somewhat correlated for
great performance.

Understanding Language Modeling

● Language modeling is the core mechanism that underpins how modern-day LLMs work.
● The simplest implementation of language modeling is predicting the next word given a
sequence of words that appeared before.
● To predict the next word, LLMs need to understand rules of grammar, sentence construction,
and the way language is generally written.

Implementing Language Modeling: Predicting the Next Word

anuragakulkarni@[Link]
R4VIYUXQBP
● LLMs are very good at predicting the next word given a set of words that precede it.
● The example of a masked sample is used to illustrate the language modeling objective, where
a word is masked and the LLM is asked to predict that missing word.
● LLMs understand correlations between words that co-occur between each other, and they
assign probabilities to different words in their vocabulary before selecting the word that fills
the missing blank.

Example: The movie was awesome overall the experience was positive

The word positive is the ground truth label and is masked for the LLM to predict.
The LLM is trained to predict the next word in the sequence in an autoregressive fashion.

This file is meant for personal use by anuragakulkarni@[Link] only.

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited. 1
Sharing or publishing the contents in part or full is liable for legal action.
anuragakulkarni@[Link]
R4VIYUXQBP

Implementing Language Modeling: Probability and Word Selection

● LLMs assign probabilities to every possible word in the corpus and use those probabilities to
select one word which has a high probability of filling in the missing blank.
● The transformer neural network behind the LLM has been constructed in such a way that it
assigns probabilities to every possible word in the corpus and uses those probabilities to
select one word which has a high probability of filling in the missing blank.

Example: The movie is a visually stunning action-packed and emotionally resonant trail ride that will
leave you on the edge of the seat from the beginning to the end overall the experience was right

The LLM is asked to predict the missing word.

This file is meant for personal use by anuragakulkarni@[Link] only.

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited. 2
Sharing or publishing the contents in part or full is liable for legal action.
The LLM assigns probabilities to every possible word in the corpus and uses those probabilities to
select one word which has a high probability of filling in the missing blank.

anuragakulkarni@[Link]
R4VIYUXQBP

This file is meant for personal use by anuragakulkarni@[Link] only.

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited. 3
Sharing or publishing the contents in part or full is liable for legal action.
Constructing a Training Corpus for LLMs

● To train LLMs, a training corpus is constructed by masking every single word one word at a
time in a training corpus to create a training data set of billions of examples for the LLM to
learn from.
● The LLM is trained to predict the next word in the sequence in an autoregressive fashion.

anuragakulkarni@[Link]
R4VIYUXQBP

This file is meant for personal use by anuragakulkarni@[Link] only.

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited. 4
Sharing or publishing the contents in part or full is liable for legal action.
Inference Stage of LLMs
● The inference stage is different from the training stage, where the LLM constructs sentences
by predicting one word at a time in an autoregressive fashion.
● The best LLMs on the market, such as the GPT series from OpenAI, the Claude series from
Anthropic, the Gemini series from Google, and the LLM series from Meta, are all trained on the
core objective of predicting the next word in the sequence in an autoregressive fashion.

anuragakulkarni@[Link]
R4VIYUXQBP

This file is meant for personal use by anuragakulkarni@[Link] only.

Generative AI Exists Because of The Transformer
No ratings yet
Generative AI Exists Because of The Transformer
52 pages
Large Language Model (LLM) 1
100% (1)
Large Language Model (LLM) 1
17 pages
Understanding Large Language Models
No ratings yet
Understanding Large Language Models
3 pages
Intro To LLMs
No ratings yet
Intro To LLMs
32 pages
Paniit Demystifying Llms
No ratings yet
Paniit Demystifying Llms
66 pages
The Best LLMs Cheatsheet - Part 1
No ratings yet
The Best LLMs Cheatsheet - Part 1
16 pages
Understanding Large Language Models
No ratings yet
Understanding Large Language Models
6 pages
Using Large Language Models
No ratings yet
Using Large Language Models
9 pages
D 02 Large Language Models
100% (1)
D 02 Large Language Models
58 pages
This 200-Page LLM Guide Will Save You Months - Here's The Gold in 5 Minutes
No ratings yet
This 200-Page LLM Guide Will Save You Months - Here's The Gold in 5 Minutes
22 pages
1st Note
No ratings yet
1st Note
3 pages
Sinan Ozdemir - Quick Start Guide To Large Language Models, Second Edition-Addison-Wesley (2024)
No ratings yet
Sinan Ozdemir - Quick Start Guide To Large Language Models, Second Edition-Addison-Wesley (2024)
279 pages
AILLM
No ratings yet
AILLM
3 pages
Large Language Models
100% (2)
Large Language Models
23 pages
What Are Large Language Models Supposed To Model? Idan A. Blank
No ratings yet
What Are Large Language Models Supposed To Model? Idan A. Blank
3 pages
LLM Models
No ratings yet
LLM Models
23 pages
Module 2
No ratings yet
Module 2
17 pages
Training Large Language Models
No ratings yet
Training Large Language Models
7 pages
Attention Is All You Need.
No ratings yet
Attention Is All You Need.
5 pages
Hallucinations in LLMs Understanding and Addressing Challenges
No ratings yet
Hallucinations in LLMs Understanding and Addressing Challenges
5 pages
Understanding Large Language Models (LLMS)
No ratings yet
Understanding Large Language Models (LLMS)
2 pages
Understanding Large Language Models (LLMS) - A Mode
No ratings yet
Understanding Large Language Models (LLMS) - A Mode
3 pages
Techniques, Tricks & Frameworks
No ratings yet
Techniques, Tricks & Frameworks
143 pages
Notes 4 Large Language Model
No ratings yet
Notes 4 Large Language Model
4 pages
All The Basics That You Need To Know About LLMs
No ratings yet
All The Basics That You Need To Know About LLMs
26 pages
Week4 LLMs EN
No ratings yet
Week4 LLMs EN
48 pages
LLM Information
No ratings yet
LLM Information
6 pages
Module1 L4 LLMs New
No ratings yet
Module1 L4 LLMs New
37 pages
Understanding Large Language Models
No ratings yet
Understanding Large Language Models
14 pages
Large Language Models
No ratings yet
Large Language Models
32 pages
What Are LLMs
No ratings yet
What Are LLMs
3 pages
Understanding Transformers and LLMs
No ratings yet
Understanding Transformers and LLMs
4 pages
Large Language Model: Instructor Name: Shukdev Datta ML Developer at Innovative Skills
No ratings yet
Large Language Model: Instructor Name: Shukdev Datta ML Developer at Innovative Skills
22 pages
Mod 4
No ratings yet
Mod 4
69 pages
W 1 Largelanguagemodelsandchatgptin 3 Weeks 11748368383984
No ratings yet
W 1 Largelanguagemodelsandchatgptin 3 Weeks 11748368383984
134 pages
How LLM Work
No ratings yet
How LLM Work
3 pages
Module 2 Foundation Maven-V3
No ratings yet
Module 2 Foundation Maven-V3
60 pages
Module 3
No ratings yet
Module 3
43 pages
《A Primer on Large Language Models and their Limitations
No ratings yet
《A Primer on Large Language Models and their Limitations
33 pages
LLMs
No ratings yet
LLMs
40 pages
How Llms Work
No ratings yet
How Llms Work
2 pages
LLM Book 43-102
No ratings yet
LLM Book 43-102
60 pages
LLM
No ratings yet
LLM
3 pages
Language Modeling Lecture Notes
No ratings yet
Language Modeling Lecture Notes
88 pages
LLM - Seminar Report
No ratings yet
LLM - Seminar Report
13 pages
Understanding Large Language Models (LLMs)
No ratings yet
Understanding Large Language Models (LLMs)
10 pages
Large Language Models
No ratings yet
Large Language Models
2 pages
Python BAKMR010399001
No ratings yet
Python BAKMR010399001
3 pages
The Impact of Large Language Modeling On Natural Language Processing in Legal Te
No ratings yet
The Impact of Large Language Modeling On Natural Language Processing in Legal Te
7 pages
Thoughts On NLP Research in The (Post-) LLM Era: Yijia Shao Yuanpei College 2023/04/28
No ratings yet
Thoughts On NLP Research in The (Post-) LLM Era: Yijia Shao Yuanpei College 2023/04/28
51 pages
LLM Series 01 - Introduction To LLMS
No ratings yet
LLM Series 01 - Introduction To LLMS
10 pages
LLMs in Planning: Insights and Applications
No ratings yet
LLMs in Planning: Insights and Applications
97 pages
2 Generative Models
No ratings yet
2 Generative Models
60 pages
Day 2 Module 2 - Understanding LLMs
No ratings yet
Day 2 Module 2 - Understanding LLMs
14 pages
Understanding Large Language Models
No ratings yet
Understanding Large Language Models
9 pages
50 LLM Interview Questions
100% (2)
50 LLM Interview Questions
56 pages
LLM Model
No ratings yet
LLM Model
3 pages
Pranay Report
No ratings yet
Pranay Report
26 pages
LLM Basics for Researchers
No ratings yet
LLM Basics for Researchers
54 pages
GEN AI Course Handouts Week 3
No ratings yet
GEN AI Course Handouts Week 3
25 pages
Handout - Open AI Journey and GPT Training
No ratings yet
Handout - Open AI Journey and GPT Training
3 pages
Handout - Introduction To Prompt Engineering
No ratings yet
Handout - Introduction To Prompt Engineering
3 pages
My Apartment Story
No ratings yet
My Apartment Story
15 pages
Chimur's Cry For Freedom - Anchor Dialogues
100% (1)
Chimur's Cry For Freedom - Anchor Dialogues
4 pages
Notes On The Verb To BE
No ratings yet
Notes On The Verb To BE
8 pages
Rules For Legal Writing in Acls I
No ratings yet
Rules For Legal Writing in Acls I
2 pages
Maha Tet Paper - 5 (6 To 8) - English Mmarathi
No ratings yet
Maha Tet Paper - 5 (6 To 8) - English Mmarathi
32 pages
Minimal Pairs and Sets in Phonology - Laum
No ratings yet
Minimal Pairs and Sets in Phonology - Laum
12 pages
Introduction to Translation Studies
No ratings yet
Introduction to Translation Studies
146 pages
Atlas in Greek Mythology Explained
No ratings yet
Atlas in Greek Mythology Explained
9 pages
Construing Motion in Berber
No ratings yet
Construing Motion in Berber
19 pages
Understanding Reported Speech Rules
100% (1)
Understanding Reported Speech Rules
4 pages
Bilingual Education and Curriculum Overview
No ratings yet
Bilingual Education and Curriculum Overview
3 pages
Formal Informal English
No ratings yet
Formal Informal English
8 pages
Difficulties For Vietnamese When Pronouncing English: Final Consonants
No ratings yet
Difficulties For Vietnamese When Pronouncing English: Final Consonants
38 pages
Paper 3 N-Gram Quiz Practice
No ratings yet
Paper 3 N-Gram Quiz Practice
8 pages
Empowering Vocabulary Retention With Twee: Designing Digital Reinforcement Exercises For High School Students
No ratings yet
Empowering Vocabulary Retention With Twee: Designing Digital Reinforcement Exercises For High School Students
7 pages
Year 5 English Lesson Plan
No ratings yet
Year 5 English Lesson Plan
6 pages
ACT3-Felipe de Jesus Perez Hernandez
No ratings yet
ACT3-Felipe de Jesus Perez Hernandez
10 pages
Task 2 - Writing Production
No ratings yet
Task 2 - Writing Production
9 pages
Previewpdf
No ratings yet
Previewpdf
53 pages
Grade: 10 ESL Subject: English IGCSE Teacher: MR Yaqoob: Sampoerna Academy Lesson Plan 2025-26
No ratings yet
Grade: 10 ESL Subject: English IGCSE Teacher: MR Yaqoob: Sampoerna Academy Lesson Plan 2025-26
6 pages
Grade 4 Math Module: Experiments
No ratings yet
Grade 4 Math Module: Experiments
2 pages
2022 JPSP Nurtamin Herawaty Ery Hasyim
No ratings yet
2022 JPSP Nurtamin Herawaty Ery Hasyim
8 pages
Grammar Notes Spoken English, Let's Enjoy Learning English
No ratings yet
Grammar Notes Spoken English, Let's Enjoy Learning English
21 pages
English Permission Expressions
No ratings yet
English Permission Expressions
5 pages
Effects of Focus-on-Form in ESL Teaching
No ratings yet
Effects of Focus-on-Form in ESL Teaching
21 pages
EFL - ESL Speaking Lessons - Making Appointments in English
No ratings yet
EFL - ESL Speaking Lessons - Making Appointments in English
3 pages
The English Subjunctive1
100% (1)
The English Subjunctive1
5 pages
Domestic and Wild Animals 3 Grade ALMIRA
No ratings yet
Domestic and Wild Animals 3 Grade ALMIRA
3 pages
Comparative Adjectives: Comparative Adjectives - Town and Country - Directions - Comparatives and Superlatives
No ratings yet
Comparative Adjectives: Comparative Adjectives - Town and Country - Directions - Comparatives and Superlatives
1 page
LCS July 10
No ratings yet
LCS July 10
49 pages
Year 6 English Grammar & Comprehension Guide
No ratings yet
Year 6 English Grammar & Comprehension Guide
11 pages
Mastering Indirect Speech Rules
No ratings yet
Mastering Indirect Speech Rules
67 pages