Day 2 Module 2 - Understanding LLMs

Uploaded by

ama.dani.id

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

132 views14 pages

Day 2 Module 2 - Understanding LLMs

Uploaded by

ama.dani.id

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

AI WORKPLACE FOUNDATIONS

Understanding LLMs
DAY 2, MODULE 2
Agenda

01 What is a LLM?

02 Key Timelines
03 LLM – Under the Hood
04 LLM Capabilities
05 Getting More Out of LLMs
06 Building LLM-Powered Assistants
What is a Large Language Model?

• A large language model is a trained

deep-learning model that contextually
understands human language and can
generate text in a human-like fashion.

• LLMs are trained on vast amounts of text

data to develop deep understanding of
language structures and meanings.
What is a Large Language Model?

• The “large” in large language models

refers to 3 things: the huge training data
utilized, the massive scale of the model's
architecture, and the costly computational
resources required for training.

• They are typically based on transformer Model Language Large

architectures, which rely on self-attention Neural
network
Designed for
NLP tasks
Lots of
params
mechanisms that allow the model to
capture long-range dependencies
between words in a sentence.
Key Timelines

In the Beginning Statistical Revolution Deep Learning Age Age of Transformers

’50s – 80s ’90s – 2000s 2010s 2017 & beyond
Symbolic AI & rule- Rise of probabilistic Word embeddings Rise of attention mech-
based systems (ELIZA) models (n-gram) anism. Pretraining
Sequence modelling
with masked modelling
Statistical methods, Neural networks (RNN, LSTM)
start of connectionism (feedforward, RNN) Attention mechanism Parameter scaling.
LLMs – Under the Hood

LLMs follow a two-step training process:

• Pre-training: The models learn from massive amounts of

unlabeled text data. Using self-supervised learning, the
model learns to predict masked or corrupted words in the
input, allowing it to capture rich contextual representations.

• Fine-tuning: The models are further trained on specific tasks

using labeled data to specialize their language
understanding for various applications. This is known as
transfer learning, which allows a model to generalize its
capabilities to various downstream NLP tasks.
LLM Capabilities

• Conversation and dialogue, mimicking different writing

styles, adapting to various genres, and producing
contextually appropriate responses.
• Language translation, capturing nuances and idiomatic
expressions.
• Document summarization and knowledge extraction from a
wide range of sources.
• Intelligent text suggestion and completion based on partial
input.
• Sentiment analysis, distinguishing positive, negative or
neutral tones.
• Creative content generation, including fictional stories,
poetry, or script dialogues.
More LLM Capabilities

1 2

Coding Copilot Data Analysis & Interpretation

Assist developers in completing code Automatic generation of reports from
snippets, suggesting functions, and raw data to provide insights and
debugging. Generate technical summaries.
documentation from code or explain Generative analytics enables running
code in simple language. analysis using prompts. Conversational
Code completion tools: Codex analytics uses NLQ to query databases
(OpenAI), Github Copilot, AlphaCode and fetch data for non-technical users.
(DeepMind), TabNine, IntelliCode (MS).
Leading LLMs

1 2

Open Models Closed Models

Llama 3, Llama 3.1, Mixtral 8x22b,
Claude 3.5, Gemini 1.5, Gemini Ultra,
Mixtral 8x7b, Mistral Large, Qwen2,
GPT-4, GPT-4 Turbo, GPT-4o
Command-R
Getting More out of LLMs

The ambiguity of natural language affects how LLMs perform in

different tasks. These issues can be addressed in two ways –
prompt engineering and finetuning:

Prompt engineering
• This involves designing and refining input queries, known as
prompts, to achieve desired responses from LLMs.
• The phrasing, structure, and context of a prompt directly
influence the quality and relevance of the model's output.
• Understanding how to tune prompts effectively will help you
obtain more accurate and nuanced responses that are useful
and relevant.
Getting More out of LLMs

The ambiguity of natural language affects how LLMs perform in

different tasks. These issues can be addressed in two ways –
prompt engineering and finetuning:

Finetuning
• Finetuning involves taking a pre-trained LLM, such as GPT-3,
and further training it on a domain-specific task.
• Finetuning a model on a more focused dataset enables the
model to adapt to the specific requirements of the target
task, resulting in improved performance and tailored
responses.
• When you finetune a LLM, you train it on how to respond, so
you don’t necessarily have to do any prompt engineering
subsequently.
Building LLM-Powered Assistants

Key must-haves of LLM assistants:

• Contextual understanding: Should be to comprehend and
interpret user input, going beyond syntax to understanding
nuanced contextual cues.
• Information retrieval: Should be accurate at retrieving and
presenting information, responding to general knowledge
queries, providing up-to-date weather forecasts, fetching
relevant news articles, and offering personalized
recommendations.
• Task management: Must have an exceptional task
management system tailored to user preferences and
priorities, including seamless organisation of to-do lists,
appointment scheduling & reminder setting.
Building LLM-Powered Assistants

Key must-haves of LLM assistants:

• Personalisation: Should be adaptive, incorporating user
preferences to provide tailored responses and
recommendations.
• Conversational interface: A simplified and intuitive user
interface for providing the user with a chat experience
• Voice interaction (optional): Enables users to effortlessly
communicate with it via speech. This integration of speech-
to-text and text-to-speech technologies fosters an intuitive
user experience to enhance convenience and accessibility.
• Security and Privacy: Should be able to safeguard user
data, prioritising trust and protection of sensitive information
at all times.
AI WORKPLACE FOUNDATIONS

Understanding LLMs
DAY 2, MODULE 2

Full Stack Java Broucher 2024 Main
No ratings yet
Full Stack Java Broucher 2024 Main
31 pages
IoT Frameworks, Tools, APIs and Architectures
No ratings yet
IoT Frameworks, Tools, APIs and Architectures
11 pages
Fast Payment Flagship - Final - Nov 1
No ratings yet
Fast Payment Flagship - Final - Nov 1
113 pages
Brochure Tableau
50% (2)
Brochure Tableau
4 pages
Sage A. JavaScript Programming. From Beginners To Expert in 45 Days,... 2024
No ratings yet
Sage A. JavaScript Programming. From Beginners To Expert in 45 Days,... 2024
69 pages
Teaching Materials HBR
No ratings yet
Teaching Materials HBR
24 pages
Python Basics for Beginners
No ratings yet
Python Basics for Beginners
112 pages
AI Curriculum HandbookClassXI Level2
No ratings yet
AI Curriculum HandbookClassXI Level2
119 pages
Recommender Systems Overview
No ratings yet
Recommender Systems Overview
28 pages
Implementing Retrieval-Augmented Generation
No ratings yet
Implementing Retrieval-Augmented Generation
3 pages
GenAI Pinnacle Plus Brochure
No ratings yet
GenAI Pinnacle Plus Brochure
10 pages
Junior Data Scientist Program for Kids
No ratings yet
Junior Data Scientist Program for Kids
24 pages
Researchers & Educators: GPT-4 & LangChain
No ratings yet
Researchers & Educators: GPT-4 & LangChain
7 pages
Software Design Patterns Guide
No ratings yet
Software Design Patterns Guide
3 pages
Build AI-Powered Recommendation Systems
100% (1)
Build AI-Powered Recommendation Systems
28 pages
Object Oriented Programming Lab Report
No ratings yet
Object Oriented Programming Lab Report
5 pages
Data Analyst RoadMap by Mustafa Elryah 2025
No ratings yet
Data Analyst RoadMap by Mustafa Elryah 2025
22 pages
(Google Interview Prep Guide) Data Science Lead
No ratings yet
(Google Interview Prep Guide) Data Science Lead
7 pages
JavaScript Notes by Yandyesh
No ratings yet
JavaScript Notes by Yandyesh
109 pages
Regularization For Neural Networks 1718966083
No ratings yet
Regularization For Neural Networks 1718966083
9 pages
Learn React Basics in 4 Hours
No ratings yet
Learn React Basics in 4 Hours
101 pages
Java Programming Basics
No ratings yet
Java Programming Basics
397 pages
Tableau: Fast, Cost-Effective BI Software
No ratings yet
Tableau: Fast, Cost-Effective BI Software
5 pages
Programming Tutorials for Developers
No ratings yet
Programming Tutorials for Developers
46 pages
Python Training Course in Hyderabad
100% (1)
Python Training Course in Hyderabad
10 pages
Harvard Business Review Ebook Subscription Collection
No ratings yet
Harvard Business Review Ebook Subscription Collection
2 pages
Curriculum Overview Booklet - Intro To CS MakeCode Microbit
No ratings yet
Curriculum Overview Booklet - Intro To CS MakeCode Microbit
9 pages
843 AI Projects Cookbook
No ratings yet
843 AI Projects Cookbook
40 pages
Analytics Case Studies Ebook
No ratings yet
Analytics Case Studies Ebook
12 pages
AI Transforming Global Education
No ratings yet
AI Transforming Global Education
9 pages
Introduction To Generative AI
100% (1)
Introduction To Generative AI
81 pages
Large Language Models Concepts Techniques and Applications Atkinson Abutridy John 2024
No ratings yet
Large Language Models Concepts Techniques and Applications Atkinson Abutridy John 2024
254 pages
Smarter Decisions in Uncertainty
No ratings yet
Smarter Decisions in Uncertainty
14 pages
LLMs and GPT: A Developer's Guide
No ratings yet
LLMs and GPT: A Developer's Guide
137 pages
The Practitioner's Guide To Product Management - Jock Busuttil
No ratings yet
The Practitioner's Guide To Product Management - Jock Busuttil
194 pages
Introducing MLOps PDF
No ratings yet
Introducing MLOps PDF
112 pages
Statistical Distances For Machine Learning
No ratings yet
Statistical Distances For Machine Learning
31 pages
Envisioning The Future of Education
No ratings yet
Envisioning The Future of Education
1 page
John - Hopkins - Applied GenAI - Oct - 23
No ratings yet
John - Hopkins - Applied GenAI - Oct - 23
13 pages
NorthStar Kickoff: S&P Global Insights
No ratings yet
NorthStar Kickoff: S&P Global Insights
78 pages
(Ebook) Practical Deep Learning For Cloud and Mobile by Anirudh Koul, Siddha Ganju, Meher Kasam ISBN 9781492034865, 149203486xinstall Download
No ratings yet
(Ebook) Practical Deep Learning For Cloud and Mobile by Anirudh Koul, Siddha Ganju, Meher Kasam ISBN 9781492034865, 149203486xinstall Download
55 pages
Building Neo4j Powered
No ratings yet
Building Neo4j Powered
312 pages
Whatsapp Assignment
No ratings yet
Whatsapp Assignment
7 pages
Rust by Example
No ratings yet
Rust by Example
167 pages
Testing I2VGen-XL: Image-to-Video Model
No ratings yet
Testing I2VGen-XL: Image-to-Video Model
23 pages
Python Panorama Wide Angle Programming
No ratings yet
Python Panorama Wide Angle Programming
170 pages
Codebasics Data Science Bootcamp Brochure
No ratings yet
Codebasics Data Science Bootcamp Brochure
32 pages
Learn Python With Jupyter
No ratings yet
Learn Python With Jupyter
333 pages
Chapter 01 Introduction To ML
No ratings yet
Chapter 01 Introduction To ML
31 pages
Trends On AI Bond Report May 2025-1
No ratings yet
Trends On AI Bond Report May 2025-1
200 pages
Large Language Models
No ratings yet
Large Language Models
40 pages
AI Tools Categories Condensed Presentation
No ratings yet
AI Tools Categories Condensed Presentation
10 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
21 pages
Generative AI: Insurance Insights
No ratings yet
Generative AI: Insurance Insights
12 pages
Tech 101 For PMs HelloPM 1640879694
No ratings yet
Tech 101 For PMs HelloPM 1640879694
10 pages
Data Trends 2024 for Business Leaders
No ratings yet
Data Trends 2024 for Business Leaders
11 pages
Practical Natural Language Processing: A Comprehensive Guide To Building Real-World NLP Systems
No ratings yet
Practical Natural Language Processing: A Comprehensive Guide To Building Real-World NLP Systems
8 pages
Sinan Ozdemir - Quick Start Guide To Large Language Models, Second Edition-Addison-Wesley (2024)
No ratings yet
Sinan Ozdemir - Quick Start Guide To Large Language Models, Second Edition-Addison-Wesley (2024)
279 pages
Large Language Model (LLM) 1
100% (1)
Large Language Model (LLM) 1
17 pages
Techniques, Tricks & Frameworks
No ratings yet
Techniques, Tricks & Frameworks
143 pages
Posts and Short Pieces of Text Such As Short Stories.: Note-Taking
No ratings yet
Posts and Short Pieces of Text Such As Short Stories.: Note-Taking
5 pages
An Efficient Synthesis of 2 Amino 5 Chloro 3 Pyridinecarbox
No ratings yet
An Efficient Synthesis of 2 Amino 5 Chloro 3 Pyridinecarbox
3 pages
Business Sector and Industry Overview
No ratings yet
Business Sector and Industry Overview
5 pages
McAfee ePO Backup
No ratings yet
McAfee ePO Backup
4 pages
Ancient Qatari History and Archaeology
No ratings yet
Ancient Qatari History and Archaeology
9 pages
EM1 What To Do If You Find Asbestos PDF
No ratings yet
EM1 What To Do If You Find Asbestos PDF
2 pages
LWG 431
No ratings yet
LWG 431
1 page
Penetration of Bituminous Materials: Standard Test Method For
No ratings yet
Penetration of Bituminous Materials: Standard Test Method For
4 pages
Health6 - 4Q-ML6
No ratings yet
Health6 - 4Q-ML6
13 pages
2024-2025 Es7 Ass Equilibrium
No ratings yet
2024-2025 Es7 Ass Equilibrium
3 pages
Valmont - Galvanizing Information
No ratings yet
Valmont - Galvanizing Information
107 pages
Reading Models for Educators
No ratings yet
Reading Models for Educators
8 pages
(SURNAME) - A1CO2 - Audit of PPE - Masipag Company
No ratings yet
(SURNAME) - A1CO2 - Audit of PPE - Masipag Company
1 page
Internship On Unilever
No ratings yet
Internship On Unilever
57 pages
Datasheet - SigenMicro Inverter
No ratings yet
Datasheet - SigenMicro Inverter
2 pages
SoloA5 Flyer
No ratings yet
SoloA5 Flyer
1 page
Trig Ratios
No ratings yet
Trig Ratios
11 pages
Industrial Iot 4G Lte Router & Gateway: ICR-3231, ICR-3231W
No ratings yet
Industrial Iot 4G Lte Router & Gateway: ICR-3231, ICR-3231W
4 pages
Tissue Engineering & Regenerative Medicine Guide
100% (1)
Tissue Engineering & Regenerative Medicine Guide
42 pages
Joseph Intership Report (DP World)
No ratings yet
Joseph Intership Report (DP World)
19 pages
Genital Fistulae
No ratings yet
Genital Fistulae
15 pages
A General Theory of Artistic Legitimation How Art Worlds Are Like Social Movements
No ratings yet
A General Theory of Artistic Legitimation How Art Worlds Are Like Social Movements
19 pages
Curriculum Vitae
No ratings yet
Curriculum Vitae
2 pages
LIC
No ratings yet
LIC
33 pages
Comprehensive Color Reference Guide
No ratings yet
Comprehensive Color Reference Guide
1 page
Chocolate Hazelnut Cookies: Hungry For More Recipes?
No ratings yet
Chocolate Hazelnut Cookies: Hungry For More Recipes?
1 page
TH 0622
No ratings yet
TH 0622
8 pages
Han Shan, The Cold Mountain Poems
100% (2)
Han Shan, The Cold Mountain Poems
8 pages
SAS Library Data Transformations and Data Manipulation in SAS
No ratings yet
SAS Library Data Transformations and Data Manipulation in SAS
31 pages
Intelligent Bus Stops in The Flexible Bus Systems: Razi Iqbal and Muhammad Usman Ghani
No ratings yet
Intelligent Bus Stops in The Flexible Bus Systems: Razi Iqbal and Muhammad Usman Ghani
7 pages