0% found this document useful (0 votes)

294 views4 pages

Assignment 5 Solution

This document is an assignment on Large Language Models consisting of 8 questions, with a total mark of 10. The questions cover topics such as the disadvantages of RNNs, the purpose of LSTM cell states, and the time complexity of RNNs. Each question includes the correct answer and a brief solution or explanation.

Uploaded by

Harsh Vardhan Choudhary

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

294 views4 pages

Assignment 5 Solution

Uploaded by

Harsh Vardhan Choudhary

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Introduction to Large Language Models

Assignment- 5

Number of questions: 8 Total mark: 6 X 1 + 2 X 2 = 10

_________________________________________________________________________

QUESTION 1: [1 mark]
Which of the following is a disadvantage of Recurrent Neural Networks (RNNs)?

a. Can only process fixed-length inputs.

b. Symmetry in how inputs are processed.
c. Difficulty accessing information from many steps back.
d. Weights are not reused across timesteps.

Correct Answer: c

Solution: Please refer to the lecture slides.

_______________________________________________________________________

QUESTION 2: [1 mark]

Why are RNNs preferred over fixed-window neural models?

a. They have a smaller parameter size.
b. They can process sequences of arbitrary length.
c. They eliminate the need for embedding layers.
d. None of the above.

Correct Answer: b
Solution: Please refer to lecture slides.
_________________________________________________________________________

QUESTION 3: [1 mark]

What is the primary purpose of the cell state in an LSTM?

a. Store short-term information.
b. Control the gradient flow across timesteps.
c. Store long-term information.
d. Perform the activation function.

Correct Answer: c
Solution: The cell stores long-term information in LSTM.
_________________________________________________________________________

QUESTION 4: [1 mark]

In training an RNN, what technique is used to calculate gradients over multiple timesteps?
a. Backpropagation through Time (BPTT)
b. Stochastic Gradient Descent (SGD)
c. Dropout Regularization
d. Layer Normalization

Correct Answer: a
Solution: Please refer to lecture slides.
_________________________________________________________________________

QUESTION 5: [2 mark]

Consider a simple RNN:

● Input vector size: 3

● Hidden state size: 4
● Output vector size: 2
● Number of timesteps: 5

How many parameters are there in total?

a. 210
b. 190
c. 90
d. 42

Correct Answer: d
Solution:
Input to hidden weights: 3×4=12
Hidden to hidden weights: 4×4=16
Hidden to output weights: 4×2=8
Bias terms: 4(hidden) + 2(output) = 6
Total: 12+16+8+6=42
_________________________________________________________________________

QUESTION 6: [1 mark]

What is the time complexity for processing a sequence of length 'N' by an RNN, if the input
embedding dimension, hidden state dimension, and output vector dimension are all 'd'?

a. O(N)
b. O(N²d)
c. O(Nd)
d. O(Nd²)

Correct answer: d
Solution: The time complexity of processing a sequence of length N by an RNN depends on
the computational cost of updating the hidden state at each time step.
At each time step, the RNN updates its hidden state ht using the previous hidden state ht-1
and the current input xt. This update typically involves matrix multiplications:

I. Input-to-hidden transformation: Wx * xt, where Wx is a d × d matrix, leading to a

complexity of O(d²).
II. Hidden-to-hidden transformation: Wh * ht-1, where Wh is also a d × d matrix, leading
to a complexity of O(d²).
III. Activation function application: This is typically O(d) and negligible compared to
matrix multiplications.

Since these computations occur at every time step, the total complexity for a sequence of
length N is: O(N * d²)
_________________________________________________________________________

QUESTION 7: [1 mark]

Which of the following is true about Seq2Seq models?

(i) Seq2Seq models are always conditioned on the source sentence.
(ii) The encoder compresses the input sequence into a fixed-size vector representation.
(iii) Seq2Seq models cannot handle variable-length sequences.
a. (i) and (ii)
b. (ii) only
c. (iii) only
d. (i), (ii), and (iii)

Correct Answer: a
Solution: Seq2Seq models are designed to encode variable-length sequences but
compress them into fixed-size vector representations.

_________________________________________________________________________

QUESTION 8: [2 marks]

Given the following encoder and decoder hidden states, compute the attention scores. (Use
dot product as the scoring function)

Encoder hidden states: h1=[1,2], h2=[3,4], h3=[5,6]

Decoder hidden state: s=[0.5,1]

a. 0.00235,0.04731,0.9503
b. 0.0737,0.287,0.6393
c. 0.9503,0.0137,0.036
d. 0.6393,0.0737,0.287

Correct Answer: a
Solution:
e1 = 1*0.5+2*1 =0.5+2 = 2.5
e2 = 3*0.5+4*1 =1.5+4 = 5.5
e3 = 5*0.5+6*1 =2.5+6 = 8.5

α1 = e2.5/(e2.5 + e5.5 + e8.5) = 0.00235

α2 = e5.5/(e2.5 + e5.5 + e8.5) = 0.04731

α3 = e8.5/(e2.5 + e5.5 + e8.5) = 0.9503

_________________________________________________________________________

LLM A5
No ratings yet
LLM A5
3 pages
Week 11
No ratings yet
Week 11
3 pages
C6 Sample
No ratings yet
C6 Sample
4 pages
Week 11 Nptel Deep Learning
No ratings yet
Week 11 Nptel Deep Learning
6 pages
Week 11
No ratings yet
Week 11
3 pages
Assignment 6 Solution
No ratings yet
Assignment 6 Solution
3 pages
Assignment 3 Solution
No ratings yet
Assignment 3 Solution
3 pages
Exam Long Questions
No ratings yet
Exam Long Questions
8 pages
ML Endsem 2022
No ratings yet
ML Endsem 2022
7 pages
Deep Learning Exam With Answers
No ratings yet
Deep Learning Exam With Answers
4 pages
Second Exam 2021-22 Solution
No ratings yet
Second Exam 2021-22 Solution
9 pages
ADL Midterm Mock Exam 2021
No ratings yet
ADL Midterm Mock Exam 2021
5 pages
DL Unit-3 Question Bank
No ratings yet
DL Unit-3 Question Bank
39 pages
Deep Learning Solutions Q1-29
No ratings yet
Deep Learning Solutions Q1-29
3 pages
NLP Quiz
No ratings yet
NLP Quiz
1 page
Introduction To Large Language Models (LLMS) - Unit 7 - Week 5
No ratings yet
Introduction To Large Language Models (LLMS) - Unit 7 - Week 5
4 pages
DL
No ratings yet
DL
1 page
Nervous System Assessment Guide
No ratings yet
Nervous System Assessment Guide
2 pages
Unit IV
No ratings yet
Unit IV
31 pages
QuestionBank DL
No ratings yet
QuestionBank DL
7 pages
RNN Model Selection and Analysis
No ratings yet
RNN Model Selection and Analysis
4 pages
DL QB 2marks
No ratings yet
DL QB 2marks
4 pages
NLP: Transformer vs. N-gram Models
No ratings yet
NLP: Transformer vs. N-gram Models
6 pages
Question Bank
No ratings yet
Question Bank
14 pages
Ad3501 DL Unit-3
No ratings yet
Ad3501 DL Unit-3
6 pages
245008-23CS2902 - Deep Learning
No ratings yet
245008-23CS2902 - Deep Learning
4 pages
UNIT 4 (MCQS)
No ratings yet
UNIT 4 (MCQS)
13 pages
2022 Resit Solution
No ratings yet
2022 Resit Solution
12 pages
4.2 Sequence2Sequence (RNN)
No ratings yet
4.2 Sequence2Sequence (RNN)
46 pages
Recurrent Neural Networks LSTMS, Transformers, Graph Neural Networks
No ratings yet
Recurrent Neural Networks LSTMS, Transformers, Graph Neural Networks
97 pages
Deep Learning Question Bank
No ratings yet
Deep Learning Question Bank
8 pages
Were Rnns All We Needed?: Leo - Feng@Mila - Quebec
No ratings yet
Were Rnns All We Needed?: Leo - Feng@Mila - Quebec
27 pages
Unit 3
No ratings yet
Unit 3
4 pages
Exercise #2 28 - 4 - 2025
No ratings yet
Exercise #2 28 - 4 - 2025
7 pages
Dis6 Sol
No ratings yet
Dis6 Sol
6 pages
MT1SP19
No ratings yet
MT1SP19
13 pages
G L: F D - C L R - S M: ATE OOP Ully ATA Ontrolled Inear E Currence For Equence Odeling
No ratings yet
G L: F D - C L R - S M: ATE OOP Ully ATA Ontrolled Inear E Currence For Equence Odeling
14 pages
RNN LSTM
No ratings yet
RNN LSTM
71 pages
DL Exam 2023-2
No ratings yet
DL Exam 2023-2
5 pages
Exam - Deep Learning - From Theory To Practice (201800177) - Jan 22 2019
No ratings yet
Exam - Deep Learning - From Theory To Practice (201800177) - Jan 22 2019
3 pages
AI Exam Prep: Neural Networks
No ratings yet
AI Exam Prep: Neural Networks
115 pages
Second Exam 2021-22
No ratings yet
Second Exam 2021-22
14 pages
Neural Networks & Deep Learning MCQs
100% (1)
Neural Networks & Deep Learning MCQs
6 pages
41 RNN Notes
No ratings yet
41 RNN Notes
14 pages
11 RNN
No ratings yet
11 RNN
32 pages
Multiplication Operations in Deep Learning
No ratings yet
Multiplication Operations in Deep Learning
7 pages
Unit 5 Updated
No ratings yet
Unit 5 Updated
125 pages
Deep Learning: RNN & LSTM Quiz
No ratings yet
Deep Learning: RNN & LSTM Quiz
10 pages
General Notes: Heruntergeladen Durch Petre Weinberger (Extern - Weinberger@tum - De)
No ratings yet
General Notes: Heruntergeladen Durch Petre Weinberger (Extern - Weinberger@tum - De)
6 pages
E9 205 - Machine Learning For Signal Processing
No ratings yet
E9 205 - Machine Learning For Signal Processing
2 pages
AN2DL 04 2324 RecurrentNeuralNetworks
No ratings yet
AN2DL 04 2324 RecurrentNeuralNetworks
34 pages
COMP 9444 Neural Networks Exam Solutions
No ratings yet
COMP 9444 Neural Networks Exam Solutions
7 pages
Unit4 Deep Learning
No ratings yet
Unit4 Deep Learning
6 pages
Week11 Discussion - Deep Learning
No ratings yet
Week11 Discussion - Deep Learning
23 pages
Build RNN with Numpy: Step-by-Step Guide
No ratings yet
Build RNN with Numpy: Step-by-Step Guide
36 pages
F16midterm Sols v2
No ratings yet
F16midterm Sols v2
14 pages
MT1 SP19 Solutions
No ratings yet
MT1 SP19 Solutions
14 pages
MCQ PDF 6 7
No ratings yet
MCQ PDF 6 7
33 pages
Assignment 7 Solution
No ratings yet
Assignment 7 Solution
3 pages
LLM 1-11
No ratings yet
LLM 1-11
51 pages
Assignment 11 Solution
No ratings yet
Assignment 11 Solution
7 pages
Assignment 9 Solution
No ratings yet
Assignment 9 Solution
7 pages
Assignment 12 Solution
No ratings yet
Assignment 12 Solution
6 pages
Assignment 8 Solution
No ratings yet
Assignment 8 Solution
7 pages
Assignment 10 Solution
No ratings yet
Assignment 10 Solution
6 pages
Assignment 4 Solution
No ratings yet
Assignment 4 Solution
3 pages
Assignment 1 Solution
No ratings yet
Assignment 1 Solution
4 pages
Assignment 2 Solution
No ratings yet
Assignment 2 Solution
4 pages
Deep Learning
No ratings yet
Deep Learning
10 pages
Experiment 10 1
No ratings yet
Experiment 10 1
3 pages
Understanding Neural Network Learning
100% (2)
Understanding Neural Network Learning
19 pages
Unit II
No ratings yet
Unit II
35 pages
IIT Kharagpur AI4ICPS Certificate Program
No ratings yet
IIT Kharagpur AI4ICPS Certificate Program
2 pages
Neural Network Learning Techniques
No ratings yet
Neural Network Learning Techniques
23 pages
LSTM Networks Explained: Key Concepts
No ratings yet
LSTM Networks Explained: Key Concepts
15 pages
Deep Learning - AD3501 - Important Question and 2 Marks With Answers - Unit 2
No ratings yet
Deep Learning - AD3501 - Important Question and 2 Marks With Answers - Unit 2
7 pages
Deep Learning
No ratings yet
Deep Learning
39 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
43 pages
Arificial Neural Networks Deep Learning Lab CSF
No ratings yet
Arificial Neural Networks Deep Learning Lab CSF
2 pages
DL Co4 - PPT 2
No ratings yet
DL Co4 - PPT 2
23 pages
DL Imp Questions
No ratings yet
DL Imp Questions
4 pages
Deep Learning Engineer Roadmap
No ratings yet
Deep Learning Engineer Roadmap
3 pages
Unit III
No ratings yet
Unit III
43 pages
Neural Networks for Researchers
No ratings yet
Neural Networks for Researchers
9 pages
Understanding Deep Learning DNN RNN LSTM CNN and R-CNN
No ratings yet
Understanding Deep Learning DNN RNN LSTM CNN and R-CNN
6 pages
Ad3501 Deep Learning Assesment II QP
No ratings yet
Ad3501 Deep Learning Assesment II QP
2 pages
LDP Blow Up Syllabus End
No ratings yet
LDP Blow Up Syllabus End
2 pages
2-Weeks Deep Learning, Computer Vision, & NLP
No ratings yet
2-Weeks Deep Learning, Computer Vision, & NLP
7 pages
2022ADeepLearning BasedModelforDateFruitClassification
No ratings yet
2022ADeepLearning BasedModelforDateFruitClassification
17 pages
Unit - 4 ANN
No ratings yet
Unit - 4 ANN
46 pages
Vggnet
No ratings yet
Vggnet
8 pages
Module 4 RNN LSTM GRU
No ratings yet
Module 4 RNN LSTM GRU
59 pages
ML Visuals for AI Enthusiasts
No ratings yet
ML Visuals for AI Enthusiasts
62 pages
Generative AI Course Outline
No ratings yet
Generative AI Course Outline
1 page
Matlab Iris RBF
No ratings yet
Matlab Iris RBF
21 pages
Counterpropagation Networks
No ratings yet
Counterpropagation Networks
6 pages
Flower Image Classification with Transfer Learning
No ratings yet
Flower Image Classification with Transfer Learning
6 pages
Neural Network
No ratings yet
Neural Network
12 pages

Assignment 5 Solution

Uploaded by

Assignment 5 Solution

Uploaded by

Introduction to Large Language Models

Number of questions: 8 Total mark: 6 X 1 + 2 X 2 = 10

a. Can only process fixed-length inputs.

Solution: Please refer to the lecture slides.

Why are RNNs preferred over fixed-window neural models?

What is the primary purpose of the cell state in an LSTM?

Consider a simple RNN:

● Input vector size: 3

How many parameters are there in total?

I. Input-to-hidden transformation: Wx * xt, where Wx is a d × d matrix, leading to a

Which of the following is true about Seq2Seq models?

Encoder hidden states: h1=[1,2], h2=[3,4], h3=[5,6]

Decoder hidden state: s=[0.5,1]

α1 = e2.5/(e2.5 + e5.5 + e8.5) = 0.00235

α2 = e5.5/(e2.5 + e5.5 + e8.5) = 0.04731

α3 = e8.5/(e2.5 + e5.5 + e8.5) = 0.9503

You might also like