0% found this document useful (0 votes)

43 views4 pages

Honors QP

The document is an examination paper for the course CST497 on Reinforcement Learning at APJ Abdul Kalam Technological University. It consists of multiple-choice questions and detailed problems covering topics such as probability, Markov Decision Processes, Monte Carlo methods, and Q-learning. The exam is structured into two parts, with Part A containing short answer questions and Part B requiring detailed responses to selected questions from different modules.

Uploaded by

shinolshinto

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

43 views4 pages

Honors QP

Uploaded by

shinolshinto

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

lOMoARcPSD|42490715

CST497-QP - CST497-QP

reinforcement learning (APJ Abdul Kalam Technological University)

Scan to open on Studocu

Studocu is not sponsored or endorsed by any college or university

Downloaded by Shinol Shinto (shinolshinto@[Link])
lOMoARcPSD|42490715

Reg No.:_ Name:____________

1000CST497122203
APJ ABDUL KALAM TECHNOLOGICAL UNIVERSITY
Seventh Semester [Link] Degree (Honours) Examination December 2023 (2020 Admission)

Course Code: CST497

Course Name: REINFORCEMENT LEARNING
Max. Marks: 100 Duration: 3 Hours

PART A
Answer all questions, each carries 3 marks. Marks

1 Each of two persons A and B toss three fair coins. The probability that both get (3)
the same number of heads is?
2 Two components of a laptop computer have the following joint probability density (3)
function for their useful lifetimes X and Y (in years):

Find the marginal probability density function of X, fX( x ).

3 What is a Markov Decision Process? Give suitable example (3)
4 What is the difference between state value function and state action value (3)
function?
5 List any three advantages of Monte Carlo methods over dynamic programming (3)
techniques?
6 Brief the concept of Monte Carlo Estimation of Action values. (3)
7 Why is Q-learning considering an off-policy control method? (3)

8 Differentiate N-Step bootstrapping and Q- Learning. (3)

9 Explain Feature Construction for Linear Methods (3)

10 Differentiate Stochastic-gradient and Semi-gradient Methods. (3)
PART B
Answer any one full question from each module, each carries 14 marks.
Module I
11 a) Dick and Jane have agreed to meet for lunch between noon (0:00 p.m.) and 1:00 (7)
p.m. Denote Jane’s arrival time by X, Dick’s by Y, and suppose X and Y are
independent with probability density functions

Page 1 of 3

Downloaded by Shinol Shinto (shinolshinto@[Link])

lOMoARcPSD|42490715

1000CST497122203

Find the probability that Jane arrives before Dick. That is, find P( X < Y ).
b) Let A and B are two independent events such that P(A) = 0.2 and P(B) = 0.8. Find (7)
P(A and B), P(A or B), P(B not A), and P(neither A nor B).

Given P(A) = 0.2 and P(B) = 0.8 and events A and B are independent of each
other.

OR
12 a) 10% of the bulbs produced in a factory are of red colour and 2% are red and (5)
defective. If one bulb is picked up at random, determine the probability of its being
defective if it is red.

b) A discrete random variable X has the following probability distribution: (9)

Find the value of C. Also find the mean of the distribution.

Module II
13 a) Limitations and Scope of Reinforcement Learning? (5)

b) Explain the agent- environment interaction in Markov Decision Process with (9)
diagrammatic representation.
OR
14 a) How to justify the involvement of Policies and Value functions in reinforcement (7)
learning algorithm.

b) Reinforcement Learning Algorithm involves estimating value functions. Justify. (7)

Module III
15 a) With respect to the expected SARSA algorithm, is exploration required as it is in (7)
the normal SARSA and Q-learning algorithms? Justify.

b) Suppose you are given a finite set of transition data. Assuming that the Markov (7)
model that can be formed with the given data is the actual MDP from which the

Page 2 of 3

Downloaded by Shinol Shinto (shinolshinto@[Link])

lOMoARcPSD|42490715

1000CST497122203

data is generated, will the value functions calculated by the MC and TD methods
necessarily agree? Justify.
OR
16 a) Brief the concept of Monte Carlo Control. How can we avoid the unlikely (9)
assumption of exploring starts?

b) For a specific MDP, suppose we have a policy that we want to evaluate through (5)
the use of actual experience in the environment alone and using Monte Carlo
methods. We decide to use the first-visit approach along with the technique of
always picking the start state at random from the available set of states. Will this
approach ensure complete evaluation of the action value function corresponding
to the policy?
Module IV
17 a) What is the difference between Monte Carlo simulations and Markov Chain (7)
Monte Carlo (MCMC)?

b) What are the advantages and disadvantages of temporal difference learning and (7)
monte carlo?

OR
18 a) Why do we use the Monte Carlo simulations? Justify your answer. (7)
b) What is the difference between Q-learning and Sarsa? (7)

Module V
19 a) What is the difference between a state value function V(s) and a state-action value (7)
function Q (s,a)?

b) Justify the concept of Monte Carlo methods in reinforcement learning? (7)

OR
20 a) What is an intuitive explanation of tile coding function approximation in (9)
reinforcement learning?

b) Why are function approximators required? (5)

****

Page 3 of 3

Downloaded by Shinol Shinto (shinolshinto@[Link])

RL - Exam2023 Solved
No ratings yet
RL - Exam2023 Solved
6 pages
Reinforcement Learning Exam
No ratings yet
Reinforcement Learning Exam
2 pages
RL Unitwise Imp Questions
No ratings yet
RL Unitwise Imp Questions
4 pages
Reinforcement Learning 20CAE01
No ratings yet
Reinforcement Learning 20CAE01
2 pages
2 DRL Compre Makeup
No ratings yet
2 DRL Compre Makeup
12 pages
RL Exam Tutti
No ratings yet
RL Exam Tutti
47 pages
RL 2021 22 Exam I
No ratings yet
RL 2021 22 Exam I
4 pages
CS6700 RL 2024 Wa1
No ratings yet
CS6700 RL 2024 Wa1
7 pages
Assignment 4
No ratings yet
Assignment 4
6 pages
Reinforcement Learning Exam
No ratings yet
Reinforcement Learning Exam
6 pages
Question Bank - Reinforcement Learning
No ratings yet
Question Bank - Reinforcement Learning
3 pages
Practice Assignment 6: Reinforcement Learning Prof. B. Ravindran
No ratings yet
Practice Assignment 6: Reinforcement Learning Prof. B. Ravindran
24 pages
Reinforcement Learning Question Bank
No ratings yet
Reinforcement Learning Question Bank
11 pages
Monte Carlo Methods in AI & Data Science
No ratings yet
Monte Carlo Methods in AI & Data Science
40 pages
Reinforcement Learning 20CAE01
No ratings yet
Reinforcement Learning 20CAE01
2 pages
Reinforcement Learning and Deep Learning
No ratings yet
Reinforcement Learning and Deep Learning
15 pages
BTech RL CIAP - B - Assignment 1
No ratings yet
BTech RL CIAP - B - Assignment 1
2 pages
DL Unit 6 QP Solution
No ratings yet
DL Unit 6 QP Solution
15 pages
A12 Spring2024
No ratings yet
A12 Spring2024
5 pages
Advanced Value-Based Methods in RL
No ratings yet
Advanced Value-Based Methods in RL
6 pages
Cse3011 RL End Term Announcement
No ratings yet
Cse3011 RL End Term Announcement
2 pages
AIML II Test Scheme and Soluion 2023
No ratings yet
AIML II Test Scheme and Soluion 2023
12 pages
AI 3000 / CS5500: Reinforcement Learning Exam 1: Instructions
0% (1)
AI 3000 / CS5500: Reinforcement Learning Exam 1: Instructions
4 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
46 pages
21cse417t - Fundamentals of Reinforcement Learning Syllabus
No ratings yet
21cse417t - Fundamentals of Reinforcement Learning Syllabus
2 pages
Problem 1: Markov Reward Process
No ratings yet
Problem 1: Markov Reward Process
3 pages
Reinforcement Learning, Code 201AM7E04 Set2
No ratings yet
Reinforcement Learning, Code 201AM7E04 Set2
2 pages
HGTFHGFHTF
No ratings yet
HGTFHGFHTF
5 pages
q2B Review
No ratings yet
q2B Review
9 pages
Assignment 6 (Sol.) : Reinforcement Learning
No ratings yet
Assignment 6 (Sol.) : Reinforcement Learning
4 pages
RL Question Bank - Final
No ratings yet
RL Question Bank - Final
4 pages
Assignment 9: Reinforcement Learning Prof. B. Ravindran
No ratings yet
Assignment 9: Reinforcement Learning Prof. B. Ravindran
3 pages
18AI72
No ratings yet
18AI72
3 pages
RL 10 QUESTIONS FOR MID II Scheme of Evaluvation
No ratings yet
RL 10 QUESTIONS FOR MID II Scheme of Evaluvation
15 pages
ML Exam 2016
No ratings yet
ML Exam 2016
5 pages
11-DL-Deep Learning For Reinforcement Learning
No ratings yet
11-DL-Deep Learning For Reinforcement Learning
47 pages
15-381 Spring 2007 Final Exam SOLUTIONS
No ratings yet
15-381 Spring 2007 Final Exam SOLUTIONS
18 pages
Assignment on Reinforcement Learning
No ratings yet
Assignment on Reinforcement Learning
2 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
45 pages
Gujarat Technological University: Bachelor of Engineering Syllabus Subject Code: Subject Name
No ratings yet
Gujarat Technological University: Bachelor of Engineering Syllabus Subject Code: Subject Name
3 pages
Reinforcement Learning Basics
No ratings yet
Reinforcement Learning Basics
14 pages
Reinforcement Learning Overview
No ratings yet
Reinforcement Learning Overview
14 pages
II - Cse-Aiml Fai Subjective Mid-2
No ratings yet
II - Cse-Aiml Fai Subjective Mid-2
4 pages
q2B Review Sol
No ratings yet
q2B Review Sol
14 pages
20ai903 - RL - Unit 4
No ratings yet
20ai903 - RL - Unit 4
49 pages
Machine Learning Exam Dec 2018
No ratings yet
Machine Learning Exam Dec 2018
2 pages
Reinforcement Learning - Unit 7 - Week 4
No ratings yet
Reinforcement Learning - Unit 7 - Week 4
3 pages
Lecture#5 Monte Carlo Methods Part I
No ratings yet
Lecture#5 Monte Carlo Methods Part I
28 pages
5SC28 L9 Machine Learning Systems Control
No ratings yet
5SC28 L9 Machine Learning Systems Control
75 pages
RL-UNIT2 - RL Unit 2 RL-UNIT2 - RL Unit 2
No ratings yet
RL-UNIT2 - RL Unit 2 RL-UNIT2 - RL Unit 2
23 pages
1 DRL Compre Regular
No ratings yet
1 DRL Compre Regular
12 pages
Lecture 10
No ratings yet
Lecture 10
25 pages
Tutorial Questions (Annexure I) Que S-Tion No Questions Co BTL
No ratings yet
Tutorial Questions (Annexure I) Que S-Tion No Questions Co BTL
6 pages
Deep Reinforcement Learning Handout v2.0
0% (1)
Deep Reinforcement Learning Handout v2.0
6 pages
Nptel Notes
No ratings yet
Nptel Notes
28 pages
Chapter 4 Solutions: Reinforcement Learning
No ratings yet
Chapter 4 Solutions: Reinforcement Learning
5 pages
Exam RL 2022 Sample
No ratings yet
Exam RL 2022 Sample
8 pages
20CM1111
No ratings yet
20CM1111
3 pages
RL Theory Tutorial
No ratings yet
RL Theory Tutorial
80 pages
Honors QP 2
No ratings yet
Honors QP 2
4 pages
Hybrid Machine Learning-Based Framework For Predicting The Output of Daily Solar Irradiation
No ratings yet
Hybrid Machine Learning-Based Framework For Predicting The Output of Daily Solar Irradiation
5 pages
UST Global Reasoning Placement Paper
No ratings yet
UST Global Reasoning Placement Paper
10 pages
Univ-Model - Module 2
No ratings yet
Univ-Model - Module 2
2 pages
2023 Dec. CST401-C
No ratings yet
2023 Dec. CST401-C
3 pages
EV M4 Ktunotes - in
No ratings yet
EV M4 Ktunotes - in
60 pages
MMML Tutorial ACL2017
No ratings yet
MMML Tutorial ACL2017
221 pages
Ad3301 Apr May 2024 Answer Key
No ratings yet
Ad3301 Apr May 2024 Answer Key
31 pages
DWDM Unit-4
No ratings yet
DWDM Unit-4
27 pages
Constitutional Isomers in Chem 220 Exam
No ratings yet
Constitutional Isomers in Chem 220 Exam
9 pages
Beginner English Conversation
No ratings yet
Beginner English Conversation
3 pages
Chapter 5 Light Detector
No ratings yet
Chapter 5 Light Detector
34 pages
Honors QP
No ratings yet
Honors QP
4 pages
SMS Based Hardware Controlling System A PDF
No ratings yet
SMS Based Hardware Controlling System A PDF
7 pages
Understanding Work Sampling Methods
100% (1)
Understanding Work Sampling Methods
25 pages
Intro To Fluid Mechanics & Heat Transfer
No ratings yet
Intro To Fluid Mechanics & Heat Transfer
15 pages
Constitutive Laws for Engineering Materials
No ratings yet
Constitutive Laws for Engineering Materials
5 pages
MASTA培训资料 - 非对称齿轮
No ratings yet
MASTA培训资料 - 非对称齿轮
32 pages
Relational Database Constraints Overview
No ratings yet
Relational Database Constraints Overview
19 pages
BYTE Vol 00-08 1976-04 Automation
100% (1)
BYTE Vol 00-08 1976-04 Automation
100 pages
Iabco 35.45
No ratings yet
Iabco 35.45
1 page
Twitter Data Mining with R Techniques
No ratings yet
Twitter Data Mining with R Techniques
34 pages
Horseshoe Normaldepth2010
No ratings yet
Horseshoe Normaldepth2010
6 pages
Data Set - Updated HRA Book 28th Jan
No ratings yet
Data Set - Updated HRA Book 28th Jan
200 pages
ML Theory
No ratings yet
ML Theory
54 pages
Capillary Electrophoresis of Ab Aggregates
No ratings yet
Capillary Electrophoresis of Ab Aggregates
9 pages
FM213 - AT - Problem Set 2
No ratings yet
FM213 - AT - Problem Set 2
9 pages
OBD2 Scanner Results
No ratings yet
OBD2 Scanner Results
3 pages
Biofertilizer General 2025
No ratings yet
Biofertilizer General 2025
44 pages
Diklofenak Farmakologi 3
No ratings yet
Diklofenak Farmakologi 3
65 pages
FWR A
No ratings yet
FWR A
19 pages
KV Questions
No ratings yet
KV Questions
2 pages
Book
40% (5)
Book
20 pages
143 BinaryAlgorhytms Sample
100% (5)
143 BinaryAlgorhytms Sample
11 pages
Activity 4 (Data Sheet) - Alban, Ronel D.
No ratings yet
Activity 4 (Data Sheet) - Alban, Ronel D.
1 page
Embedded Systems Overview
No ratings yet
Embedded Systems Overview
9 pages

Honors QP

Uploaded by

Honors QP

Uploaded by

lOMoARcPSD|42490715

reinforcement learning (APJ Abdul Kalam Technological University)

Scan to open on Studocu

Studocu is not sponsored or endorsed by any college or university

Reg No.:_______________ Name:__________________________

Course Code: CST497

Find the marginal probability density function of X, fX( x ).

8 Differentiate N-Step bootstrapping and Q- Learning. (3)

9 Explain Feature Construction for Linear Methods (3)

Downloaded by Shinol Shinto (shinolshinto@[Link])

b) A discrete random variable X has the following probability distribution: (9)

Find the value of C. Also find the mean of the distribution.

b) Reinforcement Learning Algorithm involves estimating value functions. Justify. (7)

Downloaded by Shinol Shinto (shinolshinto@[Link])

b) Justify the concept of Monte Carlo methods in reinforcement learning? (7)

b) Why are function approximators required? (5)

Downloaded by Shinol Shinto (shinolshinto@[Link])

You might also like

Reg No.:_ Name:____________