Welcome to Scribd!

RLQP 2023

Uploaded by

0% found this document useful (0 votes)

2 views1 page

This document is a paper for a Reinforcement Learning exam worth 80 marks and lasting 3 hours. The subject code is 53173. The paper tests knowledge of reinforcement learning techniques for developing autonomous agents that can learn optimal behaviors through trial-and-error interactions with dynamic and uncertain environments.

Original Description:

This is a case study on reinforcement learning.

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

2 views1 page

RLQP 2023

Uploaded by

ashish1803cuber

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 1

Search inside document

9F

96
B
CC

6A
Paper / Subject Code: 53173 / Reinforcement Learning (DLOC - V)

E2
B3

2A
9F
8C

CB
9D

9
E2
B3

2A
FC
1B

B6
8C
9D

AB
C8

C
E2
B3

FC
B
16

6
1

B
Time: 3 Hours Marks: 80

D
C8

29
B8

CC
9

B3
1B

CE
6
D0

F
1

9D
C8

9
B8

C
FF

E2
B3
Note: 1. Question 1 is compulsory

FC
B
6
D0
96

8C
81

29
2A

FF
2. Answer any three out of the remaining five questions.

B9
C
B

E
6
0
96
AB
3. Assume any suitable data wherever required and justify the same.

C
1
D

9D
8

38
2A

C
0B
B6

B
16

DB
6
AB

81
D
9
CC

38
Q1 a) Describe generalized policy iteration. [5]

B9
C
B
6

F
B2

16
9F

B
0
B

81
FD

9D
A9
b) Suppose γ = 0.9 and the reward sequence is R1 = 2 followed by an infinite

CC
[5]

8
6A
E2

C
0B
6F

1B
2

16
F
8C
sequence of 7s. What are G1 and G0?

D
9

9
C

C8
B8
6A
E2
B3

F
FC
c) What are the key features of reinforcement learning? [5]

6F
B2

6
D0
8C

B
9D

1
29

A9
CC

B8
A
3

C
1B

E
B

6
d) Describe temporal-difference (TD) prediction with an example. [5]

16
9F

D0
C

96
9D

AB
C8

B8
E2

F
FC
1B
16

D0
8C
Q2 a) Consider a k-armed bandit problem with k = 4 actions, denoted 1, 2, 3, and 4. [10]

B
C8

9
B8

A9
C
B9

A
E2
B3

FF
Consider applying to this problem a bandit algorithm using ε -greedy action

C
6
D0

B2
9F
81

8C
1

96
9D

selection, sample-average action-value estimates, and initial estimates of Q1(a) = 0,

C
FF

A
6C

2
3

2A
FC
1B

E
for all a. Suppose the initial sequence of actions and rewards is A1 = 1, R1 =1, A2
DB
0

B6
96

C
81
D

AB
C8

29
38
= 2, R2 = 1, A3 = 2, R3 =2, A4 = 2, R4 = 2, A5 = 3, R5 = 0. On some of these time

C
2A

B9
B

FC
CE
6

B
0

B6
96

steps the ε case may have occurred, causing an action to be selected at random. On
AB

81
1
D

29
8

CC
2A

B9
C

which time steps did this definitely occur? On which time steps could this possibly
0B
B6

CE
16

DB
96
B

9F
81
D

have occurred?
CC

B8
A

38
A

B9
6C

E2
B6

F
B2
9F

B
0

b) What is reinforcement learning. Explain the elements of reinforcement learning. [10]

8C
81
D

9D
A9
CC

C8
6A
E2

B3
1B
B2

16
9F
8C

9D
Q3 a) Illustrate through an example the use of Monte Carlo Methods. [10]
C8
B8
6A
E2
B3

2A
C

1B
16
F

D0
8C

B
9D

AB
9

A9
CC

C8
B8
2
B3

FF
1B

b) Describe the application of reinforcement learning to the real world problem [10]
CE

16
9F

0
CB

96
D
C8

of Job-Shop Scheduling.
38
9

B8
6A
E2

2A
FC
1B
16

D0
8C

AB
C8

9
B8

A9
C
B9

E2
B3

Q4 a) Describe temporal-difference (TD) control using Q-Learning. [10]

FC
16

B2
81

96
D

29
B8

b) Explain how upper confidence bound (UCB) action selection generally performs [10]
9

6A
C

2A
1B

E
6

DB
0

better than ε-greedy action selection with a suitable example.

C
1
FD

AB
8
8

38
B9
6C

E2
0B
6F

FC
DB

B6
81

8C
81
FD
A9

Q5 a) Describe Monte Carlo Estimation of Action Values. [10]

B9
C
0B

B3
F

CE
6
96

9F
81
1
FD

b) Describe asynchronous dynamic programming with an example. [10]

38
2A

E2
0B
6F

DB
AB

8C
81
D
9

C8
2A

B9
0B

Q6 a) What are Goals and Rewards? Explain with a suitable example. [10]
B3
B6

16
96
AB

81
FD

9D
CC

B8
2A

6C
B6

1B
D0

b) Explain Tracking a Nonstationary Problem with an example. [10]

81
9
CC

C8
A

__________________________
B6

16
F

FD
29

B8
6A

2A
FC
E

D0
8C

AB
29

A9
B3

FF
FC
E

B2
8C

96
29

6A
B3

2A
CE

B
9D

AB
8

CC
E2
B3

42752 Page 1 of 1
B

B6
9F
81

8C
9D

CC
6C

E2
B3
1B

9F
8C
9D
C8

E2
B3

0B816C81B9DB38CE29FCCB6AB2A96FFD
1B
16

Kindergarten Bound
From Everand
Kindergarten Bound
Thinking Kids
No ratings yet
FM QP
Document4 pages
FM QP
timepass3595
No ratings yet
CNS 2022
Document1 page
CNS 2022
gpmumbai123
No ratings yet
CMPN Sem V Se Dec 2022
Document1 page
CMPN Sem V Se Dec 2022
monke.hrishi
No ratings yet
Be Information Technology Semester 8 2023 May Iloc II Project Management Rev 2019 C Scheme
Document2 pages
Be Information Technology Semester 8 2023 May Iloc II Project Management Rev 2019 C Scheme
buymore434
No ratings yet
Em 2023
Document1 page
Em 2023
Mik Vlogs
No ratings yet
Adr 1
Document4 pages
Adr 1
mumbailab2020
No ratings yet
Dw044 p02 MCC Buffer Zone Sections
Document1 page
Dw044 p02 MCC Buffer Zone Sections
Mohammed Nadeem
No ratings yet
Office Automation Tools - BCA-106-VI-A
Document2 pages
Office Automation Tools - BCA-106-VI-A
kailasbankar96
No ratings yet
Be - Electrical Engineering - Semester 7 - 2022 - December - Dloc III Internet of Things Rev 2019 C Scheme
Document1 page
Be - Electrical Engineering - Semester 7 - 2022 - December - Dloc III Internet of Things Rev 2019 C Scheme
harsh yadav
No ratings yet
Finance Strategic Financial Management
Document5 pages
Finance Strategic Financial Management
Vidhi Dodiya
No ratings yet
TE Dec 2022
Document2 pages
TE Dec 2022
bottomfragger993
No ratings yet
Be - Electronics Engineering - Semester 7 - 2022 - December - Power Electronics Rev 2019 C Scheme
Document1 page
Be - Electronics Engineering - Semester 7 - 2022 - December - Power Electronics Rev 2019 C Scheme
shitalmane71
No ratings yet
Dmbi 3
Document2 pages
Dmbi 3
Ahmad Raza Ansari
No ratings yet
Commodity Derivatives
Document3 pages
Commodity Derivatives
Gaurav Ghare
No ratings yet
Orange Juice
Document1 page
Orange Juice
Huỳnh Như Đặng Thụy
No ratings yet
Be - Computer Engineering - Semester 8 - 2023 - May - Iloc II - Project Management Rev 2019 C' Scheme
Document2 pages
Be - Computer Engineering - Semester 8 - 2023 - May - Iloc II - Project Management Rev 2019 C' Scheme
2020.dimple.madhwani
No ratings yet
Ns Iocl Tal E001
Document1 page
Ns Iocl Tal E001
Titus Praveen
No ratings yet
Mathematics - 203-IX
Document2 pages
Mathematics - 203-IX
Krishna Parik
No ratings yet
Anexo N°2 EGP - EEC.D.27.PE.P.58025.16.092.01 Montalvo Ver02 O&M
Document1 page
Anexo N°2 EGP - EEC.D.27.PE.P.58025.16.092.01 Montalvo Ver02 O&M
Josue Rojas de la Cruz
No ratings yet
Be - Information Technology - Semester 8 - 2023 - May - Iloc II - Digital Buisness Management Rev 2019 C' Scheme
Document1 page
Be - Information Technology - Semester 8 - 2023 - May - Iloc II - Digital Buisness Management Rev 2019 C' Scheme
buymore434
No ratings yet
A - Full Engine
Document1 page
A - Full Engine
Thái Trang
No ratings yet
Advanced Prog. With PHP Bamu University Questions
Document2 pages
Advanced Prog. With PHP Bamu University Questions
Parmeshwar Swami
No ratings yet
Pcology III
Document4 pages
Pcology III
Anit Dubey
No ratings yet
LLB Question Paper With Question
Document3 pages
LLB Question Paper With Question
Ketan shinde
No ratings yet
Plan Etaj 1 P
Document1 page
Plan Etaj 1 P
Lidia-Teodora Florescu
No ratings yet
2023 Nov - Family Law-I
Document4 pages
2023 Nov - Family Law-I
Amit SCM
No ratings yet
VRO-5223087-17016-ID-MET-PL-730-1 (Ubicacion Tie Ins 30k)
Document1 page
VRO-5223087-17016-ID-MET-PL-730-1 (Ubicacion Tie Ins 30k)
QAQC Consorcio EHCSI
No ratings yet
Commerce Bcom Financial Markets Semester 5 2023 April Business Ethics and Corporate Governance Becg Cbcgs
Document2 pages
Commerce Bcom Financial Markets Semester 5 2023 April Business Ethics and Corporate Governance Becg Cbcgs
Prerna kulaye
No ratings yet
Be Information Technology Semester 6 2023 December Ai and Ds Irev 2019 C Scheme
Document2 pages
Be Information Technology Semester 6 2023 December Ai and Ds Irev 2019 C Scheme
ANIMESH PARAB
No ratings yet
2023 Nov - Constitutional Law
Document2 pages
2023 Nov - Constitutional Law
Amit SCM
No ratings yet
JSX Full Schematic
Document12 pages
JSX Full Schematic
jorge babiano
No ratings yet
DTO - Mobilisation Plan - Rev.003-Model
Document1 page
DTO - Mobilisation Plan - Rev.003-Model
Arber Canga
No ratings yet
Css-2019 QP
Document1 page
Css-2019 QP
Rohit Kshatriya
No ratings yet
Commerce Bcom Bachelor of Commerce Semester 5 2023 November Financial Accounting and Auditing Vii Financial Accounting Cbcgs
Document14 pages
Commerce Bcom Bachelor of Commerce Semester 5 2023 November Financial Accounting and Auditing Vii Financial Accounting Cbcgs
Rahul Madgundi
No ratings yet
Basement Plan: Pemukiman
Document6 pages
Basement Plan: Pemukiman
Askarya File
No ratings yet
Science Bsc-Physics Semester-6 2023 April Nuclear-Physics-Cbsgs
Document2 pages
Science Bsc-Physics Semester-6 2023 April Nuclear-Physics-Cbsgs
jadhav742004
No ratings yet
Osanna Eh Alto Sax 2
Document1 page
Osanna Eh Alto Sax 2
Simone Piraino Official Channel
No ratings yet
CA 2023 (Nov)
Document5 pages
CA 2023 (Nov)
aditikotere92
No ratings yet
Osanna Eh Violins 2
Document1 page
Osanna Eh Violins 2
Simone Piraino Official Channel
No ratings yet
A 02 Base
Document2 pages
A 02 Base
eeon
No ratings yet
02 May 2023 Up
Document1 page
02 May 2023 Up
aaryamankattali75
No ratings yet
Lenoxx MC-245B PDF
Document5 pages
Lenoxx MC-245B PDF
Ednaelson Silva
No ratings yet
Nov 2022 AUDITING I
Document2 pages
Nov 2022 AUDITING I
Vishakha Vishwakarma
No ratings yet
Implementacion de Maquinas Basicas para Alumnos Del Segundo Nivel
Document30 pages
Implementacion de Maquinas Basicas para Alumnos Del Segundo Nivel
Lili Mogollón
No ratings yet
Aual
Document1 page
Aual
Robson Nerio
No ratings yet
03 Dec 2022 Up
Document1 page
03 Dec 2022 Up
aaryamankattali75
No ratings yet
M.M.S. Sem III Strategic Management
Document3 pages
M.M.S. Sem III Strategic Management
Shubham Padale
No ratings yet
As Built Drawing: Equipment Room
Document1 page
As Built Drawing: Equipment Room
mohamed fawzan
No ratings yet
Fitting Pompa Ambreiaj - V1RevA
Document1 page
Fitting Pompa Ambreiaj - V1RevA
Radu Torpy
No ratings yet
ML QP May 23
Document1 page
ML QP May 23
Lisban Gonslaves
No ratings yet
Schematic Class-D-900W Sheet-1 20181106144825
Document1 page
Schematic Class-D-900W Sheet-1 20181106144825
Yurik Diaz
No ratings yet
Be Electronics and Telecommunication Semester 7 2022 December Mobile Communication System Rev 2019 C Scheme
Document1 page
Be Electronics and Telecommunication Semester 7 2022 December Mobile Communication System Rev 2019 C Scheme
tejasrabad20
No ratings yet
Schematic Class-D-900W Sheet-1 20181106144825
Document1 page
Schematic Class-D-900W Sheet-1 20181106144825
Xuân Trường
No ratings yet
Be - Computer Engineering Ai, DS, ML - Semester 6 - 2023 - May - Data Analytics and Visualization Rev 2019 C Scheme
Document1 page
Be - Computer Engineering Ai, DS, ML - Semester 6 - 2023 - May - Data Analytics and Visualization Rev 2019 C Scheme
Rohit Kshatriya
No ratings yet
Be Electrical Engineering Semester 7 2022 December Electrical Power System III Rev 2019 C Scheme
Document2 pages
Be Electrical Engineering Semester 7 2022 December Electrical Power System III Rev 2019 C Scheme
ishaansingh120208
No ratings yet
ML December 2023
Document2 pages
ML December 2023
Jayesh Kumar
No ratings yet
Is Dec 2022
Document1 page
Is Dec 2022
Ashish Patel
No ratings yet
WC Paper
Document2 pages
WC Paper
harshalbhamare987
No ratings yet
Gender School & Society
Document2 pages
Gender School & Society
Kavya S R Acharya
No ratings yet
Project Report On DVR (17001005025,2056,2046)
Document51 pages
Project Report On DVR (17001005025,2056,2046)
Gaurav Sharma
No ratings yet
Ba LabMaster E 2518902 200040 - 355 - 06
Document53 pages
Ba LabMaster E 2518902 200040 - 355 - 06
moriz52
No ratings yet
Disaster Recovery Procedures Manual Sample
Document3 pages
Disaster Recovery Procedures Manual Sample
resurse
No ratings yet
QML Composing Uis
Document33 pages
QML Composing Uis
nishantapatil
100% (1)
Singnal and System
Document106 pages
Singnal and System
john cena
No ratings yet
Config4star Practical Usage Guide
Document119 pages
Config4star Practical Usage Guide
Marcelo Silveira
No ratings yet
Logical Stupidity - INNOVATION by Navigating Through Nonsense
Document191 pages
Logical Stupidity - INNOVATION by Navigating Through Nonsense
Peter Greenwall
100% (14)
CSM Workbook
Document28 pages
CSM Workbook
sijia Hao
No ratings yet
Excel Formulae and Fun
Document8 pages
Excel Formulae and Fun
YeeXuan Ten
No ratings yet
203452A ENU TrainerHandbook PDF
Document584 pages
203452A ENU TrainerHandbook PDF
ManuelRomulo
100% (1)
Technical Paper - Aisyah AF 130089
Document8 pages
Technical Paper - Aisyah AF 130089
Hafiz95 Reacts
No ratings yet
UDS14229
Document42 pages
UDS14229
SimplyVeg Kumar
100% (1)
21CSS101J Programming For Problem Solving
Document135 pages
21CSS101J Programming For Problem Solving
Pratyush Deka
No ratings yet
Mach2 6 11
Document162 pages
Mach2 6 11
carllongway7369
No ratings yet
TheGamesMachine24 Nov89 PDF
Document100 pages
TheGamesMachine24 Nov89 PDF
Cristian
No ratings yet
Mitsubishi WD-73727 Distortion Flicker Solved
Document15 pages
Mitsubishi WD-73727 Distortion Flicker Solved
mlminier
No ratings yet
Numerical Investigation On The Flexural Behaviour of Cold Formed Steel Sigma Beam
Document6 pages
Numerical Investigation On The Flexural Behaviour of Cold Formed Steel Sigma Beam
amoke
No ratings yet
Motion To Compel Discovery
Document10 pages
Motion To Compel Discovery
TH721
No ratings yet
Validacion VOACAP Mil
Document40 pages
Validacion VOACAP Mil
Nestor Alberto Escala
No ratings yet
Forsaken House
Document6 pages
Forsaken House
Carolina Benaro
No ratings yet
Chapter 2 Test Bank
Document20 pages
Chapter 2 Test Bank
Chelsie Barrera
100% (1)
Optimization of Os
Document4 pages
Optimization of Os
Padma Gowda
No ratings yet
Tafj Calljee
Document8 pages
Tafj Calljee
Zakaria Almamari
No ratings yet
PRACTITIONER custom-CCP-Rev4.1-AM-ForSend PDF
Document154 pages
PRACTITIONER custom-CCP-Rev4.1-AM-ForSend PDF
Kath Caillahua
No ratings yet
Thirdparty Debranding Rebranding Standards 123115
Document6 pages
Thirdparty Debranding Rebranding Standards 123115
Tasfin Habib
No ratings yet
KTMED MVSA Angelus Vital Signs Monitor
Document48 pages
KTMED MVSA Angelus Vital Signs Monitor
bbaboom
No ratings yet
Rtdtempx Series Ss
Document2 pages
Rtdtempx Series Ss
Laboratorio Metrologia USC
No ratings yet
Ebooklet On IT Initiatives of National Health Mission
Document48 pages
Ebooklet On IT Initiatives of National Health Mission
hguky
No ratings yet
Alienvault: I.T Security Vulnerabiliy Report Scan Time Generated Profile Job Name Hostname Host Ip Service
Document10 pages
Alienvault: I.T Security Vulnerabiliy Report Scan Time Generated Profile Job Name Hostname Host Ip Service
Olga Lucia Rondon Perez
No ratings yet