Welcome to Scribd!

Assignment 3

Uploaded by

0% found this document useful (0 votes)

17 views2 pages

This document provides instructions for an assignment on implementing a "windy gridworld" environment in the Gym framework. Students are asked to: 1) Create a 7x10 gridworld environment with crosswinds, random starting position, and rewards for reaching the goal or going off the grid. 2) Use dynamic programming or Q-learning to find an optimal policy for reaching the goal state from two starting positions. 3) Optionally, add obstacles above and below the goal and re-find the optimal policy to avoid those positions.

Original Description:

Original Title

Assignment 3(1)

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

17 views2 pages

Assignment 3

Uploaded by

asifzh

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 2

Search inside document

Applied AI and Control

IP505245 Applied AI and Control

Assignment 3, Windy gridworld
Out: 26.10.2022
In: 16.11.2022

Given a 7×10 gridworld with the goal state G, as shown in the figure below. The agent has a random initial
state within the gridworld. It can move one of the four directions per step. There is a crosswind running
upward through the grid cells. The numbers under the gridworld in red indicate the strength of the wind,
which will take the agent the corresponding number of cells upward. For example, if the agent is at the cell
to the right of the goal, then the action “move left” takes the agent to the cell just above the goal.
Regarding the reward:
• Actions that take the agent off the grid will receive a reward of -100.
• Actions that take the agent to the goal state will receive a reward of 100.
• Otherwise, the agent will receive a constant reward of -1 per time step.
Episode termination:
• Actions that take the agent off the grid will terminate the episode.
• Actions that take the agent to the goal state will terminate the episode.
• Time step exceeds 20.

Wind 3 G
2
Actions
1

0
i
0 1 2 3 4 5 6 7 8 9
0 0 0 1 2 0 1 1 1 0

1 of 2
Applied AI and Control
1) Implement the windy gridworld scenario in Gym environment
• Implement the stepping, resetting and rendering functions.
• Create a script to test the developed scenario, where the agent can random walk in the
gridworld until episode termination.
2) Use dynamic programming (pp. 42 in Lecture 8) or Q-learning method to
• Generate an optimal policy, so that the agent can reach the goal state.
• Test two initial cases using the generated policy:
i. Initial state [i, j] = [1, 1]
ii. Initial state [i, j] = [2, 5]
• (Optional) Assume the cells above and under the goal state are obstacles. Actions that take
the agent to these obstacle cells will terminate the episode and receive a reward of -100. Redo
question 2 to find out the optimal policy.

2 of 2

Jigs and Fixtures
Document85 pages
Jigs and Fixtures
Mudassar Khan
No ratings yet
Electric Vehicle Charging Infrastructure Assessment
Document123 pages
Electric Vehicle Charging Infrastructure Assessment
Rob Nikolewski
No ratings yet
101 Outer Space Projects for the Evil Genius
From Everand
101 Outer Space Projects for the Evil Genius
Dave Prochnow
No ratings yet
Jira Certification Exam Questions 171QA
Document4 pages
Jira Certification Exam Questions 171QA
Avanish Verma
No ratings yet
Tremulus Playbook Set IV - The Pilot
Document2 pages
Tremulus Playbook Set IV - The Pilot
andrewh3
No ratings yet
Aluminium Rail Coach
Document7 pages
Aluminium Rail Coach
dselvakuu
No ratings yet
CS5500: Reinforcement Learning Assignment 3: Additional Guidelines
Document7 pages
CS5500: Reinforcement Learning Assignment 3: Additional Guidelines
SHUBHAM PANCHAL
No ratings yet
Week 5 - Project 3 - Ilogic Part 2 PDF
Document22 pages
Week 5 - Project 3 - Ilogic Part 2 PDF
Khairun Nisa
No ratings yet
Cascade Control
Document17 pages
Cascade Control
محمد سلام
No ratings yet
PERT:Programme Evaluation and Review Technique". and CPM
Document39 pages
PERT:Programme Evaluation and Review Technique". and CPM
akant_chandrakar6246
No ratings yet
Optimizing Factory Performance: Cost-Effective Ways to Achieve Significant and Sustainable Improvement
From Everand
Optimizing Factory Performance: Cost-Effective Ways to Achieve Significant and Sustainable Improvement
James P. Ignizio
No ratings yet
New CZ3005 Module 5 - Reinforcement Learning
Document31 pages
New CZ3005 Module 5 - Reinforcement Learning
kellen tse
No ratings yet
Introduction to Stochastic Dynamic Programming
From Everand
Introduction to Stochastic Dynamic Programming
Sheldon M. Ross
No ratings yet
Electrical Engineers Licensure Examination Results Released in Four (4) Working Days
Document41 pages
Electrical Engineers Licensure Examination Results Released in Four (4) Working Days
Rappler
No ratings yet
Chapter - 20 3-6-05
Document30 pages
Chapter - 20 3-6-05
samandondon
No ratings yet
DMC
Document27 pages
DMC
ReynanBorlini
No ratings yet
Regent CA HW 4 (1) - 1
Document4 pages
Regent CA HW 4 (1) - 1
Harvey Spector
No ratings yet
Pgdorm Sem 2 2017-18
Document5 pages
Pgdorm Sem 2 2017-18
Ziroh
No ratings yet
10 Assignment Problem
Document61 pages
10 Assignment Problem
sairam
No ratings yet
Lab Activity 5 - Reactor
Document2 pages
Lab Activity 5 - Reactor
Hugo EG
No ratings yet
Tec-Campus Qro Control Lab-MR3029: L A 5 A C S
Document2 pages
Tec-Campus Qro Control Lab-MR3029: L A 5 A C S
Hugo EG
No ratings yet
Low-Cost Microcontroller-Based Hover Control Design of A Quadcopter
Document7 pages
Low-Cost Microcontroller-Based Hover Control Design of A Quadcopter
nguyễn nam
No ratings yet
System Verilog - Checker Construct
Document4 pages
System Verilog - Checker Construct
Sarath Chandran Nhavalore
No ratings yet
Space Wars
Document33 pages
Space Wars
Nikos Maragos
No ratings yet
APC CH5202 Midsem Key 20102021
Document18 pages
APC CH5202 Midsem Key 20102021
Tapasya Dangi
No ratings yet
PID Control Experiment: Mechanical Engineering Lab
Document4 pages
PID Control Experiment: Mechanical Engineering Lab
Freddy A. Sanchez
No ratings yet
DS Updated PDF Questions
Document35 pages
DS Updated PDF Questions
diksha1920dangde
No ratings yet
Panel Lifting Line Methods AERO
Document27 pages
Panel Lifting Line Methods AERO
zhigueras18
No ratings yet
CH 20
Document68 pages
CH 20
BolWol
No ratings yet
Iit Python Virtual Lab
Document61 pages
Iit Python Virtual Lab
satu
No ratings yet
Deeplearning - Ai Deeplearning - Ai
Document62 pages
Deeplearning - Ai Deeplearning - Ai
Noura Algadi
No ratings yet
OR Assignment-II
Document3 pages
OR Assignment-II
Pawan Nani
No ratings yet
Module 2B - Greedy - Final
Document40 pages
Module 2B - Greedy - Final
SIDDHARTH CHATTERJEE
No ratings yet
Open-Loop and Closed-Loop Control of A DC Motor With Ni Myrio
Document15 pages
Open-Loop and Closed-Loop Control of A DC Motor With Ni Myrio
kamal wani
No ratings yet
Project Control
Document2 pages
Project Control
mariomourad12
No ratings yet
MOOC Control of Mobile Robots Simulation Lecture 4
Document13 pages
MOOC Control of Mobile Robots Simulation Lecture 4
jcvoscrib
No ratings yet
Sheet 2: Topic: Motion Models and Robot Odometry
Document6 pages
Sheet 2: Topic: Motion Models and Robot Odometry
Yessica Rosas
No ratings yet
Ec 501 Lab 2
Document6 pages
Ec 501 Lab 2
nikmatena
No ratings yet
Theory of Production and Cost Analysis Theory of Production The Production Function
Document12 pages
Theory of Production and Cost Analysis Theory of Production The Production Function
Deevi Perumallu
No ratings yet
Log Cat 1704891282125
Document73 pages
Log Cat 1704891282125
rafiwijdan883
No ratings yet
Sat Course3
Document72 pages
Sat Course3
aman
No ratings yet
Projectile Motion With Air Resistance
Document5 pages
Projectile Motion With Air Resistance
Gra Vity
100% (1)
3 Sample Paper A (Section 2)
Document19 pages
3 Sample Paper A (Section 2)
Madhushan Dassanayake
No ratings yet
Simple Inverted Pendulum
Document7 pages
Simple Inverted Pendulum
H0tm
No ratings yet
Machine Problem 4
Document12 pages
Machine Problem 4
JERUEL
No ratings yet
IT3 Notes
Document23 pages
IT3 Notes
Ankit Singh
No ratings yet
GE8151 KSN Notes - by WWW - EasyEngineering.net 1
Document108 pages
GE8151 KSN Notes - by WWW - EasyEngineering.net 1
Sri Ch.V.Krishna Reddy Assistant Professor (Sr,)
No ratings yet
VP03 2 Graphics-Lab
Document8 pages
VP03 2 Graphics-Lab
inigofet
No ratings yet
HW - 05 Modeling and Classical Control
Document2 pages
HW - 05 Modeling and Classical Control
Ahmed
No ratings yet
Programming Fundamentals: by Imran Kazmi
Document17 pages
Programming Fundamentals: by Imran Kazmi
Abdul Quddus Khan
No ratings yet
MSRA Form - Corian Counter
Document3 pages
MSRA Form - Corian Counter
Bala Krishnan
No ratings yet
Fault Tolerant Control: F Crusca and M Aldeen
Document17 pages
Fault Tolerant Control: F Crusca and M Aldeen
marwan123456789
No ratings yet
IntroCS Part 7
Document6 pages
IntroCS Part 7
Alfred Fred
No ratings yet
1 E5011C17 MATLAB Simulink Servorig Tutorial
Document5 pages
1 E5011C17 MATLAB Simulink Servorig Tutorial
rory mcelhinney
No ratings yet
Unit 2 Problem Solving
Document9 pages
Unit 2 Problem Solving
jskri3399
No ratings yet
Log Cat 1705585994987
Document165 pages
Log Cat 1705585994987
aa77
No ratings yet
Unit - 2 EEFM
Document19 pages
Unit - 2 EEFM
Appasani Manishankar
No ratings yet
DIRECT Optimization Algorithm User Guide
Document14 pages
DIRECT Optimization Algorithm User Guide
Mostafa Mangal
No ratings yet
Assignment Details: 1.1 Coursework Objectives
Document4 pages
Assignment Details: 1.1 Coursework Objectives
syedshan
No ratings yet
Backstepping and Sliding-Mode Techniques Applied To An Indoor Micro Quadrotor
Document6 pages
Backstepping and Sliding-Mode Techniques Applied To An Indoor Micro Quadrotor
siamak
No ratings yet
OSCM Numericals
Document29 pages
OSCM Numericals
ADITYA SINGH
No ratings yet
Job Sequencing With Deadlines: The Problem Is Stated As Below
Document24 pages
Job Sequencing With Deadlines: The Problem Is Stated As Below
Vijay Trivedi
No ratings yet
For Loop
Document4 pages
For Loop
Belen
No ratings yet
Sem V QTM - CBCS 2021 (OBE) 3
Document3 pages
Sem V QTM - CBCS 2021 (OBE) 3
Jeva Arora
No ratings yet
HW 5 CHAP 5 Simulación Espol
Document4 pages
HW 5 CHAP 5 Simulación Espol
Elvira
No ratings yet
Intel® Core™ I3-9100f Processor (6M Cache, Up To 4.20 GHZ) Specificatii Tehnice Compatibilitati
Document3 pages
Intel® Core™ I3-9100f Processor (6M Cache, Up To 4.20 GHZ) Specificatii Tehnice Compatibilitati
Nicu Muț
No ratings yet
"Lemko Experiences As Recalled by Teodor Doklia" (Yasiunka and Other Villages)
Document31 pages
"Lemko Experiences As Recalled by Teodor Doklia" (Yasiunka and Other Villages)
TheLemkoProject
No ratings yet
2078.01.25 Final Draft of Cement
Document57 pages
2078.01.25 Final Draft of Cement
Aviation Nepal
No ratings yet
Presentation On 3 Phase Motor
Document14 pages
Presentation On 3 Phase Motor
Pushkar Pandit
No ratings yet
Artikel Jurnal - Nur Alimul Hakim - 42117055-5
Document12 pages
Artikel Jurnal - Nur Alimul Hakim - 42117055-5
cinta burung
No ratings yet
Economic Analysis (Summary) A. Recent Economic Developments Gross Domestic Product Dynamics. Armenia Experienced Remarkable Economic Growth
Document7 pages
Economic Analysis (Summary) A. Recent Economic Developments Gross Domestic Product Dynamics. Armenia Experienced Remarkable Economic Growth
carlotillo
No ratings yet
Abstract 2 Tones
Document8 pages
Abstract 2 Tones
Filip Filipovic
No ratings yet
S2VNA Operating Manual
Document271 pages
S2VNA Operating Manual
Hever Rodriguez
No ratings yet
Introduction To The Hospitality Industry
Document11 pages
Introduction To The Hospitality Industry
Harmohinder Bhinder
No ratings yet
Lubrication Regimes
Document3 pages
Lubrication Regimes
fyhufh
No ratings yet
Equipment List For 1 X 750 MW Gas Based Combined Cycle Power Plant
Document3 pages
Equipment List For 1 X 750 MW Gas Based Combined Cycle Power Plant
Razi Khan
100% (1)
Acid Base Test
Document8 pages
Acid Base Test
Doris Grimaldi
No ratings yet
List of Participants - Workshop
Document5 pages
List of Participants - Workshop
Sven del Pino
No ratings yet
Deconstructivism
Document47 pages
Deconstructivism
Rafay Khan
No ratings yet
Full Synthetic Motor Oil
Document1 page
Full Synthetic Motor Oil
fghd
No ratings yet
REEandRME0918se jg18
Document116 pages
REEandRME0918se jg18
TOPNOTCHER Philippines
0% (1)
DT4 Student's Book - U3
Document16 pages
DT4 Student's Book - U3
Tuyên Lương
No ratings yet
Heron of Alexandria-Math Contributions-Square Root Method and Area of A Triangle
Document11 pages
Heron of Alexandria-Math Contributions-Square Root Method and Area of A Triangle
Dr Srinivasan Nenmeli -K
No ratings yet
Test Bank For Strategic Management Concepts 13th Edition David
Document24 pages
Test Bank For Strategic Management Concepts 13th Edition David
ShawnMatthewsedjq
100% (48)
Blg20m12v Bda Cust Eng
Document7 pages
Blg20m12v Bda Cust Eng
MihaiCiorbaru
No ratings yet
Analysis of A Copper Coordination Compound
Document4 pages
Analysis of A Copper Coordination Compound
api-340385716
No ratings yet
Exercise 1 - TVM & Equivalence 2.0
Document5 pages
Exercise 1 - TVM & Equivalence 2.0
Bayu Purnama
No ratings yet
Effect of Kitten Vocalizations On Maternal Behavior: Ron H Askins
Document9 pages
Effect of Kitten Vocalizations On Maternal Behavior: Ron H Askins
Ane Magi
No ratings yet
Hoa Answer
Document8 pages
Hoa Answer
MariaDaroing
No ratings yet
Machining
Document12 pages
Machining
wardendavid5591
100% (1)