Welcome to Scribd!

HW4 Slide

Uploaded by

0% found this document useful (0 votes)

13 views8 pages

This document provides instructions for Homework 4 on reinforcement learning (RL) due on May 12th. Students will implement Q-learning and its variants in OpenAI Gym environments like Taxi-v3 and CartPole-v0. The homework consists of implementing parts of the code, running experiments, and writing a report following a template. Students must submit a zip file with their code, results, and report by the due date to receive credit.

Original Description:

Original Title

HW4 slide

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

13 views8 pages

HW4 Slide

Uploaded by

鄭博仁

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 8

Search inside document

Homework 4 - RL

Due Date: 5/12 (Friday) 23:59

Human-centered Intelligent Systems Lab

Introduction
In this assignment, you will implement basic RL algorithm, Q learning and its variants in OpenAI
Gym environments i.e.,

● Taxi-v3

● CartPole-v0

Human-centered Intelligent Systems Lab

Setup
We recommend you to use python 3.7 and all the packages you need are listed in the
requirements.txt. Please run the command to install the packages:

Human-centered Intelligent Systems Lab

Implementation (50%)
The sections you need to implement are specified with # Begin your code and # End your code.
Please read all the comments to comprehend the source code before implementation. Do not
modify the rest of the code

● Part 1: Q learning in Taxi-v3 (10%)

● Part 2: Q learning in CartPole-v0 (15%)

● Part 3: DQN in CartPole-v0 (25%)

Human-centered Intelligent Systems Lab

Experiment

You can use plot.py to plot the learning curves, this will help you verify if you train the model
correctly.

Human-centered Intelligent Systems Lab

Report (50%)
● You should write your report following the report template
● The report should be written in English.

● Please save the report as a .pdf file. (font size: 12)

● Answer the questions in the report template in detail.

Human-centered Intelligent Systems Lab

Submission
Due Date: 2023/5/12 23:59

Please compress your source code, results and report (.pdf) into STUDENTID_hw4.zip.

The file structure should look like:

Wrong submission format leads to -10 point.

Late Submission Policy

20% off per late day

Human-centered Intelligent Systems Lab

Please check out the spec
for more details!

8
Human-centered Intelligent Systems Lab

Python Project Report
Document44 pages
Python Project Report
Varsha Timori
100% (1)
Chatbor Report (Covid19)
Document19 pages
Chatbor Report (Covid19)
Ragland
No ratings yet
SPCC Assignment
Document6 pages
SPCC Assignment
Kenneth
No ratings yet
Python Project Captcha Making Using Python Gui
Document18 pages
Python Project Captcha Making Using Python Gui
Kamlesh Parihar
No ratings yet
Assignment: Systems Programming and Computer Control (50%) Date Assigned: Week 3 Date Due: Week 15 Lecturer: Submission: Softcopy & Hard
Document6 pages
Assignment: Systems Programming and Computer Control (50%) Date Assigned: Week 3 Date Due: Week 15 Lecturer: Submission: Softcopy & Hard
Mikeyy Nzuzi
No ratings yet
ITNPAI1 Assignment - 1553606962
Document3 pages
ITNPAI1 Assignment - 1553606962
h58991888
No ratings yet
Apu SPCC Assignment
Document5 pages
Apu SPCC Assignment
Elaine LaLa
No ratings yet
Python Report
Document17 pages
Python Report
schweser Books
No ratings yet
70-532 New
Document14 pages
70-532 New
Tanveer Kumbari
No ratings yet
Data Science With Python - Final Projects 24JAN2019
Document6 pages
Data Science With Python - Final Projects 24JAN2019
Amiya Kumar Biswal
No ratings yet
SYNOPSIS
Document20 pages
SYNOPSIS
ankita galgalikar
No ratings yet
All Projects F 22
Document154 pages
All Projects F 22
Iqra Iqra
No ratings yet
APU SPCC Assignment Smart Home
Document5 pages
APU SPCC Assignment Smart Home
Bigyan
No ratings yet
Python For Scientific and High Performance Com
Document125 pages
Python For Scientific and High Performance Com
spygen123
100% (1)
1.0 Individual Assignment Description
Document6 pages
1.0 Individual Assignment Description
Jayant Babu Bastola
No ratings yet
Presentation - of - Project - Report - Vikas
Document19 pages
Presentation - of - Project - Report - Vikas
Vikash sharma
No ratings yet
How To Write A Big PLC Program Contact and Coil
Document5 pages
How To Write A Big PLC Program Contact and Coil
Juozas Ko
No ratings yet
2.2.4.acceptance Testing
Document3 pages
2.2.4.acceptance Testing
Thành Công
No ratings yet
Problems
Document14 pages
Problems
Ajmal Ismail
No ratings yet
Proposalwithsdlc 151025121134 Lva1 App6891
Document7 pages
Proposalwithsdlc 151025121134 Lva1 App6891
smin61903
No ratings yet
You Are Asked To Enter Student Names Below: 1. 2. 3. 4
Document5 pages
You Are Asked To Enter Student Names Below: 1. 2. 3. 4
Mer Utsav
No ratings yet
FHCT1022 Assignment 2014 Oct
Document7 pages
FHCT1022 Assignment 2014 Oct
ChongZY
100% (1)
Practice For Lesson 5-3
Document6 pages
Practice For Lesson 5-3
ade
No ratings yet
The Indian Public School - : Unit-1-Problem Solving and Technique
Document11 pages
The Indian Public School - : Unit-1-Problem Solving and Technique
Ramya
No ratings yet
Testing Experience Resume
Document3 pages
Testing Experience Resume
Parthiban
No ratings yet
Resit Deferral Individual Project Assignment Brief 4037cem
Document8 pages
Resit Deferral Individual Project Assignment Brief 4037cem
Atif khan
No ratings yet
Abinash Resume
Document3 pages
Abinash Resume
engineeringwatch
No ratings yet
Condes-Dognidon-Franco-Pascual - Assignment 2
Document10 pages
Condes-Dognidon-Franco-Pascual - Assignment 2
Alex Andersen
No ratings yet
How-To Leverage ChatGPT For Test Automation
Document24 pages
How-To Leverage ChatGPT For Test Automation
suraj satav
No ratings yet
Main PART PDF
Document46 pages
Main PART PDF
Ishan Patwal
No ratings yet
Software Engineering CH. 5: Get It Now: Ends in 14d 23h 39m 34s
Document2 pages
Software Engineering CH. 5: Get It Now: Ends in 14d 23h 39m 34s
حنين عبدالكريم شاكر احمد
No ratings yet
Harshit Satya: Work Experience Skills
Document1 page
Harshit Satya: Work Experience Skills
harshit
No ratings yet
2009 Stanford Local ACM Programming Contest: Saturday, October 3rd, 2009
Document14 pages
2009 Stanford Local ACM Programming Contest: Saturday, October 3rd, 2009
Ajmal Ismail
No ratings yet
CT108-3-1-PYP - Assignment - Question
Document6 pages
CT108-3-1-PYP - Assignment - Question
Usue
No ratings yet
PHD Thesis in Operating System
Document8 pages
PHD Thesis in Operating System
SomeoneToWriteMyPaperGlendale
100% (2)
Walk-In Drive For Feat Systems-Organized by VibrantMinds Technologies
Document3 pages
Walk-In Drive For Feat Systems-Organized by VibrantMinds Technologies
Tanay Jaybhaye
No ratings yet
Ganesh
Document28 pages
Ganesh
Srikanth Reddy
No ratings yet
8 Machine Learning Algorithms in Python
Document16 pages
8 Machine Learning Algorithms in Python
uzair12345
100% (2)
Research Paper On Embedded Systems PDF
Document5 pages
Research Paper On Embedded Systems PDF
jrcrefvhf
100% (1)
7ENT1116 - NN - Assignment Briefing Sheet 2019-20 - v2
Document5 pages
7ENT1116 - NN - Assignment Briefing Sheet 2019-20 - v2
amin
No ratings yet
Twelve Ways
Document4 pages
Twelve Ways
de7yT3iz
No ratings yet
HW4 Spec
Document5 pages
HW4 Spec
鄭博仁
No ratings yet
IT401 Embedded Systems
Document2 pages
IT401 Embedded Systems
Vidya A
No ratings yet
Robot Manipulator Control Using PLC With Position Based and Image Based Algorithm 2090 4908 1000154
Document8 pages
Robot Manipulator Control Using PLC With Position Based and Image Based Algorithm 2090 4908 1000154
Chandru Christuraj
No ratings yet
Common Mistakes in Using Codevita Code Evaluation Platform
Document5 pages
Common Mistakes in Using Codevita Code Evaluation Platform
srinivasan18121992
67% (6)
Assignment - 13M Payment Behaviour Prediction - BAA
Document1 page
Assignment - 13M Payment Behaviour Prediction - BAA
Aditya
No ratings yet
INFO0027-2: Project 3 (10% INDIVIDUAL) Pirallel Computing: 1 Implementation
Document2 pages
INFO0027-2: Project 3 (10% INDIVIDUAL) Pirallel Computing: 1 Implementation
Pierre Vyncke
No ratings yet
CN5004 2023 24 Assignment
Document9 pages
CN5004 2023 24 Assignment
wanjalaaustine819
No ratings yet
5 Assignment Concurrent Programming Depot OBE Apr 2018
Document6 pages
5 Assignment Concurrent Programming Depot OBE Apr 2018
Jelly Holmes
No ratings yet
Topic For Lab1
Document4 pages
Topic For Lab1
Dương Huy Hoàng
No ratings yet
ML Course Prospectus
Document7 pages
ML Course Prospectus
Mustafa Ali
No ratings yet
Long Question & Answer
Document11 pages
Long Question & Answer
animesh_anush
No ratings yet
Test Automation Research Papers
Document8 pages
Test Automation Research Papers
fvffv0x7
100% (1)
DCOMS Assignment Question JUNE2013
Document4 pages
DCOMS Assignment Question JUNE2013
Armin Amiri
No ratings yet
Hua Chi Quan
Document1 page
Hua Chi Quan
Hứa Chí Quân
No ratings yet
Project Hotel Management
Document29 pages
Project Hotel Management
faggskn
No ratings yet
Pet Shop
Document119 pages
Pet Shop
suraj
75% (8)
Bringing Images to Life: Exploring DALL-E with ChatGPT
From Everand
Bringing Images to Life: Exploring DALL-E with ChatGPT
Aura-Elena Turcu
No ratings yet
How to Build Self-Driving Cars From Scratch, Part 1: A Step-by-Step Guide to Creating Autonomous Vehicles With Python
From Everand
How to Build Self-Driving Cars From Scratch, Part 1: A Step-by-Step Guide to Creating Autonomous Vehicles With Python
Bolakale Aremu
No ratings yet
Mastering Python Scientific Computing
From Everand
Mastering Python Scientific Computing
Mehta Hemant Kumar
Rating: 4 out of 5 stars
4/5 (1)
HW4 Spec
Document5 pages
HW4 Spec
鄭博仁
No ratings yet
Lecture 9 - RL
Document82 pages
Lecture 9 - RL
鄭博仁
No ratings yet
Homework Assignments 2 PDF
Document10 pages
Homework Assignments 2 PDF
鄭博仁
No ratings yet
Report PDF
Document5 pages
Report PDF
鄭博仁
100% (1)
Report PDF
Document12 pages
Report PDF
鄭博仁
No ratings yet
hw4 PDF
Document6 pages
hw4 PDF
鄭博仁
No ratings yet