Welcome to Scribd!

Ass 6

Uploaded by

0% found this document useful (0 votes)

7 views2 pages

This document shows the implementation of Q-learning to solve a reinforcement learning problem. It imports gym and defines an environment using FrozenLake-v1. It initializes a Q-table with all values set to 0. It defines an epsilon-greedy action selection method. It runs Q-learning over 50,000 episodes with a learning rate of 0.85, discount factor of 0.9, and starting epsilon of 0.8 to populate the Q-table.

Original Description:

Original Title

ass6

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

7 views2 pages

Ass 6

Uploaded by

Akash Sahu

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 2

Search inside document

ASSIGNMENT - 6

In [11]:

import gym
import numpy as np
import random

In [12]:

env= gym.make('FrozenLake-v1') #, render_mode='human')

In [13]:

Q = {}
for s in range(env.observation_space.n):
for a in range(env.action_space.n):
Q[(s,a)] = 0.0

In [14]:
def epsilon_greedy (state, epsilon):
if random.uniform(0,1) < epsilon:
return env.action_space.sample()
else:
return max(list(range(env.action_space.n)), key= lambda x:
Q[(state,x)])

In [15]:
alpha=0.85
gamma= 0.90
epsilon = 0.8

In [16]:
num_episodes = 50000
num_timesteps= 1000

In [17]:
for i in range(num_episodes):
s = env.reset()[0]
for t in range(num_timesteps):
a = epsilon_greedy(s, epsilon)
s_,r, done, _, trash = env.step(a)
a_ = np.argmax([Q[(s_, a)] for a in range(env.action_space.n)])
Q[(s,a)] += alpha * (r + gamma * Q[(s_,a_)]-Q[(s,a)])
s = s_
if done:
break

In [18]:
Q

Out[18]:
{(0, 0): 0.23477961696373423,
(0, 1): 0.22480181183703787,
(0, 2): 0.23961716957752016,
(0, 3): 0.24066398243905854,
(1, 0): 0.2204815896999076,
(1, 1): 0.04017125915710931,
(1, 2): 0.2822227428738474,
(1, 3): 0.22490808477046206,
(2, 0): 0.29961284509447655,
(2, 0): 0.29961284509447655,
(2, 1): 0.32990657866523887,
(2, 2): 0.37292229711147334,
(2, 3): 0.2790710024900863,
(3, 0): 0.25000597793284357,
(3, 1): 0.2575759230383145,
(3, 2): 0.037377204692152305,
(3, 3): 0.32551898596551954,
(4, 0): 0.3804023551933965,
(4, 1): 0.00856676265665978,
(4, 2): 0.5076563484150082,
(4, 3): 0.050394379122136346,
(5, 0): 0.0,
(5, 1): 0.0,
(5, 2): 0.0,
(5, 3): 0.0,
(6, 0): 0.4285911909119954,
(6, 1): 0.0002831967627810342,
(6, 2): 0.692932809233417,
(6, 3): 0.006210297861473632,
(7, 0): 0.0,
(7, 1): 0.0,
(7, 2): 0.0,
(7, 3): 0.0,
(8, 0): 0.4724947380728043,
(8, 1): 0.5292926568861616,
(8, 2): 0.0854144618498413,
(8, 3): 0.4739574281045383,
(9, 0): 0.07482359865928406,
(9, 1): 0.7499983936496128,
(9, 2): 0.5420577123103719,
(9, 3): 0.07627448541464799,
(10, 0): 0.5934266484891275,
(10, 1): 0.8240260740178592,
(10, 2): 0.774672222464751,
(10, 3): 0.09258352148159432,
(11, 0): 0.0,
(11, 1): 0.0,
(11, 2): 0.0,
(11, 3): 0.0,
(12, 0): 0.0,
(12, 1): 0.0,
(12, 2): 0.0,
(12, 3): 0.0,
(13, 0): 0.5646633544813056,
(13, 1): 0.11021736826449313,
(13, 2): 0.6234923802366498,
(13, 3): 0.7062633423537164,
(14, 0): 0.807111703090516,
(14, 1): 0.6835467756183892,
(14, 2): 0.9109203529502328,
(14, 3): 0.7967422343382821,
(15, 0): 0.0,
(15, 1): 0.0,
(15, 2): 0.0,
(15, 3): 0.0}

Research 9 Modules 1 and 2
Document8 pages
Research 9 Modules 1 and 2
Reinopeter Koykoy Dagpin Lagasca
No ratings yet
LLT PCB Model List A
Document2 pages
LLT PCB Model List A
Álvaro Satué Crespo
100% (1)
Polarisation Data
Document10 pages
Polarisation Data
Nobe Felix
No ratings yet
Record Book Write Up - Jupyter Notebook
Document11 pages
Record Book Write Up - Jupyter Notebook
WASHIPONG LONGKUMER 2147327
No ratings yet
ROC and AUC Practical Implementation PDF
Document6 pages
ROC and AUC Practical Implementation PDF
Nermine Limeme
No ratings yet
Granger Causality and VAR Models
Document1 page
Granger Causality and VAR Models
Manoj M
No ratings yet
Lab Record 2018-19 Mathematical Models Using Python Programming MAT451 Name: Yamuna.A Reg No:1740370
Document32 pages
Lab Record 2018-19 Mathematical Models Using Python Programming MAT451 Name: Yamuna.A Reg No:1740370
yamuna
No ratings yet
Import As
Document27 pages
Import As
Fozia Dawood
100% (1)
Laboratorio #4 1.: Es Posible Calcular Los Errores de La Pendiente
Document3 pages
Laboratorio #4 1.: Es Posible Calcular Los Errores de La Pendiente
Lander Marquez
No ratings yet
R19C076 - Chanukya Gowda K - Mlda - Assignment-2
Document19 pages
R19C076 - Chanukya Gowda K - Mlda - Assignment-2
Chanukya Gowda k
No ratings yet
Frozen Lake
Document6 pages
Frozen Lake
Akash Sahu
No ratings yet
Ass1 Merged Merged
Document19 pages
Ass1 Merged Merged
Akash Sahu
No ratings yet
Problem 4.1 A)
Document11 pages
Problem 4.1 A)
Renxiang Lu
No ratings yet
DC LAB Report#01
Document11 pages
DC LAB Report#01
rajasafeel
No ratings yet
Ejercicios 1.-Dados Los Datos Del Modelo Y
Document4 pages
Ejercicios 1.-Dados Los Datos Del Modelo Y
Williams S Sernaqué H
No ratings yet
Uas Algoritma - Novera Safitri
Document3 pages
Uas Algoritma - Novera Safitri
NOVERA SAFITRI
No ratings yet
Uas Algoritma - Novera Safitri
Document3 pages
Uas Algoritma - Novera Safitri
NOVERA SAFITRI
No ratings yet
Import As: Numpy NP
Document3 pages
Import As: Numpy NP
19C089 SHAAMBHAVI S
No ratings yet
A) Vinegere Chiper: Enkripsi
Document7 pages
A) Vinegere Chiper: Enkripsi
dennialdi
No ratings yet
Machine Learning Lab Assignment-8: Name: Kailasa Sandeep Kumar Reg No: 15BCE0480 Slot: L15+L16 Faculty: Vijaysherley.V
Document3 pages
Machine Learning Lab Assignment-8: Name: Kailasa Sandeep Kumar Reg No: 15BCE0480 Slot: L15+L16 Faculty: Vijaysherley.V
Ashish kumar Neela
No ratings yet
Cryptography Fundamentals Lab Assignment - 6: ECC and Digital Signature Verification
Document13 pages
Cryptography Fundamentals Lab Assignment - 6: ECC and Digital Signature Verification
Surya Vjkumar
No ratings yet
TP - La Récursivité - Correction
Document4 pages
TP - La Récursivité - Correction
elbiyatimanal
No ratings yet
Ejercicio 6 - M.Especificos - V.A.Continuas - Semana6
Document6 pages
Ejercicio 6 - M.Especificos - V.A.Continuas - Semana6
Daniel García
No ratings yet
Answer PDF Lab
Document34 pages
Answer PDF Lab
Al Kafi
No ratings yet
Control Assingment
Document14 pages
Control Assingment
Nipuna Thushara Wijesekara
No ratings yet
NA LAB (Matlab Assignment 01)
Document19 pages
NA LAB (Matlab Assignment 01)
Muhammad Irfan Malik
No ratings yet
Matlab Assignment 01
Document19 pages
Matlab Assignment 01
Muhammad Irfan Malik
No ratings yet
Lecture 2 Python 常用library
Document41 pages
Lecture 2 Python 常用library
Yuanxing
No ratings yet
Python Qazaqsha Sabak 3
Document12 pages
Python Qazaqsha Sabak 3
Damir Muratbaev
No ratings yet
Import As Import As From Import: "Mean Squared Errors: "
Document1 page
Import As Import As From Import: "Mean Squared Errors: "
ul
No ratings yet
Varma Garch
Document55 pages
Varma Garch
Josue Kouakou
No ratings yet
S&P Handson
Document2 pages
S&P Handson
TECHer YT
No ratings yet
Arithmetic Series
Document4 pages
Arithmetic Series
Chris Tea
No ratings yet
ENME503 Assignments Solutions 02
Document10 pages
ENME503 Assignments Solutions 02
Seifeldin T. Abdelghany
No ratings yet
Quantile Regression Explained
Document4 pages
Quantile Regression Explained
ramesh158
No ratings yet
1 Question No. 1 Synthetic Data Generation and Simple Curve Fitting
Document14 pages
1 Question No. 1 Synthetic Data Generation and Simple Curve Fitting
Surya Chala Praveen
No ratings yet
Numpy Tutorial
Document1 page
Numpy Tutorial
Dr. Sanjay Gupta
No ratings yet
QSTN 4
Document2 pages
QSTN 4
surujJD
No ratings yet
Import As: in (1) : in (2) : in (3) : in (4) : in
Document2 pages
Import As: in (1) : in (2) : in (3) : in (4) : in
surujJD
No ratings yet
Ass 4
Document2 pages
Ass 4
Akash Sahu
No ratings yet
b) Ptrinh đường tải tĩnh:: GS GG
Document4 pages
b) Ptrinh đường tải tĩnh:: GS GG
Phi Hùng
No ratings yet
Psa File
Document32 pages
Psa File
Nitin Bhardwaj
No ratings yet
Probability Theory and Mathematical Statistics: Homework 5, Vitaliy Pozdnyakov
Document12 pages
Probability Theory and Mathematical Statistics: Homework 5, Vitaliy Pozdnyakov
Garakhan Talibov
No ratings yet
Nonnewtonian Problem 3-C - Jupyter Notebook
Document8 pages
Nonnewtonian Problem 3-C - Jupyter Notebook
Patel Anjali
No ratings yet
Analisis Dinamico Eje X
Document24 pages
Analisis Dinamico Eje X
VICTOR MANUEL PAITAN MENDEZ
No ratings yet
223 Lec Not RLang
Document28 pages
223 Lec Not RLang
dogan20021907
No ratings yet
Lab 6 VN Diagram
Document13 pages
Lab 6 VN Diagram
Chan Teng Yan
No ratings yet
Name: Shreyash Kharat Homework: Hw3 (Problem 1)
Document7 pages
Name: Shreyash Kharat Homework: Hw3 (Problem 1)
Mainak Samanta
No ratings yet
Sujet Fuite BBA2 QT
Document15 pages
Sujet Fuite BBA2 QT
Junior Benze
No ratings yet
Practica 1
Document13 pages
Practica 1
Adriana Valadez
No ratings yet
Stat 245 Problem Set #6
Document9 pages
Stat 245 Problem Set #6
yoachallenge
No ratings yet
Ml-A3 Code
Document6 pages
Ml-A3 Code
BECOC362Atharva Utekar
No ratings yet
Ujian Mid Semester
Document11 pages
Ujian Mid Semester
Akhmad Fauzul Albab
No ratings yet
Trab CC
Document5 pages
Trab CC
Stiamat
No ratings yet
Kerr - Solve Ivp
Document8 pages
Kerr - Solve Ivp
yulieth andrea ramirez romero
No ratings yet
Manajemen Keuangan
Document4 pages
Manajemen Keuangan
Aisyah
No ratings yet
C B 01 Ta3 Wingconfiguration
Document30 pages
C B 01 Ta3 Wingconfiguration
Krizelle Lao
No ratings yet
Form Laporan Akhi1
Document17 pages
Form Laporan Akhi1
mhd isa lie
No ratings yet
A.) Input: Integer Real Dimension
Document28 pages
A.) Input: Integer Real Dimension
Samar Pratap
No ratings yet
Steepest Descent
Document1 page
Steepest Descent
Yasaman Asiaee
No ratings yet
Instructor's Manual to Accompany CALCULUS WITH ANALYTIC GEOMETRY
From Everand
Instructor's Manual to Accompany CALCULUS WITH ANALYTIC GEOMETRY
Sam Stuart
No ratings yet
Engineers Precision Data Pocket Reference
From Everand
Engineers Precision Data Pocket Reference
Steve Heather
Rating: 3 out of 5 stars
3/5 (1)
R Packages: How To Install, Include and Remove The Packages in R
Document10 pages
R Packages: How To Install, Include and Remove The Packages in R
Akash Sahu
No ratings yet
Ass1 Merged Merged
Document19 pages
Ass1 Merged Merged
Akash Sahu
No ratings yet
Ass1 Merged Merged
Document15 pages
Ass1 Merged Merged
Akash Sahu
No ratings yet
Ass 4
Document2 pages
Ass 4
Akash Sahu
No ratings yet
Frozen Lake
Document6 pages
Frozen Lake
Akash Sahu
No ratings yet
Ass 1
Document2 pages
Ass 1
Akash Sahu
No ratings yet
Ass 3
Document3 pages
Ass 3
Akash Sahu
No ratings yet
Untitled
Document1 page
Untitled
Akash Sahu
No ratings yet
Ass 2
Document4 pages
Ass 2
Akash Sahu
No ratings yet
4ce7: Concrete Technology Lab: Experiment No.:-6
Document6 pages
4ce7: Concrete Technology Lab: Experiment No.:-6
sita ram Jat
No ratings yet
Continue
Document2 pages
Continue
Dag Der
No ratings yet
Where Are You From?: Write About Yourself
Document6 pages
Where Are You From?: Write About Yourself
Fikri
No ratings yet
Orientation: Ged 102: The Life and Works of Jose Rizal
Document32 pages
Orientation: Ged 102: The Life and Works of Jose Rizal
Maecy S. Paglinawan
No ratings yet
IRM2400 Recommended Types of Linings
Document13 pages
IRM2400 Recommended Types of Linings
mika cabello
No ratings yet
Water in Soils
Document25 pages
Water in Soils
Nicholas Viney
No ratings yet
Keli D30-2 Calibrate Manual
Document11 pages
Keli D30-2 Calibrate Manual
TakaSensei
No ratings yet
20 Self Exploration Exercises
Document12 pages
20 Self Exploration Exercises
michael bailey
No ratings yet
Pranayam
Document118 pages
Pranayam
Dst
No ratings yet
Collingwood - On The So-Called Idea of Causation
Document11 pages
Collingwood - On The So-Called Idea of Causation
moliner
No ratings yet
Super-Resolution in Confocal Imaging Sheppard 1988 Optik
Document3 pages
Super-Resolution in Confocal Imaging Sheppard 1988 Optik
Dmitry Cherny
No ratings yet
1L02 - Quality Plan
Document19 pages
1L02 - Quality Plan
Muhammad Rizal
No ratings yet
BE Final Year Project Log Book - 2020-21
Document24 pages
BE Final Year Project Log Book - 2020-21
Amit Sheoran
100% (1)
Topics in Contemporary Mathematics 10th Edition Bello Test Bank 1
Document35 pages
Topics in Contemporary Mathematics 10th Edition Bello Test Bank 1
Howard Goforth
100% (41)
Week 3 Pendulum Lab-2
Document7 pages
Week 3 Pendulum Lab-2
Faiz Jillani
No ratings yet
Sidiq Subandriyanto Setyawan: Statement of Participation
Document2 pages
Sidiq Subandriyanto Setyawan: Statement of Participation
setyawan punk
No ratings yet
DIP Notes Unit-2
Document159 pages
DIP Notes Unit-2
gfgfdgf
No ratings yet
Code nhập key Windows - Office.0
Document3 pages
Code nhập key Windows - Office.0
ank1805
No ratings yet
Nerc'S New Control Performance Stand: Nasser Jaleelit and Louis
Document8 pages
Nerc'S New Control Performance Stand: Nasser Jaleelit and Louis
Woody Tzen Tham
No ratings yet
Narayanan and Darwish-Use of Steel Fibre As Shear Reinforcement
Document12 pages
Narayanan and Darwish-Use of Steel Fibre As Shear Reinforcement
Smith Abutu Simon John
No ratings yet
Chemical Oxygen Demand THEO
Document1 page
Chemical Oxygen Demand THEO
Nill Patrick Ulat Dulce
No ratings yet
Online Test One, Grade 10
Document5 pages
Online Test One, Grade 10
olgashanty
No ratings yet
Zodiac Names - Choices For A Sagittarius Baby - Baby Name Blog - Nameberry PDF
Document16 pages
Zodiac Names - Choices For A Sagittarius Baby - Baby Name Blog - Nameberry PDF
justbase
No ratings yet
What Is Organizational Behavior.. HRM ELEC CHAPTER 1
Document7 pages
What Is Organizational Behavior.. HRM ELEC CHAPTER 1
Michelle Rible
No ratings yet
Safety Data Sheet: Section 1: Identification of The Substance/mixture and of The Company/undertaking
Document7 pages
Safety Data Sheet: Section 1: Identification of The Substance/mixture and of The Company/undertaking
BB
No ratings yet
NSSEMA 2023 Poster
Document1 page
NSSEMA 2023 Poster
sumitravashisht1954
No ratings yet
AF SDK Online Course - GetValue Method
Document2 pages
AF SDK Online Course - GetValue Method
Cleber Pereira
No ratings yet
Niaaa: Understanding Alcohol's Impact On Health
Document2 pages
Niaaa: Understanding Alcohol's Impact On Health
PROMISE OGBONNAYA
No ratings yet