Welcome to Scribd!

Lec 3

Uploaded by

0% found this document useful (0 votes)

9 views7 pages

This document provides an overview and introduction to PyTorch for a deep reinforcement learning course. It explains that the goal of the course is to train agents to perform useful tasks. It then introduces PyTorch as a Python library for defining neural networks, computing gradients automatically, and other functions like working with datasets and optimizers. The document explains that PyTorch allows you to define a neural network and it will automatically compute the gradients to train the model, focusing on the key aspects of defining and training neural networks. It concludes by providing a link to a PyTorch tutorial on Colab.

Original Description:

Original Title

lec-3

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

9 views7 pages

Lec 3

Uploaded by

张竞择

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 7

Search inside document

PyTorch and Neural Nets

CS285 Deep RL
Instructor: Marwa Abdulhai

[Adapted from Kevin Li’s CS285 Fa21 Slides]

Goal of this course
Train an agent to perform useful tasks

𝜋! (𝑎|𝑠) )"#$ = 𝑓! (𝑠" , 𝑎" )

∆
𝑄! (𝑠, 𝑎)
train the model

data agent

collect data
Goal of this course
Train an agent to perform useful tasks

𝜋! (𝑎|𝑠) )"#$ = 𝑓! (𝑠" , 𝑎" )

∆
𝑄! (𝑠, 𝑎)
train the
train model
the model

focus for today’s lecture!

data agent

collect data
How do train a model?
𝜃 ∗ = arg min" ∑ #,% ∈' ℒ( 𝑓" 𝑥 , 𝑦)

dataset neural network

gradient
descent loss

PyTorch does all of these!

What is PyTorch?

Python library for:

• Defining neural networks
• Automating computing gradients
• And more! (datasets, optimizers,
GPUs, etc.)
How does PyTorch work?

You define: ℎ! = 𝜎 𝑊! 𝑥 ℎ" = 𝜎 𝑊" ℎ! 𝑦 = 𝜎 𝑊# ℎ"

$% $% $'" $'! $% $% $'" 𝜕𝑦
PyTorch computes: = =
$&! $'" $'! $&! $&" $'" $&! 𝜕𝑊#

[picture from Stanford’s CS231n]

PyTorch Tutorial (Colab)
https://colab.research.google.com/drive/1XQu1mUbGtvkQY-D7_YCOZlRzSnjp4u9f?usp=sharing

https://bit.ly/3CM6lcf

Content
Document12 pages
Content
Prashanth Shetty
No ratings yet
Data Analytics Using Python Lab Manual
Document8 pages
Data Analytics Using Python Lab Manual
Nayana Gowda
0% (1)
Lecture 5
Document31 pages
Lecture 5
ChatGeePeeTee
No ratings yet
HTML Code
Document1 page
HTML Code
Navkaran Singh
No ratings yet
MCS 011 Solved Assignments 2016
Document10 pages
MCS 011 Solved Assignments 2016
Sourav Purkait
No ratings yet
What Is PCA: When Should You Use PCA?
Document21 pages
What Is PCA: When Should You Use PCA?
sailusha
No ratings yet
ML Shristi File
Document49 pages
ML Shristi File
RAVI PARKASH
No ratings yet
Shashidhar-18csl76 Final
Document19 pages
Shashidhar-18csl76 Final
Mohammad Athiq kamran
No ratings yet
Pertemuan 13
Document3 pages
Pertemuan 13
gilang
No ratings yet
Linear and Logistic Regression
Document6 pages
Linear and Logistic Regression
Mahevish Fatima
No ratings yet
Principal Component Analysis Notes : Info
Document22 pages
Principal Component Analysis Notes : Info
VALMICK GUHA
No ratings yet
School of Engineering: Lab Manual On Machine Learning Lab
Document23 pages
School of Engineering: Lab Manual On Machine Learning Lab
Ravi Kumawat
No ratings yet
Cs3381-Oops Lab Manual
Document40 pages
Cs3381-Oops Lab Manual
passion job
No ratings yet
CSPF
Document28 pages
CSPF
shivanimishra0093
No ratings yet
Data Structures Lab Record Format
Document37 pages
Data Structures Lab Record Format
t3137303
No ratings yet
Final Practical File 2022-23
Document87 pages
Final Practical File 2022-23
Kuldeep Singh
No ratings yet
03 A Polynomial Linear Regression
Document6 pages
03 A Polynomial Linear Regression
Gabriel Gheorghe
No ratings yet
DSA Lab
Document90 pages
DSA Lab
Mahaan nidhi
No ratings yet
Ex - No. 13 22.03.2011 Eigen Value Extraction by Power Method
Document4 pages
Ex - No. 13 22.03.2011 Eigen Value Extraction by Power Method
Joyson Silva
No ratings yet
Gradient Descent Vizcs229 PDF
Document7 pages
Gradient Descent Vizcs229 PDF
Alankar Gupta
No ratings yet
LIVE 6 DP Inbuilt DS
Document12 pages
LIVE 6 DP Inbuilt DS
harrysmani
No ratings yet
DSA Labmanual PDF
Document72 pages
DSA Labmanual PDF
ANIMESH ANAND
No ratings yet
Rendering Worlds With Two Triangles With Raytracing On The GPU in 4096 Bytes
Document57 pages
Rendering Worlds With Two Triangles With Raytracing On The GPU in 4096 Bytes
Alma Körte
No ratings yet
5 Ejercicio - Experimentación Con Los Modelos de Regresión Más Eficaces - Training - Microsoft Learn Ingles
Document9 pages
5 Ejercicio - Experimentación Con Los Modelos de Regresión Más Eficaces - Training - Microsoft Learn Ingles
acxel david castillo casas
No ratings yet
Practical Manual ON Neural Networks: Dr. Dharm Singh Dr. Naveen Choudhary
Document43 pages
Practical Manual ON Neural Networks: Dr. Dharm Singh Dr. Naveen Choudhary
1balamanian
No ratings yet
Welcome To Colaboratory - Colaboratory
Document5 pages
Welcome To Colaboratory - Colaboratory
chairahmaulida
No ratings yet
Face Detection & Emotion Recognition
Document26 pages
Face Detection & Emotion Recognition
Nishu Tiwari
No ratings yet
361 Project Code
Document10 pages
361 Project Code
skdlf
No ratings yet
01 Introduction A
Document50 pages
01 Introduction A
Duy Hùng Đào
No ratings yet
Twittermining: 1 Twitter Text Mining - Required Libraries
Document4 pages
Twittermining: 1 Twitter Text Mining - Required Libraries
Samuel Peoples
No ratings yet
Posit Cloud 2
Document1 page
Posit Cloud 2
ayushiitesh423
No ratings yet
Explain Machine Learning Model Using SHAP
Document28 pages
Explain Machine Learning Model Using SHAP
sediha5178
No ratings yet
AD3301 - Numpy - and - Pandas - Ipynb - Colaboratory
Document18 pages
AD3301 - Numpy - and - Pandas - Ipynb - Colaboratory
palaniappan.cse
No ratings yet
Python Myssql Programs For Practical File Class 12 Ip
Document26 pages
Python Myssql Programs For Practical File Class 12 Ip
Pragyanand Singh
No ratings yet
Untitled
Document51 pages
Untitled
Abhishek Patel
No ratings yet
Autoencoder
Document24 pages
Autoencoder
Akash Savaliya
No ratings yet
3PA07 - 11519645 - Demita Riassekar Cututri 8-Puzzles - Ipynb
Document6 pages
3PA07 - 11519645 - Demita Riassekar Cututri 8-Puzzles - Ipynb
3PA07 Demita Riassekar Cututri
No ratings yet
Matlab Talk
Document43 pages
Matlab Talk
hrishikeshanand
No ratings yet
1 An Introduction To Machine Learning With Scikit Learn
Document2 pages
1 An Introduction To Machine Learning With Scikit Learn
kehinde.ogunleye007
No ratings yet
Machine Learning Practice
Document31 pages
Machine Learning Practice
Manish Kumar
No ratings yet
EJ2SEMANA3
Document14 pages
EJ2SEMANA3
fuck off we need limits
No ratings yet
ML Record
Document18 pages
ML Record
harshitsr1234
No ratings yet
Practical File (Xii - Ip Final)
Document35 pages
Practical File (Xii - Ip Final)
torin shah
No ratings yet
Mark Goldstein mg3479 A2 Code
Document4 pages
Mark Goldstein mg3479 A2 Code
joziah43
No ratings yet
Python Note 3
Document11 pages
Python Note 3
Coding Knowledge
No ratings yet
Lect2 Slides
Document24 pages
Lect2 Slides
tqbrowne
No ratings yet
Python Assignment 4
Document5 pages
Python Assignment 4
Jai Sai
No ratings yet
Py Set-5 To 7
Document14 pages
Py Set-5 To 7
Drive User
No ratings yet
High Performance Computing in Web Browsers: CE Seminar WT14/15 Henning Lohse
Document33 pages
High Performance Computing in Web Browsers: CE Seminar WT14/15 Henning Lohse
Rudolf Schreiber
No ratings yet
Matlabtalk 2
Document43 pages
Matlabtalk 2
Anand Balaji
No ratings yet
DataStructurePMR 1
Document80 pages
DataStructurePMR 1
Sandeep K. V
No ratings yet
Index: S.O. Name of Experiment Date No. Sign Remarks
Document27 pages
Index: S.O. Name of Experiment Date No. Sign Remarks
Aditya Upadhyay
No ratings yet
Ai Lab
Document48 pages
Ai Lab
vignesheai22
No ratings yet
ML - LAB Record
Document36 pages
ML - LAB Record
Bruhathi.S
No ratings yet
COMPUTER SCIENCE New
Document32 pages
COMPUTER SCIENCE New
Soham Chaudhuri
No ratings yet
CSE 3024: Web Mining: Lab Assessment - 3
Document13 pages
CSE 3024: Web Mining: Lab Assessment - 3
Nikitha Reddy
No ratings yet
DA0101EN-2-Review-Data-Wrangling - Jupyter Notebook
Document14 pages
DA0101EN-2-Review-Data-Wrangling - Jupyter Notebook
Sohail Doulah
No ratings yet
Data Structure & Analysis Maual
Document63 pages
Data Structure & Analysis Maual
Nayab Syed
No ratings yet
Lec 10
Document36 pages
Lec 10
kiuclairdelune
No ratings yet
Python For Beginners
From Everand
Python For Beginners
Célio Azevedo
No ratings yet