Deep Reinforcement Learning For Flappy Bird: Pipeline

Uploaded by

Goutham Prasad

0% found this document useful (0 votes)

7 views1 page

poster

Original Title

362_poster.pdf

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

poster

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

7 views1 page

Deep Reinforcement Learning For Flappy Bird: Pipeline

Uploaded by

Goutham Prasad

poster

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 1

Search inside document

Deep Reinforcement Learning for Flappy Bird

Kevin Chen
Stanford University

Abstract Pipeline
Reinforcement learning is essential for training an agent to
make smart decisions under uncertainty and to take small sample
initialize
update state choose action minibatch
actions in order to achieve a higher overarching goal. In this replay
new episode next frame and replay based on - from replay
project, we combined reinforcement learning and deep memory and
memory greedy policy memory and
DQN
learning techniques to train an agent to play the game, update DQN
Flappy Bird. The challenge is that the agent only sees the
pixels and the rewards, similar to a human player. Using just if crash if not crash
this information, it is able to successfully play the game at a
human or sometimes super-human level.

Feature extractor Deep Q-Network (DQN)

[1]
Related Work
V. Mnih, K. Kavukcuoglu, D. Silver, A. A. Rusu, J. Veness, M. G. Bellemare, A. Graves, M. Riedmiller, A.K. extract n-channel Deep Q-
convert to downsample Q-value for
Fidjeland, G. Ostrovski, S. Petersen, C. Beattle, A. Sadik, I. Antonoglou, H. King, D. Kumaran, D. Wierstra, S. images from image (n most Network
[2] Legg, D. Hassabis, Human-level control through deep reinforcement learning, Nature 518, 529-533 (2015). grayscale to 84x84 each action
V. Mnih, K. Kavukcuoglu, D. Silver, A. Graves, I. Antonoglou, D. Wierstra, M. Riedmiller. Playing Atari with state recent frames) (DQN)
deep reinforcement learning. arXiv preprint arXiv: 1312.5602, 2013.

Reinforcement Learning Experimental Results

Average Score
State: Sequence of frames and actions Easy Medium Hard Game Human Baseline DQN DQN DQN
(flap every n) (easy) (medium) (hard)
st = x1, a1, x2, a2, xt-1, at-1, xt difficulty
Action: Flap (a = 1) or Do nothing (a = 0) Easy Inf Inf Inf Inf Inf
Rewards: rewardAlive rewardPipe rewardDead Medium Inf Inf 0.7 Inf Inf
Hard 21 0.5 0.1 0.6 82.2
+0.1 +1.0 -1.0
Highest Score Achieved
Q-learning: Q*(s, a) = Es~[r + maxa Q*(s, a) | s, a] Game Human Baseline DQN DQN DQN
(flap every n) (easy) (medium) (hard)
Qi+1(s, a) Es~[r + maxa Qi(s, a) | s, a] difficulty

Loss Li(i) = Es, a~p()[(yi Q(s, a; i))2] Easy Inf Inf Inf Inf Inf
Medium Inf 11 2 Inf Inf
yi = Es~[r + maxa Qtarget(s, a; target) | s, a] Hard 65 1 1 1 215

Case Study C Neww
Document12 pages
Case Study C Neww
Rudransh Sharma
No ratings yet
Rainbow - Combining Improvements in Deep Reinforcement Learning (1710.02298)
Document14 pages
Rainbow - Combining Improvements in Deep Reinforcement Learning (1710.02298)
koveje
No ratings yet
Deep Q-Network
Document15 pages
Deep Q-Network
aishika.ranjan2021
No ratings yet
111 Report
Document6 pages
111 Report
Nguyễn Tự Sang
No ratings yet
SSL 18 Mar 23 PDF
Document50 pages
SSL 18 Mar 23 PDF
arpan singh
No ratings yet
RL Project - Deep Q-Network Agent Presentation
Document15 pages
RL Project - Deep Q-Network Agent Presentation
Phạm Tịnh
No ratings yet
Transfer Learning
Document7 pages
Transfer Learning
asim zaman
No ratings yet
Yan Wang Ee367 Win17 Report
Document8 pages
Yan Wang Ee367 Win17 Report
Amissadai ferreira
No ratings yet
CS480 Lecture November 21st
Document193 pages
CS480 Lecture November 21st
Rajeswari
No ratings yet
Dual Cross-Attention Learning For Fine-Grained Visual Categorization and Object Re-Identification
Document15 pages
Dual Cross-Attention Learning For Fine-Grained Visual Categorization and Object Re-Identification
Bijayan Bhattarai
No ratings yet
Pe2 DLP Lesson 08
Document4 pages
Pe2 DLP Lesson 08
Glycelle Urlanda Mapili
No ratings yet
Towards Vision-Based Deep Reinforcement Learning For Robotic Motion Control
Document8 pages
Towards Vision-Based Deep Reinforcement Learning For Robotic Motion Control
hadi
No ratings yet
Introduction To Deep Q-Network (DQN) : by Divyansh Pandit
Document10 pages
Introduction To Deep Q-Network (DQN) : by Divyansh Pandit
Rudransh Sharma
No ratings yet
Acoustic Detection of Drone: Mel Spectrogram
Document1 page
Acoustic Detection of Drone: Mel Spectrogram
LALIT KUMAR
No ratings yet
1 s2.0 S0925231220303337 Main
Document12 pages
1 s2.0 S0925231220303337 Main
Maria Pilar
No ratings yet
Distributed Deep Q-Learning: Hao Yi Ong, Kevin Chavez, and Augustus Hong
Document8 pages
Distributed Deep Q-Learning: Hao Yi Ong, Kevin Chavez, and Augustus Hong
pog
No ratings yet
Playing Geometry Dash With Convolutional Neural Networks
Document7 pages
Playing Geometry Dash With Convolutional Neural Networks
friedman
No ratings yet
"Double-DIP": Unsupervised Image Decomposition Via Coupled Deep-Image-Priors
Document10 pages
"Double-DIP": Unsupervised Image Decomposition Via Coupled Deep-Image-Priors
xfgdgf
No ratings yet
Loss Functions For Image Restoration With Neural Networks: Hang Zhao, Orazio Gallo, Iuri Frosio, and Jan Kautz
Document11 pages
Loss Functions For Image Restoration With Neural Networks: Hang Zhao, Orazio Gallo, Iuri Frosio, and Jan Kautz
Stephen Lau
No ratings yet
GeoStat DeepLearn NDesassis 15 06 22
Document134 pages
GeoStat DeepLearn NDesassis 15 06 22
thomas.romary
No ratings yet
Deep Reinforcement Learning On Atari 2600
Document3 pages
Deep Reinforcement Learning On Atari 2600
International Journal of Innovative Science and Research Technology
No ratings yet
Zhu2018 1
Document6 pages
Zhu2018 1
Ramjan Khandelwal
No ratings yet
On Improving DRL For POMDP
Document7 pages
On Improving DRL For POMDP
ZHANG JUNJIE
No ratings yet
Self Defense Techniques
Document2 pages
Self Defense Techniques
No Intensive
No ratings yet
Imdt Project Report
Document2 pages
Imdt Project Report
Devank Garg
No ratings yet
Blind Image Deconvolution Using Deep Generative Priors: Muhammad Asim, Fahad Shamshad, and Ali Ahmed
Document20 pages
Blind Image Deconvolution Using Deep Generative Priors: Muhammad Asim, Fahad Shamshad, and Ali Ahmed
vamsi
No ratings yet
Sample
Document14 pages
Sample
AR Gaucher
No ratings yet
Personal Combat Reference
Document2 pages
Personal Combat Reference
Zander Franklin
No ratings yet
Ruan 2019
Document5 pages
Ruan 2019
Nirban Das
No ratings yet
Video Based Emotion Recognition: Submitted By
Document8 pages
Video Based Emotion Recognition: Submitted By
Atharva Tripathi
No ratings yet
Deep Reinforcement Learning With Quantum-Inspired Experience Replay Qing Wei, Hailan Ma, Chunlin Chen, Member, IEEE, Daoyi Dong, Senior Member, IEEE
Document12 pages
Deep Reinforcement Learning With Quantum-Inspired Experience Replay Qing Wei, Hailan Ma, Chunlin Chen, Member, IEEE, Daoyi Dong, Senior Member, IEEE
chanwengqiu
No ratings yet
Overcoming Catastrophic Forgetting in Neural Networks PDF
Document6 pages
Overcoming Catastrophic Forgetting in Neural Networks PDF
Colin Lewis
No ratings yet
Massively Parallel Methods For Deep Reinforcement Learning
Document14 pages
Massively Parallel Methods For Deep Reinforcement Learning
Vasco Ribeiro da Silva
No ratings yet
Image Deblurring Using A Neural Network Approach: ISSN: 2277-3754
Document4 pages
Image Deblurring Using A Neural Network Approach: ISSN: 2277-3754
Simona Stolnicu
No ratings yet
Thanks To XYZ Agency For Funding
Document5 pages
Thanks To XYZ Agency For Funding
Serkalem Negusse
No ratings yet
One-Shot Image Classification: Adv. Computer Vision Term Project Presentation
Document20 pages
One-Shot Image Classification: Adv. Computer Vision Term Project Presentation
Naufal Suryanto
No ratings yet
A D6 Total Conversion For The Heavy Gear Universe: Compatible
Document21 pages
A D6 Total Conversion For The Heavy Gear Universe: Compatible
William Myers
No ratings yet
Deeplearning - Ai Deeplearning - Ai
Document53 pages
Deeplearning - Ai Deeplearning - Ai
Yassine Zagnane
No ratings yet
Prioritized Experience Replay
Document21 pages
Prioritized Experience Replay
cnt dvs
No ratings yet
SLChapter8 1
Document20 pages
SLChapter8 1
dnthrtm3
No ratings yet
Detectors: Detecting Objects With Recursive Feature Pyramid and Switchable Atrous Convolution
Document12 pages
Detectors: Detecting Objects With Recursive Feature Pyramid and Switchable Atrous Convolution
a m
No ratings yet
1 s2.0 S0925231218300602 Main
Document13 pages
1 s2.0 S0925231218300602 Main
Piyush Bafna
No ratings yet
Entity Embeddings of Categorical Variables
Document9 pages
Entity Embeddings of Categorical Variables
Axel Straminsky
No ratings yet
Marta Gans: Unsupervised Representation Learning For Remote Sensing Image Classification
Document5 pages
Marta Gans: Unsupervised Representation Learning For Remote Sensing Image Classification
Acharya Rabin
No ratings yet
DeepLearning Tutorial1
Document22 pages
DeepLearning Tutorial1
Bharath kumar
No ratings yet
Vid2Avatar: 3D Avatar Reconstruction From Videos in The Wild Via Self-Supervised Scene Decomposition
Document11 pages
Vid2Avatar: 3D Avatar Reconstruction From Videos in The Wild Via Self-Supervised Scene Decomposition
Vetal Yeshor
No ratings yet
Brain Region Segmentation Using Convolutional Neural Network
Document6 pages
Brain Region Segmentation Using Convolutional Neural Network
SAI CHAKRADHAR G
No ratings yet
Eccv10 Tutorial Part4
Document52 pages
Eccv10 Tutorial Part4
jatin
No ratings yet
Recurrent Neural Networks
Document106 pages
Recurrent Neural Networks
Manish Singhal
No ratings yet
Towards A Scalable Discrete Quantum Generative Adversarial Neural Network
Document11 pages
Towards A Scalable Discrete Quantum Generative Adversarial Neural Network
Lakshika Rathi
No ratings yet
Lecture06 - Copie
Document52 pages
Lecture06 - Copie
Charef Wided
No ratings yet
Process Mining Poster
Document1 page
Process Mining Poster
Giel Beuzel
No ratings yet
Rug01-002945739 2021 0001 Ac
Document80 pages
Rug01-002945739 2021 0001 Ac
easydrawjps
No ratings yet
Convolutional Neural Networks: CMSC 733 Fall 2015 Angjoo Kanazawa
Document55 pages
Convolutional Neural Networks: CMSC 733 Fall 2015 Angjoo Kanazawa
Sanjeeb
No ratings yet
Speech Recog
Document5 pages
Speech Recog
zxvcbvb
No ratings yet
Human Activity Classification Poster
Document1 page
Human Activity Classification Poster
nikhil singh
No ratings yet
One-Shot Video Object Segmentation
Document10 pages
One-Shot Video Object Segmentation
tadeas kelly
No ratings yet
The Strange - Cypher Deck
Document34 pages
The Strange - Cypher Deck
Liam Arken
No ratings yet
Capital Structure PDF
Document8 pages
Capital Structure PDF
Ahmad Bello Dogarawa
No ratings yet
Critical Values of The Chi-Squared Distribution
Document4 pages
Critical Values of The Chi-Squared Distribution
Goutham Prasad
No ratings yet
Avg (MS) Min (MS) Max (MS) : Results of The Simulation Completed At: 06/09/2017 01:02:18
Document6 pages
Avg (MS) Min (MS) Max (MS) : Results of The Simulation Completed At: 06/09/2017 01:02:18
Goutham Prasad
No ratings yet
Requirements
Document1 page
Requirements
Goutham Prasad
No ratings yet
Deep Learning Youtube Video Tags: Travis Addair Stanford University Taddair@Stanford - Edu
Document7 pages
Deep Learning Youtube Video Tags: Travis Addair Stanford University Taddair@Stanford - Edu
Goutham Prasad
No ratings yet
Btech (Information Technology) (1) - 1
Document2 pages
Btech (Information Technology) (1) - 1
Goutham Prasad
No ratings yet
p742 Goldberg 2
Document9 pages
p742 Goldberg 2
Goutham Prasad
No ratings yet
Deep Reinforcement Learning For Flappy Bird: Pipeline
Document1 page
Deep Reinforcement Learning For Flappy Bird: Pipeline
Goutham Prasad
No ratings yet
Winnie Lin and Timothy Wu: Implementation-Cont
Document1 page
Winnie Lin and Timothy Wu: Implementation-Cont
Goutham Prasad
No ratings yet
Uttar Pradesh Skill Development Policy
Document95 pages
Uttar Pradesh Skill Development Policy
Goutham Prasad
No ratings yet
Django Graphos Documentation: Release 0.0.2a0
Document19 pages
Django Graphos Documentation: Release 0.0.2a0
Goutham Prasad
No ratings yet
Advanced Java Slides
Document134 pages
Advanced Java Slides
Deepa Subramanyam
No ratings yet
Testing: Instructor: Iqra Javed
Document32 pages
Testing: Instructor: Iqra Javed
zagi tech
No ratings yet
Anthro250J/Soc273E - Ethnography Inside Out: Fall 2005
Document10 pages
Anthro250J/Soc273E - Ethnography Inside Out: Fall 2005
Raquel Pérez Andrade
No ratings yet
Ten Strategies For The Top Management
Document19 pages
Ten Strategies For The Top Management
AQuh C Jhane
67% (3)
HT 02 Intro Tut 07 Radiation and Convection
Document46 pages
HT 02 Intro Tut 07 Radiation and Convection
rbeckk
No ratings yet
A Practical Guide To Geostatistical - Hengl
Document165 pages
A Practical Guide To Geostatistical - Hengl
Jorge D. Marques
No ratings yet
Sustainability Indicators: Are We Measuring What We Ought To Measure?
Document8 pages
Sustainability Indicators: Are We Measuring What We Ought To Measure?
yrperdana
No ratings yet
40 Years of Transit Oriented Development
Document74 pages
40 Years of Transit Oriented Development
Terry Maynard
No ratings yet
Engineering Council of South Africa: 1 Purpose
Document5 pages
Engineering Council of South Africa: 1 Purpose
2lie
No ratings yet
Nursing 405 Efolio
Document5 pages
Nursing 405 Efolio
api-403368398
100% (1)
ESL BOOKS - IELTS Academic Writing Task 1 Vocabulary by ESL Fluency - Preview
Document7 pages
ESL BOOKS - IELTS Academic Writing Task 1 Vocabulary by ESL Fluency - Preview
anirudh modhalavalasa
No ratings yet
Chemistry Chemical Engineering
Document124 pages
Chemistry Chemical Engineering
jrobs314
No ratings yet
Zerkle Dalcroze Workshop Handout
Document2 pages
Zerkle Dalcroze Workshop Handout
EricDoCarmo
No ratings yet
Daily Lesson Log: Department of Education
Document10 pages
Daily Lesson Log: Department of Education
Stevenson Libranda Barretto
No ratings yet
IFEM Ch07 PDF
Document19 pages
IFEM Ch07 PDF
NitzOO
No ratings yet
Barthes EiffelTower PDF
Document21 pages
Barthes EiffelTower PDF
egr1971
No ratings yet
CM6 - Mathematics As A Tool - Dispersion and Correlation
Document18 pages
CM6 - Mathematics As A Tool - Dispersion and Correlation
Loeynahc
No ratings yet
About Karmic Debt Numbers in Numerology
Document3 pages
About Karmic Debt Numbers in Numerology
MarkMadMunki
100% (2)
MCQ in Engineering Economics Part 11 ECE Board Exam
Document19 pages
MCQ in Engineering Economics Part 11 ECE Board Exam
Daryl Gwapo
No ratings yet
Feedback For Question 1-MIDTERM 2 AFM 451
Document2 pages
Feedback For Question 1-MIDTERM 2 AFM 451
jason f
No ratings yet
Physics - DDPS1713 - Chapter 4-Work, Energy, Momentum and Power
Document26 pages
Physics - DDPS1713 - Chapter 4-Work, Energy, Momentum and Power
jimmi_ramli
No ratings yet
Labour Welfare
Document250 pages
Labour Welfare
Arundhathi Adarsh
No ratings yet
Unit 3
Document9 pages
Unit 3
Estefani Zambrano
No ratings yet
Cerita Bugis
Document14 pages
Cerita Bugis
I'dris M11
No ratings yet
French DELF A1 Exam PDF
Document10 pages
French DELF A1 Exam PDF
Mishti
No ratings yet
Sample Intern Prop
Document7 pages
Sample Intern Prop
maxshawon
No ratings yet
5300 Operation Manual (v1.5)
Document486 pages
5300 Operation Manual (v1.5)
Phan Quan
100% (1)
CP100 Module 2 - Getting Started With Google Cloud Platform
Document33 pages
CP100 Module 2 - Getting Started With Google Cloud Platform
Manjunath Bheemappa
No ratings yet
Ifrs Sap
Document6 pages
Ifrs Sap
ravikb01
No ratings yet