Instruction

Uploaded by

Eldane Vieira

0% found this document useful (0 votes)

12 views2 pages

Instruções

Copyright

Available Formats

TXT, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Instruções

Copyright:

Available Formats

Download as TXT, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

12 views2 pages

Instruction

Uploaded by

Eldane Vieira

Instruções

Copyright:

Available Formats

Download as TXT, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 2

Search inside document

The game of Go has long been viewed as the most challenging of classic games for

artificial
intelligence due to its enormous search space and the difficulty of evaluating b
oard
positions and moves. We introduce a new approach to computer Go that uses value
networks
to evaluate board positions and policy networks to select moves. These deep neur
al networks
are trained by a novel combination of supervised learning from human expert game
s, and
reinforcement learning from games of self-play. Without any lookahead search, th
e neural
networks play Go at the level of state-of-the-art Monte-Carlo tree search progra
ms that simulate
thousands of random games of self-play. We also introduce a new search algorithm
that combines Monte-Carlo simulation with value and policy networks. Using this
search algorithm,
our program AlphaGo achieved a 99.8% winning rate against other Go programs,
and defeated the European Go champion by 5 games to 0. This is the first time th
at a computer
program has defeated a human professional player in the full-sized game of Go, a
feat
previously thought to be at least a decade away.

We train the neural networks using a pipeline consisting of several stages of ma

chine learning
(Figure 1). We begin by training a supervised learning (SL) policy network, p, di
rectly from
expert human moves. This provides fast, efficient learning updates with immediat
e feedback and
high quality gradients. Similar to prior work 13, 15, we also train a fast polic
y p that can rapidly
sample actions during rollouts. Next, we train a reinforcement learning (RL) pol
icy network, p,
that improves the SL policy network by optimising the final outcome of games of
self-play. This
adjusts the policy towards the correct goal of winning games, rather than maximi
zing predictive
accuracy. Finally, we train a value network v that predicts the winner of games p
layed by the
RL policy network against itself. Our program AlphaGo efficiently combines the p
olicy and value
networks with MCTS.

AlphaGo combines the policy and value networks in an MCTS algorithm (Figure 3) t
hat selects
actions by lookahead search. Each edge (s; a) of the search tree stores an actio
n value Q(s; a), visit
7
count N(s; a), and prior probability P(s; a). The tree is traversed by simulatio
n (i.e. descending
the tree in complete games without backup), starting from the root state. At eac
h time-step t of
each simulation, an action at is selected from state st,

At the end of simulation n, the action values and visit counts of all traversed
edges are
updated. Each edge accumulates the visit count and mean evaluation of all simula
tions passing
through that edge,
Evaluating policy and value networks requires several orders of magnitude more c
omputation
than traditional search heuristics. To efficiently combine MCTS with deep neural
networks,
AlphaGo uses an asynchronous multi-threaded search that executes simulations on
CPUs, and
computes policy and value networks in parallel on GPUs. The final version of Alp
haGo used 40
search threads, 48 CPUs, and 8 GPUs. We also implemented a distributed version o
f AlphaGo that
exploited multiple machines, 40 search threads, 1202 CPUs and 176 GPUs. The Meth
ods section
provides full details of asynchronous and distributed MCTS.

In this work we have developed a Go program, based on a combination of deep neur

al networks and
tree search, that plays at the level of the strongest human players, thereby ach
ieving one of artificial
intelligence s grand challenges 32 34. We have developed, for the first time, effectiv
e move selection
and position evaluation functions for Go, based on deep neural networks that are
trained by
a novel combination of supervised and reinforcement learning. We have introduced
a new search
algorithm that successfully combines neural network evaluations with Monte-Carlo
rollouts. Our
program AlphaGo integrates these components together, at scale, in a high-perfor
mance tree search
engine.

The Yellow House: A Memoir (2019 National Book Award Winner)
From Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Rating: 4 out of 5 stars
4/5 (98)
Shoe Dog: A Memoir by the Creator of Nike
From Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Rating: 4.5 out of 5 stars
4.5/5 (540)
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
From Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Rating: 4 out of 5 stars
4/5 (5810)
Never Split the Difference: Negotiating As If Your Life Depended On It
From Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Rating: 4.5 out of 5 stars
4.5/5 (843)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
From Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Rating: 4 out of 5 stars
4/5 (897)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
From Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Rating: 3.5 out of 5 stars
3.5/5 (231)
The Little Book of Hygge: Danish Secrets to Happy Living
From Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Rating: 3.5 out of 5 stars
3.5/5 (401)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
From Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Rating: 4.5 out of 5 stars
4.5/5 (474)
Yes Please
From Everand
Yes Please
Amy Poehler
Rating: 4 out of 5 stars
4/5 (1898)
Grit: The Power of Passion and Perseverance
From Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Rating: 4 out of 5 stars
4/5 (590)
Team of Rivals: The Political Genius of Abraham Lincoln
From Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Rating: 4.5 out of 5 stars
4.5/5 (234)
The Emperor of All Maladies: A Biography of Cancer
From Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Rating: 4.5 out of 5 stars
4.5/5 (271)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
From Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Rating: 4.5 out of 5 stars
4.5/5 (346)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
From Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Rating: 4.5 out of 5 stars
4.5/5 (266)
On Fire: The (Burning) Case for a Green New Deal
From Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Rating: 4 out of 5 stars
4/5 (74)
Principles: Life and Work
From Everand
Principles: Life and Work
Ray Dalio
Rating: 4 out of 5 stars
4/5 (608)
Fear: Trump in the White House
From Everand
Fear: Trump in the White House
Bob Woodward
Rating: 3.5 out of 5 stars
3.5/5 (738)
Rise of ISIS: A Threat We Can't Ignore
From Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Rating: 3.5 out of 5 stars
3.5/5 (137)
Angela's Ashes: A Memoir
From Everand
Angela's Ashes: A Memoir
Frank McCourt
Rating: 4.5 out of 5 stars
4.5/5 (441)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
From Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
Rating: 4 out of 5 stars
4/5 (1092)
The Unwinding: An Inner History of the New America
From Everand
The Unwinding: An Inner History of the New America
George Packer
Rating: 4 out of 5 stars
4/5 (45)
Bad Feminist: Essays
From Everand
Bad Feminist: Essays
Roxane Gay
Rating: 4 out of 5 stars
4/5 (1018)
John Adams
From Everand
John Adams
David McCullough
Rating: 4.5 out of 5 stars
4.5/5 (2409)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
From Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Rating: 3.5 out of 5 stars
3.5/5 (2259)
The Glass Castle: A Memoir
From Everand
The Glass Castle: A Memoir
Jeannette Walls
Rating: 4.5 out of 5 stars
4.5/5 (1716)
Steve Jobs
From Everand
Steve Jobs
Walter Isaacson
Rating: 4.5 out of 5 stars
4.5/5 (806)
The Outsider: A Novel
From Everand
The Outsider: A Novel
Stephen King
Rating: 4 out of 5 stars
4/5 (1850)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
From Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Rating: 4.5 out of 5 stars
4.5/5 (122)
The Woman in Cabin 10
From Everand
The Woman in Cabin 10
Ruth Ware
Rating: 3.5 out of 5 stars
3.5/5 (2521)
The Light Between Oceans: A Novel
From Everand
The Light Between Oceans: A Novel
M.L. Stedman
Rating: 4.5 out of 5 stars
4.5/5 (789)
Brooklyn: A Novel
From Everand
Brooklyn: A Novel
Colm Toibin
Rating: 3.5 out of 5 stars
3.5/5 (1946)
A Man Called Ove: A Novel
From Everand
A Man Called Ove: A Novel
Fredrik Backman
Rating: 4.5 out of 5 stars
4.5/5 (4610)
The Art of Racing in the Rain: A Novel
From Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Rating: 4 out of 5 stars
4/5 (4203)
The Perks of Being a Wallflower
From Everand
The Perks of Being a Wallflower
Stephen Chbosky
Rating: 4.5 out of 5 stars
4.5/5 (2104)
Manhattan Beach: A Novel
From Everand
Manhattan Beach: A Novel
Jennifer Egan
Rating: 3.5 out of 5 stars
3.5/5 (792)
Wolf Hall: A Novel
From Everand
Wolf Hall: A Novel
Hilary Mantel
Rating: 4 out of 5 stars
4/5 (3811)
A Tree Grows in Brooklyn
From Everand
A Tree Grows in Brooklyn
Betty Smith
Rating: 4.5 out of 5 stars
4.5/5 (1929)
Little Women
From Everand
Little Women
Louisa May Alcott
Rating: 4 out of 5 stars
4/5 (104)
Sing, Unburied, Sing: A Novel
From Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Rating: 4 out of 5 stars
4/5 (1104)
Her Body and Other Parties: Stories
From Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Rating: 4 out of 5 stars
4/5 (822)
The Constant Gardener: A Novel
From Everand
The Constant Gardener: A Novel
John le Carré
Rating: 3.5 out of 5 stars
3.5/5 (104)
Research On Text Classification Based On CNN and LSTM: Yuandong Luan Shaofu Lin
Document4 pages
Research On Text Classification Based On CNN and LSTM: Yuandong Luan Shaofu Lin
mainakroni
No ratings yet
Image Classification Based On Transfer Learning of CNN
Document5 pages
Image Classification Based On Transfer Learning of CNN
Prateek Singh
No ratings yet
Fundamentals of Artificial Neural Networks
Document27 pages
Fundamentals of Artificial Neural Networks
bhaskar rao m
No ratings yet
Ensemble Learning Technique For Artificial Intelligence Assisted IVF Applications
Document4 pages
Ensemble Learning Technique For Artificial Intelligence Assisted IVF Applications
Sotiris Goudos
No ratings yet
Text Summarization and Conversion of Speech To Text
Document5 pages
Text Summarization and Conversion of Speech To Text
IJRASETPublications
No ratings yet
Sign Language Character Recognition Research Paper
Document5 pages
Sign Language Character Recognition Research Paper
Vanilla Suga
No ratings yet
Abhi Intership Weekly Report PDF
Document6 pages
Abhi Intership Weekly Report PDF
Sammed Huchchannavar
No ratings yet
NeurIPS 2021 Deeply Shared Filter Bases For Parameter Efficient Convolutional Neural Networks Paper
Document12 pages
NeurIPS 2021 Deeply Shared Filter Bases For Parameter Efficient Convolutional Neural Networks Paper
cthfmz
No ratings yet
Course Code CSA400 8 Course Type LTP Credits 4: Applied Machine Learning
Document3 pages
Course Code CSA400 8 Course Type LTP Credits 4: Applied Machine Learning
Dileep singh Rathore 100353
No ratings yet
EC360 Soft Computing - Syllabus PDF
Document2 pages
EC360 Soft Computing - Syllabus PDF
voxov
No ratings yet
Niall - Chiang Festival-1
Document12 pages
Niall - Chiang Festival-1
api-538408600
No ratings yet
Tensorflow and Deep Learning
Document51 pages
Tensorflow and Deep Learning
Sara Elateif
No ratings yet
Here's Why AI May Be Extremely Dangerous - Whether It's Conscious or Not - Scientific American
Document4 pages
Here's Why AI May Be Extremely Dangerous - Whether It's Conscious or Not - Scientific American
Rij
No ratings yet
Sciml Introduction
Document29 pages
Sciml Introduction
Sajid Khan
No ratings yet
Lecun 20240124 Uw Lyttle
Document84 pages
Lecun 20240124 Uw Lyttle
esev
No ratings yet
IEGAN: Multi-Purpose Perceptual Quality Image Enhancement Using Generative Adversarial Network
Document10 pages
IEGAN: Multi-Purpose Perceptual Quality Image Enhancement Using Generative Adversarial Network
connectsoumya
No ratings yet
ML Quiz-1
Document4 pages
ML Quiz-1
Saikat Bhattacharyya
No ratings yet
Deep Learning: Data Mining: Advanced Aspects
Document131 pages
Deep Learning: Data Mining: Advanced Aspects
makeos
No ratings yet
Artificial Intelligence A-Z™ 2023 Build An AI With
Document19 pages
Artificial Intelligence A-Z™ 2023 Build An AI With
Nikhil katiyar
No ratings yet
Deeplearning - Ai Deeplearning - Ai
Document38 pages
Deeplearning - Ai Deeplearning - Ai
Estefania Salisbury Flores
No ratings yet
Hindi Text Classification
Document7 pages
Hindi Text Classification
Kushagra Bhatia
No ratings yet
Variational Autoencoders - Pre Quiz - Attempt Review
Document4 pages
Variational Autoencoders - Pre Quiz - Attempt Review
vinay Murakambattu
No ratings yet
Artificial Intelligence (AI) : Koray Ismail
Document10 pages
Artificial Intelligence (AI) : Koray Ismail
Zeus Kabadayı Fan
No ratings yet
EMAIL+SPAM+DETECTION Final Fishries++ (2658+to+2664) - 1
Document7 pages
EMAIL+SPAM+DETECTION Final Fishries++ (2658+to+2664) - 1
abhiram2003pgd
No ratings yet
Course Logistics and Introduction To Machine Learning
Document34 pages
Course Logistics and Introduction To Machine Learning
Siddhant Garg
No ratings yet
Self-Constructing Graph Convolutional Networks For Semantic Labeling
Document4 pages
Self-Constructing Graph Convolutional Networks For Semantic Labeling
Mustafa Mohammadi Gharasuie
No ratings yet
FDP On ML
Document10 pages
FDP On ML
Chiru Naidu
No ratings yet
Intelligence Artificielle en Rhumatologie: Maryame Boutkhil Juin 2022
Document39 pages
Intelligence Artificielle en Rhumatologie: Maryame Boutkhil Juin 2022
Maryame Boutkhil
No ratings yet
Optimizing Artificial Neural Networks Using Cat Swarm Optimization Algorithm
Document12 pages
Optimizing Artificial Neural Networks Using Cat Swarm Optimization Algorithm
jpyusiong
No ratings yet
Hybrid Approach For Facial Expression Recognition Using Convolutional Neural Networks and SVM
Document21 pages
Hybrid Approach For Facial Expression Recognition Using Convolutional Neural Networks and SVM
STEMM 2022
No ratings yet