Welcome to Scribd!

Hanin Slides

Uploaded by

0% found this document useful (0 votes)

8 views42 pages

This paper analyzes Bayesian inference in deep linear networks. It shows that: 1. Bayesian inference is exactly solvable for deep linear networks. 2. The effective depth of the prior and posterior distributions can be different, depending on the data and model parameters. 3. Deep linear networks with universal priors can learn the same posteriors as shallow networks with optimal data-dependent priors.

Original Description:

data science

Original Title

Hanin_slides

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

8 views42 pages

Hanin Slides

Uploaded by

Bimal Chand

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 42

Search inside document

Bayesian Interpolation with

Deep Linear Networks

Boris Hanin (Princeton ORFE) and Alexander Zlokapa (MIT Physics)

PNAS 2023 (arXiv:2212.14457)

Motivating Questions
Motivating Questions
Given Model.
Motivating Questions
Given Model.

Data.
Motivating Questions
Given Model.

Data.

Alg.
Motivating Questions
Given Model.

Data.

Alg.

Q1. How do jointly affect learning?

Motivating Questions
Given Model.

Data.

Alg.

Q1. How do jointly affect learning?

● depends on (Marchenko-Pastur)
Motivating Questions
Given Model.

Data.

Alg.

Q1. How do jointly affect learning?

● depends on (Marchenko-Pastur)

● Deviation from linear model depends on

Motivating Questions
Given Model.

Data.

Alg.

Q1. How do jointly affect learning?

Q2. How does one analyze learning in non-linear models?

Motivating Questions
Given Model.

Data.

Alg.

Q1. How do jointly affect learning?

Q2. How does one analyze learning in non-linear models?

Q3. How do models make predictions on test data?

This Work: Q1 - Q3 for Linear Networks
This Work: Q1 - Q3 for Linear Networks
Model. Set and take identity non-linearity
This Work: Q1 - Q3 for Linear Networks
Model. Set and take identity non-linearity

Data.
This Work: Q1 - Q3 for Linear Networks
Model. Set and take identity non-linearity

Data.

Alg. Bayesian inference at zero temperature:

This Work: Q1 - Q3 for Linear Networks
Model. Set and take identity non-linearity

Data.

Alg. Bayesian inference at zero temperature:

●
This Work: Q1 - Q3 for Linear Networks
Model. Set and take identity non-linearity

Data.

Alg. Bayesian inference at zero temperature:

●
This Work: Q1 - Q3 for Linear Networks
Model. Set and take identity non-linearity

Data.

Alg. Bayesian inference at zero temperature:

●
Key Theorems, Informally
Key Theorems, Informally

T1. Bayesian inference is exactly solvable

Key Theorems, Informally

T1. Bayesian inference is exactly solvable

T2. Effective depth of prior:

Effective depth of posterior:

Key Theorems, Informally

T1. Bayesian inference is exactly solvable

T2. Effective depth of prior:

Effective depth of posterior:

T3. Deep networks with universal priors learn same posteriors as shallow
networks with optimal data-dependent priors
Structure of Prior/Posterior
Structure of Prior/Posterior
Structure of Prior/Posterior
Structure of Prior/Posterior
1
Structure of Prior/Posterior
1
Structure of Prior/Posterior
1
Structure of Prior/Posterior
1

2
Structure of Prior/Posterior
1

3 is a learnable data-dependent scale

for predictions in new directions
Evidence and Posterior Via G-Functions
Thm. Write Then,

where and
Phase Diagram for Deep Linear Posteriors
Phase Diagram for Deep Linear Posteriors
Phase Diagram for Deep Linear Posteriors
Phase Diagram for Deep Linear Posteriors
Phase Diagram for Deep Linear Posteriors
Phase Diagram for Deep Linear Posteriors
Phase Diagram for Deep Linear Posteriors
Phase Diagram for Deep Linear Posteriors
Phase Diagram for Deep Linear Posteriors
Phase Diagram for Deep Linear Posteriors
Phase Diagram for Deep Linear Posteriors
Deep Linear Networks Learn Optimal Features
Thm. Define

Then,

for Moreover,

where

Network Reliability: A Lecture Course
From Everand
Network Reliability: A Lecture Course
Ilya Gertsbakh
No ratings yet
Variational Methods for Machine Learning with Applications to Deep Networks
From Everand
Variational Methods for Machine Learning with Applications to Deep Networks
Lucas Pinheiro Cinelli
No ratings yet
Bayesian Network Homework Solutions
Document4 pages
Bayesian Network Homework Solutions
g3mk6p0h
100% (1)
ICML-2020-The Neural Tangent Kernel in High Dimensions - Triple Descent and A Multi-Scale Theory of Generalization
Document11 pages
ICML-2020-The Neural Tangent Kernel in High Dimensions - Triple Descent and A Multi-Scale Theory of Generalization
金依菲
No ratings yet
Statement of Purpose: Jaweria Amjad
Document3 pages
Statement of Purpose: Jaweria Amjad
Jaweria Amjad
No ratings yet
PHD Thesis On Neural Network
Document6 pages
PHD Thesis On Neural Network
Nancy Ideker
100% (2)
Dyn Am Learn
Document21 pages
Dyn Am Learn
reader21
No ratings yet
1 s2.0 S2666827022001049 Main
Document8 pages
1 s2.0 S2666827022001049 Main
ivan
No ratings yet
Paper Presentation
Document21 pages
Paper Presentation
Shivpreet Sharma
No ratings yet
From Classical To Unsupervised Deep Learning For Solving Inverse Problem in Imaging To
Document248 pages
From Classical To Unsupervised Deep Learning For Solving Inverse Problem in Imaging To
Asim Asrar
No ratings yet
Cs4407 Programming Assignment 5
Document7 pages
Cs4407 Programming Assignment 5
studyjunky
100% (1)
An Assessment of Gene Regulatory Network Inference Algorithms
Document15 pages
An Assessment of Gene Regulatory Network Inference Algorithms
Adrian Zuur
No ratings yet
2023-Pre-Training Strategy For Solving Evolution Equations Based On Physics-Informed Neural Networks
Document19 pages
2023-Pre-Training Strategy For Solving Evolution Equations Based On Physics-Informed Neural Networks
maycvc
No ratings yet
Fast Learning of Graph Neural Networks With Guaranteed Generalizability: One-Hidden-Layer Case
Document1 page
Fast Learning of Graph Neural Networks With Guaranteed Generalizability: One-Hidden-Layer Case
Thomas Lee
No ratings yet
Research Papers On Neural Networks Free
Document8 pages
Research Papers On Neural Networks Free
wzsatbcnd
100% (1)
Jean Faber and Gilson A. Giraldi - Quantum Models For Artifcial Neural Network
Document8 pages
Jean Faber and Gilson A. Giraldi - Quantum Models For Artifcial Neural Network
dcsi3
No ratings yet
Zhang
Document49 pages
Zhang
carriegos
No ratings yet
Master Thesis Neural Network
Document4 pages
Master Thesis Neural Network
tonichristensenaurora
100% (1)
BP Neural Network Principle and MATLAB Simulation: Xiong Xin Nie Mingxin
Document6 pages
BP Neural Network Principle and MATLAB Simulation: Xiong Xin Nie Mingxin
j
No ratings yet
Neural Hawkes 1
Document16 pages
Neural Hawkes 1
Luis Ivan Hernández Ruíz
No ratings yet
Skymind The Math Behind Neural Networks
Document17 pages
Skymind The Math Behind Neural Networks
jainnitk
100% (1)
ICML - 2016 - Stratified Sampling Meets Machine Learning
Document10 pages
ICML - 2016 - Stratified Sampling Meets Machine Learning
gao jiashi
No ratings yet
An Introduction To Programming Physics-Informed Neural Network-Based Computational Solid Mechanics
Document32 pages
An Introduction To Programming Physics-Informed Neural Network-Based Computational Solid Mechanics
w
No ratings yet
Link Prediction Thesis
Document5 pages
Link Prediction Thesis
ykramhiig
100% (1)
On The Challenges of Learning With Inference Networks On Sparse, High-Dimensional Data
Document14 pages
On The Challenges of Learning With Inference Networks On Sparse, High-Dimensional Data
sahand
No ratings yet
Cs 801 Practicals
Document67 pages
Cs 801 Practicals
vishnu0751
No ratings yet
Lagrange Programming Neural Networks
Document13 pages
Lagrange Programming Neural Networks
Ives Rodríguez
No ratings yet
0400000010
Document339 pages
0400000010
Sagnik Mukhopadhyay
No ratings yet
Data Driven Modelling Using MATLAB
Document21 pages
Data Driven Modelling Using MATLAB
hquynh
No ratings yet
Psichogios AICHE-1 PDF
Document13 pages
Psichogios AICHE-1 PDF
Ruppah
No ratings yet
Polynomial Regression As An Alternative To Neural Nets
Document28 pages
Polynomial Regression As An Alternative To Neural Nets
Miguel Angel Perez
No ratings yet
MCMC Sampling of Directed Flag Complexes With Fixed Undirected Graphs
Document35 pages
MCMC Sampling of Directed Flag Complexes With Fixed Undirected Graphs
Daniele Marconi
No ratings yet
Random Graph and Stochastic Process
Document10 pages
Random Graph and Stochastic Process
Catalina Jaramillo Villalba
No ratings yet
An Introduction To Back
Document4 pages
An Introduction To Back
Dinesh Sundar
No ratings yet
7 Differentiable Physics Simulat
Document5 pages
7 Differentiable Physics Simulat
imsgsatti
No ratings yet
Forecasting Stock Indices With Back Propagation Neural Network
Document10 pages
Forecasting Stock Indices With Back Propagation Neural Network
sarte00
No ratings yet
Deep Learning-Accelerated Computational Framework Based On Physics
Document51 pages
Deep Learning-Accelerated Computational Framework Based On Physics
sohel rana
No ratings yet
Fuzzy Neural Computing of Coffee and Tainted-Water Data From An Electronic Nose
Document6 pages
Fuzzy Neural Computing of Coffee and Tainted-Water Data From An Electronic Nose
yohanna maldonado
No ratings yet
A Hybrid Neural Network-First Principles Approach Process Modeling
Document13 pages
A Hybrid Neural Network-First Principles Approach Process Modeling
Juan Carlos Ladino Vega
No ratings yet
General Observation
Document93 pages
General Observation
Raja
No ratings yet
Assignment No 7
Document6 pages
Assignment No 7
Muhammad Afaq akram
No ratings yet
Impact of Asymmetric Weight Update On Neural Network Training Wi
Document21 pages
Impact of Asymmetric Weight Update On Neural Network Training Wi
nomannazar
No ratings yet
Research Paper On Linear Algebra
Document5 pages
Research Paper On Linear Algebra
pkzoyxrif
100% (1)
Sparse Error Correction From Nonlinear Measurements With Applications in Bad Data Detection For Power Networks
Document30 pages
Sparse Error Correction From Nonlinear Measurements With Applications in Bad Data Detection For Power Networks
subalbeura
No ratings yet
Module 3
Document20 pages
Module 3
Manju p s
No ratings yet
Unit 5
Document97 pages
Unit 5
Mangesh
No ratings yet
Furong
Document42 pages
Furong
Amartya Ray
No ratings yet
Path-Based Spectral Clustering: Guarantees, Robustness To Outliers, and Fast Algorithms
Document59 pages
Path-Based Spectral Clustering: Guarantees, Robustness To Outliers, and Fast Algorithms
Sabyasachi Bera
No ratings yet
2018 06 DeepLearning Song
Document58 pages
2018 06 DeepLearning Song
brahmendra
No ratings yet
A Probabilistic Theory of Deep Learning: Unit 2
Document17 pages
A Probabilistic Theory of Deep Learning: Unit 2
Harshit
No ratings yet
Neural Networks and Their Statistical Application
Document41 pages
Neural Networks and Their Statistical Application
kamjulajay
No ratings yet
PHD Thesis On Artificial Neural Networks
Document8 pages
PHD Thesis On Artificial Neural Networks
prdezlief
100% (1)
Forecasting of Nonlinear Time Series Using Ann: Sciencedirect
Document11 pages
Forecasting of Nonlinear Time Series Using Ann: Sciencedirect
restu
No ratings yet
Path-Based Spectral Clustering: Guarantees, Robustness To Outliers, and Fast Algorithms
Document66 pages
Path-Based Spectral Clustering: Guarantees, Robustness To Outliers, and Fast Algorithms
Sabyasachi Bera
No ratings yet
Research Paper On Counting Sort
Document5 pages
Research Paper On Counting Sort
efdkhd4e
100% (1)
PHD Thesis Genetic Algorithm
Document8 pages
PHD Thesis Genetic Algorithm
robinmoralestopeka
100% (2)
Zhang 2012
Document18 pages
Zhang 2012
Mario Gutiérrez Morales
No ratings yet
Neural Network PHD Thesis PDF
Document5 pages
Neural Network PHD Thesis PDF
lisafieldswashington
100% (2)
Gravity Neural Network
Document8 pages
Gravity Neural Network
Raden Ferrianggoro Supriadi
No ratings yet
Q1 - Exact Solutions To The Nonlinear Dynamics of Learning in Deep Linear Neural Networks
Document22 pages
Q1 - Exact Solutions To The Nonlinear Dynamics of Learning in Deep Linear Neural Networks
弘元
No ratings yet
Scientificamerican0418 12
Document3 pages
Scientificamerican0418 12
Bimal Chand
No ratings yet
Data Scientist Interview Questions and Answers PDF
Document37 pages
Data Scientist Interview Questions and Answers PDF
Bimal Chand
No ratings yet
Top 100 R Interview Questions and Answers For 2021
Document50 pages
Top 100 R Interview Questions and Answers For 2021
Bimal Chand
No ratings yet
Mbaex Admlist Gen 2021
Document5 pages
Mbaex Admlist Gen 2021
Bimal Chand
No ratings yet
Application Satellite Image in Mineral Exploration
Document17 pages
Application Satellite Image in Mineral Exploration
HifdzulFikri
No ratings yet
Affective Encounters Proceedings
Document296 pages
Affective Encounters Proceedings
Joe Rigby
No ratings yet
NLP Module 5
Document156 pages
NLP Module 5
Lisban Gonslaves
No ratings yet
Intuitiveinvesting:: Remote Viewing & Applications To Financial Markets
Document4 pages
Intuitiveinvesting:: Remote Viewing & Applications To Financial Markets
miszka01
No ratings yet
Industrial Attachment Report
Document82 pages
Industrial Attachment Report
Niyibizi Promesse
100% (9)
3dconfig Setting Di Autocad
Document4 pages
3dconfig Setting Di Autocad
vr_tallei
No ratings yet
Auto-Gate Report
Document21 pages
Auto-Gate Report
stamford_bridge
No ratings yet
Capgemini 2017 Informe Anual
Document360 pages
Capgemini 2017 Informe Anual
joseosteopatia3693
No ratings yet
Introduction To E-Business and E-Commerce
Document27 pages
Introduction To E-Business and E-Commerce
Thư Anh
No ratings yet
f6ea499c37d63b3801dee6dd82a4aaf2_320a9a62d020a9964c64b2c44de68cf1
Document2 pages
f6ea499c37d63b3801dee6dd82a4aaf2_320a9a62d020a9964c64b2c44de68cf1
Hardik Gupta
0% (1)
Literature Survey On Student's Performance Prediction in Education Using Data Mining Techniques
Document10 pages
Literature Survey On Student's Performance Prediction in Education Using Data Mining Techniques
Mukesh Kumar
No ratings yet
CW1 Midterm Exam (VALE)
Document2 pages
CW1 Midterm Exam (VALE)
Nonito Vale
No ratings yet
Emotional Autonomy Scale
Document14 pages
Emotional Autonomy Scale
Muhammad Nur Hidayah
100% (1)
Cover Letter
Document1 page
Cover Letter
JobJob
No ratings yet
World Hub Indonesia-Turkey
Document2 pages
World Hub Indonesia-Turkey
Jihan Zahra
No ratings yet
Using The Developer Console To Execute Apex Code
Document5 pages
Using The Developer Console To Execute Apex Code
ksr131
No ratings yet
Prabuddha Bharata September 2014
Document55 pages
Prabuddha Bharata September 2014
Swami Narasimhananda
No ratings yet
Engendered Orange-Fleshed Sweetpotato Project Planning, Implementation, Monitoring and Evaluation: A Learning Kit. Volume 5: Workshop Evaluation, PAPA and Annexes
Document107 pages
Engendered Orange-Fleshed Sweetpotato Project Planning, Implementation, Monitoring and Evaluation: A Learning Kit. Volume 5: Workshop Evaluation, PAPA and Annexes
cip-library
No ratings yet
Class Notes Microsoft Office Word Document
Document89 pages
Class Notes Microsoft Office Word Document
Ariyani 'ayiex' Estiningrum
No ratings yet
Theories in Nursing
Document22 pages
Theories in Nursing
Chrisgr8
No ratings yet
Team Conflict Resolutions
Document5 pages
Team Conflict Resolutions
mans2014
No ratings yet
Aiyaz Sayed Khaiyum Thesis
Document5 pages
Aiyaz Sayed Khaiyum Thesis
afbtbakvk
100% (2)
Health Center Letter
Document2 pages
Health Center Letter
Iyah Dimalanta
No ratings yet
BadBoy - The Core Truth Report
Document20 pages
BadBoy - The Core Truth Report
Alan Gee
No ratings yet
Competing Advantage
Document63 pages
Competing Advantage
jeremyclarkson0202
0% (4)
Data Migration To Hadoop
Document26 pages
Data Migration To Hadoop
krishna
100% (2)
PB115 Connection Reference
Document278 pages
PB115 Connection Reference
Jose Andres Leon
No ratings yet
Agri Mapping UEGIS
Document17 pages
Agri Mapping UEGIS
narockav
No ratings yet
Configuring NIS Services in Linux
Document8 pages
Configuring NIS Services in Linux
api-3736383
100% (1)
B2+ UNIT 10 Test Answer Key Higher
Document2 pages
B2+ UNIT 10 Test Answer Key Higher
MOSQUITO beats
100% (1)