Welcome to Scribd!

0 1 Intro Notations

Uploaded by

0% found this document useful (0 votes)

6 views6 pages

The document discusses notations used in supervised discriminative learning. It notes that labeled training data contains inputs x and corresponding classifications or regressions y. A neural network is parameterized by weights θ and represented as a function fθ that maps inputs to outputs. The network is trained discriminatively by minimizing a loss function on the training set or per example, plus a regularization term. For regression networks specifically, mean squared error is commonly used as the loss function with L2 regularization.

Original Description:

Original Title

0_1_intro_Notations

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

6 views6 pages

0 1 Intro Notations

Uploaded by

mahdi nematshahi

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 6

Search inside document

Notations

(supervised discriminative learning)
Notation

Supervised Learning

Labelled training data: 𝒟 𝐱 ,𝑦 :

• input 𝐱∈𝒳 ℝ
• classification: 𝑦∈𝒴 0,1, … , 𝑘 1
• regression: 𝑦∈𝒴 ℝ

We say, a neural network is parametrized by 𝜽 ∈ Θ ℝ (its weights)
and is represented by 𝑓𝜽 : 𝒳 → 𝒴

9/4/2022 KTH ‐ DD2412 2

Notation

A neural network function

y 𝑓𝜽 𝐱

𝐱 𝑦

9/4/2022 KTH ‐ DD2412 3

Notation

Training a deep network discriminatively

𝜽∗ argmin𝜽 ℒ 𝒟 loss function on the whole training set

𝜽∗ argmin𝜽 𝑙 𝑓𝜽 𝐱 , 𝑦 loss function per training example

𝜽∗ argmin𝜽 𝑙 𝑓𝜽 𝐱 , 𝑦 Ω 𝜽 a regularization term

9/4/2022 KTH ‐ DD2412 4

Notation

Example: Deep Regression Networks

𝜽∗ argmin𝜽 𝑙 𝑓𝜽 𝐱 , 𝑦 Ω 𝜽

mean squared error (MSE)

𝜽∗ argmin𝜽 𝑓𝜽 𝐱 𝑦 𝜽

L2 regularization (similar to weight decay)

9/4/2022 KTH ‐ DD2412 5

Reflection/Discussion point

What do the following two
statements mean?
• Standard deep (regression) networks give a
point estimate prediction
i.e., 𝑦 𝑎𝑟𝑔𝑚𝑎𝑥 𝑃 𝑦|𝐱

• Standard deep network weights are a point
3 minutes
estimate of the parameter distribution
i.e., 𝜽∗ 𝑎𝑟𝑔𝑚𝑎𝑥𝜽 𝑃 𝜽 𝒟

• What is the alternative?
6

Neural Networks
Document40 pages
Neural Networks
salemamr1010
No ratings yet
Global Sparse Momentum SGD For Pruning Very Deep Neural Networks
Document13 pages
Global Sparse Momentum SGD For Pruning Very Deep Neural Networks
chuliang guo
No ratings yet
Thanks To XYZ Agency For Funding
Document5 pages
Thanks To XYZ Agency For Funding
Serkalem Negusse
No ratings yet
Slide07 Haykin Chapter 7: Committee Machines
Document8 pages
Slide07 Haykin Chapter 7: Committee Machines
hossein_kho
No ratings yet
Lec9 CNN 25jan18
Document111 pages
Lec9 CNN 25jan18
Trần Văn Duy
No ratings yet
CIS537 Tool NeuralNetworkCheatSheet
Document2 pages
CIS537 Tool NeuralNetworkCheatSheet
yohhong
No ratings yet
Sensors: Self-Supervised Point Set Local Descriptors For Point Cloud Registration
Document18 pages
Sensors: Self-Supervised Point Set Local Descriptors For Point Cloud Registration
SouhailKel
No ratings yet
Pin The Memory: Learning To Generalize Semantic Segmentation
Document19 pages
Pin The Memory: Learning To Generalize Semantic Segmentation
fakloda khan
No ratings yet
Use of Physics-Informed Neural Networks For Ageing Prediction and Lifetime Extension of Wind Farm Components
Document12 pages
Use of Physics-Informed Neural Networks For Ageing Prediction and Lifetime Extension of Wind Farm Components
malekpour_ahmad
No ratings yet
Done
Document6 pages
Done
rayanemouzali3230
No ratings yet
Dic Image Segmentation of Dense Cell Populations
Document5 pages
Dic Image Segmentation of Dense Cell Populations
saujanya rao
No ratings yet
DLCV Ch2 Neural Network
Document68 pages
DLCV Ch2 Neural Network
Mario Parot
No ratings yet
Machine Learning Techniques For Classifying Network Anomalies and Intrusions Revised
Document5 pages
Machine Learning Techniques For Classifying Network Anomalies and Intrusions Revised
Aditi Biswas
No ratings yet
Fast-Track Semester 2022: Technical Answers To Real-World Problems
Document11 pages
Fast-Track Semester 2022: Technical Answers To Real-World Problems
STYX
No ratings yet
Clustering Techniques - Utkarsh Kulshrestha
Document25 pages
Clustering Techniques - Utkarsh Kulshrestha
N Mahesh
No ratings yet
Deep Learning QP
Document4 pages
Deep Learning QP
Gowri Ilayaraja
No ratings yet
Image Based Classification
Document8 pages
Image Based Classification
Muhammad Sami Ullah
No ratings yet
On Loss Functions For Deep Neural Networks in Classification Katarzyna Janocha, Wojciech Marian Czarnecki
Document10 pages
On Loss Functions For Deep Neural Networks in Classification Katarzyna Janocha, Wojciech Marian Czarnecki
ingaleharshal
No ratings yet
4 NN Regularization
Document13 pages
4 NN Regularization
virgulatiit21a1068
No ratings yet
Lec5-Adts 6up
Document4 pages
Lec5-Adts 6up
Kapil Kumar
No ratings yet
Aggregated Residual Transformations For Deep Neural Networks
Document10 pages
Aggregated Residual Transformations For Deep Neural Networks
blackicebattle
No ratings yet
Part PDF
Document43 pages
Part PDF
Srinu Sehwag
No ratings yet
Rainbow - Combining Improvements in Deep Reinforcement Learning (1710.02298)
Document14 pages
Rainbow - Combining Improvements in Deep Reinforcement Learning (1710.02298)
koveje
No ratings yet
Hierarchical Action Classification With Network
Document14 pages
Hierarchical Action Classification With Network
huicheng chen
No ratings yet
Persistency of Excitation For Robustness of Neural Networks: Kamil Nar S. Shankar Sastry
Document38 pages
Persistency of Excitation For Robustness of Neural Networks: Kamil Nar S. Shankar Sastry
Shad
No ratings yet
Deep Learning Handson
Document65 pages
Deep Learning Handson
Alan
No ratings yet
21-Data Clustering (K-Means Clustering Algorithm), Predictive Analytics-11!04!2023
Document41 pages
21-Data Clustering (K-Means Clustering Algorithm), Predictive Analytics-11!04!2023
Shubham Kodilkar
No ratings yet
Clustering With Deep Learning: Taxonomy and New Methods
Document12 pages
Clustering With Deep Learning: Taxonomy and New Methods
rather Aarif
No ratings yet
Simple, Distributed, and Accelerated Probabilistic Programming
Document16 pages
Simple, Distributed, and Accelerated Probabilistic Programming
M Feldy Riza
No ratings yet
Binaryconnect: Training Deep Neural Networks With Binary Weights During Propagations
Document9 pages
Binaryconnect: Training Deep Neural Networks With Binary Weights During Propagations
Chanon Tonmai
No ratings yet
AIML-Module-3-part 2
Document122 pages
AIML-Module-3-part 2
srujanmoily
No ratings yet
Machine Review: University, Greater Noida, Uttar Pradesh, India
Document6 pages
Machine Review: University, Greater Noida, Uttar Pradesh, India
Vaibhav
No ratings yet
Dr. Meenakshi Sood Associate Professor, NITTTR Chandigarh: Meenkashi@nitttrchd - Ac.in
Document39 pages
Dr. Meenakshi Sood Associate Professor, NITTTR Chandigarh: Meenkashi@nitttrchd - Ac.in
Ravikumaar Rayala
No ratings yet
A Neural Network Approach To Ordinal Regression
Document6 pages
A Neural Network Approach To Ordinal Regression
张拓
No ratings yet
The Decision Tree Classifier: Design and Potential: Abstmct-Tiús Paper Presents The Basic Concepts of A Multistage
Document6 pages
The Decision Tree Classifier: Design and Potential: Abstmct-Tiús Paper Presents The Basic Concepts of A Multistage
Klissman Morales Olabarrera
No ratings yet
Activation Function
Document44 pages
Activation Function
SANJIDA AKTER
No ratings yet
Ternary Weight Networks: 30th Conference On Neural Information Processing Systems (NIPS 2016), Barcelona, Spain
Document5 pages
Ternary Weight Networks: 30th Conference On Neural Information Processing Systems (NIPS 2016), Barcelona, Spain
tayyabmujahid
No ratings yet
Cambricon Q
Document14 pages
Cambricon Q
Yu Qian
No ratings yet
Knime Project Report
Document12 pages
Knime Project Report
Ansh Rohatgi
No ratings yet
Machine Learning: Chapter 4. Artificial Neural Networks
Document34 pages
Machine Learning: Chapter 4. Artificial Neural Networks
fareenfarzanawahed
No ratings yet
Neural Networks
Document39 pages
Neural Networks
Melkamu Gebeyehu
No ratings yet
Xu Weakly Supervised Semantic Point Cloud Segmentation Towards 10x Fewer Labels CVPR 2020 Paper
Document10 pages
Xu Weakly Supervised Semantic Point Cloud Segmentation Towards 10x Fewer Labels CVPR 2020 Paper
Youness ABOUQORA
No ratings yet
Sampling Techniques in Bayesian Target Encoding: June 2020
Document13 pages
Sampling Techniques in Bayesian Target Encoding: June 2020
marboe
No ratings yet
Domain Generalization On Constrained
Document12 pages
Domain Generalization On Constrained
Randa
No ratings yet
6-DeepVisualLearning L6
Document82 pages
6-DeepVisualLearning L6
hisisthesongoficeandfire
No ratings yet
(IJCST-V10I4P14) :manish Chava, Aman Agarwal, DR Radha K
Document5 pages
(IJCST-V10I4P14) :manish Chava, Aman Agarwal, DR Radha K
EighthSenseGroup
No ratings yet
CS60010: Deep Learning CNN - Part 1: Sudeshna Sarkar
Document64 pages
CS60010: Deep Learning CNN - Part 1: Sudeshna Sarkar
DEEP ROY
No ratings yet
sc09 Fluid Sim Cohen
Document33 pages
sc09 Fluid Sim Cohen
Valentino
No ratings yet
Tablada Et Al (2015) - DCT Approximations Based On Feig-Winograd Algorithm
Document14 pages
Tablada Et Al (2015) - DCT Approximations Based On Feig-Winograd Algorithm
Claudio Javier Tablada
No ratings yet
Tutorial Pres 1
Document28 pages
Tutorial Pres 1
Jonas Jixiao Wang
No ratings yet
Scalpel: Customizing DNN Pruning To The Underlying Hardware Parallelism
Document13 pages
Scalpel: Customizing DNN Pruning To The Underlying Hardware Parallelism
ali shaarawy
No ratings yet
Guest-Lecture - NN Architectures
Document64 pages
Guest-Lecture - NN Architectures
sk1029
No ratings yet
Hybrid Discriminative-Generative Training Via COnstrrastive Leanring
Document14 pages
Hybrid Discriminative-Generative Training Via COnstrrastive Leanring
bobabeari989
No ratings yet
Neural Execution Engines: Learning To Execute Subroutines: Work Completed During An Internship at Google
Document21 pages
Neural Execution Engines: Learning To Execute Subroutines: Work Completed During An Internship at Google
walter hu
No ratings yet
Integer Quantization For Deep Learning Inference
Document20 pages
Integer Quantization For Deep Learning Inference
jifik60153
No ratings yet
Pplication of Deep Reinforcement Learning For Ndian Stock Trading Automation
Document9 pages
Pplication of Deep Reinforcement Learning For Ndian Stock Trading Automation
ekene
No ratings yet
Pam-Crash Tutorial 1-2 ElasticCantilever
Document32 pages
Pam-Crash Tutorial 1-2 ElasticCantilever
Udham
No ratings yet
Evaluating Hedge Fund and CTA Performance: Data Envelopment Analysis Approach
From Everand
Evaluating Hedge Fund and CTA Performance: Data Envelopment Analysis Approach
Greg N. Gregoriou
No ratings yet
A Computational Framework for Segmentation and Grouping
From Everand
A Computational Framework for Segmentation and Grouping
G. Medioni
No ratings yet
Object-Oriented Information Engineering: Analysis, Design, and Implementation
From Everand
Object-Oriented Information Engineering: Analysis, Design, and Implementation
Stephen Montgomery
No ratings yet