Welcome to Scribd!

Self-Supervised Learning

Uploaded by

0% found this document useful (0 votes)

5 views11 pages

This document discusses self-supervised learning. [1] Self-supervised learning allows neural networks to learn useful representations from unlabeled data by defining pretext tasks where the model predicts structural aspects of the inputs. [2] Examples of self-supervised tasks discussed are image colorization, where the model predicts colors from grayscale images, image inpainting where the model predicts missing regions of images, and image super-resolution where the model predicts higher resolution images. [3] The document explains that these proxy tasks force networks to learn meaningful image semantics that can then be used for downstream tasks like object detection or segmentation, without requiring human annotation of large datasets.

Original Description:

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

5 views11 pages

Self-Supervised Learning

Uploaded by

Janvi

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 11

Search inside document

Self-Supervised Learning

Tutorial 19-08-2021
Supervised Learning
• The initial boost in the machine learning world came via the paradigm of
supervised learning.

• In this setting, a model is trained for a specialized task for which the data is
carefully labelled.
• Bounding boxes for localization
• Semantic maps for semantic segmentation, etc.

• Practically speaking, it’s impossible to label everything in the world.

• Unfortunately, this a limits how far the field of AI can go with supervised
learning alone.
Basics of Self-Supervised
Learning
• Data provides the supervision directly.

• In general, perturb the data and task a network to predict it back.

• We often may solve a proxy task using the network forcing it to learn meaningful
semantics that can be used in downstream tasks.

• The proxy tasks are also often of great research importance in standalone form.

• Let us check some image level self-supervised learning tasks.

An Example: Image Colorization

• Train a neural network to predict colours from a grayscale image.

• The network needs to inherently learn semantic boundaries present in the image, for
example the shape of the foreground (dog), the background type etc.

• This semantic knowledge can be exploited in downstream tasks like semantic segmentation, a
task which now needs a human to annotate every pixel present in an image.
An Example: Image Inpainting

• Remove a particular area of an image randomly and ask a NN to predict it back.

• The network requires to understand the structure of objects present in the image to inpaint
the required region
An Example: Image Super-resolution

● Predicting a higher resolution image from a lower input.

● We will be showing a code walk through for this particular topic in today’s session.
How are the networks trained?

Perturbed Inputs Ground Truths

Convolutional Layers

Corpus of unlabelled images

Similar approach in other ﬁelds:
• The BERT language model is also trained
on a similar concept.

• Instead of image patch, we mask a word

from a random sentence.

• A transformer based network is tasked

to predict the masked word back
learning rich semantics present in
language.

• Wav2Vec also follows a similar principal

for speech.

Image credit: https://towardsdatascience.com/bert-explained-state-of-the-art-language-model-for-nlp-f8b21a9b6270

Audio-Visual Self Supervised Learning

y=1
Chosen pair
is in Sync

Input Frames Cosine

Binary
(only the lower half) Similarity cross-entropy loss

y=0
Mel-spectrogram Chosen pair
is out of Sync

(In Sync) (Out of Sync)

Let us now go through a SR code:
• Please go to this repository: https://github.com/Rudrabha/SS2021-19-08-2021

• Open this notebook for the code walk through:

https://github.com/Rudrabha/SS2021-19-08-2021/blob/main/Image_Super_Resolve_Tutorial.ipynb

• There are other two notebooks containing codes for Image inpainting and Image Colorization.

• Please note that these codes are for basic introduction and not meant for State-of-the-art uses in
any of these problems. However, the building blocks of the network can be used to train much
more complex networks.

• Please check Prof. Andrew Zisserman’s slides:

https://project.inria.fr/paiss/files/2018/07/zisserman-self-supervised.pdf for more insights.
Thank You!

Data2vec: A General Framework For Self-Supervised Learning in Speech, Vision & Language
Document20 pages
Data2vec: A General Framework For Self-Supervised Learning in Speech, Vision & Language
Soumyadipto Banerjee
No ratings yet
Electronic Service Tool
Document199 pages
Electronic Service Tool
Ezequiel Zeta
No ratings yet
nn4nlp 6.9.2022
Document63 pages
nn4nlp 6.9.2022
muhammad usman Haider
No ratings yet
Pretraining Part1 16 Mar 23 PDF
Document32 pages
Pretraining Part1 16 Mar 23 PDF
arpan singh
No ratings yet
Pretraining Part2 17 Mar 23 PDF
Document38 pages
Pretraining Part2 17 Mar 23 PDF
arpan singh
No ratings yet
Lec 02
Document33 pages
Lec 02
Ghhanali Singh
No ratings yet
Shirani MehrH PDF
Document8 pages
Shirani MehrH PDF
chirag801
No ratings yet
Deep Learning: Hoàng Huy Minh Hoàng Thảo Lan Chi Phạm Huy Thiên Phúc Trương Huỳnh Đăng Khoa
Document25 pages
Deep Learning: Hoàng Huy Minh Hoàng Thảo Lan Chi Phạm Huy Thiên Phúc Trương Huỳnh Đăng Khoa
sandy milk min
No ratings yet
Building A Neural Machine Translation System From Scratch: Deep Learning World 2019, Munich
Document62 pages
Building A Neural Machine Translation System From Scratch: Deep Learning World 2019, Munich
Tới Trần
No ratings yet
Session 8
Document24 pages
Session 8
arash.hasanpour
No ratings yet
Chap 7.1 Sequence Analysis Using FFN
Document47 pages
Chap 7.1 Sequence Analysis Using FFN
HRITWIK GHOSH
No ratings yet
Experiment 9: Aim: Theory
Document4 pages
Experiment 9: Aim: Theory
Varun Vora
No ratings yet
CH 5
Document16 pages
CH 5
21dce106
No ratings yet
The Art of Programming
Document21 pages
The Art of Programming
Noor Muhammad
No ratings yet
Poster
Document1 page
Poster
Muhammad Waris Hargan
No ratings yet
XAI (v7)
Document40 pages
XAI (v7)
applead
No ratings yet
Deep Learning in C# - Understanding Neural Network Architecture - CodeProject
Document4 pages
Deep Learning in C# - Understanding Neural Network Architecture - CodeProject
Fahd Ahmed
No ratings yet
Natural Language Processing (NLP) - Module 3
Document76 pages
Natural Language Processing (NLP) - Module 3
ENG19CS0357 Vedha Murthy N L
No ratings yet
Transformer Part3 16 Mar 23 PDF
Document59 pages
Transformer Part3 16 Mar 23 PDF
arpan singh
No ratings yet
495 Lecture 11 BERT
Document31 pages
495 Lecture 11 BERT
Mohibur Nabil
No ratings yet
Pos Tagging TBL 4up
Document5 pages
Pos Tagging TBL 4up
girish reddy
No ratings yet
Transfer (v3)
Document38 pages
Transfer (v3)
daredevilcho
No ratings yet
2019 CVPR Paper Overview: Sualab Ho Seong Lee
Document30 pages
2019 CVPR Paper Overview: Sualab Ho Seong Lee
Ramanarayan
No ratings yet
Graph+Embeddings 3
Document35 pages
Graph+Embeddings 3
Kartikey Verma
No ratings yet
Presentation1 (Autosaved) (Autosaved)
Document20 pages
Presentation1 (Autosaved) (Autosaved)
Rakshith
No ratings yet
NLP Nanodegree Syllabus
Document11 pages
NLP Nanodegree Syllabus
balthazar shrestha
No ratings yet
01 Introduction Annotated - PPTX 1
Document63 pages
01 Introduction Annotated - PPTX 1
rikewon446
No ratings yet
Word Prediction Using Trie Data Structure: J Component Project Report
Document81 pages
Word Prediction Using Trie Data Structure: J Component Project Report
SPANDAN KUMAR
No ratings yet
Does The Brain Do Inverse Graphics?
Document49 pages
Does The Brain Do Inverse Graphics?
Made Artha
No ratings yet
Convolutional Neural Networks For Hand-Written Digit Recognition
Document30 pages
Convolutional Neural Networks For Hand-Written Digit Recognition
Xuân Xuânn
No ratings yet
Deep Learning Lecture 0 Introduction Alexander Tkachenko
Document31 pages
Deep Learning Lecture 0 Introduction Alexander Tkachenko
Mahmood Kohansal
No ratings yet
Deep Learning
Document58 pages
Deep Learning
Sirish Venkatagiri
100% (5)
Part4 22
Document65 pages
Part4 22
kylefan15
No ratings yet
Natural Language Processing: Chatbot Sequence To Sequence Learning
Document44 pages
Natural Language Processing: Chatbot Sequence To Sequence Learning
Nitish
No ratings yet
Neural Networks For Machine Learning: Lecture 16a Learning A Joint Model of Images and Captions
Document19 pages
Neural Networks For Machine Learning: Lecture 16a Learning A Joint Model of Images and Captions
Reshma Khemchandani
No ratings yet
495 Lecture 13 Trans Decoder
Document21 pages
495 Lecture 13 Trans Decoder
Mohibur Nabil
No ratings yet
Deep Belief Nets: 2007 NIPS Tutorial On
Document100 pages
Deep Belief Nets: 2007 NIPS Tutorial On
Firomsa Tesfaye
No ratings yet
Understanding: Chapter 14: Rich & Knight Dr. Suthikshn Kumar
Document17 pages
Understanding: Chapter 14: Rich & Knight Dr. Suthikshn Kumar
Shubham Saundarya
No ratings yet
Compiled Lesson 1 - Introduction To JAVA Programming
Document12 pages
Compiled Lesson 1 - Introduction To JAVA Programming
Ian Jansen Fernandez Tibayan
No ratings yet
Deep Learning and Applications: Pham The Bao Ptbao@sgu - Edu.vn
Document43 pages
Deep Learning and Applications: Pham The Bao Ptbao@sgu - Edu.vn
Xuân Xuânn
No ratings yet
(CS3244) Final Poster Group 19
Document1 page
(CS3244) Final Poster Group 19
Anonymous qNZVXg
No ratings yet
Desh Raj: Center For Language and Speech Processing Draj@cs - Jhu.edu
Document1 page
Desh Raj: Center For Language and Speech Processing Draj@cs - Jhu.edu
santoso elektro
No ratings yet
Gupta Et Al 2019 - Character-Based NMT With Transformer
Document16 pages
Gupta Et Al 2019 - Character-Based NMT With Transformer
mcccccc
No ratings yet
Contact Us: Spoken Tutorial
Document2 pages
Contact Us: Spoken Tutorial
Datta Yallapu
No ratings yet
Deep Learning Nanodegree Syllabus 8-15
Document11 pages
Deep Learning Nanodegree Syllabus 8-15
SANTIAGO PUJOL ZABALA
No ratings yet
Geoffrey Hinton With Nitish Srivastava Kevin Swersky: Neural Networks For Machine Learning
Document32 pages
Geoffrey Hinton With Nitish Srivastava Kevin Swersky: Neural Networks For Machine Learning
Mohamed Mostafa
No ratings yet
He Masked Autoencoders Are Scalable Vision Learners CVPR 2022 Paper
Document10 pages
He Masked Autoencoders Are Scalable Vision Learners CVPR 2022 Paper
Johnathan Xie
No ratings yet
Deep Visual-Semantic Alignments For Generating Image Descriptions
Document14 pages
Deep Visual-Semantic Alignments For Generating Image Descriptions
RAGURAM SHANMUGAM
No ratings yet
cs224n 2023 Lecture9 Pretraining
Document54 pages
cs224n 2023 Lecture9 Pretraining
Cassin Thangam
No ratings yet
Generatives Neural Networks
Document31 pages
Generatives Neural Networks
Devyansh Gupta
No ratings yet
Between Words and Characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP
Document27 pages
Between Words and Characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP
Vipul Chauhan
No ratings yet
Paper SimAN
Document10 pages
Paper SimAN
Kazi Istiak Uddin Toriqe
No ratings yet
Texture Synthesis by Non-Parametric Sampling
Document32 pages
Texture Synthesis by Non-Parametric Sampling
Alessio
No ratings yet
Switchprompt: Learning Domain-Specific Gated Soft Prompts For Classification in Low-Resource Domains
Document7 pages
Switchprompt: Learning Domain-Specific Gated Soft Prompts For Classification in Low-Resource Domains
nhat.nhat.v.pham
No ratings yet
Concurrent Programming in Java Design Principles A
Document216 pages
Concurrent Programming in Java Design Principles A
jyowan
No ratings yet
Java 8: Selected Updates
Document31 pages
Java 8: Selected Updates
idkwhii
No ratings yet
Capstone Meeting #2 - Literature Review Notes
Document4 pages
Capstone Meeting #2 - Literature Review Notes
adityaemmanuel1313
No ratings yet
Geoffrey Hinton With Nitish Srivastava Kevin Swersky: Neural Networks For Machine Learning
Document30 pages
Geoffrey Hinton With Nitish Srivastava Kevin Swersky: Neural Networks For Machine Learning
Mohammad Alzyoud
No ratings yet
Using A Hierarchy of Domain Specific Languages in Complex Software Systems Design
Document8 pages
Using A Hierarchy of Domain Specific Languages in Complex Software Systems Design
Daniel Ajoy
No ratings yet
Transfer Learning for Natural Language Processing
From Everand
Transfer Learning for Natural Language Processing
Paul Azunre
No ratings yet
Convolutional Neural Networks with Swift for Tensorflow: Image Recognition and Dataset Categorization
From Everand
Convolutional Neural Networks with Swift for Tensorflow: Image Recognition and Dataset Categorization
Brett Koonce
No ratings yet
Microprocessor and Assembly Language Programming 402 PDF
Document38 pages
Microprocessor and Assembly Language Programming 402 PDF
Gurram Sunitha
83% (6)
Oracle Applications Cloud: Environment Resizing
Document6 pages
Oracle Applications Cloud: Environment Resizing
juliovh71
No ratings yet
Expense Tracker Project
Document15 pages
Expense Tracker Project
hardik patil
No ratings yet
Smart Home
Document7 pages
Smart Home
Wentika Putri
100% (1)
Cleo
Document7 pages
Cleo
lolbilibili233lmao
No ratings yet
V600 Serial Interface+Operation Manual
Document124 pages
V600 Serial Interface+Operation Manual
Phuoc Pham
No ratings yet
PDU Geist
Document66 pages
PDU Geist
james gonzales
No ratings yet
Agreement For Web Site Development and Ecommerce Services
Document14 pages
Agreement For Web Site Development and Ecommerce Services
Shahnur Alam
No ratings yet
Final Report SQL Injection
Document24 pages
Final Report SQL Injection
KhalilGhenimi
No ratings yet
Abdul Ghani Shahzaib CV 2023 v1.1
Document4 pages
Abdul Ghani Shahzaib CV 2023 v1.1
San Lizas Airen
No ratings yet
Smart Robotic Arm Using: Page 1 of 9
Document9 pages
Smart Robotic Arm Using: Page 1 of 9
Akhilesh Kumar
No ratings yet
Sotero 06activity MIS
Document2 pages
Sotero 06activity MIS
bernadette sotero
No ratings yet
0RECVALSTCK
Document3 pages
0RECVALSTCK
NITISCHO000
No ratings yet
Instructions To Candidates
Document17 pages
Instructions To Candidates
sivakumar67538
No ratings yet
Project Review - Final B187
Document15 pages
Project Review - Final B187
Indigo Virat
No ratings yet
Fortigate 1800f Series
Document7 pages
Fortigate 1800f Series
Azza Amara
No ratings yet
Azmat Updated Resume 0ct-2019
Document3 pages
Azmat Updated Resume 0ct-2019
Syed Azmat Bukhari
No ratings yet
Am51 Webseal Guide
Document540 pages
Am51 Webseal Guide
im.aliraza
No ratings yet
Ecommerce Website
Document14 pages
Ecommerce Website
Vipin Kumar
100% (1)
E-Book Research Paper
Document6 pages
E-Book Research Paper
Aks
No ratings yet
Info Tool MATSHITA DVD-RAM UJ8C2
Document135 pages
Info Tool MATSHITA DVD-RAM UJ8C2
Alfredo Tamayo
No ratings yet
Log Cat 1697818094709
Document11 pages
Log Cat 1697818094709
alfhansyahisa28
No ratings yet
Vlsi Design Slides
Document77 pages
Vlsi Design Slides
migad
No ratings yet
Simatic Simatic S7 App V1.0
Document32 pages
Simatic Simatic S7 App V1.0
franojeda
No ratings yet
DBMS Notes
Document43 pages
DBMS Notes
jitendrabudholiya620
No ratings yet
Matlab Is A Programming Language Developed by Mathworks. It Started Out As A Matrix Programming Language Where Linear Algebra Programming Was Simple
Document29 pages
Matlab Is A Programming Language Developed by Mathworks. It Started Out As A Matrix Programming Language Where Linear Algebra Programming Was Simple
Lakshmi malladi
No ratings yet
Checkpoint Firewall OT ICS Solution For CI and Industrial
Document46 pages
Checkpoint Firewall OT ICS Solution For CI and Industrial
Dea Josh Farro
No ratings yet
Create Append Structure in SAP Table
Document3 pages
Create Append Structure in SAP Table
Kabil Rocky
100% (2)
Mapa Modbus PLC DSE
Document132 pages
Mapa Modbus PLC DSE
Luigi Euan
No ratings yet