0% found this document useful (0 votes)

71 views6 pages

Deep Learning Important Studies

The document lists significant architecture papers related to various aspects of deep learning, including image classification, object detection, image captioning, and image generation. It provides references to key papers along with their authors, publication details, and links to code where applicable. Additionally, it includes resources for online courses and books on deep learning.

Uploaded by

vmahajanbe22

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

71 views6 pages

Deep Learning Important Studies

Uploaded by

vmahajanbe22

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Famous Architecture papers with Code

Image Classification:
Network in Network [Paper] [Note] [Torch Code]

 Lin, Min, Qiang Chen, and Shuicheng Yan. "Network in network." arXiv preprint
arXiv:1312.4400 (2013).

VGG [Paper] [Note] [Torch Code]

 Simonyan, Karen, and Andrew Zisserman. "Very deep convolutional networks for
large-scale image recognition." arXiv preprint arXiv:1409.1556 (2014).

GoogleNet [Paper] [Note] [Torch Code]

 Szegedy, Christian, et al. "Going deeper with convolutions." Proceedings of the

IEEE Conference on Computer Vision and Pattern Recognition. 2015.

ResNet [Paper] [Note] [Torch Code]

 He, Kaiming, et al. "Deep residual learning for image recognition." Proceedings of
the IEEE Conference on Computer Vision and Pattern Recognition. 2016.

Popular Module
Dropout [Paper] [Note]

 Srivastava, Nitish, et al. "Dropout: a simple way to prevent neural networks from
overfitting." Journal of Machine Learning Research 15.1 (2014): 1929-1958.

Batch Normalization [Paper] [Note]

 Ioffe S, Szegedy C. Batch normalization: Accelerating deep network training by

reducing internal covariate shift[J]. arXiv preprint arXiv:1502.03167, 2015.

Object Detection in Image:

RCNN [Paper] [Note] [Code]
 Ross Girshick, Jeff Donahue, Trevor Darrell, Jitendra Malik, Rich feature hierarchies
for accurate object detection and semantic segmentation

Spatial pyramid pooling in deep convolutional networks for visual recognition [[Paper]]
([Link] [Note] [Code]

 He K, Zhang X, Ren S, et al. Spatial pyramid pooling in deep convolutional

networks for visual recognition[J]. Pattern Analysis and Machine Intelligence, IEEE
Transactions on, 2015, 37(9): 1904-1916.

Fast R-CNN [[Paper]] ([Link] [Note] [Code]

 Ross Girshick, Fast R-CNN, arXiv:1504.08083.

Faster R-CNN, Microsoft Research [[Paper]]

([Link] [Note] [Code] [Python Code]

 Shaoqing Ren, Kaiming He, Ross Girshick, Jian Sun, Faster R-CNN: Towards Real-
Time Object Detection with Region Proposal Networks, arXiv:1506.01497.

End-to-end people detection in crowded scenes [[Paper]]

([Link] [Note] [Code]

 Russell Stewart, Mykhaylo Andriluka, End-to-end people detection in crowded

scenes, arXiv:1506.04878.

You Only Look Once: Unified, Real-Time Object Detection [[Paper]]

([Link] [Note] [Code]

 Joseph Redmon, Santosh Divvala, Ross Girshick, Ali Farhadi, You Only Look Once:
Unified, Real-Time Object Detection, arXiv:1506.02640

Adaptive Object Detection Using Adjacency and Zoom Prediction [[Paper]]

([Link] [Note]

 Lu Y, Javidi T, Lazebnik S. Adaptive Object Detection Using Adjacency and Zoom

Prediction[J]. arXiv:1512.07711, 2015.

Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent
Neural Networks [Paper] [Note]

 Sean Bell, C. Lawrence Zitnick, Kavita Bala, Ross Girshick. arXiv:1512.04143, 2015.
G-CNN: an Iterative Grid Based Object Detector [Paper]

 Mahyar Najibi, Mohammad Rastegari, Larry S. Davis. arXiv:1512.07729, 2015.

Object Detection in Video:

Seq-NMS for Video Object Detection [Paper] [Note]

 Wei Han, Pooya Khorrami, Tom Le Paine, Prajit Ramachandran, Mohammad

Babaeizadeh, Honghui Shi, Jianan Li, Shuicheng Yan, Thomas S. Huang. Seq-NMS
for Video Object Detection. arXiv preprint arXiv:1602.08465, 2016

Image Caption:
Exploring Nearest Neighbor Approaches for Image Captioning [Paper]

 Devlin J, Gupta S, Girshick R, et al. Exploring Nearest Neighbor Approaches for

Image Captioning[J]. arXiv preprint arXiv:1505.04467, 2015.

Show and Tell: A Neural Image Caption Generator [Paper] [Note]

 Vinyals, Oriol, et al. "Show and tell: A neural image caption generator."
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.
2015.

Image Generations:
Pixel Recurrent Neural Networks [Paper] [Note]

 van den Oord A, Kalchbrenner N, Kavukcuoglu K. Pixel Recurrent Neural

Networks[J]. arXiv preprint arXiv:1601.06759, 2016.

Variational Autoencoder [Paper] [Note]

 Kingma D P, Welling M. Auto-encoding variational bayes[J]. arXiv preprint

arXiv:1312.6114, 2013.

DRAW: A recurrent neural network for image generation [Paper] [Torch

Code] [Tensorflow Code] [Note]
 Gregor K, Danihelka I, Graves A, et al. DRAW: A recurrent neural network for
image generation[J]. arXiv preprint arXiv:1502.04623, 2015.

Scribbler: Controlling Deep Image Synthesis with Sketch and Color [Paper] [Note]

 Patsorn Sangkloy, Jingwan Lu, et al. Scribbler: Controlling Deep Image Synthesis
with Sketch and Color. arXiv preprint arXiv:1612.00835, 2016.

Unsupervised Representation Learning with Deep Convolutional Generative Adversarial

Networks [Paper]

 Radford A, Metz L, Chintala S. Unsupervised representation learning with deep

convolutional generative adversarial networks[J]. arXiv preprint arXiv:1511.06434,
2015.

Improved Techniques for Training GANs [Paper]

 Salimans T, Goodfellow I, Zaremba W, et al. Improved Techniques for Training

GANs[J]. arXiv preprint arXiv:1606.03498, 2016.

InfoGAN: Interpretable Representation Learning by Information Maximizing Generative

Adversarial Nets[Paper]

 Chen X, Duan Y, Houthooft R, et al. InfoGAN: Interpretable Representation

Learning by Information Maximizing Generative Adversarial Nets[J]. arXiv preprint
arXiv:1606.03657, 2016.

Image-to-Image Translation with Conditional Adversarial Networks [Paper] [Note] [Torch

Code] [Tensorflow Code]

 Isola P, Zhu J Y, Zhou T, et al. Image-to-Image Translation with Conditional

Adversarial Networks[J]. arXiv preprint arXiv:1611.07004, 2016.

Learning to Generate Images of Outdoor Scenes from Attributes and Semantic

Layouts [Paper] [Note]

 Levent Karacan, Zeynep Akata, Aykut Erdem, Erkut Erdem. Learning to Generate
Images of Outdoor Scenes from Attributes and Semantic Layouts [J]. arXiv
preprint arXiv:1612.00215, 2016.

Learning to Discover Cross-Domain Relations with Generative Adversarial

Networks [Paper] [Note]
 Kim, Taeksoo, et al. "Learning to Discover Cross-Domain Relations with
Generative Adversarial Networks." arXiv preprint arXiv:1703.05192 (2017).

Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial

Networks [Paper] [Note]

 Zhu J Y, Park T, Isola P, et al. Unpaired Image-to-Image Translation using Cycle-

Consistent Adversarial Networks[J]. arXiv preprint arXiv:1703.10593, 2017.

BEGAN: Boundary Equilibrium Generative Adversarial Networks [Paper] [Note]

 Berthelot, David, Tom Schumm, and Luke Metz. "BEGAN: Boundary Equilibrium
Generative Adversarial Networks." arXiv preprint arXiv:1703.10717 (2017).

StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial

Networks [Paper] [Note] [Tensorflow Code]

 Zhang, Han, et al. "StackGAN: Text to Photo-realistic Image Synthesis with

Stacked Generative Adversarial Networks." arXiv preprint arXiv:1612.03242 (2016).

Image & Language

Learning Deep Representations of Fine-Grained Visual Descriptions [Paper] [Note]

 Reed, Scott, et al. "Learning deep representations of fine-grained visual

descriptions." Proceedings of the IEEE Conference on Computer Vision and
Pattern Recognition. 2016.

Activation Maximization
Synthesizing the preferred inputs for neurons in neural networks via deep generator
networks [Paper] [Note]

 Nguyen A, Dosovitskiy A, Yosinski J, et al. Synthesizing the preferred inputs for

neurons in neural networks via deep generator networks[J]. arXiv preprint
arXiv:1605.09304, 2016.

Style Transfer
A neural algorithm of artistic style [Paper] [Note]
 Gatys L A, Ecker A S, Bethge M. A neural algorithm of artistic style[J]. arXiv
preprint arXiv:1508.06576, 2015.

Perceptual losses for real-time style transfer and super-resolution [Paper] [Note]

 Johnson J, Alahi A, Fei-Fei L. Perceptual losses for real-time style transfer and
super-resolution[J]. arXiv preprint arXiv:1603.08155, 2016.

Super Resolution
Texture Enhancement via High-Resolution Style Transfer for Single-Image Super-
Resolution [Paper] [Note]

 Il Jun Ahn, Woo Hyun Nam. Texture Enhancement via High-Resolution Style
Transfer for Single-Image Super-Resolution [J]. arXiv preprint arXiv:1612.00085,
2016.

Others
Fully convolutional networks for semantic segmentation [Paper] [Note]

 Long J, Shelhamer E, Darrell T. Fully convolutional networks for semantic

segmentation[C]//Proceedings of the IEEE Conference on Computer Vision and
Pattern Recognition. 2015: 3431-3440.

Open Courses

 CS231n: Convolutional Neural Networks for Visual Recognition [Course Page]

 CS224d: Deep Learning for Natural Language Processing [Course Page]

Online Books

 Deep Learning by Ian Goodfellow, Yoshua Bengio and Aaron Courville

By : Dr. Mazhar Javed Awan

Deep Learning Resources Guide
No ratings yet
Deep Learning Resources Guide
5 pages
Harsha Thesis
No ratings yet
Harsha Thesis
62 pages
978 0 7503 6244 3.preview
No ratings yet
978 0 7503 6244 3.preview
56 pages
Lec25 Architectures
No ratings yet
Lec25 Architectures
52 pages
Deep Learning in Image Processing Review
No ratings yet
Deep Learning in Image Processing Review
23 pages
Deep Learning Overview and History
No ratings yet
Deep Learning Overview and History
54 pages
GH Deep Learning Awesome
No ratings yet
GH Deep Learning Awesome
8 pages
DL Tutorial NIPS2015 PDF
No ratings yet
DL Tutorial NIPS2015 PDF
133 pages
Pretrained Computer Vision Models Guide
No ratings yet
Pretrained Computer Vision Models Guide
10 pages
Google Research: 3D Vision & Robotics
No ratings yet
Google Research: 3D Vision & Robotics
35 pages
Final Report PDF
No ratings yet
Final Report PDF
69 pages
Advances in Generative Adversarial Networks
No ratings yet
Advances in Generative Adversarial Networks
2 pages
Thuy T. Pham: By, U. of Technology Sydney
No ratings yet
Thuy T. Pham: By, U. of Technology Sydney
5 pages
Research On Learning Representations in Computer Vision
No ratings yet
Research On Learning Representations in Computer Vision
52 pages
Listofpapers1 0
No ratings yet
Listofpapers1 0
8 pages
1 s2.0 S0031320317304120 Main
No ratings yet
1 s2.0 S0031320317304120 Main
24 pages
Neural Scene Representation and Rendering - Related Work
No ratings yet
Neural Scene Representation and Rendering - Related Work
14 pages
Universal Vision-Language Model for Charts
No ratings yet
Universal Vision-Language Model for Charts
10 pages
Bibliography
No ratings yet
Bibliography
37 pages
Urn CH SLSP ZBZ 9781098134181 Ihv PDF
No ratings yet
Urn CH SLSP ZBZ 9781098134181 Ihv PDF
7 pages
BMM 2018 - Deep Learning Tutorial
No ratings yet
BMM 2018 - Deep Learning Tutorial
47 pages
Deep Learning for Visual Experts
No ratings yet
Deep Learning for Visual Experts
58 pages
Computational Intelligence and Neuroscience - 2018 - Voulodimos - Deep Learning For Computer Vision A Brief Review
No ratings yet
Computational Intelligence and Neuroscience - 2018 - Voulodimos - Deep Learning For Computer Vision A Brief Review
13 pages
DL U3 Applications of Deep Learning To Computer Vision: Image Classification Object Detection
No ratings yet
DL U3 Applications of Deep Learning To Computer Vision: Image Classification Object Detection
15 pages
Bascis of AI - Module 2 - Complementary Study Material - 4
No ratings yet
Bascis of AI - Module 2 - Complementary Study Material - 4
4 pages
Recent Advances in Deep Learning For Object Detection
No ratings yet
Recent Advances in Deep Learning For Object Detection
26 pages
3D Generative Models A Survey
No ratings yet
3D Generative Models A Survey
21 pages
30 Amazing Machine Learning Projects For The Past Year (v.2018)
No ratings yet
30 Amazing Machine Learning Projects For The Past Year (v.2018)
22 pages
Deep Residual Learning References
No ratings yet
Deep Residual Learning References
9 pages
Deep Learning in Computer Vision Overview
No ratings yet
Deep Learning in Computer Vision Overview
26 pages
CSE Deep Learning Seminar Report
No ratings yet
CSE Deep Learning Seminar Report
4 pages
Deep Learning Models Overview
No ratings yet
Deep Learning Models Overview
66 pages
CVPR 2019 Paper Summaries and Stats
No ratings yet
CVPR 2019 Paper Summaries and Stats
30 pages
Lecture 19
No ratings yet
Lecture 19
19 pages
Image Classification and Detection Challenges
No ratings yet
Image Classification and Detection Challenges
86 pages
L10-DL Intro
No ratings yet
L10-DL Intro
25 pages
8 Modern Convolutional Neural Networks: Et Al. Et Al. Et Al
No ratings yet
8 Modern Convolutional Neural Networks: Et Al. Et Al. Et Al
57 pages
Advances in Image Recognition Models
No ratings yet
Advances in Image Recognition Models
5 pages
Image Captioning with CNN and RNN
No ratings yet
Image Captioning with CNN and RNN
4 pages
Sensors: HEMIGEN: Human Embryo Image Generator Based On Generative Adversarial Networks
No ratings yet
Sensors: HEMIGEN: Human Embryo Image Generator Based On Generative Adversarial Networks
16 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
202 pages
GANs for Image Synthesis and Editing Survey
No ratings yet
GANs for Image Synthesis and Editing Survey
15 pages
Advances in Image Colorization Techniques
No ratings yet
Advances in Image Colorization Techniques
2 pages
SSRN Id3354412
No ratings yet
SSRN Id3354412
8 pages
Kim2019 Article LatentTransformationsNeuralNet
No ratings yet
Kim2019 Article LatentTransformationsNeuralNet
15 pages
Generating Caption From Images Using Flickr Image Dataset
No ratings yet
Generating Caption From Images Using Flickr Image Dataset
7 pages
Recent Advances in Convolutional Neural Networks-2018
100% (1)
Recent Advances in Convolutional Neural Networks-2018
42 pages
3D Object Learning from Images
No ratings yet
3D Object Learning from Images
169 pages
Generative Adversarial Networks and Deep Learning Theory and Applications 9781032068107 - 20230320 - 112232 PDF
No ratings yet
Generative Adversarial Networks and Deep Learning Theory and Applications 9781032068107 - 20230320 - 112232 PDF
223 pages
Insect Detection Using CNN and Drones
No ratings yet
Insect Detection Using CNN and Drones
8 pages
CNN Architectures in Computer Vision
No ratings yet
CNN Architectures in Computer Vision
8 pages
Repurposing Gans For One-Shot Semantic Part Segmentation
No ratings yet
Repurposing Gans For One-Shot Semantic Part Segmentation
14 pages
YOLOv4 vs Detectron2 Overview
No ratings yet
YOLOv4 vs Detectron2 Overview
16 pages
Unsupervised Learning with DCGANs
No ratings yet
Unsupervised Learning with DCGANs
15 pages
ECCV 2020 Paper Digests
No ratings yet
ECCV 2020 Paper Digests
138 pages
Explainable GANs for Time Series Generation
No ratings yet
Explainable GANs for Time Series Generation
85 pages
Trabajos - Articulos - Total
No ratings yet
Trabajos - Articulos - Total
1 page
Perceptron and Backpropagation
No ratings yet
Perceptron and Backpropagation
17 pages
Bagging vs Pasting in Machine Learning
100% (1)
Bagging vs Pasting in Machine Learning
21 pages
9e Aditya Pandey 2234
No ratings yet
9e Aditya Pandey 2234
12 pages
Machine Learning
No ratings yet
Machine Learning
10 pages
Kmean Clustering
No ratings yet
Kmean Clustering
10 pages
ML Quiz Answers for Learners
No ratings yet
ML Quiz Answers for Learners
34 pages
LLM Architectures Explained - RNNS, LSTMs & GRUs (Part 3) - by Vipra Singh - Sep, 2024 - Medium
No ratings yet
LLM Architectures Explained - RNNS, LSTMs & GRUs (Part 3) - by Vipra Singh - Sep, 2024 - Medium
115 pages
Learn Machine Learning in 20 Days
No ratings yet
Learn Machine Learning in 20 Days
23 pages
AI & Machine Learning Lab Course Guide
No ratings yet
AI & Machine Learning Lab Course Guide
6 pages
Deep Learning Exam Questions December 2022
No ratings yet
Deep Learning Exam Questions December 2022
2 pages
ML Group 4 Assignment
No ratings yet
ML Group 4 Assignment
26 pages
KNN VS Kmeans
No ratings yet
KNN VS Kmeans
3 pages
Architecture: Simple Neural Nets For Pattern Classification
No ratings yet
Architecture: Simple Neural Nets For Pattern Classification
15 pages
Programa Ciencia de Datos y Machine Learning Con Python - Feb23
No ratings yet
Programa Ciencia de Datos y Machine Learning Con Python - Feb23
13 pages
An Intrusion Detection System For Imbalanced Dataset Based On Deep Learning
No ratings yet
An Intrusion Detection System For Imbalanced Dataset Based On Deep Learning
10 pages
Machine Learning & Deep Learning Models For Time Series Forecasting
No ratings yet
Machine Learning & Deep Learning Models For Time Series Forecasting
13 pages
Deep Learning - IIT Ropar - Unit 6 - Week 3
No ratings yet
Deep Learning - IIT Ropar - Unit 6 - Week 3
8 pages
30 Day Data Science Tracker
No ratings yet
30 Day Data Science Tracker
1 page
Understanding Convolutional Neural Networks
No ratings yet
Understanding Convolutional Neural Networks
132 pages
Neural Network Training in R Package
No ratings yet
Neural Network Training in R Package
15 pages
RNNs and GANs for Researchers
No ratings yet
RNNs and GANs for Researchers
74 pages
DM Practical File
No ratings yet
DM Practical File
21 pages
Understanding Neural Networks and Perceptrons
No ratings yet
Understanding Neural Networks and Perceptrons
6 pages
Neural Network Training Basics
No ratings yet
Neural Network Training Basics
34 pages
Deep Learning for Developers
100% (1)
Deep Learning for Developers
1,029 pages
Implementing ANN in Python
No ratings yet
Implementing ANN in Python
13 pages
10fold Split70
No ratings yet
10fold Split70
5 pages
Neural Networks & Deep Learning 2025
No ratings yet
Neural Networks & Deep Learning 2025
73 pages
Google Net
No ratings yet
Google Net
40 pages
CNN Architectures Workshop
No ratings yet
CNN Architectures Workshop
104 pages

Deep Learning Important Studies

Uploaded by

Deep Learning Important Studies

Uploaded by

Famous Architecture papers with Code

VGG [Paper] [Note] [Torch Code]

GoogleNet [Paper] [Note] [Torch Code]

 Szegedy, Christian, et al. "Going deeper with convolutions." Proceedings of the

ResNet [Paper] [Note] [Torch Code]

Batch Normalization [Paper] [Note]

 Ioffe S, Szegedy C. Batch normalization: Accelerating deep network training by

Object Detection in Image:

 He K, Zhang X, Ren S, et al. Spatial pyramid pooling in deep convolutional

Fast R-CNN [[Paper]] ([Link] [Note] [Code]

 Ross Girshick, Fast R-CNN, arXiv:1504.08083.

Faster R-CNN, Microsoft Research [[Paper]]

End-to-end people detection in crowded scenes [[Paper]]

 Russell Stewart, Mykhaylo Andriluka, End-to-end people detection in crowded

You Only Look Once: Unified, Real-Time Object Detection [[Paper]]

Adaptive Object Detection Using Adjacency and Zoom Prediction [[Paper]]

 Lu Y, Javidi T, Lazebnik S. Adaptive Object Detection Using Adjacency and Zoom

 Mahyar Najibi, Mohammad Rastegari, Larry S. Davis. arXiv:1512.07729, 2015.

Object Detection in Video:

 Wei Han, Pooya Khorrami, Tom Le Paine, Prajit Ramachandran, Mohammad

 Devlin J, Gupta S, Girshick R, et al. Exploring Nearest Neighbor Approaches for

Show and Tell: A Neural Image Caption Generator [Paper] [Note]

 van den Oord A, Kalchbrenner N, Kavukcuoglu K. Pixel Recurrent Neural

Variational Autoencoder [Paper] [Note]

 Kingma D P, Welling M. Auto-encoding variational bayes[J]. arXiv preprint

DRAW: A recurrent neural network for image generation [Paper] [Torch

Unsupervised Representation Learning with Deep Convolutional Generative Adversarial

 Radford A, Metz L, Chintala S. Unsupervised representation learning with deep

Improved Techniques for Training GANs [Paper]

 Salimans T, Goodfellow I, Zaremba W, et al. Improved Techniques for Training

InfoGAN: Interpretable Representation Learning by Information Maximizing Generative

 Chen X, Duan Y, Houthooft R, et al. InfoGAN: Interpretable Representation

Image-to-Image Translation with Conditional Adversarial Networks [Paper] [Note] [Torch

 Isola P, Zhu J Y, Zhou T, et al. Image-to-Image Translation with Conditional

Learning to Generate Images of Outdoor Scenes from Attributes and Semantic

Learning to Discover Cross-Domain Relations with Generative Adversarial

Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial

 Zhu J Y, Park T, Isola P, et al. Unpaired Image-to-Image Translation using Cycle-

BEGAN: Boundary Equilibrium Generative Adversarial Networks [Paper] [Note]

StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial

 Zhang, Han, et al. "StackGAN: Text to Photo-realistic Image Synthesis with

Image & Language

 Reed, Scott, et al. "Learning deep representations of fine-grained visual

 Nguyen A, Dosovitskiy A, Yosinski J, et al. Synthesizing the preferred inputs for

 Long J, Shelhamer E, Darrell T. Fully convolutional networks for semantic

 CS231n: Convolutional Neural Networks for Visual Recognition [Course Page]

 Deep Learning by Ian Goodfellow, Yoshua Bengio and Aaron Courville

By : Dr. Mazhar Javed Awan

You might also like