Welcome to Scribd!

R6 - Temporal Relational Reasoning in Videos

Uploaded by

0% found this document useful (0 votes)

9 views2 pages

The document summarizes a research paper that proposes the Temporal Relation Network (TRN) module to capture temporal dependencies in video frames at different time scales. The TRN module achieves state-of-the-art results on three datasets for action recognition and human interaction tasks. While the model succeeds at the proposed task, the document notes that the way it leverages temporal structure is unclear and that it is learning frame relationships rather than "common sense". The evaluation methodology in the paper is effective through easy-to-read tables and helpful visualizations.

Original Description:

Copyright

Available Formats

DOCX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

9 views2 pages

R6 - Temporal Relational Reasoning in Videos

Uploaded by

Nalhdaf 07

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 2

Search inside document

Zain Nasrullah (998892685)

Review on “Temporal Relational Reasoning in Video”

Authors: B. Zhou, A. Andonian, A. Torralba

Short Summary
The goal of this paper is to reason about temporal dependencies in video frames at multiple time scales.
The authors propose Temporal Relation Net (TRN) as a means to improve historical baselines that
exceed at action detection but cannot reason about transformations between frames. The module is
designed to be extensible enough to be inserted into any CNN architecture as a plug-and-play module.

The architecture involves using different frame relation modules that analyze n-frame relations by
sampling from the video. For simplicity, Inception pre-trained on ImageNet is used as the base model
and tests are performed with and without the TRN module. The model can capture the essence of
activity given only a few frames from an action. Using a larger number of frames generally improves
accuracy and ordered frames perform significantly better than shuffled ones suggesting that the
temporal order plays an important role.

The model achieves state-of-the-art results on the Something-Something dataset for human-interaction
recognition (33.60 top1 % accuracy) and the Jester dataset for hand gesture recognition (94.78). The
model also achieves 25.2 mAP on the Charades dataset for daily activity recognition which exceeds
previous state of the art. A t-SNE visualization shows that the model is learning to separate actions into
distinct regions in space; furthermore, similar actions are closer together suggesting that the model is
learning the meaning behind these actions to some degree.

Main Contributions
1. Propose a Temporal Relation Network (TRN) to learn temporal dependencies between video
frames at multiple time scales.
2. Evaluated on Something Something, Jester, and Charades where previous works do not perform
well due to lacking reasoning
3. Visualized what the model is learning to show that they are in fact learning relations

High-Level Evaluation of Paper

This paper placed little focus on theory and spent most of its length discussing interesting experimental
results. As such, the way the model is leveraging temporal structure isn’t quite clear, but the extensive
results demonstrate that it is in fact succeeding at the proposed task. Given that the model achieves
state of the art on many distinct datasets, an interesting argument can be made about the importance
of temporal structure for a reasoning task. That being said, I do not agree with the author’s claim that
the model is learning “common sense”; rather, it is learning the relationships between frames that
correspond to a particular action.

Discussion on Evaluation Methodology

With an emphasis on results, the paper provides five tables that compare results on the various tested
datasets, graphs and accompanying visualizations. The tables are easy to read, the metrics used seem to
be the standard for the particular datasets, and thus the results are easy to interpret. I appreciate how
most results are kept on a single page, including validation and test results, allowing one to quickly get a
Zain Nasrullah (998892685)

sense that the proposed model achieves state-of-the-art and by what margin it exceeds baselines. The
charts in particular are also helpful for understanding the impact of the number of frames and their
ordering on recognition accuracy.

Sprinkler System Installation Guide: For Redbuilt Open-Web Trusses and Red-I Joists
Document12 pages
Sprinkler System Installation Guide: For Redbuilt Open-Web Trusses and Red-I Joists
Steve Velarde
No ratings yet
A Global Averaging Method For Dynamictime Warping, With Applications To Clustering
Document16 pages
A Global Averaging Method For Dynamictime Warping, With Applications To Clustering
basela2010
No ratings yet
Generalization
Document10 pages
Generalization
jerry.sharma0312
No ratings yet
Generalization
Document10 pages
Generalization
jerry.sharma0312
No ratings yet
Generative Adversarial Networks in Time Series: A Systematic Literature Review
Document31 pages
Generative Adversarial Networks in Time Series: A Systematic Literature Review
klaus peter
No ratings yet
Deep Learning Master Thesis
Document4 pages
Deep Learning Master Thesis
dwtnpjyv
100% (1)
DPP For ML
Document120 pages
DPP For ML
TheLastArchivist
No ratings yet
An Experimental Review On Deep Learning Architectures For Time Series Forecasting
Document25 pages
An Experimental Review On Deep Learning Architectures For Time Series Forecasting
piyoya2565
No ratings yet
Ijcseit 020604
Document9 pages
Ijcseit 020604
ijcseit
No ratings yet
Synthetic Market Data and Its Applications 1676296950
Document42 pages
Synthetic Market Data and Its Applications 1676296950
Daniel Peña
No ratings yet
Unifying Temporal Data Models Via A Conceptual Model
Document40 pages
Unifying Temporal Data Models Via A Conceptual Model
Chinmay Kunkikar
No ratings yet
Evaluation of Time Series
Document12 pages
Evaluation of Time Series
karina7cerda7o7ate
No ratings yet
Text Segmentation Via Topic Modeling An PDF
Document11 pages
Text Segmentation Via Topic Modeling An PDF
Dandy Milton
No ratings yet
SSRN Id3900141
Document43 pages
SSRN Id3900141
김기범
No ratings yet
Welco ME
Document15 pages
Welco ME
altaf
No ratings yet
Master Thesis Neural Network
Document4 pages
Master Thesis Neural Network
tonichristensenaurora
100% (1)
Decmion Support: Elsevier
Document15 pages
Decmion Support: Elsevier
Haythem Mzoughi
No ratings yet
Conference Paper LATENT DIRICHLET ALLOCATION (LDA)
Document9 pages
Conference Paper LATENT DIRICHLET ALLOCATION (LDA)
mahi m
No ratings yet
Hyperparameter Tuning For Deep Learning in Natural Language Processing
Document7 pages
Hyperparameter Tuning For Deep Learning in Natural Language Processing
Wardhana Halim Kusuma
No ratings yet
SSRN Id3981160
Document44 pages
SSRN Id3981160
hetalpatel259
No ratings yet
Time Series Forecasting Deep Learning Review IJNS Preprint
Document26 pages
Time Series Forecasting Deep Learning Review IJNS Preprint
dk
No ratings yet
Literature Review Neural Network
Document5 pages
Literature Review Neural Network
fvffybz9
100% (1)
Yoshua Bengio, Nicolas Boulanger-Lewandowski and Razvan Pascanu
Document5 pages
Yoshua Bengio, Nicolas Boulanger-Lewandowski and Razvan Pascanu
Hồ Văn Chương
No ratings yet
Raahul's Solar Weather Forecasting Using Linear Algebra Proposal PDF
Document11 pages
Raahul's Solar Weather Forecasting Using Linear Algebra Proposal PDF
Raahul Singh
No ratings yet
Totally Ordered Broadcast and Multicast Algorithms: A Comprehensive Survey
Document52 pages
Totally Ordered Broadcast and Multicast Algorithms: A Comprehensive Survey
ashishkr23438
No ratings yet
Master Thesis Web Application
Document5 pages
Master Thesis Web Application
DoMyPaperForMeCanada
100% (2)
Esma El 2012
Document6 pages
Esma El 2012
Anonymous VNu3ODGav
No ratings yet
Depso Model For Efficient Clustering Using Drifting Concepts.
Document5 pages
Depso Model For Efficient Clustering Using Drifting Concepts.
Dr J S Kanchana
No ratings yet
Generalizability of Semantic Segmentation Techniques: Keshav Bhandari Texas State University, San Marcos, TX
Document6 pages
Generalizability of Semantic Segmentation Techniques: Keshav Bhandari Texas State University, San Marcos, TX
Keshav Bhandari
No ratings yet
Silva Widm2021
Document44 pages
Silva Widm2021
wowexo4683
No ratings yet
2002 Internet Research Needs Better Models
Document6 pages
2002 Internet Research Needs Better Models
dereje
No ratings yet
IEEE Solved PROJECTS 2009
Document64 pages
IEEE Solved PROJECTS 2009
Muniasamy
No ratings yet
What Can Neural Networks Learn About Flow Physics?
Document6 pages
What Can Neural Networks Learn About Flow Physics?
Javad Rahmannezhad
No ratings yet
LSTM Online Training PDF
Document12 pages
LSTM Online Training PDF
Riadh Gharbi
No ratings yet
Research
Document2 pages
Research
postscript
No ratings yet
AR-NET Autoregressive Neural Net Untuk Time Series - MASUKIN PDF
Document12 pages
AR-NET Autoregressive Neural Net Untuk Time Series - MASUKIN PDF
Rais Haq
No ratings yet
Schema Networks: Zero-Shot Transfer With A Generative Causal Model of Intuitive Physics
Document10 pages
Schema Networks: Zero-Shot Transfer With A Generative Causal Model of Intuitive Physics
tazinabhishek
No ratings yet
Decoupling Superpages From Symmetric Encryption in Web Services
Document8 pages
Decoupling Superpages From Symmetric Encryption in Web Services
Wilson Collins
No ratings yet
An Iterative Alignment Framework For Temporal Sentence Grounding
Document10 pages
An Iterative Alignment Framework For Temporal Sentence Grounding
万至秦
No ratings yet
Lossless, Virtual Theory For Rasterizationse
Document7 pages
Lossless, Virtual Theory For Rasterizationse
alvarito2009
No ratings yet
Extracting Interpretable Features For Time Series An - 2023 - Expert Systems Wit
Document11 pages
Extracting Interpretable Features For Time Series An - 2023 - Expert Systems Wit
Dilip Singh
No ratings yet
MLSys 2020 What Is The State of Neural Network Pruning Paper
Document18 pages
MLSys 2020 What Is The State of Neural Network Pruning Paper
Sher Afghan Malik
No ratings yet
FreDo - Frequency Domain-Based Long-Term Time Series Forecasting
Document12 pages
FreDo - Frequency Domain-Based Long-Term Time Series Forecasting
marcmyomyint1663
No ratings yet
Master Thesis Focus Groups
Document4 pages
Master Thesis Focus Groups
BuyingCollegePapersOnlineSantaRosa
100% (2)
A Case For Courseware: Trixie Mendeer and Debra Rohr
Document4 pages
A Case For Courseware: Trixie Mendeer and Debra Rohr
frida
No ratings yet
Extra Questions
Document6 pages
Extra Questions
Liêu Đăng Khoa
No ratings yet
1.0 Executive Summary: Computational Materials Science CRTA
Document1 page
1.0 Executive Summary: Computational Materials Science CRTA
Anonymous T02GVGzB
No ratings yet
Text Similarity Using Siamese Networks and Transformers
Document10 pages
Text Similarity Using Siamese Networks and Transformers
IJRASETPublications
No ratings yet
Tabu Search For Partitioning Dynamic Data Ow Programs
Document12 pages
Tabu Search For Partitioning Dynamic Data Ow Programs
Daniel Poveda
No ratings yet
A Study of The Connectionist Models For Software Reliability Prediction
Document9 pages
A Study of The Connectionist Models For Software Reliability Prediction
Tirișcă Alexandru
No ratings yet
Modeling and Simulation Best Practices For Wireless Ad Hoc Networks
Document9 pages
Modeling and Simulation Best Practices For Wireless Ad Hoc Networks
John
No ratings yet
End-to-End Non-Factoid Question Answering With An Interactive Visualization of Neural Attention Weights
Document6 pages
End-to-End Non-Factoid Question Answering With An Interactive Visualization of Neural Attention Weights
Phu Nguyen
No ratings yet
Sag Heer 2019
Document19 pages
Sag Heer 2019
Abimael Avila Torres
No ratings yet
Pplied Time Series Ransfer Learning
Document4 pages
Pplied Time Series Ransfer Learning
M.Qasim Mahmood
No ratings yet
An Efficient and Robust Algorithm For Shape Indexing and Retrieval
Document5 pages
An Efficient and Robust Algorithm For Shape Indexing and Retrieval
Shradha Palsani
No ratings yet
An Overview and Comparative Analysis of Recurrent Neural Networks For Short Term Load Forecasting
Document41 pages
An Overview and Comparative Analysis of Recurrent Neural Networks For Short Term Load Forecasting
PietroManzoni
No ratings yet
1 s2.0 S0263224121002402 Main
Document19 pages
1 s2.0 S0263224121002402 Main
vikas sharma
No ratings yet
Bachelor Thesis Eth Mavt
Document8 pages
Bachelor Thesis Eth Mavt
afcmqldsw
100% (2)
Distributed Real Time Architecture For D PDF
Document8 pages
Distributed Real Time Architecture For D PDF
skw
No ratings yet
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Mastering Partial Least Squares Structural Equation Modeling (Pls-Sem) with Smartpls in 38 Hours
From Everand
Mastering Partial Least Squares Structural Equation Modeling (Pls-Sem) with Smartpls in 38 Hours
Ken Kwong-Kay Wong
Rating: 3 out of 5 stars
3/5 (1)
Data Visualization With Python: BCS358D
Document5 pages
Data Visualization With Python: BCS358D
Bhavana Nagaraj
No ratings yet
ABB Ellipse APM Edge - Brochure
Document8 pages
ABB Ellipse APM Edge - Brochure
Carlos Jara
No ratings yet
Inverse Kinematics Solution of PUMA 560 Robot Arm Using ANFIS
Document5 pages
Inverse Kinematics Solution of PUMA 560 Robot Arm Using ANFIS
anon_731929628
No ratings yet
Single/Dual Compressor Centrifugal Chillers: PEH/PHH 050,063,079,087, 100, 126 PFH/PJH 050,063,079,087, 126
Document21 pages
Single/Dual Compressor Centrifugal Chillers: PEH/PHH 050,063,079,087, 100, 126 PFH/PJH 050,063,079,087, 126
Dương Dũng
No ratings yet
FIcha Tecnica LEXUS
Document2 pages
FIcha Tecnica LEXUS
Jair Gonzalez
No ratings yet
Catalogo Pistones
Document196 pages
Catalogo Pistones
alexjose
No ratings yet
Iu-At-137 - Iii - Eng Aspel 308 Abpm
Document37 pages
Iu-At-137 - Iii - Eng Aspel 308 Abpm
Ela Mih
No ratings yet
QW-QAL-626 - (Rev-00) - Prod. and QC Process Flow Chart - Ventura Motor No. 13 (35804)
Document6 pages
QW-QAL-626 - (Rev-00) - Prod. and QC Process Flow Chart - Ventura Motor No. 13 (35804)
Toso Batam
No ratings yet
STudyScheme Townhall
Document12 pages
STudyScheme Townhall
Nicole
No ratings yet
FMEA For Wind Turbine
Document17 pages
FMEA For Wind Turbine
alexrferreira
No ratings yet
GIS Ch3
Document43 pages
GIS Ch3
Mahmoud Elnahas
No ratings yet
Big Data Analytics (BDAG 19-5) : Quiz: GMP - 2019 Term V
Document2 pages
Big Data Analytics (BDAG 19-5) : Quiz: GMP - 2019 Term V
debmatra
No ratings yet
E Broker
Document5 pages
E Broker
Rashmi Sarkar
No ratings yet
Is AI Dangerous To Humans?
Document2 pages
Is AI Dangerous To Humans?
Kevin Fung
No ratings yet
Toshiba T300BMV2 - Instruction - Manual
Document74 pages
Toshiba T300BMV2 - Instruction - Manual
Agnelo Fernandes
No ratings yet
Asrat Simru App
Document36 pages
Asrat Simru App
tadiso birku
No ratings yet
Chapter 5 Javascriptdocument
Document122 pages
Chapter 5 Javascriptdocument
Kin
No ratings yet
MR-185S, - 200S Liquid-Ring Pump: Operating Manual
Document21 pages
MR-185S, - 200S Liquid-Ring Pump: Operating Manual
pablo ortiz
No ratings yet
Different Types of Fuses and Its Applications
Document10 pages
Different Types of Fuses and Its Applications
Ahmet Mehmet
No ratings yet
GMA-7 Organizational Chart
Document8 pages
GMA-7 Organizational Chart
Emman Taguba
No ratings yet
Dot Map
Document7 pages
Dot Map
Rohamat Mandal
No ratings yet
Insurans
Document6 pages
Insurans
Muhammad Farailham
No ratings yet
HR Digital Transformation: The Practical Guide
Document50 pages
HR Digital Transformation: The Practical Guide
paulus chandra
No ratings yet
Value Creation Oikuus
Document68 pages
Value Creation Oikuus
Jefferson Cu
No ratings yet
35th Annual Dive Show: - Convention Center, San Diego, California
Document2 pages
35th Annual Dive Show: - Convention Center, San Diego, California
Hà Vi
No ratings yet
Beadbreakers: Pneumatic Driven Hydraulic Beadbreaker
Document2 pages
Beadbreakers: Pneumatic Driven Hydraulic Beadbreaker
Genaire Limited
No ratings yet
Technical Specifications: Easy. Efficient. Reliable
Document2 pages
Technical Specifications: Easy. Efficient. Reliable
Zahid Rasool
No ratings yet
ARC20011 Mechanics of Structures: Laboratory Session #2: Beam Bending Moment, Measurements and Analysis
Document22 pages
ARC20011 Mechanics of Structures: Laboratory Session #2: Beam Bending Moment, Measurements and Analysis
Kushan Hirantha
No ratings yet
Coffer Slabs Formwork
Document16 pages
Coffer Slabs Formwork
Hans Jones
No ratings yet