Welcome to Scribd!

Skip carousel

Mandatory Exercise

Uploaded by

redalert4ever4

0% found this document useful (0 votes)

1 views7 pages

Original Title

Mandatory_Exercise

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

1 views7 pages

Mandatory Exercise

Uploaded by

redalert4ever4

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 7

Search inside document

ML4NLU Page 1 Matr.Nr.

Machine Learning for Natural Language Understanding

(Upload solutions via moodle test)
Good Luck!
Exercise WS 2023/2024

Mandatory Exercise Due: 04.02.2024

Multiple Choice 10 Points

Are the following statements true or false?

Statement True False

1 The sigmoid-function hθ (x) is smooth and symetric at x = 0.

2 BERT is a language model based on RNNs

3 A Multilayer Perceptron encodes a simple linear discriminant function

4 Every continuous function can be approximated arbitrarily closely by a

multi-layer Artificial Neural Network.
5 GPT-4 is known for its language generation abilities

6 RNNs capture long-term dependencies

7 Hyperparameters should be tuned on the validation set

8 An overfitted model performs well on unknown data

9 The classification of unbalanced data is measured best with error and

accuracy
10 The harmonic mean of precision and recall is called F-measure
ML4NLU Page 2 Matr.Nr.:

Aspects of Machine Learning Models 10 Points

1) Use Pseudocode to fill the steps (1 to 4) in such way that the model goes through the process
of training. Stopping criteria can be ignored. Assign and reuse variables if needed. (5 Points)
Algorithm 1: Generic machine learning model training
input : batches = {samples, targets}, learningrate = λ, P arameters = Θ, loss = MSE,
model = NN
init model(parameters);
for batch in data do
1
2
3
4
end
return model;

2) The following table contains predicted values from a simplified linear model (Yi = Beta0 + xi )
and their true (i.e. expected) counterpart. Show the calculation of the MSE for this model! How
should Beta be updated to minimize the MSE? (5 Points)

Predicted Expected
2 1
3 2
5 4
8 7
ML4NLU Page 3 Matr.Nr.:

Neural Networks I 10 Points

1) Which types of neural networks do you know and for which tasks are they typically used? (2
Points)

2) Explain what distinguishes an Long Short-Term Memory model (LSTM) from a conventional
Recurrent Neural Network (RNN). (3 Points)

3) Name at least three NLP tasks for which an LSTM is suitable! (3 Points)

4) Describe how Transformers handle sequential information (2 Points)

ML4NLU Page 4 Matr.Nr.:

Neural Networks II 10 Points

1) Explain the terms overfitting and underfitting! When can they each occur? (2 Points)

2) Explain the differences between parameters and hyperparameters in a machine learning model (3
Points)
ML4NLU Page 5 Matr.Nr.:

3) Take a look at the following Neural Network:

Input Hidden Ouput

layer layer layer

x0 4
b = −2
1 1

b = 0.5
2 1

x1 1
b = −3

Show that the network correctly classifies the following data. Assume sgn as activation function (5
Points)

x0 x1 Klasse (
2 1 1 +1, if x > 0,
sgn(x):=
-1 2 1 −1, if x <= 0.
-3 2 -1
ML4NLU Page 6 Matr.Nr.:

Language Models 10 Points

1) Name an describe task usually used for the pretraining of language models (e.g. BERT) (2
Points)

2) What are positional embeddings and why are they used in the context of Transformer mod-
els? (2 Points)

3) Name at least four downstream tasks at token or text level and briefly explain them. (4 Points)

4) Discuss where even the largest language models reach their limits! (2 Points)
ML4NLU Page 7 Matr.Nr.:

Benchmarks 10 Points

h i
G(y, n) := (y1 , . . . , yn ), (y2 , . . . , yn+1 ), . . . , (y|y|−n+1 , . . . , y|y| ) (1)

C(g, y, n) := [g|g ∈ G(y, n)] (2)

P
min C(g, ŷ, n), C(g, y, n)
g∈G(ŷ,n)
P (ŷ, y, n) := P (3)
C(g, ŷ, n)
g∈G(ŷ,n)

ŷ = [a cat is on the mat]

y = [a dog is on the couch]

1) Calculate the uni-/bi-grams for G(ŷ, 1), G(y, 1), G(ŷ, 2), G(y, 2). (4 Points)

2) Calculate the uni-/bi-gram-precision for P (ŷ, y, 1) and P (ŷ, y, 2). (6 Points)

Mastering Mathematica®: Programming Methods and Applications
From Everand
Mastering Mathematica®: Programming Methods and Applications
John W. Gray
Rating: 5 out of 5 stars
5/5 (1)
FULL Download Ebook PDF Hands On Machine Learning With Scikit Learn and Tensorflow PDF Ebook
Document41 pages
FULL Download Ebook PDF Hands On Machine Learning With Scikit Learn and Tensorflow PDF Ebook
brandon.alexander480
100% (36)
Real Time Object Dectection
Document60 pages
Real Time Object Dectection
Rakesh
No ratings yet
TensorFlow For Machine Intelligence
Document305 pages
TensorFlow For Machine Intelligence
Manash Mandal
100% (24)
Solution Dseclzg524 05-07-2020 Ec3r
Document7 pages
Solution Dseclzg524 05-07-2020 Ec3r
srirams007
No ratings yet
Homework For Module 3 Part 2
Document6 pages
Homework For Module 3 Part 2
bita younesian
100% (3)
SMAI Question Papers
Document13 pages
SMAI Question Papers
Sangam Patil
No ratings yet
Mandatory - Exercise 2
Document11 pages
Mandatory - Exercise 2
redalert4ever4
No ratings yet
Advanced Data Mining and Machine Learning: Assignment 3: High Dimensional Data Clustering
Document4 pages
Advanced Data Mining and Machine Learning: Assignment 3: High Dimensional Data Clustering
Ankit Malhotra
No ratings yet
Second Exam 2021-22
Document14 pages
Second Exam 2021-22
Luís Lopes
No ratings yet
hw2 311
Document4 pages
hw2 311
john doe
No ratings yet
GPU Programming EE 4702-1 Take-Home Pre-Final Examination
Document11 pages
GPU Programming EE 4702-1 Take-Home Pre-Final Examination
moien
No ratings yet
Btech Electrical 5 Sem Digital Signal Processing Pee5i103 2020
Document2 pages
Btech Electrical 5 Sem Digital Signal Processing Pee5i103 2020
mrout0333
No ratings yet
Machine Learning Week 3
Document4 pages
Machine Learning Week 3
Anandhs
No ratings yet
Int Computacional 2016 Ex 7 Boston Housing Price
Document4 pages
Int Computacional 2016 Ex 7 Boston Housing Price
Rodrigo De Santis
No ratings yet
Deep Learning: Models and Optimization: Marco Cuturi
Document272 pages
Deep Learning: Models and Optimization: Marco Cuturi
Bojan Bankovic
No ratings yet
Lab 2. Bisection Method: 1 Instructions
Document3 pages
Lab 2. Bisection Method: 1 Instructions
Andres Mendez
No ratings yet
Assignment 1
Document2 pages
Assignment 1
Arjun
No ratings yet
Final Spring18
Document3 pages
Final Spring18
aizaz
No ratings yet
3 - Design and Analysis of Algorithms
Document188 pages
3 - Design and Analysis of Algorithms
UdupiSri group
No ratings yet
Home Exercise 3: Dynamic Programming and Randomized Algorithms
Document5 pages
Home Exercise 3: Dynamic Programming and Randomized Algorithms
Jaokd
No ratings yet
SP18 CS182 Midterm Solutions - Edited
Document14 pages
SP18 CS182 Midterm Solutions - Edited
Hasim
No ratings yet
Y 1999 PAPER1
Document7 pages
Y 1999 PAPER1
hussainfizam
No ratings yet
178 hw3
Document3 pages
178 hw3
jagaenator
No ratings yet
Tutorial Kit (Mathematics-400 L) - Vol. 2
Document68 pages
Tutorial Kit (Mathematics-400 L) - Vol. 2
Gebra Togun
No ratings yet
Final 2008 F
Document18 pages
Final 2008 F
Muhammad Murtaza
No ratings yet
HW 3
Document5 pages
HW 3
Abbas
No ratings yet
Cse373 10sp Midterm1.Key
Document10 pages
Cse373 10sp Midterm1.Key
Yohannes Kefyalew
No ratings yet
HW 1
Document2 pages
HW 1
Tushar Garg
No ratings yet
GPU Programming EE 4702-1 Take-Home Pre-Final Examination: Name Solution
Document19 pages
GPU Programming EE 4702-1 Take-Home Pre-Final Examination: Name Solution
moien
No ratings yet
ML PG Assignment 3
Document3 pages
ML PG Assignment 3
sonu23144
No ratings yet
Homework 2: SVM, Kernel Methods, Ensemble Learning, Learning Theory
Document12 pages
Homework 2: SVM, Kernel Methods, Ensemble Learning, Learning Theory
Ali Ahmad
No ratings yet
Machine Learning Notes
Document67 pages
Machine Learning Notes
Andrea
No ratings yet
HT TP: //qpa Pe R.W But .Ac .In: Pattern Recognition
Document4 pages
HT TP: //qpa Pe R.W But .Ac .In: Pattern Recognition
Duma Dumai
No ratings yet
Weekly Progress Report: I. This Week Progress
Document4 pages
Weekly Progress Report: I. This Week Progress
Hagai
No ratings yet
XGboost Tutorial
Document13 pages
XGboost Tutorial
gargwork1990
No ratings yet
SEMESTER I, 20152016 Midterm
Document6 pages
SEMESTER I, 20152016 Midterm
কফি ওয়ান টু
No ratings yet
HW 7
Document4 pages
HW 7
adithya604
No ratings yet
CS321 Introduction To Numerical Methods: Lecture 8 Review
Document21 pages
CS321 Introduction To Numerical Methods: Lecture 8 Review
Joy lobo
No ratings yet
Homework 1: CS 178: Machine Learning: Spring 2020
Document4 pages
Homework 1: CS 178: Machine Learning: Spring 2020
Jonathan Nguyen
No ratings yet
HW1 Final
Document4 pages
HW1 Final
Vino Wad
No ratings yet
Prexamscombinedpdf
Document26 pages
Prexamscombinedpdf
paul sakelaridis
No ratings yet
MTH 510 Final S18
Document10 pages
MTH 510 Final S18
azuresteow
No ratings yet
Artigo Smallex
Document17 pages
Artigo Smallex
Will Corleone
No ratings yet
Time: 3 Hours Total Marks: 70
Document16 pages
Time: 3 Hours Total Marks: 70
Tushar Gupta
No ratings yet
W7 Lab
Document3 pages
W7 Lab
chaitanyach650
No ratings yet
ML 2023 FinalExam
Document3 pages
ML 2023 FinalExam
Pattranit Teerakoson
No ratings yet
Midterm
Document3 pages
Midterm
at4786
No ratings yet
Roll No. ................................ 361001
Document6 pages
Roll No. ................................ 361001
shubham
No ratings yet
College of Computer Information Technology
Document3 pages
College of Computer Information Technology
Mohamed
No ratings yet
Qs ML
Document8 pages
Qs ML
Ms Bukhary
No ratings yet
Lab Manual
Document20 pages
Lab Manual
Muthukrishnan N
No ratings yet
CMSC 251: Algorithms: Spring 1998
Document30 pages
CMSC 251: Algorithms: Spring 1998
Anthony-Dimitri A
No ratings yet
Data Structures and Algorithms Unit - 1: Topics
Document38 pages
Data Structures and Algorithms Unit - 1: Topics
KISHAN GOPAL SONI 1660495
No ratings yet
Exam 2011
Document22 pages
Exam 2011
Anas Bachiri
No ratings yet
Lecture 1, Part 1: Linear Regression: Roger Grosse
Document9 pages
Lecture 1, Part 1: Linear Regression: Roger Grosse
Shamil shihab pk
No ratings yet
Midterm Sample Answer: Instructor: Cristiana Amza Department of Electrical and Computer Engineering University of Toronto
Document18 pages
Midterm Sample Answer: Instructor: Cristiana Amza Department of Electrical and Computer Engineering University of Toronto
jhusseth
No ratings yet
Solutions 1
Document17 pages
Solutions 1
Eman Asem
No ratings yet
Solutions
Document25 pages
Solutions
ale meke
No ratings yet
Optimization Problems Algorithms And: Prof. G. K. Mahanti
Document27 pages
Optimization Problems Algorithms And: Prof. G. K. Mahanti
Asad Ahmed
No ratings yet
178 hw1
Document4 pages
178 hw1
Vijay kumar
No ratings yet
CS8451 QB
Document15 pages
CS8451 QB
Thilaga Mylsamy
No ratings yet
Test 1 Week 5
Document3 pages
Test 1 Week 5
Cristian Cabello
No ratings yet
DCS 304
Document8 pages
DCS 304
ranjan
No ratings yet
Artificial Inteligent 4
Document18 pages
Artificial Inteligent 4
3IA04 Hilman Fansyuri
No ratings yet
Deep Learning in Neural Networks An Overview
Document89 pages
Deep Learning in Neural Networks An Overview
Anjan Kumar Sahoo
No ratings yet
Sakhap Soaking Modeling
Document10 pages
Sakhap Soaking Modeling
Sakharam
No ratings yet
Deep Learning (Wiki)
Document20 pages
Deep Learning (Wiki)
Vin
No ratings yet
Correlation Among The Cutting Parameters, Surface Roughness and Cutting Forces in Turning Process by Experimental Studies
Document6 pages
Correlation Among The Cutting Parameters, Surface Roughness and Cutting Forces in Turning Process by Experimental Studies
Surendra Shekhawat
No ratings yet
Sketch To Image Using GAN
Document6 pages
Sketch To Image Using GAN
International Journal of Innovative Science and Research Technology
No ratings yet
2017project Paper
Document5 pages
2017project Paper
Âjáy
No ratings yet
Best of Best of OR
Document12 pages
Best of Best of OR
kebede desalegn
No ratings yet
Bioprocess Control, Advances and Challenges
Document12 pages
Bioprocess Control, Advances and Challenges
Obed Morales
No ratings yet
B.Tech Project Mid Term Report: Handwritten Digits Recognition Using Neural Networks
Document13 pages
B.Tech Project Mid Term Report: Handwritten Digits Recognition Using Neural Networks
Vandhana Rathod
No ratings yet
A Neural Network Approach For Early Cost Estimation of Structural Systems of Buildings
Document8 pages
A Neural Network Approach For Early Cost Estimation of Structural Systems of Buildings
Hai Dai Gia
No ratings yet
Math and Architectures of Deep Learning v10
Document494 pages
Math and Architectures of Deep Learning v10
JonnyHinostrozaGuillermo
No ratings yet
Financial Market Time Series Prediction With Recurrent Neural Networks
Document5 pages
Financial Market Time Series Prediction With Recurrent Neural Networks
Nikhitha Pai
No ratings yet
Cs229 Notes Deep Learning
Document21 pages
Cs229 Notes Deep Learning
Chirag Pramod
No ratings yet
Intecho'18 Full Content
Document8 pages
Intecho'18 Full Content
Yamuna Devi
No ratings yet
Condition Assessment of High Voltage Circuit Breaker Operating Mechanism Based On Coil Current Waveform
Document6 pages
Condition Assessment of High Voltage Circuit Breaker Operating Mechanism Based On Coil Current Waveform
Roy Luu
No ratings yet
Sae Technical Paper Series: Nicolas Gandoin
Document9 pages
Sae Technical Paper Series: Nicolas Gandoin
Steven Sullivan
No ratings yet
NN Examples Matlab
Document91 pages
NN Examples Matlab
Anonymous 1dVLJSVhtr
No ratings yet
Random Forest Algorithm For Land Cover Classification: Arun D. Kulkarni and Barrett Lowe
Document6 pages
Random Forest Algorithm For Land Cover Classification: Arun D. Kulkarni and Barrett Lowe
Editor IJRITCC
No ratings yet
MACHINE LEARNING New
Document2 pages
MACHINE LEARNING New
mahesh
No ratings yet
Pattern Classification Using Simplified Neural Networks With Pruning Algorithm
Document7 pages
Pattern Classification Using Simplified Neural Networks With Pruning Algorithm
Thái Sơn
No ratings yet
Training Graph Neural Networks With 1000 Layers
Document16 pages
Training Graph Neural Networks With 1000 Layers
Arcangelo Steelman Palma
No ratings yet
Acuan CNN + LSTM Model
Document5 pages
Acuan CNN + LSTM Model
Bambang M
No ratings yet
Robotics and Artificial Intelligence Sci TechwithEstif PDF
Document23 pages
Robotics and Artificial Intelligence Sci TechwithEstif PDF
Umair Nazeer
100% (1)
CBV-Institute Journal2019 Digital
Document149 pages
CBV-Institute Journal2019 Digital
Judith Reyes
No ratings yet
SSTP Poster PDF
Document1 page
SSTP Poster PDF
Tyler Kim
No ratings yet