Welcome to Scribd!

Assignment 1

Uploaded by

0% found this document useful (0 votes)

41 views2 pages

This document provides two problems related to text classification and predicting movie ratings from reviews. The first problem involves classifying four sample movie reviews as positive or negative using a linear classifier with word count features and the hinge loss. The second problem predicts numeric movie ratings in the range of 0 to 1 using a logistic regression model with a non-linear predictor and squared loss.

Original Description:

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

41 views2 pages

Assignment 1

Uploaded by

nandhinigirijavks

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 2

Search inside document

Artificial Intelligence EC6442E Semester 2

Instructor : Anup Aprem. Year: 2023–2024

Assignment 1

Problem 1: Classification
Here are two reviews of Perfect Blue, from Rotten Tomatoes:

Rotten Tomatoes has classified these reviews as “positive” and “negative”, respectively, as indicated by the intact
tomato on the top and the splatter on the bottom. In this problem, you will create a simple text classification system
that can perform this task automatically. We’ll warm up with the following set of four mini-reviews, each labeled
positive (+1) or negative (-1):

1. (-1) not good

2. (-1) pretty bad
3. (+1) good plot
4. (+1) pretty scenery

Each review x is mapped onto a feature vector ϕ(x), which maps each word to the number of occurrences of that
word in the review. For example, the second review maps to the (sparse) feature vector ϕ(x) = {pretty : 1, bad : 1}.
In this problem, we will use the hinge loss given by:

↕(y, yi ) = max (0, 1 − y ẏi ) , (1)

where, yi is the actual value and y is the predicted value. Assuming a linear hypothesis space, and using the feature
vector ϕ(x) as input,
Losshinge (xi , yi , w) = max {0, 1 − w.ϕ(xi )yi } ,
where x is the review text, yi is the correct label, w is the weight vector.

1. Suppose we run stochastic gradient descent once for each of the 4 samples in the order given above, updating
the weights according to
w ← w − η∇w Losshinge (x, y, w).
After the updates, what are the weights of the six words (“pretty”, “good”, “bad”, “plot”, “not”, “scenery”)
that appear in the above reviews?
• Use η = 0.1 as the step size.
• Initialize w = [0, 0, 0, 0, 0, 0].

2. Given the following dataset of reviews:

(a) (-1) bad
(b) (+1) good
(c) (+1) not bad
(d) (-1) not good

3. Prove that no linear classifier using word features (i.e. word count) can get zero error on this dataset. Remember
that this is a question about classifiers, not optimization algorithms; your proof should be true for any linear
classifier of the form fw (x) = sign(w.ϕ(x)), regardless of how the weights are learned.
4. Propose a single additional feature for your dataset that we could augment the feature vector with that would
fix this problem.

Problem 2: Predicting Movie Ratings

Suppose that we are now interested in predicting a numeric rating for movie reviews. We will use a non-linear
predictor that takes a movie review x and returns σ(w.ϕ(x)), where σ(z) = (1 + e−z )−1 is the logistic function that
squashes a real number to the range (0, 1). For this problem, assume that the movie rating y is a real-valued variable
in the range [0, 1].

1. Suppose that we wish to use squared loss. Write out the expression for Loss(x, y, w) for a single datapoint
(x, y).
2. Given Loss(x, y, w from the previous part, compute the gradient of the loss with respect to w, ∇w Loss(x, y, w).
Write the answer in terms of the predicted value p = σ(w.ϕ(x)).

Calculus I Essentials
From Everand
Calculus I Essentials
Editors of REA
Rating: 1 out of 5 stars
1/5 (1)
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet
A Robust Real Time Face Detection
Document55 pages
A Robust Real Time Face Detection
Gaurav Veer Singh
No ratings yet
Section05 Solutions
Document9 pages
Section05 Solutions
ebi1234
No ratings yet
Review Questions PDF
Document2 pages
Review Questions PDF
Sulaiman Amin
No ratings yet
Machine Learning Coms-4771: Alina Beygelzimer Tony Jebara, John Langford, Cynthia Rudin
Document17 pages
Machine Learning Coms-4771: Alina Beygelzimer Tony Jebara, John Langford, Cynthia Rudin
Praveen Kumar
No ratings yet
10-701/15-781 Machine Learning - Midterm Exam, Fall 2010: Aarti Singh Carnegie Mellon University
Document15 pages
10-701/15-781 Machine Learning - Midterm Exam, Fall 2010: Aarti Singh Carnegie Mellon University
ale meke
No ratings yet
Midterm 2010 F
Document15 pages
Midterm 2010 F
Muhammad Murtaza
No ratings yet
Midterm Exam - Summer 21
Document6 pages
Midterm Exam - Summer 21
Siddhant
No ratings yet
Learn Review Sol
Document47 pages
Learn Review Sol
Hassanein Al-hadad
No ratings yet
Midterm 2008s Solution
Document12 pages
Midterm 2008s Solution
AlexSpiridon
No ratings yet
VJ Apl
Document5 pages
VJ Apl
Ideal Volt
No ratings yet
Logisticregression 2021
Document78 pages
Logisticregression 2021
Perike Chandra Sekhar
No ratings yet
Support Vector Machine Classification Algorithm and Its Application
Document8 pages
Support Vector Machine Classification Algorithm and Its Application
0191720003 ELIAS ANTONIO BELLO LEON ESTUDIANTE ACTIVO
No ratings yet
SVM Tutorial
Document34 pages
SVM Tutorial
Khoa Nguyen Dang
No ratings yet
HW 23 P 4 Rie
Document5 pages
HW 23 P 4 Rie
정하윤
No ratings yet
DSCI 303: Machine Learning For Data Science Fall 2020
Document5 pages
DSCI 303: Machine Learning For Data Science Fall 2020
Anonymous Student
No ratings yet
6.867 Machine Learning: Mid-Term Exam October 13, 2004
Document11 pages
6.867 Machine Learning: Mid-Term Exam October 13, 2004
Dickson Samwel
No ratings yet
Vector Machines For Remote-Sensing Image Classification: Support
Document7 pages
Vector Machines For Remote-Sensing Image Classification: Support
Antony Rajeev
No ratings yet
Sping 2009 Final
Document15 pages
Sping 2009 Final
Andrew Zeller
No ratings yet
Logistic Regression: Gunjan Bharadwaj Assistant Professor Dept of CEA
Document42 pages
Logistic Regression: Gunjan Bharadwaj Assistant Professor Dept of CEA
Rahul Singh
100% (1)
I I I I I I I I I: Optimal Solution Your Solution
Document3 pages
I I I I I I I I I: Optimal Solution Your Solution
Kevin Gao
No ratings yet
CS60050: Machine Learning Mid-Semester Examination, Autumn 2017
Document1 page
CS60050: Machine Learning Mid-Semester Examination, Autumn 2017
mansi uniyal
No ratings yet
Behavioral Economics: Mark Dean
Document3 pages
Behavioral Economics: Mark Dean
targus
No ratings yet
Midterm Review Spring18 Sols
Document22 pages
Midterm Review Spring18 Sols
Robert Edwards
No ratings yet
Computer Vision MCQ's For Interview
Document12 pages
Computer Vision MCQ's For Interview
Mallikarjun patil
No ratings yet
MISY 631 Final Review Calculators Will Be Provided For The Exam
Document9 pages
MISY 631 Final Review Calculators Will Be Provided For The Exam
AniKelbakiani
No ratings yet
Support Vector Machines
Document32 pages
Support Vector Machines
George Paul
No ratings yet
103 Exercises
Document70 pages
103 Exercises
hungbkpro90
No ratings yet
6.034 QUIZ 2: Fall 2002
Document15 pages
6.034 QUIZ 2: Fall 2002
gomathy
No ratings yet
Chapter 2 Choosing Random Numbers From Distributions: 2.1 Direct Inversion
Document21 pages
Chapter 2 Choosing Random Numbers From Distributions: 2.1 Direct Inversion
K
No ratings yet
Reference Material - Linear - Regression
Document12 pages
Reference Material - Linear - Regression
sonal
100% (1)
Vmls Additional Exercises
Document66 pages
Vmls Additional Exercises
marcosilvasegovia
No ratings yet
CS229 Supplemental Lecture Notes: 1 Boosting
Document11 pages
CS229 Supplemental Lecture Notes: 1 Boosting
Lam Sin Wing
No ratings yet
Mids 20 A
Document8 pages
Mids 20 A
Eman Asem
No ratings yet
Exam 2005
Document21 pages
Exam 2005
kib6707
No ratings yet
FX X XX X X: AP Calculus BC:: Quiz One:: Shubleka Name
Document1 page
FX X XX X X: AP Calculus BC:: Quiz One:: Shubleka Name
teachopensource
No ratings yet
SS ZG568 EC 2R SECOND SEM 2020 2021 Solution 1617000149821
Document6 pages
SS ZG568 EC 2R SECOND SEM 2020 2021 Solution 1617000149821
amrasirah
No ratings yet
Solutions Problem Set 1
Document7 pages
Solutions Problem Set 1
sanketjaiswal
No ratings yet
Unit 1 Lecture 2
Document4 pages
Unit 1 Lecture 2
ferbolche
No ratings yet
A Few More Past Exam Questions (Multiple Choice)
Document6 pages
A Few More Past Exam Questions (Multiple Choice)
exams_sbs
No ratings yet
Homework 1: Background Test: Due 12 A.M. Tuesday, September 06, 2020
Document4 pages
Homework 1: Background Test: Due 12 A.M. Tuesday, September 06, 2020
anwer fadel
No ratings yet
Homework 1
Document7 pages
Homework 1
Muhammad Murtaza
No ratings yet
Midterm 2006
Document11 pages
Midterm 2006
Muhammad Murtaza
No ratings yet
Spring 17 Solved
Document8 pages
Spring 17 Solved
سلمى طارق عبدالخالق عطيه Unknown
No ratings yet
Describing The Problem With Background Knowledge. 2. Design of Object Recognition Algorithm Based On Support Vector Machine
Document2 pages
Describing The Problem With Background Knowledge. 2. Design of Object Recognition Algorithm Based On Support Vector Machine
Hasib haque
No ratings yet
Homework 4
Document4 pages
Homework 4
Jeremy Ng
No ratings yet
Empirical Analysis of Detection Cascades of Boosted Classifiers For Rapid Object Detection
Document8 pages
Empirical Analysis of Detection Cascades of Boosted Classifiers For Rapid Object Detection
Mamik Ciamik
No ratings yet
Fa11 Final
Document21 pages
Fa11 Final
Sri Pat
No ratings yet
Instruction Assignment9
Document3 pages
Instruction Assignment9
Nurbek Nussipbekov
No ratings yet
Midterm 2003
Document11 pages
Midterm 2003
Muhammad Murtaza
No ratings yet
Homework2 v1.0
Document5 pages
Homework2 v1.0
royadaneshi2001
No ratings yet
5 LR Apr 7 2021
Document94 pages
5 LR Apr 7 2021
khcheng
No ratings yet
Unit 2
Document35 pages
Unit 2
shruthikagudem
No ratings yet
1 ModuleEcontent - Session5
Document24 pages
1 ModuleEcontent - Session5
devesh verma
No ratings yet
Exam 2003 B
Document20 pages
Exam 2003 B
kib6707
No ratings yet
SVM Tutorial: SVM - Understanding The Math - The Optimal Hyperplane
Document13 pages
SVM Tutorial: SVM - Understanding The Math - The Optimal Hyperplane
Talha Yousuf
No ratings yet
Machine Learning Assignment
Document13 pages
Machine Learning Assignment
mostgole zannath
No ratings yet
6390 Fall 2022 Midterm
Document20 pages
6390 Fall 2022 Midterm
Daphne LIU
No ratings yet
Learning Rational Function Equation and Inequality
Document18 pages
Learning Rational Function Equation and Inequality
Faith Alquiza Villanueva
No ratings yet
(A) Plot The Error E As A Function of The Number of Training Iterations
Document8 pages
(A) Plot The Error E As A Function of The Number of Training Iterations
Ruoyu Tan
No ratings yet
GEM 803 Chapter 2 (3rd Term) PDF
Document11 pages
GEM 803 Chapter 2 (3rd Term) PDF
Ralph Justine Guinto
No ratings yet
Ch. 4 Roundoff and Truncation Errors
Document16 pages
Ch. 4 Roundoff and Truncation Errors
Fh H
No ratings yet
Unit - Iv - Transportation and Assignment Problem Part - A
Document64 pages
Unit - Iv - Transportation and Assignment Problem Part - A
Saravanan Poomalai
No ratings yet
SSL2UG
Document635 pages
SSL2UG
Anyak2014
No ratings yet
Class 9 Mathematics Gist & Assignment 9
Document4 pages
Class 9 Mathematics Gist & Assignment 9
Sourath Biswas
No ratings yet
Special Case Part 2 Review Degeneracy Alternative Optima
Document14 pages
Special Case Part 2 Review Degeneracy Alternative Optima
Jerome Fernandez
No ratings yet
B.Tech. Theory Examination (Sem - IV) 2016-17 Introduction To Soft Computing (Neural Network, Fuzzy Logic & Genetic Algorithm)
Document1 page
B.Tech. Theory Examination (Sem - IV) 2016-17 Introduction To Soft Computing (Neural Network, Fuzzy Logic & Genetic Algorithm)
Nishant Mishra
No ratings yet
Bachelor Thesis and Master Thesis in Computer Science or Mathematics
Document15 pages
Bachelor Thesis and Master Thesis in Computer Science or Mathematics
sanni hafiz oluwasola
No ratings yet
Online Calculator - Gaussian Elimination
Document8 pages
Online Calculator - Gaussian Elimination
Oyedotun Tunde
No ratings yet
Chapter 2 Bioinformatics
Document9 pages
Chapter 2 Bioinformatics
Kaiash M Y
No ratings yet
NN Ch04
Document29 pages
NN Ch04
balamu96m
No ratings yet
Higher-Order Differential Equations: Section 4.5: Undetermined Coefficients - Annihilator Approach
Document16 pages
Higher-Order Differential Equations: Section 4.5: Undetermined Coefficients - Annihilator Approach
Saad Hussain
No ratings yet
LP Tutorial-2
Document1 page
LP Tutorial-2
aunghtoo1
No ratings yet
The Successive Over Relaxation Method in Multilayer Grid Refinement Scheme
Document6 pages
The Successive Over Relaxation Method in Multilayer Grid Refinement Scheme
Sadek Ahmed
No ratings yet
Fyit NM
Document6 pages
Fyit NM
nonameuser12many
No ratings yet
Gaussian Elimination: × 2 Matrix
Document9 pages
Gaussian Elimination: × 2 Matrix
Miljan Trivic
No ratings yet
Lab5 Gams Formulation
Document48 pages
Lab5 Gams Formulation
Ketan Patel
No ratings yet
Computational Learning Theory by Safdar Khan
Document10 pages
Computational Learning Theory by Safdar Khan
skhan_wah897
No ratings yet
SCILAB Solver NMOP PDF
Document11 pages
SCILAB Solver NMOP PDF
Ravi Parkhe
No ratings yet
Lecture 4: Perceptrons and Multilayer Perceptrons: Cognitive Systems II - Machine Learning SS 2005
Document25 pages
Lecture 4: Perceptrons and Multilayer Perceptrons: Cognitive Systems II - Machine Learning SS 2005
HakanKalaycı
No ratings yet
Numerical Solution of A Differential Equation: Module-4
Document21 pages
Numerical Solution of A Differential Equation: Module-4
Rishita
No ratings yet
Himanshu PPT Transportaion 1212
Document91 pages
Himanshu PPT Transportaion 1212
Vashisht
No ratings yet
(2020) Gaussian Error Linear Units (Gelus)
Document9 pages
(2020) Gaussian Error Linear Units (Gelus)
Mhd rdb
No ratings yet
Rohini 93803270119
Document5 pages
Rohini 93803270119
Sareddy Kirankumarreddy
No ratings yet
Case Study 2022 Topics
Document28 pages
Case Study 2022 Topics
Priyanka Chaudhary
No ratings yet
HCW Min Time To TargetIPOPTinfo
Document2 pages
HCW Min Time To TargetIPOPTinfo
andyvalley
No ratings yet
Computational Fluid Dynamics A Practical Approach 2nd Edition Tu Solutions Manual
Document10 pages
Computational Fluid Dynamics A Practical Approach 2nd Edition Tu Solutions Manual
kevinsnyder22051995fne
100% (24)
Solution of Ordinary Differential Equations by Fourth Order Runge-Kutta Methods - Excel Spreadsheet
Document11 pages
Solution of Ordinary Differential Equations by Fourth Order Runge-Kutta Methods - Excel Spreadsheet
rodwellhead
0% (1)
Module 5
Document51 pages
Module 5
Reshma Sindhu
No ratings yet