Welcome to Scribd!

CSE 518A Course Project Milestone 2

Uploaded by

0% found this document useful (0 votes)

8 views2 pages

This document summarizes a course project on the effects of adversarial behavior in crowdsourcing systems that use expectation-maximization (EM) algorithms. It describes the project's data generation strategy, including generating full and sub-sampled datasets with different answer rates. It also discusses modeling worker skill levels, question difficulty, and different worker categories. The document outlines plans to implement and test basic EM, majority voting, and weighted majority voting with adversarial users. It identifies two papers on GLAD and MBEM algorithms that will be explored further.

Original Description:

Original Title

CSE_518A_Course_Project_Milestone_2

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

8 views2 pages

CSE 518A Course Project Milestone 2

Uploaded by

Sayantan Kumar

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 2

Search inside document

Hostile Takeovers in EM based

Crowdsourcing Systems

Ashwin Kumar, Bradford Orr and Sayantan Kumar

CSE 518A Course Project

In this report we will talk about the progress we have made so far in the course
project. The following 2 sections describes our data generation strategy and the
the variants of EM algorithms found in our literature review that we plan to
implement to illustrate the effects of adversarial behaviour on those algorithms.

Data Generation
For our work, we mainly generated 2 types of data sets. One is the full matrix
where all workers have answered all questions, and the other a sub sampled
version of the full matrix. For the sub sampled data set, we have considered 4
cases : (1) Fixed number of answers per question (2) Each user answers same
number of questions. (3) Variable answer rate per user. (4) Different number of
answers for each question. While data generation, we have assumed the fraction
of adversarial users as a user input and we plan to model that in the later stages.
Contrary to the usual case where a worker making a random guess has a skill of
0.5, we have sampled the initial worker skills from a Gaussian distribution with
mean 0.5 and standard deviation 0.25.

Apart from worker skills, we plan to model the difficulty level of the question.
Since we assuming random values between +1 and −1 for the ground truth and
the answers, it would be more realistic if workers answer only those questions
whose difficulty level is less than their skills. We have considered tags or labels
of each question. The tags will not be implicitly known to the workers. It would
be an interesting experiment analyze how the event of adversarial workers tar-
geting some specific tags overall affect the system.

For our data generation, we have assumed mainly 2 categories of workers,

the adversarial workers whose skill level is 1 and the other normal workers
who have a chance of 0.25 − 0.75 to know the correct answer to the questions,
and guess randomly otherwise. While this assumption is a more constrained
type, one interesting insight of this is how will the overall model be changed
if we assume 3 categories of workers : skilled adversarial, skill normal and
noisy workers.

1
Probable EM approaches
To test the effect of adversarial workers on the system, we have planned to try
some popular algorithms where the EM framework is used. While our first aim
is to test the adversarial effect on the basic EM, we want to analyze if other
more complicated frameworks are susceptible to the hostile takeover. Apart
from the basic EM, we also plan to introduce adversarial users while Majority
Voting and Weighted Majority Voting as part of the baseline approaches.

After a brief literature review, we have selected 2 papers. First one is the
paper which implemented GLAD [Whose Vote Should Count More: Optimal
Integration of Labels from Labelers of Unknown Expertise]. The 2nd one is
Learning From Noisy Singly-labeled Data which developed an algorithm called
Model Bootstrapped EM algorithm (MBEM). We are in the process of reading
more papers to find EM based algorithms that suits our purpose.

Top 100 Machine Learning Questions With Answers For Interview PDF
Document48 pages
Top 100 Machine Learning Questions With Answers For Interview PDF
Piyush Saraf
100% (2)
AS1 Digital Marketing Report
Document5 pages
AS1 Digital Marketing Report
Nimra Tahir
No ratings yet
Social Determinants of Health Report
Document24 pages
Social Determinants of Health Report
Ganis Ry
No ratings yet
CSE 518A Project Report
Document6 pages
CSE 518A Project Report
Sayantan Kumar
No ratings yet
Reputation-Based Worker Filtering in Crowdsourcing
Document9 pages
Reputation-Based Worker Filtering in Crowdsourcing
morteza hosseini
No ratings yet
Machine Learning Algorithms
Document245 pages
Machine Learning Algorithms
bradburywills
No ratings yet
Project Lit Final1
Document15 pages
Project Lit Final1
poornima
No ratings yet
Seke07 PDF
Document6 pages
Seke07 PDF
Deepak dimri
No ratings yet
Approximate Statistical Tests For Comparing Supervised Classification Learning Algorithms
Document30 pages
Approximate Statistical Tests For Comparing Supervised Classification Learning Algorithms
smunoz443x9066
No ratings yet
Data Analytics Lab
Document46 pages
Data Analytics Lab
Anupriya Jain
No ratings yet
Iterative Learning For Reliable Crowdsourcing Systems
Document10 pages
Iterative Learning For Reliable Crowdsourcing Systems
morteza hosseini
No ratings yet
Machine Learning
Document19 pages
Machine Learning
Daksh
No ratings yet
ML Questions
Document31 pages
ML Questions
Shubham Bakshi
No ratings yet
Document
Document9 pages
Document
Ayush Patel
No ratings yet
What Are The Types of Machine Learning?
Document24 pages
What Are The Types of Machine Learning?
sahil kumar
100% (1)
BadPrompt Backdoor Attacks On Continuous
Document1 page
BadPrompt Backdoor Attacks On Continuous
shifa fatima
No ratings yet
AI Session 3 Machine Learning Slides
Document35 pages
AI Session 3 Machine Learning Slides
Philani Mangezi
No ratings yet
Fundamentals of Machine Learning II
Document13 pages
Fundamentals of Machine Learning II
ssakhare2001
No ratings yet
ML Exam
Document5 pages
ML Exam
Aravind Kumar Reddy
No ratings yet
Proposed Methods
Document2 pages
Proposed Methods
ankit sahu
No ratings yet
Proceedings of The 1997 Winter Simulation Conference Ed. S. Andradóttir, K. J. Healy, D. H. Withers, and B. L. Nelson
Document7 pages
Proceedings of The 1997 Winter Simulation Conference Ed. S. Andradóttir, K. J. Healy, D. H. Withers, and B. L. Nelson
Eric Reyes
No ratings yet
Machinelearning Concepts
Document29 pages
Machinelearning Concepts
raqibapp
No ratings yet
TusharGoel Seminar PPT
Document23 pages
TusharGoel Seminar PPT
tushar goel
No ratings yet
Introduction To Machine Learning For Beginners
Document5 pages
Introduction To Machine Learning For Beginners
Nandkumar Khachane
No ratings yet
Project - Machine Learning-Business Report: By: K Ravi Kumar PGP-Data Science and Business Analytics (PGPDSBA.O.MAR23.A)
Document38 pages
Project - Machine Learning-Business Report: By: K Ravi Kumar PGP-Data Science and Business Analytics (PGPDSBA.O.MAR23.A)
Ravi Kotharu
No ratings yet
Social Media Mining
Document10 pages
Social Media Mining
Hari Atharsh
No ratings yet
Finalized Review Report 3 (Gradient, Confusion Matrix)
Document5 pages
Finalized Review Report 3 (Gradient, Confusion Matrix)
Sameer Younas
No ratings yet
Chapter11 Slides
Document20 pages
Chapter11 Slides
Parth Rajesh Sheth
No ratings yet
MFDS - Test 1 Problems
Document9 pages
MFDS - Test 1 Problems
Larry Lautor
No ratings yet
Expectation Maximization Homework Solution
Document8 pages
Expectation Maximization Homework Solution
cffge1tw
100% (1)
Lecture1423722821 56 69
Document14 pages
Lecture1423722821 56 69
Pavan Gutta
No ratings yet
Ds Module 4
Document73 pages
Ds Module 4
Prathik Srinivas
No ratings yet
Infomais Example Exam
Document12 pages
Infomais Example Exam
summerorvector
No ratings yet
170 Machine Learning Interview Questios - Greatlearning
Document57 pages
170 Machine Learning Interview Questios - Greatlearning
Lily Lauren
No ratings yet
COS 511: Foundations of Machine Learning
Document7 pages
COS 511: Foundations of Machine Learning
Pooja Sinha
No ratings yet
Machine Learning
Document16 pages
Machine Learning
Zhivko Zhelyazkov
No ratings yet
Predicting Credit Card Approvals
Document14 pages
Predicting Credit Card Approvals
as
100% (1)
Estimation of Software Defects Fix Effort Using Neural Networks
Document2 pages
Estimation of Software Defects Fix Effort Using Neural Networks
SpeedSrl
No ratings yet
What Is Machine Learning?: Example 1
Document5 pages
What Is Machine Learning?: Example 1
samuel rivera enriquez
No ratings yet
Assignment CSE 411
Document6 pages
Assignment CSE 411
asrar tamim
No ratings yet
Ranking Features Based On Predictive Power - Importance of The Class Labels
Document11 pages
Ranking Features Based On Predictive Power - Importance of The Class Labels
Juan
No ratings yet
Operations Research Lecture Notes 2-Introduction To Operations Research-2
Document27 pages
Operations Research Lecture Notes 2-Introduction To Operations Research-2
Ülvi Mamedov
No ratings yet
02 Ai Project Cycle Revision Notes
Document4 pages
02 Ai Project Cycle Revision Notes
devendrabanac
No ratings yet
TEAM DS Final Report
Document14 pages
TEAM DS Final Report
Gurucharan Reddy
No ratings yet
ML Unit 1
Document74 pages
ML Unit 1
mr. potter
No ratings yet
Explaining Network Intrusion Detection System Using Explainable AI Framework
Document10 pages
Explaining Network Intrusion Detection System Using Explainable AI Framework
Khalid Syfullah
No ratings yet
Types of ML
Document4 pages
Types of ML
chandana kiran
No ratings yet
246 AI-900 New Sets
Document20 pages
246 AI-900 New Sets
robin jaiswal
No ratings yet
Lets Verify Step by Step
Document29 pages
Lets Verify Step by Step
Alex Bravo
No ratings yet
Annu Maria-Introduction To Modelling and Simulation
Document7 pages
Annu Maria-Introduction To Modelling and Simulation
Miguel Dominguez de García
0% (1)
Whats Your ML Test Score A Rubric For ML Production Systems
Document5 pages
Whats Your ML Test Score A Rubric For ML Production Systems
정현도
No ratings yet
Programming Building Blocks: Birkbeck College, University of London School of Computer Science and Information Systems
Document23 pages
Programming Building Blocks: Birkbeck College, University of London School of Computer Science and Information Systems
Syed Shabir
No ratings yet
Introduction To Modeling and Simulation
Document7 pages
Introduction To Modeling and Simulation
Abiodun Gbenga
100% (2)
Lecture - 2 Classification (Machine Learning Basic and KNN)
Document94 pages
Lecture - 2 Classification (Machine Learning Basic and KNN)
Dawit Woldemichael
No ratings yet
Machine Learning Models: by Mayuri Bhandari
Document48 pages
Machine Learning Models: by Mayuri Bhandari
mayuri
No ratings yet
7 محاضرات
Document36 pages
7 محاضرات
nnnn403010
No ratings yet
Untitled
Document7 pages
Untitled
Lillian Lin
No ratings yet
A Systematic Literature Review On Fault Prediction Performance in Software Engineering PDF
Document4 pages
A Systematic Literature Review On Fault Prediction Performance in Software Engineering PDF
ea4gaa0g
No ratings yet
PCX - RepoHHHHHHHHHrt
Document13 pages
PCX - RepoHHHHHHHHHrt
Said Rahman
No ratings yet
2-Machine Learning For Data Streams
Document5 pages
2-Machine Learning For Data Streams
Nagendra Kumar
No ratings yet
Mastering Machine Learning Basics: A Beginner's Companion
From Everand
Mastering Machine Learning Basics: A Beginner's Companion
Moss Adelle Louise
No ratings yet
Artificial Intelligence Diagnosis: Fundamentals and Applications
From Everand
Artificial Intelligence Diagnosis: Fundamentals and Applications
Fouad Sabry
No ratings yet
Jack-In Pile On Weathered Granite
Document4 pages
Jack-In Pile On Weathered Granite
Shamsul Bahrin Sulaiman
No ratings yet
Emotional Inteligence Scale
Document16 pages
Emotional Inteligence Scale
Kriti Shetty
No ratings yet
Quantity Surveyor - : Rider Levett Bucknall Philippines, Inc
Document3 pages
Quantity Surveyor - : Rider Levett Bucknall Philippines, Inc
Carlito Pantalunan
No ratings yet
Performance Management at National Institute of Management
Document8 pages
Performance Management at National Institute of Management
Jeet Dash
No ratings yet
2023 - MP02 Reliability Management
Document88 pages
2023 - MP02 Reliability Management
OPHAR SEKTOR TAMBORA
No ratings yet
Initial Professional Development Chartered Member
Document27 pages
Initial Professional Development Chartered Member
Vijay
No ratings yet
Objectives and Outcomes: Textile Design Concentration-BS
Document2 pages
Objectives and Outcomes: Textile Design Concentration-BS
Md Masum
No ratings yet
PPT
Document15 pages
PPT
maritthe morales
No ratings yet
Vibration
Document468 pages
Vibration
aklamos
No ratings yet
Week 9-12 Forms and Types of Creative Nonfiction
Document10 pages
Week 9-12 Forms and Types of Creative Nonfiction
MARY NELLE JEAN COSPADA
No ratings yet
Action Research
Document12 pages
Action Research
yasminkhalid
No ratings yet
Factors of Undecidability in Career Choices of Grade 11 General Academic Track Students. Basis For Career Decision-Making Program
Document8 pages
Factors of Undecidability in Career Choices of Grade 11 General Academic Track Students. Basis For Career Decision-Making Program
Sabrina Duff Ocampo
No ratings yet
Understanding Culture, Society and Politics
Document24 pages
Understanding Culture, Society and Politics
Cherry Chanel
No ratings yet
1 Factorial Experiments PDF
Document35 pages
1 Factorial Experiments PDF
whmimbs
No ratings yet
Ug030050 International Gcse in Geography Master Booklet Spec Sams For Web 220212
Document140 pages
Ug030050 International Gcse in Geography Master Booklet Spec Sams For Web 220212
api-298584351
100% (1)
Flows of Ideas - Tamara - Research Method
Document6 pages
Flows of Ideas - Tamara - Research Method
Tamara
No ratings yet
Environmental Life Cycle Cost Analysis of Products
Document24 pages
Environmental Life Cycle Cost Analysis of Products
Gowthaman Maruthamuthu
No ratings yet
General Properties of Strongly Magic Squares: Neeradha. C. K. Dr. V. Madhukar Mallayya
Document8 pages
General Properties of Strongly Magic Squares: Neeradha. C. K. Dr. V. Madhukar Mallayya
CuriosityShop
No ratings yet
Statistics - How To Draw Probability Density Function in MatLab
Document3 pages
Statistics - How To Draw Probability Density Function in MatLab
yousnail
No ratings yet
Research Methodology: By: Dr. Najma Kabir University of Management & Technology Lahore
Document12 pages
Research Methodology: By: Dr. Najma Kabir University of Management & Technology Lahore
Atique Mughal
No ratings yet
Adv 1403
Document20 pages
Adv 1403
MarcTim
No ratings yet
Image Analysis and Machine Learning For Detecting Malaria
Document21 pages
Image Analysis and Machine Learning For Detecting Malaria
William Jonathan
No ratings yet
Irc Pahs Proposal Submission Form
Document4 pages
Irc Pahs Proposal Submission Form
Sulabh Shrestha
No ratings yet
Edtpa Lesson Plan
Document3 pages
Edtpa Lesson Plan
api-253147349
No ratings yet
Major Construction Risk Factors Considered by General Contractors in Qatar
Document30 pages
Major Construction Risk Factors Considered by General Contractors in Qatar
efe
No ratings yet
Designing A Survey Questionnaire
Document4 pages
Designing A Survey Questionnaire
Riala Teresa Galang
No ratings yet
Renewable and Sustainable Energy Reviews: Nelson Fumo
Document8 pages
Renewable and Sustainable Energy Reviews: Nelson Fumo
Yousif Kirkuke
No ratings yet
Naturalistic and Islamic Approaches To Psychology, Psychotherapy, and Religion - JF Psy of Religion
Document16 pages
Naturalistic and Islamic Approaches To Psychology, Psychotherapy, and Religion - JF Psy of Religion
Dimas Wichaksono
No ratings yet