Professional Documents
Culture Documents
Existing System:
The first type of solutions is the pure inference methods. The input and output of these pure
inference methods are the crowd sourced labels and the inferred labels respectively. They have
not exploited the budget allocation issue in the label acquisition phase, and just conduct uniform
budget distribution. The second kind of solutions, considers both label inference and label
acquisition. With parameters estimated in the inference phase, they allocate budget to tough
items in the label acquisition phase. Benefited from a more effective usage of budget, these
adaptive methods generally achieve a higher accuracy than pure inference methods.
Disadvantage:
Their drawback lies in the assumption of managing workers. In this assumption, the requesters
can target a specific worker to answer questions. In other words, the requesters are able to assign
tasks to specific workers. This is not possible on typical crowd sourcing markets like AMT or
Crowd Flower, On common crowd sourcing platforms AMT and Crowd- Flower, requesters
decide which tasks to ask, and the posted tasks are self-selected by the workers.
Proposed system:
In pursuit of a higher classification accuracy using common platforms, we build a classification
crowd sourcing framework in this paper, called Dynamic Label Acquisition and Answer
Aggregation (DLTA, T means triple) framework. The DLTA framework follows an adaptive
design for human machine- crowd interactions. In this framework, the machine serves as the
medium between the user and the crowd. It proceeds in a sequence of rounds, and each round
consists of two steps, the label inference (answer aggregation) step and the label acquisition
step. In the label inference step, after the previous issued query is completed by the crowd, it
conducts inference with new answers and then outputs its estimation to help the user decide the
amount of budget for next round. The budget is given dynamically in a round manner in real-
time by the user. In the label acquisition step, given the budget for the next round by the
requester, the machine allocates the budget among items, and then issues the corresponding
query to the crowd through platforms like AMT. Note that in the label acquisition step, the
requester can alternatively choose to stop when he/she is satisfied with the present output.
Advantages:
DLTA proceeds in a sequence of rounds. In each round, based on the intermediate results of
previous rounds,
1) The user determines the round budget in a real-time manner.
2) DLTA performs proper budget allocation among items with the given budget. In this way, it
dynamically issues crowd queries and acquires labels round-by-round. In contrast, those pure
label inference algorithms, whose label acquisition follows a one-shot strategy, do not have this
flexibility.
Allocation measurement:
In this subsection, we describe our method on measuringthe benefit of allocating some budget to
an item in (t+1)th round, with respect to the inferred results of t-th round. As an user of labor
markets like AMT, LIPE helps us to infer the reliabilities of the participating workers from the
market. Based on these reliability samples, we can further estimate a reliability density
distribution in the underlying market. In the following, we first illustrate our worker reliability
distribution model. This distribution would be used in the post-estimation of Pik’s, which is a
prediction of Pik’s for item i if a certain amount of budget is allocated to collect its new labels.
The benefit of that budget allocation is then measured by comparing the current estimation and
the post-estimation.
Software Requirements: