Fundamental Limits of Budget-Fidelity Trade-off in Label Crowdsourcing

Creators: Lahouti, Farshad; Hassibi, Babak

Abstract

Digital crowdsourcing (CS) is a modern approach to perform certain large projects using small contributions of a large crowd. In CS, a taskmaster typically breaks down the project into small batches of tasks and assigns them to so-called workers with imperfect skill levels. The crowdsourcer then collects and analyzes the results for inference and serving the purpose of the project. In this work, the CS problem, as a human-in-the-loop computation problem, is modeled and analyzed in an information theoretic rate-distortion framework. The purpose is to identify the ultimate fidelity that one can achieve by any form of query from the crowd and any decoding (inference) algorithm with a given budget. The results are established by a joint source channel (de)coding scheme, which represent the query scheme and inference, over parallel noisy channels, which model workers with imperfect skill levels. We also present and analyze a query scheme dubbed k-ary incidence coding and study optimized query pricing in this setting.

Additional Information

Attached Files

Published - NIPS-2016-fundamental-limits-of-budget-fidelity-trade-off-in-label-crowdsourcing-Paper.pdf

Submitted - 1608.07328.pdf

Supplemental Material - NIPS-2016-fundamental-limits-of-budget-fidelity-trade-off-in-label-crowdsourcing-Supplemental.zip

Files

1608.07328.pdf

Files (1.0 MB)

Name	Size	Download all
1608.07328.pdf md5:441aaa8ed76292017b33ea25642fc7e4	418.5 kB	Preview Download
NIPS-2016-fundamental-limits-of-budget-fidelity-trade-off-in-label-crowdsourcing-Paper.pdf md5:2a1e8d6c56747984fbae914309610950	178.3 kB	Preview Download
NIPS-2016-fundamental-limits-of-budget-fidelity-trade-off-in-label-crowdsourcing-Supplemental.zip md5:7df328d215d521bbb0201ce28f00e926	428.7 kB	Preview Download

Additional details

	All versions	This version
Views	0	0
Downloads	0	0
Data volume	0 Bytes	0 Bytes