Sociology, Department of

Department of Sociology: Faculty Publications

Humans in the Loop: Incorporating Expert and Crowd-sourced Knowledge for Predictions Using Survey Data

Anna Filippova, Carnegie Mellon University
Connor Gilroy, University of Washington
Ridhi Kashyap, University of Oxford
Antje Kirchner, RTI International
Allison C. Morgan, University of Colorado, Boulder
Kivan Polimis, Università Bocconi, Bocconi Institute for Data Science and Analytics
Adaner Usmani, Brown University
Tong Wang, University of Iowa

ORCID IDs

Kivan Polimis

Document Type

Article

Date of this Version

2019

Comments

CC-BY-NC

Abstract

Survey data sets are often wider than they are long. This high ratio of variables to observations raises concerns about overfitting during prediction, making informed variable selection important. Recent applications in computer science have sought to incorporate human knowledge into machine-learning methods to address these problems. The authors implement such a “human-in-the-loop” approach in the Fragile Families Challenge. The authors use surveys to elicit knowledge from experts and laypeople about the importance of different variables to different outcomes. This strategy offers the option to subset the data before prediction or to incorporate human knowledge as scores in prediction models, or both together. The authors find that human intervention is not obviously helpful. Human-informed subsetting reduces predictive performance, and considered alone, approaches incorporating scores perform marginally worse than approaches that do not. However, incorporating human knowledge may still improve predictive performance, and future research should consider new ways of doing so.

Download

Included in

Family, Life Course, and Society Commons, Social Psychology and Interaction Commons

COinS

Sociology, Department of

Department of Sociology: Faculty Publications

Humans in the Loop: Incorporating Expert and Crowd-sourced Knowledge for Predictions Using Survey Data

ORCID IDs

Document Type

Date of this Version

Comments

Abstract

Included in

Search

Browse

Author Corner

Links

Sociology, Department of

Department of Sociology: Faculty Publications

Humans in the Loop: Incorporating Expert and Crowd-sourced Knowledge for Predictions Using Survey Data

Authors

ORCID IDs

Document Type

Date of this Version

Comments

Abstract

Included in

Share

Search

Browse

Author Corner

Links