Machine Learning: Applications and Opportunities in the Social Sciences


  • Christopher Hare, University of California at Davis

The field of machine learning is most commonly associated with "big data": how we can use massive datasets to make better predictions about things like credit card fraud, Netflix recommendations, and the like. Though machine learning has been most influential in its commercial and medical applications, a growing number of social scientists are taking advantage of these methods to: (1) uncover patterns and structure embedded in data, (2) test and improve model specification and predictions, and (3) perform data reduction. This course covers the mechanics underlying machine learning methods and discusses how these techniques can be leveraged by social scientists to gain new insight from their data. Specifically, the course will cover: decision trees, random forests, boosting, k-means clustering and nearest neighbors, support vector machines, kernels, neural networks, and ensemble learning. We will also discuss topics related to best practices, including error rates, cross-validation, and the use of bootstrapping methods to develop uncertainty estimates.

Fee: Members = $1700; Non-members = $3200

Tags: machine learning

Course Sections

Section 1

Location: ICPSR -- Ann Arbor, MI

Date(s): August 7 - August 11

Time: 9:00 AM - 5:00 PM


  • Christopher Hare, University of California at Davis