Automated Macro-scale Causal Hypothesis Formation Based on Micro-scale Observation
Author: Chalupka, Krzysztof
Year: 2017
Degree: Dissertation (Ph.D.)
Advisors: Perona, Pietro; Eberhardt, Frederick
Committee Members: Perona, Pietro; Eberhardt, Frederick D.; Yue, Yisong; Tsao, Doris Y.
Option: Computation and Neural Systems
DOI: 10.7907/Z9MW2F4P
Abstract
This book introduces new concepts at the intersection of machine learning, causal inference and philosophy of science: the macrovariable cause and effect. Methods for learning such from microvariable data are introduced. The learning process proposes a minimal number of guided experiments that recover the macrovariable cause from observational data.
Mathematical definitions of a micro- and macro- scale manipulation, an observational and causal partition, and a subsidiary variable are given. These concepts provide a link to previous work in causal inference and machine learning.
The main theoretical result is the Causal Coarsening Theorem, a new insight into the measure-theoretic structure of probability spaces and structural equation models. The theorem provides grounds for automatic causal hypothesis formation from data. Other results concern the minimality and sufficiency of representations created in accordance with the theorem.
Finally, this book proposes the first algorithms for supervised and unsupervised causal macrovariable discovery. These algorithms bridge large-scale, multidimensional machine learning and causal inference. In an application to climate science, the algorithms re-discover a known causal mechanism as a viable causal hypothesis. In a psychophysical experiment, the algorithms learn to minimally change visual stimuli to achieve a desired effect on human perception.
Files
- chalupka-thesis.pdf (application/pdf)