Causal Representation Learning in Temporal Data via Single-Parent Decoding


Journal article


Philippe Brouillard, Sébastien Lachapelle, Julia Kaltenborn, Yaniv Gurwicz, Dhanya Sridhar, Alexandre Drouin, Peer Nowack, Jakob Runge, David Rolnick
2024

Semantic Scholar ArXiv
Cite

Cite

APA   Click to copy
Brouillard, P., Lachapelle, S., Kaltenborn, J., Gurwicz, Y., Sridhar, D., Drouin, A., … Rolnick, D. (2024). Causal Representation Learning in Temporal Data via Single-Parent Decoding.


Chicago/Turabian   Click to copy
Brouillard, Philippe, Sébastien Lachapelle, Julia Kaltenborn, Yaniv Gurwicz, Dhanya Sridhar, Alexandre Drouin, Peer Nowack, Jakob Runge, and David Rolnick. “Causal Representation Learning in Temporal Data via Single-Parent Decoding” (2024).


MLA   Click to copy
Brouillard, Philippe, et al. Causal Representation Learning in Temporal Data via Single-Parent Decoding. 2024.


BibTeX   Click to copy

@article{philippe2024a,
  title = {Causal Representation Learning in Temporal Data via Single-Parent Decoding},
  year = {2024},
  author = {Brouillard, Philippe and Lachapelle, Sébastien and Kaltenborn, Julia and Gurwicz, Yaniv and Sridhar, Dhanya and Drouin, Alexandre and Nowack, Peer and Runge, Jakob and Rolnick, David}
}

Abstract

Scientific research often seeks to understand the causal structure underlying high-level variables in a system. For example, climate scientists study how phenomena, such as El Ni~no, affect other climate processes at remote locations across the globe. However, scientists typically collect low-level measurements, such as geographically distributed temperature readings. From these, one needs to learn both a mapping to causally-relevant latent variables, such as a high-level representation of the El Ni~no phenomenon and other processes, as well as the causal model over them. The challenge is that this task, called causal representation learning, is highly underdetermined from observational data alone, requiring other constraints during learning to resolve the indeterminacies. In this work, we consider a temporal model with a sparsity assumption, namely single-parent decoding: each observed low-level variable is only affected by a single latent variable. Such an assumption is reasonable in many scientific applications that require finding groups of low-level variables, such as extracting regions from geographically gridded measurement data in climate research or capturing brain regions from neural activity data. We demonstrate the identifiability of the resulting model and propose a differentiable method, Causal Discovery with Single-parent Decoding (CDSD), that simultaneously learns the underlying latents and a causal graph over them. We assess the validity of our theoretical results using simulated data and showcase the practical validity of our method in an application to real-world data from the climate science field.