A Logical Characterization of Constraint-Based Causal Discovery

Tom Claassen; Tom  Heskes

A Logical Characterization of Constraint-Based Causal Discovery

Tom Claassen

Tom Heskes

Sign up for access to the world's latest research

checkGet notified about relevant papers

checkSave papers to use in your research

checkJoin the discussion with peers

checkTrack your impact

Abstract

We present a novel approach to constraint-based causal discovery, that takes the form of straightforward logical inference, applied to a list of simple, logical statements about causal relations that are derived directly from observed (in)dependencies. It is both sound and complete, in the sense that all invariant features of the corresponding partial ancestral graph (PAG) are identified, even in the presence of latent variables and selection bias. The approach shows that every identifiable causal relation corresponds to one of just two fundamental forms. More importantly, as the basic building blocks of the method do not rely on the detailed (graphical) structure of the corresponding PAG, it opens up a range of new opportunities, including more robust inference, detailed accountability, and application to large models.

Giorgos Borboudakis

The Maximal Ancestral Graph (MAG) formalism is an important generalization of Bayesian Networks for representing causal processes that admit the possibility of latent confounding variables. Thus, when learning MAGs from data for Causal Discovery, the often unrealistic assumption of Causal Sufficiency can be dismissed. However, the causal interpretation of edges in a MAG is not trivial and it is potentially misleading to unfamiliar practitioners. An edge X → Y may denote either (a) X causes Y and no latent confounding variable is present (pure-causal edge) or (b) X causes Y with the potential presence of a latent common cause. In addition, an edge X → Y may denote (I) X causes Y directly (direct-causal edge), i.e., without any modeled variables mediating the causation or (II) X causes Y possibly-indirectly. In this paper, we present polynomial-time algorithms and tools that can distinguish among the above cases and facilitate the causal interpretation of MAGs. In addition, we run simulated experiments to quantify the percentage of edges that can be labeled as pure or direct-causal. Our results show that the percentage of edges that can be labeled as pure-causal achieves a minimum for sparse or dense networks, and a maximum for in-between values of edge density. In contrast, the percentage of edges that can be labeled as direct-causal decreases as the edge density of the MAG increases.

Log In

A Logical Characterization of Constraint-Based Causal Discovery

Sign up for access to the world's latest research

Abstract

Related papers

Related topics

Related papers