Figure 3From: Generalized Thompson sampling for sequential decision-making and causal inferenceThompson sampling for causal induction. (Left) Posterior distribution P(θ|·) for 10 runs when the true system is given by Θ=θ. (Right) Posterior distribution P(θ|·) for 10 runs when the true system is given by Θ=¬θ. In both cases the agent is able to identify the causal structure of the environment with high confidence when P(θ|⋯) is close to one or zero respectively.Back to article page