21cmLikelihoods

Neural density estimators (NDEs) for the cosmic 21-cm power spectrum likelihoods.

Common assumptions of the classical Bayesian inferences with the 21-cm PS are:

the likelihood shape is a Gaussian,
the covariance matrix is usually fixed and pre-calculated at some fiducial parameter values,
often only diagonal covariance is used, ignoring other correlations,
the Gaussian mean at each point in parameter space is estimated from only one realization.

All of these assumptions mostly come in order to reduce computational costs, and have a potentially significant impact on the final posterior.

In order to bypass all of these, we use Simulation-Based Inference (SBI). It can be summarized into two main steps:

draw parameter sample from some distribution (possibly prior) - $\tilde{\boldsymbol{\theta}} \sim \pi(\boldsymbol{\theta})$,
draw a data sample by using a realistic data simulator - $\tilde{\boldsymbol{d}} \sim \mathcal{L}(\boldsymbol{d} | \tilde{\boldsymbol{\theta}})$,
Repeat many times.

A database of (parameter, data sample) pairs follow full distribution $P(\boldsymbol{d}, \boldsymbol{\theta}) = \mathcal{L}(\boldsymbol{d} | \boldsymbol{\theta}) \cdot \pi(\boldsymbol{\theta})$. Using a NN-parameterized likelihood NDE $\mathcal{L}_{\text{NN}}(\boldsymbol{d} | \boldsymbol{\theta})$ and training it to minimize KL divergence, we recover a data-driven likelihood estimator. Once trained, one can use standard MCMC (or nested sampling) to recover posterior for a particular observed data $\boldsymbol{d}_{\text{obs}}$.

See examples and article for more details.

Implemented likelihoods

We implement three main likelihood categories, by relaxing classical inference constraints.

Mean constraint

In order to estimate the mean better, a feed-forward NN is used which takes parameters $\boldsymbol{\theta}$ and outputs the mean: $\boldsymbol{\mu}_{\text{NN}}(\boldsymbol{\theta}) = \text{NN}(\boldsymbol{\theta}) .$

The possible Gaussian likelihoods are then:

$$\mathcal{L}_{\text{NN}}(\boldsymbol{d} | \boldsymbol{\theta}) = \mathcal{N}(\boldsymbol{d}| \boldsymbol{\mu}_{\text{NN}}(\boldsymbol{\theta}), \boldsymbol{\sigma}^2(\boldsymbol{\theta}_{\text{fid}})) , $$

$$\mathcal{L}_{\text{NN}}(\boldsymbol{d} | \boldsymbol{\theta}) = \mathcal{N}(\boldsymbol{d}| \boldsymbol{\mu}_{\text{NN}}(\boldsymbol{\theta}), \Sigma(\boldsymbol{\theta}_{\text{fid}})) . $$

Here $\boldsymbol{\sigma}^2(\boldsymbol{\theta}_{\text{fid}})$ and $\Sigma(\boldsymbol{\theta}_{\text{fid}})$ represent the variance and covariance estimated at the fiducial parameter values.

In code, one can create such likelihoods as:

import numpy as np
from py21cmlikelihoods import ConditionalGaussian

fiducial_covariance = np.load("cov.npy")

NDE = ConditionalGaussian(
    n_parameters = 2, 
    n_data = 5, 
    covariance = fiducial_covariance,
)

where fiducial_covariance can be 1D or 2D, depending if full or diagonal covariance is needed.

Covariance constraint

Likewise, we can also estimate the (co)variance matrix with a NN. In this scenario, the network can output one of the following:

$$\boldsymbol{\mu}_{\text{NN}}(\boldsymbol{\theta}), \boldsymbol{\sigma}^2_{\text{NN}}(\boldsymbol{\theta}) = \text{NN}(\boldsymbol{\theta}) ,$$

$$\boldsymbol{\mu}_{\text{NN}}(\boldsymbol{\theta}), \Sigma_{\text{NN}}(\boldsymbol{\theta}) = \text{NN}(\boldsymbol{\theta}) , $$

with their respective likelihoods:

$$\mathcal{L}_{\text{NN}}(\boldsymbol{d} | \boldsymbol{\theta}) = \mathcal{N}(\boldsymbol{d}| \boldsymbol{\mu}_{\text{NN}}(\boldsymbol{\theta}), \boldsymbol{\sigma}^2_{\text{NN}}(\boldsymbol{\theta})) ,$$

$$\mathcal{L}_{\text{NN}}(\boldsymbol{d} | \boldsymbol{\theta}) = \mathcal{N}(\boldsymbol{d}| \boldsymbol{\mu}_{\text{NN}}(\boldsymbol{\theta}), \Sigma_{\text{NN}}(\boldsymbol{\theta})) .$$

In code:

NDE_diagonal = ConditionalGaussian(
    n_parameters = 2, 
    n_data = 5, 
    diagonal_covariance = True,
)

NDE_full = ConditionalGaussian(
    n_parameters = 2, 
    n_data = 5, 
    diagonal_covariance = False,
)

Gaussian constraint

Finally, we can relax the Gaussian constraint as well. This can be done in a parametric way by using Gaussian mixture networks, or non-parametric way with Conditional Masked Autoregressive Flows (CMAF).

Gaussian mixture network

The setup here is exactly the same as previous cases, with the difference that NN outputs a Gaussian mixture:

$$\boldsymbol{\mu}_{\text{NN}, 1}(\boldsymbol{\theta}), \Sigma_{\text{NN}, 1}(\boldsymbol{\theta}), \phi_1(\boldsymbol{\theta}), \ldots, \boldsymbol{\mu}_{\text{NN}, K}(\boldsymbol{\theta}), \Sigma_{\text{NN}, K}(\boldsymbol{\theta}), \phi_K(\boldsymbol{\theta}) = \text{NN}(\boldsymbol{\theta}) ,$$

where $\boldsymbol{\mu}_{\text{NN}, i}(\boldsymbol{\theta}), \Sigma_{\text{NN}, i}(\boldsymbol{\theta})$ describe mean and covariance of the $i-\text{th}$ Gaussian and $\phi_i(\boldsymbol{\theta})$ its relative (positive) weight, $\sum_i \phi_i(\boldsymbol{\theta}) = 1$. Therefore, the full likelihood can be written as:

$$\mathcal{L}_{\text{NN}}(\boldsymbol{d} | \boldsymbol{\theta}) = \sum_{i=1}^K \phi_i(\boldsymbol{\theta}) \cdot \mathcal{N}(\boldsymbol{d}| \boldsymbol{\mu}_{\text{NN}, i}(\boldsymbol{\theta}), \Sigma_{\text{NN}, i}(\boldsymbol{\theta})) .$$

In code:

from py21cmlikelihoods import ConditionalGaussianMixture

NDE = ConditionalGaussianMixture(
    n_parameters = 2, 
    n_data = 5, 
    n_components = 3,
)

Conditional Masked Autoregressive Flows

CMAF represents non-parametric density estimator, with large expressivity in the shape of the final distribution. Minimal example is the following:

from py21cmlikelihoods import ConditionalMaskedAutoregressiveFlow

NDE = ConditionalMaskedAutoregressiveFlow(
    n_dim = 5,
    cond_n_dim = 2,
)

Training NDE likelihood

To train NDE, simply format the training set and call the training function.

from py21cmlikelihoods.utils import prepare_dataset

data_samples = np.load("data.npy")
param_samples = np.load("params.npy")
batch_size = 100
training_set = prepare_dataset(NDE, data_samples, param_samples, batch_size)

NDE.train(
    epochs = 100,
    dataset = training_set,
)

Installation

To install and use the code, clone the repository and run

pip install -e .

For a full setup needed to run examples, check the the conda environment.yml and install it as

conda env create -f environment.yml
conda activate 21cmLikelihoods
pip install -e .

Acknowledging

If you use the code in your research, please cite the original paper:

@ARTICLE{Prelogovic2023,
       author = {{Prelogovi{\'c}}, David and {Mesinger}, Andrei},
        title = "{Exploring the likelihood of the 21-cm power spectrum with simulation-based inference}",
      journal = {\mnras},
     keywords = {cosmology: theory, dark ages, reionization, first stars, methods: data analysis, methods: statistical, Astrophysics - Cosmology and Nongalactic Astrophysics, Astrophysics - Astrophysics of Galaxies},
         year = 2023,
        month = jul,
          doi = {10.1093/mnras/stad2027},
archivePrefix = {arXiv},
       eprint = {2305.03074},
 primaryClass = {astro-ph.CO},
       adsurl = {https://ui.adsabs.harvard.edu/abs/2023MNRAS.tmp.1955P},
      adsnote = {Provided by the SAO/NASA Astrophysics Data System}
}

Name		Name	Last commit message	Last commit date
Latest commit History 62 Commits
examples		examples
py21cmlikelihoods		py21cmlikelihoods
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

21cmLikelihoods

Implemented likelihoods

Mean constraint

Covariance constraint

Gaussian constraint

Gaussian mixture network

Conditional Masked Autoregressive Flows

Training NDE likelihood

Installation

Acknowledging

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

21cmLikelihoods

Implemented likelihoods

Mean constraint

Covariance constraint

Gaussian constraint

Gaussian mixture network

Conditional Masked Autoregressive Flows

Training NDE likelihood

Installation

Acknowledging

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages