Noise models¶
In socca, the likelihood function to be used during the inference process is fully specified by the noise model associated with the input data. This encodes the assumed statistical properties of the noise affecting the observations and therefore plays a central role in defining how model predictions are compared to data themselves.
As shown in “Getting started”, any noise model should be linked to the input data by passing the noise object to the corresponding socca.data.Image instance:
>>> img = Image(..., noise=noise)
At present, socca provides a limited but representative set of Gaussian noise models, covering the most common use cases encountered in astronomical imaging and map-based analyses. Below is a brief description of the noise models currently implemented in socca.
Note
Several noise models are under active development and testing, and will be added to socca in future releases. These include, among others, Poisson noise models for photon-counting data (e.g., X-ray observations).
Important
If the noise model is not explicitly specified when instantiating a socca.data.Image object, socca will use the Normal noise model by default and compute the per-pixel standard deviation from the median absolute deviation of all pixels in the input data that are not masked out.
NormalRI: radio-interferometric noise¶
The NormalRI noise model implements an image-space approximation to the Fourier-space likelihood for radio-interferometric data. Rather than working directly in the visibility domain, this model uses the dirty beam (i.e., the interferometric response function) to approximate the noise covariance structure in image space. This approach enables efficient likelihood evaluation while accounting for the correlated noise introduced by the interferometric imaging process. The specific implementation is based on the derivations by Powell et al. (2020) and Zhang et al. (2025), to which we refer to for an extensive discussion of the theoretical framework.
Important
When using the NormalRI noise model, the input data should be a naturally-weighted dirty image (i.e., without any deconvolution applied) and should not be corrected for the primary beam (i.e., the antenna pattern) response. If the primary beam is expected to have a significant effect on the signal, a model should be provided as the response attribute of the input socca.data.Image object.
The noise amplitude can be specified using the same keyword arguments as the Normal noise model (see above), but does not support image-like inputs. If no noise amplitude is provided, the per-pixel standard deviation is estimated from the median absolute deviation of the unmasked pixels.
>>> from socca.noise import NormalRI
>>>
>>> # With explicit noise level
>>> noise = NormalRI(sigma=0.1)
>>>
>>> # With automatic noise estimation
>>> noise = NormalRI()
Warning
The NormalRI noise model is designed for map-based analyses of radio-interferometric data, where the dirty beam encodes the sampling of the Fourier plane by the interferometer. It provides a computationally efficient alternative to full visibility-domain fitting, at the cost of an approximate treatment of the noise correlations. Current tests indicate that this approach is accurate enough for sources that are not too extended compared to the maximum recoverable scale in the specific interferometric dataset. For more extended sources, the approximation may break down and a full visibility-domain analysis may be necessary.
Attention
Some tests have exposed some stability issues in the case of priors that allow for sources with characteristic scales much larger than the maximum recoverable scale and the field of view of the interferometric dataset. In such cases, the sampler might explore regions of the parameter space where the model predictions are dominated by large-scale features that are not well constrained by the data, leading to an unstable behavior of the sampling process. This isssue is currently under investigation and will be addressed in future releases of socca. In the meantime, we recommend using more informative priors that restrict the parameter space to regions where the model predictions are reasonably constrained by the data, especially when dealing with sources that are expected to be extended compared to the maximum recoverable scale.