I am a Research Fellow in the
Center for Computational Mathematics at the Flatiron Institute in New York, where I work at the interface between (astro)physics and data science.
I develop statistical methods for astrophysics, cosmology, and beyond using signal processing and machine learning. I tackle various problems including generative modeling, inference, denoising, and source separation.
These problems naturally emerged from my applied research in modeling interstellar dust emission, analyzing cosmic microwave background data, and studying galaxy clustering (as part of the SimBIG collaboration).
Lately, I have been particularly focused on deep generative models and their application to scientific endeavors. I am also actively involved in the Polymathic AI initiative, which aims to leverage these models for the development of foundation models for science.
Before my fellowship, I earned a Ph.D. in Astrophysics in 2021 from the École Normale Supérieure, Paris. Prior to that, I graduated from the Ecole Polytechnique (X2014) and obtained a Master's degree in Astrophysics from the Observatoire de Paris. Download my CV.
*Bruno is my first name, and Régaldo-Saint Blancard is my last name. You can shorten my last name to Régaldo, but please don't shorten it to Blancard!
Research Highlights
Bayesian Blind Denoising with Gibbs Diffusion
February 2024
Blind denoising problems are not exclusive to natural image processing; they are also prevalent in many scientific applications where the noise distribution is unknown or hard to model. In our new preprint, we introduce GDiff, a novel solution to blind denoising in a fully Bayesian context. By combining Gibbs sampling and a diffusion model, we build a rigorous method to sample the posterior distribution of the signal and the noise parameters for any kind of diffusion-based signal prior!
We show that GDiff is directly relevant to the analysis of cosmic microwave background (CMB) data, by taking an original view on the problem of separating the CMB from its foregrounds. Have you ever thought of the CMB as the noise of a blind denoising problem, and the foregrounds as the signal? From that perspective, we show that GDiff can directly separate dust and CMB while solving cosmological inference at the same time! Stay tuned for future applications to observational data!
Update 05/24: Accepted at ICML 2024!
Removing Dust from CMB Observations with Diffusion Models
October 2023
Diffusion models have revolutionized the modeling of natural images. Can they also help us to analyze cosmic microwave background (CMB) data? Thanks to my talented intern David Heurtel-Depeiges, and the collaboration of Blaskeley Burkhart and Ruben Ohana, we make a first demonstration of the potential of diffusion models for the separation of Galactic dust and CMB. We show that dust+CMB observations can be seen as the result of a diffusion process that can be reversed in time, thus naturally solving source separation.
We are already working on the next step: a diffusion-based approach for cosmological inference. Stay tuned!
Update 11/23: Spotlight talk at ML4PS NeurIPS 2023 Workshop!
Stacking for Simulation-Based Inference
October 2023
With simulation-based inference, it is typical to end up with a multitude of models/approximations of the same target posterior distribution. This usually results from the investigation of different inference algorithms, different architectures, or can simply be due to the randomness of initialization and stochastic gradients. While most practitioners usually choose to select the best of their models, with Yuling Yao and Justin Domke, we show that there is much better to do, and it's called stacking. We show that models can all be combined at once in a systematic way to improve precision, calibration, coverage, and bias at the same time. Check out our new preprint on Simulation-Based Stacking!
Update 01/24: Accepted at AISTATS 2024!
SimBIG Collaboration: Second Wave of Papers
October 2023
We are taking simulation-based inference for the analysis of galaxy clustering to the next level with our second release of papers! We now explore galaxy clustering data through the lenses of the wavelet scattering transform, convolutional networks, and bispectrum statistics. For each of these, we get new cosmological constraints leveraging non-linear information from the data. Check out our new website for more information!
With Michael Eickenberg, we led the wavelet scattering transform (WST) analysis. The WST statistics capture a wealth of non-Gaussian information from the data improving constraints on cosmological parameters. However, we show in our paper that these statistics might be too rich as they can also capture unrealistic specifics of the forward models, raising model misspecifications issues when applied to observational data. Our next challenge will be to address this in detail!
Update 02/24: Accepted in PRD!
Polymathic AI and Multiple Physics Pretraining
October 2023
I am lucky to be part of the amazing Polymathic AI initiative which aims to create a foundation model for advancing scientific discovery. We recently released a series of paper, check out our blog to find out about it!
In particular, in a project led by Michael McCabe, we introduce “Multiple Physics Pretraining”, an autoregressive task-agnostic pretraining approach for physical surrogate modeling. In this paper, we notably show that a single transformer model trained on a broad range of physical tasks can perform better than task-specific models on a variety of downstream applications.
Statistical Component Separation for Targeted Signal Recovery in Noisy Mixtures
June 2023
SimBIG: Simulation-Based Inference of Galaxies
November 2022
Glad to announce the release of the two first papers of the SimBIG collaboration (led by ChangHoon Hahn): letter, mock challenge. The SimBIG framework enables the analysis of cosmological information from galaxy surveys on small nonlinear scales using simulation-based inference. It relies on the SimBIG forward model, which connects the cosmological parameters to realistic mock galaxy surveys. Take a look at how this model compares to BOSS data!
Update 10/23: Published in PNAS and JCAP!
Generative Models of Multi-frequency Dust Emission Maps
August 2022
Check out our recent paper, where we use the Wavelet Phase Harmonic statistics to build generative models of multi-frequency dust emission maps from a single example. Want to try this on your own data? Take a look at the code associated with the paper.
Update 01/23: Published in the Astrophysical Journal!
Wavelet Moments for Cosmological Parameter Estimation
April 2022
I was recently involved in Eickenberg et al. paper, which introduced a new set of wavelet statistics, called "Wavelet Moments", to extract non-Gaussian information from 3D cosmological fields. Fisher forecasts based on the Quijote simulations show that these statistics improve constraints on the cosmological parameters by a factor 5 to 10 with respect to the power spectrum baseline.
Ph.D. Thesis: Statistical Modeling of the Polarized Emission of Interstellar Dust
November 2021
I conducted my Ph.D. research at the LPENS, École Normale Supérieure, Paris, under the supervision of François Levrier and François Boulanger. My work was motivated by challenges in analyzing cosmic microwave background (CMB) data. I focused on the statistical modeling of one of the CMB foregrounds, namely the emission of interstellar dust. These foregrounds constitute major obstacles for the next generation of CMB experiments. I developed data-driven models using the wavelet scattering transform — a technique closely related to the mathematics of convolutional neural networks. You can learn more about this in my Ph.D. thesis.
Selected Papers
D. Heurtel-Depeiges, C. C. Margossian, R. Ohana & B. Régaldo-Saint Blancard; Listening to the Noise: Blind Denoising with Gibbs Diffusion; ICML (2024).
ArXivProceedings
D. Heurtel-Depeiges, B. Burkhart, R. Ohana & B. Régaldo-Saint Blancard; Removing Dust from CMB Observations with Diffusion Models; ML4PS Workshop at NeurIPS - Spotlight (2023).
ArXiv
Y. Yao, B. Régaldo-Saint Blancard & J. Domke; Simulation Based Stacking; AISTATS (2024).
ArXivProceedings
B. Régaldo-Saint Blancard, C. Hahn, S. Ho, J. Hou, P. Lemos, E. Massara, C. Modi, A. Moradinezhad Dizgah, L. Parker, Y. Yao & M. Eickenberg; SimBIG: Galaxy Clustering Analysis with the Wavelet Scattering Transform; Physical Review D (2024).
ArXivDOI
C. Hahn, P. Lemos, L. Parker, B. Régaldo-Saint Blancard, M. Eickenberg, S. Ho, J. Hou, E. Massara, C. Modi, A. Moradinezhad Dizgah & D. Spergel; Cosmological constraints from non-Gaussian and nonlinear galaxy clustering using the SimBIG inference framework; Nature Astronomy (2024)
ArXivDOI
M. McCabe, B. Régaldo-Saint Blancard, L. Holden Parker, R. Ohana, M. Cranmer, A. Bietti, M. Eickenberg, S. Golkar, G. Krawezik, F. Lanusse, M. Pettee, T. Tesileanu, K. Cho & S. Ho; Multiple Physics Pretraining for Physical Surrogate Models; NeurIPS (2024) - also Best Paper Award @ AI4Science Workshop NeurIPS 2023
ArXiv
B. Régaldo-Saint Blancard & M. Eickenberg; Statistical Component Separation for Targeted Signal Recovery in Noisy Mixtures; Transactions on Machine Learning Research (2024).
ArXiv
C. Hahn, M. Eickenberg, S. Ho, J. Hou, P. Lemos, E. Massara, C. Modi, A. Moradinezhad Dizgah, B. Régaldo-Saint Blancard & M. Abidi; SimBIG: A Forward Modeling Approach To Analyzing Galaxy Clustering; Proceedings on National Academy of Sciences (2023).
ArXivDOI
B. Régaldo-Saint Blancard, E. Allys, C. Auclair, F. Boulanger, M. Eickenberg, F. Levrier, L. Vacher & S. Zhang; Generative Models of Multi-channel Data from a Single Example - Application to Dust Emission; The Astrophysical Journal (2023).
ArXivDOI
N. Jeffrey, F. Boulanger, B. D. Wandelt, B. Regaldo-Saint Blancard, E. Allys & F. Levrier; Single frequency CMB B-mode inference with realistic foregrounds from a single training image; Monthly Notices of the Royal Astronomical Society: Letters (2021).
ArXivDOI
B. Regaldo-Saint Blancard, E. Allys, F. Boulanger, F. Levrier & N. Jeffrey; A new approach for the statistical denoising of Planck interstellar dust polarization data; Astronomy & Astrophysics: Letters (2021).
ArXivDOI
B. Regaldo-Saint Blancard, F. Levrier, E. Allys, E. Bellomi & F. Boulanger; Statistical description of dust polarized emission from the diffuse interstellar medium - A RWST approach; Astronomy & Astrophysics (2020).
ArXivDOI
I am committed to promoting open-source practices to facilitate the reproducibility of my research. You can find all the open-source projects I have been involved in on my GitHub page. I have also developed a few Python packages that you might find useful for your research:
Python package for GPU-accelerated computations of Wavelet Scattering Statistics for 3D fields and Galaxy Surveys.
Teaching
2018 - 2021: Teaching assistant at the École Normale Supérieure, Paris, for the course "Numerical methods for differential equations in Physics" (Master's level, faculty: L. Tuckerman). Exercises.
2019 - 2021: Lecturer at the École Normale Supérieure, Paris, for the course "Physique pour Tous" ("Physics for All") intended for a broad non-scientific audience.
2014 - 2015: Educational coordinator for homework assistance program at Association Le Rocher (primary and secondary levels).
Talks
I try to keep track of some of my past talks in my CV. Fun fact, I gave a TEDx talk during my Ph.D. on the topic "Un Univers sans limite ?" (in French, at Pôle Universitaire Léonard de Vinci, Paris-La Défense). You can watch it here!
Contact
Address
Flatiron Institute
CCM, 308
162 Fifth Avenue
New York, NY 10010
United States