Machine learning isotope shifts in molecular energy levels

April 17, 20262604.16073

Marco G. Barnfield, Oleg L. Polyansky, Sergei N. Yurchenko, Jonathan Tennyson

astro-ph.EPastro-ph.IMphysics.chem-phphysics.comp-ph

TLDR

This paper uses machine learning to improve predictions of isotope shifts in molecular energy levels, crucial for exoplanet atmosphere analysis.

Key contributions

Developed ML framework to correct errors in isotopologue extrapolation (IE) for molecular energy levels.
A neural network reduced CO2 energy prediction errors by over 87% compared to original IE method.
Novel transfer learning approach improved CO energy predictions by over 93% using CO2 data.
Establishes a scalable, data-driven method to refine molecular line lists for exoplanet studies.

Why it matters

Accurate molecular line lists for minor isotopologues are vital for diagnosing exoplanet atmospheres but suffer from scarce experimental data. This work offers a scalable, data-driven machine learning solution to significantly improve these predictions, bridging the gap between theoretical calculations and experimental precision.

Original Abstract

Recent advances in the use of High-Resolution Cross-Correlation Spectroscopy (HRCCS) to detect molecular species in exoplanet atmospheres, presents a new challenge for the accuracy of reference spectroscopic line lists. While parent isotopologues of key atmospheric tracers are often well-characterized, minor isotopologues, crucial for diagnosing planetary formation histories and evolution, suffer from a scarcity of experimental data, often leading to reliance on less accurate theoretical predictions. In this work, a comprehensive machine learning framework is designed to mitigate these inaccuracies by modelling the residual errors of the isotopologue extrapolation (IE) method used within the ExoMol project. A fully connected neural network architecture for carbon dioxide (CO$_2$) is shown to predict energy corrections with high fidelity, reducing the mean absolute error (MAE) relative to the original IE approach for more than 87\% of the levels when benchmarked against empirical (\Marvel) energies. Furthermore, development of a novel hybrid, molecule-aware transfer learning architecture is presented that successfully propagates correction patterns from the data-rich CO$_2$ system to the data-poor carbon monoxide (CO) system. This transfer learning approach yields MAE improvements in over 93\% of CO samples, demonstrating that physical correction factors related to isotopic substitution can be generalized across chemically related molecular systems. Updated and improved line lists are presented for 11 CO$_2$ isotopologues and energy levels for excited states of CO isotopologues are predicted. The methodology establishes a scalable, data-driven paradigm for refining molecular line lists, helping to bridge the gap between theoretical calculations and experimental precision.

View on arXiv Download PDF

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.

TLDR

Key contributions

Why it matters

Original Abstract

📬 Weekly AI Paper Digest

Related papers