论文标题
评估主人大气检索的监督机器学习
Assessment of Supervised Machine Learning for Atmospheric Retrieval of Exoplanets
论文作者
论文摘要
从光谱观测中对外球星的大气检索需要广泛探索高度退化和高维参数空间,以准确限制大气参数。检索方法通常使用采样算法(例如马尔可夫链蒙特卡洛(MCMC)或嵌套采样)进行贝叶斯参数估计和统计推断。最近,已经尝试使用机器学习算法来补充或替换完全贝叶斯方法。尽管取得了很多进展,但这些方法有时仍无法准确地从当代贝叶斯检索中繁殖结果。我们目前的工作的目的是研究机器学习对大气检索的功效。作为一个案例研究,我们使用随机森林监督机器学习算法,该算法先前已应用,并使用其近红外传输光谱成功地检索了热木星WASP-12B的大气检索。我们使用相同的方法和相同的半分析模型重现了先前的结果,并随后扩展了此方法以开发一种新算法,该算法与完全贝叶斯检索更接近。我们将这种新方法与完全数值的大气模型相结合,并与另一种热木星HD 209458B的传输光谱的贝叶斯检索表现出了极好的一致性。尽管取得了成功,并达到了高计算效率,但我们仍然发现,机器学习方法对高维参数空间的计算效率很高,这些空间通常通过具有适度的计算资源的贝叶斯检索来探索。我们讨论了未来的交易和潜在途径。
Atmospheric retrieval of exoplanets from spectroscopic observations requires an extensive exploration of a highly degenerate and high-dimensional parameter space to accurately constrain atmospheric parameters. Retrieval methods commonly conduct Bayesian parameter estimation and statistical inference using sampling algorithms such as Markov Chain Monte Carlo (MCMC) or Nested Sampling. Recently several attempts have been made to use machine learning algorithms either to complement or replace fully Bayesian methods. While much progress has been made, these approaches are still at times unable to accurately reproduce results from contemporary Bayesian retrievals. The goal of our present work is to investigate the efficacy of machine learning for atmospheric retrieval. As a case study, we use the Random Forest supervised machine learning algorithm which has been applied previously with some success for atmospheric retrieval of the hot Jupiter WASP-12b using its near-infrared transmission spectrum. We reproduce previous results using the same approach and the same semi-analytic models, and subsequently extend this method to develop a new algorithm that results in a closer match to a fully Bayesian retrieval. We combine this new method with a fully numerical atmospheric model and demonstrate excellent agreement with a Bayesian retrieval of the transmission spectrum of another hot Jupiter, HD 209458b. Despite this success, and achieving high computational efficiency, we still find that the machine learning approach is computationally prohibitive for high-dimensional parameter spaces that are routinely explored with Bayesian retrievals with modest computational resources. We discuss the trade offs and potential avenues for the future.