Improving the discovery of near-Earth objects with machine-learning methods
Journal:
arXiv
Published Date:
May 17, 2025
Abstract
We present a comprehensive analysis of the digest2 parameters for candidates
of the Near-Earth Object Confirmation Page (NEOCP) that were reported between
2019 and 2024. Our study proposes methods for significantly reducing the
inclusion of non-NEO objects on the NEOCP. Despite the substantial increase in
near-Earth object (NEO) discoveries in recent years, only about half of the
NEOCP candidates are ultimately confirmed as NEOs. Therefore, much observing
time is spent following up on non-NEOs. Furthermore, approximately 11% of the
candidates remain unconfirmed because the follow-up observations are
insufficient. These are nearly 600 cases per year. To reduce false positives
and minimize wasted resources on non-NEOs, we refine the posting criteria for
NEOCP based on a detailed analysis of all digest2 scores. We investigated 30
distinct digest2 parameter categories for candidates that were confirmed as
NEOs and non-NEOs. From this analysis, we derived a filtering mechanism based
on selected digest2 parameters that were able to exclude 20% of the non-NEOs
from the NEOCP while maintaining a minimal loss of true NEOs. We also
investigated the application of four machine-learning (ML) techniques, that is,
the gradient-boosting machine (GBM), the random forest (RF) classifier, the
stochastic gradient descent (SGD) classifier, and neural networks (NN) to
classify NEOCP candidates as NEOs or non-NEOs. Based on digest2 parameters as
input, our ML models achieved a precision of approximately 95% in
distinguishing between NEOs and non-NEOs. Results. Combining the digest2
parameter filter with an ML-based classification model, we demonstrate a
significant reduction in non-NEOs on the NEOCP that exceeds 80%, while limiting
the loss of NEO discovery tracklets to 5.5%. Importantly, we show that most
follow-up tracklets of initially misclassified NEOs are later correctly
identified as NEOs.