RowDetr: End-to-End Row Detection Using Polynomials
Journal:
arXiv
Published Date:
Dec 13, 2024
Abstract
Crop row detection is essential for enabling autonomous navigation in
GPS-denied environments, such as under-canopy agricultural settings.
Traditional methods often struggle with occlusions, variable lighting
conditions, and the structural variability of crop rows. To address these
challenges, RowDetr, a novel end-to-end neural network architecture, is
introduced for robust and efficient row detection. A new dataset of
approximately 6,900 images is curated, capturing a diverse range of real-world
agricultural conditions, including occluded rows, uneven terrain, and varying
crop densities. Unlike previous approaches, RowDetr leverages smooth polynomial
functions to precisely delineate crop boundaries in the image space, ensuring a
more structured and interpretable representation of row geometry. A key
innovation of this approach is PolyOptLoss, a novel energy-based loss function
designed to enhance learning robustness, even in the presence of noisy or
imperfect labels. This loss function significantly improves model stability and
generalization by optimizing polynomial curve fitting directly in image space.
Extensive experiments demonstrate that RowDetr significantly outperforms
existing frameworks, including Agronav and RowColAttention, across key
performance metrics. Additionally, RowDetr achieves a sixfold speedup over
Agronav, making it highly suitable for real-time deployment on
resource-constrained edge devices. To facilitate better comparisons across
future studies, lane detection metrics from autonomous driving research are
adapted, providing a more standardized and meaningful evaluation framework for
crop row detection. This work establishes a new benchmark in under-canopy