论文标题
在低资源场景中进行有效检测的本地化分组实例
Localizing Grouped Instances for Efficient Detection in Low-Resource Scenarios
论文作者
论文摘要
通常评估了最先进的检测系统,以详细检索图像中密集分布的物体,各种各样的外观和语义类别的能力进行评估。正交的,许多真实的对象检测应用程序,例如在遥感中,需要处理仅包含单个类的几个小对象的大图像,它们在整个空间中散布着异质。此外,它们通常受到严格的计算限制,例如电池容量有限和计算能力。为了解决这些更实用的方案,我们提出了一种新型的灵活检测方案,该方案有效地适应了可变的对象大小和密度:我们依靠一系列检测阶段,每个阶段都有能够预测对象和个体组的能力。与检测级联相似,这种多阶段架构通过在检测过程中早期丢弃图像的大区域来避免计算工作。分组对象的能力提供了进一步的计算和内存节省,因为它允许在早期阶段使用较低的图像分辨率,在该阶段,小组比个人更容易被检测到,因为它们更为显着。我们在两个航空图像数据集上报告了实验结果,并表明所提出的方法在三个不同的骨干架构上始终如一地比标准的单杆检测器准确但计算更有效。
State-of-the-art detection systems are generally evaluated on their ability to exhaustively retrieve objects densely distributed in the image, across a wide variety of appearances and semantic categories. Orthogonal to this, many real-life object detection applications, for example in remote sensing, instead require dealing with large images that contain only a few small objects of a single class, scattered heterogeneously across the space. In addition, they are often subject to strict computational constraints, such as limited battery capacity and computing power. To tackle these more practical scenarios, we propose a novel flexible detection scheme that efficiently adapts to variable object sizes and densities: We rely on a sequence of detection stages, each of which has the ability to predict groups of objects as well as individuals. Similar to a detection cascade, this multi-stage architecture spares computational effort by discarding large irrelevant regions of the image early during the detection process. The ability to group objects provides further computational and memory savings, as it allows working with lower image resolutions in early stages, where groups are more easily detected than individuals, as they are more salient. We report experimental results on two aerial image datasets, and show that the proposed method is as accurate yet computationally more efficient than standard single-shot detectors, consistently across three different backbone architectures.