论文标题
评估行人横断预测指标的跨围场概括
Assessing Cross-dataset Generalization of Pedestrian Crossing Predictors
论文作者
论文摘要
行人交叉预测一直是积极研究的话题,导致许多新的算法解决方案。尽管随着时间的推移测量这些解决方案的总体进展往往越来越建立,这是由于新的公开基准和标准化的评估程序,但知道现有的预测因素对看不见的数据的反应如何仍然是一个未解决的问题。这种评估是必须进行的,因为应设置可维修的交叉行为预测因子在各种情况下工作,而不会因错误预测而损害行人安全。为此,我们根据直接的跨数据库评估进行了一项研究。我们的实验表明,当前最新的行人行为预测在跨数据库评估方案中概括的情况下,无论他们在直接训练测试设置设置中的稳健性如何。根据我们的观察,我们认为行人交叉预测的未来,例如可靠且可推广的实现不应与量身定制模型有关,该模型的可用数据很少,并且在经典的火车测试场景中进行了测试,并希望推断出其在现实生活中的行为。它应该是在跨数据集中评估模型,同时考虑其在域移位下的不确定性估计。
Pedestrian crossing prediction has been a topic of active research, resulting in many new algorithmic solutions. While measuring the overall progress of those solutions over time tends to be more and more established due to the new publicly available benchmark and standardized evaluation procedures, knowing how well existing predictors react to unseen data remains an unanswered question. This evaluation is imperative as serviceable crossing behavior predictors should be set to work in various scenarii without compromising pedestrian safety due to misprediction. To this end, we conduct a study based on direct cross-dataset evaluation. Our experiments show that current state-of-the-art pedestrian behavior predictors generalize poorly in cross-dataset evaluation scenarii, regardless of their robustness during a direct training-test set evaluation setting. In the light of what we observe, we argue that the future of pedestrian crossing prediction, e.g. reliable and generalizable implementations, should not be about tailoring models, trained with very little available data, and tested in a classical train-test scenario with the will to infer anything about their behavior in real life. It should be about evaluating models in a cross-dataset setting while considering their uncertainty estimates under domain shift.