Cross-Modal Attention for Accurate Pedestrian ...
Document type :
Autre communication scientifique (congrès sans actes - poster - séminaire...): Communication dans un congrès avec actes
Title :
Cross-Modal Attention for Accurate Pedestrian Trajectory Prediction
Author(s) :
Zaier, Mayssa [Auteur]
Centre for Digital Systems [CERI SN - IMT Nord Europe]
Wannous, Hazem [Auteur]
Centre de Recherche en Informatique, Signal et Automatique de Lille - UMR 9189 [CRIStAL]
Centre for Digital Systems [CERI SN - IMT Nord Europe]
Drira, Hassen [Auteur]
Laboratoire des sciences de l'ingénieur, de l'informatique et de l'imagerie [ICube]
Boonaert, Jacques [Auteur]
Centre for Digital Systems [CERI SN - IMT Nord Europe]
Centre for Digital Systems [CERI SN - IMT Nord Europe]
Wannous, Hazem [Auteur]
Centre de Recherche en Informatique, Signal et Automatique de Lille - UMR 9189 [CRIStAL]
Centre for Digital Systems [CERI SN - IMT Nord Europe]
Drira, Hassen [Auteur]
Laboratoire des sciences de l'ingénieur, de l'informatique et de l'imagerie [ICube]
Boonaert, Jacques [Auteur]
Centre for Digital Systems [CERI SN - IMT Nord Europe]
Conference title :
34th British Machine Vision Conference 2023
City :
Aberdeen
Country :
Royaume-Uni
Start date of the conference :
2023-11-20
HAL domain(s) :
Informatique [cs]
English abstract : [en]
Accurately predicting human behavior is essential for a variety of applications, including self-driving cars, surveillance systems, and social robots. However, predicting human movement is challenging due to the complexity ...
Show more >Accurately predicting human behavior is essential for a variety of applications, including self-driving cars, surveillance systems, and social robots. However, predicting human movement is challenging due to the complexity of physical environments and social interactions. Most studies focus on static environmental information, while ignoring the dynamic visual information available in the scene. To address this issue, we propose a novel approach called Cross-Modal Attention Trajectory Prediction (CMATP) able to predict human paths based on observed trajectory and dynamic scene context. Our approach uses a bimodal transformer network to capture complex spatio-temporal interactions and incorporates both pedestrian trajectory data and contextual information. Our approach achieves state-of-the-art performance on three real-world pedestrian prediction datasets, making it a promising solution for improving the safety and reliability of pedestrian detection and tracking systems. The code to reproduce our results is available at this link.Show less >
Show more >Accurately predicting human behavior is essential for a variety of applications, including self-driving cars, surveillance systems, and social robots. However, predicting human movement is challenging due to the complexity of physical environments and social interactions. Most studies focus on static environmental information, while ignoring the dynamic visual information available in the scene. To address this issue, we propose a novel approach called Cross-Modal Attention Trajectory Prediction (CMATP) able to predict human paths based on observed trajectory and dynamic scene context. Our approach uses a bimodal transformer network to capture complex spatio-temporal interactions and incorporates both pedestrian trajectory data and contextual information. Our approach achieves state-of-the-art performance on three real-world pedestrian prediction datasets, making it a promising solution for improving the safety and reliability of pedestrian detection and tracking systems. The code to reproduce our results is available at this link.Show less >
Language :
Anglais
Peer reviewed article :
Oui
Audience :
Internationale
Popular science :
Non
Collections :
Source :
Files
- document
- Open access
- Access the document
- BMVC%202023%20Submission_467.pdf
- Open access
- Access the document