Skip to main content
NC State Home
A one-armed, research robot working in the Intelligent Cognitive Ergonomics Lab.

Pedestrian Behavior Prediction Models

Learn about why automated vehicles excel on highways but struggle in cities

Last Updated: 01/22/2026 | All information is accurate and up-to-date

Background and Objectives

  • In most cases, automated vehicles (AVs) can drive smoothly on highways and freeways.
  • However, AVs still face challenges when it comes to driving in urban settings
    • One key challenge: “Unpredictable” and rapidly changing behaviors of pedestrians and vulnerable road users
    • The existing driving strategy is over-conservative targeting to avoid crashes based on short-term kinematics calculations
  • The main research objective is to better predict pedestrian behaviors with deep-learning algorithms to support driving decision-making during pedestrian encounters in complex road scenes

Two-Tower Ego-Centric Pedestrian Trajectory Prediction

Key Features

  1. Multi-modal inputs;
  2. Two-tower model to decompose egocentric pedestrian trajectories based on ego-vehicle and pedestrian movements;
  3. Inferences of pedestrian future moving directions.
A diagram of the Two-tower Ego-centric Pedestrian Trajectory prediction model structure.
Model Structure
Multiple images of pedestrians crossing roads.
Prediction Results at 1.5 Seconds

Algorithm Results on JAAD Benchmark Dataset

MethodAverage Displacement ErrorFinal Displacement Error
PIE22.8349.44
BiPed21.1348.88
Two-Tower Model17.9241.33

PIE: Rasouli, A., Kotseruba, I., Kunic, T. and Tsotsos, J.K., 2019. Pie: A large-scale dataset and models for pedestrian intention estimation and trajectory prediction. In Proceedings of the IEEE/CVF International Conference on Computer Vision (pp. 6262-6271).

BiPed: Rasouli, A.; Rohani, M.; and Luo, J. 2021. Bifold and Semantic Reasoning for Pedestrian Behavior Prediction. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 15600–15610.

Algorithm Results on PSI Benchmark Dataset

MethodAverage Displacement ErrorFinal Displacement Error
PIE35.3961.50
Two-Tower Model22.3446.63

Dual-View Pedestrian Trajectory Prediction

Key Features

  1. Multi-modal inputs;
  2. Predicting bird’s eye trajectory, ego-centric trajectory, and pedestrian actions simultaneously.
  3. Multi-task learning to improve prediction accuracy.
A diagram of the Dual-View Pedestrian Trajectory Prediction model structure.
Model Structure
Multiple images of pedestrians crossing roads and the model predicting their trajectory.
Prediction Results at 2 Seconds

Algorithm Results on nuScenes Benchmark Dataset

Bitrap: Yao, Y., Atkins, E., Johnson-Roberson, M., Vasudevan, R. and Du, X., 2021. Bitrap: Bi-directional pedestrian trajectory prediction with multi-modal goal estimation. IEEE Robotics and Automation Letters, 6(2), pp.1463-1470.

SGNet: Wang, C., Wang, Y., Xu, M. and Crandall, D.J., 2022. Stepwise goal-driven networks for trajectory prediction. IEEE Robotics and Automation Letters7(2), pp.2716-2723.

Pedestrian Intention Prediction Models

Key Features of VR-GCN

  1. Scene graph with 32 objects;
  2. CNN + GCN + LSTM as the main structure;
  3. Pose information incorporated;
Diagram of the Graph-Convolution-Network-based Pedestrian Intent Prediction Model
Graph-Convolutional-Network-based Pedestrian Intent Prediction (VR-GCN)

Key Features of TrEP

  1. Transformer-based feature extraction and encoding;
  2. Evidential learning for robust performance and calibrated uncertainty.
Diagram of Transformer-based Evidential Learning for Intent Prediction model
Transformer-based Evidential Learning for Intent Prediction (TrEP)

Algorithm Results on nuScenesBenchmark Dataset

MethodAccuracyBalanced AccuracyF1
VR-GCN0.740.610.64
TrEP0.850.770.90

Intention Prediction Demo

Pedestrian Intention and Trajectory Prediction