Generic Prediction Architecture Considering both Rational and Irrational Driving Behaviors

Liting Sun; Masayoshi Tomizuka; Yeping Hu

arxiv: 1907.10170 · v1 · pith:IUSK4DJJnew · submitted 2019-07-23 · 💻 cs.RO · cs.LG· stat.ML

Generic Prediction Architecture Considering both Rational and Irrational Driving Behaviors

Yeping Hu , Liting Sun , Masayoshi Tomizuka This is my paper

Pith reviewed 2026-05-24 17:05 UTC · model grok-4.3

classification 💻 cs.RO cs.LGstat.ML

keywords trajectory predictionautonomous vehicleshybrid prediction modelrational and irrational behaviorslearning-based modelsplanning-based modelsunseen scenariosroundabout scenario

0 comments

The pith

A hybrid architecture combines learning-based and planning-based models to predict both rational and irrational vehicle trajectories.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper develops a prediction system for autonomous vehicles that accounts for the full range of human driving behaviors. By combining models that learn from data with those that assume rational planning, it aims to handle uncertainties in other drivers' actions. This matters because accurate predictions of future paths allow safer trajectory planning, especially when encountering situations not represented in the training data. The approach generates continuous trajectories and maintains consistent performance across different scenarios.

Core claim

The proposed generic prediction architecture leverages the advantages from both learning-based and planning-based prediction models to address various rationalities in human behavior. It is able to predict continuous trajectories that well-reflect possible future situations of other drivers. Moreover, the prediction performance remains stable under various unseen driving scenarios, as demonstrated in a real-world roundabout case study.

What carries the argument

The generic prediction architecture that combines learning-based and planning-based models, allowing a customizable balance between rational and irrational behaviors.

If this is right

Continuous trajectories are generated that reflect possible future situations of other drivers.
Prediction performance remains stable under various unseen driving scenarios.
The balance between rational and irrational behaviors can be customized.
Reasonable results are produced even in the presence of unseen and corner scenarios.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

This structure may reduce the need for scenario-specific retraining when deploying autonomous vehicle systems in new areas.
Similar hybrid combinations could apply to prediction tasks involving other agents or humans beyond traffic.
The customizable balance might support adaptation to regional driving differences without collecting new datasets.

Load-bearing premise

That a generic combination of learning-based and planning-based models can be constructed such that the balance between rational and irrational behaviors is customizable and produces stable generalization to unseen scenarios without post-hoc tuning or data-specific adjustments.

What would settle it

A demonstration that prediction accuracy drops or becomes unstable in a new unseen scenario outside the roundabout case study, or that adjusting the rational-irrational balance requires data-specific adjustments to maintain performance.

Figures

Figures reproduced from arXiv: 1907.10170 by Liting Sun, Masayoshi Tomizuka, Yeping Hu.

**Figure 1.** Figure 1: Illustration of the Frenet frame and vehicle boundaries. The ´ red point represents the vehicle’s center of mass. The shape of the vehicle is approximated by three circles [PITH_FULL_IMAGE:figures/full_fig_p002_1.png] view at source ↗

**Figure 2.** Figure 2: This plot shows the process of proposed architecture. Note that trajectories directly sampled from the learning-based method [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗

**Figure 3.** Figure 3: Visualized prediction results of a selected scene at different [PITH_FULL_IMAGE:figures/full_fig_p005_3.png] view at source ↗

**Figure 4.** Figure 4: Generalization results at an unseen roundabout entrance. [PITH_FULL_IMAGE:figures/full_fig_p005_4.png] view at source ↗

**Figure 5.** Figure 5: Selected artificial test scenarios and the corresponding results. The pink dot [PITH_FULL_IMAGE:figures/full_fig_p006_5.png] view at source ↗

**Figure 7.** Figure 7: Different sampled trajectories and their corresponding costs [PITH_FULL_IMAGE:figures/full_fig_p007_7.png] view at source ↗

**Figure 8.** Figure 8: Visualization of the overall framework at different stages [PITH_FULL_IMAGE:figures/full_fig_p007_8.png] view at source ↗

read the original abstract

Accurately predicting future behaviors of surrounding vehicles is an essential capability for autonomous vehicles in order to plan safe and feasible trajectories. The behaviors of others, however, are full of uncertainties. Both rational and irrational behaviors exist, and the autonomous vehicles need to be aware of this in their prediction module. The prediction module is also expected to generate reasonable results in the presence of unseen and corner scenarios. Two types of prediction models are typically used to solve the prediction problem: learning-based model and planning-based model. Learning-based model utilizes real driving data to model the human behaviors. Depending on the structure of the data, learning-based models can predict both rational and irrational behaviors. But the balance between them cannot be customized, which creates challenges in generalizing the prediction results. Planning-based model, on the other hand, usually assumes human as a rational agent, i.e., it anticipates only rational behavior of human drivers. In this paper, a generic prediction architecture is proposed to address various rationalities in human behavior. We leverage the advantages from both learning-based and planning-based prediction models. The proposed approach is able to predict continuous trajectories that well-reflect possible future situations of other drivers. Moreover, the prediction performance remains stable under various unseen driving scenarios. A case study under a real-world roundabout scenario is provided to demonstrate the performance and capability of the proposed prediction architecture.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The hybrid idea addresses a real AV prediction gap but the stability claim hangs on one qualitative roundabout example with no metrics or ablations.

read the letter

The main takeaway is that this paper identifies a genuine issue—learning predictors can't tune the rational-irrational mix while planning ones ignore irrational cases—and offers a hybrid architecture to fix it, but the evidence for stable generalization to unseen scenarios is limited to a single qualitative case study. That framing is worth noting even if the execution stays thin. What the work does is lay out the limitations of each approach clearly: data-driven models pick up mixed behaviors from real trajectories but lock in whatever balance the training set has, while optimization-based planners default to rational agents. Their generic setup aims to let the user adjust that balance and still output continuous trajectories. The roundabout demonstration shows plausible paths that cover possible futures, which at least illustrates the intent. The soft spot is the lack of support for the central claim. The stress-test note is right that there's no multi-scenario quantitative benchmark, no held-out dataset results, no ablation on the rationality weighting, and no numbers on prediction error or stability. A single real-world roundabout with trajectory plots doesn't test whether the hybrid holds up without post-hoc tuning across corner cases. Without those, the weakest assumption—that the combination produces customizable and stable performance—remains untested. This paper is for people already working on motion forecasting in robotics who want to see one more attempt at blending the two families of methods. It won't give them a deployable system or strong empirical backing. It deserves peer review because the topic matters for safety and the authors engage the literature on the problem, even if the current results need substantial expansion on validation and implementation details before it would influence practice.

Referee Report

2 major / 0 minor

Summary. The paper proposes a generic hybrid prediction architecture for autonomous vehicles that combines learning-based models (to capture both rational and irrational behaviors from data) with planning-based models (assuming rational agents). It claims this allows customizable balance between behavior types, generates continuous trajectories reflecting possible futures, and yields stable performance on unseen and corner scenarios, with demonstration via a single real-world roundabout case study.

Significance. If the hybrid construction and stability claims were substantiated with quantitative multi-scenario validation, the work could address a practical gap in AV prediction by enabling tunable rationality without retraining, improving robustness to behavioral uncertainty.

major comments (2)

[Abstract] Abstract: The central claim that 'the prediction performance remains stable under various unseen driving scenarios' is supported only by qualitative trajectory plots from one roundabout case study. No held-out quantitative metrics (e.g., ADE/FDE), cross-dataset tests, ablation on the rationality weighting parameter, or sensitivity analysis are reported, leaving the generalization assertion untested.
[Abstract] Abstract: The description of how the hybrid architecture is constructed (e.g., the mechanism for combining learning-based and planning-based outputs, or how the customizable balance between rational/irrational behaviors is implemented and tuned) is absent. Without this, it is impossible to assess whether the architecture avoids post-hoc data-specific adjustments as asserted.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive comments, which help clarify how to better present the hybrid architecture and its claims. We respond to each major comment below.

read point-by-point responses

Referee: [Abstract] Abstract: The central claim that 'the prediction performance remains stable under various unseen driving scenarios' is supported only by qualitative trajectory plots from one roundabout case study. No held-out quantitative metrics (e.g., ADE/FDE), cross-dataset tests, ablation on the rationality weighting parameter, or sensitivity analysis are reported, leaving the generalization assertion untested.

Authors: We agree that the stability claim on unseen scenarios would be more robust with quantitative support beyond the single roundabout case study. The provided case study was chosen because it contains multiple unseen and corner behaviors within one environment, but we acknowledge the limitation of relying on qualitative plots alone. In revision we will add held-out ADE/FDE metrics on additional data splits, an ablation study varying the rationality weighting parameter, and sensitivity analysis to directly address this concern. revision: yes
Referee: [Abstract] Abstract: The description of how the hybrid architecture is constructed (e.g., the mechanism for combining learning-based and planning-based outputs, or how the customizable balance between rational/irrational behaviors is implemented and tuned) is absent. Without this, it is impossible to assess whether the architecture avoids post-hoc data-specific adjustments as asserted.

Authors: The abstract is intentionally concise, but we accept that it omits key implementation details. The body of the manuscript describes the fusion of learning-based trajectory distributions with planning-based rational constraints and the tunable weighting parameter that controls the rational/irrational balance. We will revise the abstract to include a brief statement of this mechanism and the tuning procedure so that the hybrid construction is clear from the abstract alone. revision: yes

Circularity Check

0 steps flagged

No circularity in derivation chain; architectural proposal with empirical demonstration.

full rationale

The paper presents a hybrid architecture proposal that combines existing learning-based and planning-based prediction models to handle rational/irrational behaviors. No equations, fitted parameters, or derivations are shown that reduce by construction to inputs or prior self-citations. The stability claim is supported by a single case study rather than a mathematical derivation, making the work self-contained against external benchmarks with no load-bearing self-referential steps.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Only abstract available; no free parameters, axioms, or invented entities can be extracted. The central claim rests on the unstated premise that a hybrid model can be made to generalize without scenario-specific engineering.

pith-pipeline@v0.9.0 · 5779 in / 1064 out tokens · 16277 ms · 2026-05-24T17:05:47.621199+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

32 extracted references · 32 canonical work pages · 3 internal anchors

[1]

Convolutional Social Pooling for Vehicle Trajectory Prediction

N. Deo and M. M. Trivedi, “Convolutional social pooling for vehicle trajectory prediction,” arXiv preprint arXiv:1805.06771 , 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018
[2]

Probabilistic analysis of dynamic scenes and collision risks assessment to improve driving safety,

C. Laugier, I. E. Paromtchik, M. Perrollaz, M. Yong, J.-D. Yoder, C. Tay, K. Mekhnacha, and A. N `egre, “Probabilistic analysis of dynamic scenes and collision risks assessment to improve driving safety,” IEEE Intelligent Transportation Systems Magazine , vol. 3, no. 4, pp. 4–19, 2011

work page 2011
[3]

Probabilistic Vehicle Trajectory Prediction over Occupancy Grid Map via Recurrent Neural Network

B. Kim, C. M. Kang, S. H. Lee, H. Chae, J. Kim, C. C. Chung, and J. W. Choi, “Probabilistic vehicle trajectory prediction over occupancy grid map via recurrent neural network,” arXiv preprint arXiv:1704.07049, 2017

work page internal anchor Pith review Pith/arXiv arXiv 2017
[4]

Probabilistic prediction of vehicle semantic intention and motion,

Y . Hu, W. Zhan, and M. Tomizuka, “Probabilistic prediction of vehicle semantic intention and motion,” inIntelligent Vehicles Symposium (IV), 2018 IEEE. IEEE, 2018, pp. 307–313

work page 2018
[5]

A probabilistic model for estimating driver behaviors and vehicle trajectories in trafﬁc environments,

T. Gindele, S. Brechtel, and R. Dillmann, “A probabilistic model for estimating driver behaviors and vehicle trajectories in trafﬁc environments,” inIntelligent Transportation Systems (ITSC), 2010 13th International IEEE Conference on . IEEE, 2010, pp. 1625–1631

work page 2010
[6]

Generating lane-change trajectories of individual drivers,

Y . Nishiwaki, C. Miyajima, N. Kitaoka, R. Terashima, T. Wakita, and K. Takeda, “Generating lane-change trajectories of individual drivers,” in Vehicular Electronics and Safety, 2008. ICVES 2008. IEEE International Conference on . IEEE, 2008, pp. 271–275

work page 2008
[7]

Accurate and diverse sampling of sequences based on a best of many sample objective,

A. Bhattacharyya, B. Schiele, and M. Fritz, “Accurate and diverse sampling of sequences based on a best of many sample objective,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018, pp. 8485–8493

work page 2018
[8]

A framework for probabilistic generic trafﬁc scene prediction,

Y . Hu, W. Zhan, and M. Tomizuka, “A framework for probabilistic generic trafﬁc scene prediction,” in Intelligent Transportation Systems (ITSC). IEEE, 2018, pp. 2790–2796

work page 2018
[9]

Multi-modal probabilistic predic- tion of interactive behavior via an interpretable model,

Y . Hu, W. Zhan, and M. Tomizuka, “Multi-modal probabilistic predic- tion of interactive behavior via an interpretable model,” in Proceedings of the 2019 IEEE Intelligent Vehicles Symposium (IV2019) , 2019

work page 2019
[10]

Junior: The stanford entry in the urban challenge,

M. Montemerlo, J. Becker, S. Bhat, H. Dahlkamp, D. Dolgov, S. Et- tinger, D. Haehnel, T. Hilden, G. Hoffmann, B. Huhnke et al., “Junior: The stanford entry in the urban challenge,” Journal of ﬁeld Robotics, vol. 25, no. 9, pp. 569–597, 2008

work page 2008
[11]

Path planning for autonomous vehicles in unknown semi-structured environments,

D. Dolgov, S. Thrun, M. Montemerlo, and J. Diebel, “Path planning for autonomous vehicles in unknown semi-structured environments,” The International Journal of Robotics Research , vol. 29, no. 5, pp. 485–501, 2010

work page 2010
[12]

Time-bounded lattice for efﬁcient planning in dynamic environments,

A. Kushleyev and M. Likhachev, “Time-bounded lattice for efﬁcient planning in dynamic environments,” in 2009 IEEE International Conference on Robotics and Automation . IEEE, 2009, pp. 1662– 1668

work page 2009
[13]

Real-time motion planning with applications to autonomous urban driving,

Y . Kuwata, J. Teo, G. Fiore, S. Karaman, E. Frazzoli, and J. P. How, “Real-time motion planning with applications to autonomous urban driving,” IEEE Transactions on Control Systems Technology , vol. 17, no. 5, pp. 1105–1118, 2009

work page 2009
[14]

Optimal motion planning with the half-car dynamical model for autonomous high-speed driving,

J. hwan Jeon, R. V . Cowlagi, S. C. Peters, S. Karaman, E. Frazzoli, P. Tsiotras, and K. Iagnemma, “Optimal motion planning with the half-car dynamical model for autonomous high-speed driving,” in2013 American Control Conference. IEEE, 2013, pp. 188–193

work page 2013
[15]

Threat-aware path planning in uncertain urban environments,

G. S. Aoude, B. D. Luders, D. S. Levine, and J. P. How, “Threat-aware path planning in uncertain urban environments,” in 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems . IEEE, 2010, pp. 6058–6063

work page 2010
[16]

When is a linear control system optimal?

R. E. Kalman, “When is a linear control system optimal?” Journal of Basic Engineering, vol. 86, no. 1, pp. 51–60, 1964

work page 1964
[17]

Algorithms for inverse reinforcement learning

A. Y . Ng, S. J. Russell et al. , “Algorithms for inverse reinforcement learning.” in Icml, vol. 1, 2000, p. 2

work page 2000
[18]

Apprenticeship learning via inverse rein- forcement learning,

P. Abbeel and A. Y . Ng, “Apprenticeship learning via inverse rein- forcement learning,” in Proceedings of the twenty-ﬁrst international conference on Machine learning . ACM, 2004, p. 1

work page 2004
[19]

Maximum entropy inverse reinforcement learning

B. D. Ziebart, A. L. Maas, J. A. Bagnell, and A. K. Dey, “Maximum entropy inverse reinforcement learning.” in Aaai, vol. 8. Chicago, IL, USA, 2008, pp. 1433–1438

work page 2008
[20]

Continuous inverse optimal control with locally optimal examples,,

S. Levine and V . Koltun, “Continuous inverse optimal control with locally optimal examples,,” in the 29th International Conference on Machine Learning (ICML-12) , 2012

work page 2012
[21]

Socially compliant mobile robot navigation via inverse reinforcement learning,

H. Kretzschmar, M. Spies, C. Sprunk, and W. Burgard, “Socially compliant mobile robot navigation via inverse reinforcement learning,” The International Journal of Robotics Research , vol. 35, no. 11, pp. 1289–1307, 2016

work page 2016
[22]

Courteous autonomous cars,

L. Sun, W. Zhan, M. Tomizuka, and A. D. Dragan, “Courteous autonomous cars,” in 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) . IEEE, 2018, pp. 663–670

work page 2018
[23]

Probabilistic prediction of interactive driving behavior via hierarchical inverse reinforcement learning,

L. Sun, W. Zhan, and M. Tomizuka, “Probabilistic prediction of interactive driving behavior via hierarchical inverse reinforcement learning,” in 2018 21st International Conference on Intelligent Trans- portation Systems (ITSC) . IEEE, 2018, pp. 2111–2117

work page 2018
[24]

Behavior planning of autonomous cars with social perception,

L. Sun, W. Zhan, C.-Y . Chan, and M. Tomizuka, “Behavior planning of autonomous cars with social perception,” in Proceedings of the 2019 IEEE Intelligent Vehicles Symposium (IV2019) , vol. 1, 2019, p. 2

work page 2019
[25]

Risk-sensitive inverse reinforcement learning via coherent risk models

A. Majumdar, S. Singh, A. Mandlekar, and M. Pavone, “Risk-sensitive inverse reinforcement learning via coherent risk models.” in Robotics: Science and Systems , 2017

work page 2017
[26]

Interpretable modelling of driving behaviors in interactive driving scenarios based on cumu- lative prospect theory,

L. Sun, W. Zhan, Y . Hu, and M. Tomizuka, “Interpretable modelling of driving behaviors in interactive driving scenarios based on cumu- lative prospect theory,” in Proceedings of the 2019 IEEE Intelligent Transporation Systems Conference (ITSC2019) , vol. 1, 2019, p. 2

work page 2019
[27]

Scaling up dynamic time warping for datamining applications,

E. J. Keogh and M. J. Pazzani, “Scaling up dynamic time warping for datamining applications,” in Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining . ACM, 2000, pp. 285–289

work page 2000
[28]

Learning structured output representa- tion using deep conditional generative models,

K. Sohn, H. Lee, and X. Yan, “Learning structured output representa- tion using deep conditional generative models,” in Advances in neural information processing systems , 2015, pp. 3483–3491

work page 2015
[29]

Auto-Encoding Variational Bayes

D. P. Kingma and M. Welling, “Auto-encoding variational bayes,” arXiv preprint arXiv:1312.6114 , 2013

work page internal anchor Pith review Pith/arXiv arXiv 2013
[30]

Does the chimpanzee have a theory of mind?

D. Premack and G. Woodruff, “Does the chimpanzee have a theory of mind?” Behavioral and brain sciences , vol. 1, no. 4, pp. 515–526, 1978

work page 1978
[31]

Constructing a highly interactive vehicle motion dataset

W. Zhan, L. Sun, D. Wang, Y . Jin, and M. Tomizuka, “Constructing a highly interactive vehicle motion dataset.” in IROS2019, vol. 1, 2019, p. 2

work page 2019
[32]

Interaction dataset: An international, adversarial and cooperative motion dataset in interactive scenarios with semantic maps,

W. Zhan, L. Sun, D. Wang, H. Shi, A. Clausse, M. Naumann, J. Kum- merle, H. Konigshof, C. Stiller, A. de La Fortelle, and M. Tomizuka, “Interaction dataset: An international, adversarial and cooperative motion dataset in interactive scenarios with semantic maps,” 2019

work page 2019

[1] [1]

Convolutional Social Pooling for Vehicle Trajectory Prediction

N. Deo and M. M. Trivedi, “Convolutional social pooling for vehicle trajectory prediction,” arXiv preprint arXiv:1805.06771 , 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018

[2] [2]

Probabilistic analysis of dynamic scenes and collision risks assessment to improve driving safety,

C. Laugier, I. E. Paromtchik, M. Perrollaz, M. Yong, J.-D. Yoder, C. Tay, K. Mekhnacha, and A. N `egre, “Probabilistic analysis of dynamic scenes and collision risks assessment to improve driving safety,” IEEE Intelligent Transportation Systems Magazine , vol. 3, no. 4, pp. 4–19, 2011

work page 2011

[3] [3]

Probabilistic Vehicle Trajectory Prediction over Occupancy Grid Map via Recurrent Neural Network

B. Kim, C. M. Kang, S. H. Lee, H. Chae, J. Kim, C. C. Chung, and J. W. Choi, “Probabilistic vehicle trajectory prediction over occupancy grid map via recurrent neural network,” arXiv preprint arXiv:1704.07049, 2017

work page internal anchor Pith review Pith/arXiv arXiv 2017

[4] [4]

Probabilistic prediction of vehicle semantic intention and motion,

Y . Hu, W. Zhan, and M. Tomizuka, “Probabilistic prediction of vehicle semantic intention and motion,” inIntelligent Vehicles Symposium (IV), 2018 IEEE. IEEE, 2018, pp. 307–313

work page 2018

[5] [5]

A probabilistic model for estimating driver behaviors and vehicle trajectories in trafﬁc environments,

T. Gindele, S. Brechtel, and R. Dillmann, “A probabilistic model for estimating driver behaviors and vehicle trajectories in trafﬁc environments,” inIntelligent Transportation Systems (ITSC), 2010 13th International IEEE Conference on . IEEE, 2010, pp. 1625–1631

work page 2010

[6] [6]

Generating lane-change trajectories of individual drivers,

Y . Nishiwaki, C. Miyajima, N. Kitaoka, R. Terashima, T. Wakita, and K. Takeda, “Generating lane-change trajectories of individual drivers,” in Vehicular Electronics and Safety, 2008. ICVES 2008. IEEE International Conference on . IEEE, 2008, pp. 271–275

work page 2008

[7] [7]

Accurate and diverse sampling of sequences based on a best of many sample objective,

A. Bhattacharyya, B. Schiele, and M. Fritz, “Accurate and diverse sampling of sequences based on a best of many sample objective,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018, pp. 8485–8493

work page 2018

[8] [8]

A framework for probabilistic generic trafﬁc scene prediction,

Y . Hu, W. Zhan, and M. Tomizuka, “A framework for probabilistic generic trafﬁc scene prediction,” in Intelligent Transportation Systems (ITSC). IEEE, 2018, pp. 2790–2796

work page 2018

[9] [9]

Multi-modal probabilistic predic- tion of interactive behavior via an interpretable model,

Y . Hu, W. Zhan, and M. Tomizuka, “Multi-modal probabilistic predic- tion of interactive behavior via an interpretable model,” in Proceedings of the 2019 IEEE Intelligent Vehicles Symposium (IV2019) , 2019

work page 2019

[10] [10]

Junior: The stanford entry in the urban challenge,

M. Montemerlo, J. Becker, S. Bhat, H. Dahlkamp, D. Dolgov, S. Et- tinger, D. Haehnel, T. Hilden, G. Hoffmann, B. Huhnke et al., “Junior: The stanford entry in the urban challenge,” Journal of ﬁeld Robotics, vol. 25, no. 9, pp. 569–597, 2008

work page 2008

[11] [11]

Path planning for autonomous vehicles in unknown semi-structured environments,

D. Dolgov, S. Thrun, M. Montemerlo, and J. Diebel, “Path planning for autonomous vehicles in unknown semi-structured environments,” The International Journal of Robotics Research , vol. 29, no. 5, pp. 485–501, 2010

work page 2010

[12] [12]

Time-bounded lattice for efﬁcient planning in dynamic environments,

A. Kushleyev and M. Likhachev, “Time-bounded lattice for efﬁcient planning in dynamic environments,” in 2009 IEEE International Conference on Robotics and Automation . IEEE, 2009, pp. 1662– 1668

work page 2009

[13] [13]

Real-time motion planning with applications to autonomous urban driving,

Y . Kuwata, J. Teo, G. Fiore, S. Karaman, E. Frazzoli, and J. P. How, “Real-time motion planning with applications to autonomous urban driving,” IEEE Transactions on Control Systems Technology , vol. 17, no. 5, pp. 1105–1118, 2009

work page 2009

[14] [14]

Optimal motion planning with the half-car dynamical model for autonomous high-speed driving,

J. hwan Jeon, R. V . Cowlagi, S. C. Peters, S. Karaman, E. Frazzoli, P. Tsiotras, and K. Iagnemma, “Optimal motion planning with the half-car dynamical model for autonomous high-speed driving,” in2013 American Control Conference. IEEE, 2013, pp. 188–193

work page 2013

[15] [15]

Threat-aware path planning in uncertain urban environments,

G. S. Aoude, B. D. Luders, D. S. Levine, and J. P. How, “Threat-aware path planning in uncertain urban environments,” in 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems . IEEE, 2010, pp. 6058–6063

work page 2010

[16] [16]

When is a linear control system optimal?

R. E. Kalman, “When is a linear control system optimal?” Journal of Basic Engineering, vol. 86, no. 1, pp. 51–60, 1964

work page 1964

[17] [17]

Algorithms for inverse reinforcement learning

A. Y . Ng, S. J. Russell et al. , “Algorithms for inverse reinforcement learning.” in Icml, vol. 1, 2000, p. 2

work page 2000

[18] [18]

Apprenticeship learning via inverse rein- forcement learning,

P. Abbeel and A. Y . Ng, “Apprenticeship learning via inverse rein- forcement learning,” in Proceedings of the twenty-ﬁrst international conference on Machine learning . ACM, 2004, p. 1

work page 2004

[19] [19]

Maximum entropy inverse reinforcement learning

B. D. Ziebart, A. L. Maas, J. A. Bagnell, and A. K. Dey, “Maximum entropy inverse reinforcement learning.” in Aaai, vol. 8. Chicago, IL, USA, 2008, pp. 1433–1438

work page 2008

[20] [20]

Continuous inverse optimal control with locally optimal examples,,

S. Levine and V . Koltun, “Continuous inverse optimal control with locally optimal examples,,” in the 29th International Conference on Machine Learning (ICML-12) , 2012

work page 2012

[21] [21]

Socially compliant mobile robot navigation via inverse reinforcement learning,

H. Kretzschmar, M. Spies, C. Sprunk, and W. Burgard, “Socially compliant mobile robot navigation via inverse reinforcement learning,” The International Journal of Robotics Research , vol. 35, no. 11, pp. 1289–1307, 2016

work page 2016

[22] [22]

Courteous autonomous cars,

L. Sun, W. Zhan, M. Tomizuka, and A. D. Dragan, “Courteous autonomous cars,” in 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) . IEEE, 2018, pp. 663–670

work page 2018

[23] [23]

Probabilistic prediction of interactive driving behavior via hierarchical inverse reinforcement learning,

L. Sun, W. Zhan, and M. Tomizuka, “Probabilistic prediction of interactive driving behavior via hierarchical inverse reinforcement learning,” in 2018 21st International Conference on Intelligent Trans- portation Systems (ITSC) . IEEE, 2018, pp. 2111–2117

work page 2018

[24] [24]

Behavior planning of autonomous cars with social perception,

L. Sun, W. Zhan, C.-Y . Chan, and M. Tomizuka, “Behavior planning of autonomous cars with social perception,” in Proceedings of the 2019 IEEE Intelligent Vehicles Symposium (IV2019) , vol. 1, 2019, p. 2

work page 2019

[25] [25]

Risk-sensitive inverse reinforcement learning via coherent risk models

A. Majumdar, S. Singh, A. Mandlekar, and M. Pavone, “Risk-sensitive inverse reinforcement learning via coherent risk models.” in Robotics: Science and Systems , 2017

work page 2017

[26] [26]

Interpretable modelling of driving behaviors in interactive driving scenarios based on cumu- lative prospect theory,

L. Sun, W. Zhan, Y . Hu, and M. Tomizuka, “Interpretable modelling of driving behaviors in interactive driving scenarios based on cumu- lative prospect theory,” in Proceedings of the 2019 IEEE Intelligent Transporation Systems Conference (ITSC2019) , vol. 1, 2019, p. 2

work page 2019

[27] [27]

Scaling up dynamic time warping for datamining applications,

E. J. Keogh and M. J. Pazzani, “Scaling up dynamic time warping for datamining applications,” in Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining . ACM, 2000, pp. 285–289

work page 2000

[28] [28]

Learning structured output representa- tion using deep conditional generative models,

K. Sohn, H. Lee, and X. Yan, “Learning structured output representa- tion using deep conditional generative models,” in Advances in neural information processing systems , 2015, pp. 3483–3491

work page 2015

[29] [29]

Auto-Encoding Variational Bayes

D. P. Kingma and M. Welling, “Auto-encoding variational bayes,” arXiv preprint arXiv:1312.6114 , 2013

work page internal anchor Pith review Pith/arXiv arXiv 2013

[30] [30]

Does the chimpanzee have a theory of mind?

D. Premack and G. Woodruff, “Does the chimpanzee have a theory of mind?” Behavioral and brain sciences , vol. 1, no. 4, pp. 515–526, 1978

work page 1978

[31] [31]

Constructing a highly interactive vehicle motion dataset

W. Zhan, L. Sun, D. Wang, Y . Jin, and M. Tomizuka, “Constructing a highly interactive vehicle motion dataset.” in IROS2019, vol. 1, 2019, p. 2

work page 2019

[32] [32]

Interaction dataset: An international, adversarial and cooperative motion dataset in interactive scenarios with semantic maps,

W. Zhan, L. Sun, D. Wang, H. Shi, A. Clausse, M. Naumann, J. Kum- merle, H. Konigshof, C. Stiller, A. de La Fortelle, and M. Tomizuka, “Interaction dataset: An international, adversarial and cooperative motion dataset in interactive scenarios with semantic maps,” 2019

work page 2019