arxiv: 2605.07166 · v1 · submitted 2026-05-08 · 💻 cs.LG

Recognition: 2 theorem links

· Lean Theorem

Neurosymbolic Imitation Learning with Human Guidance: A Privileged Information Approach

Nikhilesh Prabhakar , Varun Balaji , Athresh Karanam , Kristian Kersting , Sriraam Natarajan

Authors on Pith no claims yet

Pith reviewed 2026-05-11 01:40 UTC · model grok-4.3

classification 💻 cs.LG

keywords imitation learningneurosymbolicprivileged informationgaze datageneralizationhuman guidancepolicy learning

0 comments

The pith

A neurosymbolic method for imitation learning exploits privileged gaze data available only during training to combine high-dimensional perception with strong generalization.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Imitation learning from demonstrations often struggles when using only neural networks because they require large amounts of data and tend to overfit to the training examples. Purely symbolic methods generalize better but cannot process the raw high-dimensional sensory inputs common in real environments. This paper develops a neurosymbolic architecture that uses neural components to handle complex observations while relying on symbolic structures for decision making. It does so by incorporating gaze data as privileged information that guides the learning process but is not needed at deployment time. The result is a policy that performs effectively in complex settings and generalizes to new scenarios, as shown through empirical tests.

Core claim

We propose a neurosymbolic approach that achieves the best of both worlds, i.e, handling high-dimensional data while achieving generalization. The key advantage of our approach is that it can effectively exploit additional privileged information that is available only during training (in our case, gaze data). Our empirical evaluations demonstrate the effectiveness, efficiency and the generalization capability of our proposed approach.

What carries the argument

Neurosymbolic architecture that integrates privileged gaze information available only at training time to guide the fusion of neural perception and symbolic reasoning in imitation learning.

If this is right

The approach enables imitation learning policies to generalize from fewer demonstrations than pure neural methods.
It processes high-dimensional inputs effectively while maintaining the generalization properties of symbolic systems.
Training with gaze data leads to more efficient learning and better performance in complex environments.
Privileged information can be leveraged to bridge the gap between neural and symbolic methods without requiring it at test time.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Similar privileged signals could be used in other learning settings where extra data is cheap to collect during training but expensive later.
The neurosymbolic design may reduce sample complexity in domains like autonomous driving or robotics where gaze or attention data can be recorded.
Extending this to other human guidance signals beyond gaze could further improve data efficiency in imitation tasks.

Load-bearing premise

That gaze data or similar privileged information can be reliably collected during training and integrated into the neurosymbolic architecture without introducing new failure modes or requiring domain-specific assumptions about the relationship between gaze and actions.

What would settle it

An experiment that trains the model with and without the privileged gaze data and compares generalization performance on unseen high-dimensional test environments; failure to show improvement with gaze data would falsify the central advantage.

Figures

Figures reproduced from arXiv: 2605.07166 by Athresh Karanam, Kristian Kersting, Nikhilesh Prabhakar, Sriraam Natarajan, Varun Balaji.

**Figure 1.** Figure 1: Neurosymbolic Imitation Learning (NESY-IL) Architecture. The framework integrates high-dimensional visual perception with structured symbolic reasoning. A perception model extracts a set of grounded atoms from raw observations. Concurrently, a gaze prediction model and an object extractor provide object-level saliency, which is used to reweigh the states and align them with human attention. This gaze-ali… view at source ↗

**Figure 1.** Figure 1: To reiterate, the framework obtains raw images as inputs, converts [PITH_FULL_IMAGE:figures/full_fig_p009_1.png] view at source ↗

**Figure 2.** Figure 2: Comparison of sample efficiency in Atari games. Mean Game Score is plotted against the percentage of training dataset used for two domains, Asterix (left) and Seaquest (right). In both the domains, GRAIL achieves higher performance earlier and converges to a higher asymptote. Q2. Sample Efficiency To assess whether access to human gaze data improves performance in low-data regimes, we compare the learnin… view at source ↗

**Figure 3.** Figure 3: Generalization across variable object counts in Seaquest. Mean game scores are reported for configurations trained on a dataset containing a maximum of one and two objects per object type. training distribution. NSFR-IL improves substantially in the 2-objects setting, and GRAIL improves further still, achieving the highest mean reward overall. This is consistent with the compositionality of first-order rel… view at source ↗

**Figure 1.** Figure 1: NSFR inference pipeline for Seaquest. Raw pixels are converted to an objectcentric logic state via OCAtari, grounded into fuzzy atom valuations (a neural perception module is trained to convert the stacked image into grounded valuations), modulated by gaze, and passed through T=2 steps of differentiable forward chaining to produce a discrete action. The Neurosymbolic Forward Reasoner (NSFR) forms the cor… view at source ↗

**Figure 2.** Figure 2: GRAIL inference pipeline for Seaquest. Raw pixels are converted to an objectcentric logic state via OCAtari, grounded into fuzzy atom valuations, and reweighted by the predicted gaze heatmap Gˆt from a frozen HumanGazeNet encoder gϕ. The gazemodulated atoms v (g) t are then passed through T=2 steps of differentiable forward chaining to produce a discrete action [PITH_FULL_IMAGE:figures/full_fig_p019_2.png] view at source ↗

read the original abstract

Imitation learning is widely used for learning to act in complex environments. While pure neural-based methods handle high dimensional data effectively, they suffer from the requirement of large number of samples and are prone to overfitting. Pure symbolic approaches, while generalize well, do not handle high-dimensional data effectively. We propose a neurosymbolic approach that achieves the best of both worlds, i.e, handling high-dimensional data while achieving generalization. The key advantage of our approach is that it can effectively exploit additional privileged information that is available only during training (in our case, gaze data). Our empirical evaluations demonstrate the effectiveness, efficiency and the generalization capability of our proposed approach.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The neurosymbolic IL method with privileged gaze data is a reasonable hybrid idea but the integration and results stay too thin to judge the generalization claim.

read the letter

This paper combines neural components for high-dimensional inputs with symbolic ones for better generalization in imitation learning, using gaze data that is only available during training as the bridge. The core pitch is that this gets the strengths of both without the usual sample hunger or overfitting of pure neural IL, and without the high-dim limitations of pure symbolic methods. That framing is straightforward and matches real needs in settings where humans can provide extra signals like eye tracking at training time but not at deployment. If the experiments actually show the policy generalizing when gaze is removed at test time, and if they include proper baselines and ablations, the work would be a useful incremental step for the subfield. The soft spot is exactly what the stress-test flags: the abstract gives no architecture diagram, no description of how gaze gets mapped into the symbolic part or used as supervision, and no loss functions or quantitative results. Without those, it is impossible to check whether the privileged signal creates new failure modes or whether the reported generalization is real. The assumption that gaze data can be collected cleanly and correlates usefully with actions is left unexamined in the available text. This is aimed at imitation-learning researchers who already work with human guidance or neurosymbolic hybrids. It deserves a serious referee because the problem is practical and the proposed direction is coherent, even though the current version would need substantial revision to make the claims verifiable.

Referee Report

2 major / 1 minor

Summary. The manuscript proposes a neurosymbolic imitation learning approach for acting in complex environments. It combines neural methods (to handle high-dimensional data) with symbolic methods (for generalization) by exploiting additional privileged information available only at training time—in this case, gaze data. The abstract claims that empirical evaluations demonstrate the effectiveness, efficiency, and generalization capability of the proposed method.

Significance. If the central claims hold after details are supplied, the work could meaningfully advance imitation learning by addressing the sample inefficiency and overfitting of pure neural approaches while mitigating the high-dimensional data limitations of pure symbolic approaches. Leveraging human-provided privileged signals such as gaze during training offers a practical route to more robust policies that do not require the privileged signal at test time.

major comments (2)

Abstract: the manuscript asserts that the neurosymbolic approach 'achieves the best of both worlds' and that 'empirical evaluations demonstrate the effectiveness, efficiency and the generalization capability,' yet supplies no architecture description, loss functions, datasets, baselines, ablation studies, or quantitative results. This absence is load-bearing for the central claim that privileged gaze data can be integrated without introducing new failure modes or overfitting to training-time signals.
Method (or equivalent section describing the model): the integration of gaze data into the neurosymbolic architecture is not specified—e.g., whether gaze serves as auxiliary supervision, how it is mapped onto symbolic components, or what training objective enforces generalization once gaze is removed at test time. Without these details the generalization claim cannot be evaluated.

minor comments (1)

The abstract would benefit from a single sentence naming the specific tasks or environments used in the claimed empirical evaluations.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback and for highlighting areas where additional clarity would strengthen the manuscript. We agree that the abstract and method descriptions would benefit from more explicit technical details to better support the central claims. We will revise the paper to address these points directly.

read point-by-point responses

Referee: Abstract: the manuscript asserts that the neurosymbolic approach 'achieves the best of both worlds' and that 'empirical evaluations demonstrate the effectiveness, efficiency and the generalization capability,' yet supplies no architecture description, loss functions, datasets, baselines, ablation studies, or quantitative results. This absence is load-bearing for the central claim that privileged gaze data can be integrated without introducing new failure modes or overfitting to training-time signals.

Authors: We acknowledge that the abstract is high-level and does not enumerate the specific elements listed. The full manuscript contains dedicated sections on the architecture, training losses, datasets, baselines, and ablations with quantitative results. To make this immediately apparent, we will revise the abstract to briefly reference the key components (neural processing of high-dimensional inputs, symbolic generalization, and privileged gaze integration) and include a short summary of the empirical findings. We will also add explicit cross-references to the relevant sections and figures. revision: yes
Referee: Method (or equivalent section describing the model): the integration of gaze data into the neurosymbolic architecture is not specified—e.g., whether gaze serves as auxiliary supervision, how it is mapped onto symbolic components, or what training objective enforces generalization once gaze is removed at test time. Without these details the generalization claim cannot be evaluated.

Authors: We agree that the integration mechanism requires clearer exposition. In the revised Method section we will explicitly state that gaze data functions as auxiliary supervision during training only: it is mapped to symbolic predicates via a learned attention module that aligns visual features with symbolic states, and the overall objective combines behavioral cloning loss with a privileged-information regularization term that penalizes reliance on gaze at inference. We will include the precise loss formulation, a diagram of the information flow, and a proof sketch showing that the regularization enforces generalization once the privileged signal is removed. This will directly support the generalization claim. revision: yes

Circularity Check

0 steps flagged

No derivation chain or self-referential fits present in empirical proposal

full rationale

The manuscript proposes a neurosymbolic imitation learning method that exploits privileged gaze data available only at training time. It contains no equations, no parameter-fitting steps, no uniqueness theorems, and no mathematical derivations that could reduce outputs to inputs by construction. Claims of achieving 'the best of both worlds' and demonstrating generalization rest entirely on empirical evaluations rather than any self-definitional, fitted-input, or self-citation load-bearing structure. The absence of any load-bearing derivation chain makes the circularity score zero.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

No free parameters, axioms, or invented entities are identifiable from the abstract alone; the proposal relies on standard neural and symbolic learning assumptions without explicit enumeration.

pith-pipeline@v0.9.0 · 5421 in / 1145 out tokens · 46221 ms · 2026-05-11T01:40:50.410848+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

We propose a neurosymbolic approach that achieves the best of both worlds... using gaze data as privileged information... GRAIL framework... differentiable forward-chaining reasoner... NSFR
IndisputableMonolith/Foundation/ArithmeticFromLogic.lean LogicNat recovery theorem unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

relational policies... first-order definite clauses... gaze-modulated symbolic state v(g)t,i = v(0)t,i · si

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

48 extracted references · 48 canonical work pages · 1 internal anchor

[1]

and Jordan, Michael I

Abbeel, P., Ng, A.Y.: Apprenticeship learning via inverse reinforcement learning. In: Proceedings of the Twenty-First International Conference on Machine Learning (ICML). p. 1. ACM (2004). https://doi.org/10.1145/1015330.1015430

work page doi:10.1145/1015330.1015430 2004
[2]

Robotics and Autonomous Systems57(5), 469–483 (2009)

Argall, B.D., Chernova, S., Veloso, M., Browning, B.: A survey of robot learning from demonstration. Robotics and Autonomous Systems57(5), 469–483 (2009). https://doi.org/10.1016/j.robot.2008.10.024

work page doi:10.1016/j.robot.2008.10.024 2009
[3]

arXiv preprint arXiv:2507.19647 (2025)

Banayeeanzade, A., Bahrani, F., Zhou, Y., Bıyık, E.: Gabril: Gaze-based regu- larization for mitigating causal confusion in imitation learning. arXiv preprint arXiv:2507.19647 (2025)

work page arXiv 2025
[4]

Journal of Artificial Intelligence Research47, 253–279 (2013)

Bellemare, M.G., Naddaf, Y., Veness, J., Bowling, M.: The arcade learning environ- ment: An evaluation platform for general agents. Journal of Artificial Intelligence Research47, 253–279 (2013)

work page 2013
[5]

PMLR (2023)

Caldarelli, E., Chatalic, A., Colomé, A., Rosasco, L., Torras, C.: Heteroscedastic gaussian processes and random features: Scalable motion primitives with guaran- tees.In:CoRL.ProceedingsofMachineLearningResearch,vol.229,pp.3010–3029. PMLR (2023)

work page 2023
[6]

EPFL/CRC Press (2009)

Calinon, S.: Robot Programming by Demonstration: A Probabilistic Approach. EPFL/CRC Press (2009)

work page 2009
[7]

In: 2019 IEEE/RSJ International Conference on In- telligent Robots and Systems (IROS)

Chen, Y., Liu, C., Tai, L., Liu, M., Shi, B.E.: Gaze training by modulated dropout improves imitation learning. In: 2019 IEEE/RSJ International Conference on In- telligent Robots and Systems (IROS). pp. 7756–7761. IEEE (2019) 14 N. Prabhakar et al

work page 2019
[8]

In: Chaudhuri, K., Jegelka, S., Song, L., Szepesvari, C., Niu, G., Sabato, S

Collier, M., Jenatton, R., Kokiopoulou, E., Berent, J.: Transfer and marginal- ize: Explaining away label noise with privileged information. In: Chaudhuri, K., Jegelka, S., Song, L., Szepesvari, C., Niu, G., Sabato, S. (eds.) Proceedings of the 39th International Conference on Machine Learning. Proceedings of Machine Learning Research, vol. 162, pp. 4219–...

work page 2022
[9]

Machine learning75(3), 297–325 (2009)

Daumé, H., Langford, J., Marcu, D.: Search-based structured prediction. Machine learning75(3), 297–325 (2009)

work page 2009
[10]

Ma- chine learning43(1), 7–52 (2001)

Džeroski, S., De Raedt, L., Driessens, K.: Relational reinforcement learning. Ma- chine learning43(1), 7–52 (2001)

work page 2001
[11]

Journal of Artificial Intelligence Research61, 1–64 (2018)

Evans, R., Grefenstette, E.: Learning explanatory rules from noisy data. Journal of Artificial Intelligence Research61, 1–64 (2018)

work page 2018
[12]

IEEE Trans

Garcia, N.C., Morerio, P., Murino, V.: Learning with privileged information via adversarial discriminative modality distillation. IEEE Trans. Pattern Anal. Mach. Intell.42(10), 2581–2593 (2020)

work page 2020
[13]

In: Advances in Neural Information Processing Systems

Hernández-Lobato, D., Sharmanska, V., Kersting, K., Lampert, C.H., Quadrianto, N.: Mind the nuisance: Gaussian process classification using privileged noise. In: Advances in Neural Information Processing Systems. pp. 55–63 (2014)

work page 2014
[14]

Distilling the Knowledge in a Neural Network

Hinton, G., Vinyals, O., Dean, J.: Distilling the knowledge in a neural network. arXiv preprint arXiv:1503.02531 (2015)

work page internal anchor Pith review Pith/arXiv arXiv 2015
[15]

In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Hoffman, J., Gupta, S., Darrell, T.: Learning with side information through modal- ity hallucination. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). pp. 826–834 (2016). https://doi.org/10.1109/CVPR.2016.96

work page doi:10.1109/cvpr.2016.96 2016
[16]

Artificial Intelli- gence113(1–2), 125–148 (1999)

Khardon, R.: Learning action strategies for planning domains. Artificial Intelli- gence113(1–2), 125–148 (1999). https://doi.org/10.1016/S0004-3702(99)00060-0

work page doi:10.1016/s0004-3702(99)00060-0 1999
[17]

IEEE Robotics and Au- tomation Letters5(3), 4415–4422 (2020)

Kim, H., Ohmura, Y., Kuniyoshi, Y.: Using human gaze to improve robustness against irrelevant objects in robot manipulation tasks. IEEE Robotics and Au- tomation Letters5(3), 4415–4422 (2020)

work page 2020
[18]

IEEE Robotics and Au- tomation Letters6(2), 1630–1637 (2021)

Kim, H., Ohmura, Y., Kuniyoshi, Y.: Gaze-based dual resolution deep imitation learning for high-precision dexterous robot manipulation. IEEE Robotics and Au- tomation Letters6(2), 1630–1637 (2021)

work page 2021
[19]

In: Theory and Practice of Logic Programming

Kimmig, A., Demoen, B., Raedt, L.D., Costa, V.S., Rocha, R.: On the implemen- tation of the probabilistic logic programming language ProbLog. In: Theory and Practice of Logic Programming. vol. 11, pp. 235–262. Cambridge University Press (2011)

work page 2011
[20]

In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (June 2018)

Lambert, J., Sener, O., Savarese, S.: Deep learning under privileged information using heteroscedastic dropout. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (June 2018)

work page 2018
[21]

In: 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

Liang, A., Thomason, J., Bıyık, E.: Visarl: Visual reinforcement learning guided by human saliency. In: 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). pp. 2907–2912. IEEE (2024)

work page 2024
[22]

Communications of the ACM43(3), 72–74 (2000)

Lieberman, H.: Programming by example (introduction). Communications of the ACM43(3), 72–74 (2000). https://doi.org/10.1145/330534.330543

work page doi:10.1145/330534.330543 2000
[23]

Springer-Verlag, Berlin, 2nd edn

Lloyd, J.W.: Foundations of Logic Programming. Springer-Verlag, Berlin, 2nd edn. (1987)

work page 1987
[24]

Unifying distillation and privileged information.arXiv preprint arXiv:1511.03643, 2015

Lopez-Paz, D., Bottou, L., Schölkopf, B., Vapnik, V.: Unifying distillation and privileged information. arXiv preprint arXiv:1511.03643 (2015)

work page arXiv 2015
[25]

In: Interspeech

Markov, K., Matsui, T.: Robust speech recognition using generalized distillation framework. In: Interspeech. pp. 2364–2368 (2016)

work page 2016
[26]

Nature518(7540), 529–533 (2015) Neurosymbolic Imitation Learning with Privileged Information 15

Mnih, V., Kavukcuoglu, K., Silver, D., Rusu, A.A., Veness, J., Bellemare, M.G., Graves, A., Riedmiller, M., Fidjeland, A.K., Ostrovski, G., et al.: Human-level control through deep reinforcement learning. Nature518(7540), 529–533 (2015) Neurosymbolic Imitation Learning with Privileged Information 15

work page 2015
[27]

New generation computing8(4), 295– 318 (1991)

Muggleton, S.: Inductive logic programming. New generation computing8(4), 295– 318 (1991)

work page 1991
[28]

In: Proceed- ings of the Seventeenth International Conference on Machine Learning (ICML)

Ng, A.Y., Russell, S.: Algorithms for inverse reinforcement learning. In: Proceed- ings of the Seventeenth International Conference on Machine Learning (ICML). pp. 663–670 (2000)

work page 2000
[29]

Foundations and Trends®in Robotics 7(1-2), 1–179 (2018)

Osa, T., Pajarinen, J., Neumann, G., Bagnell, J.A., Abbeel, P., Peters, J.: An algo- rithmic perspective on imitation learning. Foundations and Trends®in Robotics 7(1-2), 1–179 (2018)

work page 2018
[30]

PLoS one17(3), e0264471 (2022)

Pfeiffer, C., Wengeler, S., Loquercio, A., Scaramuzza, D.: Visual attention predic- tion improves performance of autonomous drone racing agents. PLoS one17(3), e0264471 (2022)

work page 2022
[31]

Artificial Intelli- gence64(1), 81–129 (1993)

Poole, D.: Probabilistic horn abduction and bayesian networks. Artificial Intelli- gence64(1), 81–129 (1993)

work page 1993
[32]

In: Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics

Ross, S., Bagnell, D.: Efficient reductions for imitation learning. In: Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics. pp. 661–668. JMLR Workshop and Conference Proceedings (2010)

work page 2010
[33]

In: Proceedings of the Fourteenth Interna- tional Conference on Artificial Intelligence and Statistics (AISTATS)

Ross, S., Gordon, G., Bagnell, D.: A reduction of imitation learning and structured prediction to no-regret online learning. In: Proceedings of the Fourteenth Interna- tional Conference on Artificial Intelligence and Statistics (AISTATS). pp. 627–635. JMLR Workshop and Conference Proceedings (2011)

work page 2011
[34]

Sammut, C., et al.: Building symbolic representations of intuitive real-time skills from performance data (1992)

work page 1992
[35]

arXiv preprint arXiv:2002.12500 (2020)

Saran, A., Zhang, R., Short, E.S., Niekum, S.: Efficiently guiding imitation learning agents with human gaze. arXiv preprint arXiv:2002.12500 (2020)

work page arXiv 2002
[36]

In: Proceedings of the 12th International Conference on Logic Programming (ICLP)

Sato, T.: A statistical learning method for logic programs with distribution seman- tics. In: Proceedings of the 12th International Conference on Logic Programming (ICLP). pp. 715–729. MIT Press (1995)

work page 1995
[37]

In: Proceedings of the 1985 IEEE Interna- tional Conference on Robotics and Automation

Segre, A.M., DeJong, G.: Explanation-based manipulator learning: Acquisition of planning ability through observation. In: Proceedings of the 1985 IEEE Interna- tional Conference on Robotics and Automation. pp. 555–560 (1985)

work page 1985
[38]

In: Proceedings of the 21st International Joint Conference on Artificial Intelligence (IJCAI)

Shavlik, J., Natarajan, S.: Speeding up inference in Markov Logic Networks by preprocessing to reduce the size of the resulting grounded network. In: Proceedings of the 21st International Joint Conference on Artificial Intelligence (IJCAI). pp. 1951–1956 (2009)

work page 1951
[39]

arXiv preprint arXiv:2110.09383 (2021)

Shindo, H., Dhami, D.S., Kersting, K.: Neuro-symbolic forward reasoning. arXiv preprint arXiv:2110.09383 (2021)

work page arXiv 2021
[40]

The MIT Press, 2nd edn

Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. The MIT Press, 2nd edn. (2018)

work page 2018
[41]

In: Interactive Learning with Implicit Human Feedback Workshop at ICML (2023)

Thakur, R.K., Sunbeam, M.N.S., Goecks, V.G., Novoseller, E., Bera, R., Lawhern, V.J., Gremillion, G.M., Valasek, J., Waytowich, N.R.: Imitation learning with hu- man eye gaze via multi-objective prediction. In: Interactive Learning with Implicit Human Feedback Workshop at ICML (2023)

work page 2023
[42]

Journal of Machine Learning Research16(61), 2023–2049 (2015),http://jmlr.org/papers/v16/vapnik15b.html

Vapnik, V., Izmailov, R.: Learning using privileged information: Similarity control and knowledge transfer. Journal of Machine Learning Research16(61), 2023–2049 (2015),http://jmlr.org/papers/v16/vapnik15b.html

work page 2023
[43]

Neural Networks22(5-6), 544–557 (2009)

Vapnik, V., Vashist, A.: A new learning paradigm: Learning using privileged infor- mation. Neural Networks22(5-6), 544–557 (2009)

work page 2009
[44]

In: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision

Xia, Y., Kim, J., Canny, J., Zipser, K., Canas-Bajo, T., Whitney, D.: Periphery- fovea multi-resolution driving model guided by human attention. In: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision. pp. 1767–1775 (2020) 16 N. Prabhakar et al

work page 2020
[45]

Frontiers Artif

Yan, S., Odom, P., Pasunuri, R., Kersting, K., Natarajan, S.: Learning with priv- ileged and sensitive information: a gradient-boosting approach. Frontiers Artif. Intell.6(2023)

work page 2023
[46]

In: NeurIPS (2022)

Yang, S., Sanghavi, S., Rahmanian, H., Bakus, J., Vishwanathan, S.V.N.: To- ward understanding privileged features distillation in learning-to-rank. In: NeurIPS (2022)

work page 2022
[47]

In: Proceedings of the European Conference on Computer Vision (ECCV)

Zhang, R., Liu, Z., Zhang, L., Whritner, J.A., Muller, K.S., Hayhoe, M.M., Ballard, D.H.: Agil: Learning attention from human for visuomotor tasks. In: Proceedings of the European Conference on Computer Vision (ECCV). pp. 663–679 (2018)

work page 2018
[48]

In: Proceedings of the AAAI Conference on Artificial Intelligence

Zhang, R., Walshe, C., Liu, Z., Guan, L., Muller, K.S., Whritner, J.A., Zhang, L., Hayhoe, M.M., Ballard, D.H.: Atari-HEAD: Atari human eye-tracking and demon- stration dataset. In: Proceedings of the AAAI Conference on Artificial Intelligence. vol. 34, pp. 6811–6820 (2020) Neurosymbolic Imitation Learning with Human Guidance: A Privileged Information App...

work page 2020