$\text{DT}^2$: Decision-Targeted Digital Twins

Harry Amad; Mihaela van der Schaar

arxiv: 2606.25923 · v1 · pith:JHI7YOCNnew · submitted 2026-06-24 · 💻 cs.LG

DT²: Decision-Targeted Digital Twins

Harry Amad , Mihaela van der Schaar This is my paper

Pith reviewed 2026-06-25 20:06 UTC · model grok-4.3

classification 💻 cs.LG

keywords digital twinsdecision-targeted trainingpolicy rankingfitted Q-evaluationoffline reinforcement learningmodel-based decision making

0 comments

The pith

Digital twins trained to minimize transition errors can rank policies suboptimally, but DT² targets rankings to improve selection.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper proves that minimizing one-step transition errors in digital twins does not always produce the best models for ranking policies according to expected reward when model capacity is limited. This holds even with expressive model classes in empirical tests. DT² first applies fitted Q-evaluation to offline data to estimate candidate policy values, then trains the twin so its generated rollouts preserve the pairwise rankings implied by those estimates. The resulting models improve policy ranking accuracy and lower decision regret for both training policies and unseen ones, while keeping simulation fidelity comparable to standard training.

Core claim

When model capacity is limited, training DTs to minimise one-step transition errors can produce suboptimal models for ranking sets of policies according to a reward function. DT² uses fitted Q-evaluation to estimate values of candidate policies from offline data. A DT is then trained to generate rollouts that preserve pairwise policy rankings derived from these proxy ground-truth values with an architecture-agnostic loss function.

What carries the argument

Architecture-agnostic loss that trains the digital twin to preserve pairwise policy rankings obtained from fitted Q-evaluation on offline data.

If this is right

DT² yields higher accuracy when ranking policies by expected reward.
Decision regret drops during policy selection from the ranked set.
Gains appear for both policies present in the offline data and for new policies.
Simulation fidelity measured by raw transition error stays comparable to baseline training.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Training objectives for simulators used in decisions should align directly with downstream ranking or selection tasks rather than generic prediction metrics.
The ranking-preservation idea could apply to other model-based planning systems where the end goal is comparative evaluation of action sequences.
Testing the method on continuous policy parameterizations would show whether the pairwise approach scales beyond discrete candidate sets.

Load-bearing premise

Pairwise policy rankings from fitted Q-evaluation on offline data serve as a reliable proxy for true relative policy values.

What would settle it

A controlled benchmark with known true policy values where DT² produces higher decision regret than a transition-error trained twin on the same data and architecture.

Figures

Figures reproduced from arXiv: 2606.25923 by Harry Amad, Mihaela van der Schaar.

**Figure 1.** Figure 1: We visualise two diabetes DTs, Φˆ 1 (top) and Φˆ 2 (bottom), under treatment plans with high, low, and adaptive insulin levels, and we consider the ‘value’ of a treatment plan as the time it spends in the optimal glucose region. Φˆ 1 has lower MSE from the true rollouts, yet it leads to an incorrect treatment plan preference ordering (high ≈ low ≻ adaptive). In contrast, Φˆ 2 leads to a correct preference … view at source ↗

**Figure 2.** Figure 2: When Φ ∈ F/ , the Lsim-optimal DT, Φˆ θ∗ , may land in a region of low decision quality for some candidate policy set Π and reward function r (grey areas). With decision-targeted DT training, we wish to find Φˆ θ′ that resides in a better decision region (green areas), at a small simulation fidelity cost. Proof. See Appendix A.1. From Theorem 3.1 we can see that, when a DT cannot exactly model its target … view at source ↗

**Figure 3.** Figure 3: State delta across action values, as determined by mechanistic models learnt via L MSE sim and LDT2 , compared to the true Φ. Black dots represent constant-action policies in Π. 6.1.3. PARTIALLY OBSERVED DYNAMICS Finally, we investigate a system with partially observable dynamics, where an unobserved latent variable zt dictates 6 [PITH_FULL_IMAGE:figures/full_fig_p006_3.png] view at source ↗

**Figure 4.** Figure 4: Regret and Spearman’s correlation for preference orderings from L NLL sim (hashed) and LDT2 (solid) DTs across base architectures in six continuous control environments. We report averages over 10 seeds, with 95% CIs. the transition distribution, ensuring Φ ∈ F/ even with an expressive hypothesis space of three-layer MLPs. We consider a one-dimensional system governed by xt+1 = xt +at +zt, where zt ∼ N (0… view at source ↗

**Figure 5.** Figure 5: plots the Spearman’s correlation and test MSE obtained across these λ settings (N.B. that the λ = 0 setting is equivalent to standard training with L NLL sim only). As expected, we observe a consistent increase in ranking performance and decrease in simulation fidelity as λ increases. Practitioners can therefore effectively use λ to navigate to their desired point along this trade-off during training. Ho… view at source ↗

**Figure 6.** Figure 6: Regret and Spearman’s correlation for preference orderings from L MSE sim (hashed) and LDT2 (solid) DTs across base architectures in six continuous control environments. We report averages over 10 seeds, with 95% CIs. For visual purposes, the top end of some error bars are cropped out. 25 [PITH_FULL_IMAGE:figures/full_fig_p025_6.png] view at source ↗

**Figure 7.** Figure 7: Regret and Spearman’s correlation for preference orderings from DTs trained via L NLL sim DTs (hashed) and LDT2 with ranking losses of our smoothed Kendall’s (solid), a hinge loss (circles), and a ListNet loss (dotted) across different base architectures. We report averages over 10 seeds, with 95% CIs. For visual purposes, the top end of some bars are cropped out. 27 [PITH_FULL_IMAGE:figures/full_fig_p027… view at source ↗

read the original abstract

A digital twin (DT) is a virtual model of a real-world system that can assist decision-making by simulating scenarios induced by different policies. However, typical machine learning-based DTs do not optimise for this use case. We prove that, when model capacity is limited, training DTs to minimise one-step transition errors can produce suboptimal models for ranking sets of policies according to a reward function. We further show that this holds empirically, even with expressive model classes. To address this, we introduce $\text{DT}^2$, a decision-targeted DT training paradigm. Firstly, $\text{DT}^2$ uses fitted Q-evaluation to estimate values of candidate policies from offline data. A DT is then trained to generate rollouts that preserve pairwise policy rankings derived from these proxy ground-truth values with an architecture-agnostic loss function. We empirically demonstrate the efficacy of our method across a range of settings and architectures. $\text{DT}^2$ consistently improves policy ranking and reduces decision regret during policy selection relative to conventional DT training, both for policies used during training and for unseen policies, while maintaining a good level of raw simulation fidelity.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

DT² targets policy ranking over transition accuracy in digital twins via FQE-derived rankings, with a limited-capacity proof and reported empirical gains, but the proxy's reliability remains untested.

read the letter

The main thing here is that standard one-step transition training for digital twins can produce models that rank policies poorly even when they fit the data, and DT² tries to correct this by training directly on pairwise rankings from fitted Q-evaluation.

The paper establishes a proof that limited-capacity models minimizing transition error are suboptimal for policy ranking under a reward. It then shows the issue persists empirically with expressive models and introduces DT², which first runs FQE on offline data to get proxy values, derives rankings, and applies an architecture-agnostic loss so the DT's rollouts preserve those rankings. They report consistent improvements in policy ranking accuracy and lower decision regret for both training policies and unseen ones, while simulation fidelity stays reasonable.

The soft spot is the dependence on FQE rankings as reliable targets. If the offline data has poor coverage or FQE introduces bias, the DT learns to match flawed rankings rather than true values. The abstract gives no check against ground-truth policy values from on-policy evaluation or other means, so the central assumption stays unverified. The proof covers only limited capacity; the expressive-model claim still rests on the same proxy without additional safeguards.

This is relevant for people working on model-based RL, control, or digital twins in data-limited or safety-critical domains where the end goal is policy selection rather than pure prediction. The formal result plus cross-architecture experiments give it enough substance to go to peer review, though referees will need to examine the proxy validation and experimental controls closely.

Referee Report

2 major / 2 minor

Summary. The manuscript claims that conventional digital twin (DT) training via one-step transition error minimization can lead to suboptimal models for ranking policies by their values under a reward function, with a proof provided for limited model capacity and empirical evidence even for expressive models. To address this, DT² is introduced, which first applies fitted Q-evaluation (FQE) to offline data to obtain proxy policy values, then trains the DT using an architecture-agnostic loss to preserve the pairwise rankings induced by these proxies. Empirical results demonstrate that DT² improves policy ranking accuracy and reduces decision regret for both in-training and unseen policies compared to standard training, while preserving simulation fidelity.

Significance. If the results hold, this paper makes a significant contribution by highlighting the misalignment between standard simulation objectives and decision-making utility in digital twins, offering a targeted training approach that directly optimizes for policy ranking. The provision of a proof for the limited-capacity case and consistent empirical improvements across multiple settings and architectures are strengths that could guide future work in model-based RL and digital twin applications. The architecture-agnostic nature of the loss is particularly practical.

major comments (2)

[Abstract and method description] The central claim that DT² yields better policy selection relies on the assumption that FQE-derived pairwise rankings serve as a reliable proxy for true policy values. However, the manuscript does not appear to include an independent validation of this proxy (e.g., via on-policy rollouts in environments with known ground truth), which is load-bearing for the decision-targeted objective and the reported reductions in decision regret.
[Proof section (referenced in abstract)] The proof establishes suboptimality only under limited model capacity; the extension to expressive model classes is purely empirical. If the proof technique cannot be extended, this should be explicitly discussed as a limitation of the theoretical contribution.

minor comments (2)

[Abstract] The abstract states that DT² 'consistently improves' but does not quantify the improvements or mention the number of environments/architectures tested; adding such details would strengthen the summary.
[Notation] Ensure that the definition of the loss function in the DT² paradigm is clearly distinguished from standard transition losses, perhaps with an equation number for easy reference.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive comments, which highlight important aspects of our theoretical and empirical claims. We address each major comment below and outline planned revisions.

read point-by-point responses

Referee: [Abstract and method description] The central claim that DT² yields better policy selection relies on the assumption that FQE-derived pairwise rankings serve as a reliable proxy for true policy values. However, the manuscript does not appear to include an independent validation of this proxy (e.g., via on-policy rollouts in environments with known ground truth), which is load-bearing for the decision-targeted objective and the reported reductions in decision regret.

Authors: We agree that validating the FQE proxy is important for the decision-targeted objective. While our empirical results already measure final policy selection quality and decision regret using ground-truth values (via on-policy evaluation in the benchmark environments), we did not explicitly compare FQE estimates against these ground-truth values in the original manuscript. In the revision, we will add a dedicated validation subsection reporting the correlation between FQE-derived rankings and true policy values across the environments, thereby confirming the proxy's reliability in the settings studied. revision: yes
Referee: [Proof section (referenced in abstract)] The proof establishes suboptimality only under limited model capacity; the extension to expressive model classes is purely empirical. If the proof technique cannot be extended, this should be explicitly discussed as a limitation of the theoretical contribution.

Authors: The referee is correct: the formal proof applies only to limited-capacity models, with the expressive-model case supported solely by experiments. We will revise the manuscript to explicitly state this scope as a limitation of the theoretical contribution and note that extending the proof technique to general function classes is left for future work. revision: yes

Circularity Check

0 steps flagged

No significant circularity detected

full rationale

The paper's central derivation consists of a capacity-limited proof that one-step transition minimization can be suboptimal for policy ranking, plus an empirical method that explicitly trains on FQE-derived pairwise rankings as an external proxy target via a new loss. This does not reduce any claimed result to its inputs by construction, nor does it involve self-definitional mappings, fitted inputs renamed as predictions, load-bearing self-citations, imported uniqueness theorems, smuggled ansatzes, or renamed known results. The FQE proxy is treated as an independent (if imperfect) benchmark rather than being generated from the DT itself, leaving the derivation self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

Review based on abstract only. The central claim rests on the domain assumption that fitted Q-evaluation yields usable proxy rankings and that preserving those rankings improves decisions more than transition accuracy.

axioms (1)

domain assumption Fitted Q-evaluation on offline data produces reliable pairwise policy rankings that should be preserved by the digital twin.
Invoked to generate the proxy ground-truth values used in the DT² loss.

pith-pipeline@v0.9.1-grok · 5730 in / 1311 out tokens · 23866 ms · 2026-06-25T20:06:02.339399+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

167 extracted references · 8 canonical work pages

[1]

Journal of medical Internet research , volume=

Optimal treatment selection in sequential systemic and locoregional therapy of oropharyngeal squamous carcinomas: deep Q-learning with a patient-physician digital twin dyad , author=. Journal of medical Internet research , volume=. 2022 , publisher=

2022
[2]

The International Journal of Advanced Manufacturing Technology , volume=

Digital twin-driven product design, manufacturing and service with big data , author=. The International Journal of Advanced Manufacturing Technology , volume=. 2018 , publisher=

2018
[3]

European Heart Journal , volume =

Corral-Acero, Jorge and Margara, Francesca and Marciniak, Maciej and Rodero, Cristobal and Loncaric, Filip and Feng, Yingjing and Gilbert, Andrew and Fernandes, Joao F and Bukhari, Hassaan A and Wajdan, Ali and Martinez, Manuel Villegas and Santos, Mariana Sousa and Shamohammdi, Mehrdad and Luo, Hongxing and Westphal, Philip and Leeson, Paul and DiAchille...

work page doi:10.1093/eurheartj/ehaa159 2020
[4]

Energy , volume=

Hybrid mechanistic and neural network modeling of nuclear reactors , author=. Energy , volume=. 2023 , publisher=

2023
[5]

Current Opinion in Chemical Engineering , volume=

Hybrid modeling—a key enabler towards realizing digital twins in biopharma? , author=. Current Opinion in Chemical Engineering , volume=. 2021 , publisher=

2021
[6]

The Thirty-eighth Annual Conference on Neural Information Processing Systems , year=

Automatically Learning Hybrid Digital Twins of Dynamical Systems , author=. The Thirty-eighth Annual Conference on Neural Information Processing Systems , year=
[7]

IEEE Transactions on Knowledge and Data Engineering , year=

Promptcast: A new prompt-based learning paradigm for time series forecasting , author=. IEEE Transactions on Knowledge and Data Engineering , year=
[8]

Advances in Neural Information Processing Systems , volume=

Large language models are zero-shot time series forecasters , author=. Advances in Neural Information Processing Systems , volume=. 2024 , url=

2024
[9]

The Twelfth International Conference on Learning Representations , year=

Time-LLM: Time Series Forecasting by Reprogramming Large Language Models , author=. The Twelfth International Conference on Learning Representations , year=
[10]

2024.Foundational Research Gaps and Future Directions for Digital Twins

Foundational Research Gaps and Future Directions for Digital Twins , isbn =. doi:10.17226/26894 , year =

work page doi:10.17226/26894
[11]

Advances in neural information processing systems , volume=

Language models are few-shot learners , author=. Advances in neural information processing systems , volume=. 2020 , url=

2020
[12]

Advances in neural information processing systems , volume=

One fits all: Power general time series analysis by pretrained LM , author=. Advances in neural information processing systems , volume=. 2023 , url=

2023
[13]

Advances in neural information processing systems , volume=

Autoformer: Decomposition transformers with auto-correlation for long-term series forecasting , author=. Advances in neural information processing systems , volume=. 2021 , url=

2021
[14]

arXiv preprint arXiv:2004.04906 , year=

Dense passage retrieval for open-domain question answering , author=. arXiv preprint arXiv:2004.04906 , year=

Pith/arXiv arXiv 2004
[15]

2024 , url =

Novel Drug Therapy Approvals 2024 , journal =. 2024 , url =

2024
[16]

Journal of Rare Diseases , volume=

Emerging biomarkers for precision diagnosis and personalized treatment of cystic fibrosis , author=. Journal of Rare Diseases , volume=. 2024 , publisher=

2024
[17]

International conference on machine learning , pages=

Fedformer: Frequency enhanced decomposed transformer for long-term series forecasting , author=. International conference on machine learning , pages=. 2022 , organization=

2022
[18]

arXiv preprint arXiv:2211.14730 , year=

A time series is worth 64 words: Long-term forecasting with transformers , author=. arXiv preprint arXiv:2211.14730 , year=

Pith/arXiv arXiv
[19]

The Thirty-Fifth

Haoyi Zhou and Shanghang Zhang and Jieqi Peng and Shuai Zhang and Jianxin Li and Hui Xiong and Wancai Zhang , title =. The Thirty-Fifth
[20]

arXiv preprint arXiv:2402.19072 , year=

Timexer: Empowering transformers for time series forecasting with exogenous variables , author=. arXiv preprint arXiv:2402.19072 , year=

arXiv
[21]

Statistics and computing , volume=

Genetic programming as a means for programming computers by natural selection , author=. Statistics and computing , volume=. 1994 , publisher=

1994
[22]

Brunton and Joshua L

Steven L. Brunton and Joshua L. Proctor and J. Nathan Kutz , title =. Proceedings of the National Academy of Sciences , volume =. 2016 , doi =

2016
[23]

International Conference on Learning Representations , year=

D-code: Discovering closed-form odes from observed trajectories , author=. International Conference on Learning Representations , year=
[24]

Journal of Computational physics , volume=

Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations , author=. Journal of Computational physics , volume=. 2019 , publisher=

2019
[25]

Journal of Statistical Mechanics: Theory and Experiment , volume=

Augmenting physical models with deep networks for complex dynamics forecasting , author=. Journal of Statistical Mechanics: Theory and Experiment , volume=. 2021 , publisher=

2021
[26]

Advances in Neural Information Processing Systems , volume=

Integrating expert ODEs into neural ODEs: pharmacology and disease progression , author=. Advances in Neural Information Processing Systems , volume=. 2021 , url=

2021
[27]

arXiv preprint arXiv:2202.03881 , year=

Robust hybrid learning with expert augmentation , author=. arXiv preprint arXiv:2202.03881 , year=

arXiv
[28]

arXiv preprint arXiv:2310.04948 , year=

Tempo: Prompt-based generative pre-trained transformer for time series forecasting , author=. arXiv preprint arXiv:2310.04948 , year=

arXiv
[29]

The Thirty-eighth Annual Conference on Neural Information Processing Systems , year=

Med-Real2Sim: Non-Invasive Medical Digital Twins using Physics-Informed Self-Supervised Learning , author=. The Thirty-eighth Annual Conference on Neural Information Processing Systems , year=
[30]

Advances in Neural Information Processing Systems , volume=

Synctwin: Treatment effect estimation with longitudinal outcomes , author=. Advances in Neural Information Processing Systems , volume=
[31]

New England Journal of Medicine , volume=

A CFTR potentiator in patients with cystic fibrosis and the G551D mutation , author=. New England Journal of Medicine , volume=. 2011 , publisher=

2011
[32]

International Conference on Learning Representations , year=

TimesNet: Temporal 2D-Variation Modeling for General Time Series Analysis , author=. International Conference on Learning Representations , year=
[33]

arXiv preprint arXiv:2407.13278 , year=

Deep Time Series Models: A Comprehensive Survey and Benchmark , author=. arXiv preprint arXiv:2407.13278 , year=

Pith/arXiv arXiv
[34]

Proceedings of the AAAI conference on artificial intelligence , volume=

Are transformers effective for time series forecasting? , author=. Proceedings of the AAAI conference on artificial intelligence , volume=
[35]

Advances in Neural Information Processing Systems , volume=

Non-stationary transformers: Exploring the stationarity in time series forecasting , author=. Advances in Neural Information Processing Systems , volume=
[36]

arXiv preprint arXiv:2405.14616 , year=

Timemixer: Decomposable multiscale mixing for time series forecasting , author=. arXiv preprint arXiv:2405.14616 , year=

arXiv
[37]

IEEE transactions on knowledge and data engineering , volume=

Learning under concept drift: A review , author=. IEEE transactions on knowledge and data engineering , volume=. 2018 , publisher=

2018
[38]

Frontiers of Computer Science , volume=

Large language models make sample-efficient recommender systems , author=. Frontiers of Computer Science , volume=. 2025 , publisher=

2025
[39]

npj Digital Medicine , volume=

Large language models forecast patient health trajectories enabling digital twins , author=. npj Digital Medicine , volume=. 2025 , publisher=

2025
[40]

NPJ Digital Medicine , volume=

Digital twins for health: a scoping review , author=. NPJ Digital Medicine , volume=. 2024 , publisher=

2024
[41]

doi: https://doi.org/10.1016/0364-0213(90)90002-E

Finding structure in time , journal =. 1990 , issn =. doi:https://doi.org/10.1016/0364-0213(90)90002-E , url =

work page doi:10.1016/0364-0213(90)90002-e 1990
[42]

Supervised Sequence Labelling with Recurrent Neural Networks , pages=

Long Short-Term Memory , author=. Supervised Sequence Labelling with Recurrent Neural Networks , pages=. 2012 , publisher=

2012
[43]

International Conference on Machine Learning , pages=

Causal transformer for estimating counterfactual outcomes , author=. International Conference on Machine Learning , pages=. 2022 , organization=

2022
[44]

Advances in neural information processing systems , volume=

Neural ordinary differential equations , author=. Advances in neural information processing systems , volume=. 2018 , url=

2018
[45]

Advances in neural information processing systems , volume=

Augmented neural odes , author=. Advances in neural information processing systems , volume=. 2019 , url=

2019
[46]

Advances in Neural Information Processing Systems , volume=

Physics-integrated variational autoencoders for robust and interpretable generative modeling , author=. Advances in Neural Information Processing Systems , volume=. 2021 , url=

2021
[47]

UPRISE : Universal Prompt Retrieval for Improving Zero-Shot Evaluation

Cheng, Daixuan and Huang, Shaohan and Bi, Junyu and Zhan, Yuefeng and Liu, Jianfeng and Wang, Yujing and Sun, Hao and Wei, Furu and Deng, Weiwei and Zhang, Qi. UPRISE : Universal Prompt Retrieval for Improving Zero-Shot Evaluation. Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing. 2023. doi:10.18653/v1/2023.emnlp-main.758

work page doi:10.18653/v1/2023.emnlp-main.758 2023
[48]

Findings of the Association for Computational Linguistics ACL 2024 , pages=

se2: Sequential example selection for in-context learning , author=. Findings of the Association for Computational Linguistics ACL 2024 , pages=. 2024 , url=

2024
[49]

nature , volume=

Learning representations by back-propagating errors , author=. nature , volume=. 1986 , publisher=

1986
[50]

arXiv preprint arXiv:2009.04278 , year=

Dynode: Neural ordinary differential equations for dynamics modeling in continuous control , author=. arXiv preprint arXiv:2009.04278 , year=

arXiv 2009
[51]

Journal of the American statistical Association , volume=

Strictly proper scoring rules, prediction, and estimation , author=. Journal of the American statistical Association , volume=. 2007 , publisher=

2007
[52]

arXiv preprint arXiv:1807.03748 , year=

Representation learning with contrastive predictive coding , author=. arXiv preprint arXiv:1807.03748 , year=

Pith/arXiv arXiv
[53]

International conference on artificial neural networks , pages=

Multi-dimensional recurrent neural networks , author=. International conference on artificial neural networks , pages=. 2007 , organization=

2007
[54]

Bender, Timnit Gebru, Angelina McMillan-Major, and Shmargaret Shmitchell

Bender, Emily M. and Gebru, Timnit and McMillan-Major, Angelina and Shmitchell, Shmargaret , title =. 2021 , isbn =. doi:10.1145/3442188.3445922 , booktitle =

work page doi:10.1145/3442188.3445922 2021
[55]

arXiv preprint arXiv:2412.13663 , year=

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference , author=. arXiv preprint arXiv:2412.13663 , year=

Pith/arXiv arXiv
[56]

International Conference on Learning Representations (ICLR) , year =

Adam: A Method for Stochastic Optimization , author =. International Conference on Learning Representations (ICLR) , year =. 1412.6980 , archivePrefix =

Pith/arXiv arXiv
[57]

Advances in neural information processing systems , volume=

Pytorch: An imperative style, high-performance deep learning library , author=. Advances in neural information processing systems , volume=. 2019 , url=

2019
[58]

arXiv preprint arXiv:1704.08863 , year=

On weight initialization in deep neural networks , author=. arXiv preprint arXiv:1704.08863 , year=

Pith/arXiv arXiv
[59]

Thomas Wolf and Lysandre Debut and Victor Sanh and Julien Chaumond and Clement Delangue and Anthony Moi and Pierric Cistac and Tim Rault and Rémi Louf and Morgan Funtowicz and Joe Davison and Sam Shleifer and Patrick von Platen and Clara Ma and Yacine Jernite and Julien Plu and Canwen Xu and Teven Le Scao and Sylvain Gugger and Mariama Drame and Quentin L...

2020
[60]

Transactions of the Association for Computational Linguistics , volume=

Lost in the middle: How language models use long contexts , author=. Transactions of the Association for Computational Linguistics , volume=. 2024 , publisher=

2024
[61]

The Thirteenth International Conference on Learning Representations , year=

No Equations Needed: Learning System Dynamics Without Relying on Closed-Form ODEs , author=. The Thirteenth International Conference on Learning Representations , year=
[62]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , pages=

Context versus Prior Knowledge in Language Models , author=. Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , pages=. 2024 , organization=

2024
[63]

Nature Computational Science , volume=

Digital twins in medicine , author=. Nature Computational Science , volume=. 2024 , publisher=

2024
[64]

European Proceedings of Social and Behavioural Sciences , year=

Impact of digital twin technology on the financial performance of corporations , author=. European Proceedings of Social and Behavioural Sciences , year=
[65]

Communications Earth & Environment , volume=

Digital twins of the Earth with and for humans , author=. Communications Earth & Environment , volume=. 2024 , publisher=

2024
[66]

2024 , issn =

Artificial intelligence in digital twins—A systematic literature review , journal =. 2024 , issn =. doi:https://doi.org/10.1016/j.datak.2024.102304 , url =

work page doi:10.1016/j.datak.2024.102304 2024
[67]

Scientific reports , volume=

Prediction of treatment response for combined chemo-and radiation therapy for non-small cell lung cancer patients using a bio-mathematical model , author=. Scientific reports , volume=. 2017 , publisher=

2017
[68]

International Conference on Learning Representations , year=

Estimating counterfactual treatment outcomes over time through adversarially balanced representations , author=. International Conference on Learning Representations , year=
[69]

International Conference on Machine Learning , pages=

Continuous-Time Modeling of Counterfactual Outcomes Using Neural Controlled Differential Equations , author=. International Conference on Machine Learning , pages=. 2022 , organization=

2022
[70]

Cochrane Database of Systematic Reviews , number=

Dornase alfa for cystic fibrosis , author=. Cochrane Database of Systematic Reviews , number=. 2021 , publisher=

2021
[71]

Respiratory Research , volume=

Predicting lung function decline in cystic fibrosis: the impact of initiating ivacaftor therapy , author=. Respiratory Research , volume=. 2024 , publisher=

2024
[72]

ACM computing surveys , volume=

Survey of hallucination in natural language generation , author=. ACM computing surveys , volume=. 2023 , publisher=

2023
[73]

Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , pages=

Neural Machine Translation of Rare Words with Subword Units , author=. Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , pages=. 2016 , organization=

2016
[74]

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics , pages=

On Faithfulness and Factuality in Abstractive Summarization , author=. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics , pages=. 2020 , organization=

2020
[75]

arXiv preprint arXiv:2305.14201 , year=

Goat: Fine-tuned llama outperforms gpt-4 on arithmetic tasks , author=. arXiv preprint arXiv:2305.14201 , year=

arXiv
[76]

arXiv preprint arXiv:2302.13971 , year=

Llama: Open and efficient foundation language models , author=. arXiv preprint arXiv:2302.13971 , year=

Pith/arXiv arXiv
[77]

Nature , volume=

Self-organizing neural network that discovers surfaces in random-dot stereograms , author=. Nature , volume=. 1992 , publisher=

1992
[78]

2006 IEEE computer society conference on computer vision and pattern recognition (CVPR'06) , volume=

Dimensionality reduction by learning an invariant mapping , author=. 2006 IEEE computer society conference on computer vision and pattern recognition (CVPR'06) , volume=. 2006 , organization=

2006
[79]

International conference on machine learning , pages=

A simple framework for contrastive learning of visual representations , author=. International conference on machine learning , pages=. 2020 , organization=

2020
[80]

Active Task Disambiguation with

Kasia Kobalczyk and Nicol. Active Task Disambiguation with. The Thirteenth International Conference on Learning Representations , year=

Showing first 80 references.

[1] [1]

Journal of medical Internet research , volume=

Optimal treatment selection in sequential systemic and locoregional therapy of oropharyngeal squamous carcinomas: deep Q-learning with a patient-physician digital twin dyad , author=. Journal of medical Internet research , volume=. 2022 , publisher=

2022

[2] [2]

The International Journal of Advanced Manufacturing Technology , volume=

Digital twin-driven product design, manufacturing and service with big data , author=. The International Journal of Advanced Manufacturing Technology , volume=. 2018 , publisher=

2018

[3] [3]

European Heart Journal , volume =

Corral-Acero, Jorge and Margara, Francesca and Marciniak, Maciej and Rodero, Cristobal and Loncaric, Filip and Feng, Yingjing and Gilbert, Andrew and Fernandes, Joao F and Bukhari, Hassaan A and Wajdan, Ali and Martinez, Manuel Villegas and Santos, Mariana Sousa and Shamohammdi, Mehrdad and Luo, Hongxing and Westphal, Philip and Leeson, Paul and DiAchille...

work page doi:10.1093/eurheartj/ehaa159 2020

[4] [4]

Energy , volume=

Hybrid mechanistic and neural network modeling of nuclear reactors , author=. Energy , volume=. 2023 , publisher=

2023

[5] [5]

Current Opinion in Chemical Engineering , volume=

Hybrid modeling—a key enabler towards realizing digital twins in biopharma? , author=. Current Opinion in Chemical Engineering , volume=. 2021 , publisher=

2021

[6] [6]

The Thirty-eighth Annual Conference on Neural Information Processing Systems , year=

Automatically Learning Hybrid Digital Twins of Dynamical Systems , author=. The Thirty-eighth Annual Conference on Neural Information Processing Systems , year=

[7] [7]

IEEE Transactions on Knowledge and Data Engineering , year=

Promptcast: A new prompt-based learning paradigm for time series forecasting , author=. IEEE Transactions on Knowledge and Data Engineering , year=

[8] [8]

Advances in Neural Information Processing Systems , volume=

Large language models are zero-shot time series forecasters , author=. Advances in Neural Information Processing Systems , volume=. 2024 , url=

2024

[9] [9]

The Twelfth International Conference on Learning Representations , year=

Time-LLM: Time Series Forecasting by Reprogramming Large Language Models , author=. The Twelfth International Conference on Learning Representations , year=

[10] [10]

2024.Foundational Research Gaps and Future Directions for Digital Twins

Foundational Research Gaps and Future Directions for Digital Twins , isbn =. doi:10.17226/26894 , year =

work page doi:10.17226/26894

[11] [11]

Advances in neural information processing systems , volume=

Language models are few-shot learners , author=. Advances in neural information processing systems , volume=. 2020 , url=

2020

[12] [12]

Advances in neural information processing systems , volume=

One fits all: Power general time series analysis by pretrained LM , author=. Advances in neural information processing systems , volume=. 2023 , url=

2023

[13] [13]

Advances in neural information processing systems , volume=

Autoformer: Decomposition transformers with auto-correlation for long-term series forecasting , author=. Advances in neural information processing systems , volume=. 2021 , url=

2021

[14] [14]

arXiv preprint arXiv:2004.04906 , year=

Dense passage retrieval for open-domain question answering , author=. arXiv preprint arXiv:2004.04906 , year=

Pith/arXiv arXiv 2004

[15] [15]

2024 , url =

Novel Drug Therapy Approvals 2024 , journal =. 2024 , url =

2024

[16] [16]

Journal of Rare Diseases , volume=

Emerging biomarkers for precision diagnosis and personalized treatment of cystic fibrosis , author=. Journal of Rare Diseases , volume=. 2024 , publisher=

2024

[17] [17]

International conference on machine learning , pages=

Fedformer: Frequency enhanced decomposed transformer for long-term series forecasting , author=. International conference on machine learning , pages=. 2022 , organization=

2022

[18] [18]

arXiv preprint arXiv:2211.14730 , year=

A time series is worth 64 words: Long-term forecasting with transformers , author=. arXiv preprint arXiv:2211.14730 , year=

Pith/arXiv arXiv

[19] [19]

The Thirty-Fifth

Haoyi Zhou and Shanghang Zhang and Jieqi Peng and Shuai Zhang and Jianxin Li and Hui Xiong and Wancai Zhang , title =. The Thirty-Fifth

[20] [20]

arXiv preprint arXiv:2402.19072 , year=

Timexer: Empowering transformers for time series forecasting with exogenous variables , author=. arXiv preprint arXiv:2402.19072 , year=

arXiv

[21] [21]

Statistics and computing , volume=

Genetic programming as a means for programming computers by natural selection , author=. Statistics and computing , volume=. 1994 , publisher=

1994

[22] [22]

Brunton and Joshua L

Steven L. Brunton and Joshua L. Proctor and J. Nathan Kutz , title =. Proceedings of the National Academy of Sciences , volume =. 2016 , doi =

2016

[23] [23]

International Conference on Learning Representations , year=

D-code: Discovering closed-form odes from observed trajectories , author=. International Conference on Learning Representations , year=

[24] [24]

Journal of Computational physics , volume=

Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations , author=. Journal of Computational physics , volume=. 2019 , publisher=

2019

[25] [25]

Journal of Statistical Mechanics: Theory and Experiment , volume=

Augmenting physical models with deep networks for complex dynamics forecasting , author=. Journal of Statistical Mechanics: Theory and Experiment , volume=. 2021 , publisher=

2021

[26] [26]

Advances in Neural Information Processing Systems , volume=

Integrating expert ODEs into neural ODEs: pharmacology and disease progression , author=. Advances in Neural Information Processing Systems , volume=. 2021 , url=

2021

[27] [27]

arXiv preprint arXiv:2202.03881 , year=

Robust hybrid learning with expert augmentation , author=. arXiv preprint arXiv:2202.03881 , year=

arXiv

[28] [28]

arXiv preprint arXiv:2310.04948 , year=

Tempo: Prompt-based generative pre-trained transformer for time series forecasting , author=. arXiv preprint arXiv:2310.04948 , year=

arXiv

[29] [29]

The Thirty-eighth Annual Conference on Neural Information Processing Systems , year=

Med-Real2Sim: Non-Invasive Medical Digital Twins using Physics-Informed Self-Supervised Learning , author=. The Thirty-eighth Annual Conference on Neural Information Processing Systems , year=

[30] [30]

Advances in Neural Information Processing Systems , volume=

Synctwin: Treatment effect estimation with longitudinal outcomes , author=. Advances in Neural Information Processing Systems , volume=

[31] [31]

New England Journal of Medicine , volume=

A CFTR potentiator in patients with cystic fibrosis and the G551D mutation , author=. New England Journal of Medicine , volume=. 2011 , publisher=

2011

[32] [32]

International Conference on Learning Representations , year=

TimesNet: Temporal 2D-Variation Modeling for General Time Series Analysis , author=. International Conference on Learning Representations , year=

[33] [33]

arXiv preprint arXiv:2407.13278 , year=

Deep Time Series Models: A Comprehensive Survey and Benchmark , author=. arXiv preprint arXiv:2407.13278 , year=

Pith/arXiv arXiv

[34] [34]

Proceedings of the AAAI conference on artificial intelligence , volume=

Are transformers effective for time series forecasting? , author=. Proceedings of the AAAI conference on artificial intelligence , volume=

[35] [35]

Advances in Neural Information Processing Systems , volume=

Non-stationary transformers: Exploring the stationarity in time series forecasting , author=. Advances in Neural Information Processing Systems , volume=

[36] [36]

arXiv preprint arXiv:2405.14616 , year=

Timemixer: Decomposable multiscale mixing for time series forecasting , author=. arXiv preprint arXiv:2405.14616 , year=

arXiv

[37] [37]

IEEE transactions on knowledge and data engineering , volume=

Learning under concept drift: A review , author=. IEEE transactions on knowledge and data engineering , volume=. 2018 , publisher=

2018

[38] [38]

Frontiers of Computer Science , volume=

Large language models make sample-efficient recommender systems , author=. Frontiers of Computer Science , volume=. 2025 , publisher=

2025

[39] [39]

npj Digital Medicine , volume=

Large language models forecast patient health trajectories enabling digital twins , author=. npj Digital Medicine , volume=. 2025 , publisher=

2025

[40] [40]

NPJ Digital Medicine , volume=

Digital twins for health: a scoping review , author=. NPJ Digital Medicine , volume=. 2024 , publisher=

2024

[41] [41]

doi: https://doi.org/10.1016/0364-0213(90)90002-E

Finding structure in time , journal =. 1990 , issn =. doi:https://doi.org/10.1016/0364-0213(90)90002-E , url =

work page doi:10.1016/0364-0213(90)90002-e 1990

[42] [42]

Supervised Sequence Labelling with Recurrent Neural Networks , pages=

Long Short-Term Memory , author=. Supervised Sequence Labelling with Recurrent Neural Networks , pages=. 2012 , publisher=

2012

[43] [43]

International Conference on Machine Learning , pages=

Causal transformer for estimating counterfactual outcomes , author=. International Conference on Machine Learning , pages=. 2022 , organization=

2022

[44] [44]

Advances in neural information processing systems , volume=

Neural ordinary differential equations , author=. Advances in neural information processing systems , volume=. 2018 , url=

2018

[45] [45]

Advances in neural information processing systems , volume=

Augmented neural odes , author=. Advances in neural information processing systems , volume=. 2019 , url=

2019

[46] [46]

Advances in Neural Information Processing Systems , volume=

Physics-integrated variational autoencoders for robust and interpretable generative modeling , author=. Advances in Neural Information Processing Systems , volume=. 2021 , url=

2021

[47] [47]

UPRISE : Universal Prompt Retrieval for Improving Zero-Shot Evaluation

Cheng, Daixuan and Huang, Shaohan and Bi, Junyu and Zhan, Yuefeng and Liu, Jianfeng and Wang, Yujing and Sun, Hao and Wei, Furu and Deng, Weiwei and Zhang, Qi. UPRISE : Universal Prompt Retrieval for Improving Zero-Shot Evaluation. Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing. 2023. doi:10.18653/v1/2023.emnlp-main.758

work page doi:10.18653/v1/2023.emnlp-main.758 2023

[48] [48]

Findings of the Association for Computational Linguistics ACL 2024 , pages=

se2: Sequential example selection for in-context learning , author=. Findings of the Association for Computational Linguistics ACL 2024 , pages=. 2024 , url=

2024

[49] [49]

nature , volume=

Learning representations by back-propagating errors , author=. nature , volume=. 1986 , publisher=

1986

[50] [50]

arXiv preprint arXiv:2009.04278 , year=

Dynode: Neural ordinary differential equations for dynamics modeling in continuous control , author=. arXiv preprint arXiv:2009.04278 , year=

arXiv 2009

[51] [51]

Journal of the American statistical Association , volume=

Strictly proper scoring rules, prediction, and estimation , author=. Journal of the American statistical Association , volume=. 2007 , publisher=

2007

[52] [52]

arXiv preprint arXiv:1807.03748 , year=

Representation learning with contrastive predictive coding , author=. arXiv preprint arXiv:1807.03748 , year=

Pith/arXiv arXiv

[53] [53]

International conference on artificial neural networks , pages=

Multi-dimensional recurrent neural networks , author=. International conference on artificial neural networks , pages=. 2007 , organization=

2007

[54] [54]

Bender, Timnit Gebru, Angelina McMillan-Major, and Shmargaret Shmitchell

Bender, Emily M. and Gebru, Timnit and McMillan-Major, Angelina and Shmitchell, Shmargaret , title =. 2021 , isbn =. doi:10.1145/3442188.3445922 , booktitle =

work page doi:10.1145/3442188.3445922 2021

[55] [55]

arXiv preprint arXiv:2412.13663 , year=

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference , author=. arXiv preprint arXiv:2412.13663 , year=

Pith/arXiv arXiv

[56] [56]

International Conference on Learning Representations (ICLR) , year =

Adam: A Method for Stochastic Optimization , author =. International Conference on Learning Representations (ICLR) , year =. 1412.6980 , archivePrefix =

Pith/arXiv arXiv

[57] [57]

Advances in neural information processing systems , volume=

Pytorch: An imperative style, high-performance deep learning library , author=. Advances in neural information processing systems , volume=. 2019 , url=

2019

[58] [58]

arXiv preprint arXiv:1704.08863 , year=

On weight initialization in deep neural networks , author=. arXiv preprint arXiv:1704.08863 , year=

Pith/arXiv arXiv

[59] [59]

Thomas Wolf and Lysandre Debut and Victor Sanh and Julien Chaumond and Clement Delangue and Anthony Moi and Pierric Cistac and Tim Rault and Rémi Louf and Morgan Funtowicz and Joe Davison and Sam Shleifer and Patrick von Platen and Clara Ma and Yacine Jernite and Julien Plu and Canwen Xu and Teven Le Scao and Sylvain Gugger and Mariama Drame and Quentin L...

2020

[60] [60]

Transactions of the Association for Computational Linguistics , volume=

Lost in the middle: How language models use long contexts , author=. Transactions of the Association for Computational Linguistics , volume=. 2024 , publisher=

2024

[61] [61]

The Thirteenth International Conference on Learning Representations , year=

No Equations Needed: Learning System Dynamics Without Relying on Closed-Form ODEs , author=. The Thirteenth International Conference on Learning Representations , year=

[62] [62]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , pages=

Context versus Prior Knowledge in Language Models , author=. Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , pages=. 2024 , organization=

2024

[63] [63]

Nature Computational Science , volume=

Digital twins in medicine , author=. Nature Computational Science , volume=. 2024 , publisher=

2024

[64] [64]

European Proceedings of Social and Behavioural Sciences , year=

Impact of digital twin technology on the financial performance of corporations , author=. European Proceedings of Social and Behavioural Sciences , year=

[65] [65]

Communications Earth & Environment , volume=

Digital twins of the Earth with and for humans , author=. Communications Earth & Environment , volume=. 2024 , publisher=

2024

[66] [66]

2024 , issn =

Artificial intelligence in digital twins—A systematic literature review , journal =. 2024 , issn =. doi:https://doi.org/10.1016/j.datak.2024.102304 , url =

work page doi:10.1016/j.datak.2024.102304 2024

[67] [67]

Scientific reports , volume=

Prediction of treatment response for combined chemo-and radiation therapy for non-small cell lung cancer patients using a bio-mathematical model , author=. Scientific reports , volume=. 2017 , publisher=

2017

[68] [68]

International Conference on Learning Representations , year=

Estimating counterfactual treatment outcomes over time through adversarially balanced representations , author=. International Conference on Learning Representations , year=

[69] [69]

International Conference on Machine Learning , pages=

Continuous-Time Modeling of Counterfactual Outcomes Using Neural Controlled Differential Equations , author=. International Conference on Machine Learning , pages=. 2022 , organization=

2022

[70] [70]

Cochrane Database of Systematic Reviews , number=

Dornase alfa for cystic fibrosis , author=. Cochrane Database of Systematic Reviews , number=. 2021 , publisher=

2021

[71] [71]

Respiratory Research , volume=

Predicting lung function decline in cystic fibrosis: the impact of initiating ivacaftor therapy , author=. Respiratory Research , volume=. 2024 , publisher=

2024

[72] [72]

ACM computing surveys , volume=

Survey of hallucination in natural language generation , author=. ACM computing surveys , volume=. 2023 , publisher=

2023

[73] [73]

Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , pages=

Neural Machine Translation of Rare Words with Subword Units , author=. Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , pages=. 2016 , organization=

2016

[74] [74]

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics , pages=

On Faithfulness and Factuality in Abstractive Summarization , author=. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics , pages=. 2020 , organization=

2020

[75] [75]

arXiv preprint arXiv:2305.14201 , year=

Goat: Fine-tuned llama outperforms gpt-4 on arithmetic tasks , author=. arXiv preprint arXiv:2305.14201 , year=

arXiv

[76] [76]

arXiv preprint arXiv:2302.13971 , year=

Llama: Open and efficient foundation language models , author=. arXiv preprint arXiv:2302.13971 , year=

Pith/arXiv arXiv

[77] [77]

Nature , volume=

Self-organizing neural network that discovers surfaces in random-dot stereograms , author=. Nature , volume=. 1992 , publisher=

1992

[78] [78]

2006 IEEE computer society conference on computer vision and pattern recognition (CVPR'06) , volume=

Dimensionality reduction by learning an invariant mapping , author=. 2006 IEEE computer society conference on computer vision and pattern recognition (CVPR'06) , volume=. 2006 , organization=

2006

[79] [79]

International conference on machine learning , pages=

A simple framework for contrastive learning of visual representations , author=. International conference on machine learning , pages=. 2020 , organization=

2020

[80] [80]

Active Task Disambiguation with

Kasia Kobalczyk and Nicol. Active Task Disambiguation with. The Thirteenth International Conference on Learning Representations , year=