MALLM-GAN: Multi-Agent Large Language Model as Generative Adversarial Network for Synthesizing Tabular Data

Xiaoqian Jiang; Yaobin Ling; Yejin Kim

arxiv: 2406.10521 · v5 · submitted 2024-06-15 · 💻 cs.LG · cs.AI

MALLM-GAN: Multi-Agent Large Language Model as Generative Adversarial Network for Synthesizing Tabular Data

Yaobin Ling , Xiaoqian Jiang , Yejin Kim This is my paper

Pith reviewed 2026-05-23 23:50 UTC · model grok-4.3

classification 💻 cs.LG cs.AI

keywords synthetic tabular datalarge language modelsgenerative adversarial networksmulti-agent systemsdata privacysmall sample sizesdownstream task utilitydata scarcity

0 comments

The pith

Large language models can be arranged as a multi-agent GAN to generate higher-quality synthetic tabular data from small samples than existing methods.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper sets out to show that large language models can be coordinated to copy the generator-discriminator structure of a generative adversarial network for creating synthetic tabular data. By feeding the data generation process itself into the models as context and using one LLM to optimize the others, the approach aims to avoid the large data volumes that traditional neural GANs require. This matters because many domains, especially healthcare, face data scarcity due to privacy rules or collection costs, yet still need realistic data for research and model training. Experiments on both public and private datasets indicate the resulting synthetic tables support better performance on downstream tasks than prior synthetic-data methods while avoiding release of the original records.

Core claim

By casting synthetic tabular data generation as a multi-agent LLM process that emulates GAN architecture, supplies the generation process as contextual information, and employs an LLM as optimizer, the framework produces synthetic data that yields higher utility on downstream tasks than state-of-the-art alternatives when only small real samples are available, while preserving privacy of the source data.

What carries the argument

The MALLM-GAN framework in which separate large language models serve as generator, discriminator, and optimizer inside an adversarial loop.

If this is right

Models trained on the generated synthetic tables achieve higher accuracy or other performance measures on classification, regression, or prediction tasks than models trained on tables from competing synthetic-data techniques.
Only synthetic records are released, so the privacy of the original small samples is not compromised.
The method remains effective even when the number of real records is too low for conventional neural GAN training.
Results hold across both publicly available benchmark tables and private domain-specific collections.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The approach could lower barriers to data-driven work in fields where gathering large labeled sets is expensive or restricted by regulation.
As language-model capabilities improve, the same multi-agent structure might transfer to generating other structured data such as time series or relational records.
The optimizer LLM could be replaced or augmented with domain-specific rules to further constrain the synthetic output toward known statistical properties of the target domain.

Load-bearing premise

Large language models can be reliably directed to play generator, discriminator, and optimizer roles in a repeated adversarial loop that produces useful tabular data without the large training sets normally needed for neural networks.

What would settle it

Downstream task accuracy or utility metrics on multiple small-sample tabular datasets remain no higher when models are trained on MALLM-GAN synthetic data than when trained on synthetic data from current leading methods.

Figures

Figures reproduced from arXiv: 2406.10521 by Xiaoqian Jiang, Yaobin Ling, Yejin Kim.

**Figure 2.** Figure 2: DCR between the synthetic data and the real data. DCR were calculated based on training [PITH_FULL_IMAGE:figures/full_fig_p007_2.png] view at source ↗

**Figure 3.** Figure 3: An example of trajectory of causal structure in data generation process over adversarial [PITH_FULL_IMAGE:figures/full_fig_p009_3.png] view at source ↗

**Figure 4.** Figure 4: Real and synthetic data distribution with three categorical conditions and one numerical [PITH_FULL_IMAGE:figures/full_fig_p010_4.png] view at source ↗

**Figure 5.** Figure 5: Number n of examples and MLE. 1 2 3 4 5 Adult 5, 6, 10 5, 7, 12 4, 6, 9 4, 6, 10 4, 6, 11 Insurance 31, 93, 301 44, 66, 453 32,60, 182 33, 73, 167 29, 55, 168 Asia 0, 0, 0 0, 0, 0 0, 0, 0 0, 0, 0 0, 0, 0 ATACH2 61, 73, 88 72, 87, 97 69, 78, 89 66, 75, 94 67, 83, 104 ERICH 53, 61, 79 59, 78, 96 59, 74, 101 56, 72, 96 50, 64, 88 [PITH_FULL_IMAGE:figures/full_fig_p019_5.png] view at source ↗

**Figure 6.** Figure 6: Trajectory of causal structure in data generation process over adversarial optimization. [PITH_FULL_IMAGE:figures/full_fig_p019_6.png] view at source ↗

read the original abstract

In the era of big data, access to abundant data is crucial for driving research forward. However, such data is often inaccessible due to privacy concerns or high costs, particularly in healthcare domain. Generating synthetic (tabular) data can address this, but existing models typically require substantial amounts of data to train effectively, contradicting our objective to solve data scarcity. To address this challenge, we propose a novel framework to generate synthetic tabular data, powered by large language models (LLMs) that emulates the architecture of a Generative Adversarial Network (GAN). By incorporating data generation process as contextual information and utilizing LLM as the optimizer, our approach significantly enhance the quality of synthetic data generation in common scenarios with small sample sizes. Our experimental results on public and private datasets demonstrate that our model outperforms several state-of-art models regarding generating higher quality synthetic data for downstream tasks while keeping privacy of the real data.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper frames multiple LLMs as generator, discriminator, and optimizer to mimic a GAN for small-sample tabular synthesis, but the abstract supplies zero metrics, baselines, or implementation details so the performance claims cannot be checked.

read the letter

The one thing to take away is that this work tries to replace the usual neural GAN training loop with a multi-agent LLM setup for generating synthetic tabular data when real samples are scarce. They serialize tables to text, give generation context to the models, and let an LLM optimizer refine the process iteratively, claiming this beats existing methods on both public and private datasets while preserving privacy.

Referee Report

2 major / 1 minor

Summary. The manuscript proposes MALLM-GAN, a multi-agent LLM framework that emulates a GAN architecture (with LLMs serving as generator, discriminator, and optimizer) to synthesize tabular data from small samples. Data generation is incorporated as context and an LLM optimizer refines the process; the central claim is that this yields higher-quality synthetic data than SOTA models on public and private datasets for downstream tasks while preserving privacy.

Significance. If substantiated with rigorous experiments, the work could meaningfully advance synthetic data methods for data-scarce, privacy-sensitive domains such as healthcare by sidestepping the large-sample requirements of gradient-based neural GANs. The multi-agent LLM orchestration of adversarial dynamics is a novel direction that, if shown to add value beyond LLM priors, would be of broad interest.

major comments (2)

[Abstract] Abstract: the claim that the model 'outperforms several state-of-art models' and 'significantly enhance the quality of synthetic data generation in common scenarios with small sample sizes' is unsupported by any metrics, baselines, sample sizes, evaluation protocol, or implementation details, rendering the central empirical claim impossible to assess.
[Experimental section (or equivalent)] No section demonstrates convergence properties of the LLM-prompting loop, variance across stochastic runs, or that the multi-agent setup improves upon the base LLM's pretraining knowledge; this is load-bearing for the claim that the architecture emulates effective GAN dynamics on small tabular samples without the usual data-volume requirements.

minor comments (1)

[Abstract] Abstract: 'enhance' should be 'enhances' for subject-verb agreement.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive comments. We address the major points below and will revise the manuscript accordingly to strengthen the presentation of empirical claims.

read point-by-point responses

Referee: [Abstract] Abstract: the claim that the model 'outperforms several state-of-art models' and 'significantly enhance the quality of synthetic data generation in common scenarios with small sample sizes' is unsupported by any metrics, baselines, sample sizes, evaluation protocol, or implementation details, rendering the central empirical claim impossible to assess.

Authors: We agree that the abstract would benefit from greater specificity. The full manuscript reports results on public and private tabular datasets using downstream task performance as the primary metric, with comparisons to SOTA baselines under small-sample regimes. In revision we will update the abstract to reference the evaluation protocol, sample-size ranges, and key quantitative improvements. revision: yes
Referee: [Experimental section (or equivalent)] No section demonstrates convergence properties of the LLM-prompting loop, variance across stochastic runs, or that the multi-agent setup improves upon the base LLM's pretraining knowledge; this is load-bearing for the claim that the architecture emulates effective GAN dynamics on small tabular samples without the usual data-volume requirements.

Authors: We acknowledge the value of these analyses for substantiating the adversarial multi-agent dynamics. The current manuscript presents end-to-end results but does not include convergence diagnostics, run-to-run variance, or an explicit ablation versus a single-LLM baseline. We will add a dedicated experimental subsection reporting (i) iteration-wise convergence of the prompting loop, (ii) standard deviations over repeated stochastic executions, and (iii) an ablation isolating the contribution of the multi-agent GAN emulation over a non-adversarial LLM baseline. revision: yes

Circularity Check

0 steps flagged

No circularity detected; claims rest on experimental comparisons

full rationale

The paper introduces a multi-agent LLM framework that emulates GAN architecture for small-sample tabular synthesis and supports its claims solely through empirical results on public/private datasets showing outperformance versus SOTA models on downstream tasks. No equations, fitted parameters renamed as predictions, self-citations as load-bearing premises, or ansatzes imported via prior work appear in the provided text; the derivation chain is absent and the work is self-contained as an experimental proposal.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review supplies no explicit free parameters, axioms, or invented entities; the multi-agent LLM setup is described at a conceptual level without enumerated assumptions or new postulated objects.

pith-pipeline@v0.9.0 · 5694 in / 1117 out tokens · 20235 ms · 2026-05-23T23:50:01.006497+00:00 · methodology

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Evaluating Inter-Column Logical Relationships in Synthetic Tabular Data Generation
cs.LG 2025-02 unverdicted novelty 6.0

Proposes three metrics for inter-column logical relationships in synthetic tabular data and reports that current generators often fail to preserve them on an industrial dataset.

Reference graph

Works this paper leans on

48 extracted references · 48 canonical work pages · cited by 1 Pith paper · 4 internal anchors

[1]

Medical cost personal datasets, May 2018

work page 2018
[2]

Distributional learning of variational autoencoder: Application to synthetic data generation

AN, S., AND JEON , J.-J. Distributional learning of variational autoencoder: Application to synthetic data generation. In Advances in Neural Information Processing Systems (2023), A. Oh, T. Naumann, A. Globerson, K. Saenko, M. Hardt, and S. Levine, Eds., vol. 36, Curran Associates, Inc., pp. 57825–57851

work page 2023
[3]

Towards principled methods for training generative adversarial networks, 2017

ARJOVSKY , M., AND BOTTOU , L. Towards principled methods for training generative adversarial networks, 2017

work page 2017
[4]

BECKER , B., AND KOHAVI , R. Adult. UCI Machine Learning Repository, 1996. DOI: https://doi.org/10.24432/C5XW20

work page doi:10.24432/c5xw20 1996
[5]

E., G ERBER , M., AND ROBERT , C

BERNTON , E., J ACOB , P. E., G ERBER , M., AND ROBERT , C. P. Approximate Bayesian Computation with the Wasserstein Distance. Journal of the Royal Statistical Society Series B: Statistical Methodology 81, 2 (02 2019), 235–269

work page 2019
[6]

Language models are realistic tabular data generators

BORISOV , V., S ESSLER , K., L EEMANN , T., PAWELCZYK , M., AND KASNECI , G. Language models are realistic tabular data generators. In The Eleventh International Conference on Learning Representations (2023)

work page 2023
[7]

D., D HARIWAL , P., NEELAKANTAN , A., S HYAM, P., S ASTRY, G., A SKELL , A., ET AL

BROWN , T., M ANN , B., R YDER , N., S UBBIAH , M., K APLAN , J. D., D HARIWAL , P., NEELAKANTAN , A., S HYAM, P., S ASTRY, G., A SKELL , A., ET AL . Language models are few-shot learners. Advances in neural information processing systems 33 (2020), 1877–1901

work page 2020
[8]

Smote: Synthetic minority over-sampling technique

CHAWLA , N., B OWYER , K., H ALL , L., AND KEGELMEYER , W. Smote: Synthetic minority over-sampling technique. J. Artif. Intell. Res. (JAIR) 16 (06 2002), 321–357

work page 2002
[9]

V., B OWYER , K

CHAWLA , N. V., B OWYER , K. W., H ALL , L. O., AND KEGELMEYER , W. P. Smote: synthetic minority over-sampling technique. Journal of artificial intelligence research 16(2002), 321– 357

work page 2002
[10]

A., Z HANG , J., H U, Z., Q I, Y., N ICKLEACH , S., S OCOLINSKY , D., S ENGAMEDU , S., AND FALOUTSOS , C

FANG , X., X U, W., TAN, F. A., Z HANG , J., H U, Z., Q I, Y., N ICKLEACH , S., S OCOLINSKY , D., S ENGAMEDU , S., AND FALOUTSOS , C. Large language models(llms) on tabular data: Prediction, generation, and understanding – a survey, 2024

work page 2024
[11]

GRINSZTAJN , L., O YALLON , E., AND VAROQUAUX , G. Why do tree-based models still outperform deep learning on typical tabular data? In Thirty-sixth Conference on Neural Information Processing Systems Datasets and Benchmarks Track(2022)

work page 2022
[12]

S., AND ROYSDON , P

GULATI , M. S., AND ROYSDON , P. F. TabMT: Generating tabular data with masked trans- formers. In Thirty-seventh Conference on Neural Information Processing Systems (2023)

work page 2023
[13]

Large Language Model based Multi-Agents: A Survey of Progress and Challenges

GUO, T., C HEN , X., W ANG , Y., C HANG , R., P EI, S., C HAWLA , N. V., W IEST , O., AND ZHANG , X. Large language model based multi-agents: A survey of progress and challenges. arXiv preprint arXiv:2402.01680 (2024)

work page internal anchor Pith review Pith/arXiv arXiv 2024
[14]

Tabllm: Few-shot classification of tabular data with large language models

HEGSELMANN , S., B UENDIA , A., L ANG , H., A GRAWAL , M., J IANG , X., AND SONTAG , D. Tabllm: Few-shot classification of tabular data with large language models. In Proceedings of The 26th International Conference on Artificial Intelligence and Statistics(25–27 Apr 2023), F. Ruiz, J. Dy, and J.-W. van de Meent, Eds., vol. 206 of Proceedings of Machine L...

work page 2023
[15]

K., R ENDA , A., AND CARBIN , M

HOPKINS , A. K., R ENDA , A., AND CARBIN , M. Can llms generate random numbers? evaluating llm sampling in controlled domains. In ICML 2023 Workshop: Sampling and Optimization in Discrete Space (2023)

work page 2023
[16]

J., L AM, M

HUANG , J.- T., L I, E. J., L AM, M. H., L IANG , T., WANG , W., Y UAN, Y., J IAO, W., WANG , X., T U, Z., AND LYU, M. R. How far are we on the decision-making of llms? evaluating llms’ gaming ability in multi-agent environments. arXiv preprint arXiv:2403.11807 (2024). 11

work page arXiv 2024
[17]

B., C HESS , B., C HILD , R., GRAY, S., R ADFORD , A., W U, J., AND AMODEI , D

KAPLAN , J., M CCANDLISH , S., H ENIGHAN , T., B ROWN , T. B., C HESS , B., C HILD , R., GRAY, S., R ADFORD , A., W U, J., AND AMODEI , D. Scaling laws for neural language models, 2020

work page 2020
[18]

Causal reasoning and large language models: Opening a new frontier for causality

KICIMAN , E., N ESS , R., S HARMA , A., AND TAN, C. Causal reasoning and large language models: Opening a new frontier for causality. arXiv preprint arXiv:2305.00050 (2023)

work page arXiv 2023
[19]

Tabddpm: Mod- elling tabular data with diffusion models

KOTELNIKOV , A., B ARANCHUK , D., R UBACHEV , I., AND BABENKO , A. Tabddpm: Mod- elling tabular data with diffusion models. In International Conference on Machine Learning (2023), PMLR, pp. 17564–17579

work page 2023
[20]

Retrieval- augmented generation for knowledge-intensive nlp tasks, 2021

LEWIS , P., P EREZ , E., P IKTUS , A., P ETRONI , F., K ARPUKHIN , V., G OYAL, N., K ÜTTLER , H., L EWIS , M., TAU YIH, W., R OCKTÄSCHEL , T., R IEDEL , S., AND KIELA , D. Retrieval- augmented generation for knowledge-intensive nlp tasks, 2021

work page 2021
[21]

Cancergpt for few shot drug pair synergy prediction using large pretrained language models

LI, T., S HETTY , S., K AMATH , A., J AISWAL , A., J IANG , X., D ING , Y., AND KIM, Y. Cancergpt for few shot drug pair synergy prediction using large pretrained language models. npj Digital Medicine 7, 1 (Feb 2024), 40

work page 2024
[22]

B., T ANG , K., A RONOWSKI , J., F ANN , Y., S AVITZ , S

LING , Y., T ARIQ , M. B., T ANG , K., A RONOWSKI , J., F ANN , Y., S AVITZ , S. I., J IANG , X., AND KIM, Y. An interpretable framework to identify responsive subgroups from clinical trials regarding treatment effects: Application to treatment of intracerebral hemorrhage. PLOS Digital Health 3, 5 (05 2024), 1–17

work page 2024
[23]

Causal discovery with language models as imperfect experts, 07 2023

LONG , S., P ICHÉ , A., Z ANTEDESCHI , V., SCHUSTER , T., AND DROUIN , A. Causal discovery with language models as imperfect experts, 07 2023

work page 2023
[24]

T., Z HANG , S., C ARIGNAN , D., E DGAR , R., F USI , N., K ING , N., LARSON , J., L I, Y., L IU, W., L UO, R., M CKINNEY , S

NORI , H., L EE, Y. T., Z HANG , S., C ARIGNAN , D., E DGAR , R., F USI , N., K ING , N., LARSON , J., L I, Y., L IU, W., L UO, R., M CKINNEY , S. M., N ESS , R. O., P OON , H., Q IN, T., USUYAMA , N., W HITE , C., AND HORVITZ , E. Can generalist foundation models outcompete special-purpose tuning? case study in medicine. November 2023

work page 2023
[25]

Gpt-4 technical report, 2023

O PENAI. Gpt-4 technical report, 2023

work page 2023
[26]

Causality, 2 ed

P EARL , J. Causality, 2 ed. Cambridge University Press, Cambridge, UK, 2009

work page 2009
[27]

I., P ALESCH , Y

QURESHI , A. I., P ALESCH , Y. Y., BARSAN , W. G., H ANLEY , D. F., H SU, C. Y., M ARTIN , R. L., M OY, C. S., S ILBERGLEIT , R., S TEINER , T., S UAREZ , J. I., T OYODA , K., W ANG , Y., YAMAMOTO , H., AND YOON , B.-W. Intensive blood-pressure lowering in patients with acute cerebral hemorrhage. New England Journal of Medicine 375, 11 (2016), 1033–1043

work page 2016
[28]

Zero-shot text-to-image generation, 2021

RAMESH , A., P AVLOV, M., G OH, G., G RAY, S., V OSS , C., R ADFORD , A., C HEN , M., AND SUTSKEVER , I. Zero-shot text-to-image generation, 2021

work page 2021
[29]

Mathematical discoveries from program search with large language models

ROMERA -PAREDES , B., B AREKATAIN , M., N OVIKOV, A., B ALOG , M., K UMAR , M., DUPONT , E., R UIZ , F., E LLENBERG , J., W ANG , P., FAWZI , O., K OHLI , P., AND FAWZI , A. Mathematical discoveries from program search with large language models. Nature 625 (12 2023)

work page 2023
[30]

Learning Bayesian Networks with the bnlearn R Package

SCUTARI , M. Learning bayesian networks with the bnlearn r package. arXiv preprint arXiv:0908.3817 (2009)

work page internal anchor Pith review Pith/arXiv arXiv 2009
[31]

Curated llm: Synergy of llms and data curation for tabular augmentation in ultra low-data regimes, 2024

SEEDAT, N., H UYNH , N., VAN BREUGEL , B., AND VAN DER SCHAAR , M. Curated llm: Synergy of llms and data curation for tabular augmentation in ultra low-data regimes, 2024

work page 2024
[32]

V., AND DUPRIEZ , O

SOLATORIO , A. V., AND DUPRIEZ , O. Realtabformer: Generating realistic relational and tabular data using transformers, 2023

work page 2023
[33]

Causation, prediction, and search

SPIRTES , P., G LYMOUR , C., AND SCHEINES , R. Causation, prediction, and search. MIT press, 2001

work page 2001
[34]

An algorithm for causal inference in the presence of latent variables and selection bias (vol

SPIRTES , P., M EEK , C., AND RICHARDSON , T. An algorithm for causal inference in the presence of latent variables and selection bias (vol. 1), 1999. 12

work page 1999
[35]

Multi-Agent Collaboration: Harnessing the Power of Intelligent LLM Agents

TALEBIRAD , Y., AND NADIRI , A. Multi-agent collaboration: Harnessing the power of intelligent llm agents. arXiv preprint arXiv:2306.03314 (2023)

work page internal anchor Pith review Pith/arXiv arXiv 2023
[36]

E., AND ALIFERIS , C

TSAMARDINOS , I., B ROWN , L. E., AND ALIFERIS , C. F. The max-min hill-climbing bayesian network structure learning algorithm. Machine learning 65 (2006), 31–78

work page 2006
[37]

E., AND ALIFERIS , C

TSAMARDINOS , I., B ROWN , L. E., AND ALIFERIS , C. F. The max-min hill-climbing bayesian network structure learning algorithm. Machine Learning 65, 1 (Oct 2006), 31–78

work page 2006
[38]

Scalable causal structure learning: Scoping review of traditional and deep learning algorithms and new opportunities in biomedicine

UPADHYAYA, P., Z HANG , K., L I, C., J IANG , X., AND KIM, Y. Scalable causal structure learning: Scoping review of traditional and deep learning algorithms and new opportunities in biomedicine. JMIR Med Inform 11 (Jan 2023), e38266

work page 2023
[39]

L., O SBORNE , J., B ROWN , M

WOO, D., R OSAND , J., K IDWELL , C., M CCAULEY, J. L., O SBORNE , J., B ROWN , M. W., WEST, S. E., R ADEMACHER , E. W., W ADDY, S., R OBERTS , J. N., ET AL . The ethnic/racial variations of intracerebral hemorrhage (erich) study protocol. Stroke 44, 10 (2013), e120–e125

work page 2013
[40]

Interpretation for variational autoencoder used to generate financial synthetic tabular data

WU, J., P LATANIOTIS , K., L IU, L., A MJADIAN , E., AND LAWRYSHYN , Y. Interpretation for variational autoencoder used to generate financial synthetic tabular data. Algorithms 16, 2 (2023)

work page 2023
[41]

AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation

WU, Q., B ANSAL , G., Z HANG , J., W U, Y., Z HANG , S., Z HU, E., L I, B., J IANG , L., Z HANG , X., AND WANG , C. Autogen: Enabling next-gen llm applications via multi-agent conversation framework. arXiv preprint arXiv:2308.08155 (2023)

work page internal anchor Pith review Pith/arXiv arXiv 2023
[42]

Modeling tabular data using conditional GAN

XU, L., S KOULARIDOU , M., C UESTA -I NFANTE , A., AND VEERAMACHANENI , K. Modeling tabular data using conditional GAN. Curran Associates Inc., Red Hook, NY , USA, 2019

work page 2019
[43]

V., Z HOU , D., AND CHEN , X

YANG , C., WANG , X., L U, Y., LIU, H., L E, Q. V., Z HOU , D., AND CHEN , X. Large language models as optimizers

work page
[44]

Using bayesian networks to create synthetic data, 2009

YOUNG , J., G RAHAM , P., AND PENNY, R. Using bayesian networks to create synthetic data, 2009

work page 2009
[45]

Unified language representation for question answering over text, tables, and images

YU, B., F U, C., Y U, H., H UANG , F., AND LI, Y. Unified language representation for question answering over text, tables, and images. In Findings of the Association for Computational Lin- guistics: ACL 2023 (Toronto, Canada, July 2023), A. Rogers, J. Boyd-Graber, and N. Okazaki, Eds., Association for Computational Linguistics, pp. 4756–4765

work page 2023
[46]

Dag-gnn: Dag structure learning with graph neural networks, 2019

YU, Y., C HEN , J., G AO, T., AND YU, M. Dag-gnn: Dag structure learning with graph neural networks, 2019

work page 2019
[47]

Generative table pre-training empowers models for tabular prediction

ZHANG , T., W ANG , S., Y AN, S., J IAN , L., AND LIU, Q. Generative table pre-training empowers models for tabular prediction. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (Singapore, Dec. 2023), H. Bouamor, J. Pino, and K. Bali, Eds., Association for Computational Linguistics, pp. 14836–14854

work page 2023
[48]

treatment

ZHENG , X., A RAGAM , B., R AVIKUMAR , P., AND XING , E. P. Dags with no tears: Continuous optimization for structure learning, 2018. 13 SUPPLEMENTARY 1 E XAMPLE OF PROMPTS AND OUTPUT Here, we provided example of generator prompt and optimizer prompt. Note that generator prompt evolves over the iteration. 1 System role: 2 % Specify role and task 3 You are...

work page 2018

[1] [1]

Medical cost personal datasets, May 2018

work page 2018

[2] [2]

Distributional learning of variational autoencoder: Application to synthetic data generation

AN, S., AND JEON , J.-J. Distributional learning of variational autoencoder: Application to synthetic data generation. In Advances in Neural Information Processing Systems (2023), A. Oh, T. Naumann, A. Globerson, K. Saenko, M. Hardt, and S. Levine, Eds., vol. 36, Curran Associates, Inc., pp. 57825–57851

work page 2023

[3] [3]

Towards principled methods for training generative adversarial networks, 2017

ARJOVSKY , M., AND BOTTOU , L. Towards principled methods for training generative adversarial networks, 2017

work page 2017

[4] [4]

BECKER , B., AND KOHAVI , R. Adult. UCI Machine Learning Repository, 1996. DOI: https://doi.org/10.24432/C5XW20

work page doi:10.24432/c5xw20 1996

[5] [5]

E., G ERBER , M., AND ROBERT , C

BERNTON , E., J ACOB , P. E., G ERBER , M., AND ROBERT , C. P. Approximate Bayesian Computation with the Wasserstein Distance. Journal of the Royal Statistical Society Series B: Statistical Methodology 81, 2 (02 2019), 235–269

work page 2019

[6] [6]

Language models are realistic tabular data generators

BORISOV , V., S ESSLER , K., L EEMANN , T., PAWELCZYK , M., AND KASNECI , G. Language models are realistic tabular data generators. In The Eleventh International Conference on Learning Representations (2023)

work page 2023

[7] [7]

D., D HARIWAL , P., NEELAKANTAN , A., S HYAM, P., S ASTRY, G., A SKELL , A., ET AL

BROWN , T., M ANN , B., R YDER , N., S UBBIAH , M., K APLAN , J. D., D HARIWAL , P., NEELAKANTAN , A., S HYAM, P., S ASTRY, G., A SKELL , A., ET AL . Language models are few-shot learners. Advances in neural information processing systems 33 (2020), 1877–1901

work page 2020

[8] [8]

Smote: Synthetic minority over-sampling technique

CHAWLA , N., B OWYER , K., H ALL , L., AND KEGELMEYER , W. Smote: Synthetic minority over-sampling technique. J. Artif. Intell. Res. (JAIR) 16 (06 2002), 321–357

work page 2002

[9] [9]

V., B OWYER , K

CHAWLA , N. V., B OWYER , K. W., H ALL , L. O., AND KEGELMEYER , W. P. Smote: synthetic minority over-sampling technique. Journal of artificial intelligence research 16(2002), 321– 357

work page 2002

[10] [10]

A., Z HANG , J., H U, Z., Q I, Y., N ICKLEACH , S., S OCOLINSKY , D., S ENGAMEDU , S., AND FALOUTSOS , C

FANG , X., X U, W., TAN, F. A., Z HANG , J., H U, Z., Q I, Y., N ICKLEACH , S., S OCOLINSKY , D., S ENGAMEDU , S., AND FALOUTSOS , C. Large language models(llms) on tabular data: Prediction, generation, and understanding – a survey, 2024

work page 2024

[11] [11]

GRINSZTAJN , L., O YALLON , E., AND VAROQUAUX , G. Why do tree-based models still outperform deep learning on typical tabular data? In Thirty-sixth Conference on Neural Information Processing Systems Datasets and Benchmarks Track(2022)

work page 2022

[12] [12]

S., AND ROYSDON , P

GULATI , M. S., AND ROYSDON , P. F. TabMT: Generating tabular data with masked trans- formers. In Thirty-seventh Conference on Neural Information Processing Systems (2023)

work page 2023

[13] [13]

Large Language Model based Multi-Agents: A Survey of Progress and Challenges

GUO, T., C HEN , X., W ANG , Y., C HANG , R., P EI, S., C HAWLA , N. V., W IEST , O., AND ZHANG , X. Large language model based multi-agents: A survey of progress and challenges. arXiv preprint arXiv:2402.01680 (2024)

work page internal anchor Pith review Pith/arXiv arXiv 2024

[14] [14]

Tabllm: Few-shot classification of tabular data with large language models

HEGSELMANN , S., B UENDIA , A., L ANG , H., A GRAWAL , M., J IANG , X., AND SONTAG , D. Tabllm: Few-shot classification of tabular data with large language models. In Proceedings of The 26th International Conference on Artificial Intelligence and Statistics(25–27 Apr 2023), F. Ruiz, J. Dy, and J.-W. van de Meent, Eds., vol. 206 of Proceedings of Machine L...

work page 2023

[15] [15]

K., R ENDA , A., AND CARBIN , M

HOPKINS , A. K., R ENDA , A., AND CARBIN , M. Can llms generate random numbers? evaluating llm sampling in controlled domains. In ICML 2023 Workshop: Sampling and Optimization in Discrete Space (2023)

work page 2023

[16] [16]

J., L AM, M

HUANG , J.- T., L I, E. J., L AM, M. H., L IANG , T., WANG , W., Y UAN, Y., J IAO, W., WANG , X., T U, Z., AND LYU, M. R. How far are we on the decision-making of llms? evaluating llms’ gaming ability in multi-agent environments. arXiv preprint arXiv:2403.11807 (2024). 11

work page arXiv 2024

[17] [17]

B., C HESS , B., C HILD , R., GRAY, S., R ADFORD , A., W U, J., AND AMODEI , D

KAPLAN , J., M CCANDLISH , S., H ENIGHAN , T., B ROWN , T. B., C HESS , B., C HILD , R., GRAY, S., R ADFORD , A., W U, J., AND AMODEI , D. Scaling laws for neural language models, 2020

work page 2020

[18] [18]

Causal reasoning and large language models: Opening a new frontier for causality

KICIMAN , E., N ESS , R., S HARMA , A., AND TAN, C. Causal reasoning and large language models: Opening a new frontier for causality. arXiv preprint arXiv:2305.00050 (2023)

work page arXiv 2023

[19] [19]

Tabddpm: Mod- elling tabular data with diffusion models

KOTELNIKOV , A., B ARANCHUK , D., R UBACHEV , I., AND BABENKO , A. Tabddpm: Mod- elling tabular data with diffusion models. In International Conference on Machine Learning (2023), PMLR, pp. 17564–17579

work page 2023

[20] [20]

Retrieval- augmented generation for knowledge-intensive nlp tasks, 2021

LEWIS , P., P EREZ , E., P IKTUS , A., P ETRONI , F., K ARPUKHIN , V., G OYAL, N., K ÜTTLER , H., L EWIS , M., TAU YIH, W., R OCKTÄSCHEL , T., R IEDEL , S., AND KIELA , D. Retrieval- augmented generation for knowledge-intensive nlp tasks, 2021

work page 2021

[21] [21]

Cancergpt for few shot drug pair synergy prediction using large pretrained language models

LI, T., S HETTY , S., K AMATH , A., J AISWAL , A., J IANG , X., D ING , Y., AND KIM, Y. Cancergpt for few shot drug pair synergy prediction using large pretrained language models. npj Digital Medicine 7, 1 (Feb 2024), 40

work page 2024

[22] [22]

B., T ANG , K., A RONOWSKI , J., F ANN , Y., S AVITZ , S

LING , Y., T ARIQ , M. B., T ANG , K., A RONOWSKI , J., F ANN , Y., S AVITZ , S. I., J IANG , X., AND KIM, Y. An interpretable framework to identify responsive subgroups from clinical trials regarding treatment effects: Application to treatment of intracerebral hemorrhage. PLOS Digital Health 3, 5 (05 2024), 1–17

work page 2024

[23] [23]

Causal discovery with language models as imperfect experts, 07 2023

LONG , S., P ICHÉ , A., Z ANTEDESCHI , V., SCHUSTER , T., AND DROUIN , A. Causal discovery with language models as imperfect experts, 07 2023

work page 2023

[24] [24]

T., Z HANG , S., C ARIGNAN , D., E DGAR , R., F USI , N., K ING , N., LARSON , J., L I, Y., L IU, W., L UO, R., M CKINNEY , S

NORI , H., L EE, Y. T., Z HANG , S., C ARIGNAN , D., E DGAR , R., F USI , N., K ING , N., LARSON , J., L I, Y., L IU, W., L UO, R., M CKINNEY , S. M., N ESS , R. O., P OON , H., Q IN, T., USUYAMA , N., W HITE , C., AND HORVITZ , E. Can generalist foundation models outcompete special-purpose tuning? case study in medicine. November 2023

work page 2023

[25] [25]

Gpt-4 technical report, 2023

O PENAI. Gpt-4 technical report, 2023

work page 2023

[26] [26]

Causality, 2 ed

P EARL , J. Causality, 2 ed. Cambridge University Press, Cambridge, UK, 2009

work page 2009

[27] [27]

I., P ALESCH , Y

QURESHI , A. I., P ALESCH , Y. Y., BARSAN , W. G., H ANLEY , D. F., H SU, C. Y., M ARTIN , R. L., M OY, C. S., S ILBERGLEIT , R., S TEINER , T., S UAREZ , J. I., T OYODA , K., W ANG , Y., YAMAMOTO , H., AND YOON , B.-W. Intensive blood-pressure lowering in patients with acute cerebral hemorrhage. New England Journal of Medicine 375, 11 (2016), 1033–1043

work page 2016

[28] [28]

Zero-shot text-to-image generation, 2021

RAMESH , A., P AVLOV, M., G OH, G., G RAY, S., V OSS , C., R ADFORD , A., C HEN , M., AND SUTSKEVER , I. Zero-shot text-to-image generation, 2021

work page 2021

[29] [29]

Mathematical discoveries from program search with large language models

ROMERA -PAREDES , B., B AREKATAIN , M., N OVIKOV, A., B ALOG , M., K UMAR , M., DUPONT , E., R UIZ , F., E LLENBERG , J., W ANG , P., FAWZI , O., K OHLI , P., AND FAWZI , A. Mathematical discoveries from program search with large language models. Nature 625 (12 2023)

work page 2023

[30] [30]

Learning Bayesian Networks with the bnlearn R Package

SCUTARI , M. Learning bayesian networks with the bnlearn r package. arXiv preprint arXiv:0908.3817 (2009)

work page internal anchor Pith review Pith/arXiv arXiv 2009

[31] [31]

Curated llm: Synergy of llms and data curation for tabular augmentation in ultra low-data regimes, 2024

SEEDAT, N., H UYNH , N., VAN BREUGEL , B., AND VAN DER SCHAAR , M. Curated llm: Synergy of llms and data curation for tabular augmentation in ultra low-data regimes, 2024

work page 2024

[32] [32]

V., AND DUPRIEZ , O

SOLATORIO , A. V., AND DUPRIEZ , O. Realtabformer: Generating realistic relational and tabular data using transformers, 2023

work page 2023

[33] [33]

Causation, prediction, and search

SPIRTES , P., G LYMOUR , C., AND SCHEINES , R. Causation, prediction, and search. MIT press, 2001

work page 2001

[34] [34]

An algorithm for causal inference in the presence of latent variables and selection bias (vol

SPIRTES , P., M EEK , C., AND RICHARDSON , T. An algorithm for causal inference in the presence of latent variables and selection bias (vol. 1), 1999. 12

work page 1999

[35] [35]

Multi-Agent Collaboration: Harnessing the Power of Intelligent LLM Agents

TALEBIRAD , Y., AND NADIRI , A. Multi-agent collaboration: Harnessing the power of intelligent llm agents. arXiv preprint arXiv:2306.03314 (2023)

work page internal anchor Pith review Pith/arXiv arXiv 2023

[36] [36]

E., AND ALIFERIS , C

TSAMARDINOS , I., B ROWN , L. E., AND ALIFERIS , C. F. The max-min hill-climbing bayesian network structure learning algorithm. Machine learning 65 (2006), 31–78

work page 2006

[37] [37]

E., AND ALIFERIS , C

TSAMARDINOS , I., B ROWN , L. E., AND ALIFERIS , C. F. The max-min hill-climbing bayesian network structure learning algorithm. Machine Learning 65, 1 (Oct 2006), 31–78

work page 2006

[38] [38]

Scalable causal structure learning: Scoping review of traditional and deep learning algorithms and new opportunities in biomedicine

UPADHYAYA, P., Z HANG , K., L I, C., J IANG , X., AND KIM, Y. Scalable causal structure learning: Scoping review of traditional and deep learning algorithms and new opportunities in biomedicine. JMIR Med Inform 11 (Jan 2023), e38266

work page 2023

[39] [39]

L., O SBORNE , J., B ROWN , M

WOO, D., R OSAND , J., K IDWELL , C., M CCAULEY, J. L., O SBORNE , J., B ROWN , M. W., WEST, S. E., R ADEMACHER , E. W., W ADDY, S., R OBERTS , J. N., ET AL . The ethnic/racial variations of intracerebral hemorrhage (erich) study protocol. Stroke 44, 10 (2013), e120–e125

work page 2013

[40] [40]

Interpretation for variational autoencoder used to generate financial synthetic tabular data

WU, J., P LATANIOTIS , K., L IU, L., A MJADIAN , E., AND LAWRYSHYN , Y. Interpretation for variational autoencoder used to generate financial synthetic tabular data. Algorithms 16, 2 (2023)

work page 2023

[41] [41]

AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation

WU, Q., B ANSAL , G., Z HANG , J., W U, Y., Z HANG , S., Z HU, E., L I, B., J IANG , L., Z HANG , X., AND WANG , C. Autogen: Enabling next-gen llm applications via multi-agent conversation framework. arXiv preprint arXiv:2308.08155 (2023)

work page internal anchor Pith review Pith/arXiv arXiv 2023

[42] [42]

Modeling tabular data using conditional GAN

XU, L., S KOULARIDOU , M., C UESTA -I NFANTE , A., AND VEERAMACHANENI , K. Modeling tabular data using conditional GAN. Curran Associates Inc., Red Hook, NY , USA, 2019

work page 2019

[43] [43]

V., Z HOU , D., AND CHEN , X

YANG , C., WANG , X., L U, Y., LIU, H., L E, Q. V., Z HOU , D., AND CHEN , X. Large language models as optimizers

work page

[44] [44]

Using bayesian networks to create synthetic data, 2009

YOUNG , J., G RAHAM , P., AND PENNY, R. Using bayesian networks to create synthetic data, 2009

work page 2009

[45] [45]

Unified language representation for question answering over text, tables, and images

YU, B., F U, C., Y U, H., H UANG , F., AND LI, Y. Unified language representation for question answering over text, tables, and images. In Findings of the Association for Computational Lin- guistics: ACL 2023 (Toronto, Canada, July 2023), A. Rogers, J. Boyd-Graber, and N. Okazaki, Eds., Association for Computational Linguistics, pp. 4756–4765

work page 2023

[46] [46]

Dag-gnn: Dag structure learning with graph neural networks, 2019

YU, Y., C HEN , J., G AO, T., AND YU, M. Dag-gnn: Dag structure learning with graph neural networks, 2019

work page 2019

[47] [47]

Generative table pre-training empowers models for tabular prediction

ZHANG , T., W ANG , S., Y AN, S., J IAN , L., AND LIU, Q. Generative table pre-training empowers models for tabular prediction. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (Singapore, Dec. 2023), H. Bouamor, J. Pino, and K. Bali, Eds., Association for Computational Linguistics, pp. 14836–14854

work page 2023

[48] [48]

treatment

ZHENG , X., A RAGAM , B., R AVIKUMAR , P., AND XING , E. P. Dags with no tears: Continuous optimization for structure learning, 2018. 13 SUPPLEMENTARY 1 E XAMPLE OF PROMPTS AND OUTPUT Here, we provided example of generator prompt and optimizer prompt. Note that generator prompt evolves over the iteration. 1 System role: 2 % Specify role and task 3 You are...

work page 2018