VIST-GPT: Ushering in the era of visual storytelling with LLMs?arXiv preprint

Mohamed Gado, Towhid Taliee, Muhammad Danish Memon, Dmitry Ignatov, Radu Timofte · 2025 · arXiv 2504.19267

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

representative citing papers

Delta-Based Neural Architecture Search: LLM Fine-Tuning via Code Diffs

cs.LG · 2026-05-06 · unverdicted · novelty 7.0

Fine-tuned 7B LLMs generating unified diffs for neural architecture refinement achieve 66-75% valid rates and 64-66% mean first-epoch accuracy, outperforming full-generation baselines by large margins while cutting output length by 75-85%.

Closed-Loop LLM Discovery of Non-Standard Channel Priors in Vision Models

cs.CV · 2026-01-13 · unverdicted · novelty 6.0

Closed-loop LLM search with AST-generated examples discovers non-standard channel widths that improve vision model performance over initial architectures on CIFAR-100.

Enhancing LLM-Based Neural Network Generation: Few-Shot Prompting and Efficient Validation for Automated Architecture Design

cs.CV · 2025-12-30 · conditional · novelty 6.0

Three-example few-shot prompting optimizes LLM-generated vision architectures while a whitespace-normalized hash provides 100x faster duplicate detection than AST parsing across seven benchmarks.

Preparation of Fractal-Inspired Computational Architectures for Advanced Large Language Model Analysis

cs.LG · 2025-11-10 · unverdicted · novelty 4.0 · 2 refs

FractalNet automatically generates and tests over 1,200 CNN architectures based on recursive fractal templates, achieving up to 80.18% accuracy on CIFAR-10 after five training epochs.

citing papers explorer

Showing 4 of 4 citing papers.

Delta-Based Neural Architecture Search: LLM Fine-Tuning via Code Diffs cs.LG · 2026-05-06 · unverdicted · none · ref 11
Fine-tuned 7B LLMs generating unified diffs for neural architecture refinement achieve 66-75% valid rates and 64-66% mean first-epoch accuracy, outperforming full-generation baselines by large margins while cutting output length by 75-85%.
Closed-Loop LLM Discovery of Non-Standard Channel Priors in Vision Models cs.CV · 2026-01-13 · unverdicted · none · ref 5
Closed-loop LLM search with AST-generated examples discovers non-standard channel widths that improve vision model performance over initial architectures on CIFAR-100.
Enhancing LLM-Based Neural Network Generation: Few-Shot Prompting and Efficient Validation for Automated Architecture Design cs.CV · 2025-12-30 · conditional · none · ref 7
Three-example few-shot prompting optimizes LLM-generated vision architectures while a whitespace-normalized hash provides 100x faster duplicate detection than AST parsing across seven benchmarks.
Preparation of Fractal-Inspired Computational Architectures for Advanced Large Language Model Analysis cs.LG · 2025-11-10 · unverdicted · none · ref 3 · 2 links
FractalNet automatically generates and tests over 1,200 CNN architectures based on recursive fractal templates, achieving up to 80.18% accuracy on CIFAR-10 after five training epochs.

VIST-GPT: Ushering in the era of visual storytelling with LLMs?arXiv preprint

fields

years

verdicts

representative citing papers

citing papers explorer