hub Mixed citations

U-Net: Convolutional Networks for Biomedical Image Segmentation

Olaf Ronneberger, Philipp Fischer, Thomas Brox · 2015 · cs.CV · arXiv 1505.04597

Mixed citation behavior. Most common role is background (43%).

85 Pith papers citing it

Background 43% of classified citations

open full Pith review browse 85 citing papers arXiv PDF

abstract

There is large consent that successful training of deep networks requires many thousand annotated training samples. In this paper, we present a network and training strategy that relies on the strong use of data augmentation to use the available annotated samples more efficiently. The architecture consists of a contracting path to capture context and a symmetric expanding path that enables precise localization. We show that such a network can be trained end-to-end from very few images and outperforms the prior best method (a sliding-window convolutional network) on the ISBI challenge for segmentation of neuronal structures in electron microscopic stacks. Using the same network trained on transmitted light microscopy images (phase contrast and DIC) we won the ISBI cell tracking challenge 2015 in these categories by a large margin. Moreover, the network is fast. Segmentation of a 512x512 image takes less than a second on a recent GPU. The full implementation (based on Caffe) and the trained networks are available at http://lmb.informatik.uni-freiburg.de/people/ronneber/u-net .

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 11 method 6 baseline 3 dataset 1

citation-polarity summary

background 9 use method 6 baseline 3 unclear 2 use dataset 1

claims ledger

abstract There is large consent that successful training of deep networks requires many thousand annotated training samples. In this paper, we present a network and training strategy that relies on the strong use of data augmentation to use the available annotated samples more efficiently. The architecture consists of a contracting path to capture context and a symmetric expanding path that enables precise localization. We show that such a network can be trained end-to-end from very few images and outperforms the prior best method (a sliding-window convolutional network) on the ISBI challenge for segme

co-cited works

representative citing papers

REALISTA: Realistic Latent Adversarial Attacks that Elicit LLM Hallucinations

cs.CL · 2026-05-12 · unverdicted · novelty 8.0

REALISTA optimizes continuous combinations of valid editing directions in latent space to produce realistic adversarial prompts that elicit hallucinations more effectively than prior methods, including on large reasoning models.

LatentHDR: Decoupling Exposure from Diffusion via Conditional Latent-to-Latent Mapping for Text/Image-to-Panoramic HDR

cs.CV · 2026-05-11 · unverdicted · novelty 7.0

LatentHDR generates structurally consistent panoramic HDR images by producing one scene latent with a diffusion backbone then deterministically mapping it to multiple exposure latents via a lightweight conditional head.

EchoXFlow: A Beamspace Echocardiography Dataset for Cardiac Motion, Flow, and Function

cs.CV · 2026-05-06 · unverdicted · novelty 7.0

EchoXFlow is a new dataset of 37,125 beamspace echocardiography recordings with separable modalities, Doppler data, ECG, and clinical annotations that enables acquisition-aware learning not possible with standard scan-converted videos.

Generative diffusion models for spatiotemporal influenza forecasting

cs.LG · 2026-04-27 · unverdicted · novelty 7.0

Influpaint uses generative diffusion models on image-encoded influenza data to produce realistic and diverse epidemic trajectories that match leading ensemble methods in accuracy.

VitaminP: cross-modal learning enables whole-cell segmentation from routine histology

cs.CV · 2026-04-26 · unverdicted · novelty 7.0

VitaminP uses paired H&E-mIF data to train a model that transfers molecular boundary information, enabling accurate whole-cell segmentation directly from routine H&E histology across 34 cancer types.

Physics-informed, Generative Adversarial Design of Funicular Shells

cs.CE · 2026-04-17 · unverdicted · novelty 7.0

A modified DCGAN with an auxiliary discriminator using the membrane factor generates stable, previously unseen funicular shells optimized for pure compression in three dimensions.

Machine Learning Phase Field Reconstruction in a Bose-Einstein Condensate

cond-mat.quant-gas · 2026-04-10 · unverdicted · novelty 7.0

A U-Net-based ML pipeline reconstructs the complete phase field and quantized vortex charges in 2D Bose-Einstein condensates from density snapshots alone, using synthetic training data from projected Gross-Pitaevskii simulations.

Dual Triangle Attention: Effective Bidirectional Attention Without Positional Embeddings

q-bio.QM · 2026-04-09 · unverdicted · novelty 7.0

Dual Triangle Attention achieves effective bidirectional attention with built-in positional inductive bias via dual triangular masks, outperforming standard bidirectional attention on position-sensitive tasks and showing strong masked language modeling results with or without positional embeddings.

Diffusion Processes on Implicit Manifolds

cs.LG · 2026-04-08 · unverdicted · novelty 7.0 · 2 refs

Defines diffusion processes on implicit data manifolds via proximity-graph approximations to the infinitesimal generator and carré-du-champ operator, proves convergence in law to the continuous manifold process, and provides an Euler-Maruyama integrator validated on synthetic and MNIST manifolds.

Contour Refinement using Discrete Diffusion in Low Data Regime

cs.CV · 2026-02-05 · unverdicted · novelty 7.0

A CNN-based discrete diffusion method refines sparse contours from segmentation masks using simplified denoising steps and minimal post-processing, outperforming baselines on small medical and environmental datasets while running 3.5 times faster.

Radio-Interferometric Image Reconstruction with Denoising Diffusion Restoration Models

astro-ph.IM · 2026-01-22 · unverdicted · novelty 7.0

A diffusion model trained on real radio galaxy images reconstructs high-fidelity interferometric observations from VLA, EHT, and ALMA simulations and outperforms CLEAN on gridded visibilities.

SemanticBridge - A Dataset for 3D Semantic Segmentation of Bridges and Domain Gap Analysis

cs.CV · 2025-12-17 · unverdicted · novelty 7.0

SemanticBridge provides a new 3D dataset for bridge component segmentation and quantifies sensor-induced domain gaps that drop model performance by up to 11.4% mIoU.

Visual Diffusion Models are Geometric Solvers

cs.CV · 2025-10-24 · unverdicted · novelty 7.0

Standard visual diffusion models operating in pixel space can approximate solutions to the inscribed square, Steiner tree, and simple polygon problems.

Deep Learning for CMB Foreground Removal and Beam Deconvolution: A U-Net GAN Approach

astro-ph.IM · 2025-08-29 · unverdicted · novelty 7.0

A U-Net GAN reconstructs CMB T and E maps from Planck-like simulations with foregrounds and systematics, achieving under 1% error outside the Galactic region and demonstrating first-time correction for non-circular beams and asymmetric scans.

SinkSAM-Net: Knowledge-Driven Self-Supervised Sinkhole Segmentation Using Topographic Priors and Segment Anything Model

cs.CV · 2024-10-02 · unverdicted · novelty 7.0

SinkSAM-Net uses topographic priors and SAM with coordinate-wise bounding box jittering to create pseudo-labels for iterative self-supervised training of an EfficientNetV2-UNet, reaching about 95% of fully supervised performance on sinkhole datasets.

Normalizing flows for all-orders QED corrections in lattice field theory

hep-lat · 2026-05-21 · unverdicted · novelty 6.0

Normalizing flows enable all-order QED corrections in lattice scalar QED in 2-4 dimensions with reduced variance and transferability from small to large lattices.

Learning to Think in Physics: Breaking Shortcut Learning in Scientific Diffusion via Representation Alignment

cs.LG · 2026-05-20 · unverdicted · novelty 6.0

REPA-P aligns intermediate representations in diffusion models with physical states using first-principles PDE residuals to accelerate convergence and boost out-of-distribution robustness on PDE tasks.

SegRAG: Training-Free Retrieval-Augmented Semantic Segmentation

cs.CV · 2026-05-17 · unverdicted · novelty 6.0 · 2 refs

SegRAG is a training-free retrieval-augmented framework that extracts class-specific point prompts from a filtered DINOv3 feature bank to boost SAM3 semantic segmentation performance on standard and agricultural benchmarks.

A General B\'ezier Tree Encoding Counterfactual Framework for Retinal-Vessel-Mediated Disease Analysis

eess.IV · 2026-05-13 · unverdicted · novelty 6.0

BTECF encodes retinal vessels as Bézier trees to enable targeted, parameter-level counterfactual interventions on vessel geometry for causal analysis of vascular diseases.

EDGER: EDge-Guided with HEatmap Refinement for Generalizable Image Forgery Localization

cs.CV · 2026-05-12 · unverdicted · novelty 6.0

A dual-branch system using frequency edge cues and CLIP-based synthetic patch detection for accurate, resolution-independent image forgery localization.

Geometry-aware Prototype Learning for Cross-domain Few-shot Medical Image Segmentation

cs.CV · 2026-05-11 · unverdicted · novelty 6.0

GeoProto enriches appearance prototypes with geometric offsets from an ordinal shape branch to improve cross-domain few-shot medical image segmentation.

Don't Fix the Basis -- Learn It: Spectral Representation with Adaptive Basis Learning for PDEs

cs.LG · 2026-05-11 · unverdicted · novelty 6.0

ABLE learns a spatially adaptive Parseval frame from data via an ancillary density to replace fixed bases in spectral neural operators for PDEs.

StereoPolicy: Improving Robotic Manipulation Policies via Stereo Perception

cs.RO · 2026-05-11 · unverdicted · novelty 6.0

StereoPolicy fuses stereo image pairs via a Stereo Transformer on pretrained 2D encoders to boost robotic manipulation policies, showing gains over monocular, RGB-D, point cloud, and multi-view methods in simulations and real-robot tests.

Diffusion model for SU(N) gauge theories

hep-lat · 2026-05-07 · unverdicted · novelty 6.0

Implicit score matching trains diffusion models that successfully sample SU(3) Wilson gauge configurations on lattices, with a Hamiltonian-dynamics corrector needed for strong coupling.

citing papers explorer

Showing 50 of 85 citing papers.

REALISTA: Realistic Latent Adversarial Attacks that Elicit LLM Hallucinations cs.CL · 2026-05-12 · unverdicted · none · ref 29 · internal anchor
REALISTA optimizes continuous combinations of valid editing directions in latent space to produce realistic adversarial prompts that elicit hallucinations more effectively than prior methods, including on large reasoning models.
LatentHDR: Decoupling Exposure from Diffusion via Conditional Latent-to-Latent Mapping for Text/Image-to-Panoramic HDR cs.CV · 2026-05-11 · unverdicted · none · ref 37 · internal anchor
LatentHDR generates structurally consistent panoramic HDR images by producing one scene latent with a diffusion backbone then deterministically mapping it to multiple exposure latents via a lightweight conditional head.
EchoXFlow: A Beamspace Echocardiography Dataset for Cardiac Motion, Flow, and Function cs.CV · 2026-05-06 · unverdicted · none · ref 38 · internal anchor
EchoXFlow is a new dataset of 37,125 beamspace echocardiography recordings with separable modalities, Doppler data, ECG, and clinical annotations that enables acquisition-aware learning not possible with standard scan-converted videos.
Generative diffusion models for spatiotemporal influenza forecasting cs.LG · 2026-04-27 · unverdicted · none · ref 18 · internal anchor
Influpaint uses generative diffusion models on image-encoded influenza data to produce realistic and diverse epidemic trajectories that match leading ensemble methods in accuracy.
VitaminP: cross-modal learning enables whole-cell segmentation from routine histology cs.CV · 2026-04-26 · unverdicted · none · ref 46 · internal anchor
VitaminP uses paired H&E-mIF data to train a model that transfers molecular boundary information, enabling accurate whole-cell segmentation directly from routine H&E histology across 34 cancer types.
Physics-informed, Generative Adversarial Design of Funicular Shells cs.CE · 2026-04-17 · unverdicted · none · ref 35 · internal anchor
A modified DCGAN with an auxiliary discriminator using the membrane factor generates stable, previously unseen funicular shells optimized for pure compression in three dimensions.
Machine Learning Phase Field Reconstruction in a Bose-Einstein Condensate cond-mat.quant-gas · 2026-04-10 · unverdicted · none · ref 60 · internal anchor
A U-Net-based ML pipeline reconstructs the complete phase field and quantized vortex charges in 2D Bose-Einstein condensates from density snapshots alone, using synthetic training data from projected Gross-Pitaevskii simulations.
Dual Triangle Attention: Effective Bidirectional Attention Without Positional Embeddings q-bio.QM · 2026-04-09 · unverdicted · none · ref 49 · internal anchor
Dual Triangle Attention achieves effective bidirectional attention with built-in positional inductive bias via dual triangular masks, outperforming standard bidirectional attention on position-sensitive tasks and showing strong masked language modeling results with or without positional embeddings.
Diffusion Processes on Implicit Manifolds cs.LG · 2026-04-08 · unverdicted · none · ref 56 · 2 links · internal anchor
Defines diffusion processes on implicit data manifolds via proximity-graph approximations to the infinitesimal generator and carré-du-champ operator, proves convergence in law to the continuous manifold process, and provides an Euler-Maruyama integrator validated on synthetic and MNIST manifolds.
Contour Refinement using Discrete Diffusion in Low Data Regime cs.CV · 2026-02-05 · unverdicted · none · ref 36 · internal anchor
A CNN-based discrete diffusion method refines sparse contours from segmentation masks using simplified denoising steps and minimal post-processing, outperforming baselines on small medical and environmental datasets while running 3.5 times faster.
Radio-Interferometric Image Reconstruction with Denoising Diffusion Restoration Models astro-ph.IM · 2026-01-22 · unverdicted · none · ref 14 · internal anchor
A diffusion model trained on real radio galaxy images reconstructs high-fidelity interferometric observations from VLA, EHT, and ALMA simulations and outperforms CLEAN on gridded visibilities.
SemanticBridge - A Dataset for 3D Semantic Segmentation of Bridges and Domain Gap Analysis cs.CV · 2025-12-17 · unverdicted · none · ref 69 · internal anchor
SemanticBridge provides a new 3D dataset for bridge component segmentation and quantifies sensor-induced domain gaps that drop model performance by up to 11.4% mIoU.
Visual Diffusion Models are Geometric Solvers cs.CV · 2025-10-24 · unverdicted · none · ref 38 · internal anchor
Standard visual diffusion models operating in pixel space can approximate solutions to the inscribed square, Steiner tree, and simple polygon problems.
Deep Learning for CMB Foreground Removal and Beam Deconvolution: A U-Net GAN Approach astro-ph.IM · 2025-08-29 · unverdicted · none · ref 30 · internal anchor
A U-Net GAN reconstructs CMB T and E maps from Planck-like simulations with foregrounds and systematics, achieving under 1% error outside the Galactic region and demonstrating first-time correction for non-circular beams and asymmetric scans.
SinkSAM-Net: Knowledge-Driven Self-Supervised Sinkhole Segmentation Using Topographic Priors and Segment Anything Model cs.CV · 2024-10-02 · unverdicted · none · ref 18 · internal anchor
SinkSAM-Net uses topographic priors and SAM with coordinate-wise bounding box jittering to create pseudo-labels for iterative self-supervised training of an EfficientNetV2-UNet, reaching about 95% of fully supervised performance on sinkhole datasets.
Normalizing flows for all-orders QED corrections in lattice field theory hep-lat · 2026-05-21 · unverdicted · none · ref 57 · internal anchor
Normalizing flows enable all-order QED corrections in lattice scalar QED in 2-4 dimensions with reduced variance and transferability from small to large lattices.
Learning to Think in Physics: Breaking Shortcut Learning in Scientific Diffusion via Representation Alignment cs.LG · 2026-05-20 · unverdicted · none · ref 15 · internal anchor
REPA-P aligns intermediate representations in diffusion models with physical states using first-principles PDE residuals to accelerate convergence and boost out-of-distribution robustness on PDE tasks.
SegRAG: Training-Free Retrieval-Augmented Semantic Segmentation cs.CV · 2026-05-17 · unverdicted · none · ref 7 · 2 links · internal anchor
SegRAG is a training-free retrieval-augmented framework that extracts class-specific point prompts from a filtered DINOv3 feature bank to boost SAM3 semantic segmentation performance on standard and agricultural benchmarks.
A General B\'ezier Tree Encoding Counterfactual Framework for Retinal-Vessel-Mediated Disease Analysis eess.IV · 2026-05-13 · unverdicted · none · ref 42 · internal anchor
BTECF encodes retinal vessels as Bézier trees to enable targeted, parameter-level counterfactual interventions on vessel geometry for causal analysis of vascular diseases.
EDGER: EDge-Guided with HEatmap Refinement for Generalizable Image Forgery Localization cs.CV · 2026-05-12 · unverdicted · none · ref 18 · internal anchor
A dual-branch system using frequency edge cues and CLIP-based synthetic patch detection for accurate, resolution-independent image forgery localization.
Geometry-aware Prototype Learning for Cross-domain Few-shot Medical Image Segmentation cs.CV · 2026-05-11 · unverdicted · none · ref 2 · internal anchor
GeoProto enriches appearance prototypes with geometric offsets from an ordinal shape branch to improve cross-domain few-shot medical image segmentation.
Don't Fix the Basis -- Learn It: Spectral Representation with Adaptive Basis Learning for PDEs cs.LG · 2026-05-11 · unverdicted · none · ref 18 · internal anchor
ABLE learns a spatially adaptive Parseval frame from data via an ancillary density to replace fixed bases in spectral neural operators for PDEs.
StereoPolicy: Improving Robotic Manipulation Policies via Stereo Perception cs.RO · 2026-05-11 · unverdicted · none · ref 88 · internal anchor
StereoPolicy fuses stereo image pairs via a Stereo Transformer on pretrained 2D encoders to boost robotic manipulation policies, showing gains over monocular, RGB-D, point cloud, and multi-view methods in simulations and real-robot tests.
Diffusion model for SU(N) gauge theories hep-lat · 2026-05-07 · unverdicted · none · ref 18 · internal anchor
Implicit score matching trains diffusion models that successfully sample SU(3) Wilson gauge configurations on lattices, with a Hamiltonian-dynamics corrector needed for strong coupling.
Leveraging Image Generators to Address Training Data Scarcity: The Gen4Regen Dataset for Forest Regeneration Mapping cs.CV · 2026-05-07 · conditional · none · ref 50 · internal anchor
Mixing real UAV imagery with 2101 AI-generated image-mask pairs improves semantic segmentation F1 scores for fine-grained forest species by over 15 percentage points overall and up to 30 points for rare classes.
A CNN--Transformer Denoiser for low-$S/N$ Galaxy Spectra: Stellar Population Recovery in Synthetic Tests astro-ph.GA · 2026-05-06 · unverdicted · none · ref 20 · internal anchor
A hybrid CNN-Transformer denoiser trained on synthetic spectra substantially reduces noise and improves stellar population recovery for low-S/N galaxy observations in controlled tests.
Approaching human parity in the quality of automated organoid image segmentation cs.CV · 2026-05-04 · conditional · none · ref 20 · internal anchor
A composite SAM-based method segments organoid images with accuracy matching or approaching inter-observer variability among human annotators.
When Less Is More: Simplicity Beats Complexity for Physics-Constrained InSAR Phase Unwrapping cs.CV · 2026-04-28 · accept · none · ref 6 · internal anchor
A vanilla U-Net with 7.76M parameters achieves R²=0.834 and RMSE=1.01 cm on a global InSAR benchmark, beating larger attention models by 34% in R² and 51% in RMSE while running 2.5× faster.
MG-NECOLA: A Field-Level Emulator for $f(R)$ Gravity and Massive Neutrino Cosmologies astro-ph.CO · 2026-04-21 · conditional · none · ref 47 · internal anchor
A field-level CNN emulator converts MG-PICOLA runs into near N-body accuracy for f(R) gravity and neutrino cosmologies, achieving sub-percent errors on power spectra and bispectra while generalizing beyond its training set.
From Boundaries to Semantics: Prompt-Guided Multi-Task Learning for Petrographic Thin-section Segmentation cs.CV · 2026-04-16 · unverdicted · none · ref 4 · internal anchor
Petro-SAM adapts SAM via a Merge Block for polarized views plus multi-scale fusion and color-entropy priors to jointly achieve grain-edge and lithology segmentation in petrographic images.
Self-supervised Pretraining of Cell Segmentation Models cs.CV · 2026-04-12 · unverdicted · none · ref 15 · internal anchor
DINOCell achieves a SEG score of 0.784 on LIVECell by self-supervised domain adaptation of DINOv2, improving 10.42% over SAM-based models and showing strong zero-shot transfer.
GIF: A Conditional Multimodal Generative Framework for IR Drop Imaging in Chip Layouts cs.CV · 2026-04-11 · unverdicted · none · ref 20 · internal anchor
GIF fuses geometrical image features and logical graph topology in a conditional diffusion model to generate high-quality IR drop images for chip layouts, outperforming prior ML methods on CircuitNet-N28 with SSIM 0.78, Pearson 0.95, PSNR 21.77, and NMAE 0.026.
ELT: Elastic Looped Transformers for Visual Generation cs.CV · 2026-04-10 · unverdicted · none · ref 59 · internal anchor
Elastic Looped Transformers share weights across recurrent blocks and apply intra-loop self-distillation to deliver 4x parameter reduction while matching competitive FID and FVD scores on ImageNet and UCF-101.
MRI-to-CT synthesis using drifting models eess.IV · 2026-03-30 · unverdicted · none · ref 35 · internal anchor
Drifting models outperform diffusion, CNN, VAE, and GAN baselines in MRI-to-CT synthesis on two pelvis datasets with higher SSIM/PSNR, lower RMSE, and millisecond one-step inference.
A theory of learning data statistics in diffusion models, from easy to hard stat.ML · 2026-03-13 · unverdicted · none · ref 22 · internal anchor
Diffusion models exhibit a distributional simplicity bias, learning pairwise input statistics at linear sample complexity while fourth-order cumulants require cubic complexity unless sharing correlated latent structure.
SHANG++: Robust Stochastic Acceleration under Multiplicative Noise math.OC · 2026-03-10 · unverdicted · none · ref 32 · internal anchor
SHANG++ delivers faster convergence and stronger robustness to multiplicative noise in stochastic optimization for both convex and strongly convex problems, with explicit parameters and competitive deep-learning results.
Forecasting implied volatility surface with generative diffusion models q-fin.CP · 2025-11-10 · unverdicted · none · ref 14 · 2 links · internal anchor
A conditioned diffusion model with SNR-weighted arbitrage penalty generates one-day-ahead arbitrage-free implied volatility surfaces and outperforms baselines on market data.
Recovering Sub-threshold S-wave Arrivals in Deep Learning Phase Pickers via Shape-Aware Loss physics.geo-ph · 2025-11-10 · unverdicted · none · ref 14 · internal anchor
A shape-aware loss strategy recovers sub-threshold S-wave arrivals in deep learning seismic phase pickers by treating labels as coherent shapes, achieving a 64% increase in effective detections.
DAWM: Diffusion Action World Models for Offline Reinforcement Learning via Action-Inferred Transitions cs.LG · 2025-09-23 · unverdicted · none · ref 24 · internal anchor
DAWM introduces a modular diffusion world model with an inverse dynamics model to produce complete synthetic transitions that improve conservative offline RL algorithms like TD3BC and IQL on D4RL tasks.
Flow marching for a generative PDE foundation model cs.LG · 2025-09-23 · unverdicted · none · ref 53 · internal anchor
Flow Marching jointly samples noise and physical time to learn a velocity field for generative PDE modeling, paired with a latent autoencoder and efficient transformer for large-scale pretraining on 2.5M trajectories.
Label Dropout: Improved Deep Learning Echocardiography Segmentation Using Multiple Datasets With Domain Shift and Partial Labelling cs.CV · 2024-03-12 · unverdicted · none · ref 11 · internal anchor
Label dropout mitigates shortcut learning in multi-dataset partially labelled echocardiography segmentation, improving Dice scores by 62% and 25% on two cardiac structures.
SDXL-Lightning: Progressive Adversarial Diffusion Distillation cs.CV · 2024-02-21 · conditional · none · ref 52 · internal anchor
SDXL-Lightning uses progressive adversarial distillation to reach new state-of-the-art quality in one-step and few-step 1024px text-to-image generation from the SDXL base model.
Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation cs.RO · 2024-01-04 · conditional · none · ref 73 · internal anchor
A low-cost whole-body teleoperation system enables effective imitation learning for complex bimanual mobile manipulation by co-training on mobile and static demonstration datasets.
Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets cs.CV · 2023-11-25 · conditional · none · ref 73 · internal anchor
Stable Video Diffusion scales latent video diffusion models via text-to-image pretraining, video pretraining on curated data, and high-quality finetuning to produce competitive text-to-video and image-to-video results while enabling motion LoRA and multi-view 3D applications.
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis cs.CV · 2023-07-04 · conditional · none · ref 39 · internal anchor
SDXL improves upon prior Stable Diffusion versions through a larger UNet backbone, dual text encoders, novel conditioning, and a refinement model, producing higher-fidelity images competitive with black-box state-of-the-art generators.
Learning to Synthesize: Robust Phase Retrieval at Low Photon counts eess.IV · 2019-07-26 · unverdicted · none · ref 73 · internal anchor
The learning to synthesize method produces high-resolution, artifact-free phase reconstructions resilient to low photon flux by separately learning low and high frequency bands and then synthesizing them.
Fully-automated deep learning-powered system for DCE-MRI analysis of brain tumors eess.IV · 2019-07-18 · unverdicted · none · ref 36 · internal anchor
An end-to-end DL pipeline automates DCE-MRI analysis for brain tumors, introduces a cubic vascular input function model that lowers fitting error, and processes scans in under 3 minutes on one GPU while claiming state-of-the-art accuracy on BraTS and QIBA benchmarks plus 44 clinical cases.
Deep learning-based color holographic microscopy eess.IV · 2019-07-15 · unverdicted · none · ref 38 · internal anchor
A GAN framework reconstructs high-fidelity color images from a single three-wavelength hologram, shown on stained lung and prostate tissue sections.
Generative Modeling by Estimating Gradients of the Data Distribution cs.LG · 2019-07-12 · unverdicted · none · ref 46 · internal anchor
Score-based generative modeling via multi-noise-level score matching and annealed Langevin dynamics produces samples on par with GANs and sets a new inception score record on CIFAR-10.
Accurate Nuclear Segmentation with Center Vector Encoding cs.CV · 2019-07-09 · unverdicted · none · ref 15 · internal anchor
A bottom-up nuclear segmentation method using Center Vector Encoding outperforms prior state-of-the-art approaches.

U-Net: Convolutional Networks for Biomedical Image Segmentation

hub tools

citation-role summary

citation-polarity summary

claims ledger

co-cited works

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer