GPROF-IR is a CNN-based retrieval that uses temporal context in geostationary IR observations to produce precipitation estimates with lower error than prior IR methods and climatological consistency with PMW retrievals for integration into IMERG V08.
super hub Mixed citations
U-Net: Convolutional Networks for Biomedical Image Segmentation, pp.\ 234–241
Mixed citation behavior. Most common role is background (50%).
hub tools
citation-role summary
citation-polarity summary
authors
co-cited works
representative citing papers
AttentionBender applies 2D transforms to cross-attention maps in video diffusion transformers, producing distributed distortions and glitch aesthetics that reveal entangled attention mechanisms while serving as both an XAI probe and creative tool.
A LoRA-adapted conditional diffusion surrogate for electromagnetic calorimeter showers matches key observables within 2% RMSE and reproduces directional trends in design-utility gradients.
OOD-SEG reframes multi-class segmentation from sparse positive-only annotations as pixel-wise positive-unlabelled learning solved by integrating out-of-distribution detection techniques, with a proposed cross-validation evaluation on surgical imaging datasets.
Proposes a cyclic 2.5D perceptual loss with manufacturer SUVR standardization for T1w MRI to tau PET synthesis, reporting improved regional agreement on ADNI and SCAN cohorts across U-Net, UNETR, SwinUNETR, CycleGAN, and Pix2Pix.
A Jacobian sensitivity curve computed at initialization identifies the narrowest U-Net configuration that avoids performance collapse, matching nnU-Net accuracy with 400-1600x fewer parameters on six medical datasets.
Commutativity regularization mitigates transient error amplification in autoregressive neural simulators by penalizing non-normality and non-commutativity of Jacobians, yielding stable long-horizon rollouts.
Spectral analysis of activations and gradients provides new diagnostics that link batch size to representation geometry, early covariance tails to token efficiency, and spectral shifts to learning dynamics in decoder-only LLMs, backed by a mechanistic model.
A recurrent Vision Transformer hypernetwork injects context into Flux Neural Operators to infer and solve unseen conservation laws while preserving robustness and long-time stability.
SIAM achieves state-of-the-art whole-head MRI segmentation of 16 structures including extra-cerebral tissues by training on synthetic data from just six manual templates, matching or exceeding prior methods on 301 scans across eight heterogeneous datasets.
Cross-domain transfer of remote-sensing HSI foundation models improves proximal sensing semantic segmentation over in-domain training and narrows the gap to cross-modality methods on the HS3-Bench benchmark.
VSLP infers dense segmentations from global label proportions via a pre-trained transformer for initial confidence maps followed by variational optimization using Wasserstein fidelity and a learned regularizer, outperforming prior weakly supervised methods on histopathology datasets.
The ICPR 2026 LRLPR competition on real low-quality license plate images drew 99 valid submissions, with the winning team reaching 82.13% recognition rate and four teams exceeding 80%.
A tornado outbreak with simultaneous tornadic supercells occurred in the Philippines within an easterly severe weather regime, documented as the first known instance there.
FlowForge predicts flow fields via staged local updates with a shared lightweight predictor, matching or exceeding baselines in accuracy while improving robustness to noise and reducing latency.
PSIRNet produces diagnostic-quality free-breathing PSIR LGE cardiac MRI from a single interleaved IR/PD acquisition over two heartbeats using a physics-guided deep learning network trained on over 800,000 slices.
CATMIL augments nnU-Net with component-adaptive Tversky and MIL-based lesion supervision to raise Dice scores, small-lesion recall, and error control on the MSLesSeg dataset.
RABC-Net achieves 86.58% DICE and 79.47% JAC on skin lesion segmentation across ISIC-2017, ISIC-2018, and PH2 using only pseudo-labels and no manual masks for training or adaptation.
RSEdit adapts off-the-shelf text-to-image models into a collection of editing systems that follow text instructions while keeping geospatial structure intact in remote sensing images.
ASTERIS, a self-supervised spatiotemporal denoising algorithm, improves astronomical detection limits by 1 magnitude at 90% completeness while identifying three times more redshift >9 galaxy candidates in JWST images.
DiffuMeta uses diffusion transformers and algebraic language representations to generate diverse 3D shell metamaterials with targeted stress-strain responses under large deformations including buckling and contact.
Biased noise sampling for rectified flows combined with a bidirectional text-image transformer architecture yields state-of-the-art high-resolution text-to-image results that scale predictably with model size.
Biot-PINN embeds Biot poroelasticity into a neural network to decode photoacoustic signals for cancellous bone microstructure grading at 97% accuracy.
An EnsCGP coarse surrogate plus U-Net-ASPP corrector emulates LISFLOOD-FP flood depths on a 256x256 grid around one Chicago gauge, achieving R² ≈ 0.99 and MAE < 0.01 m on held-out events while matching the gauge depth at that single pixel.
citing papers explorer
-
Spectral Lens: Activation and Gradient Spectra as Diagnostics of LLM Optimization
Spectral analysis of activations and gradients provides new diagnostics that link batch size to representation geometry, early covariance tails to token efficiency, and spectral shifts to learning dynamics in decoder-only LLMs, backed by a mechanistic model.