hub

Deep Learning for Classical Japanese Literature

· 2018 · cs.CV · arXiv 1812.01718

20 Pith papers cite this work. Polarity classification is still indexing.

20 Pith papers citing it

open full Pith review browse 20 citing papers arXiv PDF

abstract

Much of machine learning research focuses on producing models which perform well on benchmark tasks, in turn improving our understanding of the challenges associated with those tasks. From the perspective of ML researchers, the content of the task itself is largely irrelevant, and thus there have increasingly been calls for benchmark tasks to more heavily focus on problems which are of social or cultural relevance. In this work, we introduce Kuzushiji-MNIST, a dataset which focuses on Kuzushiji (cursive Japanese), as well as two larger, more challenging datasets, Kuzushiji-49 and Kuzushiji-Kanji. Through these datasets, we wish to engage the machine learning community into the world of classical Japanese literature. Dataset available at https://github.com/rois-codh/kmnist

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

dataset 3

citation-polarity summary

use dataset 3

representative citing papers

Partitioning for Intrinsic Model Inversion Resistance in Collaborative Inference

cs.IT · 2025-06-18 · conditional · novelty 7.0

The authors identify a Golden Partition Zone based on an intra-class variance shift in entropy bounds that enables intrinsic model inversion resistance when partitioning neural networks for collaborative inference.

Quantum Interval Bound Propagation for Certified Training of Quantum Neural Networks

quant-ph · 2026-05-01 · unverdicted · novelty 7.0

QIBP adapts interval bound propagation to quantum neural networks for certified adversarial robustness via interval and affine arithmetic implementations.

Grokking of Diffusion Models: Case Study on Modular Addition

cs.LG · 2026-04-20 · unverdicted · novelty 7.0

Diffusion models show grokking on modular addition by composing periodic operand representations in simple data regimes or by separating arithmetic computation from visual denoising across timesteps in varied regimes.

E-PMQ: Expert-Guided Post-Merge Quantization with Merged-Weight Anchoring

cs.CL · 2026-05-16 · unverdicted · novelty 6.0

E-PMQ improves 4-bit quantization accuracy on merged models by 8-42 points across CLIP and GLUE tasks through expert-guided calibration and merged-weight anchoring.

Bayesian Model Merging

cs.LG · 2026-05-13 · unverdicted · novelty 6.0

Bayesian Model Merging introduces a bi-level optimization framework that merges task-specific models via closed-form Bayesian regression with an anchor prior and global hyperparameter search, outperforming baselines and nearly matching expert averages on up to 20-task vision and 5-task language Merg

ACE-Merging: Data-Free Model Merging with Adaptive Covariance Estimation

cs.CL · 2026-03-03 · unverdicted · novelty 6.0

ACE-Merging estimates task input covariances from parameter differences to enable closed-form data-free merging that reduces interference and outperforms prior baselines on vision and language tasks.

Realistic Handwritten Multi-Digit Writer (MDW) Number Recognition Challenges

cs.CV · 2025-11-30 · unverdicted · novelty 6.0

New MDW benchmarks demonstrate that isolated digit classifiers struggle with multi-digit numbers from the same writer, necessitating task-specific metrics and advanced methods.

Reducing Class Bias In Data-Balanced Datasets Through Hardness-Based Resampling

cs.LG · 2025-04-09 · unverdicted · novelty 6.0

Hardness-Based Resampling reduces class recall gaps in balanced datasets by up to 32% on CIFAR-10 and 16% on CIFAR-100 by prioritizing harder samples over random or frequency-based selection.

Model Merging: Foundations and Algorithms

cs.LG · 2026-05-02 · unverdicted · novelty 6.0

New cycle-consistent optimization, task vector theory, singular vector decompositions, adaptive routing, and efficient evolutionary search provide foundations for merging neural network weights across tasks.

Possibilistic Predictive Uncertainty for Deep Learning

cs.LG · 2026-05-01 · unverdicted · novelty 6.0

DAPPr introduces a possibilistic framework that projects parameter posteriors to predictions via supremum and approximates them with Dirichlet possibility functions to yield efficient, closed-form epistemic uncertainty estimates.

Efficient Mutation Testing of Quantum Machine Learning Models

quant-ph · 2026-04-30 · unverdicted · novelty 6.0

New mutation operators and directed mutant generation produce more diverse faulty quantum neural network circuits than prior techniques, as shown in experiments.

Controlled Steering-Based State Preparation for Adversarial-Robust Quantum Machine Learning

quant-ph · 2026-04-30 · unverdicted · novelty 6.0

A passive steering method for quantum state preparation improves adversarial accuracy in QML models by up to 40% across tested cases.

Task Alignment: A simple and effective proxy for model merging in computer vision

cs.CV · 2026-04-14 · unverdicted · novelty 6.0

Task alignment serves as an efficient proxy for hyperparameter selection in model merging, accelerating the process by orders of magnitude while preserving performance in vision models with heterogeneous decoders.

Continual Distillation of Teachers from Different Domains

cs.LG · 2026-04-10 · conditional · novelty 6.0

SE2D stabilizes continual distillation across heterogeneous teachers by preserving logits on external unlabeled data to mitigate unseen knowledge forgetting.

Algebraic Machine Learning for Small-to-Medium Datasets Is Competitive against Strong Standard Baselines

cs.LG · 2026-05-21 · unverdicted · novelty 5.0

AML outperforms cross-validated baselines including CNNs on 50-2000 example image datasets and is comparable to XGBoost/LightGBM on tabular data using only training data and no task-dependent hyperparameters.

Stimulus symmetries can confound representational similarity analyses

q-bio.NC · 2026-05-20 · unverdicted · novelty 5.0

Stimulus symmetries render many neural representations functionally equivalent yet produce qualitatively different RSMs, including drifting ones from SGD or regularization in image-encoding networks.

Risk-Consistent Multiclass Learning from Random Label-Subset Membership Queries

cs.LG · 2026-05-08 · unverdicted · novelty 5.0

The paper introduces risk-consistent multiclass learning from random label-subset queries by deriving an unbiased risk estimator under ERM, plus non-negative and absolute-value corrections, with generalization bounds and consistency results.

Dendritic Neural Networks with Equilibrium Propagation

cs.LG · 2026-05-01 · unverdicted · novelty 5.0

Dendritic EP matches standard EP on simple tasks but significantly outperforms it on KMNIST and FMNIST, and in deeper models, approaching the performance of backpropagation-trained dendritic networks.

Context-Aware Multipath Networks

cs.CV · 2019-07-26 · unverdicted · novelty 4.0

CAMNet uses data-dependent routing across parallel tensors in a multi-path network to outperform equivalent single-path, multi-path, and deeper networks on classification and pixel-labeling tasks for individual, sequential, and combined datasets.

Unlocking the Potential of Continual Model Merging: An ODE Perspective

cs.LG · 2026-05-19 · 2 refs

citing papers explorer

Showing 20 of 20 citing papers.

Partitioning for Intrinsic Model Inversion Resistance in Collaborative Inference cs.IT · 2025-06-18 · conditional · none · ref 4 · internal anchor
The authors identify a Golden Partition Zone based on an intra-class variance shift in entropy bounds that enables intrinsic model inversion resistance when partitioning neural networks for collaborative inference.
Quantum Interval Bound Propagation for Certified Training of Quantum Neural Networks quant-ph · 2026-05-01 · unverdicted · none · ref 23
QIBP adapts interval bound propagation to quantum neural networks for certified adversarial robustness via interval and affine arithmetic implementations.
Grokking of Diffusion Models: Case Study on Modular Addition cs.LG · 2026-04-20 · unverdicted · none · ref 4
Diffusion models show grokking on modular addition by composing periodic operand representations in simple data regimes or by separating arithmetic computation from visual denoising across timesteps in varied regimes.
E-PMQ: Expert-Guided Post-Merge Quantization with Merged-Weight Anchoring cs.CL · 2026-05-16 · unverdicted · none · ref 46 · internal anchor
E-PMQ improves 4-bit quantization accuracy on merged models by 8-42 points across CLIP and GLUE tasks through expert-guided calibration and merged-weight anchoring.
Bayesian Model Merging cs.LG · 2026-05-13 · unverdicted · none · ref 47 · internal anchor
Bayesian Model Merging introduces a bi-level optimization framework that merges task-specific models via closed-form Bayesian regression with an anchor prior and global hyperparameter search, outperforming baselines and nearly matching expert averages on up to 20-task vision and 5-task language Merg
ACE-Merging: Data-Free Model Merging with Adaptive Covariance Estimation cs.CL · 2026-03-03 · unverdicted · none · ref 45 · internal anchor
ACE-Merging estimates task input covariances from parameter differences to enable closed-form data-free merging that reduces interference and outperforms prior baselines on vision and language tasks.
Realistic Handwritten Multi-Digit Writer (MDW) Number Recognition Challenges cs.CV · 2025-11-30 · unverdicted · none · ref 4 · internal anchor
New MDW benchmarks demonstrate that isolated digit classifiers struggle with multi-digit numbers from the same writer, necessitating task-specific metrics and advanced methods.
Reducing Class Bias In Data-Balanced Datasets Through Hardness-Based Resampling cs.LG · 2025-04-09 · unverdicted · none · ref 58 · internal anchor
Hardness-Based Resampling reduces class recall gaps in balanced datasets by up to 32% on CIFAR-10 and 16% on CIFAR-100 by prioritizing harder samples over random or frequency-based selection.
Model Merging: Foundations and Algorithms cs.LG · 2026-05-02 · unverdicted · none · ref 31
New cycle-consistent optimization, task vector theory, singular vector decompositions, adaptive routing, and efficient evolutionary search provide foundations for merging neural network weights across tasks.
Possibilistic Predictive Uncertainty for Deep Learning cs.LG · 2026-05-01 · unverdicted · none · ref 12
DAPPr introduces a possibilistic framework that projects parameter posteriors to predictions via supremum and approximates them with Dirichlet possibility functions to yield efficient, closed-form epistemic uncertainty estimates.
Efficient Mutation Testing of Quantum Machine Learning Models quant-ph · 2026-04-30 · unverdicted · none · ref 20
New mutation operators and directed mutant generation produce more diverse faulty quantum neural network circuits than prior techniques, as shown in experiments.
Controlled Steering-Based State Preparation for Adversarial-Robust Quantum Machine Learning quant-ph · 2026-04-30 · unverdicted · none · ref 24
A passive steering method for quantum state preparation improves adversarial accuracy in QML models by up to 40% across tested cases.
Task Alignment: A simple and effective proxy for model merging in computer vision cs.CV · 2026-04-14 · unverdicted · none · ref 11
Task alignment serves as an efficient proxy for hyperparameter selection in model merging, accelerating the process by orders of magnitude while preserving performance in vision models with heterogeneous decoders.
Continual Distillation of Teachers from Different Domains cs.LG · 2026-04-10 · conditional · none · ref 6
SE2D stabilizes continual distillation across heterogeneous teachers by preserving logits on external unlabeled data to mitigate unseen knowledge forgetting.
Algebraic Machine Learning for Small-to-Medium Datasets Is Competitive against Strong Standard Baselines cs.LG · 2026-05-21 · unverdicted · none · ref 27 · internal anchor
AML outperforms cross-validated baselines including CNNs on 50-2000 example image datasets and is comparable to XGBoost/LightGBM on tabular data using only training data and no task-dependent hyperparameters.
Stimulus symmetries can confound representational similarity analyses q-bio.NC · 2026-05-20 · unverdicted · none · ref 23 · internal anchor
Stimulus symmetries render many neural representations functionally equivalent yet produce qualitatively different RSMs, including drifting ones from SGD or regularization in image-encoding networks.
Risk-Consistent Multiclass Learning from Random Label-Subset Membership Queries cs.LG · 2026-05-08 · unverdicted · none · ref 29
The paper introduces risk-consistent multiclass learning from random label-subset queries by deriving an unbiased risk estimator under ERM, plus non-negative and absolute-value corrections, with generalization bounds and consistency results.
Dendritic Neural Networks with Equilibrium Propagation cs.LG · 2026-05-01 · unverdicted · none · ref 14
Dendritic EP matches standard EP on simple tasks but significantly outperforms it on KMNIST and FMNIST, and in deeper models, approaching the performance of backpropagation-trained dendritic networks.
Context-Aware Multipath Networks cs.CV · 2019-07-26 · unverdicted · none · ref 3 · internal anchor
CAMNet uses data-dependent routing across parallel tensors in a multi-path network to outperform equivalent single-path, multi-path, and deeper networks on classification and pixel-labeling tasks for individual, sequential, and combined datasets.
Unlocking the Potential of Continual Model Merging: An ODE Perspective cs.LG · 2026-05-19 · unreviewed · ref 2 · 2 links · internal anchor

Deep Learning for Classical Japanese Literature

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer