pith. sign in

In 2008 5th IEEE international symposium on biomed- ical imaging: from nano to macro , pages 836–838

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

fields

cs.CL 1

years

2025 1

verdicts

UNVERDICTED 1

representative citing papers

SEDD: Scalable and Efficient Dataset Deduplication with GPUs

cs.CL · 2025-01-02 · unverdicted · novelty 5.0

SEDD delivers a distributed GPU deduplication system that reports up to 158x speedup over CPU baselines and 7.8x over NeMo Curator on 30M documents while preserving MinHash fidelity above 0.95 Jaccard.

citing papers explorer

Showing 1 of 1 citing paper.

  • SEDD: Scalable and Efficient Dataset Deduplication with GPUs cs.CL · 2025-01-02 · unverdicted · none · ref 10

    SEDD delivers a distributed GPU deduplication system that reports up to 158x speedup over CPU baselines and 7.8x over NeMo Curator on 30M documents while preserving MinHash fidelity above 0.95 Jaccard.