pith. sign in

Trafilatura: A web scraping library and command-line tool for text discovery and extraction

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

citation-role summary

method 1 other 1

citation-polarity summary

fields

cs.CL 1 cs.LG 1

years

2024 2

verdicts

UNVERDICTED 2

polarities

unclear 1 use method 1

representative citing papers

InternLM2 Technical Report

cs.CL · 2024-03-26 · unverdicted · novelty 5.0

InternLM2 is a new open-source LLM that outperforms prior versions on 30 benchmarks and long-context tasks through scaled pre-training to 32k tokens and a conditional online RLHF alignment strategy.

citing papers explorer

Showing 2 of 2 citing papers.

  • DataComp-LM: In search of the next generation of training sets for language models cs.LG · 2024-06-17 · unverdicted · none · ref 17

    DCLM-Baseline dataset lets a 7B model reach 64% 5-shot MMLU accuracy after 2.6T tokens, beating prior open-data models by 6.6 points on MMLU with 40% less compute.

  • InternLM2 Technical Report cs.CL · 2024-03-26 · unverdicted · none · ref 171

    InternLM2 is a new open-source LLM that outperforms prior versions on 30 benchmarks and long-context tasks through scaled pre-training to 32k tokens and a conditional online RLHF alignment strategy.