pith. sign in

arxiv: 2606.20550 · v1 · pith:D3SXP7EBnew · submitted 2026-06-18 · 💻 cs.DL · cs.HC· cs.IR

Easy Reads: A Python program for making Scientific Papers on arXiv more Reader Friendly and Accessible

Pith reviewed 2026-06-26 14:50 UTC · model grok-4.3

classification 💻 cs.DL cs.HCcs.IR
keywords arxivtexreadabilitypythonaccessibilityscientific papersformatting
0
0 comments X

The pith

A Python program fetches arXiv papers and edits their TeX source to let users set larger fonts and fewer columns.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces Easy Reads, an open-source Python tool that takes an arXiv URL and automatically downloads the TeX source. It then applies user-specified changes to font size and column count before producing a new version of the paper. The stated goal is to reduce the strain from dense, small-font, double-column layouts common in scientific articles. This is presented as a practical step toward making open-access papers more usable for a wider range of readers. The work focuses on end-to-end automation from URL to reformatted output.

Core claim

Easy Reads is an automated, end-to-end Python program that fetches a paper from arXiv via its URL, processes the source TeX file, and allows custom formatting of font size and number of columns to improve readability and accessibility while keeping the original content intact.

What carries the argument

The Python script that retrieves the TeX source from arXiv and modifies its formatting commands for font size and column layout.

If this is right

  • Readers can obtain single-column versions with larger text from any double-column arXiv paper.
  • The process runs from a single URL input to a ready-to-read output file.
  • Because the tool is open-source, users can adapt it for papers in specific fields such as physics.
  • The approach targets the common formatting features that make papers compact but hard to read.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Extending the script to handle additional layout elements such as figure placement could broaden its usefulness.
  • A web interface or browser extension built on the same logic would remove the need to run Python locally.
  • Testing across hundreds of papers from different arXiv categories would show how often manual fixes are still required.

Load-bearing premise

Automatically editing the TeX source will produce clean, usable output without introducing errors, layout problems, or loss of content.

What would settle it

Apply the program to a sample of arXiv papers and inspect whether the generated files contain formatting errors, missing figures, or broken equations.

read the original abstract

Scientific papers are frequently dense and characterized by features such as small fonts and line spacing, double columns of text, and tightly arranged figures. While these features make papers more compact, they can hinder readability, make them less accessible, and can strain the reader. arXiv is a premier open-access repository for scientific papers across different fields and is used extensively by researchers, including those in the physics and astrophysics communities. Easy Reads is an automated, end-to-end, open-source Python program that helps address the stated challenge by making papers from arXiv more reader-friendly and accessible. Easy Reads can automatically fetch a paper from arXiv via its URL and work with the source TeX file to allow custom formatting of the paper features, primarily the font size, and the number of columns used. The main goal of Easy Reads is to facilitate ease of reading of scientific papers.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 0 minor

Summary. The paper describes Easy Reads, an open-source Python program that automatically fetches a scientific paper from arXiv via its URL and modifies the source TeX file to support custom font sizes and column counts, with the goal of improving readability and accessibility of dense, double-column papers.

Significance. If the program reliably handles arbitrary arXiv TeX sources without introducing compilation failures or content/layout artifacts, it could provide a practical tool for enhancing accessibility, especially for readers preferring larger fonts or single-column layouts. The open-source nature is a positive, but the complete absence of code, parsing details, error handling, or any validation means the significance cannot be assessed beyond the stated intent.

major comments (2)
  1. [Abstract] Abstract: the central claim that Easy Reads is an 'automated, end-to-end' program that 'work[s] with the source TeX file' to enable custom formatting is unsupported, as the manuscript supplies no description of the parsing strategy, preamble editing logic, handling of class options (e.g., revtex, aastex), or preservation of figures/tables/macros.
  2. [Abstract] Abstract: no evaluation, user testing, compilation success rates on real arXiv submissions, or demonstration of output quality is provided, leaving the assumption that reformatting improves readability without artifacts untested and the tool's functionality unverified.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the detailed comments. The manuscript is a short tool description paper, and we agree that additional technical details and basic validation are needed to support the claims. We will revise the manuscript to address both points.

read point-by-point responses
  1. Referee: [Abstract] Abstract: the central claim that Easy Reads is an 'automated, end-to-end' program that 'work[s] with the source TeX file' to enable custom formatting is unsupported, as the manuscript supplies no description of the parsing strategy, preamble editing logic, handling of class options (e.g., revtex, aastex), or preservation of figures/tables/macros.

    Authors: We agree the current manuscript provides no implementation details. The full source code is open-source on GitHub, but the paper itself only states the high-level goal. In revision we will add a dedicated Implementation section describing the TeX parsing approach (using regex and AST-based modifications where possible), preamble editing logic, handling of common classes such as revtex and aastex, and how figures, tables, and user macros are preserved. This will directly support the 'automated, end-to-end' claim. revision: yes

  2. Referee: [Abstract] Abstract: no evaluation, user testing, compilation success rates on real arXiv submissions, or demonstration of output quality is provided, leaving the assumption that reformatting improves readability without artifacts untested and the tool's functionality unverified.

    Authors: We acknowledge that the manuscript contains no empirical evaluation or success-rate statistics. As a tool-description paper the focus was on availability rather than benchmarking. In the revised version we will include a short Validation section reporting compilation success on a curated set of 20 recent arXiv submissions across physics and computer science, note observed failure modes, and state the limitations of the current approach. Full user studies remain outside the scope of this work. revision: yes

Circularity Check

0 steps flagged

No circularity: tool-description paper with no derivations or self-referential claims

full rationale

The manuscript is a description of an open-source Python program that fetches arXiv TeX sources and edits font/column settings. No equations, predictions, fitted parameters, or derivation chain exist. No self-citations are invoked as load-bearing premises. The central claim is a functional description rather than a result derived from inputs, so no step reduces to its own construction. This matches the default expectation of a non-circular paper.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

No mathematical model, free parameters, axioms, or invented entities are introduced. The contribution is a software utility whose value depends on implementation details not detailed in the abstract.

pith-pipeline@v0.9.1-grok · 5678 in / 994 out tokens · 22780 ms · 2026-06-26T14:50:18.845841+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

39 extracted references · 30 canonical work pages

  1. [2]

    Impacting literacy through the power of a font:

    Seward, Reneé and Breit, Allison and Keelor, Jennifer and Halovick, Jenny and Creaghead, Nancy and Fernández, Oscar and Srivastava, Akshat and Medrano, Frida and Roberts, Spencer and Healey, Shannon and Plattner, James and Juju Stojanovic, Lepa , month = jun, year =. Impacting literacy through the power of a font:. doi:10.21606/drs.2024.303 , language =

  2. [3]

    Contemporary Educational Technology , author =

    The. Contemporary Educational Technology , author =. 2014 , file =. doi:10.30935/cedtech/6122 , abstract =

  3. [4]

    The Electronic Library , author =

    How reading in single- and multiple-column types influence our cognitive load: an. The Electronic Library , author =. 2019 , pages =. doi:10.1108/EL-01-2019-0006 , abstract =

  4. [5]

    Optimizing

    Teixeira, Ana Rita and Brito-Costa, Sónia and de Almeida, Hugo , editor =. Optimizing. Human. 2024 , keywords =. doi:10.1007/978-3-031-60107-1_5 , abstract =

  5. [6]

    How to make open-source code citable with

  6. [7]

    How to choose a license for open scientific data and code? , url =

  7. [8]

    ACM Computing Surveys , author =

    Reading text from computer screens , volume =. ACM Computing Surveys , author =. 1987 , pages =. doi:10.1145/45075.46162 , abstract =

  8. [9]

    Australian Academic & Research Libraries , author =

    Screen and. Australian Academic & Research Libraries , author =. 2016 , pages =. doi:10.1080/00048623.2016.1227661 , abstract =

  9. [10]

    Annals of Library and Information Studies , month = mar, year =

    Reading on paper and screens: advantages, disadvantages, and digital divide , volume =. Annals of Library and Information Studies , month = mar, year =. doi:10.56042/alis.v69i1.54490 , language =

  10. [11]

    International Journal of Human--Computer Interaction , volume =

    Advantages of print reading over screen reading: A comparison of visual patterns, reading performance, and reading attitudes across paper, computers, and tablets , author =. International Journal of Human--Computer Interaction , volume =. 2021 , month = oct, doi =

  11. [12]

    Displays , author =

    Reading performance and visual fatigue when using electronic paper displays in long-duration reading tasks under various lighting conditions , volume =. Displays , author =. 2013 , pages =. doi:10.1016/j.displa.2013.06.001 , abstract =

  12. [13]

    Document Design , author =

  13. [14]

    The Electronic Library , author =

    A comparison of the influence of electronic books and paper books on reading comprehension, eye fatigue, and perception , volume =. The Electronic Library , author =. 2012 , pages =. doi:10.1108/02640471211241663 , abstract =

  14. [15]

    ACM Transactions on Computer-Human Interaction , author =

    Towards. ACM Transactions on Computer-Human Interaction , author =. 2022 , pages =. doi:10.1145/3502222 , abstract =

  15. [16]

    Ophthalmic and Physiological Optics , author =

    The effect of font and line width on reading speed in people with mild to moderate vision loss , volume =. Ophthalmic and Physiological Optics , author =. 2006 , pages =. doi:10.1111/j.1475-1313.2006.00409.x , abstract =

  16. [17]

    Frontiers in Psychology , author =

    The. Frontiers in Psychology , author =. 2018 , pages =. doi:10.3389/fpsyg.2018.01577 , abstract =

  17. [18]

    Darroch, Iain and Goodman, Joy and Brewster, Stephen and Gray, Phil , editor =. The. Human-. 2005 , note =. doi:10.1007/11555261_23 , abstract =

  18. [19]

    Rello, Luz and Pielot, Martin and Marcos, Mari-Carmen , month = may, year =. Make. Proceedings of the 2016

  19. [20]

    PLoS ONE , author =

    The. PLoS ONE , author =. 2013 , pages =. doi:10.1371/journal.pone.0074061 , abstract =

  20. [21]

    Scrolling behaviour with single- and multi-column layout , isbn =

    Braganza, Cameron and Marriott, Kim and Moulder, Peter and Wybrow, Michael and Dwyer, Tim , month = apr, year =. Scrolling behaviour with single- and multi-column layout , isbn =. Proceedings of the 18th international conference on. doi:10.1145/1526709.1526821 , abstract =

  21. [22]

    Chung, Susana T L , file =. The

  22. [23]

    The Journal of Open Source Software , keywords =

    Journal of Open Source Software , author =. 2021 , pages =. doi:10.21105/joss.02825 , abstract =

  23. [24]

    , month = jun, year =

    Pasha, Imad and Miller, Tim B. , month = jun, year =. pysersic:. doi:10.48550/arXiv.2306.05454 , abstract =

  24. [25]

    Journal of Open Source Software , author =

    The. Journal of Open Source Software , author =. 2020 , note =. doi:10.21105/joss.02554 , abstract =

  25. [26]

    Clinical and Experimental Optometry , author =

    Management of digital eye strain , volume =. Clinical and Experimental Optometry , author =. 2019 , pages =. doi:10.1111/cxo.12798 , abstract =

  26. [27]

    Computer vision syndrome (a.k.a

    Rosenfield, Mark , file =. Computer vision syndrome (a.k.a. digital eye strain) , abstract =

  27. [28]

    BMJ Open Ophthalmology , author =

    Digital eye strain: prevalence, measurement and amelioration , volume =. BMJ Open Ophthalmology , author =. 2018 , pages =. doi:10.1136/bmjophth-2018-000146 , abstract =

  28. [29]

    Work , author =

    Computer vision syndrome:. Work , author =. 2015 , pages =. doi:10.3233/WOR-152162 , abstract =

  29. [30]

    Displays , author =

    Legibility and visual fatigue affected by text direction, screen size and character size on color. Displays , author =. 2013 , pages =. doi:10.1016/j.displa.2012.11.006 , abstract =

  30. [31]

    She Ji: The Journal of Design, Economics, and Innovation , author =

    Readable,. She Ji: The Journal of Design, Economics, and Innovation , author =. 2021 , pages =. doi:10.1016/j.sheji.2021.10.005 , abstract =

  31. [32]

    Ophthalmology and Therapy , author =

    Digital. Ophthalmology and Therapy , author =. 2022 , pages =. doi:10.1007/s40123-022-00540-9 , language =

  32. [33]

    Computer Methods and Programs in Biomedicine 233, 107493

    Digital eye strain in young screen users:. Preventive Medicine , author =. 2023 , pages =. doi:10.1016/j.ypmed.2023.107493 , abstract =

  33. [34]

    British Journal of Educational Technology , author =

    The phenomenology of on‐screen reading:. British Journal of Educational Technology , author =. 2011 , pages =. doi:10.1111/j.1467-8535.2009.01043.x , abstract =

  34. [35]

    portal: Libraries and the Academy , author =

    The. portal: Libraries and the Academy , author =. 2015 , pages =. doi:10.1353/pla.2015.0013 , abstract =

  35. [37]

    Ophthalmic and Physiological Optics , author =

    Computer vision syndrome: a review of ocular causes and potential treatments , volume =. Ophthalmic and Physiological Optics , author =. 2011 , pages =. doi:10.1111/j.1475-1313.2011.00834.x , abstract =

  36. [38]

    Metacognition and Learning , author =

    Association and dissociation between judgments of learning and memory:. Metacognition and Learning , author =. 2022 , pages =. doi:10.1007/s11409-021-09287-3 , abstract =

  37. [39]

    Proceedings of the National Academy of Sciences , author =

    Reconciling print-size and display-size constraints on reading , volume =. Proceedings of the National Academy of Sciences , author =. 2020 , pages =

  38. [40]

    Journal of Human Ergology , volume =

    Selection of the optimum font type and size interface for on screen continuous reading by young adults: an ergonomic approach , author =. Journal of Human Ergology , volume =. 2011 , month = dec, doi =

  39. [41]

    Proceedings of the Human Factors and Ergonomics Society Annual Meeting , author =

    Eye. Proceedings of the Human Factors and Ergonomics Society Annual Meeting , author =. 2008 , pages =. doi:10.1177/154193120805201962 , abstract =