Assessing socio-economic climate impacts from text data

Agla\'e J\'ez\'equel; Barbara Pernici; Bram Valkenborg; Brielen Madureira; Damien Delforge; Gabriele Messori; Jakob Zscheischler; Jan Sodoge; Jingxian Wang; Joakim Nivre

arxiv: 2605.20793 · v1 · pith:LFM7OQS2new · submitted 2026-05-20 · 💻 cs.CL

Assessing socio-economic climate impacts from text data

Mariana Madruga de Brito , Brielen Madureira , Ta\'is Maria Nunes Carvalho , Damien Delforge , Agla\'e J\'ez\'equel , Murathan Kurfal{\i} , Ni Li , Gabriele Messori

show 10 more authors

Joakim Nivre Barbara Pernici Niko Speybroeck Stefano Terzi Wim Thiery Bram Valkenborg Jingxian Wang Shorouq Zahra Jakob Zscheischler Jan Sodoge

This is my paper

Pith reviewed 2026-05-21 05:10 UTC · model grok-4.3

classification 💻 cs.CL

keywords text-as-datasocio-economic impactsclimate hazardsnatural language processingdisaster risk managementimpact assessmentattribution studiesmethodological recommendations

0 comments

The pith

Synthesizing common practices and challenges in text-as-data methods enables more transparent and comparable datasets for socio-economic impacts of climate hazards.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper establishes that advances in natural language processing now support the extraction of socio-economic impact information from large volumes of news, social media, and reports on events like floods and droughts. Research in this area remains scattered, with inconsistent ways of defining impacts, correcting for biases in timing and location, and choosing analysis techniques. By compiling existing approaches, spelling out the specific difficulties tied to text sources, and laying out targeted recommendations, the work aims to guide the creation of stronger datasets. These datasets would feed more reliably into studies of disaster risk and efforts to connect hazards with their human and economic effects. A reader following the logic would expect clearer standards to reduce fragmentation and improve the evidence base for practical decisions.

Core claim

The central claim is that the absence of shared guidelines for defining impacts, managing temporal and spatial biases, and selecting modeling strategies in text-as-data work on socio-economic climate effects creates limits on transparency and cross-study comparability, and that a synthesis of practices plus concrete recommendations can directly support the building of robust datasets suited to disaster risk management and attribution studies.

What carries the argument

Synthesis of existing text-as-data practices paired with identification of domain-specific challenges in impact definition, bias correction, and post-processing, followed by recommendations to improve dataset construction.

If this is right

Text-derived datasets will exhibit greater transparency in how impacts are identified and quantified.
Comparability across studies will increase because shared practices reduce differences in bias handling and modeling choices.
Disaster risk management applications will receive more reliable input on the scale and distribution of socio-economic losses.
Attribution studies linking specific hazards to observed impacts will rest on datasets with clearer provenance and reduced methodological noise.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Standardized datasets created under these guidelines could support large-scale comparisons of impact patterns across different hazard types and regions.
The recommendations may prove especially useful when scaling up to newer large language models for automated extraction tasks.
Adoption could indirectly improve coordination between text-based impact research and traditional survey or satellite-based damage assessments.

Load-bearing premise

That researchers will adopt the identified challenges and proposed recommendations to produce measurably more consistent and usable impact datasets without separate empirical tests of those recommendations.

What would settle it

An experiment that applies the paper's recommendations to multiple independent text corpora on the same climate events and then measures whether the resulting impact datasets show higher agreement on event classification, timing, and magnitude than datasets built without the recommendations.

read the original abstract

Recent advances in natural language processing (NLP) and large language models (LLMs) have enabled the systematic use of large-scale textual data from news, social media, and reports to create datasets with socio-economic impacts of climate hazards such as floods, droughts, storms, and multi-hazard events. As the field of text-as-data for impact assessment expands, so does its methodological complexity. Yet research remains fragmented, with no clear guidelines for defining what constitutes an impact, handling temporal and spatial biases, and selecting appropriate modeling and post-processing strategies. This lack of coherence limits transparency and comparability across studies. Here, we address this gap by synthesising common practices, describing key challenges specific to the use of text-as-data methods for analyzing socio-economic impact data, and proposing recommendations to address them. By providing guidance on best practices, we aim to support the construction of robust text-derived socio-economic impact datasets that can more accurately inform disaster risk management and attribution studies.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The manuscript synthesizes common practices in applying text-as-data methods (NLP and LLMs) to extract socio-economic impacts of climate hazards from news, social media, and reports. It identifies challenges in impact definition, temporal/spatial biases, and modeling/post-processing choices, then proposes recommendations intended to improve transparency and cross-study comparability for use in disaster risk management and attribution research.

Significance. If the synthesis is accurate and the recommendations are adopted, the work could reduce fragmentation in an emerging interdisciplinary area and support more consistent text-derived impact datasets. The paper usefully maps an otherwise scattered literature and explicitly credits the value of existing studies while highlighting gaps in standardization.

major comments (2)

[§4] §4 (Recommendations): The central claim that the proposed recommendations will support construction of more robust datasets rests on the untested assumption that guidance alone produces measurable gains in transparency or bias reduction. No re-processing of prior studies, before/after metric comparisons (e.g., inter-annotator agreement or alignment with ground-truth records), or pilot validation is reported.
[§3.2] §3.2 (Challenges, temporal/spatial biases subsection): The discussion of biases is descriptive but does not quantify their prevalence across the reviewed studies or demonstrate how the listed recommendations would mitigate them in practice.

minor comments (2)

[Abstract] The abstract and introduction would benefit from an explicit statement of how many studies were reviewed and the search strategy used for the synthesis.
[§2] Notation for impact metrics (e.g., any abbreviations introduced in §2) should be defined at first use and collected in a table for reference.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive and detailed report. The comments correctly identify that this is a synthesis paper whose recommendations are not accompanied by new empirical validation. We address each major comment below and indicate the revisions we will make.

read point-by-point responses

Referee: [§4] §4 (Recommendations): The central claim that the proposed recommendations will support construction of more robust datasets rests on the untested assumption that guidance alone produces measurable gains in transparency or bias reduction. No re-processing of prior studies, before/after metric comparisons (e.g., inter-annotator agreement or alignment with ground-truth records), or pilot validation is reported.

Authors: We agree that the manuscript does not contain new empirical tests of the recommendations. As a synthesis of existing practices and challenges, the paper derives recommendations from patterns observed across the reviewed literature rather than from controlled before/after experiments. We will revise §4 and add a new Limitations subsection that explicitly states the recommendations remain untested in this work, clarifies that the language is 'aim to support' rather than 'will produce', and outlines concrete avenues for future validation (e.g., re-annotating a small set of studies with and without the proposed guidelines and reporting changes in inter-annotator agreement or alignment with official loss records). revision: partial
Referee: [§3.2] §3.2 (Challenges, temporal/spatial biases subsection): The discussion of biases is descriptive but does not quantify their prevalence across the reviewed studies or demonstrate how the listed recommendations would mitigate them in practice.

Authors: The subsection is intentionally qualitative because many source papers do not report bias diagnostics in a standardized way. To strengthen the section we will add a summary table that tallies, for each reviewed study, whether temporal or spatial bias issues were discussed or mitigated. We will also insert explicit cross-references from each recommendation in §4 back to the bias types it targets, citing concrete examples from the literature where analogous steps improved dataset quality. revision: yes

Circularity Check

0 steps flagged

No circularity: literature synthesis with no derivations or fitted predictions

full rationale

The paper is a review synthesizing common practices in text-as-data methods for socio-economic climate impacts, identifying challenges such as impact definition and biases, and offering recommendations. It contains no equations, derivations, data-fitting steps, or predictions that could reduce to inputs by construction. No self-definitional loops, fitted-input predictions, or load-bearing self-citations appear in the described structure. The central claim rests on external literature synthesis rather than internal redefinition, making the work self-contained against external benchmarks with minimal circularity burden.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

Review and synthesis paper; draws on domain assumptions about the state of the field rather than introducing new fitted parameters or entities.

axioms (1)

domain assumption The field of text-as-data for socio-economic climate impact assessment is fragmented, with no clear guidelines for defining impacts, handling temporal and spatial biases, or selecting modeling and post-processing strategies.
This premise is explicitly stated in the abstract as the motivation for the work.

pith-pipeline@v0.9.0 · 5774 in / 1214 out tokens · 49553 ms · 2026-05-21T05:10:50.420386+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

3 extracted references · 3 canonical work pages

[1]

Global Risks Report 2025

World Economic Forum. Global Risks Report 2025. https://reports.weforum.org/docs/WEF_Global_Risks_Report_2025.pdf. 2. Naumann, G., Cammalleri, C., Mentaschi, L. & Feyen, L. Increased economic drought impacts in Europe with anthropogenic warming. Nature Climate Change 11 , 485–491 (2021). 3. Donatti, C. I. et al. Global hotspots of climate-related disaster...

work page doi:10.21203/rs.3.rs-6095740/v1 2025
[2]

Fu, X. et al. Community evolutional network for situation awareness using social media. IEEE Access 8 , 39225–39240 (2020). 28. Xia, Y. et al. A question and answering service of typhoon disasters based on the T5 large language model. ISPRS Int. J. Geoinf. 13 , 165 (2024). 29. Karami, A., Shah, V., Vaezi, R. & Bansal, A. Twitter speaks: A case of national...

work page doi:10.1145/3328529.3328545 2020
[3]

several hazards

Singer, E., Endreny, P. & Glassman, M. B. Media coverage of disasters: Effect of geographic location. Journalism Quarterly 68 , 48–58 (1991). 60. Zhang, C., Yang, Y. & Mostafavi, A. Revealing Unfairness in social media contributors’ attention to vulnerable urban areas during disasters. Int. J. Disaster Risk Reduct. 58 , 102160 (2021). 61. Huang, R., Cases...

work page doi:10.18653/v1/d16-1005 1991

[1] [1]

Global Risks Report 2025

World Economic Forum. Global Risks Report 2025. https://reports.weforum.org/docs/WEF_Global_Risks_Report_2025.pdf. 2. Naumann, G., Cammalleri, C., Mentaschi, L. & Feyen, L. Increased economic drought impacts in Europe with anthropogenic warming. Nature Climate Change 11 , 485–491 (2021). 3. Donatti, C. I. et al. Global hotspots of climate-related disaster...

work page doi:10.21203/rs.3.rs-6095740/v1 2025

[2] [2]

Fu, X. et al. Community evolutional network for situation awareness using social media. IEEE Access 8 , 39225–39240 (2020). 28. Xia, Y. et al. A question and answering service of typhoon disasters based on the T5 large language model. ISPRS Int. J. Geoinf. 13 , 165 (2024). 29. Karami, A., Shah, V., Vaezi, R. & Bansal, A. Twitter speaks: A case of national...

work page doi:10.1145/3328529.3328545 2020

[3] [3]

several hazards

Singer, E., Endreny, P. & Glassman, M. B. Media coverage of disasters: Effect of geographic location. Journalism Quarterly 68 , 48–58 (1991). 60. Zhang, C., Yang, Y. & Mostafavi, A. Revealing Unfairness in social media contributors’ attention to vulnerable urban areas during disasters. Int. J. Disaster Risk Reduct. 58 , 102160 (2021). 61. Huang, R., Cases...

work page doi:10.18653/v1/d16-1005 1991