pith. sign in

arxiv: 2406.14075 · v2 · submitted 2024-06-20 · 💻 cs.CL

EXCEEDS: Extracting Complex Events via Nugget-based Grid Modeling in Scientific Domain

Pith reviewed 2026-05-23 23:38 UTC · model grok-4.3

classification 💻 cs.CL
keywords event extractionscientific domaingrid modelingnugget encodingdocument-level eventsSciEvents datasetcomplex eventsend-to-end framework
0
0 comments X

The pith

Encoding dense nuggets into a grid matrix allows end-to-end extraction of complex events from scientific documents.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper seeks to improve event extraction in the scientific domain, which features denser nuggets of information and more complex event structures than other fields. It introduces the SciEvents dataset containing 2,508 documents and 24,381 events, created through multi-stage annotation. The EXCEEDS framework encodes these nuggets into a grid matrix and treats extraction as a grid modeling task. This matters because it could make it possible to automatically parse the dense web of events in research papers, supporting better domain understanding and knowledge synthesis.

Core claim

By constructing a tailored dataset for scientific events and proposing an end-to-end framework that encodes dense nuggets into a grid matrix, the task of extracting complex, multi-event structures from documents is simplified to nugget-based grid modeling, leading to state-of-the-art results on the new benchmark.

What carries the argument

Nugget-based grid modeling, which encodes dense nuggets into a grid matrix to represent and extract complex events.

If this is right

  • Event extraction in science can be performed in a single end-to-end model rather than staged pipelines.
  • The SciEvents dataset serves as a public resource for developing and evaluating scientific event extraction systems.
  • Complex information forms in documents become manageable through matrix-based representation of nuggets.
  • Automated understanding of scientific literature gains a structured event layer.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Grid-based nugget encoding might transfer to other domains with high information density, such as legal or technical texts.
  • The approach highlights the value of treating information nuggets as the basic units for modeling rather than full sentences or paragraphs.
  • Document-level multi-event extraction could benefit from similar matrix simplifications in related NLP tasks.

Load-bearing premise

The multi-stage manual annotation and quality control process yields reliable labels that faithfully capture the denser nuggets and more complex information forms in scientific documents.

What would settle it

Running EXCEEDS on a new scientific corpus annotated independently would show performance no better than existing methods, or the annotation labels would vary substantially across different annotator teams.

Figures

Figures reproduced from arXiv: 2406.14075 by Bo Wang, Heyan Huang, Xian-Ling Mao, Xiao Liu, Yi-Fan Lu.

Figure 1
Figure 1. Figure 1: An Instance of Document-level Event Extraction [PITH_FULL_IMAGE:figures/full_fig_p001_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Dataset Construction Process of SciEvents [PITH_FULL_IMAGE:figures/full_fig_p003_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Distribution of Event Types in SciEvents [PITH_FULL_IMAGE:figures/full_fig_p004_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Distribution of Argument Types in SciEvents [PITH_FULL_IMAGE:figures/full_fig_p005_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: The Architecture of EXCEEDS In SciEvents, we add event correlation task to extract the hierarchy of events. Specifically, the trigger 𝑡𝑠 of sub-event 𝑒𝑠 = {𝑒𝑡𝑠 , 𝑡𝑠 , 𝐴𝑠 } will be regarded as an argument of main-event 𝑒𝑚 = {𝑒𝑡𝑚, 𝑡𝑚, 𝐴𝑚} with a certain argument type 𝑎𝑡𝑠 , i.e. {𝑡𝑠 , 𝑎𝑡𝑠 } ∈ 𝐴𝑚. 5 METHOD The architecture of our framework is illustrated in [PITH_FULL_IMAGE:figures/full_fig_p006_5.png] view at source ↗
read the original abstract

It is crucial to understand a specific domain by events. Extensive event extraction research has been conducted in many domains such as news, finance, and biology. However, event extraction in scientific domain is still insufficiently supported by comprehensive datasets and tailored methods. Compared with other domains, scientific domain has two characteristics: (1) denser nuggets and events, and (2) more complex information forms. To solve the above problem, considering these two characteristics, we first construct SciEvents, a large-scale multi-event document-level dataset with a schema tailored for scientific domain. It consists of 2,508 documents and 24,381 events under multi-stage manual annotation and quality control. Then, we propose EXCEEDS, an end-to-end scientific event extraction framework by encoding dense nuggets into a grid matrix and simplifying complex event extraction as a nugget-based grid modeling task. Experiments on SciEvents demonstrate state-of-the-art performances of EXCEEDS. Both the SciEvents dataset and the EXCEEDS framework are released publicly to facilitate future research.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 1 minor

Summary. The paper claims that scientific-domain event extraction is underserved due to two characteristics—denser nuggets/events and more complex information forms—relative to news/finance/biology. It introduces SciEvents (2,508 documents, 24,381 events) constructed via multi-stage manual annotation with a tailored schema, proposes EXCEEDS (an end-to-end grid-modeling framework that encodes nuggets into a matrix and reduces complex extraction to nugget-based grid modeling), and reports SOTA results on SciEvents, with public release of both dataset and framework.

Significance. If the dataset reliably encodes the stated domain characteristics and the grid-modeling gains are attributable to them rather than annotation artifacts, the work supplies a needed benchmark and method for scientific event extraction. The explicit public release of the dataset and framework is a concrete strength that supports reproducibility and follow-on research.

major comments (2)
  1. [Dataset construction] Dataset construction section: the multi-stage manual annotation and quality control process is described (yielding 2,508 documents and 24,381 events) but supplies no inter-annotator agreement figures, no nugget-density statistics relative to other domains, and no error analysis on complex information forms. This is load-bearing for the central claim that SciEvents faithfully captures the two motivating characteristics so that EXCEEDS gains can be attributed to domain handling rather than label artifacts.
  2. [Experiments] Experiments section: the SOTA claim is asserted without reported metrics, baselines, or ablation results on the grid-modeling components (e.g., nugget encoding or matrix formulation), so the data-to-claim link for the framework cannot be verified from the supplied experimental details.
minor comments (1)
  1. [Abstract] Abstract states SOTA performance but contains no quantitative results, baselines, or ablation details; this reduces immediate clarity even if the full experimental section supplies them.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback and the recommendation for major revision. We address each major comment below and will revise the manuscript accordingly to strengthen the presentation of the dataset and experiments.

read point-by-point responses
  1. Referee: [Dataset construction] Dataset construction section: the multi-stage manual annotation and quality control process is described (yielding 2,508 documents and 24,381 events) but supplies no inter-annotator agreement figures, no nugget-density statistics relative to other domains, and no error analysis on complex information forms. This is load-bearing for the central claim that SciEvents faithfully captures the two motivating characteristics so that EXCEEDS gains can be attributed to domain handling rather than label artifacts.

    Authors: We agree that inter-annotator agreement figures, comparative nugget-density statistics, and error analysis are important to substantiate the dataset's domain characteristics. The multi-stage annotation process was intended to capture denser nuggets and complex forms, but these quantitative validations were not included. In the revised manuscript we will add IAA scores, nugget-density comparisons against existing datasets from other domains, and error analysis on complex information forms. revision: yes

  2. Referee: [Experiments] Experiments section: the SOTA claim is asserted without reported metrics, baselines, or ablation results on the grid-modeling components (e.g., nugget encoding or matrix formulation), so the data-to-claim link for the framework cannot be verified from the supplied experimental details.

    Authors: We acknowledge that the experimental details require expansion for full verifiability. While the manuscript reports state-of-the-art results on SciEvents, we will revise the Experiments section to explicitly list all evaluation metrics, the complete set of baselines, and ablation studies isolating the contributions of nugget encoding and the matrix formulation. revision: yes

Circularity Check

0 steps flagged

No circularity: empirical dataset construction plus model evaluation on that dataset

full rationale

The paper constructs SciEvents via multi-stage manual annotation (2,508 docs, 24,381 events) and proposes EXCEEDS as a grid-modeling framework, then reports experimental SOTA results on SciEvents. No equations, derivations, fitted-parameter predictions, or self-citation chains appear in the provided text. The central claims rest on direct empirical measurement rather than any reduction of outputs to inputs by construction. This is the standard self-contained pattern for dataset+model papers and receives the default non-circularity finding.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

Central claim rests on the assumption that the new dataset annotations are high-quality and that grid modeling is an appropriate simplification for scientific events; no free parameters, axioms, or invented entities are specified in the abstract.

axioms (1)
  • domain assumption Scientific domain exhibits denser nuggets and more complex information forms than other domains, requiring tailored datasets and methods.
    Invoked in abstract to motivate construction of SciEvents and EXCEEDS.

pith-pipeline@v0.9.0 · 5720 in / 1100 out tokens · 19052 ms · 2026-05-23T23:38:00.136422+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

63 extracted references · 63 canonical work pages

  1. [1]

    Jacqueline Aguilar, Charley Beller, Paul McNamee, Benjamin Van Durme, Stephanie Strassel, Zhiyi Song, and Joe Ellis. 2014. A Comparison of the Events and Relations Across ACE, ERE, TAC-KBP, and FrameNet Annotation Standards. In Proceedings of the Second Workshop on EVENTS: Definition, Detection, Corefer- ence, and Representation. Association for Computati...

  2. [2]

    Yunmo Chen, Tongfei Chen, Seth Ebner, Aaron Steven White, and Benjamin Van Durme. 2019. Reading the manual: Event extraction as definition compre- hension. arXiv preprint arXiv:1912.01586 (2019)

  3. [3]

    Xinya Du and Claire Cardie. 2020. Event extraction by answering (almost) natural questions. arXiv preprint arXiv:2004.13625 (2020)

  4. [4]

    Seth Ebner, Patrick Xia, Ryan Culkin, Kyle Rawlins, and Benjamin Van Durme

  5. [5]

    InProceedings of the 58th Annual Meeting of the Association for Computational Linguistics

    Multi-Sentence Argument Linking. InProceedings of the 58th Annual Meeting of the Association for Computational Linguistics . Association for Computational Linguistics, Online, 8057–8077. https://doi.org/10.18653/v1/2020.acl-main.718

  6. [6]

    Jun Gao, Huan Zhao, Changlong Yu, and Ruifeng Xu. 2023. Exploring the feasi- bility of chatgpt for event extraction. arXiv preprint arXiv:2303.03836 (2023)

  7. [7]

    Jun Gao, Huan Zhao, Yice Zhang, Wei Wang, Changlong Yu, and Ruifeng Xu

  8. [8]

    arXiv preprint arXiv:2310.05092 (2023)

    Benchmarking Large Language Models with Augmented Instructions for Fine-grained Information Extraction. arXiv preprint arXiv:2310.05092 (2023)

  9. [9]

    Jeremy Getman, Joe Ellis, Stephanie Strassel, Zhiyi Song, and Jennifer Tracey

  10. [10]

    Laying the Groundwork for Knowledge Base Population: Nine Years of Linguistic Resources for TAC KBP. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), Nicoletta Calzolari, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Koiti Hasida, Hi- toshi Isahara, Bente Maegaard, Joseph Mariani,...

  11. [11]

    Ralph Grishman. 2015. Information Extraction. IEEE Intelligent Systems 30, 5 (2015), 8–15. https://doi.org/10.1109/MIS.2015.68

  12. [12]

    Luis Guzman-Nateras, Viet Lai, Amir Pouran Ben Veyseh, Franck Dernoncourt, and Thien Nguyen. 2022. Event Detection for Suicide Understanding. In Findings of the Association for Computational Linguistics: NAACL 2022 . Association for Computational Linguistics, Seattle, United States, 1952–1961. https://doi.org/10. 18653/v1/2022.findings-naacl.150

  13. [13]

    Jin-Dong Kim, Yue Wang, Toshihisa Takagi, and Akinori Yonezawa. 2011. Overview of Genia Event Task in BioNLP Shared Task 2011. In Proceedings of BioNLP Shared Task 2011 Workshop. Association for Computational Linguistics, Portland, Oregon, USA, 7–15. https://aclanthology.org/W11-1802

  14. [14]

    Fayuan Li, Weihua Peng, Yuguang Chen, Quan Wang, Lu Pan, Yajuan Lyu, and Yong Zhu. 2020. Event extraction as multi-turn question answering. In Findings of the Association for Computational Linguistics: EMNLP 2020 . 829–838

  15. [15]

    Jingye Li, Hao Fei, Jiang Liu, Shengqiong Wu, Meishan Zhang, Chong Teng, Donghong Ji, and Fei Li. 2022. Unified named entity recognition as word-word re- lation classification. InProceedings of the AAAI Conference on Artificial Intelligence, Vol. 36. 10965–10973

  16. [16]

    Qingquan Li, Qifan Zhang, Junjie Yao, and Yingjie Zhang. 2020. Event Extraction for Criminal Legal Text. In 2020 IEEE International Conference on Knowledge Graph (ICKG). 573–580. https://doi.org/10.1109/ICBK50248.2020.00086

  17. [17]

    Sha Li, Heng Ji, and Jiawei Han. 2021. Document-Level Event Argument Ex- traction by Conditional Generation. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Hu- man Language Technologies, Kristina Toutanova, Anna Rumshisky, Luke Zettle- moyer, Dilek Hakkani-Tur, Iz Beltagy, Steven Betha...

  18. [18]

    Xinyu Li, Fayuan Li, Lu Pan, Yuguang Chen, Weihua Peng, Quan Wang, Yajuan Lyu, and Yong Zhu. 2020. DuEE: a large-scale dataset for Chinese event extraction in real-world scenarios. In Natural Language Processing and Chinese Computing: 9th CCF International Conference, NLPCC 2020, Zhengzhou, China, October 14–18, 2020, Proceedings, Part II 9 . Springer, 534–545

  19. [19]

    Xin Liang, Dawei Cheng, Fangzhou Yang, Yifeng Luo, Weining Qian, and Aoying Zhou. 2020. F-HMTC: Detecting Financial Events for Investment Decisions Based on Neural Hierarchical Multi-Label Text Classification. In Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, IJCAI-20 , Christian Bessiere (Ed.). International Jo...

  20. [20]

    Ying Lin, Heng Ji, Fei Huang, and Lingfei Wu. 2020. A Joint Neural Model for Information Extraction with Global Features. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics , Dan Jurafsky, Joyce Chai, Natalie Schluter, and Joel Tetreault (Eds.). Association for Computational Linguistics, Online, 7999–8009. https://d...

  21. [21]

    Ruibo Liu, Jason Wei, Chenyan Jia, and Soroush Vosoughi. 2021. Modulating Language Models with Emotions. In Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021, Chengqing Zong, Fei Xia, Wenjie Li, and Roberto Navigli (Eds.). Association for Computational Linguistics, Online, 4332–4339. https://doi.org/10.18653/v1/2021.findings-acl.379

  22. [22]

    Xiao Liu, Heyan Huang, Ge Shi, and Bo Wang. 2022. Dynamic Prefix-Tuning for Generative Template-based Event Extraction. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , Smaranda Muresan, Preslav Nakov, and Aline Villavicencio (Eds.). Association for Computational Linguistics, Dublin, Irel...

  23. [23]

    Xiao Liu, Heyan Huang, and Yue Zhang. 2019. Open Domain Event Extraction Using Neural Latent Variable Models. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics . Association for Computational Linguistics, Florence, Italy, 2860–2871. https://doi.org/10.18653/v1/P19-1276

  24. [24]

    Xiao Liu, Zhunchen Luo, and Heyan Huang. 2018. Jointly multiple events extraction via attention-based graph information aggregation. arXiv preprint arXiv:1809.09078 (2018)

  25. [25]

    Ilya Loshchilov and Frank Hutter. 2018. Decoupled Weight Decay Regularization. In International Conference on Learning Representations

  26. [26]

    Jie Lou, Yaojie Lu, Dai Dai, Wei Jia, Hongyu Lin, Xianpei Han, Le Sun, and Hua Wu. 2023. Universal Information Extraction as Unified Semantic Matching. arXiv preprint arXiv:2301.03282 (2023)

  27. [27]

    Yaojie Lu, Hongyu Lin, Jin Xu, Xianpei Han, Jialong Tang, Annan Li, Le Sun, Meng Liao, and Shaoyi Chen. 2021. Text2Event: Controllable sequence-to-structure generation for end-to-end event extraction. arXiv preprint arXiv:2106.09232 (2021)

  28. [28]

    Yaojie Lu, Qing Liu, Dai Dai, Xinyan Xiao, Hongyu Lin, Xianpei Han, Le Sun, and Hua Wu. 2022. Unified Structure Generation for Universal Information Extraction. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Smaranda Muresan, Preslav Nakov, and Aline Villavicencio (Eds.). Association for...

  29. [29]

    Yaojie Lu, Qing Liu, Dai Dai, Xinyan Xiao, Hongyu Lin, Xianpei Han, Le Sun, and Hua Wu. 2022. Unified structure generation for universal information extraction. arXiv preprint arXiv:2203.12277 (2022)

  30. [30]

    Mingyu Derek Ma, Alexander Taylor, Wei Wang, and Nanyun Peng. 2023. DICE: Data-Efficient Clinical Event Extraction with Generative Models. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Anna Rogers, Jordan Boyd-Graber, and Naoaki Okazaki (Eds.). Association for Computational Linguistics,...

  31. [31]

    Yubo Ma, Zehao Wang, Yixin Cao, Mukai Li, Meiqi Chen, Kun Wang, and Jing Shao. 2022. Prompt for Extraction? PAIE: Prompting Argument Interaction for Event Argument Extraction. InProceedings of the 60th Annual Meeting of the Asso- ciation for Computational Linguistics (Volume 1: Long Papers) , Smaranda Muresan, Preslav Nakov, and Aline Villavicencio (Eds.)...

  32. [32]

    Hieu Man Duc Trong, Duc Trong Le, Amir Pouran Ben Veyseh, Thuat Nguyen, and Thien Huu Nguyen. 2020. Introducing a New Dataset for Event Detection in Cybersecurity Texts. InProceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP). Association for Computational Linguistics, Online, 5381–5390. https://doi.org/10.18653/v1...

  33. [33]

    Claire Nédellec, Robert Bossy, Jin-Dong Kim, Jung-jae Kim, Tomoko Ohta, Sampo Pyysalo, and Pierre Zweigenbaum. 2013. Overview of BioNLP Shared Task

  34. [34]

    Association for Computational Linguistics, Sofia, Bulgaria, 1–7

    In Proceedings of the BioNLP Shared Task 2013 Workshop . Association for Computational Linguistics, Sofia, Bulgaria, 1–7. https://aclanthology.org/W13- 2001

  35. [35]

    Minh Van Nguyen, Viet Dac Lai, and Thien Huu Nguyen. 2021. Cross-Task Instance Representation Interactions and Label Dependencies for Joint Informa- tion Extraction with Graph Convolutional Networks. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Kristina T...

  36. [36]

    Thien Huu Nguyen, Kyunghyun Cho, and Ralph Grishman. 2016. Joint event extraction via recurrent neural networks. In Proceedings of the 2016 conference of the North American chapter of the association for computational linguistics: human language technologies. 300–309

  37. [37]

    Tanmay Parekh, I-Hung Hsu, Kuan-Hao Huang, Kai-Wei Chang, and Nanyun Peng. 2023. GENEVA: Benchmarking Generalizability for Event Argument Ex- traction with Hundreds of Event Types and Argument Roles. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Vol- ume 1: Long Papers) , Anna Rogers, Jordan Boyd-Graber, and N...

  38. [38]

    Hao Peng, Xiaozhi Wang, Feng Yao, Kaisheng Zeng, Lei Hou, Juanzi Li, Zhiyuan Liu, and Weixing Shen. 2023. The Devil is in the Details: On the Pitfalls of Event Extraction Evaluation. In Findings of the Association for Computational Linguistics: ACL 2023, Anna Rogers, Jordan Boyd-Graber, and Naoaki Okazaki (Eds.). Association for Computational Linguistics,...

  39. [39]

    Amir Pouran Ben Veyseh, Minh Van Nguyen, Franck Dernoncourt, and Thien Nguyen. 2022. MINION: a Large-Scale and Diverse Dataset for Multilingual Event Detection. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies . Association for Computational Linguistics, Seatt...

  40. [40]

    Bhanu Pratap Singh Rawat, Samuel Kovaly, Hong Yu, and Wilfred Pigeon. 2022. ScAN: Suicide Attempt and Ideation Events Dataset. InProceedings of the 2022 Con- ference of the North American Chapter of the Association for Computational Linguis- tics: Human Language Technologies. Association for Computational Linguistics, Seattle, United States, 1029–1040. ht...

  41. [41]

    Oscar Sainz, Iker García-Ferrero, Rodrigo Agerri, Oier Lopez de Lacalle, German Rigau, and Eneko Agirre. 2023. GoLLIE: Annotation Guidelines improve Zero- Shot Information-Extraction. arXiv preprint arXiv:2310.03668 (2023)

  42. [42]

    Taneeya Satyapanich, Francis Ferraro, and Tim Finin. 2020. Casie: Extracting cybersecurity event information from text. In Proceedings of the AAAI conference on artificial intelligence, Vol. 34. 8749–8757

  43. [43]

    Shirong Shen, Guilin Qi, Zhen Li, Sheng Bi, and Lusheng Wang. 2020. Hierarchical Chinese Legal event extraction via Pedal Attention Mechanism. In Proceedings of the 28th International Conference on Computational Linguistics . International Committee on Computational Linguistics, Barcelona, Spain (Online), 100–113. https://doi.org/10.18653/v1/2020.coling-main.9

  44. [44]

    Matthew Sims, Jong Ho Park, and David Bamman. 2019. Literary Event Detection. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, Florence, Italy, 3623–3634. https://doi.org/10.18653/v1/P19-1353

  45. [45]

    Zhiyi Song, Ann Bies, Stephanie Strassel, Tom Riese, Justin Mott, Joe Ellis, Jonathan Wright, Seth Kulick, Neville Ryant, and Xiaoyi Ma. 2015. From Light to Rich ERE: Annotation of Entities, Relations, and Events. In Proceedings of the The 3rd Workshop on EVENTS: Definition, Detection, Coreference, and Repre- sentation. Association for Computational Lingu...

  46. [46]

    Zhaoyue Sun, Jiazheng Li, Gabriele Pergola, Byron Wallace, Bino John, Nigel Greene, Joseph Kim, and Yulan He. 2022. PHEE: A Dataset for Pharmacovigilance Event Extraction from Text. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing , Yoav Goldberg, Zornitsa Kozareva, and Yue Zhang (Eds.). Association for Computation...

  47. [47]

    Wei Tang, Benfeng Xu, Yuyue Zhao, Zhendong Mao, Yifeng Liu, Yong Liao, and Haiyong Xie. 2022. UniRel: Unified Representation and Interaction for Joint Relational Triple Extraction. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing , Yoav Goldberg, Zornitsa Kozareva, and Yue Zhang (Eds.). Association for Computationa...

  48. [48]

    MeiHan Tong, Bin Xu, Shuai Wang, Meihuan Han, Yixin Cao, Jiangqi Zhu, Siyu Chen, Lei Hou, and Juanzi Li. 2022. DocEE: A Large-Scale and Fine-grained Bench- mark for Document-level Event Extraction. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Hu- man Language Technologies. Associatio...

  49. [49]

    David Wadden, Ulme Wennberg, Yi Luan, and Hannaneh Hajishirzi. 2019. Entity, relation, and event extraction with contextualized span representations. arXiv preprint arXiv:1909.03546 (2019)

  50. [50]

    Christopher Walker, Stephanie Strassel, Julie Medero, and Kazuaki Maeda. 2006. ACE 2005 Multilingual Training Corpus. Techbical report, Linguistic Data Consor- tium (2006)

  51. [51]

    Bo Wang, Heyan Huang, Xiaochi Wei, Ge Shi, Xiao Liu, Chong Feng, Tong Zhou, Shuaiqiang Wang, and Dawei Yin. 2023. Boosting Event Extraction with Denoised Structure-to-Text Augmentation. InFindings of the Association for Computational Linguistics: ACL 2023, Anna Rogers, Jordan Boyd-Graber, and Naoaki Okazaki (Eds.). Association for Computational Linguistic...

  52. [52]

    Xiaozhi Wang, Ziqi Wang, Xu Han, Wangyi Jiang, Rong Han, Zhiyuan Liu, Juanzi Li, Peng Li, Yankai Lin, and Jie Zhou. 2020. MAVEN: A Massive General Domain Event Detection Dataset. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP) . Association for Computational Linguistics, Online, 1652–1671. https://doi.org/...

  53. [53]

    Xiao Wang, Weikang Zhou, Can Zu, Han Xia, Tianze Chen, Yuansen Zhang, Rui Zheng, Junjie Ye, Qi Zhang, Tao Gui, et al. 2023. InstructUIE: Multi-task Instruc- tion Tuning for Unified Information Extraction. arXiv preprint arXiv:2304.08085 (2023)

  54. [54]

    Kaiwen Wei, Xian Sun, Zequn Zhang, Jingyuan Zhang, Guo Zhi, and Li Jin. 2021. Trigger is not sufficient: Exploiting frame-aware knowledge for implicit event argument extraction. In Proceedings of the 59th Annual Meeting of the Associa- tion for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: ...

  55. [55]

    Xiang Wei, Xingyu Cui, Ning Cheng, Xiaobin Wang, Xin Zhang, Shen Huang, Pengjun Xie, Jinan Xu, Yufeng Chen, Meishan Zhang, et al . 2023. Zero-shot information extraction via chatting with chatgpt. arXiv preprint arXiv:2302.10205 (2023)

  56. [56]

    Wei Xiang and Bang Wang. 2019. A Survey of Event Extraction From Text. IEEE Access 7 (2019), 173111–173137. https://doi.org/10.1109/ACCESS.2019.2956831

  57. [57]

    Hang Yang, Yubo Chen, Kang Liu, Yang Xiao, and Jun Zhao. 2018. DCFEE: A Document-level Chinese Financial Event Extraction System based on Automat- ically Labeled Training Data. In Proceedings of ACL 2018, System Demonstra- tions. Association for Computational Linguistics, Melbourne, Australia, 50–55. https://doi.org/10.18653/v1/P18-4009

  58. [58]

    Sen Yang, Dawei Feng, Linbo Qiao, Zhigang Kan, and Dongsheng Li. 2019. Ex- ploring pre-trained language models for event extraction and generation. In Proceedings of the 57th annual meeting of the association for computational lin- guistics. 5284–5294

  59. [59]

    Feng Yao, Chaojun Xiao, Xiaozhi Wang, Zhiyuan Liu, Lei Hou, Cunchao Tu, Juanzi Li, Yun Liu, Weixing Shen, and Maosong Sun. 2022. LEVEN: A Large- Scale Chinese Legal Event Detection Dataset. In Findings of the Association for Computational Linguistics: ACL 2022 . Association for Computational Linguistics, Dublin, Ireland, 183–201. https://doi.org/10.18653/...

  60. [60]

    Juntao Yu, Bernd Bohnet, and Massimo Poesio. 2020. Named Entity Recognition as Dependency Parsing. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, Dan Jurafsky, Joyce Chai, Natalie Schluter, and Joel Tetreault (Eds.). Association for Computational Linguistics, Online, 6470–6476. https://doi.org/10.18653/v1/2020....

  61. [61]

    Tongtao Zhang, Heng Ji, and Avirup Sil. 2019. Joint entity and event extraction with generative adversarial imitation learning. Data Intelligence 1, 2 (2019), 99–120

  62. [62]

    Shun Zheng, Wei Cao, Wei Xu, and Jiang Bian. 2019. Doc2EDAG: An End-to-End Document-level Framework for Chinese Financial Event Extraction. In Proceed- ings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP- IJCNLP). Association for Computational Ling...

  63. [63]

    Yang Zhou, Yubo Chen, Jun Zhao, Yin Wu, Jiexin Xu, and Jinlong Li. 2021. What the role is vs. what plays the role: Semi-supervised event argument extraction via dual question answering. In Proceedings of the AAAI conference on artificial intelligence, Vol. 35. 14638–14646. A SCHEMA OF SCIEVENTS The schema of SciEvents consists of nugget types and event ty...