{"record_type":"pith_number_record","schema_url":"https://pith.science/schemas/pith-number/v1.json","pith_number":"pith:2026:GRSHXLAPPZ4K24XJVCFR4ZUMKG","short_pith_number":"pith:GRSHXLAP","schema_version":"1.0","canonical_sha256":"34647bac0f7e78ad72e9a88b1e668c51ae4341c83f72dc127016ea52fc3b95c5","source":{"kind":"arxiv","id":"2605.14355","version":1},"attestation_state":"computed","paper":{"title":"Herculean: An Agentic Benchmark for Financial Intelligence","license":"http://creativecommons.org/licenses/by/4.0/","headline":"","cross_cats":["cs.CL"],"primary_cat":"cs.AI","authors_text":"Alejandro Lopez-Lira, Anke Xu, Arman Cohan, Ayesha Gull, Fan Zhang, Fengbin Zhu, Fengran Mo, Fuyuan Lyu, Haohang Li, Haolun Wu, Huan He, Jerry Huang, Jiahuan Pei, Jian-Yun Nie, Jimin Huang, Junichi Tsujii, Kaleb E Smith, Lingfei Qian, Linhai Ma, Mingquan Lin, Mingyang Jiang, Mohsinul Kabir, Muhammad Usman Safder, Nuo Chen, Peng Lu, Polydoros Giannouris, Prayag Tiwari, Qiyuan Zhang, Rania Elbadry, Ruoyu Xiang, Shuyao Wang, Sophia Ananiadou, Tianshi Cai, Victor Gutierrez Basulto, Vincent Jim Zhang, Weijin Liu, Wenbo Cao, Xiao-Yang Liu, Xiaoyu Wang, Xi Chen, Xue Liu, Xueqing Peng, Xuguang Ai, Yangyang Yu, Yankai Chen, Yan Wang, Ye Yuan, Yi Han, Yijia Zhao, Yilun Zhao, Yixiang Zheng, Yonghan Yang, Youzhong Dong, Yuechen Jiang, Yuehua Tang, Yueru He, Yupeng Cao, Yuqing Guo, Yuyang Dai, Yuyan Wang, Zhiwei Liu, Zhuohan Xie, Zichen Zhao, Zimu Wang","submitted_at":"2026-05-14T04:30:49Z","abstract_excerpt":"As AI agents improve, the central question is no longer whether they can solve isolated well-defined financial tasks, but whether they can reliably carry out financial professional work. Existing financial benchmarks offer only a partial view of this ability, as they primarily evaluate static competencies such as question answering, retrieval, summarization, and classification. We introduce Herculean, the first skilled benchmark for agentic financial intelligence spanning four representative workflows, including Trading, Hedging, Market Insights, and Auditing. Each workflow is instantiated as "},"verification_status":{"content_addressed":true,"pith_receipt":true,"author_attested":false,"weak_author_claims":0,"strong_author_claims":0,"externally_anchored":false,"storage_verified":false,"citation_signatures":0,"replication_records":0,"graph_snapshot":true,"references_resolved":false,"formal_links_present":false},"canonical_record":{"source":{"id":"2605.14355","kind":"arxiv","version":1},"metadata":{"license":"http://creativecommons.org/licenses/by/4.0/","primary_cat":"cs.AI","submitted_at":"2026-05-14T04:30:49Z","cross_cats_sorted":["cs.CL"],"title_canon_sha256":"ac969ace12c79aa02d9cce5c1e3a2a4b633aa7a3083f41963e3572ed990b5d24","abstract_canon_sha256":"9bc9109498c100953b38d2e444f28f58328b29ffb99442d029d40c617df4b467"},"schema_version":"1.0"},"receipt":{"kind":"pith_receipt","key_id":"pith-v1-2026-05","algorithm":"ed25519","signed_at":"2026-05-17T23:39:08.017388Z","signature_b64":"eiuP0hIxgVAUKQz/xPierKUzkttg0Vt3j6beGlUo+MqZLFuivGku1vYfL9bNoNiuUimaEYzcaLeHmsIO5JyfBw==","signed_message":"canonical_sha256_bytes","builder_version":"pith-number-builder-2026-05-17-v1","receipt_version":"0.3","canonical_sha256":"34647bac0f7e78ad72e9a88b1e668c51ae4341c83f72dc127016ea52fc3b95c5","last_reissued_at":"2026-05-17T23:39:08.016685Z","signature_status":"signed_v1","first_computed_at":"2026-05-17T23:39:08.016685Z","public_key_fingerprint":"8d4b5ee74e4693bcd1df2446408b0d54"},"graph_snapshot":{"paper":{"title":"Herculean: An Agentic Benchmark for Financial Intelligence","license":"http://creativecommons.org/licenses/by/4.0/","headline":"","cross_cats":["cs.CL"],"primary_cat":"cs.AI","authors_text":"Alejandro Lopez-Lira, Anke Xu, Arman Cohan, Ayesha Gull, Fan Zhang, Fengbin Zhu, Fengran Mo, Fuyuan Lyu, Haohang Li, Haolun Wu, Huan He, Jerry Huang, Jiahuan Pei, Jian-Yun Nie, Jimin Huang, Junichi Tsujii, Kaleb E Smith, Lingfei Qian, Linhai Ma, Mingquan Lin, Mingyang Jiang, Mohsinul Kabir, Muhammad Usman Safder, Nuo Chen, Peng Lu, Polydoros Giannouris, Prayag Tiwari, Qiyuan Zhang, Rania Elbadry, Ruoyu Xiang, Shuyao Wang, Sophia Ananiadou, Tianshi Cai, Victor Gutierrez Basulto, Vincent Jim Zhang, Weijin Liu, Wenbo Cao, Xiao-Yang Liu, Xiaoyu Wang, Xi Chen, Xue Liu, Xueqing Peng, Xuguang Ai, Yangyang Yu, Yankai Chen, Yan Wang, Ye Yuan, Yi Han, Yijia Zhao, Yilun Zhao, Yixiang Zheng, Yonghan Yang, Youzhong Dong, Yuechen Jiang, Yuehua Tang, Yueru He, Yupeng Cao, Yuqing Guo, Yuyang Dai, Yuyan Wang, Zhiwei Liu, Zhuohan Xie, Zichen Zhao, Zimu Wang","submitted_at":"2026-05-14T04:30:49Z","abstract_excerpt":"As AI agents improve, the central question is no longer whether they can solve isolated well-defined financial tasks, but whether they can reliably carry out financial professional work. Existing financial benchmarks offer only a partial view of this ability, as they primarily evaluate static competencies such as question answering, retrieval, summarization, and classification. We introduce Herculean, the first skilled benchmark for agentic financial intelligence spanning four representative workflows, including Trading, Hedging, Market Insights, and Auditing. Each workflow is instantiated as "},"claims":{"count":0,"items":[],"snapshot_sha256":"258153158e38e3291e3d48162225fcdb2d5a3ed65a07baac614ab91432fd4f57"},"source":{"id":"2605.14355","kind":"arxiv","version":1},"verdict":{"id":null,"model_set":{},"created_at":null,"strongest_claim":"","one_line_summary":"","pipeline_version":null,"weakest_assumption":"","pith_extraction_headline":""},"references":{"count":0,"sample":[],"resolved_work":0,"snapshot_sha256":"258153158e38e3291e3d48162225fcdb2d5a3ed65a07baac614ab91432fd4f57","internal_anchors":0},"formal_canon":{"evidence_count":0,"snapshot_sha256":"258153158e38e3291e3d48162225fcdb2d5a3ed65a07baac614ab91432fd4f57"},"author_claims":{"count":0,"strong_count":0,"snapshot_sha256":"258153158e38e3291e3d48162225fcdb2d5a3ed65a07baac614ab91432fd4f57"},"builder_version":"pith-number-builder-2026-05-17-v1"},"aliases":[{"alias_kind":"arxiv","alias_value":"2605.14355","created_at":"2026-05-17T23:39:08.016798+00:00"},{"alias_kind":"arxiv_version","alias_value":"2605.14355v1","created_at":"2026-05-17T23:39:08.016798+00:00"},{"alias_kind":"doi","alias_value":"10.48550/arxiv.2605.14355","created_at":"2026-05-17T23:39:08.016798+00:00"},{"alias_kind":"pith_short_12","alias_value":"GRSHXLAPPZ4K","created_at":"2026-05-18T12:33:37.589309+00:00"},{"alias_kind":"pith_short_16","alias_value":"GRSHXLAPPZ4K24XJ","created_at":"2026-05-18T12:33:37.589309+00:00"},{"alias_kind":"pith_short_8","alias_value":"GRSHXLAP","created_at":"2026-05-18T12:33:37.589309+00:00"}],"events":[],"event_summary":{},"paper_claims":[],"inbound_citations":{"count":0,"internal_anchor_count":0,"sample":[]},"formal_canon":{"evidence_count":0,"sample":[],"anchors":[]},"links":{"html":"https://pith.science/pith/GRSHXLAPPZ4K24XJVCFR4ZUMKG","json":"https://pith.science/pith/GRSHXLAPPZ4K24XJVCFR4ZUMKG.json","graph_json":"https://pith.science/api/pith-number/GRSHXLAPPZ4K24XJVCFR4ZUMKG/graph.json","events_json":"https://pith.science/api/pith-number/GRSHXLAPPZ4K24XJVCFR4ZUMKG/events.json","paper":"https://pith.science/paper/GRSHXLAP"},"agent_actions":{"view_html":"https://pith.science/pith/GRSHXLAPPZ4K24XJVCFR4ZUMKG","download_json":"https://pith.science/pith/GRSHXLAPPZ4K24XJVCFR4ZUMKG.json","view_paper":"https://pith.science/paper/GRSHXLAP","resolve_alias":"https://pith.science/api/pith-number/resolve?arxiv=2605.14355&json=true","fetch_graph":"https://pith.science/api/pith-number/GRSHXLAPPZ4K24XJVCFR4ZUMKG/graph.json","fetch_events":"https://pith.science/api/pith-number/GRSHXLAPPZ4K24XJVCFR4ZUMKG/events.json","actions":{"anchor_timestamp":"https://pith.science/pith/GRSHXLAPPZ4K24XJVCFR4ZUMKG/action/timestamp_anchor","attest_storage":"https://pith.science/pith/GRSHXLAPPZ4K24XJVCFR4ZUMKG/action/storage_attestation","attest_author":"https://pith.science/pith/GRSHXLAPPZ4K24XJVCFR4ZUMKG/action/author_attestation","sign_citation":"https://pith.science/pith/GRSHXLAPPZ4K24XJVCFR4ZUMKG/action/citation_signature","submit_replication":"https://pith.science/pith/GRSHXLAPPZ4K24XJVCFR4ZUMKG/action/replication_record"}},"created_at":"2026-05-17T23:39:08.016798+00:00","updated_at":"2026-05-17T23:39:08.016798+00:00"}