{"record_type":"pith_number_record","schema_url":"https://pith.science/schemas/pith-number/v1.json","pith_number":"pith:2019:TWRCAKGS4IGSO7ICWGKMBLKNLO","short_pith_number":"pith:TWRCAKGS","schema_version":"1.0","canonical_sha256":"9da22028d2e20d277d02b194c0ad4d5ba8f27c7c55035039891dec57383ef5e4","source":{"kind":"arxiv","id":"1906.01830","version":1},"attestation_state":"computed","paper":{"title":"ArSentD-LEV: A Multi-Topic Corpus for Target-based Sentiment Analysis in Arabic Levantine Tweets","license":"http://arxiv.org/licenses/nonexclusive-distrib/1.0/","headline":"","cross_cats":["cs.IR","cs.LG","stat.ML"],"primary_cat":"cs.CL","authors_text":"(2) American University of Beirut, (3) American University of Beirut, (4) Qatar University, Alaa Khaddaj (2), Artificial Intelligence Laboratory, Beirut, Cambridge, Computer Engineering Department, Computer Science, Computer Science Department, Doha, Electrical, Engineering Department, Hazem Hajj (2), Khaled Bashir Shaban (4) ((1) MIT Computer Science, Lebanon, MA, Qatar), Ramy Baly (1), USA, Wassim El-Hajj (3)","submitted_at":"2019-05-25T13:31:52Z","abstract_excerpt":"Sentiment analysis is a highly subjective and challenging task. Its complexity further increases when applied to the Arabic language, mainly because of the large variety of dialects that are unstandardized and widely used in the Web, especially in social media. While many datasets have been released to train sentiment classifiers in Arabic, most of these datasets contain shallow annotation, only marking the sentiment of the text unit, as a word, a sentence or a document. In this paper, we present the Arabic Sentiment Twitter Dataset for the Levantine dialect (ArSenTD-LEV). Based on findings fr"},"verification_status":{"content_addressed":true,"pith_receipt":true,"author_attested":false,"weak_author_claims":0,"strong_author_claims":0,"externally_anchored":false,"storage_verified":false,"citation_signatures":0,"replication_records":0,"graph_snapshot":true,"references_resolved":false,"formal_links_present":false},"canonical_record":{"source":{"id":"1906.01830","kind":"arxiv","version":1},"metadata":{"license":"http://arxiv.org/licenses/nonexclusive-distrib/1.0/","primary_cat":"cs.CL","submitted_at":"2019-05-25T13:31:52Z","cross_cats_sorted":["cs.IR","cs.LG","stat.ML"],"title_canon_sha256":"d6a9678a3cd64b571d59dad93354893c84d34b13e6e0f39d061da01bc0c44979","abstract_canon_sha256":"44f47ce8163aa60dbe626229cb8cb5967c3cd0d6d93fd6f51bf93b317aff185f"},"schema_version":"1.0"},"receipt":{"kind":"pith_receipt","key_id":"pith-v1-2026-05","algorithm":"ed25519","signed_at":"2026-05-17T23:44:06.093391Z","signature_b64":"+oeSm5GavwcRJi5OVR88KMljjwp3bDit8kTJIolWd0sYHmT27FIqVoL6weXCCk2NYNAAvz1GxvXW4oYqRrymAA==","signed_message":"canonical_sha256_bytes","builder_version":"pith-number-builder-2026-05-17-v1","receipt_version":"0.3","canonical_sha256":"9da22028d2e20d277d02b194c0ad4d5ba8f27c7c55035039891dec57383ef5e4","last_reissued_at":"2026-05-17T23:44:06.092756Z","signature_status":"signed_v1","first_computed_at":"2026-05-17T23:44:06.092756Z","public_key_fingerprint":"8d4b5ee74e4693bcd1df2446408b0d54"},"graph_snapshot":{"paper":{"title":"ArSentD-LEV: A Multi-Topic Corpus for Target-based Sentiment Analysis in Arabic Levantine Tweets","license":"http://arxiv.org/licenses/nonexclusive-distrib/1.0/","headline":"","cross_cats":["cs.IR","cs.LG","stat.ML"],"primary_cat":"cs.CL","authors_text":"(2) American University of Beirut, (3) American University of Beirut, (4) Qatar University, Alaa Khaddaj (2), Artificial Intelligence Laboratory, Beirut, Cambridge, Computer Engineering Department, Computer Science, Computer Science Department, Doha, Electrical, Engineering Department, Hazem Hajj (2), Khaled Bashir Shaban (4) ((1) MIT Computer Science, Lebanon, MA, Qatar), Ramy Baly (1), USA, Wassim El-Hajj (3)","submitted_at":"2019-05-25T13:31:52Z","abstract_excerpt":"Sentiment analysis is a highly subjective and challenging task. Its complexity further increases when applied to the Arabic language, mainly because of the large variety of dialects that are unstandardized and widely used in the Web, especially in social media. While many datasets have been released to train sentiment classifiers in Arabic, most of these datasets contain shallow annotation, only marking the sentiment of the text unit, as a word, a sentence or a document. In this paper, we present the Arabic Sentiment Twitter Dataset for the Levantine dialect (ArSenTD-LEV). Based on findings fr"},"claims":{"count":0,"items":[],"snapshot_sha256":"258153158e38e3291e3d48162225fcdb2d5a3ed65a07baac614ab91432fd4f57"},"source":{"id":"1906.01830","kind":"arxiv","version":1},"verdict":{"id":null,"model_set":{},"created_at":null,"strongest_claim":"","one_line_summary":"","pipeline_version":null,"weakest_assumption":"","pith_extraction_headline":""},"references":{"count":0,"sample":[],"resolved_work":0,"snapshot_sha256":"258153158e38e3291e3d48162225fcdb2d5a3ed65a07baac614ab91432fd4f57","internal_anchors":0},"formal_canon":{"evidence_count":0,"snapshot_sha256":"258153158e38e3291e3d48162225fcdb2d5a3ed65a07baac614ab91432fd4f57"},"author_claims":{"count":0,"strong_count":0,"snapshot_sha256":"258153158e38e3291e3d48162225fcdb2d5a3ed65a07baac614ab91432fd4f57"},"builder_version":"pith-number-builder-2026-05-17-v1"},"aliases":[{"alias_kind":"arxiv","alias_value":"1906.01830","created_at":"2026-05-17T23:44:06.092852+00:00"},{"alias_kind":"arxiv_version","alias_value":"1906.01830v1","created_at":"2026-05-17T23:44:06.092852+00:00"},{"alias_kind":"doi","alias_value":"10.48550/arxiv.1906.01830","created_at":"2026-05-17T23:44:06.092852+00:00"},{"alias_kind":"pith_short_12","alias_value":"TWRCAKGS4IGS","created_at":"2026-05-18T12:33:30.264802+00:00"},{"alias_kind":"pith_short_16","alias_value":"TWRCAKGS4IGSO7IC","created_at":"2026-05-18T12:33:30.264802+00:00"},{"alias_kind":"pith_short_8","alias_value":"TWRCAKGS","created_at":"2026-05-18T12:33:30.264802+00:00"}],"events":[],"event_summary":{},"paper_claims":[],"inbound_citations":{"count":0,"internal_anchor_count":0,"sample":[]},"formal_canon":{"evidence_count":0,"sample":[],"anchors":[]},"links":{"html":"https://pith.science/pith/TWRCAKGS4IGSO7ICWGKMBLKNLO","json":"https://pith.science/pith/TWRCAKGS4IGSO7ICWGKMBLKNLO.json","graph_json":"https://pith.science/api/pith-number/TWRCAKGS4IGSO7ICWGKMBLKNLO/graph.json","events_json":"https://pith.science/api/pith-number/TWRCAKGS4IGSO7ICWGKMBLKNLO/events.json","paper":"https://pith.science/paper/TWRCAKGS"},"agent_actions":{"view_html":"https://pith.science/pith/TWRCAKGS4IGSO7ICWGKMBLKNLO","download_json":"https://pith.science/pith/TWRCAKGS4IGSO7ICWGKMBLKNLO.json","view_paper":"https://pith.science/paper/TWRCAKGS","resolve_alias":"https://pith.science/api/pith-number/resolve?arxiv=1906.01830&json=true","fetch_graph":"https://pith.science/api/pith-number/TWRCAKGS4IGSO7ICWGKMBLKNLO/graph.json","fetch_events":"https://pith.science/api/pith-number/TWRCAKGS4IGSO7ICWGKMBLKNLO/events.json","actions":{"anchor_timestamp":"https://pith.science/pith/TWRCAKGS4IGSO7ICWGKMBLKNLO/action/timestamp_anchor","attest_storage":"https://pith.science/pith/TWRCAKGS4IGSO7ICWGKMBLKNLO/action/storage_attestation","attest_author":"https://pith.science/pith/TWRCAKGS4IGSO7ICWGKMBLKNLO/action/author_attestation","sign_citation":"https://pith.science/pith/TWRCAKGS4IGSO7ICWGKMBLKNLO/action/citation_signature","submit_replication":"https://pith.science/pith/TWRCAKGS4IGSO7ICWGKMBLKNLO/action/replication_record"}},"created_at":"2026-05-17T23:44:06.092852+00:00","updated_at":"2026-05-17T23:44:06.092852+00:00"}