Extracting Knowledge from an Arabic-English Machine-Readable Dictionary Using Information Extraction

Aly A. Fahmy; Diaa M. Fayed; Mohsen A. Rashwan; Wafaa K. Fayed

arxiv: 2606.28457 · v1 · pith:XTF2437Onew · submitted 2026-06-26 · 💻 cs.CL

Extracting Knowledge from an Arabic-English Machine-Readable Dictionary Using Information Extraction

Diaa M. Fayed , Aly A. Fahmy , Mohsen A. Rashwan , Wafaa K. Fayed This is my paper

Pith reviewed 2026-06-30 01:39 UTC · model grok-4.3

classification 💻 cs.CL

keywords Arabic-English dictionaryinformation extractionlexical knowledgemachine-readable dictionaryn-gram analysisKWIC analysissynonym extractionhyponym relations

0 comments

The pith

Hand-crafted rules based on n-gram and KWIC patterns extract lexical information from the Al-Mawrid Arabic-English dictionary with high precision.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The authors developed a method to automatically extract morphologic, syntactic, and semantic lexical information from a machine-readable version of the Arabic-English Al-Mawrid dictionary. They first applied n-gram analysis and key-word-in-context analysis to discover patterns that signal the target information. Hand-crafted rules then performed the extraction, supplemented by punctuation marks and heuristics to pull synonyms from subentries. The approach delivered high precision on all extracted types, high recall on synonyms, and lower recall on the remaining types. The work also documented that the dictionary contains substantial numbers of derivations, synonyms, domain labels, and hyponym or hypernym relations.

Core claim

By using n-gram and KWIC analysis to identify lexical patterns and then applying hand-crafted rule-based information extraction, the study extracted morphologic information such as derivations, syntactic information, and semantic information such as domain labels and hyponym/hypernym relations from the Al-Mawrid dictionary, while also harvesting synonyms through punctuation and heuristics.

What carries the argument

n-gram and KWIC pattern discovery followed by hand-crafted rule-based information extraction, which identifies morphologic, syntactic, or semantic information in dictionary entries.

If this is right

Large lexical resources for NLP can be built automatically from existing machine-readable dictionaries.
The Al-Mawrid dictionary supplies usable quantities of derivations as morphologic information.
Synonyms can be extracted reliably with high recall using simple punctuation heuristics.
Domain labels and hyponym/hypernym relations provide semantic structure that the method captures precisely.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same pattern-discovery step could be reused on other bilingual dictionaries to reduce manual rule writing.
Lower recall on some relation types points to an opportunity for adding more patterns or statistical methods to increase coverage.
The documented volume of relations suggests the dictionary could serve as a primary source for Arabic lexical databases without starting from scratch.

Load-bearing premise

The dictionary entries follow consistent enough formatting that the discovered n-gram patterns and hand-crafted rules match the intended information without substantial mismatches from variations.

What would settle it

A manual audit of a random sample of extracted items against the original dictionary text to measure whether the reported precision remains high and whether the counts of derivations and relations match the stated quantities.

read the original abstract

Natural language processing (NLP) applications need large and rich amount of linguistic knowledge. Furthermore, electronic language sources such as dictionaries, encyclopedia, and corpora became available. So, automatic methods are emerged to extract lexical information from those sources to overcome the knowledge acquisition bottleneck. We presented a method to automatically extract lexical information from a machine-readable version of the Arabic-English Al-Mawrid dictionary. We used n-gram analysis and key-word-in-context (KWIC) analysis to discover lexical patterns that manifest morphologic, syntactic, or semantic information. Then, we used hand-crafted rule-based information extraction to extract that information. Furthermore, we used punctuation marks and some heuristics to extract a set of synonyms in a subentry. This study registered high precision for all types of information, high recall for synonyms, and low recall for the other information. The study also showed that the Al-Mawrid has significant amount of derivations (morphologic information) and synonyms, domain labels, and hyponym/hypernym relations (semantic information).

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

Routine IE methods applied to one dictionary with precision/recall claims that lack any evaluation details or error analysis.

read the letter

The paper runs n-gram analysis, KWIC, punctuation heuristics, and hand-crafted rules on the machine-readable Al-Mawrid Arabic-English dictionary to pull out derivations, synonyms, domain labels, and hyponym/hypernym relations. It reports high precision across the board, high recall only for synonyms, and low recall otherwise, plus some counts of what the dictionary contains.

Nothing in the method is new. These are standard techniques from decades of dictionary mining work. The actual output is an application to this specific resource and the lexical counts it yields.

The soft spot is the complete absence of evaluation details. The abstract states the precision and recall numbers but supplies no test set size, no description of how correctness was judged, no inter-annotator numbers, and no error analysis. Without those, the performance claims cannot be checked for bias or scope. The hand-crafted rules rest on the untested premise that surface patterns map cleanly to the intended categories; any formatting variation in the dictionary would directly affect the reported figures.

This is for Arabic NLP groups that need lexical data from this dictionary and might want the extracted relations if they were released. Readers seeking new algorithms or reproducible empirical results will find little of value.

I would not bring it to a reading group and would not cite it. It does not merit peer review in its current state because the central empirical claims cannot be assessed.

Referee Report

2 major / 0 minor

Summary. The manuscript presents a rule-based information extraction method to derive lexical information including morphological derivations, synonyms, domain labels, and hyponym/hypernym relations from the machine-readable Al-Mawrid Arabic-English dictionary. The approach uses n-gram and KWIC analysis to identify patterns, applies hand-crafted rules for extraction, and employs punctuation heuristics for synonyms in subentries. It claims high precision for all extracted information types, high recall specifically for synonyms, low recall for other types, and notes the dictionary's substantial content of these relations.

Significance. If the extraction accuracy claims are substantiated with proper validation, this work would offer a practical contribution to Arabic natural language processing by demonstrating how existing machine-readable dictionaries can be leveraged to build lexical resources, potentially reducing the effort required for knowledge acquisition in low-resource language settings.

major comments (2)

[Abstract and Results] Abstract and Results: The manuscript states specific precision and recall outcomes but supplies no evaluation details such as dataset size, how the gold standard was created, inter-annotator agreement, or error analysis. Without these the reported numbers cannot be assessed for bias or scope, directly undermining the central empirical claims about extraction performance.
[Method] Method description: The hand-crafted rules, n-gram analysis, KWIC patterns, and punctuation heuristics are presented as reliably identifying the intended morphologic/syntactic/semantic relations. No validation, concrete examples of rule applications, or analysis of failure cases due to dictionary formatting variations are provided, leaving the weakest assumption untested and making all quantitative results dependent on an unverified premise.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive comments, which identify key areas where additional detail is needed to support the empirical claims. We will revise the manuscript to address both major points by expanding the description of the evaluation process and the method validation.

read point-by-point responses

Referee: [Abstract and Results] Abstract and Results: The manuscript states specific precision and recall outcomes but supplies no evaluation details such as dataset size, how the gold standard was created, inter-annotator agreement, or error analysis. Without these the reported numbers cannot be assessed for bias or scope, directly undermining the central empirical claims about extraction performance.

Authors: We agree that the current manuscript lacks sufficient detail on the evaluation methodology. In the revised version we will add a dedicated subsection under Results that specifies the size of the evaluation dataset, the procedure used to construct the gold standard (including how entries were sampled and annotated), any inter-annotator agreement measures, and a summary error analysis broken down by information type. These additions will allow readers to evaluate the scope and potential biases of the reported precision and recall figures. revision: yes
Referee: [Method] Method description: The hand-crafted rules, n-gram analysis, KWIC patterns, and punctuation heuristics are presented as reliably identifying the intended morphologic/syntactic/semantic relations. No validation, concrete examples of rule applications, or analysis of failure cases due to dictionary formatting variations are provided, leaving the weakest assumption untested and making all quantitative results dependent on an unverified premise.

Authors: We acknowledge the need for greater transparency in the method. The revised manuscript will include concrete examples of the n-gram and KWIC patterns that were identified, step-by-step illustrations of how selected hand-crafted rules were applied to sample dictionary entries, and a discussion of observed failure modes related to formatting inconsistencies in the Al-Mawrid dictionary. We will also report any internal checks performed on rule reliability. revision: yes

Circularity Check

0 steps flagged

No circularity; extraction claims rest on direct application of rules to dictionary text

full rationale

The paper applies n-gram analysis, KWIC, punctuation heuristics and hand-crafted rules to a machine-readable dictionary to extract lexical relations. No equations, fitted parameters, predictions, or self-citation chains appear in the derivation. Results (precision/recall figures and counts of derivations/synonyms/etc.) are produced by running the described procedures on the input text; they do not reduce to the inputs by construction. This is the normal non-circular case for a rule-based IE study.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review yields no explicit free parameters, axioms, or invented entities; the work assumes standard dictionary formatting and that lexical patterns are detectable via surface n-grams and keywords.

pith-pipeline@v0.9.1-grok · 5721 in / 993 out tokens · 10851 ms · 2026-06-30T01:39:52.793558+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

48 extracted references

[1]

Using a Bilingual Dictionary to Create Semantic Networks,

T. Fontenelle, “Using a Bilingual Dictionary to Create Semantic Networks,” International Journal of Lexicography, vol. 10, no. 4, pp. 275, 1997

1997
[2]

Structuring the Text of the Oxford English Dictionary through Finite State Transduction,

R. Kazman, “Structuring the Text of the Oxford English Dictionary through Finite State Transduction,” Doctoral, Dept. of Computer Science, University of Waterloo, 1986

1986
[3]

Machine Readable Dictionaries: What Have We learned, Where Do We Go

N. Ide, and J. Véronis, "Machine Readable Dictionaries: What Have We learned, Where Do We Go." pp. 137–146
[4]

Automated Induction of a Lexical Sublanguage Grammar Using a Hybrid System of Corpus -and Knowledge -based Techniques,

G. J. Wilms, “Automated Induction of a Lexical Sublanguage Grammar Using a Hybrid System of Corpus -and Knowledge -based Techniques,” Dissertation, Mississippi State University, 1995

1995
[5]

Web Text Corpus for Natural Language Processing

V. Liu, and J. R. Curran, "Web Text Corpus for Natural Language Processing."
[6]

Web as Corpus

A. Kilgarriff, "Web as Corpus." pp. 342 -344
[7]

Using Lexical Patterns for Extracting Hyponyms from the Web,

R. Ortega -Mendoza, L. Villaseñor -Pineda, and M. Montes -y-Gómez, "Using Lexical Patterns for Extracting Hyponyms from the Web," MICAI 2007: Advances in Artificial Intelligence , Lecture Notes in Computer Science A. Gelbukh and Á. Kuri Morales, eds., pp. 904 -911: Springer Berlin Heidelberg, 2007

2007
[8]

Structural Patterns vs. String Patterns for Extracting Semantic Information from Dictionaries

S. Montemagni, and L. Vanderwende, "Structural Patterns vs. String Patterns for Extracting Semantic Information from Dictionaries." pp. 546-552
[9]

Tools and Methods for Computational Lexicology,

R. Byrd, N. Calzolari, M. Chodorow, J. Klavans, M. Neff, and O. Rizk, “Tools and Methods for Computational Lexicology,” Computational Linguistics, vol. 13, no. 3-4, pp. 219-240, 1987

1987
[10]

Barnbrook, Defining Language: a Local Grammar of Definition Sentences, Amsterdam: J

G. Barnbrook, Defining Language: a Local Grammar of Definition Sentences, Amsterdam: J. Benjamins., 2002

2002
[11]

Automatically Deriving Structured Knowledge Bases from On -Line Dictionaries,

W. Dolan, L. Vanderwende, and S. D. Richardson, “Automatically Deriving Structured Knowledge Bases from On -Line Dictionaries,” in PACLING 93, Simon Fraser University, Vancouver, BC., 1993, pp. 5 - 14

1993
[12]

Conceptual Semantics for Nouns,

H. v. d. Vliet, “Conceptual Semantics for Nouns,” Proceedings EURALEX'94, pp. 216-225, 1994

1994
[13]

Parsing vs. Text Processing in the Analysis of Dictionary Definitions

T. Ahlswede, and M. Evens, "Parsing vs. Text Processing in the Analysis of Dictionary Definitions." pp. 217 -224
[14]

Extracting Semantic Hierarchies from a Large On-line Dictionary

S. C. Martin, J. B. Roy, and E. H. George, "Extracting Semantic Hierarchies from a Large On-line Dictionary." pp. 299 -304
[15]

Semantically Significant Patterns in Dictionary Definitions

J. Markowitz, T. Ahlswede, and M. Evens, "Semantically Significant Patterns in Dictionary Definitions." pp. 112 -119
[16]

Extraction of Semantic Information from an Ordinary English Dictionary and its Evaluation

J.-i. Nakramura, and M. Nagao, "Extraction of Semantic Information from an Ordinary English Dictionary and its Evaluation." pp. 459 -464
[17]

Automatic Generation of Thesaurus from Arabic Lexical Resources,

S. M. Eid, “Automatic Generation of Thesaurus from Arabic Lexical Resources,” Ph.D., Electronics and Communication Engineering, Faculty of Engineering, Cairo University, Giza, Egypt, 2010

2010
[18]

Computeriz ing a Machine Readable Dictionary

G. J. Wilms, "Computeriz ing a Machine Readable Dictionary." pp. 306 - 313
[19]

Providing Machine Tractable Dictionary Tools,

Y. Wilks, D. Fass, C. -m. Guo, J. E. McDonald, T. Plate, and B. M. Slator, “Providing Machine Tractable Dictionary Tools,” Machine Translation, vol. 5, no. 2, pp. 99-154, 1990

1990
[20]

Automatic Acq uisition of Lexical Knowledge from Machine-Readable Dictionaries,

G. R. Claramunt, “Automatic Acq uisition of Lexical Knowledge from Machine-Readable Dictionaries,” Ph.D., Departament de Llenguatges i Sistemes Informàtics, Universitat Politècnica de Catalunya, Barcelona, 1998

1998
[21]

Processing Dictionary Definitions with Phrasal Pattern Hierarchies,

H. Alshawi, “Processing Dictionary Definitions with Phrasal Pattern Hierarchies,” Computational Linguistics, vol. 13, no. 3 -4, pp. 195 -202, 1987

1987
[22]

Syntactic and Semantic Analysis of Definitions in a Machine-readable Dictionary,

T. Ahlswede, “Syntactic and Semantic Analysis of Definitions in a Machine-readable Dictionary,” Ph.D., Illinois Institute of Technology, 1988

1988
[23]

Ambiguity in the Acquisiti on of Lexical Information

L. Vanderwende, "Ambiguity in the Acquisiti on of Lexical Information." pp. 174-179
[24]

MindNet: Acquiring and Structuring Semantic Information from Text

S. D. Richardson, W. B. Dolan, and L. Vanderwende, "MindNet: Acquiring and Structuring Semantic Information from Text." pp. 1098 - 1102
[25]

MindNet: An Automatically-created Lexical Resource

L. Vanderwende, G. Kacmarcik, H. Suzuki, and A. Menezes, "MindNet: An Automatically-created Lexical Resource."
[26]

Disambiguating Prepositional Phrase Attachments by Using On -Line Dictionary Definitions,

K. Jensen, and J. Binot, “Disambiguating Prepositional Phrase Attachments by Using On -Line Dictionary Definitions,” Computational Linguistics, vol. 13, no. 3-4, pp. 251-260, 1987

1987
[27]

Rule-based Information Extraction is Dead! Long Live Rule -based Information Extraction Systems!

L. Chiticariu, Y. Li, and F. R. Reiss, "Rule-based Information Extraction is Dead! Long Live Rule -based Information Extraction Systems!." pp. 827-832
[28]

Introducing the Arabic WordNet Project

W. Black, S. Elkateb, and P. Vossen, "Introducing the Arabic WordNet Project." pp. 295-299
[29]

A Compact Arabic Lexical Semantics Language Resource Based on the Theory of Semantic Fields ,

M. Attia, M. Rashwan, A. Ragheb, M. Al -Badrashiny, H. Al -Basoumy, and S. Abdou, "A Compact Arabic Lexical Semantics Language Resource Based on the Theory of Semantic Fields ," Advances in Natural Language Processing, pp. 65-76: Springer, 2008

2008
[30]

Automatic Extraction of Ontological Relations from Arabic Text,

M. G. A. Zamil, and Q. Al -Radaideh, “Automatic Extraction of Ontological Relations from Arabic Text,” Journal of King Saud University-Computer and Information Sciences , 2014

2014
[31]

Towards Structuring an Arabic -English Machine -Readable Dictionary Using Parsing Expression Grammars,

D. M. Fayed, A. A. Fahmy, M. A. Rashwan, and W. K. Fayed, “Towards Structuring an Arabic -English Machine -Readable Dictionary Using Parsing Expression Grammars,” International Journal of Computational Linguistics Research, vol. 5, no. 1, pp. 1-13, 2014

2014
[32]

Arabic -English Domain Terminology Extraction from Aligned Corpora

W. Lahbib, I. Bounhas, and B. Elayeb, "Arabic -English Domain Terminology Extraction from Aligned Corpora." pp. 745 -759
[33]

A Hybrid Approach for Arabic Semantic Relation Extraction

W. Lahbib, I. Bounhas, B. Elayeb, F. Evrard, and Y. Slimani, "A Hybrid Approach for Arabic Semantic Relation Extraction."
[34]

Automatic Extraction of Arabic Multiword Expressions

M. Attia, L. Tounsi, P. Pecina, J. van Genabith, and A. Toral, "Automatic Extraction of Arabic Multiword Expressions." pp. 19 -27
[35]

A Multi -Word Term Extraction Program for Arabic Language

S. Boulaknadel, B. Daille , and D. Aboutajdine, "A Multi -Word Term Extraction Program for Arabic Language."
[36]

Automatic Extraction of Arabic Multi - Word Terms

K. Al Khatib, and A. Badarneh, "Automatic Extraction of Arabic Multi - Word Terms." pp. 411-418
[37]

An Automatic Noun Compound Extraction from Arabic Corpus

A. M. Saif, and M. Aziz, "An Automatic Noun Compound Extraction from Arabic Corpus." pp. 224-230
[38]

Attia, L

M. Attia, L. Tounsi, and J. v. Genabith, Automatic Lexical Resource Acquisition for Constructing an LMF Compatible Lexicon of Modern Standard Arabic, DCU, Dublin, Ireland, 2010

2010
[39]

An Automatically Built Named Entity Lexicon for Arabic

M. Attia, A. Toral, L. Tounsi, M. Monachini, and J. van Gen abith, "An Automatically Built Named Entity Lexicon for Arabic ."
[40]

Knowledge Extraction from Machine -Readable Dictionaries: An Evaluation

N. Ide, and J. Véronis, "Knowledge Extraction from Machine -Readable Dictionaries: An Evaluation." pp. 19 -34
[41]

A Taxonomy for English Nouns and Verbs

R. A. Amsler, "A Taxonomy for English Nouns and Verbs." pp. 133 - 138
[42]

Info rmation Extraction: Techniques, Advances and Challenge,

H. Ji, "Info rmation Extraction: Techniques, Advances and Challenge," 2012

2012
[43]

Introduction to Information Extraction Technology: Tutorial,

D. Appelt, and D. Israel, “Introduction to Information Extraction Technology: Tutorial,” in IJCAI 1999, 1999

1999
[44]

Information Extraction,

S. Sarawagi, “Information Extraction,” Foundations and trends in databases, vol. 1, no. 3, pp. 261-377, 2008

2008
[45]

Adaptive Information Extraction and Sublanguage Analysis

R. Grishman, "Adaptive Information Extraction and Sublanguage Analysis."
[46]

لغة التعريف في المعجم العربي العام الحديث: إشكالية الصياغة والمحتوى، رؤية لغوية حاسوبية

A. Hawarry, W. Kamel, and M. Rashwan, “لغة التعريف في المعجم العربي العام الحديث: إشكالية الصياغة والمحتوى، رؤية لغوية حاسوبية ”, 2008, Ph.D., Department of Arabic Language and Literatures, Faculty of Arts, Cairo University, Giza, Egypt

2008
[47]

Information Extraction: Techniques and Challenges,

R. Grishman, "Information Extraction: Techniques and Challenges," Information Extraction A Multidisciplinary Approach to an Emerging Information Technology , pp. 10-27: Springer, 1997

1997
[48]

A Formal Framework for Evaluation of Information Extraction,

A. De Sitter, T. Calders, and W. Daelemans, “A Formal Framework for Evaluation of Information Extraction,” Online http://www. cnts. ua. ac. be/Publications/2004/DCD04 , 2004

2004

[1] [1]

Using a Bilingual Dictionary to Create Semantic Networks,

T. Fontenelle, “Using a Bilingual Dictionary to Create Semantic Networks,” International Journal of Lexicography, vol. 10, no. 4, pp. 275, 1997

1997

[2] [2]

Structuring the Text of the Oxford English Dictionary through Finite State Transduction,

R. Kazman, “Structuring the Text of the Oxford English Dictionary through Finite State Transduction,” Doctoral, Dept. of Computer Science, University of Waterloo, 1986

1986

[3] [3]

Machine Readable Dictionaries: What Have We learned, Where Do We Go

N. Ide, and J. Véronis, "Machine Readable Dictionaries: What Have We learned, Where Do We Go." pp. 137–146

[4] [4]

Automated Induction of a Lexical Sublanguage Grammar Using a Hybrid System of Corpus -and Knowledge -based Techniques,

G. J. Wilms, “Automated Induction of a Lexical Sublanguage Grammar Using a Hybrid System of Corpus -and Knowledge -based Techniques,” Dissertation, Mississippi State University, 1995

1995

[5] [5]

Web Text Corpus for Natural Language Processing

V. Liu, and J. R. Curran, "Web Text Corpus for Natural Language Processing."

[6] [6]

Web as Corpus

A. Kilgarriff, "Web as Corpus." pp. 342 -344

[7] [7]

Using Lexical Patterns for Extracting Hyponyms from the Web,

R. Ortega -Mendoza, L. Villaseñor -Pineda, and M. Montes -y-Gómez, "Using Lexical Patterns for Extracting Hyponyms from the Web," MICAI 2007: Advances in Artificial Intelligence , Lecture Notes in Computer Science A. Gelbukh and Á. Kuri Morales, eds., pp. 904 -911: Springer Berlin Heidelberg, 2007

2007

[8] [8]

Structural Patterns vs. String Patterns for Extracting Semantic Information from Dictionaries

S. Montemagni, and L. Vanderwende, "Structural Patterns vs. String Patterns for Extracting Semantic Information from Dictionaries." pp. 546-552

[9] [9]

Tools and Methods for Computational Lexicology,

R. Byrd, N. Calzolari, M. Chodorow, J. Klavans, M. Neff, and O. Rizk, “Tools and Methods for Computational Lexicology,” Computational Linguistics, vol. 13, no. 3-4, pp. 219-240, 1987

1987

[10] [10]

Barnbrook, Defining Language: a Local Grammar of Definition Sentences, Amsterdam: J

G. Barnbrook, Defining Language: a Local Grammar of Definition Sentences, Amsterdam: J. Benjamins., 2002

2002

[11] [11]

Automatically Deriving Structured Knowledge Bases from On -Line Dictionaries,

W. Dolan, L. Vanderwende, and S. D. Richardson, “Automatically Deriving Structured Knowledge Bases from On -Line Dictionaries,” in PACLING 93, Simon Fraser University, Vancouver, BC., 1993, pp. 5 - 14

1993

[12] [12]

Conceptual Semantics for Nouns,

H. v. d. Vliet, “Conceptual Semantics for Nouns,” Proceedings EURALEX'94, pp. 216-225, 1994

1994

[13] [13]

Parsing vs. Text Processing in the Analysis of Dictionary Definitions

T. Ahlswede, and M. Evens, "Parsing vs. Text Processing in the Analysis of Dictionary Definitions." pp. 217 -224

[14] [14]

Extracting Semantic Hierarchies from a Large On-line Dictionary

S. C. Martin, J. B. Roy, and E. H. George, "Extracting Semantic Hierarchies from a Large On-line Dictionary." pp. 299 -304

[15] [15]

Semantically Significant Patterns in Dictionary Definitions

J. Markowitz, T. Ahlswede, and M. Evens, "Semantically Significant Patterns in Dictionary Definitions." pp. 112 -119

[16] [16]

Extraction of Semantic Information from an Ordinary English Dictionary and its Evaluation

J.-i. Nakramura, and M. Nagao, "Extraction of Semantic Information from an Ordinary English Dictionary and its Evaluation." pp. 459 -464

[17] [17]

Automatic Generation of Thesaurus from Arabic Lexical Resources,

S. M. Eid, “Automatic Generation of Thesaurus from Arabic Lexical Resources,” Ph.D., Electronics and Communication Engineering, Faculty of Engineering, Cairo University, Giza, Egypt, 2010

2010

[18] [18]

Computeriz ing a Machine Readable Dictionary

G. J. Wilms, "Computeriz ing a Machine Readable Dictionary." pp. 306 - 313

[19] [19]

Providing Machine Tractable Dictionary Tools,

Y. Wilks, D. Fass, C. -m. Guo, J. E. McDonald, T. Plate, and B. M. Slator, “Providing Machine Tractable Dictionary Tools,” Machine Translation, vol. 5, no. 2, pp. 99-154, 1990

1990

[20] [20]

Automatic Acq uisition of Lexical Knowledge from Machine-Readable Dictionaries,

G. R. Claramunt, “Automatic Acq uisition of Lexical Knowledge from Machine-Readable Dictionaries,” Ph.D., Departament de Llenguatges i Sistemes Informàtics, Universitat Politècnica de Catalunya, Barcelona, 1998

1998

[21] [21]

Processing Dictionary Definitions with Phrasal Pattern Hierarchies,

H. Alshawi, “Processing Dictionary Definitions with Phrasal Pattern Hierarchies,” Computational Linguistics, vol. 13, no. 3 -4, pp. 195 -202, 1987

1987

[22] [22]

Syntactic and Semantic Analysis of Definitions in a Machine-readable Dictionary,

T. Ahlswede, “Syntactic and Semantic Analysis of Definitions in a Machine-readable Dictionary,” Ph.D., Illinois Institute of Technology, 1988

1988

[23] [23]

Ambiguity in the Acquisiti on of Lexical Information

L. Vanderwende, "Ambiguity in the Acquisiti on of Lexical Information." pp. 174-179

[24] [24]

MindNet: Acquiring and Structuring Semantic Information from Text

S. D. Richardson, W. B. Dolan, and L. Vanderwende, "MindNet: Acquiring and Structuring Semantic Information from Text." pp. 1098 - 1102

[25] [25]

MindNet: An Automatically-created Lexical Resource

L. Vanderwende, G. Kacmarcik, H. Suzuki, and A. Menezes, "MindNet: An Automatically-created Lexical Resource."

[26] [26]

Disambiguating Prepositional Phrase Attachments by Using On -Line Dictionary Definitions,

K. Jensen, and J. Binot, “Disambiguating Prepositional Phrase Attachments by Using On -Line Dictionary Definitions,” Computational Linguistics, vol. 13, no. 3-4, pp. 251-260, 1987

1987

[27] [27]

Rule-based Information Extraction is Dead! Long Live Rule -based Information Extraction Systems!

L. Chiticariu, Y. Li, and F. R. Reiss, "Rule-based Information Extraction is Dead! Long Live Rule -based Information Extraction Systems!." pp. 827-832

[28] [28]

Introducing the Arabic WordNet Project

W. Black, S. Elkateb, and P. Vossen, "Introducing the Arabic WordNet Project." pp. 295-299

[29] [29]

A Compact Arabic Lexical Semantics Language Resource Based on the Theory of Semantic Fields ,

M. Attia, M. Rashwan, A. Ragheb, M. Al -Badrashiny, H. Al -Basoumy, and S. Abdou, "A Compact Arabic Lexical Semantics Language Resource Based on the Theory of Semantic Fields ," Advances in Natural Language Processing, pp. 65-76: Springer, 2008

2008

[30] [30]

Automatic Extraction of Ontological Relations from Arabic Text,

M. G. A. Zamil, and Q. Al -Radaideh, “Automatic Extraction of Ontological Relations from Arabic Text,” Journal of King Saud University-Computer and Information Sciences , 2014

2014

[31] [31]

Towards Structuring an Arabic -English Machine -Readable Dictionary Using Parsing Expression Grammars,

D. M. Fayed, A. A. Fahmy, M. A. Rashwan, and W. K. Fayed, “Towards Structuring an Arabic -English Machine -Readable Dictionary Using Parsing Expression Grammars,” International Journal of Computational Linguistics Research, vol. 5, no. 1, pp. 1-13, 2014

2014

[32] [32]

Arabic -English Domain Terminology Extraction from Aligned Corpora

W. Lahbib, I. Bounhas, and B. Elayeb, "Arabic -English Domain Terminology Extraction from Aligned Corpora." pp. 745 -759

[33] [33]

A Hybrid Approach for Arabic Semantic Relation Extraction

W. Lahbib, I. Bounhas, B. Elayeb, F. Evrard, and Y. Slimani, "A Hybrid Approach for Arabic Semantic Relation Extraction."

[34] [34]

Automatic Extraction of Arabic Multiword Expressions

M. Attia, L. Tounsi, P. Pecina, J. van Genabith, and A. Toral, "Automatic Extraction of Arabic Multiword Expressions." pp. 19 -27

[35] [35]

A Multi -Word Term Extraction Program for Arabic Language

S. Boulaknadel, B. Daille , and D. Aboutajdine, "A Multi -Word Term Extraction Program for Arabic Language."

[36] [36]

Automatic Extraction of Arabic Multi - Word Terms

K. Al Khatib, and A. Badarneh, "Automatic Extraction of Arabic Multi - Word Terms." pp. 411-418

[37] [37]

An Automatic Noun Compound Extraction from Arabic Corpus

A. M. Saif, and M. Aziz, "An Automatic Noun Compound Extraction from Arabic Corpus." pp. 224-230

[38] [38]

Attia, L

M. Attia, L. Tounsi, and J. v. Genabith, Automatic Lexical Resource Acquisition for Constructing an LMF Compatible Lexicon of Modern Standard Arabic, DCU, Dublin, Ireland, 2010

2010

[39] [39]

An Automatically Built Named Entity Lexicon for Arabic

M. Attia, A. Toral, L. Tounsi, M. Monachini, and J. van Gen abith, "An Automatically Built Named Entity Lexicon for Arabic ."

[40] [40]

Knowledge Extraction from Machine -Readable Dictionaries: An Evaluation

N. Ide, and J. Véronis, "Knowledge Extraction from Machine -Readable Dictionaries: An Evaluation." pp. 19 -34

[41] [41]

A Taxonomy for English Nouns and Verbs

R. A. Amsler, "A Taxonomy for English Nouns and Verbs." pp. 133 - 138

[42] [42]

Info rmation Extraction: Techniques, Advances and Challenge,

H. Ji, "Info rmation Extraction: Techniques, Advances and Challenge," 2012

2012

[43] [43]

Introduction to Information Extraction Technology: Tutorial,

D. Appelt, and D. Israel, “Introduction to Information Extraction Technology: Tutorial,” in IJCAI 1999, 1999

1999

[44] [44]

Information Extraction,

S. Sarawagi, “Information Extraction,” Foundations and trends in databases, vol. 1, no. 3, pp. 261-377, 2008

2008

[45] [45]

Adaptive Information Extraction and Sublanguage Analysis

R. Grishman, "Adaptive Information Extraction and Sublanguage Analysis."

[46] [46]

لغة التعريف في المعجم العربي العام الحديث: إشكالية الصياغة والمحتوى، رؤية لغوية حاسوبية

A. Hawarry, W. Kamel, and M. Rashwan, “لغة التعريف في المعجم العربي العام الحديث: إشكالية الصياغة والمحتوى، رؤية لغوية حاسوبية ”, 2008, Ph.D., Department of Arabic Language and Literatures, Faculty of Arts, Cairo University, Giza, Egypt

2008

[47] [47]

Information Extraction: Techniques and Challenges,

R. Grishman, "Information Extraction: Techniques and Challenges," Information Extraction A Multidisciplinary Approach to an Emerging Information Technology , pp. 10-27: Springer, 1997

1997

[48] [48]

A Formal Framework for Evaluation of Information Extraction,

A. De Sitter, T. Calders, and W. Daelemans, “A Formal Framework for Evaluation of Information Extraction,” Online http://www. cnts. ua. ac. be/Publications/2004/DCD04 , 2004

2004