Introducing the Arabic WordNet project 论文
摘要
Arabic is the official language of hundreds of millions of people in twenty Middle East and northern African countries, and is the religious language of all Muslims of various ethnicities around the world.Surprisingly little has been done in the field of computerised language and lexical resources.It is therefore motivating to develop an Arabic (WordNet) lexical resource that discovers the richness of Arabic as described in Elkateb (2005).This paper describes our approach towards building a lexical resource in Standard Arabic.Arabic WordNet (AWN) will be based on the design and contents of the universally accepted Princeton WordNet (PWN) and will be mappable straightforwardly onto PWN 2.0 and EuroWordNet (EWN), enabling translation on the lexical level to English and dozens of other languages.Several tools specific to this task will be developed.AWN will be a linguistic resource with a deep formal semantic foundation.Besides the standard wordnet representation of senses, word meanings are defined with a machine understandable semantics in first order logic.The basis for this semantics is the Suggested Upper Merged Ontology (SUMO) and its associated domain ontologies.We will greatly extend the ontology and its set of mappings to provide formal terms and definitions equivalent to each synset.