AWN-similarity: Towards developing free open-source frameworks for measuring Arabic semantic similarity under Windows / Linux operating systems

Faaza A. Almarsoomi, Israa A. Alwan

Abstract


Arabic is a highly systematic language where its words exhibit elegant and rigorous logic. The field of Arabic word semantic similarity becomes more challenging due to its higher complexity and subtlety. This research is concerned with investigating the development of free open-source frameworks containing packages to calculate the semantic similarity between two Arabic words or concepts. These packages are known as AWN-ConceptSimilarity and AWN-WordSimilarity. The developed packages implement seven semantic similarity algorithms. One of these algorithms was proposed for Arabic and the rest were proposed for English where successfully adapted to Arabic using an Arabic lexical database, Arabic wordnet.
The functionality of the developed packages is validated using two-word similarity benchmarks datasets previously produced for Arabic. The results of the validation process indicate that the developed frameworks represent an important contribution to the Arabic semantic similarity field. Moreover, the developed packages are reliable to use and embed them with Arabic researchers' projects for improving or comparing their methodologies.

Full Text:

PDF


DOI: http://dx.doi.org/10.21533/pen.v9i1.1791

Refbacks

  • There are currently no refbacks.


Copyright (c) 2021 Faaza A. Almarsoomi, Israa A. Alwan

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.

ISSN: 2303-4521

Digital Object Identifier DOI: 10.21533/pen

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License