AWN-similarity: Towards developing free open-source frameworks for measuring Arabic semantic similarity under Windows / Linux operating systems
DOI:
https://doi.org/10.21533/pen.v9.i1.723Abstract
Arabic is a highly systematic language where its words exhibit elegant and rigorous logic. The field of Arabic word semantic similarity becomes more challenging due to its higher complexity and subtlety. This research is concerned with investigating the development of free open-source frameworks containing packages to calculate the semantic similarity between two Arabic words or concepts. These packages are known as AWN-ConceptSimilarity and AWN-WordSimilarity. The developed packages implement seven semantic similarity algorithms. One of these algorithms was proposed for Arabic and the rest were proposed for English where successfully adapted to Arabic using an Arabic lexical database, Arabic wordnet.
The functionality of the developed packages is validated using two-word similarity benchmarks datasets previously produced for Arabic. The results of the validation process indicate that the developed frameworks represent an important contribution to the Arabic semantic similarity field. Moreover, the developed packages are reliable to use and embed them with Arabic researchers' projects for improving or comparing their methodologies.
Downloads
Published
Issue
Section
License

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.