Being one of the six official languages of the united nations ever since 1973, the Arabic language lies among the world's six major ones. It is the language of the Holy Quran and spoken by more than 273 million people around the world. Arabic dialects consist of several branches: the classical (Language of Quran), the modern standard (used in newspapers, books...) and the local dialects (vary considerably among different countries). For instance, the increasing need for Arabic corpus, the Arabic corpus remains deficient to ...
Read More
Being one of the six official languages of the united nations ever since 1973, the Arabic language lies among the world's six major ones. It is the language of the Holy Quran and spoken by more than 273 million people around the world. Arabic dialects consist of several branches: the classical (Language of Quran), the modern standard (used in newspapers, books...) and the local dialects (vary considerably among different countries). For instance, the increasing need for Arabic corpus, the Arabic corpus remains deficient to support various Arabic linguistic researches, and the majority of Arabic corpora are limited in sources, types, genres or even not freely available, and the high costs of building or licensing a corpora could be an obstacle for many young researchers or even some institution in several parts of the world. Due to this lack, one of the aims of this paper is to build a new free Arabic Multipurpose corpus which we call "Silver Multipurpose Arabic Corpus" with a large size, collected over many years from multiple sources, covering all types and genres.
Read Less