Arabic natural language processing: Data science application in big data environment

Sherif M. Saif

doi:10.30574/wjarr.2024.24.2.3602

eISSN: 2581-9615 || CODEN: WJARAI || Impact Factor 8.2 || CrossRef DOI

Research and review articles are invited for publication in July 2026 (Volume 31, Issue 1) Submit manuscript

Arabic natural language processing: Data science application in big data environment

Sherif M. Saif ^*

Department of Computers and Systems, Electronics Research Institute, Cairo, Egypt.

Research Article

World Journal of Advanced Research and Reviews, 2024, 24(02), 2283-2293

Article DOI: 10.30574/wjarr.2024.24.2.3602

DOI url: https://doi.org/10.30574/wjarr.2024.24.2.3602

Publication history

Received on 16 October 2024; revised on 22 November 2024; accepted on 25 November 2024

Abstract

In the era of Big Data and Data Science, Text analysis within, Natural Language Processing (NLP), suffers from the curse of high dimensionality. The use of NLP in applications such as speech processing, semantic webs, and word processing has become a main element in today’s Artificial Intelligence and Big Data Applications. A natural language parsing system must incorporate three components of natural language, namely, lexicon, morphology, and syntax. As Arabic is highly derivational, each component requires extensive exploitation of the associated linguistic characteristics. Parsing Arabic sentences still has open challenges due to several reasons including the relatively free word order of Arabic, the length of sentences, and the omission of diacritics (vowels) in written Arabic and the frequency of pro-drop phenomena. This research exploits Visual Prolog to provide a scalable platform for Arabic parser and explains the details of the used lexicon and parser and shows the scalability of the system to address more functions.

Keywords

Arabic NLP; Data Science; Big Data; Prolog; Parser

Download Article PDF

https://wjarr.com/sites/default/files/fulltext_pdf/WJARR-2024-3602.pdf

Preview Article PDF

How to cite this article

Sherif M. Saif. Arabic natural language processing: Data science application in big data environment. World Journal of Advanced Research and Reviews, 2024, 24(2), 2283-2293. Article DOI: https://doi.org/10.30574/wjarr.2024.24.2.3602

Copyright © Author(s). All rights reserved. This article is published under the terms of the Creative Commons Attribution 4.0 International License (CC BY 4.0), which permits use, sharing, adaptation, distribution, and reproduction in any medium or format, as long as appropriate credit is given to the original author(s) and source, a link to the license is provided, and any changes made are indicated.

All statements, opinions, and data contained in this publication are solely those of the individual author(s) and contributor(s). The journal, editors, reviewers, and publisher disclaim any responsibility or liability for the content, including accuracy, completeness, or any consequences arising from its use.

Developed & Designed by VS Infosolution

Arabic natural language processing: Data science application in big data environment

Sherif M. Saif ^*

Preview Article PDF

Get Certificates

Issue details

Arabic natural language processing: Data science application in big data environment

Sherif M. Saif *

Preview Article PDF

Get Certificates

Issue details

Sherif M. Saif ^*