Home
World Journal of Advanced Research and Reviews
International Journal with High Impact Factor for fast publication of Research and Review articles

Main navigation

  • Home
    • Journal Information
    • Editorial Board Members
    • Reviewer Panel
    • Abstracting and Indexing
    • Journal Policies
    • Our CrossMark Policy
    • Publication Ethics
    • Issue in Progress
    • Current Issue
    • Past Issues
    • Instructions for Authors
    • Article processing fee
    • Track Manuscript Status
    • Get Publication Certificate
    • Join Editorial Board
    • Join Reviewer Panel
  • Contact us
  • Downloads

eISSN: 2581-9615 || CODEN: WJARAI || Impact Factor 8.2 ||  CrossRef DOI

Research and review articles are invited for publication in March 2026 (Volume 29, Issue 3) Submit manuscript

Bangla hate speech detection by embedding and hybrid machine learning algorithms

Breadcrumb

  • Home
  • Bangla hate speech detection by embedding and hybrid machine learning algorithms

Md Habibur Rahman Papel 1, *, Udoy Chandra Dey 2, Toufiq Hasan Turza 2, Avijit Datta 3, Tanu Sarkar 2 and Pranta Dey 4

1 Student, Department of Electrical and Electronics Engineering, European University of Bangladesh, Bangladesh.
2 Student, Department of Computer Science and Engineering, Daffodil International University, Bangladesh.
3 Student, Department of Computer Science and Engineering, East West University, Bangladesh.
4 Student, Department of Computer Science and Technology, Dhaka Polytechnic Institute, Bangladesh.
 
Research Article
World Journal of Advanced Research and Reviews, 2024, 24(02), 1486-1496
Article DOI: 10.30574/wjarr.2024.24.2.3399
DOI url: https://doi.org/10.30574/wjarr.2024.24.2.3399
 
Received on 07 October 2024; revised on 14 November 2024; accepted on 16 November 2024
 
The worrisome increase in hate speech in Bangladesh, driven by the rapidly expanding social media user base, has adversely affected cybersecurity and online safety. Most hate speech detection technologies overlook less commonly spoken languages, such as Bangla, in favour of more widely used ones. This project aims to address the gap by identifying hate speech in Bangla through the utilisation of sophisticated word embedding techniques (Word2Vec, GloVe, and FastText) and machine learning algorithms (Multinomial Naive Bayes, Random Forest, K-Nearest Neighbours, and Extreme Gradient Boosting). We used a dataset of 30,000 annotated messages, encompassing both positive and hate speech, to train and test the models. FastText combined with hybrid RF and SVM yielded the highest performance among the models, with an accuracy of 96.03%. The system's generalisability and usefulness were evident when it identified hate speech not included in the training data. Enhancing the identification of harmful content in Bangla is essential for the burgeoning digital ecosystem in Bangladesh, hence contributing to cybersecurity. This endeavour establishes a foundation for future study by efficiently employing advanced word embedding and machine learning techniques to identify hate speech in Bangla. It could significantly influence the advancement of more secure digital environments and the overall condition of internet security.
 
Glove; Fasttext; Word2Vec; Hybrid; Hate speech; Normal speech; Word embedding; Cybersecurity; Tokenization; Stop word; NLP.
 
https://wjarr.com/sites/default/files/fulltext_pdf/WJARR-2024-3399.pdf

Preview Article PDF

Md Habibur Rahman Papel, Udoy Chandra Dey, Toufiq Hasan Turza, Avijit Datta, Tanu Sarkar and Pranta Dey. Bangla hate speech detection by embedding and hybrid machine learning algorithms. World Journal of Advanced Research and Reviews, 2024, 24(2), 1486-1496. Article DOI: https://doi.org/10.30574/wjarr.2024.24.2.3399

Copyright © Author(s). All rights reserved. This article is published under the terms of the Creative Commons Attribution 4.0 International License (CC BY 4.0), which permits use, sharing, adaptation, distribution, and reproduction in any medium or format, as long as appropriate credit is given to the original author(s) and source, a link to the license is provided, and any changes made are indicated.


All statements, opinions, and data contained in this publication are solely those of the individual author(s) and contributor(s). The journal, editors, reviewers, and publisher disclaim any responsibility or liability for the content, including accuracy, completeness, or any consequences arising from its use.

Get Certificates

Get Publication Certificate

Download LoA

Check Corssref DOI details

Issue details

Issue Cover Page

Editorial Board

Table of content

Copyright © 2026 World Journal of Advanced Research and Reviews - All rights reserved

Developed & Designed by VS Infosolution