Department of Information Technology, Sreenidhi Institute of Science and Technology (Autonomous), Hyderabad, India.
World Journal of Advanced Research and Reviews, 2025, 26(02), 4445-4453
Article DOI: 10.30574/wjarr.2025.26.2.2102
Received on 16 April 2025; revised on 27 May 2025; accepted on 30 May 2025
Salary prediction is not just a number, it’s a decision-maker for job seekers, employers, and HR teams, shaping expectations and negotiations. Traditional models rely on structured data like job titles, experience levels, and locations, but overlook job descriptions, where real insights hide—skills, responsibilities, and industry-specific language. This study bridges that gap, combining structured and unstructured data for a more intuitive model. TF-IDF extracts key terms, assigning weights to highlight critical information, while structured data undergoes preprocessing through one-hot encoding and feature scaling. An ensemble learning approach strengthens predictions—Random Forest captures patterns, XGBoost refines them, and Linear Regression serves as a baseline. A meta-model, like Logistic Regression, optimally weighs predictions, enhancing accuracy. Evaluated through Accuracy, Macro Average F1-score, and Weighted Average F1-score, the model outperforms standalone approaches, achieving superior classification performance. The results demonstrate that integrating TF-IDF with ensemble learning provides a more accurate, scalable, and interpretable salary prediction system, ready for real-world applications.
Salary Prediction; TF-IDF (Term Frequency-Inverse Document Frequency); Ensemble Learning; Machine Learning
Preview Article PDF
Anil Pisolla, Sameer Baig Moghul, Triveni Gaddam and N Ch Sriman Narayana Iyengar. Salary Prediction Using TF-IDF and Ensemble Machine Learning: A Lightweight and Interpretable Approach. World Journal of Advanced Research and Reviews, 2025, 26(2), 4445-4453. Article DOI: https://doi.org/10.30574/wjarr.2025.26.2.2102