Vol. 3 No. 2 (2025): Issue - 02
Articles

A Hybrid ANN and XGBoost Approach to Urban Air Quality Classification

Posina Anusha
Department of Computer Science and Engineering, Annamacharya Institute of Technology and Sciences, Tirupati, Andra Pradesh, India

Published 2025-08-18

Keywords

  • Air Quality Classification,
  • Urban Zone Identification,
  • Hybrid Model,
  • Environmental Data Analysis,
  • Nonlinear Feature Extraction,
  • Urban Planning,
  • Data-Driven Decision Making,
  • Artificial Neural Networks
  • ...More
    Less

How to Cite

Posina Anusha. (2025). A Hybrid ANN and XGBoost Approach to Urban Air Quality Classification. Milestone Transactions on Medical Technometrics, 3(2), 260–273. https://doi.org/10.5281/zenodo.16892754

Abstract

Public health is severely threatened by urban air pollution, particularly in densely populated and rapidly industrialising cities.  For risk management, environmental monitoring, and targeted policymaking, urban zones must be accurately classified according to pollution levels.  This paper proposes a classification framework that integrates the ensemble-based decision-making power of Gradient Boosting With the nonlinear feature extraction capabilities of neural networks, a publicly available dataset containing over 52,000 daily air quality records from six major cities was used. The model was designed to distinguish between industrial and residential urban areas based on six major pollutants: PM2.5, PM10, CO, NO₂, SO₂, and O₃. The proposed two-stage architecture first transforms input features through an ANN to capture complex pollutant interactions, then feeds the learned representations into an XGBoost classifier for final prediction. The performance of this hybrid model was compared to that of several well-known classifiers, including standalone ANN, standalone XGBoost, Support Vector Machine, and Logistic Regression.  With an accuracy of 99.98%, the suggested ANN–XGBoost model outperformed all baseline techniques.  At 99.92%, 99.96%, and 99.98%, respectively, precision, recall, and F1-score were likewise exceptionally high, demonstrating exceptional classification performance and generalization ability.

References

  1. Meka, A. H., Sugiarto, A., & Hidayat, W. (2025). The impact of traffic in the Medan industrial estate on the social and economic community of Amplas village, Percut Sei Tuan, Deli Serdang Regency, North Sumatra. Journal of Information Technology, Computer Science and Electrical Engineering, 2(1), 1–7.
  2. Vallero, D. A. (2025). Fundamentals of air pollution. Academic Press.
  3. Aljafen, B. N., Shaikh, N., AlKhalifah, J. M., & Meo, S. A. (2025). Effect of environmental pollutants particulate matter (PM2.5, PM10), nitrogen dioxide (NO2), sulfur dioxide (SO2), carbon monoxide (NO), and ground-level ozone (O3) on epilepsy. BMC Neurology, 25(1), 133.
  4. Kronzer, V. L., Yang, Y., Roul, P., Crooks, J. L., Crowson, C. S., Davis, J. M., III, Sparks, J. A., Pierce, J. R., O’Dell, K., Sauer, B. C., et al. (2025). Associations of fire smoke and other pollutants with incident rheumatoid arthritis and rheumatoid arthritis–associated interstitial lung disease. Arthritis & Rheumatology. Advance online publication.
  5. Maspul, K. A., & Ardhin, M. (2025). ASEAN unbound: Igniting the digital renaissance. Journal of Regional Economics and Development, 2(3), 22–22.
  6. Olugbami, O. O., Ogundeko, O., Lawan, M., & Foster, E. (2025). Harnessing data for impact: Transforming public health interventions through evidence-based decision-making. World Journal of Advanced Research and Reviews, 25(1), 2085–2103.
  7. He, Y., He, H., Li, H., & Yang, J. (2025). Dirty environment, dark mood: Exploring the link between perceived environmental pollution and depression risk. Journal of Community Psychology, 53(1), e23181.
  8. Singh, S., & Suthar, G. (2025). Machine learning and deep learning approaches for PM2.5 prediction: A study on urban air quality in Jaipur, India. Earth Science Informatics, 18(1), 97.
  9. Omer, M., Ali, S. J., Raza, S. M., Le, D.-T., & Choo, H. (2025). Integrating temporal analysis with hybrid machine learning and deep learning models for enhanced air quality prediction. In 2025 19th International Conference on Ubiquitous Information Management and Communication (IMCOM) (pp. 1–7). IEEE.
  10. Idroes, G. M., Noviandy, T. R., Maulana, A., Zahriah, Z., Suhendrayatna, Suhartono, E., Khairan, K., Kusumo, F., Helwani, Z., & Abd Rahman, S. (2023). Urban air quality classification using machine learning approach to enhance environmental monitoring. Leuser Journal of Environmental Studies, 1(2), 62–68.
  11. Yadav, N., Sorek-Hamer, M., Von Pohle, M., Asanjan, A. A., Sahasrabhojanee, A., Suel, E., Arku, R. E., Lingenfelter, V., Brauer, M., Ezzati, M., et al. (2024). Using deep transfer learning and satellite imagery to estimate urban air quality in data-poor regions. Environmental Pollution, 342, 122914.
  12. Imam, M., Adam, S., Dev, S., & Nesa, N. (2024). Air quality monitoring using statistical learning models for sustainable environment. Intelligent Systems with Applications, 22, 200333.
  13. Saini, R. (2025). A review on artificial neural networks for structural analysis. Journal of Vibration Engineering & Technologies, 13(2), 142.
  14. Mohammed, B., & Hamza, C. (2025). A robust estimation of blasting-induced flyrock using machine learning decision tree algorithms: Random forest, gradient boosting machine, and XGBoost. Mining, Metallurgy & Exploration, 1–16. Advance online publication.
  15. Kaur, S., Gupta, S., Gupta, D., Juneja, S., Nauman, A., Khan, M., Husain, I., Islam, A., & Mallik, S. (2025). High-accuracy lung disease classification via logistic regression and advanced feature extraction techniques. Egyptian Informatics Journal, 29, 100596.
  16. Jabardi, M. (2025). Support vector machines: Theory, algorithms, and applications. Infocommunications Journal, 17(1).
  17. Ahmed, S. T., & Fathima, A. S. (2024). Medical ChatBot assistance for primary clinical guidance using machine learning techniques. Procedia Computer Science, 233, 279-287.
  18. Kumar, A., Satheesha, T. Y., Salvador, B. B. L., Mithileysh, S., & Ahmed, S. T. (2023). Augmented Intelligence enabled Deep Neural Networking (AuDNN) framework for skin cancer classification and prediction using multi-dimensional datasets on industrial IoT standards. Microprocessors and Microsystems, 97, 104755.