Karşılıklı Bilgi Kullanılarak Etkin Deǧişkenleri Belirlemek ve Destek Vektör Makineleri ile Sülfür Dioksit Konsantrasyonunun Tahmin Modelini Geliştirmek

Translated title of the contribution: Identifying effective variables using mutual information and building predictive models of sulfur dioxide concentration with support vector machines
  • C. Okan Sakar
  • , Olcay Kursun
  • , Huseyin Ozdemir
  • , Goksel Demir*
  • , Senay Yalcin
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Sulfur dioxide (SO2) is an issue of increasing public concern due to its recognized adverse effects on human health. Therefore, accurate SO2 prediction models are very important tools in developing public warning strategies. The goal of this study is to identify the relevance of meteorological and air pollutant variables using a classical and widely used measure of dependence, Shannon's Mutual Information (MI), and to build an accurate SO2 prediction model using the relevant variables as inputs. Specifically, features ranked by MI measure are tested on how much joint predictive power they have of the target using a popular machine learning tool, support vector machines (SVM), and in comparison to multilayer perceptron (MLP), which is the most commonly used machine learning tool in previous studies for the prediction and analysis of air pollutants. It was found that the SVM model gave a higher correlation coefficient (r) and less root mean squared error (RMSE) than MLP for both test and validation sets. The predictive model used 6 input variables for both data sets as the relevant features for maximum SO2 concentration prediction at time t+1, which are the average SO2, maximum SO2, outdoor temperature (OT), average nitrogen dioxide (NO2), average ozone (O3), and average wind speed at time t. The results of this study indicate that MI can be used efficiently in determining the importance of input variables in the prediction of SO2 concentration and SVM is a popular machine learning tool well suited for use in air pollution modeling.

Translated title of the contributionIdentifying effective variables using mutual information and building predictive models of sulfur dioxide concentration with support vector machines
Original languageTurkish
Pages (from-to)102-112
Number of pages11
JournalEkoloji
Issue number76
DOIs
Publication statusPublished - 2010
Externally publishedYes

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Fingerprint

Dive into the research topics of 'Identifying effective variables using mutual information and building predictive models of sulfur dioxide concentration with support vector machines'. Together they form a unique fingerprint.

Cite this