Abstract
The imbalanced construction dataset reduces the accuracy of the machine learning model. This issue that addressed by recent construction management research through different sampling approaches. Despite their advantages, the utilized sampling approaches are reducing the reliability of the prediction model, while posing the risk of artificial bias. The objective of this study is to address the challenge of imbalanced datasets in construction progress prediction models using a novel variational autoencoder (VAE) that generates synthetic data for underrepresented classes. The VAE's encoder-decoder architecture, along with its latent space components, is optimized for this task. A comparative analysis using decision tree-based ML models, including grid search optimization, substantiated the effectiveness of the VAE approach. The results indicate that the hybrid dataset benefited the ML models from the addition of the synthesized dataset, showing 2% improvements in performance metrics across most models. The synthetic data generated by VAEs contributes to the construction of more balanced datasets, which, in turn, can lead to more reliable and accurate predictive models. The enhanced accuracy of the VAE-ML model addresses the class imbalance problem and improves the reliability of construction productivity predictions and related resource allocation plans.
Original language | English |
---|---|
Title of host publication | Proceedings of the 10th International Conference on Civil Engineering |
Editors | Guangliang Feng |
Publisher | Springer Science and Business Media Deutschland GmbH |
Pages | 538-545 |
Number of pages | 8 |
ISBN (Print) | 9789819743544 |
DOIs | |
Publication status | Published - 2024 |
Event | 10th International Conference on Civil Engineering, ICCE2023 - Nanchang, China Duration: 24 Dec 2023 → 25 Dec 2023 |
Publication series
Name | Lecture Notes in Civil Engineering |
---|---|
Volume | 526 LNCE |
ISSN (Print) | 2366-2557 |
ISSN (Electronic) | 2366-2565 |
Conference
Conference | 10th International Conference on Civil Engineering, ICCE2023 |
---|---|
Country/Territory | China |
City | Nanchang |
Period | 24/12/23 → 25/12/23 |
Bibliographical note
Publisher Copyright:© The Author(s) 2024.
Keywords
- Generative Model
- Imbalanced Construction Dataset
- Machine Learning (ML)
- Variational Autoencoder (VAE)