Abstract
Accurate soil type classification is fundamental to geotechnical engineering, yet traditional laboratory methods are often time consuming and labor intensive. This study investigates the potential of a Transformer-based deep learning framework for the automated classification of complex soil compositions. An image database for geotechnical analysis is constructed using six distinct geotechnical samples comprising gravel, sand, silt, and clay systematically blended into 80 ternary mixtures. To address the inherent class imbalances in the multi-component dataset, the Synthetic Minority Oversampling Technique (SMOTE) is employed, ensuring robust representation across all categories. The proposed framework utilizes a Vision Transformer (ViT) architecture, leveraging its self-attention mechanism to capture both intricate textural patterns and long-range structural dependencies within the soil matrices. Experimental results demonstrate that the SMOTE–ViT pipeline achieved an overall accuracy of 95.83%, with high precision and recall across diverse ternary compositions. This interdisciplinary approach provides a scalable and high-precision alternative for soil characterization, offering significant potential for real-time decision-making in geotechnical investigation workflows.
| Original language | English |
|---|---|
| Article number | 4063 |
| Journal | Applied Sciences (Switzerland) |
| Volume | 16 |
| Issue number | 9 |
| DOIs | |
| Publication status | Published - May 2026 |
Bibliographical note
Publisher Copyright:© 2026 by the authors.
Keywords
- deep learning
- image processing
- SMOTE
- soil classification
- soil type recognition
- ViT
Fingerprint
Dive into the research topics of 'A SMOTE–ViT Framework for Advanced Soil Classification on a Self-Generated Geotechnical Image Database'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver