Özet
Effective extraction of domain-specific terms and named entities is a key challenge in text mining. This paper investigates the use of the k-means clustering algorithm for unsupervised extraction of unigrams and named entities from text data. The approach groups terms based on their vector representations, enabling the identification of semantically similar words without labeled data. Experiments conducted on the ACTER (Annotated Corpora for Term Extraction Research) corpus evaluate the method using precision, recall, and F1-score. Results show average scores of 25.79% precision, 40.05% recall and 30.47% F1-score, with optimal performance achieved using 40 to 60 clusters. Future work will explore algorithm optimization and comparisons with alternative extraction techniques.
| Orijinal dil | İngilizce |
|---|---|
| Ana bilgisayar yayını başlığı | Selected Papers from the International Conference on Artificial Intelligence - FICAILY2025 - Current Research, Industry Trends, and Innovations |
| Editörler | Ali Othman Albaji |
| Yayınlayan | Springer Science and Business Media Deutschland GmbH |
| Sayfalar | 375-386 |
| Sayfa sayısı | 12 |
| ISBN (Basılı) | 9783032002310 |
| DOI'lar | |
| Yayın durumu | Yayınlandı - 2026 |
| Etkinlik | International Conference on AI: Current Research, Industry Trends, and Innovations, FICAILY 2025 - Tripoli, Libya Süre: 9 Tem 2025 → 10 Tem 2025 |
Yayın serisi
| Adı | Studies in Computational Intelligence |
|---|---|
| Hacim | 1229 SCI |
| ISSN (Basılı) | 1860-949X |
| ISSN (Elektronik) | 1860-9503 |
???event.eventtypes.event.conference???
| ???event.eventtypes.event.conference??? | International Conference on AI: Current Research, Industry Trends, and Innovations, FICAILY 2025 |
|---|---|
| Ülke/Bölge | Libya |
| Şehir | Tripoli |
| Periyot | 9/07/25 → 10/07/25 |
Bibliyografik not
Publisher Copyright:© The Author(s), under exclusive license to Springer Nature Switzerland AG 2026.
Parmak izi
Automatic Unsupervised Extraction of Unigrams of Terms and Named Entities Using the K-Means Clustering Algorithm' araştırma başlıklarına git. Birlikte benzersiz bir parmak izi oluştururlar.Alıntı Yap
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver