Predicting Stack Overflow Question Tags: A Multi-Class, Multi-Label Classification

Eray Mert Kavuk, Ayse Tosun

Araştırma sonucu: Kitap/Rapor/Konferans Bildirisinde BölümKonferans katkısıbilirkişi

5 Atıf (Scopus)

Özet

This work proposes to predict the tags assigned for the posts on Stack Overflow platform. The raw data was obtained from the stackexchange.com including more than 50K posts and their associated tags given by the users. The posts' questions and titles are pre-processed, and the sentences in the posts are further transformed into features via Latent Dirichlet Allocation. The problem is a multi-class and multi-label classification and hence, we propose 1) one-against-all models for 15 most popularly used tags, and 2) a combined multi-tag classifier for finding the top K tags for a single post. Three algorithms are used to train the one-against-all classifiers to decide to what extent a post belongs to a tag. The probabilities of each post belonging to a tag are then combined to give the results of the multi-tag classifier with the best performing algorithm. The performance is compared with a baseline approach (kNN). Our multi-tag classifier achieves 55% recall and 39% F1-score.

Orijinal dilİngilizce
Ana bilgisayar yayını başlığıProceedings - 2020 IEEE/ACM 42nd International Conference on Software Engineering Workshops, ICSEW 2020
YayınlayanAssociation for Computing Machinery, Inc
Sayfalar489-493
Sayfa sayısı5
ISBN (Elektronik)9781450379632
DOI'lar
Yayın durumuYayınlandı - 27 Haz 2020
Etkinlik42nd IEEE/ACM International Conference on Software Engineering Workshops, ICSEW 2020 - Seoul, Korea, Republic of
Süre: 27 Haz 202019 Tem 2020

Yayın serisi

AdıProceedings - 2020 IEEE/ACM 42nd International Conference on Software Engineering Workshops, ICSEW 2020

???event.eventtypes.event.conference???

???event.eventtypes.event.conference???42nd IEEE/ACM International Conference on Software Engineering Workshops, ICSEW 2020
Ülke/BölgeKorea, Republic of
ŞehirSeoul
Periyot27/06/2019/07/20

Bibliyografik not

Publisher Copyright:
© 2020 ACM.

Parmak izi

Predicting Stack Overflow Question Tags: A Multi-Class, Multi-Label Classification' araştırma başlıklarına git. Birlikte benzersiz bir parmak izi oluştururlar.

Alıntı Yap