A taxonomy based semantic similarity of documents using the cosine measure

Ainura Madylova*, Şule Gündüz Öǧüdücü

*Bu çalışma için yazışmadan sorumlu yazar

Araştırma sonucu: ???type-name???Konferans katkısıbilirkişi

31 Atıf (Scopus)

Özet

In this paper, we present a new method for calculating semantic similarities between documents. This method is based on cosine similarity calculation between concept vectors of documents obtained from a taxonomy of words that captures IS-A relations. The calculation of semantic similarities between documents is a very time consuming task, since it is necessary first to calculate semantic similarities between each pair of words that appear on different documents. In this paper, we present a new method to calculate semantic similarities between documents which results in faster computational time. Both a taxonomy based semantic similarity and cosine similarity are employed. First, the concept vectors of documents are obtained by extending the terms in the document vectors with their corresponding IS-A concepts. Cosine similarity is then calculated between those concept vectors of documents. Thus, the overall similarity between documents is a combination of cosine similarity and semantic similarity. The proposed semantic similarity is tested in document clustering problem. The experimental results show that our method achieves a good performance.

Orijinal dilİngilizce
Ana bilgisayar yayını başlığı2009 24th International Symposium on Computer and Information Sciences, ISCIS 2009
Sayfalar129-134
Sayfa sayısı6
DOI'lar
Yayın durumuYayınlandı - 2009
Etkinlik2009 24th International Symposium on Computer and Information Sciences, ISCIS 2009 - Guzelyurt, Cyprus
Süre: 14 Eyl 200916 Eyl 2009

Yayın serisi

Adı2009 24th International Symposium on Computer and Information Sciences, ISCIS 2009

???event.eventtypes.event.conference???

???event.eventtypes.event.conference???2009 24th International Symposium on Computer and Information Sciences, ISCIS 2009
Ülke/BölgeCyprus
ŞehirGuzelyurt
Periyot14/09/0916/09/09

Parmak izi

A taxonomy based semantic similarity of documents using the cosine measure' araştırma başlıklarına git. Birlikte benzersiz bir parmak izi oluştururlar.

Alıntı Yap