TY - GEN
T1 - A new approach for mutation analysis using data mining techniques
AU - Kaya, Hüseyin
AU - Öǧüdücü, Şule Gündüz
PY - 2010
Y1 - 2010
N2 - In this study, a new method is proposed to be used in diagnostic process of genetic disorders to determine the mutations in DNA sequences. The contribution of our method is that it uses chromatograms without applying a base calling method in order to decrease the errors produced during the base calling step. Given reference and unknown chromatograms, our method searches for possible mutations in the unknown chromatogram against the reference one. Our approach first extracts feature vectors of both chromatograms by applying a two dimensional transformation to every data frame sliding through the chromatograms. The feature vectors are then used to obtain similarity matrix proceeded by applying dynamic programming from which differences between them are displayed. Difference plot can be used either for manual screening or automated mutation detection. We test our method on a freely available dataset. The results show that our method can successfully align two chromatograms and higlight the differences caused by mutations.
AB - In this study, a new method is proposed to be used in diagnostic process of genetic disorders to determine the mutations in DNA sequences. The contribution of our method is that it uses chromatograms without applying a base calling method in order to decrease the errors produced during the base calling step. Given reference and unknown chromatograms, our method searches for possible mutations in the unknown chromatogram against the reference one. Our approach first extracts feature vectors of both chromatograms by applying a two dimensional transformation to every data frame sliding through the chromatograms. The feature vectors are then used to obtain similarity matrix proceeded by applying dynamic programming from which differences between them are displayed. Difference plot can be used either for manual screening or automated mutation detection. We test our method on a freely available dataset. The results show that our method can successfully align two chromatograms and higlight the differences caused by mutations.
UR - http://www.scopus.com/inward/record.url?scp=78651256169&partnerID=8YFLogxK
U2 - 10.1109/CISIM.2010.5643664
DO - 10.1109/CISIM.2010.5643664
M3 - Conference contribution
AN - SCOPUS:78651256169
SN - 9781424478170
T3 - 2010 International Conference on Computer Information Systems and Industrial Management Applications, CISIM 2010
SP - 205
EP - 210
BT - 2010 International Conference on Computer Information Systems and Industrial Management Applications, CISIM 2010
T2 - 2010 International Conference on Computer Information Systems and Industrial Management Applications, CISIM 2010
Y2 - 8 October 2010 through 10 October 2010
ER -