Abstract
In medical genetics, each genetic variant is evaluated as an independent entity regarding its clinical importance. However, in most complex diseases, variant combinations in specific gene networks, rather than the presence of a particular single variant, predominates. In the case of complex diseases, disease status can be evaluated by considering the success level of a team of specific variants. We propose a high dimensional modelling based method to analyse all the variants in a gene network together, which we name 'Computational Gene Network Analysis' (CoGNA). To evaluate our method, we selected two gene networks, mTOR and TGF-ββ. For each pathway, we generated 400 control and 400 patient group samples. mTOR and TGF-ββ pathways contain 31 and 93 genes of varying sizes, respectively. We produced Chaos Game Representation images for each gene sequence to obtain 2-D binary patterns. These patterns were arranged in succession, and a 3-D tensor structure was achieved for each gene network. Features for each data sample were acquired by exploiting Enhanced Multivariance Products Representation to 3-D data. Features were split as training and testing vectors. Training vectors were employed to train a Support Vector Machines classification model. We achieved more than 96% and 99% classification accuracies for mTOR and TGF-ββ networks, respectively, using a limited amount of training samples.
Original language | English |
---|---|
Pages (from-to) | 2959-2969 |
Number of pages | 11 |
Journal | IEEE/ACM Transactions on Computational Biology and Bioinformatics |
Volume | 20 |
Issue number | 5 |
DOIs | |
Publication status | Published - 1 Sept 2023 |
Bibliographical note
Publisher Copyright:© 2004-2012 IEEE.
Keywords
- Gene network analysis
- chaos game representation
- enhanced multivariance products representation
- high dimensional modelling
- support vector machines