Ana gezinime geç Aramaya geç Ana içeriğe geç

From ideal to practical: Heterogeneity of student-generated variant lists highlights hidden reproducibility gaps

  • Istanbul Technical University
  • Yildiz Technical University
  • Health Institutes of Türkiye
  • Dokuz Eylul University

Araştırma sonucu: Dergiye katkıMakalebilirkişi

Özet

Next-generation sequencing (NGS) technologies offer detailed and inexpensive identification of the genetic structure of living organisms. The massive data volume necessitates the utilization of advanced computational resources for analyses. However, the rapid accumulation of data and the urgent need for analysis tools have caused the development of imperfect software solutions. Given their immense potential in clinical applications and the recent reproducibility crisis discussions in science and technology, these tools must be thoroughly examined. Typically, NGS data analysis tools are benchmarked under homogeneous conditions, with well-trained personnel and ideal hardware and data environments. However, in the real world, these analyses are done under heterogeneous conditions in terms of computing environments and experience levels. This difference is mostly overlooked, therefore studies that examine NGS workflows generated under various conditions would be highly valuable. Moreover, a detailed assessment of the difficulties faced by the trainees would allow for improved educational programs for better NGS analysis training. Considering these needs, we designed an elective undergraduate bioinformatics course project for computer engineering students at Istanbul Technical University. Students were tasked to perform and compare 12 different somatic variant calling pipelines on the recently published SEQC2 dataset. Upon examining the results, we have realized that despite seeming correct, the final variant lists created by different student groups display a high level of heterogeneity. Notably, the operating systems and installation methods were the most influential factors in variant-calling performance. Here, we present detailed evaluations of our case study and provide insights for better bioinformatics training.

Orijinal dilİngilizce
Makale numarasıe1013552
DergiPLoS Computational Biology
Hacim21
Basın numarası10
DOI'lar
Yayın durumuYayınlandı - 16 Eki 2025

Bibliyografik not

Publisher Copyright:
© 2025 Ertürk et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

BM SKH

Bu sonuç, aşağıdaki Sürdürülebilir Kalkınma Hedefine/Hedeflerine katkıda bulunur

  1. SKH 4 - Nitelikli Eğitim
    SKH 4 Nitelikli Eğitim

Parmak izi

From ideal to practical: Heterogeneity of student-generated variant lists highlights hidden reproducibility gaps' araştırma başlıklarına git. Birlikte benzersiz bir parmak izi oluştururlar.

Alıntı Yap