Özet
This paper presents a novel full autonomous multilingual audio transcription system tailored to Kazakh, Russian, and English. The proposed solution integrates a language detection module based on SpeechBrain with a transcription engine using Vosk, and employs FFmpeg for robust audio preprocessing. The system automatically detects the language from the initial 10 seconds of an audio stream, selects the corresponding acoustic model, and produces an accurate text transcription. Experimental evaluations on both synthetic and real audio data indicate that our approach achieves competitive performance in terms of accuracy (with word error rates ranging from 5 to 10% under optimal conditions) and processing speed, while operating entirely on local resources without dependency on cloud services. These features make it particularly suitable for applications in digital forensics and other domains that require secure real-time transcription capabilities.
| Orijinal dil | İngilizce |
|---|---|
| Ana bilgisayar yayını başlığı | Proceedings - 29th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing, SNPD 2025-Summer |
| Editörler | Hyun Yoe, Ha Jin Hwang, Meonghun Lee, Rackwoo Kim, Ryugap Lim, Sungtaek Lee, Seaeul Kim, Simon Xu, Miguel Garcia-Ruiz, Wenyin Feng, A B M Bodrul Alam, Randy Lin, Ajmery Sultana, Faria Khandaker, Mahreen Nasir, Ken Higuchi, Shinichiro Mori, Teruhisa Hochin |
| Yayınlayan | Institute of Electrical and Electronics Engineers Inc. |
| ISBN (Elektronik) | 9798331512583 |
| DOI'lar | |
| Yayın durumu | Yayınlandı - 2025 |
| Etkinlik | 29th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing, SNPD 2025-Summer - Busa, Korea, Republic of Süre: 25 Haz 2025 → 27 Haz 2025 |
Yayın serisi
| Adı | Proceedings - 29th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing, SNPD 2025-Summer |
|---|
???event.eventtypes.event.conference???
| ???event.eventtypes.event.conference??? | 29th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing, SNPD 2025-Summer |
|---|---|
| Ülke/Bölge | Korea, Republic of |
| Şehir | Busa |
| Periyot | 25/06/25 → 27/06/25 |
Bibliyografik not
Publisher Copyright:©2025 IEEE.
Parmak izi
Ai-based offline speech recognition for kazakh, russian and english languages' araştırma başlıklarına git. Birlikte benzersiz bir parmak izi oluştururlar.Alıntı Yap
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver