Show simple item record

dc.contributor.authorGENÇYILMAZ, İzel Zeynep
dc.contributor.authorKARAOĞLAN, Kürşat Mustafa
dc.date.accessioned2025-08-12T12:06:27Z
dc.date.available2025-08-12T12:06:27Z
dc.date.issued2024
dc.identifier.issn2147-3129
dc.identifier.urihttp://dspace.beu.edu.tr:8080/xmlui/handle/123456789/15656
dc.description.abstractThe Conversion of Speech to Text (CoST) is crucial for developing automated systems to understand and process voice commands. Studies have focused on developing this task, especially for Turkish-specific voice commands, a strategic language in the international arena. However, researchers face various challenges, such as Turkish's suffixed structure, phonological features and unique letters, dialect and accent differences, word stress, word-initial vowel effects, background noise, gender-based sound variations, and dialectal differences. To address the challenges above, this study aims to convert speech data consisting of Turkish-specific audio clips, which have been limitedly researched in the literature, into texts with highperformance accuracy using different Machine Learning (ML) models, especially models such as Convolutional Neural Network and Convolutional Recurrent Neural Network (CRNN). For this purpose, experimental studies were conducted on a dataset of 26,485 Turkish audio clips, and performance evaluation was performed with various metrics. In addition, hyperparameters were optimized to improve the model's performance in experimental studies. A performance of over 97% has been achieved according to the F1-score metric. The highest performance results were obtained with the CRNN approach. In conclusion, this study provides valuable insights into the strengths and limitations of various ML models applied to CoST. In addition to potentially contributing to a wide range of applications, such as supporting hard-of-hearing individuals, facilitating notetaking, automatic captioning, and improving voice command recognition systems, this study is one of the first in the literature on CoST in Turkish.tr_TR
dc.language.isoEnglishtr_TR
dc.publisherBitlis Eren Üniversitesitr_TR
dc.rightsinfo:eu-repo/semantics/openAccesstr_TR
dc.subjectSpeech to Text Conversion,tr_TR
dc.subjectNatural Language Processing, Convolutional Neural Network,tr_TR
dc.subjectConvolutional Recurrent Neural Network,tr_TR
dc.subjectMachine Learning.tr_TR
dc.subjectLearning,tr_TR
dc.subjectDeeptr_TR
dc.titleOptimizing Speech to Text Conversion in Turkish: An Analysis of Machine Learning Approachestr_TR
dc.typeArticletr_TR
dc.identifier.issue2tr_TR
dc.relation.journalBitlis Eren Üniversitesi Fen Bilimleri Dergisitr_TR
dc.identifier.volume13tr_TR


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record