• Login
    View Item 
    •   DSpace Home
    • 2-DERGİLER
    • 03) Bitlis Eren Üniversitesi Fen Bilimleri Dergisi
    • Cilt 15, Sayı 1 (2026)
    • View Item
    •   DSpace Home
    • 2-DERGİLER
    • 03) Bitlis Eren Üniversitesi Fen Bilimleri Dergisi
    • Cilt 15, Sayı 1 (2026)
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    A CNN–NCP BASED HYBRID DEEP LEARNING MODEL FOR SPEECH-DRIVEN GENDER CLASSIFICATION

    Thumbnail
    View/Open
    Tam Metin/Full Text (576.3Kb)
    Date
    2026
    Author
    OLGUN, Sevda
    BALIM, Caner
    OLGUN, Nevzat
    Metadata
    Show full item record
    Abstract
    Speech is one of the most natural and effective forms of human communication, carrying both linguistic and non-linguistic information. It plays a crucial role in many applications such as gender classification, biometric authentication, and personalized human-computer interaction. This study aims to investigate the contribution of a hybrid deep learning model based on Neural Circuit Policies (NCP), inspired by biological neural systems, for gender classification on Turkish speech data, by evaluating its performance in terms of accuracy and computational efficiency in comparison with conventional recurrent models. Mel-Frequency Cepstral Coefficients (MFCC) and log-Mel spectrogram features are combined to simultaneously capture the spectral and temporal properties of speech signals. These features are learned as low-level acoustic patterns via Conv1D layers. Longterm temporal dependencies are modeled using Liquid Time Constant (LTC) cells defined within the NCP architecture. To evaluate the generalizability of the model, the experiments were conducted under a speaker-independent setup, and ablation studies were performed by removing different components of the architecture to clearly assess the contribution of the NCP component. Cross-validation was applied on the Mozilla Common Voice 12.0 Turkish dataset during the experiments. The Conv1D+NCP model achieved 99.29% accuracy and 99.28% F1-score, while the LSTM-based model yielded slightly lower results. The NCPbased model offers high performance and computational efficiency with fewer parameters, making it a powerful alternative for real-time applications.
    URI
    http://dspace.beu.edu.tr:8080/xmlui/handle/123456789/16744
    Collections
    • Cilt 15, Sayı 1 (2026) [40]





    Creative Commons License
    DSpace@BEU by Bitlis Eren University Institutional Repository is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 4.0 Unported License..

    DSpace software copyright © 2002-2016  DuraSpace
    Contact Us | Send Feedback
    Theme by 
    Atmire NV
     

     




    | Yönerge | Rehber | İletişim |

    sherpa/romeo

    Browse

    All of DSpaceCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsBy TypeThis CollectionBy Issue DateAuthorsTitlesSubjectsBy Type

    My Account

    LoginRegister

    DSpace software copyright © 2002-2016  DuraSpace
    Contact Us | Send Feedback
    Theme by 
    Atmire NV