DSpace Repository

Turkish Speech Recognition Based On Deep Neural Networks

Show simple item record

dc.creator KIMANUKA, Ussen Abre
dc.creator BUYUK, Osman
dc.date 2018-10-05T00:00:00Z
dc.date.accessioned 2019-07-09T12:00:27Z
dc.date.available 2019-07-09T12:00:27Z
dc.identifier http://dergipark.org.tr/sdufenbed/issue/39695/470071
dc.identifier
dc.identifier.uri http://acikerisim.sdu.edu.tr/xmlui/handle/123456789/46759
dc.description In this paper we develop a Turkish speech recognition (SR) system  using deep neural networks and compare it with the previous state-of-the-art traditional Gaussian mixture model-hidden Markov model (GMM-HMM) method using the same Turkish speech dataset and the same large vocabulary Turkish corpus. Nowadays most SR systems deployed worldwide and particularly in Turkey use Hidden Markov Models to deal with the speech temporal variations. Gaussian mixture models are used to estimate the amount at which each state of each HMM fits a short frame of coefficients which is the representation of an acoustic input. A deep neural network consisting of feed-forward neural network is another way to estimate the fit; this neural network takes as input several frames of coefficients and gives as output posterior probabilities over HMM states. It has been shown that the use of deep neural networks can outperform the traditional GMM-HMM in other languages such as English and German. The fact that Turkish language is an agglutinative language and the lack of a huge amount of speech data complicate the design of a performant SR system. By making use of deep neural networks we will obviously improve the performance but still we will not achieve better result than English language due to the difference in the availability of speech data. We present various architectural and training techniques for the Turkish DNN-based models. The models are tested using a Turkish database collected from mobile devices. In the experiments, we observe that the Turkish DNN-HMM system have decreased the word error rate approximately 2.5% when compared to the GMM-HMM traditional system.
dc.format application/pdf
dc.publisher Süleyman Demirel University
dc.publisher Süleyman Demirel Üniversitesi
dc.relation http://dergipark.org.tr/download/article-file/552974
dc.source Cilt: 22 Sayı: Özel 319-329 en-US
dc.source 1308-6529
dc.subject Turkish speech recognition,Deep neural network; Gaussian mixture model; Hidden markov model; GMM-HMM; DNN-HMM
dc.title Turkish Speech Recognition Based On Deep Neural Networks en-US
dc.type info:eu-repo/semantics/article


Files in this item

Files Size Format View

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account