AUTOMATIC SPEECH RECOGNITION-A SURVEY
Main Article Content
Abstract
Speech recognition is the next big step that the technology needs to take for general users. An Automatic Speech Recognition (ASR) will play a major role in focusing new technology to users. Applications of ASR are speech to text conversion, voice input in aircraft, data entry, voice user interfaces such as voice dialing. Speech recognition involves extracting features from the input signal and classifying them to classes using pattern matching model. This can be done using feature extraction method. This paper involves a general study of automatic speech recognition and various methods to generate an ASR system. General techniques that can be used to implement an ASR includes artificial neural networks, Hidden Markov model, acoustic – phonetic approach
Article Details

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
IJCERT Policy:
The published work presented in this paper is licensed under the Creative Commons Attribution 4.0 International (CC BY 4.0) license. This means that the content of this paper can be shared, copied, and redistributed in any medium or format, as long as the original author is properly attributed. Additionally, any derivative works based on this paper must also be licensed under the same terms. This licensing agreement allows for broad dissemination and use of the work while maintaining the author's rights and recognition.
By submitting this paper to IJCERT, the author(s) agree to these licensing terms and confirm that the work is original and does not infringe on any third-party copyright or intellectual property rights.
References
Naoki Hirayama, Koichiro Yoshino, Katsutoshi Itoyama, Shinsuke Mori, and Hiroshi G. Okuno,”Automatic Speech Recognition for Mixed Dialect Utterances by Mixing Dialect Language Models” IEEE Transactions on Audio, speech and language processing , VOL. 23, NO. 2, FEBRUARY 2015.
N. Hirayama, K. Yoshino, K. Itoyama, S. Mori, and H. G. Okuno, Automatic estimation of dialect mixing ratio for dialect speech recognition, IEEE in Proc. Interspeech 13, 2013.
N. Hirayama, S. Mori, and H. G. Okuno, Statistical method of building dialect language models for ASR systems,” IEEE in Proc. COLING , 2012.
Rongfeng Su,Xunying Liu,Lan Wang ”Automatic Complexity Control of Generalized Variable Parameter HMMs for Noise Robust Speech Recognition,” IEEE Transactions on Audio, speech and language processing, 2015.
Cumani S, Laface P.”Large-Scale Training of Pair wise Support Vector Machine for Speaker Recognition” IEEE transactions on audio, speech and language processing 2014.
Sajeer Karattil,”A novel approach of implementation of speech recognition using neural networks for information retrieval”, International journal in Science and Technology vol 8 issue 33 Dec 2015.