AUTOMATIC SPEECH RECOGNITION-A SURVEY

Julna Nazer; Sajeer K.

PDF

Published: Apr 30, 2016

Keywords:

Automatic speech recognition, feature extraction, neural networks, hidden Markov model, acoustic phonetic approach.

Julna Nazer

Sajeer K.

Abstract

Speech recognition is the next big step that the technology needs to take for general users. An Automatic Speech Recognition (ASR) will play a major role in focusing new technology to users. Applications of ASR are speech to text conversion, voice input in aircraft, data entry, voice user interfaces such as voice dialing. Speech recognition involves extracting features from the input signal and classifying them to classes using pattern matching model. This can be done using feature extraction method. This paper involves a general study of automatic speech recognition and various methods to generate an ASR system. General techniques that can be used to implement an ASR includes artificial neural networks, Hidden Markov model, acoustic – phonetic approach

How to Cite

[1]

Julna Nazer and Sajeer K., “AUTOMATIC SPEECH RECOGNITION-A SURVEY”, Int. J. Comput. Eng. Res. Trends, vol. 3, no. 4, pp. 190–193, Apr. 2016.

Issue

Vol. 3 No. 4 (2016): April (2016) Issue

Section

Survey

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

IJCERT Policy:

The published work presented in this paper is licensed under the Creative Commons Attribution 4.0 International (CC BY 4.0) license. This means that the content of this paper can be shared, copied, and redistributed in any medium or format, as long as the original author is properly attributed. Additionally, any derivative works based on this paper must also be licensed under the same terms. This licensing agreement allows for broad dissemination and use of the work while maintaining the author's rights and recognition.

By submitting this paper to IJCERT, the author(s) agree to these licensing terms and confirm that the work is original and does not infringe on any third-party copyright or intellectual property rights.

References

Naoki Hirayama, Koichiro Yoshino, Katsutoshi Itoyama, Shinsuke Mori, and Hiroshi G. Okuno,”Automatic Speech Recognition for Mixed Dialect Utterances by Mixing Dialect Language Models” IEEE Transactions on Audio, speech and language processing , VOL. 23, NO. 2, FEBRUARY 2015.

N. Hirayama, K. Yoshino, K. Itoyama, S. Mori, and H. G. Okuno, Automatic estimation of dialect mixing ratio for dialect speech recognition, IEEE in Proc. Interspeech 13, 2013.

N. Hirayama, S. Mori, and H. G. Okuno, Statistical method of building dialect language models for ASR systems,” IEEE in Proc. COLING , 2012.

Rongfeng Su,Xunying Liu,Lan Wang ”Automatic Complexity Control of Generalized Variable Parameter HMMs for Noise Robust Speech Recognition,” IEEE Transactions on Audio, speech and language processing, 2015.

Cumani S, Laface P.”Large-Scale Training of Pair wise Support Vector Machine for Speaker Recognition” IEEE transactions on audio, speech and language processing 2014.

Sajeer Karattil,”A novel approach of implementation of speech recognition using neural networks for information retrieval”, International journal in Science and Technology vol 8 issue 33 Dec 2015.

AUTOMATIC SPEECH RECOGNITION-A SURVEY

Abstract

References

QUICK LINKS

FOR AUTHORS

FOR REVIEWERS

JOURNAL CONTENTS

DOWNLOADS

Article Sidebar

Main Article Content

Abstract

Article Details

References