Voice Recognition System using Template Matching

Download Full Text
Luqman Gbadamosi
Published Date:
September 05, 2013
Volume 3, Issue 5
13 - 17

chebyshev’s inequality, discrete fourier transform, frequency spectra, voice recognition
Luqman Gbadamosi, " Voice Recognition System using Template Matching". International Journal of Research in Computer Science, 3 (5): pp. 13-17, September 2013. doi:10.7815/ijorcs.35.2013.070 Other Formats


It is easy for human to recognize familiar voice but using computer programs to identify a voice when compared with others is a herculean task. This is due to the problem that is encountered when developing the algorithm to recognize human voice. It is impossible to say a word the same way in two different occasions. Human speech analysis by computer gives different interpretation based on varying speed of speech delivery. This research paper gives detail description of the process behind implementation of an effective voice recognition algorithm. The algorithm utilize discrete Fourier transform to compare the frequency spectra of two voice samples because it remained unchanged as speech is slightly varied. Chebyshev inequality is then used to determine whether the two voices came from the same person. The algorithm is implemented and tested using MATLAB.

  1. Kinnunen, Tomi; Li, Haizhou. "An overview of text-independent speaker recognition: From features to super vectors". Speech Communication 52 (1): 12–40. doi:10.1016/j.specom.2009.08.009
  2. Homayoon Beigi, “Speaker Recognition, Biometrics / Book 1, Jucheng Yang (ed.), Intech Open Access Publisher, 2011, pp. 3-28, ISBN 978-953-307-618-8. doi: 10.1007/978-0-387-77592-0
  3. Duhamel, P. and M. Vetterli, "Fast Fourier Transforms: A Tutorial Review and a State of the Art," Signal Processing, Vol. 19, April 1990, pp. 259-299. doi: 10.1016/0165-1684(90)90158-U
  4. Oppenheim, A. V. and R. W. Schafer, “Discrete-Time Signal Processing”, Prentice-Hall, 1989, p. 611.
  5. Oppenheim, A. V. and R. W. Schafer, Discrete-Time Signal Processing, Prentice-Hall, 1989, p. 619.
  6. Rader, C. M., "Discrete Fourier Transforms when the Number of Data Samples Is Prime," Proceedings of the IEEE, Vol. 56, June 1968 (Current Version: June 2005), pp. 1107-1108. doi: 10.1109/PROC.1968.6477
  7. Oppenheim, A. V. and R.W. Schafer. Discrete-Time Signal Processing, Englewood Cliffs, NJ: Prentice-Hall, 1989, pp. 311-312.
  8. ITU-T Recommendation G.711, "Pulse Code Modulation (PCM) of Voice Frequencies," General Aspects of Digital Transmission Systems; Terminal Equipments, International Telecommunication Union (ITU), 1993.
  9. Beigi, Homayoon (2011). “ Fundamentals of Speaker Recognition.”. [Online]. Available: http:// www.wikipedia.org/wiki/speaker_recognition.
  10. Course project (Fall 2009 ) “Voice Recognition Using MATLAB”. California State University Northridge during the semester. [Online]. Available: http://www.cnx.org/content/m33347/1.3/module_export?format=zip
  11. “Article on Human Voice” [Online]. Available: http://www.wikipedia.org/wiki/Human voice.
  12. “Techniques of Voice Recognition System” [Online]. Available:http://www.hitl.washington.edu/scllw/EVE/I.D.2.d.VoiceRecognition.htm
  13. “Probability Tutorials on Chebyshevs-Inequality” [Online]. Available: http://www.statistics.about.com/od/ probHelpandTutorials/a/Chebyshevs-Inequality.htm.
  14. Sangram Bana, Dr. Davinder Kaur, “Fingerprint Recognition System using Image Segmentation”. International Journal of Advanced Engineering Sciences and technologies Vol No. 5, Issue No. 1, 012 – 023
  15. Kapil Sharma, H.P Sinha & R.K Aggarwal “Comparative study of speech Recognition System using various feature extraction techniques”. International Journal of Information Technology and Knowledge Management Vol 3, No2, pp. 695-698

  • Chouvatut, Varin, and Supachaya Prathan. "The flexible and adaptive X-mark detection for the simple answer sheets." Computer Science and Engineering Conference (ICSEC), 2014 International. IEEE, 2014.