Skip Navigation

IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences 2008 E91-A(3):775-781; doi:10.1093/ietfec/e91-a.3.775
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Request Permissions
Google Scholar
Right arrow Articles by XU, X.
Right arrow Articles by MIYANAGA, Y.
Right arrow Search for Related Content
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Copyright © 2008 The Institute of Electronics, Information and Communication Engineers

Regular Section -- Papers -- Speech and Hearing

Robust Speech Spectra Restoration against Unspecific Noise Conditions for Pitch Detection

Xin XU1, Noboru HAYASAKA1 and Yoshikazu MIYANAGA1

1 The authors are with the Graduate School of Information Science and Technology, Hokkaido University, Sapporo-shi, 060-0814 Japan. E-mail: steven{at}csm.ist.hokudai.ac.jp

This paper proposes a new algorithm named Adaptive Running Spectrum Filtering (ARSF) to restore the amplitude spectra of speech corrupted by additive noises. Based on the pre-hand noise estimation, adaptive filtering is used in speech modulation spectra according to the noise conditions. The periodic structures in the amplitude spectra are kept against noise distortion. Since the amplitude spectral structures contain the information of fundamental frequency, which is the inverse of pitch period, ARSF algorithm is added into robust pitch detection to increase the accuracy. Compared with the conventional methods, experimental results show that the proposed method significantly improves the robustness of pitch detection against noise conditions with several types and SNRs.

Key Words: ARSF, noise robust, pitch detection, modulation spectra


Manuscript received April 2, 2007. Manuscript revised July 24, 2007.

Reference

[1] C.A. McGonegal, L.R. Rabiner, and A.E. Rosenberg, "A semiautomatic pitch detector (SAPD)," IEEE Trans. Acoust. Speech Signal Process., vol.ASSP-23, no.6, pp.570–574, Dec. 1975.

[2] M. Kazama and M. Tohyama, "Estimation of speech components by ACF analysis in a noisy environment," J. Sound and Vibration, vol.241, no.1, pp.41–52, 2001.

[3] W.J. Hess, Pitch and voicing determination, advances in speech signal processing, in Advances in Speech Signal Processing, ed., S. Furui and M.M. Sondhi, Marcel Dekker, New York. 1992.

[4] M.M. Sondhi, "New methods of pitch extraction," IEEE Trans. Audio and Electroacoustics, vol.AU-16, no.2, pp.262–266, June 1968.

[5] L.R. Rabiner, M.J. Cheng, A.E. Rosenberg, and C.A. McGonegal, "A comparative performance study of several pitch detection algorithms," IEEE Trans. Acoust. Speech Signal Process., vol.ASSP-24, no.5, pp.399–418, Oct. 1976.

[6] T. Shimamura and H. Takagi, "Noise-robust fundamental frequency extraction method based on exponentiated band-limited amplitude spectrum," 47th IEEE International Midwest Symposium on Circuits and Systems, no.2, pp.II-141–II-148, 2004.

[7] H. Kobayashi and T. Shimamura, "An extraction method of fundamental frequency using clipping and band limitation on log spectum," IEICE Trans. Fundamentals (Japanese Edition), vol.J82-A, no.7 pp.1115–1122, July 1999.

[8] N. Hayasaka, N. Wada, Y. Miyanaga, and N. Hataoka, "Running spectrum filter for robust speech recognition," IEICE Technical Report, CAS2003-6, VLD2003-16, DSP2003-36, 2003.

[9] Q. Zhu, N. Ohtsuki, Y. Miyanaga, and N. Yoshida, "Noise-robust speech analysis using running spectrum filtering," IEICE Trans. Fundamentals, vol.E88-A, no.2, pp.541–548, Feb. 2005.

[10] X. Xu, N. Hayasaka, Q. Zhu, and Y. Miyanaga, "Noise robust chinese speech recognition system for isolate words," International Workshop on Nonlinear Signal and Image Processing 2005, no.1, pp.420–425, 2005.

[11] X. Xu and Y. Miyanaga, "A robust pitch detection in noisy speech with band-pass filtering on modulation spectra," International Symposium on Communications and Information Technologies 2005, no.1, pp.266–269, 2005.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?



This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Request Permissions
Google Scholar
Right arrow Articles by XU, X.
Right arrow Articles by MIYANAGA, Y.
Right arrow Search for Related Content
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?