How to estimate a speech sound' fundamental frequency
16 visualizzazioni (ultimi 30 giorni)
Mostra commenti meno recenti
Hello,
In this code, it had a parameter P.hop % samples - interval between estimates (default: 32)
I found someone said, the frame(interval) needs to be longer then 2*fundamental period to present the characteristic of speech, but also not to long to keep the estimate correct.
If my speech file's fundamental frequency is about 200 Hz, then it's fundamental period is 1/200 sec?
And the file's sampling rate is 44100 Hz, so I need to set the frame larger then 1/200*2*44100=441?
The output of YIN, R.f0 % fundamental frequency in octaves re: 440 Hz, some if I want to get the speech file's fundamental frequency contour, F0=2.^R.f0*440; then the F0 was the fundamental frequency contour of speech?
But, if I do that, the F0 vector had some value was not correct(higher then 500 Hz to about 1000~2000 Hz). Is that because the frame was non voicing speech so it's was totally wrong estimate? And can I just set the value which is larger then 500 to NAN in F0 vector to present the non voicing part of speech is no fundamental frequency?
Thanks.
0 Commenti
Risposte (1)
Wayne King
il 18 Feb 2013
Modificato: Wayne King
il 18 Feb 2013
Often, you want to lowpass filter your speech waveform before you attempt to extract the fundamental frequency.
Have you seen these two examples in the Signal Processing Toolbox documentation:
As far your questions:
"If my speech file's fundamental frequency is about 200 Hz, then it's fundamental period is 1/200 sec?
And the file's sampling rate is 44100 Hz, so I need to set the frame larger then 1/200*2*44100=441?"
A 200 Hz oscillation sampled at 44.1 kHz has a period of 220.5 samples.
dt = 1/44100;
T = 1/200;
N = T/dt
So 2 periods would be 441 samples as you state. However, in my experience you need more than 2 periods to make accurate estimates, so I would consider increasing that length if you can. 441 samples is only 0.01 seconds, or 10 milliseconds. You should be able to find vowel sounds in the speech signal longer than that in duration.
3 Commenti
Vedere anche
Categorie
Scopri di più su Audio Processing Algorithm Design in Help Center e File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!