TY - JOUR
T1 - Adaptive Block Transform Coding of Speech Based on LPC Vector Quantization
AU - Hussain, Yunus
AU - Farvardin, Nariman
PY - 1991/12
Y1 - 1991/12
N2 - In this paper we describe several adaptive block transform speech coding systems based on vector quantization of LPC parameters. In order to account for the power fluctuations, the speech signal is normalized to have a unit-energy prediction residual. The temporal variations in the short-term spectrum, on the other hand, are taken into account by vector quantizing the LPC parameters associated with the vector of speech samples and transmitting the codeword index. Also, a variation of the scheme in which the pitch information is used to better estimate the spectrum is considered. For each block, based on the codevector associated with the input vector, an optimum bit assignment map is used to quantize the transform coefficients. We consider two types of zero-memory quantizers for encoding the transform coefficients, namely the Lloyd-Max quantizer and the entropy-coded quantizer. The performance of these schemes is compared with other adaptive transform coding schemes. We show by means of simulations that the system based on entropy-coded quantizer design leads to very high performance and in most cases as much as 5-dB performance improvement in terms of segmental signal-to-noise ratio is observed over the adaptive block transform coding scheme of Zelinski and Noll. The effect of incorporating the pitch information in the coder is studied and numerical results are presented. The effects of the bit rate and the size of the codebook on the performance of the systems are also studied in detail.
AB - In this paper we describe several adaptive block transform speech coding systems based on vector quantization of LPC parameters. In order to account for the power fluctuations, the speech signal is normalized to have a unit-energy prediction residual. The temporal variations in the short-term spectrum, on the other hand, are taken into account by vector quantizing the LPC parameters associated with the vector of speech samples and transmitting the codeword index. Also, a variation of the scheme in which the pitch information is used to better estimate the spectrum is considered. For each block, based on the codevector associated with the input vector, an optimum bit assignment map is used to quantize the transform coefficients. We consider two types of zero-memory quantizers for encoding the transform coefficients, namely the Lloyd-Max quantizer and the entropy-coded quantizer. The performance of these schemes is compared with other adaptive transform coding schemes. We show by means of simulations that the system based on entropy-coded quantizer design leads to very high performance and in most cases as much as 5-dB performance improvement in terms of segmental signal-to-noise ratio is observed over the adaptive block transform coding scheme of Zelinski and Noll. The effect of incorporating the pitch information in the coder is studied and numerical results are presented. The effects of the bit rate and the size of the codebook on the performance of the systems are also studied in detail.
UR - http://www.scopus.com/inward/record.url?scp=0026385368&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0026385368&partnerID=8YFLogxK
U2 - 10.1109/78.107411
DO - 10.1109/78.107411
M3 - Article
AN - SCOPUS:0026385368
SN - 1053-587X
VL - 39
SP - 2611
EP - 2620
JO - IEEE Transactions on Signal Processing
JF - IEEE Transactions on Signal Processing
IS - 12
ER -