JPWO1994007239A1

JPWO1994007239A1 - Audio encoding method and device

Info

Publication number: JPWO1994007239A1
Application number: JP6-507969A
Authority: JP
Inventors: 智彦谷口; 良紀田中; 恭士大田; 秀明栗原
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1992-09-16
Filing date: 1993-09-16
Publication date: 1994-10-06
Anticipated expiration: 2019-05-31

Abstract

(57)【要約】本公報は電子出願前の出願データであるため要約のデータは記録されません。 (57) [Abstract] This publication contains application data prior to electronic filing, so abstract data is not recorded.

Description

【発明の詳細な説明】音声符号化方法及び装置技術分野本発明は音声信号の情報圧縮を行うための音声符号化方法及び装置に関し、特に４〜１６Ｋｂｐｓの伝送速度で符号化を行うためのＡｎａｌｙｓｉｓｂｙ−３ｙｎｔｈｅｓｉｓ　（Ａ　−ｂ　Ｓ　）形ベクトル量子化を用いた音声符号化方法及び装置に関する。[Detailed Description of the Invention] Audio Coding Method and Apparatus Technical Field The present invention relates to an audio coding method and apparatus for compressing audio signal information, and more particularly to an audio coding method and apparatus using analysis-by-three-dimensional (A-b-S) vector quantization for encoding at transmission rates of 4 to 16 Kbps.

背景技術Ａ−ｂ−５形ベクトル量子化を用いる音声符号器、例えばＣｏｄｅ−Ｅｘｃｉｔｅｄ　Ｌｉｎｅａｒ　Ｐｒｅｄｉｃｔｉｏｎ　（ＣＥＬＰ）符号器は、近年、企業内通信システム、ディジタル移動無線システムなどにおいて、音声信号をその品質を保持しつつ情報圧縮する音声符号器として有望視されている。このベクトル量子化音声符号器（以下、単に符号器）においては、符号帳（コードブック）の各コードベクトルに予測重み付けを施して再生信号を作り、再生信号と入力音声信号との間の誤差電力を評価して最も誤差の少ないコードベクトルの番号（インデックス）を決定して受信側に送信するものである。BACKGROUND ART Speech coders using A-b-5 vector quantization, such as Code-Excited Linear Prediction (CELP) coders, have recently been seen as promising speech coders for compressing speech signals while maintaining their quality in corporate communication systems, digital mobile radio systems, and other applications. These vector quantization speech coders (hereafter simply referred to as coders) apply predictive weighting to each code vector in a codebook to create a reproduced signal, evaluate the error power between the reproduced signal and the input speech signal, and determine the index of the code vector with the least error, which is then transmitted to the receiver.

上記のＡ−ｂ−３形ベクトル量子化方式による符号器は、前記符号帳に格納された約１０００パターンの音源信号のベクトルの１つ１つに対して、線形予測合成フィルタ処理を施し、再生された各音声信号と、符号化すべき入力音声信号との間の誤差が最も小さくなる１つのパターンをその約１０００のパターンの中から探索するという処理を行う。The A-b-3 vector quantization encoder performs a linear predictive synthesis filter process on each of the approximately 1,000 sound source signal vector patterns stored in the codebook, and then searches among those approximately 1,000 patterns for the pattern that minimizes the error between each reproduced sound signal and the input sound signal to be encoded.

ところで符号器は通話の即時性が要求されるので、上記の探索処理をリアルタイムで行う必要がある。そうすると、その探索処理を行うのに、例えば５ｍｓという短い時間間隔で、通話の間連続して行わなければならない。However, since the encoder is required to respond to calls immediately, the above search process must be performed in real time. This means that the search process must be performed continuously throughout the call, at short intervals of, for example, 5 ms.

しかしながら後述する如く、この探索処理の中にフィルタ演算や相関演算という複雑な演算操作が含まれていて、これらの演算操作に要する演算量は、例えば数１００Ｍｏｐｓ　（メガオペシー９３フフ秒）という膨大なものになる。これに対応するには、現在、最高速とされるＤｉｇｉｔａｌ　Ｓｉｇｎａｌ　Ｐｒｏｃｅｓｓｏｒ　（ＤＳＰ）をもってしても、数チップを必要とし、例えば携帯電話に適用しようとする場合、その小形化ならびに低消費電力化が困難になるという問題がある。However, as will be explained later, this search process involves complex calculations such as filter and correlation operations, and the amount of computation required for these operations can be enormous, reaching several hundred Mops (Mop. 93 fps). Even with the fastest digital signal processors (DSPs) currently available, several chips are required to handle this, which poses challenges in miniaturization and low power consumption when applied to mobile phones, for example.

上記の問題を解決する音声符号化方式として、本願出願人は、特願平３−１２７６６９号（特開平４−３５２２００号公報）において、従来のように符号ベクトルそのものを格納する代わりに信号ベクトルの差分であるデルタベクトルを格納した符号帳を用い、それらデルタベクトルを順次加算及び減算することによって木構造を有する符号ベクトルを生成する木構造デルタ符号帳を用いることを提案した。As a speech coding method that solves the above problems, the applicant proposed in Japanese Patent Application No. 3-127669 (JP Patent Publication No. 4-352200) a tree-structured delta codebook. This codebook stores delta vectors, which are the differences between signal vectors, instead of storing the code vectors themselves as in the conventional method. The codebook then generates tree-structured code vectors by sequentially adding and subtracting these delta vectors.

この方式によれば、符号帳に要するメモリ容量が大幅に削減されるとともに、各符号ベクトルに対する前記フィルタ演算及び相関演算は、デルタベクトルに対してフィルタ演算及び相関演算を行い、その結果を順次加算及び減算することによって達成されるので、演算量の大幅な削減が実現される。This method significantly reduces the memory capacity required for the codebook, and also significantly reduces the amount of computation required, since the filter and correlation operations for each code vector are performed by performing filter and correlation operations on the delta vector and then sequentially adding and subtracting the results.

しかし、この方式においては、各符号ベクトルがそれより少ない数の基底ベクトルとしてのデルタベクトルの線形結合として生成されているため、その成分としてデルタベクトル以外の成分は持たない。即ち符号化対象のベクトルが分布する空間（通常、４０〜６４次元）のうち、高々デルタベクトルの数（通常、８〜１０本）に対応する次元分の部分空間にしか符号ベクトルの分布を与えられない。However, in this method, each code vector is generated as a linear combination of a smaller number of basis vectors, i.e., delta vectors, and therefore contain no components other than the delta vectors. In other words, the code vectors can only be distributed in a subspace with dimensions corresponding to the number of delta vectors (usually 8-10) within the space in which the vectors to be encoded are distributed (usually 40-64 dimensions).

そのため、木構造デルタ符号帳では、符号化対象である音声信号の統計的分布に基づき充分に基底ベクトル（デルタベクトル）を設計したとしても、従来の構造上の制約の無い符号帳に比べて量子化特性が劣化するという問題があった。Therefore, even if the basis vectors (delta vectors) are designed based on the statistical distribution of the speech signal to be coded, the tree-structured delta codebook suffers from the problem of degraded quantization characteristics compared to conventional codebooks without structural constraints.

そこで本願出願人は、特願平３−５１５０１６号において、距離を評価するために符号ベクトルに前記の線形予測合成フィルタ演算を施すとすべてのデルタベクトル成分に対して均等に増幅されず成る偏りをもって増幅されること、及び木構造デルタ符号帳において各デルタベクトルが符号ベクトルへ与える寄与はデルタベクトルの順番を変えれば変えることができること、に着目し、線形予測合成フィルタの係数が決定される毎に各デルタベクトルＣ：フィルタ演算を施してパワー（ベクトルの長さ）を比較し、パワーの大きいデルタベクトルから順に並べ変えを行った木構造デルタ符号帳を用いて符号化を行うことにより、特性を改善することを提案した。Therefore, in Japanese Patent Application No. 3-515016, the applicant of the present application noted that when the above-mentioned linear predictive synthesis filter operation is performed on a code vector to evaluate distance, all delta vector components are amplified unevenly rather than uniformly, and that the contribution of each delta vector to a code vector in a tree-structured delta codebook can be changed by changing the order of the delta vectors. Therefore, the applicant proposed a method for improving performance by performing a filter operation on each delta vector C each time the coefficients of the linear predictive synthesis filter are determined, comparing the power (vector length), and then sorting the delta vectors in descending order of power before encoding using a tree-structured delta codebook.

しかしながら、この方法によっても限られた数のデルタベクトルのみから符号ベクトルを生成する点において従来の方法と変わりがないので、特性の改善に限界があり、更なる改善が要望されている。However, this method is no different from the conventional method in that it generates code vectors from only a limited number of delta vectors, so there is a limit to the improvement in performance, and further improvement is desired.

Ａ−ｂ−５形ベクトル量子化を用いる音声符号化器のもう１つのｙＡ題は、可変ビットレート符号化を実現することである。可変ビットレート符号化とは、伝送路の余裕度、音源の重要性等の状況に応して符号のビットレートを適宜変更することによって全体として効率の良い符号化を達成するため、ビットレートを変更することのできる符号化方式である。Another challenge for speech coders using A-b-5 vector quantization is the realization of variable bit-rate coding. Variable bit-rate coding is a coding method that allows the bit rate to be changed appropriately depending on the transmission channel availability, the importance of the sound source, and other conditions, thereby achieving overall coding efficiency.

ベクトル量子化方式を可変レート音声符号化に用いようとした場合、個々の伝送レートに応し１こパターン数の符号帳を用意し、それらを所望の伝送レートに応して切り換えながら符号化を行う必要がある。When vector quantization is used for variable-rate speech coding, it is necessary to prepare a codebook with a number of patterns for each transmission rate and to switch between them according to the desired transmission rate.

この時、符号ベクトルを単に並べただけの従来の符号帳の場合、各符号帳の保持に（ベクトルの次元二Ｎ）と（パターンｆｉ：Ｍ）の積に相当するＮＸＭワードのメモリが必要である。ここで、パターン数Ｍは、符号ベクトルのインデックスのビット数の２の巾乗に比例するので、伝送レートの可変幅を大きくしたり、伝送レートを細かな“きざみ”で制御するには、膨大なメモリを要するという問題がある。In this case, in the case of a conventional codebook that simply lists code vectors, storing each codebook requires NX × M words of memory, which corresponds to the product of (vector dimension 2N) and (patterns fi:M). Here, the number of patterns M is proportional to a power of 2 of the number of bits in the code vector index, so increasing the variable range of the transmission rate or controlling the transmission rate in fine increments poses the problem of requiring a huge amount of memory.

また、可変レート伝送においては、符号化後の伝送信号に対して伝送網側からの要求で強制的に伝送レートを低く抑える必要が生じることがあり、このような場合、復号器では符号器が生成した符号化情報に対しピントの欠落した（ビット・ドロップ）情報から、音声信号を再生せざるをえなくなる。Furthermore, in variable-rate transmission, the transmission rate of the encoded signal may need to be reduced at the request of the transmission network. In such cases, the decoder must reconstruct the audio signal from the bit-dropped information generated by the encoder.

従来、ベクトル量子化に比べ効率の低いスカラー量子化においては、ピント・ドロップに対する対策として、重要度の低いＬＳＢから順にピントを落とすよう制御したり、高レートの量子化器が低レートの量子化器の量子化レベルを包含するよう構成する（エンベデッド符号化）等の工夫がなされてきた。Scalar quantization, which is less efficient than vector quantization, has traditionally dealt with focus drop by controlling the focus to drop starting with the least significant bit (LSB) or by configuring a high-rate quantizer to encompass the quantization levels of a low-rate quantizer (embedded coding).

ところが、符号ベクトルを単に並べただけの従来の符号帳を用いるベクトル量子化方式の場合、符号帳自体に何らの構造化もなされていないため、符号ベクトルのインデックスのビット間に重要度の差がなく　（ＬＳＢが落とされてもＭＳＢが落とされても全く異なるベクトルが呼び出されることに変わりはない）、スカラー量子化の場合と同様な対策が採れず、ビット　ドロップに対して大きな音質劣化を引き起こすという問題がある。However, in the case of vector quantization, which uses a conventional codebook that simply lists code vectors, the codebook itself is not structured in any way, so there is no difference in the importance of the bits in the code vector index (dropping the LSB or MSB results in a completely different vector being called). This means that the same countermeasures as those used in scalar quantization cannot be applied, resulting in significant degradation of sound quality due to bit dropping.

発明の開示したがって本発明の第１の目的：：、前述の方式よりもさらに改善された木構造データ符号帳による音声符号化方法および装置を提供することにある。DISCLOSURE OF THE INVENTION It is therefore a first object of the present invention to provide a method and apparatus for speech coding using a tree-structured data codebook that is an improvement over the previously described methods.

本発明の第２の目的は、符号帳のために膨大なメモリを必要とせず、ビ、トドロソブに対する対策が可能なベクトル量子化による音声符号化方法及び装置を提供することにある。A second object of the present invention is to provide a speech coding method and apparatus using vector quantization that does not require a large amount of memory for a codebook and that can deal with bit and digital artifacts.

本発明によれば、予め与えられた符号ベクトルの中で入力音声信号ベクトルとの距離が最小である符号ベクトルに付されたインデックスにより該入力音声信号ベクトルを符号化する音声符号化方法であって、ａ）複数の差分符号ベクトルを格納し、ｂ）咳差分符号ベクトルの各々に線形予測合成フィルタのマトリクスを乗し、Ｃ）該マトリクスが乗じられた差分符号ベクトルのパワーの増幅率を評価し、ｄ）該評価されたパワーの増幅率の大きさの順に該マトリクスが乗しられた差分符号ベクトルを並べ替え、ｅ）並べ替えられたベクトルの中から評価されたパワーの増幅率の大きさの順に所定個数のベクトルを選択し、ｆ）該選択されたベクトルを木構造上で順次加算及び減算することによって生成されるべき、線形予測合成フィルタ処理が施された符号ベクトルと、前記入力音声信号ベクトルとの距離を評価し、ｇ）！評価された距離が最小である符号ベクトルを決定する各段階を具備する音声符号化方法が提供される。According to the present invention, there is provided a speech coding method for encoding an input speech signal vector using an index assigned to a code vector having a minimum distance from the input speech signal vector among predetermined code vectors, the speech coding method comprising the steps of: (a) storing a plurality of differential code vectors; (b) multiplying each differential code vector by a linear prediction synthesis filter matrix; (c) evaluating the power amplification factors of the differential code vectors multiplied by the matrix; (d) sorting the differential code vectors multiplied by the matrix in order of the magnitude of the evaluated power amplification factors; (e) selecting a predetermined number of vectors from the sorted vectors in order of the magnitude of the evaluated power amplification factors; (f) evaluating the distance between the input speech signal vector and a linear prediction synthesis filter-processed code vector to be generated by sequentially adding and subtracting the selected vectors in a tree structure; and (g) determining the code vector having the minimum evaluated distance.

本発明によれば、予め与えられた符号ベクトルの中で入力音声信号ベクトルとの距離が最小である符号ベクトルに付されたインデックスにより咳入力音声信号ベクトルを符号化する符号化装置であって、複数の差分符号ベクトルを格納する手段と、該差分符号ベクトルの各々に線形予測合成フィルタのマトリクスを乗しる手段と、咳マトリクスが乗しられた差分符号ベクトルのパワーの増幅率を評価する手段と、該評価されたパワーの増幅率の大きさの順に該マトリクスが乗じられた差分符号ベクトルを並べ替える手段と、並べ替えられたベクトルの中から評価されたパワーの増幅率の大きさの順に所定個数のベクトルを選択する手段と、該選択されたベクトルを木構造上で順次加算及び減算することによって生成されるべき、線形予測合成フィルタ処理が施された符号ベクトルと、前記入力音声信号ベクトルとの距離を評価する手段と、該評価された距離が最小である符号ベクトルを決定する手段とを具備する音声符号化装置もまた提供される。The present invention also provides a speech coding device for encoding a cough input speech signal vector using an index assigned to a code vector among predefined code vectors that has the smallest distance from the input speech signal vector, the device comprising: means for storing a plurality of differential code vectors; means for multiplying each differential code vector by a linear predictive synthesis filter matrix; means for evaluating the power amplification factors of the differential code vectors multiplied by the cough matrix; means for sorting the differential code vectors multiplied by the matrix in order of the magnitude of the evaluated power amplification factors; means for selecting a predetermined number of vectors from the sorted vectors in order of the magnitude of the evaluated power amplification factors; means for evaluating the distance between the input speech signal vector and a linear predictive synthesis filter-processed code vector to be generated by sequentially adding and subtracting the selected vectors in a tree structure; and means for determining the code vector with the smallest evaluated distance.

本発明によれば、予め与えられた符号ベクトルの中で入力音声信号ベクトルとの距離が最小である符号ベクトルに付された可変ビット長の符号により該入力音声信号ベクトルを可変長符号化する可変長音声符号化方法であって、ａ）？Ｘ数の差分符号ベクトルを格納し、ｂ）先頭から所望の符号ビット長に応した数の差分符号ベクトルを木構造上で順次加算及び滅夏することによって生成されるべき符号ベクトルと前記入力音声信号ベクトルとの距離を評価し、ｃ）該評価された距離が最小である符号ベクトルを決定し、ｄ）該決定された符号ベクトルに付されるべき所望符号ビット長の符号を決定する各段階を具備する可変長音声符号化方法もまた提供される。The present invention also provides a variable-length speech coding method for variable-length coding an input speech signal vector using a variable-bit-length code assigned to a code vector among predefined code vectors that has the smallest distance from the input speech signal vector, the variable-length speech coding method comprising the steps of: (a) storing ≠X number of differential code vectors; (b) evaluating the distance between the input speech signal vector and a code vector to be generated by sequentially adding and subtracting, from the beginning, a number of differential code vectors corresponding to a desired code bit length in a tree structure; (c) determining the code vector with the smallest evaluated distance; and (d) determining a code of the desired code bit length to be assigned to the determined code vector.

本発明によれば、予め与えられた符号ベクトルの中で入力音声信号ベクトルとの距離が最小である符号ベクトルに付された可変ビット長の符号により咳入力音声信号ベクトルを可変長符号化する可変長音声符号化方法であって、複数の差分符号ベクトルを格納する手段と、先頭から所望の符号ビット長に応した数の差分符号ベクトルを木構造上で順次加算及び減算することによって生成されるべき符号ベクトルと前記入力音声信号ベクトルとの距離を評価する手段と、該評価された距離が最小である符号ベクトルを決定する手段と、該決定された符号ベクトルに付されるべき所望符号ビット長の符号を決定する手段とを具備する可変長音声符号化装置もまた提供される。The present invention also provides a variable-length speech coding method for variable-length coding an input cough speech signal vector using a variable-bit-length code assigned to a code vector among predefined code vectors that has the smallest distance from the input speech signal vector. The variable-length speech coding device includes: means for storing multiple differential code vectors; means for evaluating the distance between the input speech signal vector and a code vector to be generated by sequentially adding and subtracting, from the beginning, a number of differential code vectors corresponding to a desired code bit length in a tree structure; means for determining the code vector with the smallest evaluated distance; and means for determining a code of the desired code bit length to be assigned to the determined code vector.

図面の簡単な説明図１１ま音声生成系の概念を示したブロック図；図２は一般的なＣＥＬＰ音声符号化の原理を示したブロック図；図３は従来技術として、Ａ−ｂ−３型ベクトル量子化の雑音符号帳探索処理の構成を示したブロック図：図４は雑音符号帳探索処理のアルゴリズムをモデル化したブロック図。Brief Description of the Drawings Figure 11 is a block diagram showing the concept of a speech generation system; Figure 2 is a block diagram illustrating the principles of general CELP speech coding; Figure 3 is a block diagram showing the configuration of the noise codebook search process for A-b-3 type vector quantization as prior art; Figure 4 is a block diagram modeling the noise codebook search process algorithm.

図５はデルタ符号帳の原理を説明するためのブロック図；図６Ａ及び６Ｂ；＝木構造デルタ符号帳の適応化方式を説明するための図；図？Ａ、τＢ及びτＣは本発明の詳細な説明するための図；図８：＝本発明の音声符号化器のプロ２り図、及び図９Ａ及び図９Ｂ：よ本発明の可変レート符号化方式を説明するための図である。Figure 5 is a block diagram illustrating the principles of the delta codebook; Figures 6A and 6B are diagrams illustrating the adaptation method for a tree-structured delta codebook; Figures τA, τB, and τC are diagrams illustrating the details of the present invention; Figure 8 is a diagram of the speech coder of the present invention; and Figures 9A and 9B are diagrams illustrating the variable rate coding method of the present invention.

発明を実施するための最良の形態音声には有声音と無声音とがあり、有声音は声帯の振動によるパルス音源が基となって発生し、個人個人のノドや口の声道特性が付加されて声になる。また、無声音は声帯を振るわせないで出す音で、単なるガウス性の雑音列が音源となって声道を通って声となる。従って、音声発生メカニズムは図１に示すように、有声音の元となるパルス音５ｐｓｃと無声音の元となる雑音源ＮＳＣと、各音源から出力される信号に声道特性を付加する線形予測合成フィルタＬＰＣＦによりモデル化できる。尚、人の声はピッチ周期性を有し、該周期性はパルス音源から出力されるパルスの周期性に対応しており、人や話の内容によって異なる。BEST MODE FOR CARRYING OUT THE INVENTION Speech can be divided into voiced and unvoiced sounds. Voiced sounds are generated by a pulse source produced by vocal cord vibration, and are then transformed into voice by the addition of individual vocal tract characteristics of the throat and mouth. Unvoiced sounds are produced without vocal cord vibration; a simple Gaussian noise source is the source, passing through the vocal tract and becoming voice. Therefore, as shown in Figure 1, the speech generation mechanism can be modeled using a pulse sound (5psc) that is the source of voiced sounds, a noise source (NSC) that is the source of unvoiced sounds, and a linear predictive synthesis filter (LPCF) that adds vocal tract characteristics to the signals output from each sound source. Human voices have pitch periodicity, which corresponds to the periodicity of the pulses output from the pulse source and varies depending on the person and the content of speech.

以上のことから、入力音声に対応するパルス音源の周期と雑音源の雑音列を特定することができれば、これらパルス周期と雑音源の雑音列を識別する符号（インデックス）により入力音声を符号化することができる。Therefore, if we can identify the period of the pulse source and the noise sequence corresponding to the input speech, we can encode the input speech using a code (index) that identifies these pulse periods and the noise sequence.

ここで、図２に示すように、過去の値（ｂＰ＋ｇＣ）を異なるサンプル数遅延させて得られるベクトルＰを適応符号帳１１に格納し、適応符号帳１１内のベクトルＰにゲインｂを乗したベクトルｂｐを線形予測合成フィルタ１２に入力してフィルタ演夏処理を施し、得られたフィルタ演夏結果ｂＡＰを入力音声信号Ｘから減算してその誤差信号から誤差電力評価部１３において該差電力が最小となる適応符号帳１１のベクトルＰを選択することにより、周期を決定する。As shown in FIG. 2, vector P obtained by delaying past values (bP + gC) by different sample numbers is stored in adaptive codebook 11. Vector P in adaptive codebook 11 is multiplied by gain b to obtain vector bp, which is input to linear predictive synthesis filter 12 for filter processing. The resulting filter processing result bAP is subtracted from input speech signal X, and error power evaluation unit 13 selects vector P from adaptive codebook 11 that minimizes the difference power from the resulting error signal, thereby determining the period.

その後又はこれと同時に、予め複数の雑音列（各雑音列はＮ次元のコードベクトルで表現されている）を雑音符号帳１に用意しておき、各コードベクトルＣにゲインｇを乗じ線形予測合成フィルタ３の処理を施した再生信号ベクトルｇＡＣと入力信号ベクトルＸ（Ｎ次元ベクトル）との誤差が最小となるコードベクトルを誤差；力評価部５が決定すれば、前記周期とコードベクトルを特定するデータ（インデックス）により音声を符号化することができる。なお、図２を参照して上記に説明した例は、ベクトルＡＣとベクトルＡＰが直交化されている場合の例であり、そうでない場合には、入力信号ベクトルＸからベクトルｂＡＰを差し引いたベクトルＸ−ｂＡＰとの誤差が最小となるコードベクトルを決定する。Subsequently, or simultaneously, multiple noise sequences (each represented by an N-dimensional code vector) are prepared in the noise codebook 1, and each code vector C is multiplied by a gain g and processed by the linear predictive synthesis filter 3 to obtain a reproduced signal vector gAC. The error vector evaluator 5 then determines the code vector that minimizes the error between the reproduced signal vector gAC and the input signal vector X (an N-dimensional vector). Once this is determined, speech can be encoded using the period and data (index) that specifies the code vector. Note that the example described above with reference to FIG. 2 is an example in which vector AC and vector AP are orthogonal. If this is not the case, a code vector that minimizes the error between vector X - bAP, which is obtained by subtracting vector bAP from input signal vector X, is determined.

図３はＡ−ｂ−３法によるベクトル量子化を用いた音声伝送（符号化）方式の構成図で図２の下半分に相当している。詳しく述べると１はＮ次元コードベクトルＣをサイズＭだけ記憶する雑音符号帳、２はゲインｇの増幅部、３は入力信号Ｘから線形予測分析により決定された係数を有し、増幅部２の出力に線形予測フィルタ演算処理を施す線形予測合成フィルタ、４は線形予測合成フィルタ３から出力される再生信号ベクトルと入力信号ベクトルの誤差を出力する誤差発生部、５は該誤差を評価し、該誤差が最小となるコードベクトルをめる誤差電力評価部である。Figure 3 is a block diagram of a speech transmission (encoding) system using vector quantization based on the A-b-3 method, corresponding to the lower half of Figure 2. Specifically, reference numeral 1 denotes a random codebook that stores an N-dimensional code vector C of size M; 2 denotes an amplifier with gain g; 3 denotes a linear predictive synthesis filter that has coefficients determined by linear predictive analysis of the input signal X and performs linear predictive filtering on the output of amplifier 2; 4 denotes an error generator that outputs the error between the reproduced signal vector output from linear predictive synthesis filter 3 and the input signal vector; and 5 denotes an error power evaluator that evaluates the error and selects the code vector that minimizes the error.

このＡ−ｂ−３法による量子化では通常のベクトル量子化と異なり、雑音符号Ｉ１１の各コードベクトル（Ｃ）に最適のゲイン（ｇ）を掛けた後、線形予測合成フィルタ３でフィルタ処理を施し、このフィルタ処理で得られる再生信号ベクトル（ｇＡＣ）と入力信号ベクトル（Ｘ）との間の誤差信号（Ｅ）を誤差発生部４でめ、誤差電力評価部５で誤差信号の電力を評価関数（距離尺度）として雑音符号帳】の探索を行い、誤差電力が最小となるコードベクトルをめ、該コードベクトルを特定する符号（インデックス）により入力信号を符号化して伝送する。Unlike conventional vector quantization, this A-b-3 quantization method multiplies each code vector (C) in the random code I (11) by an optimal gain (g), then filters it through a linear predictive synthesis filter (3). The error signal (E) between the reproduced signal vector (gAC) obtained through this filtering and the input signal vector (X) is generated by an error generator (4). The error power evaluator (5) then searches the random codebook using the power of the error signal as an evaluation function (distance measure) to find the code vector with the smallest error power. The input signal is then encoded using a code (index) that identifies that code vector and transmitted.

このときの誤差電力は次式％式％（１）により与えられる。最適なコードベクトル及びゲインｇは、二の（１）式に示す誤差電力を最小化するものとして決定される。尚、声の大きさによりパワーが異なるので、ゲインｇを最適化して再生信号パワーを入力信号のパワーに合わせる。最適ゲインは（１）式をｇで偏微分してＯと置くことによりめることができる。即ち、ｄｌＥｌ”／ｄｇ＝Ｑより、ｇはｇ　＝　（Ｘ”　ＡＣ）／（（ＡＣ）”　（ＡＣ））　（２）で与えられる。このｇを（１）式に代入すると、ＩＥＩ”−ＩＸＩ”　−（χ丁　ＡＣ）！／（（ＡＣ）’（ＡＣ））　（３）となる。入力信号Ｘと線形予測合成フィルタ３の出力ＡＣの相互相関をＲＸＣ１線形予測合成フィルタ３の出力ＡＣの自己相関をＲＣＣとすれば、相互相関及び自己相関は次式％式％（４）により表現される。The error power in this case is given by the following equation: Equation (1) The optimal code vector and gain g are determined to minimize the error power shown in Equation (1). Note that power varies depending on the voice volume, so the gain g is optimized to match the playback signal power to the input signal power. The optimal gain can be determined by partially differentiating Equation (1) with respect to g and substituting O. That is, dlEl"/dg = Q, so g is given by g = (X" AC)/((AC)"(AC)) (2). Substituting this g into equation (1), we obtain IEI"-IXI"-(X" AC)!/((AC)'(AC)) (3). If the cross-correlation between the input signal X and the output AC of the linear prediction synthesis filter 3 is RXC1 and the auto-correlation of the output AC of the linear prediction synthesis filter 3 is RC, then the cross-correlation and auto-correlation are expressed by the following equations: Equation (4)

（３）式の誤差電力を最小にするコードベクトルＣは（３）式の右辺第２項を最大にするものであるから、該コードベクトルＣは次式％式％（６）と表現でき、最適のゲインは（６）式を満たす相互相関、及び自己相関を用いて（２）式よりｇ”Ｒｘｃ／Ｒｃｃ　（７）で与えられる。The code vector C that minimizes the error power of equation (3) maximizes the second term on the right-hand side of equation (3). Therefore, the code vector C can be expressed as follows: %Equation%(6) The optimal gain is given by g"Rxc/Rcc (7) from equation (2) using the cross-correlation and auto-correlation that satisfy equation (6).

図４は以上の式により、誤差電力が最小となるコードベクトルをめて入力信号を符号化する雑音符号帳探索処理アルゴリズムをモデル化した構成図であり、相互相関Ｒｘｃ（”χ丁ＡＣ）を演算する演算部６と、この相互相関ＲＸＣの二乗を演算する演算部７と、ＡＣの自己相関ＲＣＣを演算する演算部８と、Ｒｘｃ”　／　Ｒｃｃを演算する演算部９と、ＲＸＣ”　／　Ｒｃｃが最大となる、換言すれば誤差電力が最小となるコードベクトルを決定して該コードベクトルを特定する符号を出力する誤差電力評価部５とが設けられているが、等価的に図３と同じものである。Figure 4 is a block diagram modeling the noise codebook search algorithm that, based on the above equations, selects the code vector that minimizes the error power and encodes the input signal. It includes a calculation unit 6 that calculates the cross-correlation Rxc ("xAC"), a calculation unit 7 that calculates the square of this cross-correlation RXc, a calculation unit 8 that calculates the autocorrelation RCC of AC, a calculation unit 9 that calculates Rxc"/Rcc, and an error power evaluation unit 5 that determines the code vector that maximizes RXc"/Rcc, in other words, minimizes the error power, and outputs a code that identifies that code vector. However, this is equivalent to Figure 3.

このような従来の符号帳探索処理の内で主なものは、■コードベクトルＣに対するフィルタ処理、■相互相関ＲＸＣの算出処理、及び■自己相関ＲＣＣの算出処理の３つである。　ＬＰＣフィルタ３の次数をＮＰ、ベクトル量子化（コードベクトル）の次元をＮとすると、１つのコードベクトルに対して、■〜■のそれぞれに要する演算量はＮＰ　・Ｎ、Ｎ、及びＮである。従って、１つのコードベクトル当たりの符号帳探索に要する演算量は（Ｎｐ＝２）　・Ｎとなる。The main components of this conventional codebook search process are: (1) filtering of the code vector C, (2) calculation of the cross-correlation RXC, and (3) calculation of the auto-correlation RCC. If the order of the LPC filter 3 is NP and the dimension of the vector quantization (code vector) is N, then the computational complexity required for (1) through (2) for one code vector is NP·N, N, and N, respectively. Therefore, the computational complexity required for a codebook search per code vector is (Np = 2)·N.

通常用いられている雑音符号帳ｌは、４０次元・符号帳サイズ１０２４（Ｎ＝４０．　Ｍ−１０２４）程度のものであり、ＬＰＧフィルタ３の分析次数Ｎ、が１０次程度であるため、１回の符号帳探索に（１０＋２　）　・４０・１０２４＝４８０　ｘｌＯ’の積和算を要する。A commonly used noise codebook l has approximately 40 dimensions and a codebook size of 1024 (N = 40.M - 1024). Since the analysis order N of the LPG filter 3 is approximately 10th order, one codebook search requires (10 + 2) × 40 × 1024 = 480 × 10' multiplication and addition operations.

この欅な符号帳探索を音声符号化のサブフレーム（５ｍ５ｅｃ）毎に行うためには、９６門ｏｐｓ　（メガオペレーン３フフ秒）という膨大な処理能力が必要となり、現在最高速のディジタル・ジクナル・プロセッサ（許容演算量２０〜４０Ｍｏｐｓ）を使用しても、その実時間実現には数チップを要してしまう。To perform this complex codebook search every subframe (5m5ec) of speech coding, a massive processing power of 96 ops (3 mega-operators per second) would be required. Even with the fastest digital processors available today (capacity of 20-40 MOPs), real-time implementation would require several chips.

また、この様な雑音符号帳１をテーブルとして記憶・保持するために：＝、Ｎ　・Ｍ　（＝４０　１０２４＝１０２４＝４０Ｋものメモリ容量が必要になって′ −よう。Furthermore, to store and maintain such a noise codebook 1 as a table, a memory capacity of N × M (= 40 × 1024 = 1024 = 40K would be required.

更には、Ａ−ｂ−５型ベクトル量子化を用いた音声符号器の適用分野と考えられる自動車電話・携帯電話において；＝、装置の小型化・低消費電力化が必須の条件てあり、膨大な演算量や膨大なメモリ容量は、いずれも音声符号器実現上で重大な障害となっている。Furthermore, in the case of mobile phones and cellular phones, which are considered to be potential applications of speech coders using A-b-5 vector quantization, miniaturization and low power consumption are essential requirements. The enormous computational complexity and memory requirements pose significant obstacles to the realization of speech coders.

以上のことから本願の出願人は、特願平３−１２７６６９号（特開平４−３５２２００号公報）において、雑音符号帳探索に要する演算量を減少でき、しかも雑音符号帳の記憶に要するメモリ容量を減少できる音声符号化方式を提供するため、従来の雑音符号帳の代わりに図５に示す木構造デルタ符号帳を用いることを提案した。Based on the above, the applicant of the present application proposed in Japanese Patent Application No. 3-127669 (JP Patent Publication No. 4-352200) the use of a tree-structured delta codebook, as shown in FIG. 5, instead of a conventional stochastic codebook, in order to provide a speech coding method that can reduce the amount of calculation required for stochastic codebook search and also reduce the memory capacity required for storing the stochastic codebook.

図５において、予め１つの基準雑音列である初期ベクトルＣ０（＝Δ。）と（Ｌ −１）種類（階層）のデルタ雑音列であるデルタベクトルΔ１〜ΔＬ−１（Ｌ＝ＩＯ）をデルタ符号帳１０に格納しておき、木構造に従ってデルタベクトルΔ１〜ΔＬ−１をそれぞれ初期ベクトルＣ０に階層毎に加え合わせ及び差し引くことにより順次木構造上に（２”−１）種類の雑音列のコードベクトル（符号語）００〜Ｃ５゜２□を表現できるようにする。またはこれらコードベクトルに一００ベクトル（又は零ベクトル）を加えて２１１１の雑音列のコードベクトル（符号語）Ｃ０〜Ｃ１゜１．を表現できるようにする。In FIG. 5 , an initial vector C0 (= Δ) which is a reference noise sequence and delta vectors Δ1 to ΔL-1 (L = 10) which are (L-1) types (layers) of delta noise sequences are stored in the delta codebook 10 in advance. By adding and subtracting delta vectors Δ1 to ΔL-1 to the initial vector C0 at each layer according to the tree structure, it is possible to sequentially represent code vectors (codewords) C0 to C5°2 of (2-1) types of noise sequences on the tree structure. Alternatively, a zero vector (or a zero vector) can be added to these code vectors to represent code vectors (codewords) C0 to C1°1 of 2-11 noise sequences.

このようにすれば、デルタ符号帳１０に初期ベクトルΔ。と（Ｌ−１）種類のデルタベクトルΔ１〜ΔＬ−１（Ｌ＝１０）を格納しておくだけで、次々と２Ｌ− １（＝２”−１＝Ｍ−１）種類リコードベクトル又は２　Ｌ（＝　２　Ｉ０＝Ｍ）種類のコードベクトルを生成することができ、デルタ符号帳１０の記憶容量をＬ−Ｎ（＝１０・Ｎ）とすることができ、従来の雑音符号帳の記憶容量Ｍ　−Ｎ　（＝１０２４・Ｎ）に比べて著しく減少させることができる。In this way, simply storing the initial vector Δ and (L-1) delta vectors Δ1 to ΔL-1 (L = 10) in the delta codebook 10 allows for the sequential generation of 2L-1 (= 2′-1 = M-1) recode vectors or 2L (= 2L = M) code vectors. This reduces the storage capacity of the delta codebook 10 to L-N (= 10·N), a significant reduction compared to the storage capacity of conventional random codebooks, M-N (= 1024·N).

この様な構成の木構造デルタ符号帳ＩＯを用いれば、コードベクトルＣｒ　（ｊ＝　０〜１０２２または１０２３）に対する相互相関Ｒ、ｃ′Ｉ＋　と自己相関Ｒｃｃ′Ｉ＋　は次の様な漸化式て表現することができる。すなわち、各コードベクトルを（ｚｋ、、　ｗ　Ｃ，−Δ、　ｉ　−１、２−Ｌ　−１（８）またはＣｔｗ−ｚ篇Ｃｋ−Δ、２；−＋−１≦ｋ＜２”−１（９）と表わすとき、Ｒ，ｃｌＺｋ−１１−Ｒｘｃ　＋ｌ＋ｌ　−Ｉ−Ｘ　？（ＡΔｉ）　（１０）または　Ｒ，ｃｌＺｋ−Ｈ−Ｒｘｃ　Ｎ＋）　Ｘ　Ｔ　（ＡΔ、）　（１１）及びＲｅｃ　Ｉｔｋ−１１−Ｒｃｃ　ｔｋｌ　＋（ＡΔ、）Ｔ（ＡΔ、）　＋２　（ＡΔｉ）’（Ａｃｈ）　（１２）またはＲｃｃ　＋！に一！ｌ　ｘ　Ｒｃｃ　＋に＋　＋　（ＡΔ、）’（ＡΔｉ）　− ２（ＡΔｉ）”（ＡＣｋ）　（１３）と表現することができる。Using this tree-structured delta codebook IO, the cross-correlation R, c'I+ and autocorrelation R, c'I+ for the code vector Cr (j = 0 to 1022 or 1023) can be expressed by the following recursive formula. That is, when each code vector is expressed as (zk, , w C, -Δ, i -1, 2 - L -1 (8) or Ctw - z Ck -Δ, 2; -1 ≤ k < 2 -1 (9), R, clZk-11 - Rxc + l + l - I - X (AΔi) (10) or R, clZk-H - Rxc N+) X T (AΔ,) (11) and Rec Itk-11 - Rcc tkl + (AΔ,) T (AΔ,) + 2 ( AΔi)' (Ach) (12) or Rcc +! に一! ! l x Rcc + + (AΔ,)' (AΔi) - 2 (AΔi)" (ACk) (13).

従って、相互相関Ｒｘｃに関しては、各デルタベクトルΔ、（ｉ＝０〜Ｌ−１； Δ、−Ｃ，）についての相互相間ＸＴ（ＡΔ、）の演算を行えば、漸化式（１０）　（１１”）式に従って、すなわち図５の木構造に従ってこれらを順次加算又はｆｉＩｉ夏することにより、すべてのコードベクトルＣ１に対する相互相関Ｒ８♂Ｊゝが直ちに計算される。従来の符号帳では、全雑音列のコードベクトルに対する相互相関を演算するのにＭ　−Ｎ　（−１０２４・Ｎ）回の積和真が必要であった。これに対して、木構造デルタ符号帳では、相互相関Ｒ，ｅｆｊ＋　を各符号ベクトルＣ，（ｊ−０，１−２Ｌ−１）から直接計算せず、各デルタベクトルΔｊ（ｊ−０，１，・・・Ｌ−１）との相互相関を計算し、それらを順次加算又はｙＩｉ夏することによって算出しているので、Ｌ−Ｎ（−１０・Ｎ）回の積和算で済ますことが可能となり、演算回数を著しく減少できる。Therefore, with regard to the cross-correlation Rxc, once the cross-correlation XT(AΔ,) for each delta vector Δ (i = 0 to L-1; Δ, -C,) is calculated, the cross-correlation R for all code vectors C1 can be immediately calculated by sequentially adding or subtracting these according to the recurrence formulas (10) and (11), i.e., the tree structure of Figure 5. In conventional codebooks, it takes many iterations to calculate the cross-correlation for the code vectors of all noise sequences. Previously, -N (-1024 × N) multiplication and addition operations were required. In contrast, with a tree-structured delta codebook, the cross-correlation R,efj+ is not calculated directly from each code vector C,(j-0,1-2L-1), but is calculated by calculating the cross-correlation with each delta vector Δj(j-0,1,...L-1) and then sequentially adding or subtracting them. This reduces the number of multiplication and addition operations to only L-N (-10 × N), significantly reducing the number of calculations.

また自己相関の式（１２）　（１３）の第３項の交さ項（＾Δｉ）”　（ＡＣ− については、Ｃｙ＝Δ。±Δ１±Δ２・・・±Δｉ−１と表わせば（ＡΔｉ）”（ＡＣｋ）　＝　（ＡΔ、）”（ＡΔ。）±（ＡΔ、）Ｔ（ＡΔｌ）＝・・・±（ＡΔ、）”　（ＡΔｉ−、）　（１４）と表わすことができるから、Δ、とΔ。、Δ１・・・Δｉ−１との相互相関（ＡΔ、）Ｔ（＾Δ。＋　Ｉ＋　２”、１−１）の計算を行い、図５の木構造に従って順次加算又は減算を行えば第３項が算出される。さらに第２項の各デルタベクトルΔ３の自己相関（Ａ Δｉ）’（ＡΔ８）を計算し、これを（１２）　（１３）式に従って、すなわち図５の木構造に従って順次加算又は減算すれば、すべてのコードベクトルＣＪの自己相関Ｒ，ｃりｊ＋　が直ちに計算される。Furthermore, the cross term (^Δi)"(AC- in the third term of the autocorrelation equations (12) and (13) can be expressed as Cy = Δ.±Δ1±Δ2...±Δi-1. (AΔi)"(ACk) = (AΔ,)"(AΔ.)±(AΔ,)T(AΔl) =...±(AΔ,)"(AΔi-,) (14). Therefore, the cross-correlation between Δ and Δ.,Δ1...Δi-1 (AΔ,)T(^ Δ. + I + 2", 1-1) and sequentially add or subtract according to the tree structure of Figure 5 to calculate the third term. Furthermore, by calculating the autocorrelation (A Δi)' (AΔ8) of each delta vector Δ3 in the second term and sequentially adding or subtracting it according to equations (12) and (13), i.e., according to the tree structure of Figure 5, the autocorrelations R, c, j+ of all code vectors CJ can be immediately calculated.

すなわち、従来の符号帳では、自己相関を演算するのにＭ　−Ｎ　（＝１０２４・Ｎ）回の積和算が必要であった。これに対して、木構造デルタ符号帳では、自己相関Ｒｘｃ＋Ｊ）を各符号ベクトルＣＪ（ｊ−０，１−２Ｌ−’−１）から直接計算せず、各デルタベクトルΔｊ（ｊ＝０．１．・・・Ｌ−１）の自己相関及び異なるデルタベクトルのすべての組み合わせにおける相互相関から計算しているので、Ｌ（ＬＳＩ）　・Ｎ／２（＝５５・Ｎ）回の積和算で済ますことが可能となり、演算回数を著しく減少できる。That is, with conventional codebooks, calculating the autocorrelation required M-N (= 1024·N) product-sum calculations. In contrast, with tree-structured delta codebooks, the autocorrelation (Rxc+J) is not calculated directly from each code vector CJ (j-0, 1-2L-'-1), but is calculated from the autocorrelation of each delta vector Δj (j = 0, 1...L-1) and the cross-correlation between all combinations of different delta vectors. This reduces the calculation time to just L(LSI)·N/2 (= 55·N) product-sum calculations, significantly reducing the number of calculations.

ところが、この様な木構造デルタ符号帳の符号語（コードベクトル）は全てデルタベクトルの線形結合とじて生成されているため、その成分としてデルタベクトル以外の成分は持たない。即ち、符号化対象のベクトルが分布する空間（通常、４０〜６４次元）のうち、高にデルタベクトルの数（通常、８〜１０本）に対応する次元分の部分空間にしか符号ベクトルの分布を与えられなし・。However, because all codewords (code vectors) in such a tree-structured delta codebook are generated as linear combinations of delta vectors, they contain no components other than delta vectors. In other words, within the space in which the vectors to be coded are distributed (typically 40-64 dimensions), the code vectors can only be distributed in a subspace with dimensions corresponding at most to the number of delta vectors (typically 8-10).

従って、木構造デルタ符号帳では、符号化対象である音声信号の統計的分布に基つき充分に基底ベクトル（デルタベクトル）を設計したとしても、従来の構造上の制約の無い符号帳に比べて量子化特性が劣化するという問題があった。Therefore, even if the basis vectors (delta vectors) are designed based on the statistical distribution of the speech signal to be coded, the tree-structured delta codebook suffers from the problem of degraded quantization characteristics compared to conventional codebooks without structural constraints.

ところで、本発明の適用の対象となるＣＥＬＰ型音声符号器では、上述したようにベクトル量子化は通常のベクトル量子化と異なり、符号ベクトルにフィルタの伝達間数Ａｚを存する線形予測合成フィルタを施した信号ベクトルの空間において距離評価を行い、最適なベクトルを決定するものである。As described above, in the CELP speech coder to which the present invention is applied, vector quantization differs from conventional vector quantization in that it performs distance evaluation in a space of signal vectors obtained by applying a linear predictive synthesis filter having a filter transfer function Az to a code vector, thereby determining the optimal vector.

従って、図６Ａ及び６Ｂに示すように、線形予測合成フィルタによって残差信号の空間（Ｌ＝３の場合図６Ａの球体）は再生信号の空間に変換されるが、この時、一般には図６Ｂに示すように各軸の方向成分が均等ではなく、成る歪みをもった増幅が行われる。Therefore, as shown in FIGS. 6A and 6B, the linear predictive synthesis filter transforms the residual signal space (the sphere in FIG. 6A when L=3) into the reproduced signal space. However, during this process, the directional components along each axis are generally not uniform, as shown in FIG. 6B, resulting in distortion-inducing amplification.

つまり、線形予測合成フィルタの特性（Ａ）が符号帳の構成要素である各デルタベクトルに対して異なる振幅増幅特性を示すもので、結果のベクトルは全空間にわたって均等に分布しない。That is, the linear prediction synthesis filter characteristic (A) exhibits different amplitude amplification characteristics for each delta vector that constitutes the codebook, and the resulting vectors are not evenly distributed across the entire space.

また、図５に示される木構造デルタ符号帳において：よ、各デルタベクトルが符号ベクトルに与える寄与はデルタベクトルがデルタ符号帳１０内で置かれる位置によって異なる。例えば、２番目に置かれたデルタベクトルΔＩ　は第２Ｖｋ層以下のすべての符号ベクトルに寄与するのに対しで、第３番目のデルタへクトル Δ２は第３階層以下のすべての符号ベクトルに寄与し、Δ、は第１０階層の符号ベクトルのみに寄与する。すなわち、デルタベクトルの順番を変えれば、各デルタベクトルの符号へクトルへの寄与を変えることができる。Furthermore, in the tree-structured delta codebook shown in Figure 5, the contribution of each delta vector to a code vector varies depending on the delta vector's position in the delta codebook 10. For example, the second delta vector ΔI contributes to all code vectors in the second Vk layer and below, while the third delta vector Δ2 contributes to all code vectors in the third layer and below, and Δ contributes only to code vectors in the tenth layer. In other words, by changing the order of the delta vectors, the contribution of each delta vector to a code vector can be changed.

以上のことから、本願出願人は特願平３−５１５０１６号において、先ス各デルタベクトルΔ、ユニフィルタ特性（Ａ）を施したベクトルＡΔ、Ｓ二ついてパワー（の増幅率：各デルタベクトルが規格化されていればＡΔ８のパワーそのものが増幅率となる）ＩＡΔ、：２＝（ＡΔｉ）’（ＡΔ８）を計算して相互に比較し、パワーの大きいデルタベクトルから順に並べ替えを行ってできた符号帳を用いて符号化を行うことで、固定的にデルタベクトルを与えることにより分布に偏りを有する従来の木構造デルタ符号帳に比して特性の改善を得ることができた。Based on the above, in Japanese Patent Application No. 3-515016, the applicant calculated the power (amplification factor: if each delta vector is normalized, the power of AΔ itself becomes the amplification factor) I AΔ, :2 = (AΔi)'(AΔ) for each delta vector Δ, vector AΔ with unifilter characteristics (A), and S, and compared them. Then, the delta vectors were rearranged in descending order of power. This resulted in improved performance compared to conventional tree-structured delta codebooks, which have a biased distribution due to the fixed delta vectors.

しかしながら、この場合にもデルタベクトルの数は実際に用いられる数と同数であり、それらの中での並べ替えたデルタベクトルにより符号化を行うので、符号帳の自由度に制約がある。However, even in this case, the number of delta vectors is the same as the number actually used, and encoding is performed using delta vectors that are rearranged from among them, so the flexibility of the codebook is limited.

例えば議論を簡単にするため、Ｌ＝２の場合、すなわちベクトルＣ，（＝Δ０）とデルタベクトルΔ１　とから符号ベクトルＣ０゜Ｃ１（＝Δ０＋Δ、）、Ｃ，（＝Δ。−Δｌ）を生成する木構造デルタ符号帳の場合を考える０図７Ａに示すようにΔ。、Δ、として使用するベクトルを単位ベクトルｅつ、ｅｙに限定すると、順番を入れ替えたとしても生成される符号ベクトルは斜線で表わすｘ−ｙ平面内に限定される。一方、三つの線形独立な単位ベクトルｅ　ｚ　、ｅ　ｙ　＋　ｅｔの中から必要に応じて２つを選んでΔ。、Δ１　として使用する場合、図７Ａ〜７Ｃに示されるように、部分空間の選択の自由度が広がる。For example, to simplify the discussion, consider the case of L = 2, i.e., a tree-structured delta codebook that generates code vectors C0C1 (= Δ0 + Δ1) and C1 (= Δ1 - Δ1) from vector C1 (= Δ0) and delta vector Δ1. As shown in Figure 7A, if the vectors used as Δ1, Δ2 are limited to unit vectors e and ey, then the generated code vectors are limited to the x-y plane indicated by the diagonal lines, even if the order is changed. On the other hand, if two of the three linearly independent unit vectors e z and ey + e are selected as needed to be used as Δ1, Δ1, the degree of freedom in selecting subspaces is expanded, as shown in Figures 7A-7C.

′告−゛ルタτ６　のそこで本発明では、更なる改善を加えるために、デルタベクトル符号帳において実際に符号帳を構成する際に用いられるデルタベクトル（Ｌ本＝初期ベクトル〒Ｌ−］本のデルタベクトル）より多くのデルタベクトルの候補（Ｌ’本：Ｌ’＞Ｌ）を与え、これらの候補に対して上記と同様の操作を行って並べ替えを施した後、振幅増幅率が上位のものから所望の数（Ｌ本）だけのデルタベクトルを選定 −で符号帳を構成する。このようにすることて、自由度の高い符号帳を得ることが可能になり、量子化特性の改善が図られている。'Announcement-' Filter τ6 To further improve the codebook, the present invention provides a larger number of delta vector candidates (L': L' > L) than the delta vectors actually used to construct the codebook (L = initial vector 〒 L-] delta vectors. These candidates are then reordered using the same procedure as above. After that, the desired number (L) of delta vectors with the highest amplitude amplification factors are selected to construct the codebook. This allows for a codebook with a high degree of flexibility, resulting in improved quantization characteristics.

尚、上記は符号器についてのものであるが、この符号器に対向する復号器においても、符号器側と同しデルタベクトルの候補を備え、符号器側と同様の制御を行うことにより、常に符号器側と同内容の符号帳を生成して用いることで、符号器側との対向性を確保することができる。While the above description applies to the encoder, the decoder can also be configured with the same delta vector candidates and controlled in the same way as the encoder, ensuring compatibility with the encoder by always generating and using the same codebook as the encoder.

図８は上記思想に基づく本発明に係る音声符号化方式の実施例を表わすブロック図である。この実施例においてデルタベクトル符号帳１０は、１つの基準雑音列を表現する初期ベクトルＣ０（＝Δ。）と実際に使用する（Ｌ−１）本より多い（Ｌ’−１）本のＮ次元のデルタ雑音列を表現するデルタベクトルΔ１〜ΔＬ・ −１を記憶・保持するように構成されており、初期ベクトルｃ０及び各デルタベクトルΔ１〜Δ、・−１はそれぞれＮ次元で表現されている。すなわち、初期ベクトル及びデルタベクトルは時系列的に発生するＮサンプルの雑音の振幅をそれぞれコード化したＮ次元のベクトルである。Figure 8 is a block diagram showing an embodiment of a speech coding method according to the present invention based on the above concept. In this embodiment, delta vector codebook 10 is configured to store and retain initial vector C0 (=Δ) representing one reference noise sequence and delta vectors Δ1 through ΔL·-1 representing N-dimensional delta noise sequences (L'-1) in number, which is greater than the (L-1) sequences actually used. Initial vector c0 and each delta vector Δ1 through Δ·-1 are each expressed in N dimensions. In other words, the initial vector and delta vector are N-dimensional vectors that each encode the amplitude of N samples of noise occurring in a time series.

また、この実施例では線形予測合成フィルタ３は次数りのＩＩＲ型フィルタで構成されるが、このフィルタのインパルス応答から生成されるＮＸＮの正方行列ＡとデルタベクトルΔ１の乗算を行って、デルタベクトルΔ、にフィルタ処理Ａを施し、ベクトルＡΔ、を出力する。＋１１１型フイルタのＮｐ個の係数は入力音声信号に基づいて変化し、その都度周知の方法で決定される。すなわち、入力音声信号の隣接サンプルには相関が存在するから、サンプル間の相関係数をめ、該相関係数からパーコール係数と称せられる偏自己相関係数をめ、該パーコール係数からＩＩＲ型フィルタのアルファ係数を決定し、当該フィルタのインパルス応答列を用いてＮＸＮの正方行列Ａを作成してベクトルΔ、にフィルタ処理を施す。In this embodiment, the linear predictive synthesis filter 3 is composed of an IIR filter of order , and an NXN square matrix A generated from the filter's impulse response is multiplied by a delta vector Δ1 to apply filter processing A to the delta vector Δ, outputting vector AΔ. The Np coefficients of the +111 filter vary based on the input speech signal and are determined each time using a well-known method. That is, since there is correlation between adjacent samples of the input speech signal, the correlation coefficient between the samples is obtained, and from this correlation coefficient, a partial autocorrelation coefficient known as the PARCOR coefficient is obtained. The alpha coefficient of the IIR filter is then determined from the PARCOR coefficient, and an NXN square matrix A is created using the filter's impulse response sequence to apply filter processing to vector Δ.

フィルタ処理が施されたＬ°本のベクトルＡΔ、（ｉ＝０．１・・・Ｌ’−１）は記憶部４０に保持され、パワー評価部４２においてパワーＩＡΔ１１２＝（Ａ Δｉ）”（ＡΔ、）が評価される。なお、各デルタへクトルΔ、は規格化（１Δ ｉｌ”　＝　（Δ、）Ｔ（Δ１）＝１）されているので、パワーの評価をするだけでフィルタ処理Ａによる増幅度が直接評価される０次に、パワー評価部４２の評価結果に基づいてソーティング部４３においてパワーの矢きい順に並べ替えられる６例えば図６Ｂの例では、 Δ。＝ｅｔ、Δ＋＝ｅｘ＋　Δ２＝ｅｙの順に並び替えられる。The filtered vectors AΔ, (i = 0.1 ... L'-1) are stored in the memory unit 40, and the power IAΔ112 = (AΔi)"(AΔ,) is evaluated in the power evaluation unit 42. Since each delta vector Δ, is normalized (1Δi" = (Δ,) T(Δ1) = 1), simply evaluating the power directly evaluates the amplification caused by the filter processing A. Based on the results of the power evaluation unit 42, the vectors are sorted in order of power magnitude in the sorting unit 43. For example, in the example shown in Figure 6B, the order is Δ = et, Δ + = ex + Δ2 = ey.

上記のようにして並べ替えられたベクトルＡΔ；（ｉ＝ｏ、１・・・Ｌ’−１）は全部でＬ°本有るが、以陣の符号化処理は、実際に使用するＬ本のベクトルＡ Δ＋（ｉ＝０．１・・・Ｌ−１）により行われる。There are a total of L vectors AΔ; (i = 0, 1 ... L'-1) rearranged as described above, but the encoding process is performed using the L vectors AΔ + (i = 0, 1 ... L-1) that are actually used.

そこで、選定記憶部４１では振幅増幅率が大きいものからＬ本だけベクトルを選定して記憶する０例えば上記の例では、上記のデルタベクトルの内、Δ。＝ｅ８．及びΔ、　−６、が選定される。そして、これらのベクトルによって構成される木構成デルタ符号帳に基づいて前述した従来の木構造デルタ符号帳の場合と全く同様に符号化処理を行う。Therefore, the selection storage unit 41 selects and stores only L vectors with the largest amplitude amplification factors. For example, in the above example, Δ = e8 and Δ = -6 are selected from the delta vectors. Then, based on the tree-structured delta codebook formed by these vectors, encoding is performed in exactly the same manner as in the conventional tree-structured delta codebook described above.

五号化五ユニ詳亙以下に、選定記憶部４１に記憶されたベクトルＡΔ。、ＡΔ１゜ＡΔ２・・・Ａ ΔＬ−１からなる木構造デルタ符号帳と、入力信号ベクトルＸとから、入力信号ベクトルＸとの距離が最小である符号ベクトルＣのインデックスを見い出す符号化部４８の詳細について説明する。Details of the Coder 5 Unit The following describes the details of the encoding unit 48, which finds the index of the code vector C that has the smallest distance from the input signal vector X, using a tree-structured delta codebook consisting of vectors AΔ, AΔ1, AΔ2, ... AΔL-1 stored in the selection storage unit 41 and the input signal vector X.

符号化部４８は入力信号ベクトルＸと各デルタベクトルΔ、の相互相関Ｘ”（Ａ Δ１）を計算する演算部５０と、各デルタベクトルΔ８の自己相関（ＡΔ、）” 　（ＡΔ、）を計算する演算部５２と、デルタベクトル間の相互相関（ＡΔｉ） ”　（ＡΔ。、１□、、、、−、）を計算する演算部５４と、演算部５４の出力から交さ項（ＡΔｉ）”　（ＡＣｋ）を計算する演算部５５と、演算部５０からの各デルタベクトルの相互相関を累積して入力信号ベクトルＸと符号ベクトルＣとの相互相関Ｒｘｃを算出する演算部５６と、演算部５２が出力する各デルタベクトルの自己相関（ＡΔｉ）”　（ＡΔ、）と演算部５５が出力する交さ項（Ａ Δｉ）’　（ＡＣ，）とを累積して各符号ベクトルＣの自己相関を算出する演算部５８と、ＲＣ１１”　／　Ｒｃｃを計算する演算部６０と、誤差最小雑音列決定部６２、および音声符号化部６４から成っている。The encoding unit 48 includes a calculation unit 50 that calculates the cross-correlation X′′ (AΔ1) between the input signal vector X and each delta vector Δ, a calculation unit 52 that calculates the autocorrelation (AΔ,)′′ (AΔ,) of each delta vector Δ, a calculation unit 54 that calculates the cross-correlation (AΔi)′′ (AΔ, 1□, , , , - ,) between delta vectors, a calculation unit 55 that calculates the cross term (AΔi)′′ (ACk) from the output of calculation unit 54, and a calculation unit 56 that calculates the cross-correlation of each delta vector from calculation unit 50. It consists of a calculation unit 56 that accumulates correlations to calculate the cross-correlation Rxc between the input signal vector X and the code vector C, a calculation unit 58 that accumulates the autocorrelation (AΔi)″ (AΔ,) of each delta vector output by calculation unit 52 and the cross term (AΔi)’ (AC,) output by calculation unit 55 to calculate the autocorrelation of each code vector C, a calculation unit 60 that calculates RC11″/Rcc, a minimum-error noise sequence determination unit 62, and a speech coding unit 64.

最初に、演算中の階層を表すバラメークｌが０に設定される。この状態で演算部５０．５２ではそれぞれＸ”（ＡΔ。）、（ＡΔ。）７（ＡΔ。）が計算され出力される。演算部５４．５５からは０が出力される。演算部５０、５２が出力するχ’（ＡΔ。）、（ＡΔ。）’　（ＡΔ。）はそれぞれ第１階層における相互相関Ｒｘｃ　１１、自己相関Ｒｃｃ　（Ｉｌ＋　として演算部５６．５８に記憶され、出力される。演算部６０ではこれらＲ、ｃｌｌ　、Ｒｃｃ　ｔｅ＋からＦ　（Ｘ、　Ｃ）　＝　Ｒｘｃ”／　Ｒｃｃの値が計算され出力される。Initially, the parameter χ representing the layer being calculated is set to 0. In this state, calculation units 50 and 52 calculate and output X'(AΔ.) and (AΔ.)'(AΔ.), respectively. Calculation units 54 and 55 output 0. The χ'(AΔ.) and (AΔ.)'(AΔ.) output by calculation units 50 and 52 are stored in calculation units 56 and 58 as the cross-correlation Rxc11 and auto-correlation Rcc(11+) at the first layer, respectively, and output. Calculation unit 60 calculates and outputs the value of F(X, C) = Rxc'/Rcc from R, c11, and Rcc(11+).

誤差最小雑音列決定部６２では演算されたＦ　（Ｘ、　Ｃ）とそれ迄のＦ　（Ｘ、　Ｃ）の最大値Ｆｗａｘ　（初期値は０）を比較し、Ｆ　（Ｘ、　ｃ）　＞Ｆ　＋ｗａｘであれば、Ｆ（χ、　Ｃ）−ＦｍａｘとしてＦｔａａｘを更新すると共に、Ｆｗａｘを与える雑音列（コードベクトル）を特定する符号でそれ迄の符号を更新する。The minimum-error noise sequence determiner 62 compares the calculated F(X,C) with the maximum value of F(X,C) up to that point, Fwax (initial value 0). If F(X,C) > F+wax, it updates Ftaax as F(x,C) - Fmax, and updates the previous code with a code that specifies the noise sequence (code vector) that gives Fwax.

次にパラメータｌが０から１に更新される。この状態で、演算部５０、５２ではそれぞれχ”（ＡΔ＋）＋　（ＡΔ、）”　（ＡΔ１）が計算され出力される。Next, parameter l is updated from 0 to 1. In this state, calculation units 50 and 52 respectively calculate and output χ"(AΔ+) + (AΔ,)" (AΔ1).

演算部５４では（ＡΔ＋）’　（ＡΔ。）が計算され出力される。演算部５５ではその値を交さ項（ＡΔｌ）”　（ＡＣ，）　として出力する。演算部５６で：：記憶しているＲ　ｘｅ　＋ｏ＋　および演算部５０から出力されるＸ’（ＡΔ １）の値から、（１０）　（１１）式に従って第２Ｐ！層における相互相関Ｒ１ｃ（１）およびＲＸｃ′２）　の値を計算し出力し記憶する。演算部５日では、記憶しているＲ０♂０′オよび演算部５２．５５からそれぞれ出力される（ＡΔ 、）Ｔ（ＡΔ＋）、（ＡΔ＋）”　（Ａｃ０）の値から、（１２）　（１３）式に従って、第２階層における自己相関ＲＣＣ（１）およびＲｃｃ（２１の値を計算し出力し記憶する。演算部６０および誤差最小雑音列決定部６２の動作はＩ＝ｏのときと同様である。The calculation unit 54 calculates and outputs (AΔ+)' (AΔ). Calculation unit 55 outputs this value as the cross term (AΔl)" (AC,). Calculation unit 56: From the stored Rxe +o+ and the value of X'(AΔ 1) output from calculation unit 50, calculates, outputs, and stores the values of the cross-correlation Rc(1) and Rxc'2) in the second layer according to equations (10) and (11). Calculation unit 56 From the stored R0 0'o and the values of (AΔ 1)T(AΔ+), (AΔ+)" (Ac,) output from calculation units 52 and 55, respectively, calculates, outputs, and stores the values of the autocorrelation RCC(1) and Rcc(2) in the second layer according to equations (12) and (13). The operation of calculation unit 60 and minimum-error noise sequence determination unit 62 is the same as when I = o.

次にパラメータｌが１から２に更新される。この状態で、演算部５０、５２ではそれぞれＸ”（ＡΔｚ）、　（ＡΔｚ）’　（ＡΔ２）が計算され出力される。Next, parameter l is updated from 1 to 2. In this state, calculation units 50 and 52 calculate and output X"(AΔz) and (AΔz)'(AΔ2), respectively.

演算部５４ではΔ２とΔ３．Δ。の相互相関（４６ｇ）”　（ＡΔ、）および（ＡΔｚ）”　（ＡΔ。）が計算され出力される。演算部５５では、それらの値から、（１４）式に従って交さ項（ＡΔｚ）’　（ＡＣ＋）を計算し出力する。Calculation unit 54 calculates and outputs the cross-correlations (AΔz)' (AΔ) and (AΔz)' (AΔ) between Δ2 and Δ3. Calculation unit 55 calculates and outputs the cross term (AΔz)' (AC+) from these values according to equation (14).

演算部５６では記憶しているＲＸＣ（１）＋　ＲＸＣ（ｔ’　および演算部５０から入力されるＸ”（ＡΔ２）の値から、（１０）　（１１）式に従って第３階層における相互相関Ｒ、ｃｆｌ−６１の値を計算し出力し記憶する。演算部５８では、記憶しているＲＣＣ（１）＋　Ｒｃｃ　”　および演算部５２．５５からそれぞれ出力される（ＡΔｚ）”　（ＡΔア）、（ＡΔｚ）”　（ＡＣ，）の値から、（１２）（１３）式に従って、第３階層における自己相関Ｒｃｃ０″″１の値を計算し出力し記憶する。演算部６０および誤差最小雑音列決定部６２の動作はｉ　＝０．１　のときと同様である。Using the stored RXC(1) + RXC(t') and the value of X"(AΔ2) input from calculation unit 50, calculation unit 56 calculates, outputs, and stores the value of the cross-correlation R, cfl-61 at the third layer according to equations (10) and (11). Using the stored RCC(1) + Rcc" and the values of (AΔz)"(AΔa) and (AΔz)"(AC,) output from calculation units 52 and 55, respectively, calculation unit 58 calculates, outputs, and stores the value of the auto-correlation Rcc0""1 at the third layer according to equations (12) and (13). The operation of calculation unit 60 and minimum-error noise sequence determiner 62 is the same as when i = 0.1.

上記の処理を繰り返して１＝Ｌ−１までの処理が終了したら、音声符号化部６４は誤差最小雑音列決定部６２に記憶されている最新の符号を入力信号ベクトルχ との距離が最小である符号ベクトルのインデックスとして出力する。After repeating the above process up to 1 = L - 1, the speech encoding unit 64 outputs the most recent code stored in the minimum-error noise sequence determination unit 62 as the index of the code vector with the smallest distance from the input signal vector χ.

なお、演算部５２における（ＡΔｉ）”　（ＡΔ、）の演算において、パワー評価部４２の計算結果がそのまま利用できる。In addition, in the calculation of (AΔi)" (AΔ,) in the calculation unit 52, the calculation result of the power evaluation unit 42 can be used as is.

ユＩ］：二ビ１１止前述の木構造デルタ符号帳及び本発明により改良された木構造味デルタ符号帳を使用すれば、従来の符号帳で必要とした膨大なメモリを必要とせず、かつ、ビットドロップへの対策が可能な可変レート符号化が実現される。By using the aforementioned tree-structured delta codebook and the tree-structured delta codebook improved by the present invention, variable-rate coding can be achieved without the large memory requirements of conventional codebooks and with a built-in countermeasure against bit drop.

すなわち、図９Ａに示した構造を有する木構造デルタ符号帳Δ。。That is, a tree-structured delta codebook Δ having the structure shown in FIG.

Δ１．Δ２・・・を格納しておき、図９Ｂに示すようにこれらのうち第１階層のベクトルΔ。のみを使って、Ｃ，−Ｏ（零ベクトル）Ｃ０−Δ。Δ1, Δ2, etc. are stored, and as shown in Figure 9B, only the first-level vector Δ is used to find C, -O (zero vector) C0-Δ.

の２つの符号ベクトルが生成されるようにして符号化を行えば、インデックスデータとしてＣ０を選択するか否かの１ビツトの情報による１ビット符号化が達成される。By encoding the two code vectors, one bit of information indicating whether or not to select C0 as index data is achieved.

第２階層までのベクトルΔ。、Δ１を使ってＣ１露ＯＣ＠３ΔＯＣ３禦Δ。十Δ１Ｃ２富Δ。−Δ１の４つの符号ベクトルが生成されるようにして符号化を行えば、インデックスデータとしてＣ０の選択の有無およびΔＣ０又は−ΔＣＩを指定する２ビツトの情報による２ビット符号化が達成される。By encoding using the vectors Δ and Δ1 up to the second layer to generate four code vectors: C1 (C3ΔO), C3 (Δ). +Δ1, and C2 (Δ). -Δ1, two-bit encoding is achieved using two bits of information specifying whether C0 is selected and whether ΔC0 or -ΔC1 is selected as index data.

同様にして、第ｉ段階までのベクトルΔ。、Δ１・・・Δ、を使えばｉビット符号化が達成される。したがって、Ｌ個のデルタベクトルΔ。、Δ、・・・ΔＬ− ＋　を含む１組の木構造デルタ符号帳を使うだけで、生成されるインデックスデータのビット長を１〜Ｌの範囲で任意に可変することができる。Similarly, using vectors ΔΨ, Δ1, ... Δ up to the i-th stage, i-bit encoding is achieved. Therefore, by simply using a set of tree-structured delta codebooks containing L delta vectors ΔΨ, Δ, ... ΔL, the bit length of the generated index data can be arbitrarily varied within the range of 1 to L.

従来の符号帳を使って１〜Ｌビツトの可変ビットレート符号化を行うとすれば、ベクトルの次元をＮとすると、必要なメモリのワード数はＮ×（２°＋２’−・・・＋２Ｌ）＝ＮＸ　（２Ｌ″′−１）である。これに対して、図９＾の木構造デルタ符号帳を図９Ｂのようにして使用すれば、必要なメモリのワード数はｘＬである。To perform variable bit-rate encoding of 1 to L bits using a conventional codebook, where N is the dimension of the vector, the number of memory words required is N x (2° + 2' - ... + 2L) = NX (2L'"' - 1). In contrast, if the tree-structured delta codebook of Figure 9^ is used as shown in Figure 9B, the number of memory words required is xL.

木構造デルタ符号帳としては、前述の並べ替えを行わない木構造デルタ符号帳、Ａによる増幅率の大きさによりデルタベクトルを並べ替えた木構造デルタ符号帳および、Ｌ°本のデータベクトルの中からし本選択して使用する木構造デルタ符号帳のいずれもが使用可能である。As a tree-structured delta codebook, any of the following can be used: a tree-structured delta codebook that does not perform the reordering described above; a tree-structured delta codebook that reorders delta vectors according to the magnitude of the amplification factor A; or a tree-structured delta codebook that selects and uses one of the L° data vectors.

なお、ピントレートを可変にするｌｌ１ｌは、図８の符号化処理部４８における処理を、所望ビット数に応じて途中の階層で打ち切るようにすれば容易に達成される０例えば４ビット符号化の場合、ｉ　ｘ　Ｑ。Note that making the focus rate variable can be easily achieved by terminating the processing in the encoding processing unit 48 in Figure 8 at an intermediate layer depending on the desired number of bits. For example, in the case of 4-bit encoding, i x Q.

１．２．３について、前述の符号化処理部４８の処理を行うようにすれば良い。Regarding 1, 2 and 3, the processing of the encoding processing unit 48 described above may be performed.

エンベデッド竺６エンベデノド符号化、すなわち伝送路中で一部のビットが強制的に欠落させられても、復号器において音声を再生することのできる符号化システムは、上記の木構造デルタ符号帳による可変レート符号化において、一部のピントが欠落させられたら、木構造上でその親または先祖の符号ベクトルとして再生されるように符号系を構成すれば達成される０例えば４ビツトの符号系ＣＣ＠、ＣＩ　・・・Ｃ５４］で１ビツトが欠落させられたら、ＣＩ３＋ＣＩ４は３ビツト系のＣｈとして、Ｃ１□、Ｃｌ　ｌは３ビツト系のＣ２として再生されるように構成する。このようにすれば、親子間係にある符号ベクトルは比較的近い値を持つので、大きな音質の劣化なレニこ音声を再生する二とができる。Embedded coding, i.e., a coding system that can reproduce speech at the decoder even if some bits are forcibly dropped during transmission, is achieved by constructing a code system in the tree-structured delta codebook described above so that if some bits are dropped, they are reconstructed as their parent or ancestor code vector in the tree structure. For example, if one bit is dropped in a 4-bit code system CC, CI, ... C, 5, 4, CI3 + CI4 are constructed to be reproduced as a 3-bit system Ch, and C1, CI1 are constructed to be reproduced as a 3-bit system C2. In this way, the code vectors in the parent-child relationship have relatively close values, making it possible to reproduce speech without significant degradation in sound quality.

表１〜４にはこの様な符号系の一例を示す。Tables 1-4 show examples of such code systems.

上記の符号系は、例えば４ビツトの場合、次の例のようにして定められている。The above code system is determined as follows, for example, in the case of 4 bits:

Ｃ１，＝Δ。−Δ１−Δｔ−１−Δ、は、４個のデルタ・ベクトルの要素を持ち、それぞれの符号が、上位から順に（−、−、ふ、十）となるのでこれを“１０１１″と表す。C1, = Δ. -Δ1 - Δt - Δ has four delta vector elements, and the signs of each are (-, -, fu, 10) from the most significant, so this is represented as "10 11".

Ｃ２＝Δ。−Δ、は、２個しかデルタ・ベクトルの要素を持たず、符号は（＝、 −）の順である。この場合の符号を（０，０，１−）とみなし、“００１０”と表す。C2 = Δ. -Δ has only two delta vector elements, and the signs are (=, -). In this case, the signs are considered to be (0, 0, 1-) and represented as "0010".

このようにして符号化した情報に対して４ビット→３ビ、トのｌビ、トのビットドロップが生じた場合について表５に示す。Table 5 shows what happens when a bit drop occurs in the information coded in this way: 4 bits → 3 bits, 1 bit, 1 bit.

表５と図９Ａを参照すればわかるように、１ビツトの欠落が生しると、ｔＰｉ層上層上へクトルとして再生される。As can be seen from Table 5 and Figure 9A, if a single bit is missing, it is regenerated as a vector on the tPi layer and on the upper layer.

また、２ピントの欠落が生じると表６のように再生される。Furthermore, if two out-of-focus images occur, they are reproduced as shown in Table 6.

この場合、２階層上位の先祖のベクトルとして再生される。In this case, it is reproduced as the ancestor vector two levels higher.

表７〜ｌＯには本発明のエンベデッド符号系の他の例を示す。Tables 7-10 show other examples of embedded code systems of the present invention.

この符号系においても、１ビツト欠落が生じたらその親のベクトルが、２ビツト欠落が生じたら、２階層上位の先祖のベクトルが再生される。In this coding system, if one bit is missing, the parent vector is regenerated, and if two bits are missing, the ancestor vector two levels above is regenerated.

Claims

[Claims] 1. A speech coding method for encoding an input speech signal vector using an index assigned to a code vector among predefined code vectors that has the smallest distance from the input speech signal vector, the speech coding method comprising the steps of: a) storing a plurality of differential code vectors; b) multiplying each differential code vector by a linear predictive synthesis filter matrix; c) evaluating the power amplification factor of the differential code vector multiplied by the matrix; d) sorting the differential code vectors multiplied by the matrix in order of the magnitude of the evaluated power amplification factor; e) selecting a predetermined number of vectors from the sorted vectors in order of the magnitude of the evaluated power amplification factor; f) evaluating the distance between the input speech signal vector and a linear predictive synthesis filtered code vector to be generated by sequentially adding and subtracting the selected vectors in a tree structure; and g) determining the code vector with the smallest evaluated distance. 2. The method of claim 1, wherein each of the differential code vectors is normalized. 3. 3. The method of claim 1, wherein step f) includes calculating the cross-correlation Rxc between the input speech signal vector and the linearly predictive synthesis filtered code vector by calculating the cross-correlation between each of the selected vectors and the input speech signal vector and sequentially adding and subtracting them in a tree structure; calculating the auto-correlation Rcc of the linearly predictive synthesis filtered code vector by calculating the auto-correlation of each of the selected vectors and the cross-correlation of all combinations of different vectors and sequentially adding and subtracting them in a tree structure; and calculating Rxc2/Rcc, which is the square of the cross-correlation Rxc divided by the auto-correlation Rcc, for each code vector; and wherein step g) includes determining the code vector providing the largest Rxc2/Rcc value as the code vector having the smallest distance from the input speech signal vector. A coding device that encodes an input speech signal vector using an index assigned to a code vector among predefined code vectors that has the smallest distance from the input speech signal vector, the device comprising: means for storing a plurality of differential code vectors; means for multiplying each of the differential code vectors by a linear predictive synthesis filter matrix; means for evaluating the power amplification factor of the differential code vector multiplied by the matrix; means for sorting the differential code vectors multiplied by the matrix in order of the magnitude of the evaluated power amplification factor; means for selecting a predetermined number of vectors from the sorted vectors in order of the magnitude of the evaluated power amplification factor; means for evaluating the distance between the input speech signal vector and a linear predictive synthesis filtered code vector to be generated by sequentially adding and subtracting the selected vectors in a tree structure; and means for determining the code vector with the smallest evaluated distance. 5. The device according to claim 4, wherein the names of the differential code vectors are normalized. 6. The apparatus 7 according to claim 4, wherein the distance evaluation means includes means for calculating the cross-correlation Rxc between the input speech signal vector and the linear predictive synthesis filtered code vector by calculating the cross-correlation between each of the selected vectors and the input speech signal vector and sequentially adding and subtracting the cross-correlations in a tree structure; means for calculating the auto-correlation Rcc of the linear predictive synthesis filtered code vector by calculating the auto-correlation of each of the selected vectors and the cross-correlation of all combinations of different vectors and sequentially adding and subtracting the cross-correlations in a tree structure; and means for calculating Rxc2/Rcc, which is the square of the cross-correlation Rxc divided by the auto-correlation Rcc, for each code vector; and the code vector determination means includes means for determining the code vector providing the largest Rxc2/Rcc value as the code vector having the smallest distance from the input speech signal vector. A variable-length speech coding method for variable-length coding an input speech signal vector using a code of a variable bit length assigned to a code vector among predefined code vectors that has the smallest distance from the input speech signal vector, the method comprising the steps of: a) storing a plurality of differential code vectors; b) evaluating the distance between the input speech signal vector and a code vector to be generated by sequentially adding and subtracting, in a tree structure, a number of differential code vectors corresponding to a desired code bit length, starting from the first; c) determining the code vector with the smallest evaluated distance; and d) determining a code of the desired code bit length to be assigned to the determined code vector. 8. The method of claim 7, further comprising the step of multiplying each differential code vector by a linear predictive synthesis filter matrix, wherein in step b), the distance between the input speech signal vector and a linear predictive synthesis filtered code vector to be generated by sequentially adding and subtracting, in a tree structure, the differential code vectors multiplied by the matrix. 9. 10. The method of claim 8, wherein step b) includes: calculating the cross-correlation Rxc between the input speech signal vector and the linear predictive synthesis filtered code vector by calculating the cross-correlation between each of the differential code vectors multiplied by the matrix and the input speech signal vector, and sequentially adding and subtracting these values in a tree structure; calculating the auto-correlation Rcc of the linear predictive synthesis filtered code vector by calculating the auto-correlation of each of the differential code vectors multiplied by the matrix and the cross-correlation of all combinations of different vectors, and sequentially adding and subtracting these values in a tree structure; and evaluating Rxc2/Rcc, which is the square of the cross-correlation Rxc divided by the auto-correlation Rcc, for each code vector; and wherein step c) includes determining the code vector yielding the largest Rxc2/Rcc value as the code vector having the smallest distance from the input speech signal vector. The method of claim 9 further includes a step of evaluating the power amplification factor of the differential code vector multiplied by the matrix and sorting the differential code vectors multiplied by the matrix in order of the magnitude of the evaluated power amplification factor, wherein in step b), addition and subtraction on the tree structure are performed in accordance with the sorted order. 11. The method of claim 10 further includes a step of selecting a predetermined number of vectors from the sorted vectors in order of the magnitude of the evaluated power amplification factor, wherein in step b), addition and subtraction on the tree structure are performed using the selected vectors. A variable-length speech coding device that performs variable-length coding on an input speech signal vector using a code of a variable bit length assigned to a code vector among pre-specified code vectors that has the smallest distance from the input speech signal vector, the variable-length speech coding device comprising: means for storing a plurality of differential code vectors; means for evaluating the distance between the input speech signal vector and a code vector to be generated by sequentially adding and subtracting, in a tree structure, a number of differential code vectors corresponding to a desired code bit length, starting from the first; means for determining the code vector with the smallest evaluated distance; and means for determining a code of a desired code bit length to be assigned to the determined code vector. 13. The device of claim 12 further comprises means for multiplying each differential code vector by a linear predictive synthesis filter matrix, wherein the distance evaluation means evaluates the distance between the input speech signal vector and a linear predictive synthesis filtered code vector to be generated by sequentially adding and subtracting, in a tree structure, the differential code vectors multiplied by the matrix. The apparatus of claim 13, wherein the distance evaluation means includes: means for calculating the cross-correlation Rxc between each of the differential code vectors multiplied by the matrix and the input speech signal vector, and sequentially adding and subtracting the results in a tree structure to calculate the cross-correlation Rxc between the input speech signal vector and the linear predictive synthesis filtered code vector; means for calculating the auto-correlation Rcc of the linear predictive synthesis filtered code vector by calculating the auto-correlation Rxc of each of the differential code vectors multiplied by the matrix and the cross-correlation of all combinations of different vectors, and sequentially adding and subtracting the results in a tree structure; and means for evaluating Rxc2/Rcc, which is the square of the cross-correlation Rxc divided by the auto-correlation Rcc, for each code vector; and the code vector determination means includes means for determining the code vector providing the largest Rxc2/Rcc value as the code vector having the smallest distance from the input speech signal vector. The apparatus of claim 14 further comprises means for evaluating a power amplification factor of a differential code vector multiplied by the matrix, and means for sorting the differential code vectors multiplied by the matrix in order of the magnitude of the evaluated power amplification factor, wherein the distance evaluation means performs tree-structured addition and subtraction in the sorted order. 16. The apparatus of claim 15 further comprises means for selecting a predetermined number of vectors from the sorted vectors in order of the magnitude of the evaluated power amplification factor, wherein the distance evaluation means performs tree-structured addition and subtraction on the selected vectors. 16. The method of claim 7, wherein each code vector is assigned a code such that, if one bit is missing, it corresponds to the code vector corresponding to its parent in the tree structure. 17. The apparatus of claim 12, wherein each code vector is assigned a code such that, if one bit is missing, it corresponds to the code vector corresponding to its parent in the tree structure.