CN118355658A - Video coding method and apparatus with intra prediction derived based on reference sample lines - Google Patents
Video coding method and apparatus with intra prediction derived based on reference sample lines Download PDFInfo
- Publication number
- CN118355658A CN118355658A CN202280080888.7A CN202280080888A CN118355658A CN 118355658 A CN118355658 A CN 118355658A CN 202280080888 A CN202280080888 A CN 202280080888A CN 118355658 A CN118355658 A CN 118355658A
- Authority
- CN
- China
- Prior art keywords
- reference sample
- sample line
- mode
- current block
- prediction
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Landscapes
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
Description
技术领域Technical Field
本公开涉及使用基于参考样本线导出的帧内预测的视频译码(coding)方法和装置。The present disclosure relates to a video coding method and apparatus using intra-frame prediction derived based on reference sample lines.
背景技术Background technique
该部分中的陈述仅提供与本公开相关的背景信息,并且不一定构成现有技术。The statements in this section merely provide background information related to the present disclosure and may not constitute prior art.
因为视频数据与音频或静止图像数据相比具有大量数据,所以在没有用于压缩的处理的情况下,视频数据需要大量硬件资源(包括存储器)来存储或传输视频数据。Since video data has a large amount of data compared to audio or still image data, without a process for compression, the video data requires a large amount of hardware resources (including memory) to store or transmit the video data.
因而,编码器通常用于压缩和存储或传输视频数据。解码器接收压缩视频数据,对接收到的压缩视频数据进行解压缩,播放解压缩后的视频数据。视频压缩技术包括H.264/AVC、高效视频译码(High Efficiency Video Coding,HEVC)和多用途视频译码(VersatileVideo Coding,VVC),其与HEVC相比具有约30%或更大的改进的译码效率。Thus, encoders are generally used to compress and store or transmit video data. Decoders receive compressed video data, decompress the received compressed video data, and play the decompressed video data. Video compression technologies include H.264/AVC, High Efficiency Video Coding (HEVC), and Versatile Video Coding (VVC), which have improved decoding efficiency of about 30% or more compared to HEVC.
然而,由于图像大小、分辨率和帧速率逐渐增加,所以要编码的数据量也增加。因而,需要提供比现有压缩技术更高的译码效率和改善的图像增强效果的新压缩技术。However, as image size, resolution and frame rate gradually increase, the amount of data to be encoded also increases. Thus, a new compression technology is needed that provides higher decoding efficiency and improved image enhancement effect than existing compression technologies.
VVC(通用视频译码(versatile video coding))技术在用于根据帧内预测模式预测当前块(也称为“预测单元”或PU)时,根据诸如MIP(基于矩阵的帧内预测)、DC(直流)、Planar、Angular等的模式生成当前块的预测块。如果MIP是预测模式,则在不将当前块根据帧内子分区(ISP)分割成子块的情况下产生预测块。除非MIP是预测模式,否则可以在将当前块分割成子块的情况下或不将当前块分割成子块的情况下产生预测块。不管ISP的参与,可以针对当前块解析预测模式。可将与当前块相同的预测模式用于当前块内的子块以产生用于每一子块的预测块。VVC (versatile video coding) technology, when used to predict a current block (also called a "prediction unit" or PU) according to an intra-frame prediction mode, generates a prediction block for the current block according to modes such as MIP (matrix-based intra-frame prediction), DC (direct current), Planar, Angular, etc. If MIP is the prediction mode, the prediction block is generated without dividing the current block into sub-blocks according to intra-frame sub-partitioning (ISP). Unless MIP is the prediction mode, the prediction block can be generated with or without dividing the current block into sub-blocks. Regardless of the involvement of the ISP, the prediction mode can be parsed for the current block. The same prediction mode as the current block can be used for the sub-blocks within the current block to generate a prediction block for each sub-block.
为了产生这些预测块,可导出参考样本线。此外,参考样本线可以受到滤波,并且然后滤波的参考样本线可以用于根据每个预测模式产生预测块。导出为参考样本线的可为邻近于当前块的左侧的像素线和邻近于当前块的上的像素线。当使用多参考线(MRL)模式时,被选为参考像素线的可以是靠近当前块的左侧和上的三个像素线中的一个上像素线和一个左像素线。随着数据量增加,需要使用参考样本线的进一步先进方法来改进视频译码效率并增强视频质量。In order to generate these prediction blocks, reference sample lines may be derived. In addition, the reference sample lines may be filtered, and then the filtered reference sample lines may be used to generate prediction blocks according to each prediction mode. The reference sample lines derived may be pixel lines adjacent to the left side of the current block and pixel lines adjacent to the top of the current block. When the multi-reference line (MRL) mode is used, the reference pixel lines selected may be an upper pixel line and a left pixel line among three pixel lines adjacent to the left and top of the current block. As the amount of data increases, further advanced methods using reference sample lines are needed to improve video decoding efficiency and enhance video quality.
发明内容Summary of the invention
【技术问题】【technical problem】
本公开试图提供用于通过使用各种导出方案导出当前块的参考样本线以提高视频译码效率和增强视频质量的视频译码方法和装置。视频译码方法和装置通过使用导出的参考样本线来执行当前块的帧内预测。The present disclosure attempts to provide a video decoding method and apparatus for improving video decoding efficiency and enhancing video quality by deriving a reference sample line of a current block using various derivation schemes. The video decoding method and apparatus perform intra-frame prediction of a current block by using the derived reference sample line.
【技术方案】【Technical solutions】
本公开的至少一个方面提供一种由视频解码装置执行以用于对当前块进行帧内预测的方法。所述方法包括从比特流解码当前块的帧内预测模式及当前块的导出模式索引。这里,导出模式索引指示作为固定位置参考样本线模式、可变位置参考样本线模式和参考样本线列表参考模式中的一个的参考样本线导出模式。方法还包括根据导出模式索引确定参考样本线导出模式。方法还包括根据参考样本线导出模式导出当前块的参考样本线。这里,参考样本线包括左参考样本线和上参考样本线。方法还包括通过使用参考样本线中的参考样本,根据帧内预测模式产生当前块的预测块。At least one aspect of the present disclosure provides a method performed by a video decoding device for intra-frame prediction of a current block. The method includes decoding an intra-frame prediction mode of a current block and a derivation mode index of the current block from a bitstream. Here, the derivation mode index indicates a reference sample line derivation mode that is one of a fixed position reference sample line mode, a variable position reference sample line mode, and a reference sample line list reference mode. The method also includes determining a reference sample line derivation mode according to the derivation mode index. The method also includes deriving a reference sample line of the current block according to the reference sample line derivation mode. Here, the reference sample line includes a left reference sample line and an upper reference sample line. The method also includes generating a prediction block of the current block according to the intra-frame prediction mode by using reference samples in the reference sample line.
本公开的另一方面提供一种由视频编码装置执行以用于对当前块进行帧内预测的方法。所述方法包括确定当前块的帧内预测模式及确定当前块的导出模式索引。这里,导出模式索引指示作为固定位置参考样本线模式、可变位置参考样本线模式和参考样本线列表参考模式中的一个的参考样本线导出模式。方法还包括根据所述导出模式索引确定参考样本线导出模式。方法还包括根据参考样本线导出模式导出当前块的参考样本线。这里,参考样本线包括左参考样本线和上参考样本线。该方法还包括通过使用参考样本线中的参考样本,根据帧内预测模式产生当前块的预测块。Another aspect of the present disclosure provides a method performed by a video encoding device for intra-frame prediction of a current block. The method includes determining an intra-frame prediction mode of the current block and determining a derivation mode index of the current block. Here, the derivation mode index indicates a reference sample line derivation mode that is one of a fixed position reference sample line mode, a variable position reference sample line mode, and a reference sample line list reference mode. The method also includes determining a reference sample line derivation mode according to the derivation mode index. The method also includes deriving a reference sample line of the current block according to the reference sample line derivation mode. Here, the reference sample line includes a left reference sample line and an upper reference sample line. The method also includes generating a prediction block of the current block according to the intra-frame prediction mode by using reference samples in the reference sample line.
本公开的又一方面提供一种计算机可读记录介质,存储由视频编码方法产生的比特流。视频编码方法包括确定当前块的帧内预测模式以及确定当前块的导出模式索引。这里,导出模式索引指示作为固定位置参考样本线模式、可变位置参考样本线模式和参考样本线列表参考模式中的一个的参考样本线导出模式。视频编码方法还包括根据导出模式索引确定参考样本线导出模式。视频编码方法还包括根据参考样本线导出模式导出当前块的参考样本线。这里,参考样本线包括左参考样本线和上参考样本线。视频编码方法还包括通过使用参考样本线中的参考样本,根据帧内预测模式产生当前块的预测块。Another aspect of the present disclosure provides a computer-readable recording medium that stores a bitstream generated by a video encoding method. The video encoding method includes determining an intra-frame prediction mode of a current block and determining a derivation mode index of the current block. Here, the derivation mode index indicates a reference sample line derivation mode that is one of a fixed position reference sample line mode, a variable position reference sample line mode, and a reference sample line list reference mode. The video encoding method also includes determining a reference sample line derivation mode according to the derivation mode index. The video encoding method also includes deriving a reference sample line of the current block according to the reference sample line derivation mode. Here, the reference sample line includes a left reference sample line and an upper reference sample line. The video encoding method also includes generating a prediction block of the current block according to the intra-frame prediction mode by using a reference sample in the reference sample line.
【有益效果】【Beneficial Effects】
如上所述,本公开提供通过使用各种导出方案导出当前块的参考样本线的视译码码方法和装置。视频译码方法和装置通过使用导出的参考样本线来执行当前块的帧内预测。因此,视频译码方法和装置提高了视频译码效率并提高了视频质量。As described above, the present disclosure provides a video decoding method and apparatus for deriving a reference sample line of a current block by using various derivation schemes. The video decoding method and apparatus perform intra-frame prediction of the current block by using the derived reference sample line. Therefore, the video decoding method and apparatus improve video decoding efficiency and improve video quality.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
图1为可实施本公开的技术的视频编码装置的框图。FIG. 1 is a block diagram of a video encoding device that may implement the techniques of this disclosure.
图2示出了用于使用四叉树加二叉树三叉树(QTBTTT)结构来分割块的方法。FIG. 2 illustrates a method for partitioning a block using a quadtree plus binary tree ternary tree (QTBTTT) structure.
图3a和图3b示出包括广角帧内预测模式的多个帧内预测模式。3a and 3b illustrate a plurality of intra prediction modes including a wide-angle intra prediction mode.
图4示出当前块的杏林块。FIG. 4 shows a block diagram of the current block.
图5是可实施本公开的技术的视频解码装置的框图。5 is a block diagram of a video decoding device that may implement the techniques of this disclosure.
图6是根据本公开的至少一个实施方式的详细帧内预测器的框图。FIG. 6 is a block diagram of a detailed intra predictor in accordance with at least one embodiment of the present disclosure.
图7是示出根据本公开的至少一个实施方式的使用可变位置参考样本线模式或参考样本线列表参考模式的第一条件的示图。FIG. 7 is a diagram illustrating a first condition for using a variable position reference sample line mode or a reference sample line list reference mode according to at least one embodiment of the present disclosure.
图8a至图8d是示出根据本公开的至少一个实施方式的在可变位置参考样本线模式中的参考样本线的导出的示图。8a to 8d are diagrams illustrating derivation of reference sample lines in a variable position reference sample line mode, in accordance with at least one embodiment of the present disclosure.
图9是示出根据本公开的至少一个实施方式的在上参考样本线内的细分参考样本线的示图。FIG. 9 is a diagram illustrating subdivided reference sample lines within an upper reference sample line, in accordance with at least one embodiment of the present disclosure.
图10是根据本公开的至少一个实施方式的视频编码方法的流程图。FIG. 10 is a flowchart of a video encoding method according to at least one embodiment of the present disclosure.
图11是根据本公开的至少一个实施方式的视频解码方法的流程图。FIG. 11 is a flowchart of a video decoding method according to at least one embodiment of the present disclosure.
具体实施方式Detailed ways
在下文中,参照附图详细描述本公开的一些实施方式。在以下描述中,相同的参考标号表示相同的元件,尽管元件在不同的附图中示出。此外,在一些实施方式的以下描述中,为了清楚和简洁的目的,当认为使本公开的主题模糊时,可以省略相关已知组件和功能的详细描述。Hereinafter, some embodiments of the present disclosure are described in detail with reference to the accompanying drawings. In the following description, the same reference numerals represent the same elements, although the elements are shown in different drawings. In addition, in the following description of some embodiments, for the purpose of clarity and brevity, detailed descriptions of related known components and functions may be omitted when it is considered that the subject matter of the present disclosure is obscure.
图1是可以实现本公开的技术的视频编码装置的框图。在下文中,参照图1的示出,描述视频编码装置和该装置的部件。FIG1 is a block diagram of a video encoding device that can implement the technology of the present disclosure. In the following, with reference to the illustration of FIG1 , a video encoding device and components of the device are described.
编码装置可包括图片分裂器(picture splitter)110、预测器120、减法器130、变换器140、量化器145、重排单元(rearrangement unit)150、熵编码器155、逆量化器160、逆变换器165、加法器170、环路滤波器单元180和存储器190。The encoding device may include a picture splitter 110, a predictor 120, a subtractor 130, a transformer 140, a quantizer 145, a rearrangement unit 150, an entropy encoder 155, an inverse quantizer 160, an inverse transformer 165, an adder 170, a loop filter unit 180 and a memory 190.
编码装置的每个组件可以实现为硬件或软件或者实现为硬件和软件的组合。此外,每个组件的功能可以实现为软件,并且还可以实现微处理器以执行与每个组件相对应的软件的功能。Each component of the encoding device can be implemented as hardware or software or as a combination of hardware and software. In addition, the function of each component can be implemented as software, and a microprocessor can also be implemented to execute the function of the software corresponding to each component.
一个视频由包括多个图片的一个或多个序列构成。每个图片被分裂为多个区域,并对每个区域执行编码。例如,将一个图片分裂成一个或一个以上片(tile)或/和切片(slice)。在此,一个或多个片可以被定义为片组(tile group)。每一片或/和切片分裂成一或多个译码树单元(CTU)。另外,每一CTU通过树结构分裂成一或多个译码单元(CU)。将应用于每一个译码单元(CU)的信息编码为CU的语法,且将共同应用于包括在一个CTU中的CU的信息编码为CTU的语法。此外,共同应用于一个切片中的所有块的信息被编码为切片报头的语法,并且应用于构成一个或多个图片的所有块的信息被编码到图片参数集(PPS)或图片报头。此外,将多个图片共同参考的信息编码到序列参数集(SPS)。另外,一或多个SPS共同参考的信息被编码到视频参数集(VPS)。进一步,共同应用于一个片或切片组的信息也可编码为片或切瓦片组报头的语法。包括在SPS、PPS、切片报头、片或切片组报头中的语法可以被称为高级语法。A video is composed of one or more sequences including multiple pictures. Each picture is split into multiple regions, and encoding is performed on each region. For example, a picture is split into one or more tiles or/and slices. Here, one or more slices can be defined as a tile group. Each tile or/and slice is split into one or more coding tree units (CTUs). In addition, each CTU is split into one or more coding units (CUs) through a tree structure. The information applied to each coding unit (CU) is encoded as the syntax of the CU, and the information commonly applied to the CU included in a CTU is encoded as the syntax of the CTU. In addition, the information commonly applied to all blocks in a slice is encoded as the syntax of the slice header, and the information applied to all blocks constituting one or more pictures is encoded into a picture parameter set (PPS) or a picture header. In addition, the information commonly referenced by multiple pictures is encoded into a sequence parameter set (SPS). In addition, the information commonly referenced by one or more SPSs is encoded into a video parameter set (VPS). Further, the information commonly applied to a slice or slice group can also be encoded as the syntax of a slice or tile group header. The syntax included in an SPS, a PPS, a slice header, a slice, or a slice group header may be referred to as a high-level syntax.
图片分裂器110确定译码树单元(CTU)的大小。关于CTU的大小(CTU大小)的信息被编码为SPS或PPS的语法并且被传送到视频解码装置。The picture splitter 110 determines the size of a coding tree unit (CTU). Information on the size of the CTU (CTU size) is encoded as a syntax of an SPS or a PPS and transmitted to a video decoding device.
图片分裂器110将构成视频的每个图片分裂成具有预定大小的多个译码树单元(CTU),然后通过使用树结构递归分裂CTU。树结构中的叶节点变成译码单元(CU),其是编码的基本单元。The picture splitter 110 splits each picture constituting a video into a plurality of coding tree units (CTUs) having a predetermined size, and then recursively splits the CTUs by using a tree structure. A leaf node in the tree structure becomes a coding unit (CU), which is a basic unit of encoding.
树结构可以是四叉树(QT),其中,较高节点(或父节点)被分成具有相同大小的四个较低节点(或子节点)。树结构也可以是二叉树(BT),其中较高节点被分成两个较低节点。树结构还可以是三叉树(TT),其中,较高节点以1:2:1的比率被分裂为三个较低节点。树结构还可以是QT结构、BT结构和TT结构中的两个或更多个结构被混合的结构。例如,可以使用四叉树加二叉树(QTBT)结构或者可以使用四叉树加二叉树三叉树(QTBTTT)结构。这里,将二叉树三叉树(BTTT)添加到树结构以被称为多类型树(MTT)。The tree structure can be a quadtree (QT), in which a higher node (or parent node) is divided into four lower nodes (or child nodes) of the same size. The tree structure can also be a binary tree (BT), in which a higher node is divided into two lower nodes. The tree structure can also be a ternary tree (TT), in which a higher node is split into three lower nodes at a ratio of 1:2:1. The tree structure can also be a structure in which two or more of the QT structure, BT structure, and TT structure are mixed. For example, a quadtree plus binary tree (QTBT) structure can be used or a quadtree plus binary tree ternary tree (QTBTTT) structure can be used. Here, a binary tree ternary tree (BTTT) is added to the tree structure to be referred to as a multi-type tree (MTT).
图2是用于描述通过使用QTBTTT结构分裂块的方法的示图。FIG. 2 is a diagram for describing a method of splitting a block by using a QTBTTT structure.
如图2中所示,CTU可以首先被分成QT结构。四叉树分裂可以是递归的,直到分裂块的大小达到在QT中允许的叶节点的最小块大小(MinQTSize)。指示QT结构的每个节点是否被分裂为下层的四个节点的第一标记(QT_split_flag)由熵编码器155编码,并被信号通知给视频解码装置。当QT的叶节点不大于在BT中允许的根节点的最大块大小(MaxBTSize)时,叶节点可进一步被分裂为BT结构和TT结构中的至少一个。在BT结构和/或TT结构中可存在多个分裂方向。例如,可存在两个方向,即,相应节点的块被水平分裂的方向和相应节点的块被垂直分裂的方向。如图2所示,当MTT分裂开始时,指示节点是否被分裂的第二标记(mtt_split_flag)以及另外指示分裂方向(垂直或水平)的标记和/或如果节点被分裂则指示分裂类型(二叉或三叉)的标记由熵编码器155编码并用信号通知给视频解码装置。As shown in Figure 2, the CTU can first be divided into a QT structure. The quadtree splitting can be recursive until the size of the split block reaches the minimum block size (MinQTSize) of the leaf node allowed in QT. The first flag (QT_split_flag) indicating whether each node of the QT structure is split into four nodes of the lower layer is encoded by the entropy encoder 155 and is notified to the video decoding device by a signal. When the leaf node of QT is not larger than the maximum block size (MaxBTSize) of the root node allowed in BT, the leaf node can be further split into at least one of the BT structure and the TT structure. There may be multiple splitting directions in the BT structure and/or the TT structure. For example, there may be two directions, namely, the direction in which the block of the corresponding node is split horizontally and the direction in which the block of the corresponding node is split vertically. As shown in Figure 2, when the MTT splitting starts, a second flag (mtt_split_flag) indicating whether the node is split, as well as a flag indicating the splitting direction (vertical or horizontal) and/or a flag indicating the splitting type (binary or trifurcated) if the node is split, is encoded by the entropy encoder 155 and notified to the video decoding device by a signal.
可选择地,在编码指示每个节点是否被分裂为下层的四个节点的第一标记(QT_split_flag)之前,指示节点是否被分裂的CU分裂标记(split_cu_flag)也可被编码。当CU分裂标记(split_cu_flag)的值指示每一节点未分裂时,对应节点的块变为分裂树结构中的叶节点且变为作为编码的基本单元的CU。当CU分裂标记(split_cu_flag)的值指示每个节点被分裂时,视频编码装置首先通过上述方案开始编码第一标记。Alternatively, before encoding the first flag (QT_split_flag) indicating whether each node is split into four nodes of the lower layer, a CU split flag (split_cu_flag) indicating whether the node is split may also be encoded. When the value of the CU split flag (split_cu_flag) indicates that each node is not split, the block of the corresponding node becomes a leaf node in the split tree structure and becomes a CU as a basic unit of encoding. When the value of the CU split flag (split_cu_flag) indicates that each node is split, the video encoding device first starts encoding the first flag through the above scheme.
当QTBT被用作树结构的另一实施例时,可存在两种类型,即,相应节点的块被水平分裂为具有相同大小的两个块的类型(即,对称水平分裂)和相应节点的块被垂直分裂为具有相同大小的两个块的类型(即,对称垂直分裂)。指示BT结构的每个节点是否被分裂为下层的块的分裂标记(split_flag)和指示分裂类型的分裂类型信息由熵编码器155编码,并被传送到视频解码装置。同时,可以另外存在将对应节点的块分裂为彼此不对称的两个块的类型。非对称形式可包括其中对应节点的块被分裂为具有1:3的大小比的两个矩形块的形式,或者还可包括其中对应节点的块被沿对角线方向分裂的形式。When QTBT is used as another embodiment of a tree structure, there may be two types, namely, a type in which the block of the corresponding node is horizontally split into two blocks of the same size (i.e., symmetrical horizontal splitting) and a type in which the block of the corresponding node is vertically split into two blocks of the same size (i.e., symmetrical vertical splitting). A split flag (split_flag) indicating whether each node of the BT structure is split into blocks of the lower layer and split type information indicating the split type are encoded by the entropy encoder 155 and transmitted to the video decoding device. At the same time, there may be another type in which the block of the corresponding node is split into two blocks that are asymmetric to each other. The asymmetric form may include a form in which the block of the corresponding node is split into two rectangular blocks with a size ratio of 1:3, or may also include a form in which the block of the corresponding node is split in a diagonal direction.
CU可根据来自CTU的QTBT或QTBTTT分裂而具有不同大小。在下文中,对应于待编码或解码的CU(即,QTBTTT的叶节点)的块被称为“当前块”。由于采用QTBTTT分裂,当前块的形状除了正方形形状外,还可以是矩形形状。A CU may have different sizes depending on the QTBT or QTBTTT split from a CTU. Hereinafter, a block corresponding to a CU to be encoded or decoded (i.e., a leaf node of a QTBTTT) is referred to as a "current block". Due to the adoption of QTBTTT splitting, the shape of the current block may be a rectangular shape in addition to a square shape.
预测器120预测当前块以生成预测块。预测器120包括帧内预测器(intrapredictor)122和帧间预测器(inter predictor)124。The predictor 120 predicts the current block to generate a predicted block. The predictor 120 includes an intra predictor 122 and an inter predictor 124.
一般来说,图片中的当前块中的每一个可被预测性译码。通常,当前块的预测可以通过使用帧内预测技术(使用来自包括当前块的图片的数据)或者帧间预测技术(使用来自在包括当前块的图片之前被编译的图片的数据)来执行。帧间预测包括单向预测和双向预测两者。In general, each of the current blocks in a picture may be predictively coded. Typically, the prediction of the current block may be performed using intra-prediction techniques (using data from the picture including the current block) or inter-prediction techniques (using data from a picture coded before the picture including the current block). Inter-prediction includes both unidirectional prediction and bidirectional prediction.
帧内预测器122通过使用在包括当前块的当前图片中位于当前块的相邻块上的像素(参考像素)来预测当前块中的像素。根据预测方向,存在多个帧内预测模式。例如,如图3a所示,多个帧内预测模式可以包括包括平面模式和DC模式的2个非定向模式,并且可以包括65个定向模式。相邻像素和将被使用的算术等式根据每个预测模式被不同地定义。The intra-frame predictor 122 predicts the pixels in the current block by using the pixels (reference pixels) located on the neighboring blocks of the current block in the current picture including the current block. According to the prediction direction, there are multiple intra-frame prediction modes. For example, as shown in FIG. 3a, the multiple intra-frame prediction modes may include 2 non-directional modes including a plane mode and a DC mode, and may include 65 directional modes. The neighboring pixels and the arithmetic equations to be used are defined differently according to each prediction mode.
为了对具有矩形形状的当前块进行有效的定向预测,可以另外使用如图3b中的虚线箭头所示的定向模式(#67至#80,帧内预测模式#-1至#-14)。定向模式可被称作“广角帧内预测模式”。在图3b中,箭头指示用于预测的对应参考样本并且不表示预测方向。预测方向与箭头所示的方向相反。在当前块具有矩形形状时,广角帧内预测模式是在没有附加比特传输的情况下以与特定定向模式相反的方向执行预测的模式。在这种情况下,在广角帧内预测模式中,可通过具有矩形形状的当前块的宽度和高度的比率确定可用于当前块的一些广角帧内预测模式。例如,在当前块具有高度小于宽度的矩形形状时,具有小于45度的角度的宽角度帧内预测模式(帧内预测模式#67至#80)是可用的。在当前块具有宽度大于高度的矩形形状时,可使用具有大于-135度的角度的广角帧内预测模式。In order to perform effective directional prediction for the current block having a rectangular shape, a directional mode (#67 to #80, intra prediction mode #-1 to #-14) as shown by the dotted arrows in FIG. 3b can be used in addition. The directional mode may be referred to as a "wide-angle intra prediction mode". In FIG. 3b, the arrow indicates the corresponding reference sample used for prediction and does not indicate the prediction direction. The prediction direction is opposite to the direction shown by the arrow. When the current block has a rectangular shape, the wide-angle intra prediction mode is a mode in which prediction is performed in the opposite direction to the specific directional mode without additional bit transmission. In this case, in the wide-angle intra prediction mode, some wide-angle intra prediction modes that can be used for the current block can be determined by the ratio of the width and height of the current block having a rectangular shape. For example, when the current block has a rectangular shape with a height less than the width, a wide-angle intra prediction mode (intra prediction mode #67 to #80) with an angle less than 45 degrees is available. When the current block has a rectangular shape with a width greater than the height, a wide-angle intra prediction mode with an angle greater than -135 degrees can be used.
帧内预测器122可以确定将用于对当前块进行编码的帧内预测。在一些实施例中,帧内预测器122可通过使用多个帧内预测模式对当前块进行编码,且还可从测试模式中选择将使用的适当帧内预测模式。例如,帧内预测器122可通过使用针对多个测试的帧内预测模式的速率-失真分析来计算速率-失真值,并且还可在测试模式中选择具有最佳速率-失真特征的帧内预测模式。The intra-frame predictor 122 may determine an intra-frame prediction to be used for encoding the current block. In some embodiments, the intra-frame predictor 122 may encode the current block by using a plurality of intra-frame prediction modes, and may further select an appropriate intra-frame prediction mode to be used from a test mode. For example, the intra-frame predictor 122 may calculate a rate-distortion value by using a rate-distortion analysis for a plurality of tested intra-frame prediction modes, and may further select an intra-frame prediction mode having the best rate-distortion characteristics in the test mode.
帧内预测器122在多个帧内预测模式中选择一个帧内预测模式,并通过使用相邻像素(参考像素)和根据选择的帧内预测模式确定的算术等式来预测当前块体。关于所选择的帧内预测模式的信息由熵编码器155编码,并被传送到视频解码装置。The intra-frame predictor 122 selects an intra-frame prediction mode among a plurality of intra-frame prediction modes and predicts the current block by using adjacent pixels (reference pixels) and an arithmetic equation determined according to the selected intra-frame prediction mode. Information about the selected intra-frame prediction mode is encoded by the entropy encoder 155 and transmitted to the video decoding device.
帧间预测器124通过使用运动补偿处理来产生用于当前块的预测块。帧间预测器124在比当前图片更早地编码和解码的参考图片中搜索与当前块最相似的块,并且通过使用搜索到的块产生用于当前块的预测块。另外,产生运动矢量(MV),其对应于在当前图片中的当前块和在参考图片中的预测块之间的位移。通常,对亮度分量执行运动估计,并且基于亮度分量计算的运动矢量被用于亮度分量和色度分量两者。包括关于参考图片的信息和关于用于预测当前块的运动矢量的信息的运动信息由熵编码器155编码,并被传送到视频解码装置。The inter-frame predictor 124 generates a prediction block for the current block by using a motion compensation process. The inter-frame predictor 124 searches for a block most similar to the current block in a reference picture that is encoded and decoded earlier than the current picture, and generates a prediction block for the current block by using the searched block. In addition, a motion vector (MV) is generated, which corresponds to the displacement between the current block in the current picture and the prediction block in the reference picture. Generally, motion estimation is performed on the luminance component, and the motion vector calculated based on the luminance component is used for both the luminance component and the chrominance component. Motion information including information about the reference picture and information about the motion vector for predicting the current block is encoded by the entropy encoder 155 and transmitted to the video decoding device.
帧间预测器124还可对参考图片或参考块执行内插以便增加预测的准确度。换句话说,两个连续整数样本之间的子样本通过将滤波器系数应用于包括两个整数样本的多个连续整数样本而被内插。当针对内插的参考图片执行搜索与当前块最相似的块的处理时,可以针对运动矢量表示不是整数样本单元精度,而是十进制单元精度。可针对待编码的每一目标区域(例如,例如切片、片、CTU、CU等单元)不同地设定运动矢量的精度或分辨率。当应用这样的自适应运动矢量分辨率(AMVR)时,应针对每一目标区域用信号通知关于待应用于每一目标区域的运动矢量分辨率的信息。例如,当目标区域是CU时,用信号通知关于应用于每个CU的运动矢量分辨率的信息。关于运动矢量分辨率的信息可以是表示将在下面描述的运动矢量差的精度的信息。The inter-frame predictor 124 may also perform interpolation on a reference picture or a reference block in order to increase the accuracy of the prediction. In other words, the subsamples between two consecutive integer samples are interpolated by applying the filter coefficients to a plurality of consecutive integer samples including two integer samples. When the process of searching for the block most similar to the current block is performed for the interpolated reference picture, the decimal unit precision may be used instead of the integer sample unit precision for the motion vector representation. The precision or resolution of the motion vector may be set differently for each target region to be encoded (e.g., units such as slices, slices, CTUs, CUs, etc.). When such an adaptive motion vector resolution (AMVR) is applied, information about the motion vector resolution to be applied to each target region should be signaled for each target region. For example, when the target region is a CU, information about the motion vector resolution applied to each CU is signaled. The information about the motion vector resolution may be information representing the precision of the motion vector difference to be described below.
同时,帧间预测器124可通过使用双向预测来执行帧间预测。在双向预测的情况下,使用两个参考图片和表示在每个参考图片中最类似于当前块的块位置的两个运动矢量。帧间预测器124分别从参考图片列表0(RefPicList0)和参考图片列表1(RefPicList1)选择第一参考图片和第二参考图片。帧间预测器124还在相应参考图片中搜索与当前块最相似的块以产生第一参考块和第二参考块。另外,通过对第一参考块和第二参考块进行平均或加权平均来产生当前块的预测块。另外,包括关于用于预测当前块的两个参考图片的信息和包括关于两个运动矢量的信息的运动信息被传送到熵编码器155。在这里,参考图片列表0可以由在预重构的图片中按照显示顺序在当前图片之前的图片组成,而参考图片列表1可以由在预重构的图片中按照显示顺序在当前图片之后的图片组成。然而,虽然不特别限于此,但是按照显示顺序在当前图片之后的预重构的图片可以附加地包括在参考图片列表0中。相反地,在当前图片之前的预重构的图片也可以被附加地包括在参考图片列表1中。Meanwhile, the inter-frame predictor 124 may perform inter-frame prediction by using bidirectional prediction. In the case of bidirectional prediction, two reference pictures and two motion vectors representing the block position most similar to the current block in each reference picture are used. The inter-frame predictor 124 selects a first reference picture and a second reference picture from a reference picture list 0 (RefPicList0) and a reference picture list 1 (RefPicList1), respectively. The inter-frame predictor 124 also searches for a block most similar to the current block in the corresponding reference picture to generate a first reference block and a second reference block. In addition, a prediction block of the current block is generated by averaging or weighted averaging the first reference block and the second reference block. In addition, motion information including information about two reference pictures used to predict the current block and information about two motion vectors is transmitted to the entropy encoder 155. Here, the reference picture list 0 may be composed of pictures that are before the current picture in the display order in the pre-reconstructed picture, and the reference picture list 1 may be composed of pictures that are after the current picture in the display order in the pre-reconstructed picture. However, although not particularly limited thereto, a pre-reconstructed picture after the current picture in the display order may be additionally included in the reference picture list 0. Conversely, a pre-reconstructed picture before the current picture may also be additionally included in the reference picture list 1.
为了最小化用于编码运动信息所消耗的比特数量,可以使用各种方法。In order to minimize the number of bits consumed for encoding motion information, various methods may be used.
例如,在当前块的参考图片和运动矢量与相邻块的参考图片和运动矢量相同时,能够识别相邻块的信息被编码以将当前块的运动信息传送到视频解码装置。这种方法被称为合并模式。For example, when the reference picture and motion vector of the current block are the same as those of the adjacent block, information capable of identifying the adjacent block is encoded to transmit the motion information of the current block to the video decoding device. This method is called merge mode.
在合并模式中,帧间预测器124从当前块的相邻块中选择预定数量的合并候选块(在下文中,称为“合并候选”)。In the merge mode, the inter predictor 124 selects a predetermined number of merge candidate blocks (hereinafter, referred to as “merge candidates”) from neighboring blocks of the current block.
如在图4中示出的,可以使用在当前图片中邻近于当前块的左块A0、左下块A1、上块B0、右上块B1、以及左上块B2中的全部或者一些作为用于导出合并候选的相邻块。进一步,定位在参考图片(可以与用于预测当前块的参考图片相同或不同)而不是当前块定位在其处的当前图片内的块也可以用作合并候选。例如,在参考图片内与当前块协同定位的块或邻近于该协同定位的块的块可另外用作合并候选。如果通过上文所描述的方法选择的合并候选的数目小于预设数目,那么将零向量添加到合并候选。As shown in FIG. 4 , all or some of the left block A0, the lower left block A1, the upper block B0, the upper right block B1, and the upper left block B2 adjacent to the current block in the current picture may be used as neighboring blocks for deriving merge candidates. Further, blocks located in a reference picture (which may be the same as or different from the reference picture used to predict the current block) other than the current block within the current picture where the current block is located may also be used as merge candidates. For example, a block co-located with the current block within the reference picture or a block adjacent to the co-located block may be additionally used as a merge candidate. If the number of merge candidates selected by the method described above is less than a preset number, a zero vector is added to the merge candidate.
帧间预测器124通过使用相邻块配置包括预定数目的合并候选的合并列表。从包括在合并列表中的合并候选中选择要用作当前块的运动信息的合并候选,并且生成用于识别所选择的候选的合并索引信息。生成的合并索引信息由熵编码器155编码,并被传送到视频解码装置。The inter-frame predictor 124 configures a merge list including a predetermined number of merge candidates by using adjacent blocks. A merge candidate to be used as motion information of the current block is selected from the merge candidates included in the merge list, and merge index information for identifying the selected candidate is generated. The generated merge index information is encoded by the entropy encoder 155 and transmitted to the video decoding device.
合并跳跃模式(merge skip mode)是合并模式的特殊情况。在量化之后,当用于熵编码的所有变换系数接近于零时,仅传输相邻块选择信息而不传输残差信号。通过使用合并跳跃模式,对于具有轻微运动的图像、静止图像、屏幕内容图像等等可以实现相对高的编码效率。Merge skip mode is a special case of merge mode. After quantization, when all transform coefficients for entropy coding are close to zero, only the neighbor block selection information is transmitted without the residual signal. By using merge skip mode, relatively high coding efficiency can be achieved for images with slight motion, still images, screen content images, etc.
在下文中,合并模式和合并跳跃模式被统称为合并/跳跃模式。Hereinafter, the merge mode and the merge skip mode are collectively referred to as the merge/skip mode.
用于编码运动信息的另一方法是高级运动矢量预测(AMVP)模式。Another method for encoding motion information is the Advanced Motion Vector Prediction (AMVP) mode.
在AMVP模式中,帧间预测器124通过使用当前块的相邻块导出用于当前块的运动矢量的运动矢量预测器候选。作为用于导出运动矢量预测器候选的相邻块,可以使用与在图4中示出的当前图片中邻近于当前块的左块A0、左下块A1、上块B0、右上块B1、以及左上块B2中的全部或一些。此外,定位在参考图片(可以与用于预测当前块的参考图片相同或不同)而不是当前块定位在其处的当前图片内的块也可以用作用于导出运动矢量预测器候选的相邻块。例如,可以使用参考图片内与当前块协同定位的块或者邻近于该协同定位的块的块。如果通过上述方法选择的运动矢量预测器候选的数目小于预设数目,则零矢量被添加到运动矢量预测器候选。In the AMVP mode, the inter-frame predictor 124 derives a motion vector predictor candidate for a motion vector of the current block by using the neighboring blocks of the current block. As the neighboring blocks for deriving the motion vector predictor candidates, all or some of the left block A0, the lower left block A1, the upper block B0, the upper right block B1, and the upper left block B2 adjacent to the current block in the current picture shown in FIG. 4 can be used. In addition, a block located in a reference picture (which may be the same as or different from the reference picture used to predict the current block) other than the current block located in the current picture can also be used as a neighboring block for deriving the motion vector predictor candidate. For example, a block co-located with the current block in the reference picture or a block adjacent to the co-located block can be used. If the number of motion vector predictor candidates selected by the above method is less than the preset number, a zero vector is added to the motion vector predictor candidate.
帧间预测器124通过使用相邻块的运动矢量导出运动矢量预测器候选,并通过使用运动矢量预测器候选确定用于当前块的运动矢量的运动矢量预测器。另外,通过从当前块的运动矢量减去运动矢量预测值来计算运动矢量差。The inter predictor 124 derives a motion vector predictor candidate by using the motion vector of the neighboring block, and determines a motion vector predictor for the motion vector of the current block by using the motion vector predictor candidate. In addition, a motion vector difference is calculated by subtracting the motion vector predictor from the motion vector of the current block.
运动矢量预测器可通过将预定义的函数(例如,中心值和平均值计算等)应用于运动矢量预测器候选来获得。在这种情况下,视频解码装置还知道预定义的函数。此外,由于用于导出运动矢量预测器候选的相邻块是编码和解码已经完成的块,所以视频解码装置还可能已经知道相邻块的运动矢量。因此,视频编码装置不需要对用于识别运动矢量预测器候选的信息进行编码。因而,在这种情况下,关于运动矢量差的信息和关于用于预测当前块的参考图片的信息被编码。The motion vector predictor can be obtained by applying a predefined function (e.g., center value and average value calculation, etc.) to the motion vector predictor candidate. In this case, the video decoding device also knows the predefined function. In addition, since the neighboring blocks used to derive the motion vector predictor candidate are blocks for which encoding and decoding have been completed, the video decoding device may also already know the motion vectors of the neighboring blocks. Therefore, the video encoding device does not need to encode the information for identifying the motion vector predictor candidate. Thus, in this case, information about the motion vector difference and information about the reference picture used to predict the current block are encoded.
同时,还可以通过选择运动矢量预测器候选中的任何一个的方案来确定运动矢量预测器。在这种情况下,用于识别所选的运动矢量预测器候选的信息与关于运动矢量差的信息和关于用于预测当前块的参考图片的信息联合地另外编码。Meanwhile, the motion vector predictor may also be determined by selecting any one of the motion vector predictor candidates. In this case, information for identifying the selected motion vector predictor candidate is additionally encoded jointly with information about the motion vector difference and information about the reference picture used to predict the current block.
减法器130通过从当前块中减去由帧内预测器122或帧间预测器124生成的预测块来生成残余块。The subtractor 130 generates a residual block by subtracting a prediction block generated by the intra predictor 122 or the inter predictor 124 from the current block.
变换器140将具有空间域的像素值的残余块中的残差信号转换为频域的变换系数。变换器140可通过将残余块的总大小用作变换单元来对残余块中的残差信号进行变换,或者还可将残余块分裂成多个子块,并可通过将子块用作变换单元来执行变换。可选地,所述残余块被划分为两个子块,分别为变换区域和非变换区域,以仅使用变换区域子块作为变换单元对残差信号进行变换。这里,变换区域子块可以是具有基于水平轴(或垂直轴)的1:1的大小比的两个矩形块之一。在这种情况下,标记(cu_sbt_flag)指示仅子块被变换,方向性(垂直/水平)信息(cu_sbt_horizontal_flag)和/或位置信息(cu_sbt_pos_flag)由熵编码器155编码并用信号通知视频给解码装置。此外,变换区域子块的大小可具有基于水平轴(或垂直轴)的1:3的大小比率。在这种情况下,划分相应分裂的标记(cu_sbt_quad_flag)另外由熵编码器155编码,并用信号通知给视频解码装置。The transformer 140 converts the residual signal in the residual block having the pixel value in the spatial domain into a transform coefficient in the frequency domain. The transformer 140 may transform the residual signal in the residual block by using the total size of the residual block as a transform unit, or may also split the residual block into a plurality of sub-blocks, and may perform the transform by using the sub-block as a transform unit. Optionally, the residual block is divided into two sub-blocks, namely a transform region and a non-transform region, so as to transform the residual signal using only the transform region sub-block as a transform unit. Here, the transform region sub-block may be one of two rectangular blocks having a size ratio of 1:1 based on the horizontal axis (or vertical axis). In this case, the flag (cu_sbt_flag) indicates that only the sub-block is transformed, and the directional (vertical/horizontal) information (cu_sbt_horizontal_flag) and/or the position information (cu_sbt_pos_flag) are encoded by the entropy encoder 155 and signaled to the video to the decoding device. In addition, the size of the transform region sub-block may have a size ratio of 1:3 based on the horizontal axis (or vertical axis). In this case, a flag (cu_sbt_quad_flag) for dividing the corresponding split is additionally encoded by the entropy encoder 155 and signaled to the video decoding apparatus.
同时,变换器140可在水平方向和垂直方向上单独地对残余块执行变换。对于变换,可使用不同类型的变换函数或变换矩阵。例如,用于水平变换和垂直变换的一对变换函数可定义为多变换集合(multiple transform set,MTS)。变换器140可选择在MTS中具有最高变换效率的一个变换函数对,并可在水平和垂直方向中的每一个上变换残余块。MTS中的变换函数对的信息(mts_idx)由熵编码器155编码并用信号通知给视频解码装置。Meanwhile, the transformer 140 may perform transformation on the residual block separately in the horizontal direction and the vertical direction. For the transformation, different types of transformation functions or transformation matrices may be used. For example, a pair of transformation functions for horizontal transformation and vertical transformation may be defined as a multiple transform set (MTS). The transformer 140 may select a pair of transformation functions having the highest transformation efficiency in the MTS, and may transform the residual block in each of the horizontal and vertical directions. Information (mts_idx) of the transformation function pair in the MTS is encoded by the entropy encoder 155 and notified to the video decoding device with a signal.
量化器145使用量化参数量化从变换器140输出的变换系数,并将量化的变换系数输出到熵编码器155。量化器145还可以立即量化相关的残余块,而无需用于任何块或帧的变换。量化器145还可根据变换块中的变换系数的位置应用不同的量化系数(缩放值)。应用于以2维布置的量化的变换系数的量化矩阵可被编码并用信号通知给视频解码装置。The quantizer 145 quantizes the transform coefficients output from the transformer 140 using the quantization parameter and outputs the quantized transform coefficients to the entropy encoder 155. The quantizer 145 can also immediately quantize the relevant residual block without transforming any block or frame. The quantizer 145 can also apply different quantization coefficients (scaling values) according to the position of the transform coefficient in the transform block. The quantization matrix applied to the quantized transform coefficients arranged in 2 dimensions can be encoded and notified to the video decoding device with a signal.
重排单元150可以执行针对量化的残余块的系数值的重新排列。The rearrangement unit 150 may perform rearrangement of coefficient values for the quantized residual block.
重排单元150可以通过使用系数扫描将2D系数阵列改变为1D系数序列。例如,重排单元150可以通过使用Z字形扫描或者对角线扫描将DC系数扫描为高频域系数来输出ID系数序列。根据变换单元的大小和帧内预测模式,还可使用沿列方向扫描2D系数阵列的垂直扫描和沿行方向扫描2D块类型系数的水平扫描来代替Z字形扫描。换句话说,根据变换单元的大小和帧内预测模式,可在Z字形扫描、对角扫描、垂直扫描和水平扫描中确定要使用的扫描方法。The rearrangement unit 150 may change the 2D coefficient array into a 1D coefficient sequence by using coefficient scanning. For example, the rearrangement unit 150 may output an ID coefficient sequence by scanning the DC coefficient into a high frequency domain coefficient using a zigzag scan or a diagonal scan. Depending on the size of the transform unit and the intra prediction mode, a vertical scan that scans the 2D coefficient array in the column direction and a horizontal scan that scans the 2D block type coefficients in the row direction may also be used instead of the zigzag scan. In other words, depending on the size of the transform unit and the intra prediction mode, the scanning method to be used may be determined among zigzag scanning, diagonal scanning, vertical scanning, and horizontal scanning.
熵编码器155通过使用包括基于上下文的自适应二进制算术码(CABAC)、指数哥伦布等的各种编码方案对从重排单元150输出的ID量化的变换系数的序列进行编码来产生比特流。The entropy encoder 155 generates a bitstream by encoding the sequence of ID-quantized transform coefficients output from the rearrangement unit 150 using various encoding schemes including context-based adaptive binary arithmetic coding (CABAC), exponential Golomb, and the like.
此外,熵编码器155对与块分裂相关的信息(诸如,CTU大小、CTU分裂标记、QT分裂标记、MTT分裂类型、MTT分裂方向等)进行编码,以允许视频解码装置将块等分裂到视频编码装置。此外,熵编码器155对关于指示当前块是通过帧内预测还是帧间预测来编码的预测类型的信息进行编码。熵编码器155根据预测类型对帧内预测信息(即,关于帧内预测模式的信息)或帧间预测信息(在合并模式、合并索引的情况下,以及在AMVP模式的情况下,关于参考图片索引和运动矢量差的信息)进行编码。此外,熵编码器155对与量化相关的信息(即,关于量化参数的信息和关于量化矩阵的信息)进行编码。In addition, the entropy encoder 155 encodes information related to block splitting (such as CTU size, CTU splitting mark, QT splitting mark, MTT splitting type, MTT splitting direction, etc.) to allow the video decoding device to split the block, etc. to the video encoding device. In addition, the entropy encoder 155 encodes information about the prediction type indicating whether the current block is encoded by intra-frame prediction or inter-frame prediction. The entropy encoder 155 encodes intra-frame prediction information (i.e., information about intra-frame prediction mode) or inter-frame prediction information (in the case of merge mode, merge index, and in the case of AMVP mode, information about reference picture index and motion vector difference) according to the prediction type. In addition, the entropy encoder 155 encodes information related to quantization (i.e., information about quantization parameters and information about quantization matrices).
逆量化器160对从量化器145输出的量化的变换系数进行去量化以产生变换系数。逆变换器165将从逆量化器160输出的变换系数从频域变换到空间域,以重构残余块。The inverse quantizer 160 dequantizes the quantized transform coefficient output from the quantizer 145 to generate a transform coefficient. The inverse transformer 165 transforms the transform coefficient output from the inverse quantizer 160 from the frequency domain to the spatial domain to reconstruct a residual block.
加法器170将重构的残余块和由预测器120生成的预测块相加以重构当前块。当对下一次序块进行帧内预测时,经重构当前块中的像素可用作参考像素。The adder 170 reconstructs the current block by adding the reconstructed residual block and the prediction block generated by the predictor 120. When intra-prediction is performed on a next sequential block, pixels in the reconstructed current block may be used as reference pixels.
环路滤波器单元180对重构的像素执行滤波,以减少由于基于块的预测和变换/量化而发生的块伪影、振铃伪影、模糊伪影等。作为环路滤波器的环路滤波器单元180可包括解块滤波器182、样本自适应偏移(SAO)滤波器184和自适应环路滤波器(ALF)186的全部或一些。The loop filter unit 180 performs filtering on the reconstructed pixels to reduce block artifacts, ringing artifacts, blur artifacts, etc. that occur due to block-based prediction and transform/quantization. The loop filter unit 180 as a loop filter may include all or some of the deblocking filter 182, the sample adaptive offset (SAO) filter 184, and the adaptive loop filter (ALF) 186.
解块滤波器182对重构的块之间的边界进行滤波以便去除由于块单元编码/解码而发生的块效应,并且SAO滤波器184和ALF 186执行针对解块的滤波视频的附加滤波。SAO滤波器184和ALF 186是用于补偿重构的像素与原始像素之间的差的滤波器,所述差由于有损译码而发生。SAO滤波器184应用偏移作为CTU单元以增强主观图像质量和编码效率。另一方面,ALF 186执行块单元滤波,并且通过划分相应块的边界和变化量的程度来应用不同的滤波器来补偿失真。关于将用于ALF的滤波器系数的信息可被编码并用信号通知给视频解码装置。The deblocking filter 182 filters the boundaries between the reconstructed blocks to remove the block effect caused by block unit encoding/decoding, and the SAO filter 184 and the ALF 186 perform additional filtering for the deblocked filtered video. The SAO filter 184 and the ALF 186 are filters for compensating the difference between the reconstructed pixels and the original pixels, which occurs due to lossy decoding. The SAO filter 184 applies an offset as a CTU unit to enhance subjective image quality and coding efficiency. On the other hand, the ALF 186 performs block unit filtering and applies different filters to compensate for distortion by dividing the boundaries of the corresponding blocks and the degree of the change amount. Information about the filter coefficient to be used for the ALF can be encoded and notified to the video decoding device with a signal.
通过解块滤波器182、SAO滤波器184和ALF 186滤波的重构的块存储在存储器190中。当一个图片中的所有块被重构时,重构的图片可以用作参考图片,用于对之后要编码的图片内的块进行帧间预测。The reconstructed blocks filtered by the deblocking filter 182, the SAO filter 184, and the ALF 186 are stored in the memory 190. When all blocks in one picture are reconstructed, the reconstructed picture can be used as a reference picture for inter-frame prediction of blocks in a picture to be encoded later.
图5是可以实现本公开的技术的视频解码装置的功能框图。在下文中,参考图5,描述视频解码装置和该装置的组件。FIG5 is a functional block diagram of a video decoding device that can implement the technology of the present disclosure. In the following, with reference to FIG5 , a video decoding device and components of the device are described.
视频解码装置可包括熵解码器510、重排单元515、逆量化器520、逆变换器530、预测器540、加法器550、环路滤波器单元560和存储器570。The video decoding device may include an entropy decoder 510 , a rearrangement unit 515 , an inverse quantizer 520 , an inverse transformer 530 , a predictor 540 , an adder 550 , a loop filter unit 560 , and a memory 570 .
与图1的视频编码装置相似,视频解码装置的每个组件可实施为硬件或软件或实施为硬件和软件的组合。此外,每个组件的功能可以实现为软件,并且还可以实现微处理器以执行与每个组件对应的软件的功能。Similar to the video encoding device of Figure 1, each component of the video decoding device can be implemented as hardware or software or as a combination of hardware and software. In addition, the function of each component can be implemented as software, and a microprocessor can also be implemented to execute the function of the software corresponding to each component.
熵解码器510通过对由视频编码装置产生的比特流进行解码以确定将被解码的当前块来提取与块分裂相关的信息,并且提取重构当前块所需的预测信息和关于残差信号的信息。The entropy decoder 510 extracts information related to block splitting by decoding a bitstream generated by a video encoding apparatus to determine a current block to be decoded, and extracts prediction information required to reconstruct the current block and information about a residual signal.
熵解码器510通过从序列参数集(SPS)或图片参数集(PPS)提取关于CTU大小的信息来确定CTU的大小,并且将图片分裂成具有确定大小的CTU。另外,确定CTU为树结构的最高层,即,根节点,并且可以提取CTU的分裂信息,以使用树结构对CTU进行分裂。The entropy decoder 510 determines the size of the CTU by extracting information about the CTU size from a sequence parameter set (SPS) or a picture parameter set (PPS), and splits the picture into CTUs of the determined size. In addition, the CTU is determined to be the highest level of the tree structure, that is, the root node, and the split information of the CTU can be extracted to split the CTU using the tree structure.
例如,当使用QTBTTT结构对CTU进行分裂时,首先提取与QT的分裂相关的第一标记(QT_split_flag),以将每个节点分裂为下层的四个节点。此外,相对于与QT的叶节点相应的节点,提取与MTT的分裂相关的第二标记(mtt_split_flag)、分裂方向(垂直/水平)和/或分裂类型(二叉/三叉),以将相应的叶节点分裂为MTT结构。结果,在QT的叶节点下面的每个节点被递归地分裂成BT或TT结构。For example, when a CTU is split using a QTBTTT structure, a first flag (QT_split_flag) associated with the splitting of the QT is first extracted to split each node into four nodes at the lower layer. In addition, with respect to a node corresponding to a leaf node of the QT, a second flag (mtt_split_flag) associated with the splitting of the MTT, a splitting direction (vertical/horizontal) and/or a splitting type (binary/trifurcated) are extracted to split the corresponding leaf node into an MTT structure. As a result, each node below the leaf node of the QT is recursively split into a BT or TT structure.
作为另一实施例,当通过使用QTBTTT结构来分裂CTU时,提取指示CU是否被分裂的CU分裂标记(split_cu_flag)。当对应的块被分裂时,第一标记(QT_split_flag)也可以被提取。在分裂过程期间,相对于每个节点,0次或更多次的递归MTT分裂可以在0次或更多次的递归QT分裂之后发生。例如,对于CTU,MTT分裂可以立即发生,或者相反,仅多次的QT分裂也可以发生。As another embodiment, when a CTU is split by using a QTBTTT structure, a CU split flag (split_cu_flag) indicating whether the CU is split is extracted. When the corresponding block is split, a first flag (QT_split_flag) may also be extracted. During the splitting process, 0 or more recursive MTT splits may occur after 0 or more recursive QT splits with respect to each node. For example, for a CTU, MTT splitting may occur immediately, or conversely, only multiple QT splits may occur.
作为另一实施例,当使用QTBT结构对CTU进行分裂时,提取与QT的分裂有关的第一标记(QT_split_flag),以将每个节点分裂成下层的四个节点。此外,提取指示与QT的叶节点相对应的节点是否被进一步分裂成BT的分裂标记(split_flag)以及分裂方向信息。As another embodiment, when the CTU is split using the QTBT structure, the first flag (QT_split_flag) related to the splitting of the QT is extracted to split each node into four nodes of the lower layer. In addition, the split flag (split_flag) indicating whether the node corresponding to the leaf node of the QT is further split into BTs and the split direction information are extracted.
同时,当熵解码器510通过使用树结构的分裂确定将被解码的当前块时,熵解码器510提取关于指示当前块是被帧内预测还是被帧间预测的预测类型的信息。当预测类型信息指示帧内预测时,熵解码器510提取用于当前块的帧内预测信息(帧内预测模式)的语法元素。当预测类型信息指示帧间预测时,熵解码器510提取表示用于帧间预测信息的语法元素(即,运动矢量和运动矢量所参考的参考图片)的信息。Meanwhile, when the entropy decoder 510 determines the current block to be decoded by splitting using the tree structure, the entropy decoder 510 extracts information about a prediction type indicating whether the current block is intra-predicted or inter-predicted. When the prediction type information indicates intra-prediction, the entropy decoder 510 extracts syntax elements of intra-prediction information (intra-prediction mode) for the current block. When the prediction type information indicates inter-prediction, the entropy decoder 510 extracts information representing syntax elements for inter-prediction information (i.e., motion vectors and reference pictures to which motion vectors refer).
此外,熵解码器510提取量化相关信息并且提取关于当前块的量化的变换系数的信息作为关于残差信号的信息。Also, the entropy decoder 510 extracts quantization-related information and extracts information on a quantized transform coefficient of the current block as information on a residual signal.
重排单元515可以再次以与由视频编码装置执行的系数扫描顺序相反的顺序,将由熵解码器510熵解码的1D量化的变换系数的序列改变为2D系数阵列(即,块)。The rearrangement unit 515 may change the sequence of 1D quantized transform coefficients entropy-decoded by the entropy decoder 510 into a 2D coefficient array (ie, block) again in the reverse order of the coefficient scanning order performed by the video encoding device.
逆量化器520对量化的变换系数进行去量化,并且通过使用量化参数对量化的变换系数进行去量化。逆量化器520还可将不同的量化系数(缩放值)应用于以2D布置的量化的变换系数。逆量化器520可通过将来自视频编码装置的量化系数的矩阵(缩放值)应用于量化的变换系数的2D阵列来执行逆量化。The inverse quantizer 520 dequantizes the quantized transform coefficients and dequantizes the quantized transform coefficients by using quantization parameters. The inverse quantizer 520 may also apply different quantization coefficients (scaling values) to the quantized transform coefficients arranged in 2D. The inverse quantizer 520 may perform inverse quantization by applying a matrix (scaling value) of quantization coefficients from a video encoding device to a 2D array of quantized transform coefficients.
逆变换器530通过经由将去量化的变换系数从频域逆变换到空间域中来重构残差信号来产生用于当前块的残余块。The inverse transformer 530 generates a residual block for the current block by reconstructing a residual signal by inversely transforming the dequantized transform coefficients from the frequency domain into the spatial domain.
此外,当逆变换器530对变换块的部分区域(子块)进行逆变换时,逆变换器530提取仅变换块的子块被变换的标记(cu_sbt_flag)、子块的方向(垂直/水平)信息(cu_sbt_horizontal_flag)和/或子块的位置信息(cu_sbt_pos_flag)。逆变换器530还将相应子块的变换系数从频域逆变换到空间域中以重构残差信号并且利用值“0”作为残差信号填充未被逆变换的区域以生成用于当前块的最终残余块。In addition, when the inverse transformer 530 inversely transforms a partial region (sub-block) of the transform block, the inverse transformer 530 extracts a flag (cu_sbt_flag) that only the sub-block of the transform block is transformed, direction (vertical/horizontal) information (cu_sbt_horizontal_flag) of the sub-block, and/or position information (cu_sbt_pos_flag) of the sub-block. The inverse transformer 530 also inversely transforms the transform coefficient of the corresponding sub-block from the frequency domain to the spatial domain to reconstruct the residual signal and fills the region that is not inversely transformed with a value of "0" as the residual signal to generate a final residual block for the current block.
此外,当应用MTS时,逆变换器530通过使用从视频编码装置信号通知的MTS信息(mts_idx)来确定在水平方向和垂直方向中的每一个方向上应用的变换索引或变换矩阵。逆变换器530还通过使用所确定的变换函数在水平方向和垂直方向上对变换块中的变换系数执行逆变换。In addition, when MTS is applied, the inverse transformer 530 determines a transform index or a transform matrix applied in each of the horizontal and vertical directions by using MTS information (mts_idx) signaled from the video encoding device. The inverse transformer 530 also performs inverse transform on the transform coefficients in the transform block in the horizontal and vertical directions by using the determined transform function.
预测器540可包括帧内预测器542和帧间预测器544。在当前块的预测类型是帧内预测时激活帧内预测器542,并且在当前块的预测类型是帧间预测时激活帧间预测器544。The predictor 540 may include an intra predictor 542 and an inter predictor 544. The intra predictor 542 is activated when the prediction type of the current block is intra prediction, and the inter predictor 544 is activated when the prediction type of the current block is inter prediction.
帧内预测器542根据从熵解码器510提取的帧内预测模式的语法元素,在多个帧内预测模式之中确定当前块的帧内预测模式。帧内预测器542还通过根据帧内预测模式使用当前块的相邻参考像素来预测当前块。The intra predictor 542 determines an intra prediction mode of a current block among a plurality of intra prediction modes according to a syntax element of the intra prediction mode extracted from the entropy decoder 510. The intra predictor 542 also predicts the current block by using neighboring reference pixels of the current block according to the intra prediction mode.
帧间预测器544通过使用从熵解码器510提取的用于帧间预测模式的语法元素确定当前块的运动矢量和运动矢量所参考的参考图片。The inter predictor 544 determines a motion vector of a current block and a reference picture to which the motion vector refers by using a syntax element for the inter prediction mode extracted from the entropy decoder 510 .
加法器550通过将从逆变换器530输出的残余块和从帧间预测器544或帧内预测器542输出的预测块相加来重构当前块。在对之后要解码的块进行帧内预测时,重构的当前块内的像素被用作参考像素。The adder 550 reconstructs the current block by adding the residual block output from the inverse transformer 530 and the prediction block output from the inter predictor 544 or the intra predictor 542. When intra prediction is performed on a block to be decoded later, pixels within the reconstructed current block are used as reference pixels.
作为环路滤波器的环路滤波器单元560可以包括解块滤波器562、SAO滤波器564和ALF 566。解块滤波器562对重构的块之间的边界执行解块滤波,以便去除由于块单元解码而发生的块效应。SAO滤波器564和ALF 566在解块滤波之后对重构块执行附加滤波,以便补偿重构像素和原始像素之间的差,所述差由于有损编码而发生。通过使用关于从比特流解码的滤波器系数的信息来确定ALF的滤波器系数。The loop filter unit 560 as a loop filter may include a deblocking filter 562, an SAO filter 564, and an ALF 566. The deblocking filter 562 performs deblocking filtering on the boundary between the reconstructed blocks in order to remove the block effect that occurs due to the block unit decoding. The SAO filter 564 and the ALF 566 perform additional filtering on the reconstructed blocks after the deblocking filtering in order to compensate for the difference between the reconstructed pixels and the original pixels that occurs due to the lossy encoding. The filter coefficients of the ALF are determined by using information about the filter coefficients decoded from the bitstream.
通过解块滤波器562、SAO滤波器564和ALF 566滤波的重构的块存储在存储器570中。当一个图片中的所有块被重构时,重构的图片可以用作参考图片,用于对之后要编码的图片内的块进行帧间预测。The reconstructed blocks filtered by the deblocking filter 562, the SAO filter 564, and the ALF 566 are stored in the memory 570. When all blocks in one picture are reconstructed, the reconstructed picture can be used as a reference picture for inter-frame prediction of blocks in a picture to be encoded later.
在一些实施方式中,本公开涉及如上文所描述的对视频图像进行编码和解码。更具体地,本公开提供了用于通过使用各种导出方案导出当前块的参考样本线的视频编码方法和装置。所述视频编码方法和装置使用所导出的参考样本线来执行当前块的帧内预测。In some embodiments, the present disclosure relates to encoding and decoding a video image as described above. More specifically, the present disclosure provides a video encoding method and apparatus for deriving a reference sample line of a current block by using various derivation schemes. The video encoding method and apparatus use the derived reference sample line to perform intra-frame prediction of the current block.
下面的实施方式可以由视频编码装置中的帧内预测器122来执行。它们还可由视频解码装置中的帧内预测器542执行。The following embodiments may be performed by the intra predictor 122 in the video encoding device. They may also be performed by the intra predictor 542 in the video decoding device.
在对当前块进行帧内预测中,视频编码装置可在优化率失真方面产生与本实施方式相关联的信令信息。视频编码装置可使用熵编码器155来编码信令信息且将其传输到视频解码装置。视频解码装置可使用熵解码器510从比特流解码与当前块的帧内预测相关联的信令信息。In performing intra prediction on the current block, the video encoding device may generate signaling information associated with the present embodiment in terms of optimizing rate distortion. The video encoding device may use the entropy encoder 155 to encode the signaling information and transmit it to the video decoding device. The video decoding device may use the entropy decoder 510 to decode the signaling information associated with intra prediction of the current block from the bitstream.
在以下描述中,术语“预测单元”(PU)(具体地,当前预测单元)可以与如上所述的当前块或译码单元(CU)互换地使用,或者预测单元可以指译码单元的一些区域。In the following description, the term 'prediction unit' (PU), specifically, current prediction unit, may be used interchangeably with a current block or coding unit (CU) as described above, or a prediction unit may refer to some area of a coding unit.
此外,标记的真值指示将标记设置为1的情况。此外,标记的假值指示将标记设置为0的情况。In addition, a true value of the flag indicates a case where the flag is set to 1. In addition, a false value of the flag indicates a case where the flag is set to 0.
I.帧间预测的合并模式I. Inter-frame prediction merge mode
以下实施方式可应用于视频编码装置中的帧间预测器124。The following embodiments may be applied to the inter-frame predictor 124 in the video encoding apparatus.
下文描述用于在帧间预测的合并模式中构成运动信息的合并候选列表的方法。为了支持合并模式,视频编码装置中的帧间预测器124可选择预设数目(例如,六个)的合并候选以形成合并候选列表。A method for constituting a merge candidate list of motion information in a merge mode of inter prediction is described below. To support the merge mode, the inter predictor 124 in the video encoding device may select a preset number (eg, six) of merge candidates to form a merge candidate list.
帧间预测器124检测空间合并候选。帧间预测器124从相邻块中搜索空间合并候选,如图4中所示。可选择多达四个空间合并候选。The inter predictor 124 detects a spatial merging candidate. The inter predictor 124 searches for a spatial merging candidate from neighboring blocks, as shown in FIG4. Up to four spatial merging candidates may be selected.
帧间预测器124搜索时间合并候选。帧间预测器124可将与当前块位协同定位且在参考图片(与用以预测当前块相同的图片或不同图片)中而不在目标块所位于的当前图片中的块添加为时间合并候选。可选择一个时间合并候选。The inter-frame predictor 124 searches for temporal merge candidates. The inter-frame predictor 124 may add a block that is co-located with the current block and in a reference picture (the same picture as used to predict the current block or a different picture) but not in the current picture where the target block is located as a temporal merge candidate. One temporal merge candidate may be selected.
帧间预测器124搜索基于历史的运动矢量预测器(HMVP)候选。帧间预测器124可将先前n(其中,n为自然数)个CU的运动矢量存储在表中,且将运动矢量用作合并候选。该表的大小为6,并且以先进先出(FIFO)方案来存储先前CU的运动矢量。这指示多达6个HMVP候选存储在表中。帧间预测器124可将存储在表中的HMVP候选当中的最近运动矢量设定为合并候选。The inter-frame predictor 124 searches for a history-based motion vector predictor (HMVP) candidate. The inter-frame predictor 124 may store the motion vectors of the previous n (where n is a natural number) CUs in a table and use the motion vectors as merge candidates. The size of the table is 6, and the motion vectors of the previous CUs are stored in a first-in-first-out (FIFO) scheme. This indicates that up to 6 HMVP candidates are stored in the table. The inter-frame predictor 124 may set the most recent motion vector among the HMVP candidates stored in the table as a merge candidate.
帧间预测器124搜索成对的平均运动矢量预测器(PAMVP)候选。帧间预测器124可将合并候选列表中的第一候选和第二候选的运动矢量的平均值设定为合并候选。The inter predictor 124 searches for a paired average motion vector predictor (PAMVP) candidate. The inter predictor 124 may set an average of motion vectors of a first candidate and a second candidate in the merge candidate list as a merge candidate.
如果在执行所有上述搜索程序之后无法填充合并候选列表(即,不满足候选的预设数目),那么帧间预测器124添加零运动矢量作为合并候选。If the merge candidate list cannot be filled (ie, the preset number of candidates is not satisfied) after performing all the above-mentioned search procedures, the inter predictor 124 adds a zero motion vector as a merge candidate.
在视频编码装置中构成合并候选列表的前述方法可以在视频解码装置中由帧间预测器544同样地执行。The aforementioned method of constructing a merge candidate list in the video encoding device may be similarly performed by the inter-frame predictor 544 in the video decoding device.
II.参考样本线的导出II. Derivation of reference sample lines
以下实施方式以视频解码装置的帧内预测器542为中心来描述,但可同样地在视频编码装置的帧内预测器122中实现。The following embodiments are described with the intra-frame predictor 542 of the video decoding apparatus as the center, but can also be implemented in the intra-frame predictor 122 of the video encoding apparatus.
图6是根据本公开的至少一个实施方式的详细帧内预测器的框图。FIG. 6 is a block diagram of a detailed intra predictor in accordance with at least one embodiment of the present disclosure.
视频解码装置的帧内预测器542可包括子块分裂-无分裂信息导出器(subblocksplit-no-split information deriver)602、预测模式导出器604、参考样本线导出器606和预测执行器608的全部或部分。The intra predictor 542 of the video decoding apparatus may include all or part of a subblock split-no-split information deriver 602 , a prediction mode deriver 604 , a reference sample line deriver 606 , and a prediction executor 608 .
子块分裂-无分裂信息导出器602可导出关于当前块是否将被分裂成子块的信息。如果当前块被分裂成子块,则参考样本线导出器606和预测执行器608可以通过子块单元执行它们的相关操作。The sub-block split-no-split information deriver 602 may derive information on whether the current block is to be split into sub-blocks. If the current block is split into sub-blocks, the reference sample line deriver 606 and the prediction executor 608 may perform their related operations by sub-block units.
预测模式导出器604可以导出用于当前块的帧内预测模式。这里,预测模式可以是基于矩阵的预测模式、定向预测模式、非定向预测模式(诸如DC或平面)等。例如,预测模式导出器604可从比特流解码帧内预测模式。The prediction mode deriver 604 may derive an intra prediction mode for the current block. Here, the prediction mode may be a matrix-based prediction mode, a directional prediction mode, a non-directional prediction mode (such as DC or plane), etc. For example, the prediction mode deriver 604 may decode the intra prediction mode from the bitstream.
另一方面,子块分裂-无分裂信息导出器602和预测模式导出器604的操作也可由熵解码器510执行。On the other hand, the operations of the sub-block splitting-non-splitting information deriver 602 and the prediction mode deriver 604 may also be performed by the entropy decoder 510 .
参考样本线导出器606可根据预测模式导出所要求位置处的参考样本线。The reference sample line deriver 606 may derive reference sample lines at required positions according to the prediction mode.
预测执行器608可以通过使用导出的参考样本线中的参考样本根据预测模式产生当前块的预测块。The prediction performer 608 may generate a prediction block of the current block according to the prediction mode by using the reference samples in the derived reference sample line.
例如,参考样本线导出器606可解码指示当前块或子块的参考样本线导出模式的导出模式索引。参考样本线导出器606可根据导出模式索引选择固定位置参考样本线模式、可变位置参考样本线模式和参考样本线列表参考模式的导出模式中的一个,如图6中所示出的,且然后使用所选择的导出模式来导出当前块或当前子块的参考样本线。如果满足第一条件到第三条件中的至少一个,那么导出模式索引可由视频编码装置编码以指示可变位置参考样本线模式或参考样本线列表参考模式。另一方面,如果第一条件到第三条件中没有一个被满足,那么导出模式索引可由视频编码装置编码以指示固定位置参考样本线模式。For example, the reference sample line exporter 606 may decode an export mode index indicating a reference sample line export mode of the current block or sub-block. The reference sample line exporter 606 may select one of the export modes of the fixed position reference sample line mode, the variable position reference sample line mode, and the reference sample line list reference mode according to the export mode index, as shown in FIG6, and then use the selected export mode to export the reference sample line of the current block or the current sub-block. If at least one of the first condition to the third condition is satisfied, the export mode index may be encoded by the video encoding device to indicate the variable position reference sample line mode or the reference sample line list reference mode. On the other hand, if none of the first condition to the third condition is satisfied, the export mode index may be encoded by the video encoding device to indicate the fixed position reference sample line mode.
作为另一实施例,参考样本线导出器606可检查是否满足第一条件到第三条件中的至少一个,且如果满足至少一个条件,那么其可使用可变位置参考样本线模式或参考样本线列表参考模式。此时,参考样本线导出器606可解析指示两个模式中的一个的标记。另一方面,如果第一条件到第三条件中没有一个被满足,则参考样本线导出器606可以使用固定位置参考样本线模式。As another embodiment, the reference sample line exporter 606 may check whether at least one of the first to third conditions is satisfied, and if at least one condition is satisfied, it may use the variable position reference sample line mode or the reference sample line list reference mode. At this time, the reference sample line exporter 606 may parse a flag indicating one of the two modes. On the other hand, if none of the first to third conditions is satisfied, the reference sample line exporter 606 may use the fixed position reference sample line mode.
以下,对第一条件至第三条件进行描述。此外,当前块的左上像素的(x,y)坐标被定义为(0,0)。The first to third conditions are described below. In addition, the (x, y) coordinates of the upper left pixel of the current block are defined as (0, 0).
如图7所示,第一条件是在包括当前块的在预设位置(诸如(-1,-1)、(-1,0)、(0,-1)、(-1,puH)和(puW,-1))处的相邻像素的重构的预测单元中的至少预设数量的预测单元已经通过帧间预测重构的时候。在第一条件下,puW和puH表示当前块(即,当前预测单元)的水平长度和垂直长度。7, the first condition is when at least a preset number of prediction units among the reconstructed prediction units including neighboring pixels of the current block at preset positions (such as (-1, -1), (-1, 0), (0, -1), (-1, puH), and (puW, -1)) have been reconstructed by inter-frame prediction. Under the first condition, puW and puH represent the horizontal length and vertical length of the current block (i.e., the current prediction unit).
第二条件是在第一条件中通过帧间预测的重构的预测单元的运动矢量编码模式是如上所述的空间合并模式。The second condition is that the motion vector encoding mode of the prediction unit reconstructed by inter prediction in the first condition is the spatial merging mode as described above.
第三条件是在第一条件下通过帧间预测的重构的预测单元的运动矢量彼此相似。The third condition is that motion vectors of prediction units reconstructed by inter prediction under the first condition are similar to each other.
在参考样本线导出模式中,固定位置参考样本线模式利用常规参考样本线导出方案。参考样本线导出器606可响应于固定位置参考样本线模式的选择以用于将分别邻近于当前块的左边界和上边界的像素线导出为参考样本线。In the reference sample line derivation mode, the fixed position reference sample line mode utilizes a conventional reference sample line derivation scheme. The reference sample line deriver 606 can be responsive to the selection of the fixed position reference sample line mode to derive pixel lines adjacent to the left and top boundaries of the current block as reference sample lines.
图8a至图8d是示出根据本公开的至少一个实施方式的在可变位置参考样本线模式中的参考样本线的导出的示图。8a to 8d are diagrams illustrating derivation of reference sample lines in a variable position reference sample line mode, in accordance with at least one embodiment of the present disclosure.
当在参考样本线导出模式中选择可变位置参考样本线模式时,参考样本线导出器606可导出不邻近于当前块的参考样本线。例如,参考样本线导出器606可分别从远离当前块的左边界或上边界的N个像素的像素线导出左参考样本线和上参考样本线。可选地,参考样本线导出器606可以选择如图8a至图8d中所示出的多个导出方法之一,并且根据所选择的方法导出左参考样本线和上参考样本线。左参考样本线的导出方法索引和上参考样本线的导出方法索引可从视频编码装置用信号发送到视频解码装置。作为另一实施例,一个导出方法索引可被左参考样本线和上参考样本线所共享。When the variable position reference sample line mode is selected in the reference sample line derivation mode, the reference sample line deriver 606 may derive a reference sample line that is not adjacent to the current block. For example, the reference sample line deriver 606 may derive a left reference sample line and an upper reference sample line from a pixel line of N pixels away from the left boundary or the upper boundary of the current block, respectively. Optionally, the reference sample line deriver 606 may select one of a plurality of derivation methods as shown in FIGS. 8a to 8d , and derive the left reference sample line and the upper reference sample line according to the selected method. The derivation method index of the left reference sample line and the derivation method index of the upper reference sample line may be sent from the video encoding device to the video decoding device using a signal. As another embodiment, one derivation method index may be shared by the left reference sample line and the upper reference sample line.
以下参照图8a至图8d描述导出左参考样本线的方法。A method of deriving the left reference sample line is described below with reference to FIGS. 8 a to 8 d .
左参考样本线的第一导出方法将含有当前块的处理单元的左相邻样本当中的与当前块的左上像素共享共同y轴坐标的像素的坐标定义为(xL,yL)。参考样本线导出器606可以得到(xL,yL+α)至(xL,yL+puH×2-1)范围内的样本作为左参考样本线,如图8a中所示。The first derivation method of the left reference sample line defines the coordinates of the pixel that shares the common y-axis coordinate with the upper left pixel of the current block among the left neighboring samples of the processing unit containing the current block as (xL, yL). The reference sample line deriver 606 can obtain samples in the range of (xL, yL+α) to (xL, yL+puH×2-1) as the left reference sample line, as shown in FIG8a.
这里,α可以是预设整数,诸如-1。另外,如上所述,puW和puH表示当前块(即,当前预测单元)的水平长度和垂直长度。Here, α may be a preset integer such as −1. In addition, as described above, puW and puH represent the horizontal length and the vertical length of the current block (ie, the current prediction unit).
处理单元可通过对译码树单元进行分割来产生。一个或多个译码单元可包括在处理单元中,或一个或多个处理单元可包括在译码单元中。示例性的处理单元是虚拟管道数据单元(Virtual Pipeline Data Unit,VPDU)。VPDU是可以由虚拟管道处理的数据单元。VPDU是可以被一次编码和解码的最大单元,并且可以被用于在CTU的大小增加时降低硬件实现的成本。在图8a的实施例中,vpduW和vpduH表示VDPU的宽度和高度。The processing unit may be generated by segmenting the decoding tree unit. One or more decoding units may be included in the processing unit, or one or more processing units may be included in the decoding unit. An exemplary processing unit is a virtual pipeline data unit (VPDU). A VPDU is a data unit that can be processed by a virtual pipeline. A VPDU is the largest unit that can be encoded and decoded at one time, and can be used to reduce the cost of hardware implementation when the size of a CTU increases. In the embodiment of FIG. 8a, vpduW and vpduH represent the width and height of a VDPU.
用于左参考样本线的第二导出方法将在包括(-1,0)像素的预测单元的左部相邻样本之中与当前块的左上部(0,0)像素共享公共y轴坐标的像素的坐标定义为(xL,yL)。参考样本线导出器606可以导出(xL,yL+α)至(xL,yL+puH×2-1)范围内的样本作为左参考样本线,如图8b所示。The second derivation method for the left reference sample line defines the coordinates of the pixel that shares the common y-axis coordinate with the upper left (0, 0) pixel of the current block among the left neighboring samples of the prediction unit including the (-1, 0) pixel as (xL, yL). The reference sample line deriver 606 can derive samples in the range of (xL, yL+α) to (xL, yL+puH×2-1) as the left reference sample line, as shown in FIG8b.
左参考样本线的第三导出方法将处理单元内在当前块之前解码的区域之中的存在于当前块的左侧的任意像素的坐标定义为(xL,yL)。参考样本线导出器606可以得到(xL,yL+α)至(xL,yL+puH×2-1)范围内的样本作为左参考样本线,如图8c中所示。在此情况下,当前块的左上像素的y轴坐标可不同于任意像素的yL坐标。所导出的参考样本线中的像素可全部包括于同一预测单元中。可选地,一些像素可包括在不同预测单元中。The third derivation method of the left reference sample line defines the coordinates of any pixel existing on the left side of the current block in the area decoded before the current block within the processing unit as (xL, yL). The reference sample line deriver 606 can obtain samples in the range of (xL, yL+α) to (xL, yL+puH×2-1) as the left reference sample line, as shown in FIG8c. In this case, the y-axis coordinate of the upper left pixel of the current block may be different from the yL coordinate of any pixel. The pixels in the derived reference sample line may all be included in the same prediction unit. Alternatively, some pixels may be included in different prediction units.
任意像素的坐标(xL,yL)可从视频编码装置用信号发送到视频解码装置。作为另一实施例,可将任意像素的坐标与当前块的左上像素的坐标之间的偏移值从视频编码装置用信号发送到视频解码装置。The coordinates (xL, yL) of any pixel may be sent from the video encoding device to the video decoding device using a signal. As another embodiment, the offset value between the coordinates of any pixel and the coordinates of the upper left pixel of the current block may be sent from the video encoding device to the video decoding device using a signal.
如图8d中所示的左参考样本线的第四导出方法可以使根据第一导出方法、第二导出方法或第三导出方法导出的左参考样本线中的像素的x轴坐标变得不同。包括在左参考样本线中的具有相同x轴坐标和连续y轴坐标的像素被定义为连续的像素组。可以有两个或更多个连续的像素组。可以预先设定连续的像素组的数量,或者可以解析连续的像素组的数量。可选地,当应用特定定向预测模式时,可解析连续像素组的数量。连续的像素组的长度可以是puH、puH+1、puH/2、puH/2+1等。The fourth derivation method of the left reference sample line as shown in Figure 8d can make the x-axis coordinates of the pixels in the left reference sample line derived according to the first derivation method, the second derivation method or the third derivation method different. The pixels with the same x-axis coordinate and continuous y-axis coordinate included in the left reference sample line are defined as a continuous pixel group. There may be two or more continuous pixel groups. The number of continuous pixel groups may be preset, or the number of continuous pixel groups may be parsed. Optionally, when a specific directional prediction mode is applied, the number of continuous pixel groups may be parsed. The length of the continuous pixel group may be puH, puH+1, puH/2, puH/2+1, etc.
对于每个连续像素组,可以解析开始像素的坐标和长度。作为另一实施例,可建立包括开始像素的坐标的列表和包括开始像素的长度的列表。然后,对于每个连续的像素组,可以解析指示开始像素的坐标的索引和指示开始像素的长度的索引。For each continuous pixel group, the coordinates and length of the starting pixel can be parsed. As another embodiment, a list including the coordinates of the starting pixel and a list including the length of the starting pixel can be established. Then, for each continuous pixel group, an index indicating the coordinates of the starting pixel and an index indicating the length of the starting pixel can be parsed.
以下参照图8a至图8d描述获得上参考样本线的方法。A method of obtaining the upper reference sample line is described below with reference to FIGS. 8 a to 8 d .
用于上参考样本线的第一导出方法定义在包括当前块的处理单元的上相邻样本之中与当前块的左上像素共享公共x轴坐标的像素的坐标为(xT,yT)。参考样本线导出器606可以导出在(xT+α,yT)至(xT+puW×2-1,yT)范围内的样本作为上参考样本线,如图8a所示。The first derivation method for the upper reference sample line defines the coordinates of the pixel that shares a common x-axis coordinate with the upper left pixel of the current block among the upper neighboring samples of the processing unit including the current block as (xT, yT). The reference sample line deriver 606 can derive samples in the range of (xT+α, yT) to (xT+puW×2-1, yT) as the upper reference sample line, as shown in FIG8a.
用于上参考样本线的第二导出方法定义在包括(0,-1)像素的预测单元的上相邻样本之中与当前块的左上(0,0)像素共享公共x轴坐标的像素的坐标为(xT,yT)。参考样本线导出器606可以导出在(xT+α,yT)至(xT+puW×2-1,yT)范围内的样本作为上参考样本线,如图8b所示。The second derivation method for the upper reference sample line defines the coordinates of the pixel that shares the common x-axis coordinate with the upper left (0, 0) pixel of the current block among the upper neighboring samples of the prediction unit including the (0, -1) pixel as (xT, yT). The reference sample line deriver 606 can derive samples in the range of (xT+α, yT) to (xT+puW×2-1, yT) as the upper reference sample line, as shown in FIG8b.
用于上参考样本线的第三导出方法将在处理单元内的当前块之前解码的区域之中的存在于当前块的上部的任意像素的坐标定义为(xT,yT)。参考样本线导出器606可以导出在(xT+α,yT)至(xT+puW×2-1,yT)范围内的样本作为上参考样本线,如图8c所示。在这种情况下,当前块的左上像素的x轴坐标可以不同于任意像素的xT坐标。The third derivation method for the upper reference sample line defines the coordinates of any pixel existing in the upper part of the current block in the area decoded before the current block in the processing unit as (xT, yT). The reference sample line deriver 606 can derive samples in the range of (xT+α, yT) to (xT+puW×2-1, yT) as the upper reference sample line, as shown in FIG8c. In this case, the x-axis coordinate of the upper left pixel of the current block may be different from the xT coordinate of any pixel.
任意像素的坐标(xT,yT)可以从视频编码装置用信号通知给视频解码装置。作为另一实施例,可将任意像素的坐标与当前块的左上像素的坐标之间的偏移值从视频编码装置用信号通知给视频解码装置。The coordinates (xT, yT) of any pixel may be notified from the video encoding device to the video decoding device by a signal. As another embodiment, the offset value between the coordinates of any pixel and the coordinates of the upper left pixel of the current block may be notified from the video encoding device to the video decoding device by a signal.
如图8d中所示的上参考样本线的第四导出方法可以使根据第一导出方法、第二导出方法或第三导出方法导出的左参考样本线中的像素的y轴坐标变得不同。包括在上参考样本线中以具有相同的y轴坐标和连续的x轴坐标的像素可以被称为连续的像素组。可以有两个或多个连续的像素组。可以预先设定连续的像素组的数量,或者可以解析连续的像素组的数量。可选地,当应用特定定向预测模式时,可解析连续像素组的数量。连续的像素组的长度可以是puW、puW+1、puW/2、puW/2+1等。The fourth derivation method of the upper reference sample line as shown in Figure 8d can make the y-axis coordinates of the pixels in the left reference sample line derived according to the first derivation method, the second derivation method or the third derivation method different. Pixels included in the upper reference sample line with the same y-axis coordinate and continuous x-axis coordinates can be referred to as a continuous pixel group. There can be two or more continuous pixel groups. The number of continuous pixel groups can be preset, or the number of continuous pixel groups can be parsed. Optionally, when a specific directional prediction mode is applied, the number of continuous pixel groups can be parsed. The length of the continuous pixel group can be puW, puW+1, puW/2, puW/2+1, etc.
对于每个连续的像素组,可以解析开始像素的坐标和长度。作为另一实施例,可建立包括开始像素的坐标的列表和包括开始像素的长度的列表。然后,对于每个连续的像素组,可以解析指示开始像素的坐标的索引和指示开始像素的长度的索引。For each continuous pixel group, the coordinates and length of the starting pixel can be parsed. As another embodiment, a list including the coordinates of the starting pixel and a list including the length of the starting pixel can be established. Then, for each continuous pixel group, an index indicating the coordinates of the starting pixel and an index indicating the length of the starting pixel can be parsed.
图9是示出根据本公开的至少一个实施方式的在上参考样本线内的细分的参考样本线的示图。FIG. 9 is a diagram illustrating subdivided reference sample lines within an upper reference sample line, in accordance with at least one embodiment of the present disclosure.
当根据上参考样本线或者左参考样本线的第一导出方法、第二导出方法、第三导出方法、或者第四导出方法导出上参考样本线或者左参考样本线时,导出的参考样本线中的像素可以全部被包括在相同的预测单元中。可选地,一些像素可包括在不同预测单元中。例如,在如图9中所示的上参考样本线中,范围(xT,yT)至(xT+a-1,yT)中的像素可以包括在预测单元A中,范围(xT+a,yT)至(xT+a+b-1,yT)中的像素可以包括在预测单元B中,并且范围(xT+a+b,yT)至(xT+a+b+c-1,yT)中的像素可以包括在预测单元C中。在下文中,如在图9中所示,包括在参考样本线中的一个预测单元中的参考样本被定义为细分的参考样本线。When an upper reference sample line or a left reference sample line is derived according to the first derivation method, the second derivation method, the third derivation method, or the fourth derivation method of the upper reference sample line or the left reference sample line, the pixels in the derived reference sample line may all be included in the same prediction unit. Optionally, some pixels may be included in different prediction units. For example, in the upper reference sample line as shown in FIG. 9 , pixels in the range (xT, yT) to (xT+a-1, yT) may be included in prediction unit A, pixels in the range (xT+a, yT) to (xT+a+b-1, yT) may be included in prediction unit B, and pixels in the range (xT+a+b, yT) to (xT+a+b+c-1, yT) may be included in prediction unit C. In the following, as shown in FIG. 9 , a reference sample included in one prediction unit in a reference sample line is defined as a subdivided reference sample line.
当根据上参考样本线或左参考样本线的第一导出方法、第二导出方法或第三导出方法导出上参考样本线或左参考样本线时,参考样本线导出器606可对邻近于或接近预测单元之间的边界的像素执行解块滤波。在图9的图示中,(xT,yT)、(xT+a-1,yT)、(xT+a,yT)、(xT+a+b-1,yT)、(xT+a+b,yT)和(xT+a+b+c-1,yT)表示与上参考样本线中的边界相邻的像素。When the upper reference sample line or the left reference sample line is derived according to the first derivation method, the second derivation method, or the third derivation method of the upper reference sample line or the left reference sample line, the reference sample line deriver 606 may perform deblocking filtering on pixels adjacent to or close to the boundary between prediction units. In the illustration of FIG9 , (xT, yT), (xT+a-1, yT), (xT+a, yT), (xT+a+b-1, yT), (xT+a+b, yT), and (xT+a+b+c-1, yT) represent pixels adjacent to the boundary in the upper reference sample line.
参考样本线导出器606可确定是否执行滤波以去除候选像素边界处的块效应。可对满足以下条件中的一个或多个的候选像素边界执行滤波。The reference sample line deriver 606 may determine whether to perform filtering to remove blocking artifacts at candidate pixel boundaries. Filtering may be performed on candidate pixel boundaries that satisfy one or more of the following conditions.
第一条件是在边界的任一侧上的两个预测单元量化参数不同。The first condition is that the two prediction unit quantization parameters on either side of the boundary are different.
第二条件是边界上的两个像素之间的像素值的差大于下阈值且小于上阈值。在这种情况下,可基于两个像素的量化参数确定上阈值。The second condition is that the difference in pixel values between two pixels on the boundary is greater than a lower threshold and less than an upper threshold. In this case, the upper threshold may be determined based on the quantization parameters of the two pixels.
第三条件是经细分的参考样本线的起始像素与结束像素之间的像素值差小于或等于上阈值。The third condition is that the pixel value difference between the start pixel and the end pixel of the subdivided reference sample line is less than or equal to the upper threshold.
参考样本线导出器606可如下对边界的任一侧上的d个像素执行滤波。参考样本线导出器606可通过执行2d个像素的值的加权求和来导出经滤波的值以对上述2d像素之中的任意像素P进行滤波。在这种情况下,用于滤波的像素的位置和乘以每个像素的权重可以通过像素P和边界之间的距离来确定。例如,对于图9中所示的坐标(xT+a,yT)处的像素和坐标(xT+a+1,yT)处的像素,可如等式1和等式2中所示执行滤波。The reference sample line deriver 606 may perform filtering on the d pixels on either side of the boundary as follows. The reference sample line deriver 606 may derive a filtered value by performing a weighted sum of the values of the 2d pixels to filter any pixel P among the 2d pixels. In this case, the position of the pixel used for filtering and the weight multiplied by each pixel may be determined by the distance between the pixel P and the boundary. For example, for the pixel at the coordinate (xT+a, yT) and the pixel at the coordinate (xT+a+1, yT) shown in FIG. 9 , filtering may be performed as shown in Equations 1 and 2.
【等式1】[Equation 1]
I′(xT+a,yT)=(I(xT+a-2,yT)+2·I(xT+a-1,yT)+2·I(xT+a,yT)+2·I(xT+a+1,yT)+I(xT+a+2,yT)+4)>>3I′(xT+a,yT)=(I(xT+a-2,yT)+2·I(xT+a-1,yT)+2·I(xT+a,yT)+2·I(xT+a+1,yT)+I(xT+a+2,yT)+4)>>3
【等式2】[Equation 2]
I′(xT+a+1,yT)=(I(xT+a-1,yT)+I(xT+a,yT)+I(xT+a+1,yT)+I(xT+a+2,yT)+2)>>2I′(xT+a+1,yT)=(I(xT+a-1,yT)+I(xT+a,yT)+I(xT+a+1,yT)+I(xT+a+2,yT)+2)>>2
这里,I()和I'()表示滤波之前和之后的像素值。Here, I() and I'() represent pixel values before and after filtering.
在滤波过程中可使用与像素p的左和右(或上和下)相同或相似数量的像素。像素p越靠近边界,在滤波处理中可以使用越多的近边界像素。权重或滤波器系数可具有低通滤波器的特性。可使用基于像素p与边界之间的距离的滤波器系数,且像素越靠近像素p,滤波器系数的绝对值可乘得越大。The same or similar number of pixels to the left and right (or top and bottom) of pixel p can be used in the filtering process. The closer the pixel p is to the border, the more near-border pixels can be used in the filtering process. The weights or filter coefficients can have the characteristics of a low-pass filter. A filter coefficient based on the distance between pixel p and the border can be used, and the closer the pixel is to pixel p, the larger the absolute value of the filter coefficient can be multiplied.
当在参考样本线导出模式中选择参考样本线列表参考模式时,参考样本线导出器606可以从参考样本线列表选择参考样本线。When the reference sample line list reference mode is selected in the reference sample line export mode, the reference sample line exporter 606 may select a reference sample line from the reference sample line list.
一个参考样本线列表可以用于左参考样本线和上参考样本线。可选地,对于左参考样本线和上参考样本线,可以存在分开的参考样本线列表。另外,对于不同参考样本线长度,可以存在分开的参考样本线列表。One reference sample line list can be used for the left reference sample line and the upper reference sample line. Optionally, there can be separate reference sample line lists for the left reference sample line and the upper reference sample line. In addition, there can be separate reference sample line lists for different reference sample line lengths.
参考样本线导出器606可以使用与参考样本线列表相同的固定列表。可选地,参考样本线导出器606可以以诸如子帧、帧、帧组等为单位初始化并顺序地更新参考样本线列表。可以如下更新参考样本线列表。The reference sample line exporter 606 may use the same fixed list as the reference sample line list. Alternatively, the reference sample line exporter 606 may initialize and sequentially update the reference sample line list in units such as subframes, frames, frame groups, etc. The reference sample line list may be updated as follows.
例如,重构的预测单元的上边界或者左边界样本线可以被添加到参考样本线列表。另外,使用可变位置参考样本线模式或参考样本线列表参考模式,重构的预测单元的上边界或左边界样本线可以被添加到参考样本线列表。For example, the upper boundary or left boundary sample line of the reconstructed prediction unit can be added to the reference sample line list. In addition, using the variable position reference sample line mode or the reference sample line list reference mode, the upper boundary or left boundary sample line of the reconstructed prediction unit can be added to the reference sample line list.
参考样本线导出器606可以通过解析列表参考索引来选择(左或上)参考样本线,该列表参考索引指示包括在参考样本线列表中的参考样本线中的一个。视频解码装置还可解析权重以将参考样本线的每一像素相乘,或可解析偏移以加到参考样本线的每一像素。The reference sample line deriver 606 may select a (left or top) reference sample line by parsing a list reference index indicating one of the reference sample lines included in the reference sample line list. The video decoding device may also parse a weight to multiply each pixel of the reference sample line, or may parse an offset to add to each pixel of the reference sample line.
现在参考图10和图11,描述了基于参考样本线导出使用帧内预测的视频编码方法和视频解码方法。Referring now to FIG. 10 and FIG. 11 , a video encoding method and a video decoding method using intra prediction derived based on a reference sample line are described.
图10是根据本公开的至少一个实施方式的视频编码方法的流程图。FIG. 10 is a flowchart of a video encoding method according to at least one embodiment of the present disclosure.
视频编码装置确定当前块的帧内预测模式(S1000)。The video encoding apparatus determines an intra prediction mode of a current block (S1000).
视频编码装置确定导出模式索引(S1002)。The video encoding apparatus determines a derivation mode index ( S1002 ).
这里,导出模式索引指示参考样本线导出模式,其是固定位置参考样本线模式、可变位置参考样本线模式和参考样本线列表参考模式中的一个。Here, the derivation mode index indicates a reference sample line derivation mode, which is one of a fixed position reference sample line mode, a variable position reference sample line mode, and a reference sample line list reference mode.
在一个实施例中,视频编码装置确定是否满足如上所述的第一条件到第三条件中的至少一个,且如果满足,确定指示可变位置参考样本线模式或参考样本线列表参考模式的导出模式索引。另一方面,如果第一条件到第三条件中无一个满足,那么视频编码装置确定指示固定位置参考样本线模式的导出模式索引。In one embodiment, the video encoding device determines whether at least one of the first to third conditions described above is satisfied, and if satisfied, determines a derived mode index indicating a variable position reference sample line mode or a reference sample line list reference mode. On the other hand, if none of the first to third conditions is satisfied, the video encoding device determines a derived mode index indicating a fixed position reference sample line mode.
作为另一实施例,在检查是否满足第一条件到第三条件中的至少一个时,且如果满足,视频编码装置可使用可变位置参考样本线模式或参考样本线列表参考模式。在优化编码效率方面,视频编码装置可确定指示这两个模式中的一个的标记。另一方面,如果第一条件到第三条件中的任何一个都不满足,则视频编码装置可使用固定位置参考样本线模式。As another embodiment, when checking whether at least one of the first condition to the third condition is satisfied, and if satisfied, the video encoding device may use the variable position reference sample line mode or the reference sample line list reference mode. In terms of optimizing encoding efficiency, the video encoding device may determine a flag indicating one of the two modes. On the other hand, if any one of the first condition to the third condition is not satisfied, the video encoding device may use the fixed position reference sample line mode.
视频编码装置根据导出模式索引确定参考样本线导出模式(S1004)。The video encoding apparatus determines a reference sample line derivation mode according to the derivation mode index ( S1004 ).
作为另一实施例,视频编码装置可基于第一条件到第三条件是否被满足以及指示可变位置参考样本线模式和参考样本线列表参考模式中的一个的标记来确定参考样本线导出模式。As another embodiment, the video encoding device may determine the reference sample line derivation mode based on whether the first to third conditions are satisfied and a flag indicating one of the variable position reference sample line mode and the reference sample line list reference mode.
视频编码装置根据参考样本线导出模式导出当前块的参考样本线(S1006)。这里,参考样本线包括左参考样本线和上参考样本线。The video encoding apparatus derives a reference sample line of the current block according to the reference sample line deriving mode (S1006). Here, the reference sample line includes a left reference sample line and an upper reference sample line.
如果参考样本线导出模式是固定位置参考样本线模式,则视频编码装置将分别与当前块的左边界和上边界相邻的像素线导出为参考样本线。If the reference sample line derivation mode is the fixed position reference sample line mode, the video encoding apparatus derives pixel lines respectively adjacent to the left boundary and the upper boundary of the current block as reference sample lines.
当参考样本线导出模式是可变位置参考样本线模式时,在优化编码效率方面,视频编码装置确定左参考样本线的导出方法索引和上参考样本线的导出方法索引。这里,左参考样本线的导出方法索引和上参考样本线的导出方法索引均指示如上所述的第一导出方法、第二导出方法、第三导出方法或第四导出方法。When the reference sample line derivation mode is the variable position reference sample line mode, in terms of optimizing encoding efficiency, the video encoding device determines the derivation method index of the left reference sample line and the derivation method index of the upper reference sample line. Here, the derivation method index of the left reference sample line and the derivation method index of the upper reference sample line both indicate the first derivation method, the second derivation method, the third derivation method, or the fourth derivation method as described above.
由于已经描述了根据第一导出方法至第四导出方法通过视频编码装置导出参考样本线的方法,所以省略进一步的详细描述。Since the method of deriving the reference sample line by the video encoding apparatus according to the first to fourth derivation methods has been described, further detailed description is omitted.
如果根据第一导出方法、第二导出方法或第三导出方法导出的参考样本线包括在多个预测单元中,则视频编码装置可执行滤波以去除邻近于多个预测单元之间的边界的像素上的块效应。If the reference sample line derived according to the first, second, or third derivation method is included in a plurality of prediction units, the video encoding apparatus may perform filtering to remove blocking effects on pixels adjacent to a boundary between the plurality of prediction units.
如果参考样本线导出模式是参考样本线列表参考模式,那么视频编码装置确定指示包括在参考样本线列表中的一个参考样本线的参考索引。视频编码装置利用参考索引来从参考样本线列表导出左参考样本线和上参考样本线。If the reference sample line derivation mode is the reference sample line list reference mode, the video encoding apparatus determines a reference index indicating a reference sample line included in the reference sample line list. The video encoding apparatus derives a left reference sample line and an upper reference sample line from the reference sample line list using the reference index.
进一步,视频编码装置可更新参考样本线列表。例如,视频编码装置可将重构的预测单元的上边界样本线或左边界样本线添加到参考样本线列表。Furthermore, the video encoding device may update the reference sample line list. For example, the video encoding device may add the upper boundary sample line or the left boundary sample line of the reconstructed prediction unit to the reference sample line list.
视频编码装置通过使用参考样本线中的参考样本根据帧内预测模式生成当前块的预测块(S1008)。The video encoding apparatus generates a prediction block of the current block according to the intra prediction mode by using reference samples in the reference sample line ( S1008 ).
视频编码装置从当前块减去预测块以产生残余块(S1010)。The video encoding apparatus subtracts a prediction block from a current block to generate a residual block (S1010).
视频编码装置对导出模式索引、帧内预测模式和残余块进行编码(S1012)。The video encoding apparatus encodes the derivation mode index, the intra prediction mode, and the residual block (S1012).
图11是根据本公开的至少一个实施方式的视频解码方法的流程图。FIG. 11 is a flowchart of a video decoding method according to at least one embodiment of the present disclosure.
视频解码装置从比特流解码当前块的残余块、当前块的帧内预测模式和导出模式索引(S1100)。The video decoding apparatus decodes a residual block of a current block, an intra prediction mode of the current block, and a derived mode index from a bitstream ( S1100 ).
这里,导出模式索引指示参考样本线导出模式,其是固定位置参考样本线模式、可变位置参考样本线模式和参考样本线列表参考模式中的一个。Here, the derivation mode index indicates a reference sample line derivation mode, which is one of a fixed position reference sample line mode, a variable position reference sample line mode, and a reference sample line list reference mode.
在一个实施例中,如果满足如上文所描述的第一条件到第三条件中的至少一个,导出模式索引可由视频编码装置编码以指示可变位置参考样本线模式或参考样本线列表参考模式。另一方面,如果第一条件到第三条件中无一霍格满足,导出模式索引可由视频编码装置编码以指示固定位置参考样本线模式。In one embodiment, if at least one of the first to third conditions as described above is satisfied, the derivation mode index may be encoded by the video encoding device to indicate the variable position reference sample line mode or the reference sample line list reference mode. On the other hand, if none of the first to third conditions is satisfied, the derivation mode index may be encoded by the video encoding device to indicate the fixed position reference sample line mode.
作为另一实施例,当检查是否满足第一条件到第三条件中的至少一个时确定其满足时,视频解码装置可使用可变位置参考样本线模式或参考样本线列表参考模式。在这种情况下,视频解码装置可解析指示这两个模式中的一个的标记。另一方面,如果第一条件到第三条件中的任何一个都不满足,则视频解码装置可使用固定位置参考样本线模式。As another embodiment, when at least one of the first condition to the third condition is checked to determine whether it is satisfied, the video decoding device may use the variable position reference sample line mode or the reference sample line list reference mode. In this case, the video decoding device may parse a flag indicating one of the two modes. On the other hand, if any one of the first condition to the third condition is not satisfied, the video decoding device may use the fixed position reference sample line mode.
视频解码装置根据导出模式索引确定参考样本线导出模式(S1102)。The video decoding apparatus determines a reference sample line derivation mode according to a derivation mode index ( S1102 ).
作为另一实施例,视频解码装置可基于第一条件到第三条件是否满足以及指示可变位置参考样本线模式和参考样本线列表参考模式中的一个的标记来确定参考样本线导出模式。As another embodiment, the video decoding device may determine the reference sample line derivation mode based on whether the first to third conditions are satisfied and a flag indicating one of the variable position reference sample line mode and the reference sample line list reference mode.
视频解码装置根据参考样本线导出模式导出当前块的参考样本线(S1104)。这里,参考样本线包括左参考样本线和上参考样本线。The video decoding apparatus derives a reference sample line of the current block according to a reference sample line deriving mode (S1104). Here, the reference sample line includes a left reference sample line and an upper reference sample line.
如果参考样本线导出模式是固定位置参考样本线模式,则视频解码装置将分别与当前块的左边界和上边界相邻的像素线导出为参考样本线。If the reference sample line derivation mode is the fixed position reference sample line mode, the video decoding apparatus derives pixel lines respectively adjacent to the left boundary and the upper boundary of the current block as reference sample lines.
当参考样本线导出模式是可变位置参考样本线模式时,视频解码装置从比特流解码左参考样本线的导出方法索引和上参考样本线的导出方法索引。左参考样本线的导出方法索引和上参考样本线的导出方法索引均指示其第一导出方法、第二导出方法、第三导出方法或第四导出方法。When the reference sample line derivation mode is the variable position reference sample line mode, the video decoding apparatus decodes a derivation method index of a left reference sample line and a derivation method index of an upper reference sample line from a bitstream. The derivation method index of the left reference sample line and the derivation method index of the upper reference sample line both indicate their first derivation method, second derivation method, third derivation method, or fourth derivation method.
由于已经描述了根据第一导出方法至第四导出方法通过视频解码装置导出参考样本线的方法,所以省略进一步的详细描述。Since the method of deriving the reference sample line by the video decoding apparatus according to the first to fourth derivation methods has been described, further detailed description is omitted.
如果根据第一导出方法、第二导出方法或第三导出方法导出的参考样本线包括在多个预测单元中,则视频解码装置可执行滤波以去除邻近于多个预测单元之间的边界的像素上的块效应。If the reference sample line derived according to the first, second, or third derivation method is included in a plurality of prediction units, the video decoding apparatus may perform filtering to remove blocking effects on pixels adjacent to a boundary between the plurality of prediction units.
如果参考样本线导出模式是参考样本线列表参考模式,视频解码装置从比特流解码指示包括在参考样本线列表中的一个参考样本线的参考索引。视频解码装置利用参考索引来从参考样本线列表导出左参考样本线和上参考样本线。If the reference sample line derivation mode is the reference sample line list reference mode, the video decoding apparatus decodes a reference index indicating a reference sample line included in the reference sample line list from the bitstream, and the video decoding apparatus uses the reference index to derive a left reference sample line and an upper reference sample line from the reference sample line list.
进一步,视频解码装置可更新参考样本线列表。例如,视频解码装置可将重构的预测单元的上边界样本线或左边界样本线添加到参考样本线列表。Furthermore, the video decoding device may update the reference sample line list. For example, the video decoding device may add the upper boundary sample line or the left boundary sample line of the reconstructed prediction unit to the reference sample line list.
视频解码装置使用参考样本线中的参考样本根据帧内预测模式产生当前块的预测块(S1106)。The video decoding apparatus generates a prediction block of the current block according to the intra prediction mode using the reference samples in the reference sample line ( S1106 ).
视频解码装置将预测块和残余块相加以重构当前块(S1108)。The video decoding apparatus adds the prediction block and the residual block to reconstruct the current block (S1108).
尽管各个流程图中的步骤被描述为被顺序地执行,但是这些步骤仅仅例示本公开的一些实施方式的技术构思。因此,本公开所属领域的普通技术人员可通过改变在各个附图中描述的顺序或者通过并行执行两个以上的步骤来执行步骤。所以,各个流程图中的步骤不限于所示的时间序列顺序。Although the steps in each flowchart are described as being performed sequentially, these steps are merely illustrative of the technical concepts of some embodiments of the present disclosure. Therefore, a person of ordinary skill in the art to which the present disclosure belongs can perform the steps by changing the order described in each figure or by performing two or more steps in parallel. Therefore, the steps in each flowchart are not limited to the time series order shown.
应当理解,以上描述呈现了可以以不同其他方式实现的说明性实施方式。在一些实施方式中描述的功能可以通过硬件、软件、固件和/或其组合来实现。还应理解,本公开中描述的功能部件用“…单元”标记,以强烈强调它们独立实现的可能性。It should be understood that the above description presents illustrative embodiments that can be implemented in different other ways. The functions described in some embodiments can be implemented by hardware, software, firmware and/or a combination thereof. It should also be understood that the functional components described in this disclosure are marked with "... units" to strongly emphasize the possibility of their independent implementation.
同时,在一些实施方式中描述的各种方法或功能可以被实现为存储在能够由一个或多个处理器读取和执行的非暂时性记录介质中的指令。例如,非暂时性记录介质可包括各种类型的记录装置,其中,数据以计算机系统可读的形式存储。例如,非暂时性记录介质可包括存储介质,诸如可擦除可编程只读存储器(EPROM)、闪存驱动器、光盘驱动器、磁性硬盘驱动器和固态驱动器(SSD)等。At the same time, the various methods or functions described in some embodiments may be implemented as instructions stored in a non-transitory recording medium that can be read and executed by one or more processors. For example, the non-transitory recording medium may include various types of recording devices, in which data is stored in a form readable by a computer system. For example, the non-transitory recording medium may include a storage medium such as an erasable programmable read-only memory (EPROM), a flash drive, an optical drive, a magnetic hard drive, and a solid-state drive (SSD).
尽管出于说明性的目的已经描述了本公开的实施方式,然而,本公开所属领域的普通技术人员应当认识到,在不背离本公开的构思和范围的情况下,各种修改、添加、以及替换是可能的。因此,为了简洁和清晰起见,已经描述了本公开的实施方式。本公开的实施方式的技术构思的范围不受图示的限制。因此,本公开所属领域的普通技术人员应当理解,本公开的范围不应受上述明确描述的实施方式的限制,而是受权利要求及其等同物的限制。Although the embodiments of the present disclosure have been described for illustrative purposes, however, those of ordinary skill in the art to which the present disclosure belongs should recognize that various modifications, additions, and substitutions are possible without departing from the concept and scope of the present disclosure. Therefore, for the sake of brevity and clarity, the embodiments of the present disclosure have been described. The scope of the technical concept of the embodiments of the present disclosure is not limited by the diagrams. Therefore, those of ordinary skill in the art to which the present disclosure belongs should understand that the scope of the present disclosure should not be limited by the embodiments explicitly described above, but by the claims and their equivalents.
(参考标号)(reference number)
122 帧内预测器122 Intra-frame predictor
542 帧内预测器542 Intra-frame predictor
602 子块分裂-无分裂信息导出器602 Sub-block Split-No Split Information Exporter
604 预测模式导出器604 Prediction Model Exporter
606 参考样本线导出器606 Reference Sample Line Exporter
606 预测执行器。606 Prediction Executor.
相关申请Related Applications
本申请要求于2021年12月6日提交的韩国专利申请第10-2021-0172938号和于2022年10月11日提交的韩国专利申请第10-2022-0129732号的优先权和权益,通过引用将其每一个的全部公开内容结合在此。This application claims priority to and the benefit of Korean Patent Application No. 10-2021-0172938, filed on December 6, 2021, and Korean Patent Application No. 10-2022-0129732, filed on October 11, 2022, the entire disclosure of each of which is incorporated herein by reference.
Claims (14)
Applications Claiming Priority (4)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| KR10-2021-0172938 | 2021-12-06 | ||
| KR1020220129732A KR20230085063A (en) | 2021-12-06 | 2022-10-11 | Method And Apparatus for Video Coding Using Intra Prediction Based on Derivation of Reference Sample Line |
| KR10-2022-0129732 | 2022-10-11 | ||
| PCT/KR2022/015401 WO2023106599A1 (en) | 2021-12-06 | 2022-10-12 | Video coding method and device, using intra prediction based on reference sample line derivation |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CN118355658A true CN118355658A (en) | 2024-07-16 |
Family
ID=91817453
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202280080888.7A Pending CN118355658A (en) | 2021-12-06 | 2022-10-12 | Video coding method and apparatus with intra prediction derived based on reference sample lines |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN118355658A (en) |
-
2022
- 2022-10-12 CN CN202280080888.7A patent/CN118355658A/en active Pending
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| KR20220125175A (en) | Video coding method and apparatus using intra prediction mode derivation | |
| CN118018718A (en) | Video encoding and decoding equipment and equipment for transmitting bit streams | |
| KR20230160171A (en) | Method And Apparatus for Video Coding Using Fusion Cross-component Prediction | |
| US12401791B2 (en) | Method for generating prediction block by using weighted-sum of intra prediction signal and inter prediction signal, and device using same | |
| KR20220118351A (en) | Method for generating prediction block using weighted sum of intra prediction signal and inter prediction signal and apparatus using the same | |
| KR20240113380A (en) | Method and Apparatus for Video Coding Using Far Reference Pixels | |
| JP2024531644A (en) | Video encoding/decoding method and apparatus | |
| US12120320B2 (en) | Method for adaptively setting resolution, and image decoding apparatus | |
| US20240430451A1 (en) | Method for adaptively setting resolution, and image decoding apparatus | |
| US20260075199A1 (en) | Method and apparatus for video coding that adaptively uses single tree and dual tree in one block | |
| CN121128179A (en) | Methods for extrapolation-based intra-frame prediction | |
| CN119213775A (en) | Method and device for storing motion vector of intra-frame prediction block | |
| US12363279B2 (en) | Method for predicting quantization parameter used in a video encoding/decoding apparatus | |
| KR20240085156A (en) | Method and Apparatus for Video Coding Using Intra Mode Derivation Based on Adaptive Template | |
| CN119999212A (en) | Method and apparatus for video encoding using multi-block based intra-frame template matching prediction | |
| KR20230085063A (en) | Method And Apparatus for Video Coding Using Intra Prediction Based on Derivation of Reference Sample Line | |
| CN117917071A (en) | Method and apparatus for video coding using sub-block coding order change and intra prediction according to sub-block coding order change | |
| CN116918323A (en) | Video encoding and decoding method and apparatus for improving prediction signals of intra-frame prediction | |
| CN118355658A (en) | Video coding method and apparatus with intra prediction derived based on reference sample lines | |
| US12610038B2 (en) | Method and apparatus for video coding using intra prediction based on reference sample line derivation | |
| KR20250151991A (en) | Method for Coding Intra Prediction Mode using History-based MPM Candidate and Apparatus using the same | |
| KR20240037834A (en) | Method and Apparatus for Video Coding Using Single Tree and Dual Tree Adaptively in Single Block | |
| CN118414831A (en) | Video encoding method and apparatus using candidate list of motion vector predictors | |
| KR20240163541A (en) | Method and Apparatus for Video Encoding and Decoding Using Combined Intra/Inter Prediction Mode Based on Subblock | |
| CN118160304A (en) | Video encoding method and apparatus using various block division structures |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination |