CN119815012B

CN119815012B - Method and device for processing image block in immersive media coding

Info

Publication number: CN119815012B
Application number: CN202311299927.5A
Authority: CN
Inventors: 白雨箫; 虞露
Original assignee: Zhejiang University ZJU
Current assignee: Zhejiang University ZJU
Priority date: 2023-10-09
Filing date: 2023-10-09
Publication date: 2025-12-09
Anticipated expiration: 2043-10-09
Also published as: CN119815012A

Abstract

This invention provides a method for processing image blocks in immersive media encoding, comprising: for at least one image block in a multi-view image, for at least one segmentation line that runs through and is perpendicular to the boundary of the image block, calculating the effective information density of image sub-blocks on both sides of the segmentation line, wherein the effective information is the additional information of the image sub-block relative to other viewpoints in the multi-view image; when the image sub-blocks on both sides of the segmentation line meet a cutting condition, wherein the cutting condition is that the effective information density of one image sub-block is greater than a threshold and the effective information density of the other image sub-block is less than a threshold: cutting along the segmentation line to divide the image block into two image blocks; for an image block containing image sub-blocks with an effective information density less than a threshold, if there is no segmentation line that meets the cutting condition, then the image block is discarded and not assembled into the stitching image; otherwise: no cutting is performed.

Description

Method and device for processing image block in immersive media coding

Technical Field

The invention relates to the field of immersive media coding, in particular to a method and a device for processing multi-view image blocks.

Background

"Immersion" is a subjective assessment that refers to the substitution of viewers' perception of a virtual scene created and displayed by a multimedia system. As the capabilities of acquisition devices and display devices have increased year by year, immersive media has become a research hotspot in industry and scientific community as a visual multimedia capable of bringing a strong immersion to viewers. The multi-view image plus depth information is an effective immersive media representation consisting of texture images of multiple views and depth images corresponding to each texture image. By utilizing a viewpoint synthesis technology based on the depth image, the expression mode can render and obtain an image of the target viewpoint according to camera parameters of the target image and the position relation between the target viewpoint and the existing viewpoint.

Because larger information redundancy exists among multiple views, coding and transmitting multi-view sub-block spliced images can remarkably save pixel rate, before coding and decoding, the redundant pixels of other views are eliminated as much as possible by utilizing a main view (some images containing complete view information in multi-view images) image by analyzing the geometric texture relation among the multiple views, so that other view images except the main view only retain specific effective information. In consideration of coding efficiency, reserved effective pixels are converged into rectangular sub-blocks, and finally a plurality of image blocks are assembled into a spliced image for coding transmission. And the decoding end extracts all block images from the multi-view block spliced image obtained by decoding by utilizing the image block information obtained by decoding, and finally synthesizes the target image by taking the image blocks as units.

However, as the field of view of video content becomes larger, the increase of the number of source viewpoints exposes the disadvantage of processing capability of the encoding transmission scheme based on the spliced image when inputting a large amount of data, and in many industrial scenes, the pixel rate of the spliced image is limited, and the necessary sub-block images remained after the redundancy is removed from view points at present cannot be put into a map set with a limited size, and the information is lost when decoding due to the loss of the sub-block images, so that high-quality video cannot be synthesized. Thus, in order to make decisions to measure the importance of an image block, the concept of the effective information amount of a pixel is introduced, which measures the non-overlapping degree of the current pixel with pixels in other viewpoints. The higher the effective information amount, the lower the degree of characterization of pixels in other viewpoints, the larger the unique information should be retained in the stitched image.

More specifically, the effective information generally includes two channels, namely geometric information and texture information, wherein the value of each channel is independent from each other, and is calculated by the logarithm of the difference value of the parallax and the brightness of the synthesized pixel of the current pixel and the reference pixel from other viewpoints at the current pixel position. If the current pixel has no reference pixel, the information amount is the maximum value, and is determined by the parallax and the bit width of the brightness. The specific calculation mode is as follows, the parallax difference between the current pixel and the reference pixel is d _g, the brightness difference is d _t, and the effective information ρ is:

ρ=logd_g+logd_t

The geometric bit width b _g and the luminance bit width b _t of the video, the maximum value ρ _max of the effective information is:

Since the retained pixels are aggregated into image sub-blocks, the effective information of an image sub-block is calculated by accumulating the effective information values of all effective pixels therein.

ρ_block＝∑_{pixel∈block}ρ_pixel

Further put forward is the concept of effective information density μ, dividing sub-block effective information by the number of pixels effective

In the process of sub-block image splicing, the effective information density mu of the sub-block is used as a basis, the image sub-block with large effective information density is preferentially reserved under the limit of the priority pixel rate, the information utilization rate is improved, and the rendering quality of a decoding end is improved.

However, the distribution of the effective information may be uneven within one image sub-block, and selecting the image sub-block to be preferentially retained according to the effective information density μmay discard local areas of high effective information density in some sub-blocks together, or there is a case where local areas of low effective information density are retained together in the retained image sub-block, wasting the pixel rate. Therefore, an image sub-block cutting method based on effective information distribution is needed, so that the effective information distribution in the same sub-block after cutting is as uniform as possible, and the information rate can be improved when the image is spliced by tissues, and the coding efficiency is further improved. However, how to make the cutting more sufficient and how to reduce the ineffective cutting in the cutting process, and how to not make the cutting when the effective information density of the two sub-blocks is greater than the retention threshold, and preventing the continuity of the image area from being damaged so as to affect the coding efficiency are both difficult and challenging.

Disclosure of Invention

In order to solve the technical problems, the invention provides a method and a device for processing image blocks in immersive media coding.

For at least one image block in the multi-view image, for at least one dividing line which penetrates through and is perpendicular to the boundary of the image block in the image block, calculating the effective information density of the image sub-blocks at two sides of the dividing line respectively, wherein the effective information is the additional information of the image sub-blocks relative to other view points in the multi-view.

When the image sub-blocks at two sides of the dividing line meet the cutting condition, the cutting condition is that the effective information density of the image sub-block at one side is larger than a threshold value and the effective information density of the image sub-block at the other side is smaller than the threshold value, cutting is carried out along the dividing line, the image block is cut into two image blocks, if the image block containing the image sub-block with the effective information density smaller than the threshold value does not have the dividing line meeting the cutting condition, the image block is discarded and is not assembled into the splicing diagram, otherwise, the image block is not cut.

Further, for the image block which contains the image sub-block with the effective information density smaller than the threshold value after cutting, if a dividing line meeting the cutting condition exists, cutting the image block again.

Further, for the image block with a plurality of dividing lines for which the image sub-blocks at two sides meet the cutting condition, a dividing line with the largest difference of effective information density of the image sub-blocks at two sides of the dividing line is selected as an actual dividing line.

Further, when the image sub-blocks at two sides of the dividing line meet the cutting condition, updating a threshold according to the pixel rate limit of the spliced image and the effective information density of the image blocks which are not discarded.

Further, the invention provides a device for processing image blocks, which comprises the following modules:

The effective information density calculating module is used for respectively calculating the effective information densities of the image sub-blocks at two sides of the dividing line along the dividing line which penetrates through and is perpendicular to the boundary of the image block in the multi-view image block, wherein the effective information is the extra information of the image sub-blocks relative to other views in the multi-view

The threshold judging module is used for judging whether one of the effective information densities of the image sub-blocks at the two sides of the dividing line is larger than a threshold value, one is smaller than the threshold value, cutting is performed when the condition is met, and cutting is not performed when the condition is not met

The dividing line decision module is used for selecting the dividing line with the largest difference of the effective information density of the image sub-blocks at the two sides of the dividing line as the actual dividing line for the image block when the image sub-blocks at the two sides meet the dividing line of the cutting condition

Cutting along the dividing line under the condition of meeting the cutting condition

The assembly and splicing decision module is used for discarding image blocks which contain image sub-blocks with effective information density smaller than a threshold value and do not have a dividing line meeting the cutting condition, and does not assemble the image blocks into a splicing diagram

And the threshold updating module is used for updating the threshold according to the pixel rate limit of the spliced image and the effective information density of the image blocks which are not discarded when the image sub-blocks at the two sides of the dividing line meet the cutting condition.

Further, the image block cutting device cuts the image block containing the image sub-block with the effective information density smaller than the threshold value again if the dividing line meeting the cutting condition exists.

The method has the advantages that full cutting of the image blocks can be achieved through judgment of the granularity of the image sub-blocks, the image blocks can be discarded more accurately, the image blocks with higher effective information density can be organized under the limit of the limited pixel rate in the multi-view image splicing process, the information utilization rate of the spliced image is improved, the rendering quality is improved at the decoding end, meanwhile, invalid cutting is reduced under the condition that discarding is not needed through threshold value control cutting, the coding code rate is saved, the threshold value is updated according to the limit of the pixel rate and the effective information density of the image sub-blocks which are not discarded after each cutting, the threshold value is more accurate, and the overall coding efficiency is improved.

Drawings

The principles of the present invention may be explained by the following examples with reference to the drawings.

The accompanying drawings, which are included to provide a further understanding of the application and are incorporated in and constitute a part of this specification, illustrate only the application and, together with the description, serve to explain the principles of the application.

FIG. 1 is a schematic diagram of three image sub-blocks A, B, C of image block ABC in one embodiment of the invention.

FIG. 2 is a schematic diagram of three image sub-blocks A, B, C of image block ABC in another embodiment of the invention.

FIG. 3 is a schematic diagram of three image sub-blocks A, B, C of image block ABC in another embodiment of the invention.

Fig. 4 is a schematic diagram of a cutting result according to an embodiment of the present invention, wherein the discarded image blocks are within a dashed box.

Fig. 5 is a schematic diagram of a cutting result according to another embodiment of the present invention, wherein the discarded image blocks are within the dashed box.

Fig. 6 is a schematic diagram of a cutting result according to another embodiment of the present invention, wherein the discarded image blocks are within the dashed box.

Fig. 7 is a schematic diagram of a cutting result according to another embodiment of the present invention, wherein the discarded image blocks are within the dashed box.

Fig. 8 is a schematic diagram of a cutting result according to another embodiment of the present invention, wherein the discarded image blocks are within the dashed box.

Fig. 9 is a schematic diagram of a cutting result according to another embodiment of the present invention, wherein the discarded image blocks are within the dashed box.

Detailed Description

Example 1

As shown in fig. 1, A, B, C is three image sub-blocks of image block ABC, the colors of the image sub-blocks representing the effective density of the corresponding image sub-blocks from light to deep as low as high.

In this example, the initial threshold lies between the effective information densities of image sub-block a and image sub-block B.

The common boundary of the image sub-block A and the image sub-block B is a dividing line which penetrates through and is perpendicular to the boundary of the image block, the effective information density of the image sub-block A is smaller than a threshold value, and the effective information density of the image sub-block B is larger than the threshold value, so that the cutting condition is met;

Therefore, the image block ABC is cut into the image block A and the image block BC by taking the common boundary of the image sub-block A and the image sub-block B as a dividing line, and the effective information density of the image block A is smaller than a threshold value, and no dividing line meeting the cutting condition exists, so that the image block ABC is discarded and not assembled into the spliced graph.

Removing image block ABC, adding image block BC, and recalculating a threshold value, wherein the updated threshold value is the minimum value of the effective information density of the image blocks and the image sub-blocks which are not discarded in the pixel rate limit of the mosaic image.

In this example, the updated threshold is less than the effective information density of image sub-block B.

For the image block BC generated after cutting, repeating the process, wherein the common boundary of the image sub-block B and the image sub-block C is a dividing line which penetrates through and is perpendicular to the boundary of the image block, but the effective information density of the image sub-block B and the effective information density of the image sub-block C are both larger than a threshold value, and the cutting condition is not met;

the cut is terminated, as shown in fig. 4, with discarded tiles within the dashed box, and the final tile ABC is cut into tile a and tile BC, where tile a is discarded and not entered into the mosaic.

Example 2

In this example, the initial threshold lies between the effective information densities of image sub-block B and image sub-block C.

The common boundary of the image sub-block A and the image sub-block B is a dividing line which penetrates through and is perpendicular to the boundary of the image block, but the effective information density of the image sub-block A and the effective information density of the image sub-block B are smaller than a threshold value and do not meet the cutting condition;

The image block ABC is cut into an image block AB and an image block C by taking the common boundary of the image sub-block B and the image sub-block C as a dividing line, and the image block AB comprises an image sub-block A with the effective information density smaller than a threshold value, but the effective information density of the image sub-block A, B is smaller than the threshold value, and the dividing line meeting the cutting condition does not exist, so that the image block AB is discarded and not assembled into the spliced graph.

The cut is terminated, as shown in fig. 5, with discarded tiles within the dashed box, and the final tile ABC is cut into tiles AB and C, where tile AB is discarded and not entered into the mosaic.

Removing image block ABC, adding image block C, and recalculating a threshold value, wherein the updated threshold value is the minimum value of effective information density of the image blocks and the image sub-blocks which are not discarded in the pixel rate limit of the mosaic image.

Example 3

In this example, the initial threshold is less than the effective information density of image sub-block A

The common boundary of the image sub-block A and the image sub-block B is a dividing line which penetrates through and is perpendicular to the boundary of the image block, but the effective information density of the image sub-block A and the effective information density of the image sub-block B are both larger than a threshold value and do not meet the cutting condition;

and therefore no cutting is performed. The cut is terminated and the threshold is not updated.

By the method, ineffective segmentation of the image block ABC can be reduced, and the coding rate is saved;

Example 4

The common boundary of the image sub-block A and the image sub-block B is a dividing line which penetrates through and is perpendicular to the boundary of the image block, the effective information density of the image sub-block A is smaller than a threshold value, the effective information density of the image sub-block B is larger than the threshold value, the cutting condition is met, the common boundary of the image sub-block B and the image sub-block C is a dividing line which penetrates through and is perpendicular to the boundary of the image block, but the effective information density of the image sub-block B and the effective information density of the image sub-block C are both larger than the threshold value, and the cutting condition is not met.

In this example, the updated threshold lies between the effective information densities of image sub-block B and image sub-block C.

Repeating the above process for the image block BC generated after cutting, wherein the common boundary of the image sub-block B and the image sub-block C is a dividing line which penetrates through and is perpendicular to the boundary of the image block, the effective information density of the image sub-block B is smaller than a threshold value, the effective information density of the image sub-block C is larger than the threshold value, and the cutting condition is met;

Therefore, the image block BC is cut into the image block B and the image block C by taking the common boundary of the image sub-block B and the image sub-block C as a dividing line, and the effective information density of the image block B is smaller than a threshold value and no dividing line meeting the cutting condition exists, so that the image block BC is discarded and not assembled into the spliced graph.

The cut is terminated, as shown in fig. 6, with discarded image blocks within the dashed box, and the final image block ABC is cut into image block a, image block B, and image block C, where image sub-a, image block B are discarded and not entered into the mosaic.

And removing the image block BC, adding the image block C, and recalculating a threshold value, wherein the updated threshold value is the minimum value of the effective information density of the image blocks and the image sub-blocks which are not discarded in the pixel rate limit of the mosaic image.

According to the method, after each cutting, the threshold value is updated according to the limitation of the pixel rate and the effective information density of the image sub-blocks which are not discarded, so that the threshold value is more accurate, the cutting is more sufficient, and the overall coding efficiency is increased.

Example 5

As shown in fig. 2, A, C, B is three image sub-blocks of the image block ACB, and the colors of the image sub-blocks from light to deep represent that the effective density of the corresponding image sub-blocks is lowest to high.

The common boundary of the image sub-block A and the image sub-block C is a dividing line which penetrates through and is perpendicular to the boundary of the image block, the effective information density of the image sub-block A is smaller than a threshold value, and the effective information density of the image sub-block C is larger than the threshold value, so that a cutting condition is met;

The effective information density difference of the image sub-block A and the image sub-block C is larger, so that the common boundary of the image sub-block A and the image sub-block C is taken as an actual dividing line, the image block ACB is cut into the image block A and the image block CB, and the dividing line meeting the cutting condition is not existed because the effective information density of the image block A is smaller than a threshold value, and the image block A is discarded and not assembled into the spliced graph.

And removing the image block ACB, adding the image block CB, and recalculating a threshold value, wherein the updated threshold value is the minimum value of the effective information density of the image blocks and the image sub-blocks which are not discarded in the pixel rate limit of the mosaic image.

In this example, the updated threshold lies between the effective information densities of image sub-block C and image sub-block B.

Repeating the above process for the image block CB generated after cutting, wherein the common boundary of the image sub-block C and the image sub-block B is a dividing line which penetrates through and is perpendicular to the boundary of the image block, the effective information density of the image sub-block B is smaller than a threshold value, the effective information density of the image sub-block C is larger than the threshold value, and the cutting condition is met;

Therefore, the image block CB is cut into the image block C and the image block B by taking the common boundary of the image sub-block C and the image sub-block B as a dividing line, and the effective information density of the image block B is smaller than a threshold value and no dividing line meeting the cutting condition exists, so that the image block CB is discarded and not assembled into the spliced graph.

The cutting is terminated, as shown in fig. 7, the discarded image blocks are within the dashed frame, and the final image block ACB is cut into an image block a, an image block B, and an image block C, where the image block a, the image block B are discarded and not entered into the mosaic.

And removing the image block CB, adding the image sub-block C, and recalculating a threshold value, wherein the updated threshold value is the minimum value of the effective information density of the image blocks which are not discarded and the image sub-blocks within the pixel rate limit of the spliced image.

Example 6

The common boundary of the image sub-block A and the image sub-block C is a dividing line which penetrates through and is perpendicular to the boundary of the image block, the effective information density of the image sub-block A is smaller than a threshold value, the effective information density of the image sub-block C is larger than the threshold value, the cutting condition is met, the common boundary of the image sub-block C and the image sub-block B is a dividing line which penetrates through and is perpendicular to the boundary of the image block, but the effective information density of the image sub-block C and the effective information density of the image sub-block B are both larger than the threshold value, and the cutting condition is not met.

Therefore, the image block ACB is cut into the image block A and the image block CB by taking the common boundary of the image sub-block A and the image sub-block C as a dividing line, and the effective information density of the image block A is smaller than a threshold value and no dividing line meeting the cutting condition exists, so that the image block A is discarded and not assembled into the spliced graph.

In this example, the updated threshold is located below the threshold of image sub-block B.

For an image block CB generated after cutting, repeating the process, wherein the common boundary of the image sub-block C and the image sub-block B is a dividing line which penetrates through and is perpendicular to the boundary of the image block, but the effective information density of the image sub-block C and the effective information density of the image sub-block B are both larger than a threshold value, and the cutting condition is not met;

The cut is terminated, as shown in fig. 8, with discarded image blocks within the dashed box, and the final image block ACB is cut into image block a and image block CB, where image block a is discarded and does not enter the mosaic.

Example 7

The embodiment of the invention provides a device for processing image blocks in immersive media coding

As shown in fig. 3, B, A, C is three image sub-blocks of the image block BAC, the colors of the image sub-blocks represent the effective density of the corresponding image sub-blocks from light to deep as low as high.

And the effective information density calculating module is used for inputting the image sub-block A, the image sub-block B and the image sub-block C and outputting the effective information densities of the image sub-block A, the image sub-block B and the image sub-block C.

The threshold judging module is used for inputting effective information densities of an image sub-block A, an image sub-block B and an image sub-block C respectively, wherein the common boundary of the image sub-block B and the image sub-block A is a dividing line which penetrates through and is perpendicular to the boundary of the image block, the effective information density of the image sub-block B is larger than a threshold value, the effective information density of the image sub-block A is smaller than the threshold value, and a cutting condition is met;

And the dividing line decision module is used for inputting the effective information density difference value of the common boundary of the image sub-block B and the image sub-block A, the common boundary of the image sub-block A and the image sub-block C and the image sub-blocks at the two sides of the common boundary and the effective information density difference value, and outputting the effective information density difference value as an actual dividing line. Since the effective information density difference between the image sub-block a and the image sub-block C is larger, the common boundary between the image sub-block a and the image sub-block C is taken as the actual dividing line.

And the cutting module is used for inputting a common boundary of the image sub-block A and the image sub-block C of the parting line, cutting the decision and outputting the cut image block. The image block BAC is cut into an image block BA and an image block C;

The assembly and splicing decision module is used for inputting the image sub-block A with the effective information density smaller than the threshold value into the image block BA, and the image block BA has a dividing line meeting the cutting condition, so that the image block BA is not discarded.

And the threshold updating module inputs the effective information density and pixel rate limit of the image blocks BA and C which are not discarded and outputs the effective information density and pixel rate limit as an updated threshold. Removing image block BAC, adding image block BA and image block C, recalculating a threshold value, and updating the threshold value to be the minimum value of the effective information density of the image blocks which are not discarded in the pixel rate limit of the mosaic.

In this example, the updated threshold lies between the effective information densities of image sub-block a and image sub-block B.

Repeating the device flow for the image block BA generated after cutting, wherein the common boundary of the image sub-block B and the image sub-block A is a dividing line which penetrates through and is perpendicular to the boundary of the image block, the effective information density of the image sub-block B is larger than a threshold value, the effective information density of the image sub-block A is smaller than the threshold value, and the cutting condition is met;

And the image block BA is cut into the image block B and the image block A by taking the common boundary of the image sub-block B and the image sub-block A as a dividing line, and the effective information density of the image block A is smaller than a threshold value and no dividing line meeting the cutting condition exists, so that the image block BA is discarded and not assembled into the spliced graph.

The cut is terminated, as shown in fig. 9, with discarded image blocks within the dashed box, and the final image block BAC is cut into image block B, image block a, and image block C, where image block a is discarded and does not enter the mosaic.

And adding the image block B, and recalculating a threshold value, wherein the updated threshold value is the minimum value of the effective information density of the image blocks and the image sub-blocks which are not discarded in the pixel rate limit of the spliced image.

By the device, the segmentation of the granularity of the image sub-blocks can be realized, the image block A with low effective information density is accurately discarded, the image block with higher effective information density can be organized under the limit of the limited pixel rate in the multi-view image splicing process, the information utilization rate of a spliced image is improved, and the rendering quality is improved at a decoding end;

it should be noted that the above-mentioned embodiments are merely for illustrating the technical solution of the present invention, and not for limiting the same, and although the present invention has been described in detail with reference to the above-mentioned embodiments, it should be understood by those skilled in the art that the technical solution described in the above-mentioned embodiments may be modified or some technical features may be equivalently replaced, and these modifications or substitutions do not deviate the essence of the corresponding technical solution from the scope of the technical solution of the embodiments of the present invention.

Claims

1. A method for processing image blocks in immersive media coding, characterized in that it includes:

For at least one image patch in a multi-view image, for at least one segmentation line in the image patch that runs through and is perpendicular to the boundary of the image patch, the effective information density of the image sub-patterns on both sides of the segmentation line is calculated respectively. The effective information is the additional information of the image sub-patterns relative to other viewpoints in the multi-view image.

When the image sub-blocks on both sides of the segmentation line meet the cutting conditions, the cutting conditions are that the effective information density of one side of the image sub-block is greater than a threshold and the effective information density of the other side of the image sub-block is less than a threshold:

The image block is divided into two image blocks by cutting along the dividing line.

For an image block containing an image sub-block with an effective information density less than a threshold, if there is no dividing line that satisfies the cutting conditions, the image block is discarded and not assembled into the stitched image.

otherwise:

No cutting is performed.

2. The method according to claim 1, wherein for an image block containing an image sub-block with an effective information density less than a threshold, if there is a dividing line that satisfies the cutting condition, it is cut again.

3. The method according to claim 1, characterized in that, for an image block with multiple segmentation lines whose two side image sub-blocks satisfy the cutting conditions, the segmentation line with the largest effective information density difference between the two side image sub-blocks is selected as the actual segmentation line.

4. The method according to claim 1, wherein when the image sub-blocks on both sides of the dividing line satisfy the cutting conditions, the threshold is updated according to the pixel rate limit of the stitched image and the effective information density of the undiscarded image blocks.

5. An apparatus for processing image blocks, characterized in that it comprises the following modules:

The effective information density calculation module is used to calculate the effective information density of the image sub-blocks on both sides of the segmentation line that runs through and is perpendicular to the boundary of the image block in the multi-view image block. The effective information is the additional information of the image sub-block relative to other viewpoints in the multi-view.

Input: Image patches in a multi-view image;

Output: Effective information density of the image sub-blocks on both sides of the segmentation line;

Threshold determination module: used to determine whether the effective information density of the image sub-blocks on both sides of the dividing line is greater than the threshold and less than the threshold. If the condition is met, the blocks are cut; if the condition is not met, the blocks are not cut.

Input: The effective information density on each side of the dividing line, and the threshold;

Output: Cutting decision;

Cutting module: Cuts along the dividing line when the cutting conditions are met;

Input: dividing line, cutting decision;

Output: The cut image patch;

Assembly and stitching decision module: used to discard image sub-blocks containing effective information density less than the threshold and image blocks that do not have dividing lines that meet the cutting conditions, and not assemble them into the stitched image;

Input: Image patch;

Output: Assembly and splicing decision.

6. The apparatus according to claim 5, wherein for an image block containing image sub-blocks with an effective information density less than a threshold, if there is a dividing line that satisfies the cutting condition, it is cut again.

7. The apparatus according to claim 5, characterized in that it further comprises the following modules:

Segmentation line decision module: For an image block with multiple segmentation lines whose two side image sub-blocks satisfy the cutting conditions, the segmentation line with the largest difference in effective information density between the two side image sub-blocks is selected as the actual segmentation line;

Input: Multiple dividing lines that meet the cutting conditions, and the effective information density difference;

Output: The actual dividing line.

8. The apparatus according to claim 5, characterized in that it further comprises the following modules:

Threshold update module: When the image sub-blocks on both sides of the dividing line meet the cutting conditions, the threshold is updated according to the pixel rate limit of the stitched image and the effective information density of the undiscarded image blocks;

Input: Effective information density of the undiscarded image patches, pixel rate limit of the stitched image;

Output: The updated threshold.