JPH0830725A

JPH0830725A - Image processing apparatus and method

Info

Publication number: JPH0830725A
Application number: JP6167748A
Authority: JP
Inventors: Kazuyuki Saito; 和之齋藤
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 1994-07-20
Filing date: 1994-07-20
Publication date: 1996-02-02

Abstract

PURPOSE:To perform the area judgement of image data at a high speed while suppressing a memory amount. CONSTITUTION:The image data inputted from an input part 1 are supplied to an area division part 105 and the picture elements (reduced picture elements) of low resolution are extracted in the area division part 105 first by ORing the respective picture elements of the area of (m)X(n) picture elements in input pictures. As a result, the memory amount for judging the kind of the images practically becomes 1/(mXn). Then, the picture elements of the low resolution are successively generated and the kinds of the images of the respective positions (areas) of the image data are judged by them.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は画像処理装置及び方法、
詳しくは入力された画像データ中の各領域の像の種類を
判定する画像処理装置及び方法に関するものである。BACKGROUND OF THE INVENTION The present invention relates to an image processing apparatus and method,
More specifically, the present invention relates to an image processing apparatus and method for determining the type of image of each area in input image data.

【０００２】[0002]

【従来の技術】光学的文字認識（ＯＣＲ）装置、複写
機、或いは、ファクシミリ等の装置においては、その処
理を精度良く行うために、入力された画像中の文字領
域、図形、表等の領域を分割することが必要になる。2. Description of the Related Art In a device such as an optical character recognition (OCR) device, a copying machine, or a facsimile, a region such as a character region, a figure, or a table in an input image is processed in order to perform the processing accurately. Will need to be split.

【０００３】領域分割手段として、スペクトル分析方式
の様に、入力画像のフーリエスペクトルを分析し各種領
域に分割する方式を用いたものや、特開昭６４−１５８
８９号公報の様に垂直及び水平方向の射影を交互に繰り
返して取り周辺分布の情報から領域を分割していく方式
を用いたものなどがある。As the area dividing means, one using a method of analyzing the Fourier spectrum of the input image and dividing it into various areas, such as a spectrum analysis method, or Japanese Patent Laid-Open No. 64-158.
As disclosed in Japanese Patent No. 89, there is a method using a method in which projections in the vertical and horizontal directions are alternately repeated to divide an area from information on peripheral distribution.

【０００４】またラベル付けの手段としては連続画素の
輪郭線を追跡する方式を用いたものなどがある。As a labeling means, there is a method using a method of tracing the contour line of continuous pixels.

【０００５】[0005]

【発明が解決しようとする課題】しかしながら、上記の
手法のいずれでも、演算処理に多大な時間を費やし、か
つ、画像の画素に対して処理を行うので記憶領域が大規
模化し、複雑な領域構成の画像に対する分割精度が低い
等の問題点があった。However, in any of the above-mentioned methods, a large amount of time is spent on the arithmetic processing and the processing is performed on the pixels of the image, so that the storage area becomes large in scale, and a complicated area structure is formed. However, there is a problem that the division accuracy for the image is low.

【０００６】また、入力画像の解像度が低い場合にも分
割精度が悪くなる等の問題が有り、逆に解像度が高い場
合は画素数が増えラベル付け等の処理時間や記憶領域が
増大するという問題点が有った。Further, there is a problem that the division accuracy deteriorates even when the resolution of the input image is low, and conversely, when the resolution is high, the number of pixels increases and the processing time such as labeling and the storage area increase. There was a point.

【０００７】[0007]

【課題を解決するための手段】及び[Means for Solving the Problems] and

【作用】本発明は上記問題点に鑑みなされたものであ
り、画像データの領域判定を高速に、しかも、メモリ量
を抑えながら行うことを可能にする画像処理装置及び方
法を提供しようとするものである。SUMMARY OF THE INVENTION The present invention has been made in view of the above problems, and an object of the present invention is to provide an image processing apparatus and method capable of performing area determination of image data at high speed while suppressing the amount of memory. Is.

【０００８】この課題を解決するため、例えば本発明の
画像処理装置は以下に示す構成を備える。すなわち、与
えられた画像データの各領域の種類を判定する画像処理
装置であって、前記画像データ中のｍ×ｎ画素領域から
低解像度の１画素を抽出する抽出手段と、低解像度の画
素の連続する広がりから、画像種別の領域を判別する領
域判別手段と、該領域判別手段で判別された領域情報か
ら、当該領域の種別を判断する判断手段とを備える。In order to solve this problem, for example, the image processing apparatus of the present invention has the following configuration. That is, an image processing apparatus for determining the type of each area of given image data, comprising an extracting means for extracting one pixel of low resolution from an m × n pixel area in the image data, and a pixel of low resolution. An area discriminating unit that discriminates an image type region from a continuous spread, and a discriminating unit that determines the type of the region from the region information discriminated by the region discriminating unit.

【０００９】また、本発明に係る好適な実施態様に従え
ば、前記ｍ、ｎは、入力される画像データの解像度が高
いほど、大きな値を持つことが望ましい。これによっ
て、画像データの解像度にかかわらず、低解像度の画素
の重みを略一定にすることができ、ほぼ同じ環境で像判
定が行える。According to a preferred embodiment of the present invention, it is desirable that the m and n have larger values as the resolution of input image data is higher. As a result, the weight of low-resolution pixels can be made substantially constant regardless of the resolution of image data, and image determination can be performed in almost the same environment.

【００１０】また、抽出手段は、ｍ×ｎ画素内に有意な
画素が１つでも存在するとき、当該ｍ×ｎ画素に対する
低解像度の画素を有意が画素として抽出することが望ま
しい。これによって、ｍ×ｎの画素群内に１つでも黒画
素が存在するとき、低解像度の画素を黒画素とすること
ができるので、像域の判定に使用することが可能にな
る。Further, when at least one significant pixel exists in the m × n pixels, the extracting means preferably extracts a low-resolution pixel for the m × n pixels as a significant pixel. Accordingly, when at least one black pixel exists in the m × n pixel group, the low-resolution pixel can be set as a black pixel, and thus it can be used for image area determination.

【００１１】また、抽出手段は、ｍ×ｎの画素の各ドッ
ト状態の論理和を算出する算出手段を含むことが望まし
い。これによって、極めて簡単な構成或いは処理で低解
像度の画素を得ることが可能になる。Further, it is preferable that the extracting means includes a calculating means for calculating a logical sum of dot states of m × n pixels. This makes it possible to obtain low-resolution pixels with an extremely simple configuration or processing.

【００１２】また、領域判別手段は、低解像度の画素の
連続する広がりに対して内接する矩形を領域として判別
することが望ましい。これは、文字や写真、表等は一般
に矩形領域毎に展開されており、これに良好に、しかも
処理が単純なだけ高速に対処することが可能になる。Further, it is desirable that the area discriminating means discriminates a rectangle inscribed with respect to a continuous spread of low-resolution pixels as an area. This is because characters, photographs, tables, etc. are generally developed for each rectangular area, and it is possible to cope with this favorably and at a high speed by simple processing.

【００１３】また、記領域判別手段は、与えられた画像
データ中のｍ×ｎの画素群を１画素とするライン単位
に、各低解像度の有意な画素位置を求める手段と、主走
査方向に連続する有意な低解像度の画素の始まり位置
と、その連続する個数を走査ライン毎に記憶すると共
に、注目ライン中に有意な低解像度の画素がない場合に
は、所定の判別情報を記憶する記憶手段と、該記憶手段
で記憶された内容に従って、有意な低解像度の２次元に
連続する領域を検索する検索手段と、検索して得られた
連続する低解像度の画素群に対して固有のラベルを付す
と共に、当該固有のラベルの画素群の座標位置に基づい
て、当該画素群を内接する矩形領域の座標を得る手段と
を備えることが望ましい。なぜなら、有意な画素が存在
しない部分のみが記憶されるので、その分のメモリ消費
量を抑えることが可能になるからである。Further, the recording area discriminating means is means for obtaining a significant pixel position of each low resolution in line units in which an m × n pixel group in given image data is one pixel, and in the main scanning direction. A memory for storing the start position of consecutive significant low-resolution pixels and the number of consecutive pixels for each scanning line, and storing predetermined discrimination information when there is no significant low-resolution pixel in the line of interest. Means, a searching means for searching a significant low resolution two-dimensional continuous area according to the contents stored in the storage means, and a unique label for the continuous low resolution pixel group obtained by the search. And a means for obtaining the coordinates of the rectangular area inscribed in the pixel group based on the coordinate position of the pixel group of the unique label. This is because only the portion where no significant pixel exists is stored, and it is possible to suppress the amount of memory consumption accordingly.

【００１４】また、判断手段は、前記領域判別手段で判
別された領域の縦、横のサイズ、当該領域内における有
意な低解像度の画素数の占める割合に従って、当該領域
の種類を判断することが望ましい。これによって、各領
域を性格に判定することが可能になる。Further, the judging means can judge the type of the area according to the vertical and horizontal sizes of the area judged by the area judging means and the ratio of the number of pixels of significant low resolution in the area. desirable. This makes it possible to determine each area with a personality.

【００１５】[0015]

【実施例】以下、添付図面に従って本発明に係る実施例
を詳細に説明する。Embodiments of the present invention will be described in detail below with reference to the accompanying drawings.

【００１６】実施例における画像処理装置の構成例を図
１に示す。図中、１０１は画像データを入力する入力部
であり、イメージスキャナである。但し、これに限定さ
れるものではなく、例えば、上位装置（ホストコンピュ
ータ）から画像データを入力したり、ファクシミリ受信
するためのインターフェースであっても良い。要は、画
像データを入力するものでありさえすれば良いからであ
る。FIG. 1 shows an example of the configuration of the image processing apparatus in the embodiment. In the figure, 101 is an input unit for inputting image data, which is an image scanner. However, the present invention is not limited to this, and may be, for example, an interface for inputting image data from a host device (host computer) or receiving a facsimile. The point is that it only needs to input image data.

【００１７】１０２は装置の制御や各部処理の演算を行
う演算処理部（以下、ＣＰＵという）であり、１０３は
ＣＰＵ１０２の制御プログラムや、各種データを記憶し
ておく記憶部である。１０４は演算処理結果や画像処理
結果、及び画像データを出力する出力部である。Reference numeral 102 denotes an arithmetic processing unit (hereinafter referred to as CPU) that controls the apparatus and performs arithmetic operations for processing of each unit, and 103 is a storage unit that stores the control program of the CPU 102 and various data. An output unit 104 outputs the calculation processing result, the image processing result, and the image data.

【００１８】１０５は領域分割処理全体を示す領域分割
部であり、以下の構成を備える。Reference numeral 105 denotes an area dividing unit showing the entire area dividing processing, which has the following configuration.

【００１９】１０６は入力画像を縮小しデータ化する、
画像縮小データ化部、１０７は縮小画素データにラベル
付けを行い同時に初期矩形データを作成するラベリング
部である。１０８はセパレータや表や図形等を検出する
基本属性判別部である。１０９は縦書き・横書き等の文
章の組方向を検出する組方向検出部である。１１０は見
出し類を検出する見出し類検出部である。１１１は矩形
の合併部である。Reference numeral 106 reduces the input image to data and
An image reduced data conversion unit 107 is a labeling unit that labels reduced pixel data and simultaneously creates initial rectangular data. Reference numeral 108 denotes a basic attribute determination unit that detects a separator, a table, a figure, and the like. Reference numeral 109 denotes a set direction detection unit that detects a set direction of a sentence such as vertical writing or horizontal writing. Reference numeral 110 denotes a headline detection unit that detects headlines. 111 is a rectangular merged part.

【００２０】図２は、本実施例の画像処理装置における
画像処理を示すフローチャートで、この処理を実行する
制御プログラムは記憶部１０３に記憶されている。FIG. 2 is a flow chart showing image processing in the image processing apparatus of this embodiment. A control program for executing this processing is stored in the storage unit 103.

【００２１】以下、同フローチャートに従って上記構成
の動作を説明する。The operation of the above configuration will be described below with reference to the flowchart.

【００２２】１）ステップＳ２０１の説明まず画像入力部１０１から元画像を入力する。1) Description of Step S201 First, an original image is input from the image input unit 101.

【００２３】２）ステップＳ２０２の説明次に画像縮小データ化部１０６において、入力画像に対
し、解像度がＲｄｐｉ（ｄｐｉ＝ｄｏｔｓ／ｉｎｃｈ）
以上であれば、縦ｍドット、横ｎドットの１ドット単位
の論理和をとっり、その論理和結果を新たな１画素（縮
小画素）とする。また、入力画像データの解像度がＲｄ
ｐｉ未満であれば縦ｐドット、横ｑドットのドット単位
の論理和をとって、その論路和結果を新た１画素（縮小
画素）とする（但し、ｍ＞ｐ，ｎ＞ｑ）。ここで、解像
度に応じて領域サイズを変化させるのは、高解像度にお
ける１画素と低解像度の１画素と比較した場合に、後者
の画素の方がその意味が重要である理由による。従っ
て、入力される画像データの解像度が様々な場合には、
上記の２段階に分けるのではなく、多段階に分けるよう
にすることが望ましい。尚、入力画像の解像度の判断
は、外部に設けられた不図示の操作パネルから指示して
も良いし、画像データ発生源（ホストコンピュータ等）
がそのヘッダ部分に解像度指定コマンドを付加し、この
コマンドを解釈することで行っても良い。2) Description of Step S202 Next, in the image reduced data conversion unit 106, the resolution of the input image is Rdpi (dpi = dots / inch).
In the above case, the logical sum of vertical m dots and horizontal n dots in 1 dot units is taken, and the logical sum result is set as a new 1 pixel (reduced pixel). The resolution of the input image data is Rd.
If it is less than pi, the logical sum of dot units of vertical p dots and horizontal q dots is taken, and the logical sum result is newly set to 1 pixel (reduced pixel) (however, m> p, n> q). Here, the reason that the area size is changed according to the resolution is that the latter pixel is more important in meaning when compared with one pixel at high resolution and one pixel at low resolution. Therefore, when the resolution of the input image data varies,
It is desirable to divide it into multiple stages instead of the above two stages. Incidentally, the determination of the resolution of the input image may be instructed from an operation panel (not shown) provided outside, or may be an image data generation source (host computer or the like).
May add a resolution specification command to the header part and interpret this command.

【００２４】さて、上記の論理和処理を行うと、入力画
像のｍ×ｎの領域、或いはｐ×ｑの領域中に１ドットで
も黒画素が有れば、その縮小画素は黒となる（ここで言
う“黒”とは、便宜的なものである）。When the above logical sum processing is performed, if even one dot has a black pixel in the m × n area or the p × q area of the input image, the reduced pixel becomes black (here). The term "black" is used for convenience.)

【００２５】上記のように得られた縮小画素は、主走査
方向に連続する縮小画素を一つの塊として扱い、その黒
画素の始まるＸ座標、黒画素数を縮小画素データとして
記憶する（図５）。The reduced pixels obtained as described above are treated as one group of reduced pixels continuous in the main scanning direction, and the X coordinate at which the black pixel starts and the number of black pixels are stored as reduced pixel data (FIG. 5). ).

【００２６】主走査方向に存在する縮小黒画素の塊の画
素データは左から順にリンクさせ、最も左の画素データ
はそのアドレスを縮小画素行ヘッダに記憶する。Pixel data of a group of reduced black pixels existing in the main scanning direction are linked in order from the left, and the address of the leftmost pixel data is stored in the reduced pixel row header.

【００２７】また、各縮小画素行ヘッダにおいてもリン
クさせる事によって１連の縮小画素データとしてアクセ
スできるようにする。Further, each reduced pixel row header is linked so that it can be accessed as a series of reduced pixel data.

【００２８】つまり、各縮小画素行のヘッダには、その
行における次の黒画素の塊の格納されているアドレス
（ポインタ）を記憶していると共に、次の行のヘッダの
アドレスをも記憶している。That is, the header of each reduced pixel row stores the address (pointer) at which the next block of black pixels in that row is stored, and also the address of the header of the next row. ing.

【００２９】以上が、画像縮小データ化部１０６の処理
である。The above is the processing of the image reduced data conversion unit 106.

【００３０】３）ステップＳ２０３の説明次にラベリング部１０７の処理内容を説明する。3) Description of Step S203 Next, the processing contents of the labeling unit 107 will be described.

【００３１】縮小画像の黒画素に対し１行ずつ縮小画素
データを参照することによってラベルを付加し、上下・
斜めで連続している画素データには同一ラベルを付け、
同時に矩形をかたどっていく。ただし、この時横方向の
連続性は縮小画素データによってあらかじめかわってい
るので上下及び斜めの連続性のみを検査すれば良い事に
なる。A label is added to the black pixels of the reduced image by referring to the reduced pixel data line by line, and
The same label is attached to diagonally continuous pixel data,
Simultaneously model the rectangle. However, at this time, since the continuity in the horizontal direction is previously changed by the reduced pixel data, it is only necessary to inspect the continuity in the vertical and diagonal directions.

【００３２】図３を例にとると、１行目（縮小画素で構
成される行）の縮小画素ヘッダを参照し、その縮小画素
データ先頭アドレスから最初の縮小画素データを読み取
ると画素Ａを含む画素データのデータを参照する事にな
る。Taking FIG. 3 as an example, when the reduced pixel header of the first row (row composed of reduced pixels) is referenced and the first reduced pixel data is read from the reduced pixel data start address, pixel A is included. The pixel data will be referred to.

【００３３】最初に検出される画素Ａを含む画素データ
にラベル１が付けられ、この画素Ａの座標（Ｘａ，Ｙ
ａ）を矩形の始点と終点とし、画素数を１、矩形を区別
するための矩形ラベルに画素と同じラベル１を付加して
以上のデータを矩形データ（図４）として記憶部に記憶
する。The label 1 is attached to the pixel data including the pixel A that is detected first, and the coordinates (Xa, Y
a) is the start and end points of the rectangle, the number of pixels is 1, and the same label 1 as the pixel is added to the rectangle label for distinguishing the rectangle, and the above data is stored in the storage unit as rectangle data (FIG. 4).

【００３４】次に次縮小画素データアドレスによって次
の縮小画素データを参照すると画素Ｂを含む画素データ
を参照する事となり、左方向に画素Ａと連続性のないこ
とは画素データが異なる事から明らかである（１行目で
あるのでもちろん上からも連続画素はない）。画素Ｂを
含む画素データにはラベル２が付けられ、この画素Ｂの
座標（Ｘｂ，Ｙｂ）を矩形の始点と終点とし、画素数を
１、矩形を区別するための矩形ラベルに画素と同じラベ
ル２を付加して以上のデータも矩形データ（図４）とし
て記憶部に記憶する。Next, when the next reduced pixel data is referred to by the next reduced pixel data address, the pixel data including the pixel B is referred to, and it is clear that the pixel data is different from the pixel A in the left direction because the pixel data is different. (Since it is the first row, of course, there are no continuous pixels from the top). Label 2 is attached to the pixel data including the pixel B, the coordinates (Xb, Yb) of the pixel B are used as the start point and the end point of the rectangle, the number of pixels is 1, and the rectangular label for distinguishing the rectangle has the same label as the pixel. 2 is added and the above data is also stored in the storage unit as rectangular data (FIG. 4).

【００３５】以上の様にして１行目のラベリングが終了
したら２行目に移る。When the labeling of the first line is completed as described above, the process moves to the second line.

【００３６】２行目の縮小画素ヘッダを参照し、その縮
小画素データ先頭アドレスから最初の縮小画素データを
読み取ると画素Ｃから画素Ｄまでのデータを参照する事
になる。２行目の最初の画素Ｃを含む画素データはラベ
ル１の画素Ａを含む画素データと上から連続しているの
で画素ラベル１を付加し、矩形ラベル１の矩形データに
対し画素数は１加算して計２画素となり、矩形ラベルは
変わらず１のままで、矩形座標の終点のみを（Ｘａ，Ｙ
ａ）から（Ｘｄ，Ｙｃ）（画素Ｄの座標）へと更新する
（始点の座標は変わらない）。When the reduced pixel header of the second row is referred to and the first reduced pixel data is read from the reduced pixel data start address, the data from pixel C to pixel D is referred to. Since the pixel data including the first pixel C in the second row is continuous from the top with the pixel data including the pixel A with label 1, pixel label 1 is added, and the number of pixels is added to the rectangular data with rectangular label 1 by 1. The total number of pixels is 2 pixels, the rectangle label remains unchanged at 1, and only the end point of the rectangle coordinate is (Xa, Y
Update from (a) to (Xd, Yc) (coordinates of pixel D) (the coordinates of the start point do not change).

【００３７】また、同時に画素Ｄを含む画素データは画
素Ｂを含む画素データとも斜めに連続しており、画素Ｂ
を含む画素データのラベルをラベル２からラベル１へと
変更する。そして、矩形ラベル１の矩形データに対し、
矩形ラベル２の画素数を加算して計４画素とし矩形ラベ
ルは変わらず１のままで、矩形座標は画素Ａ，Ｂ，Ｃ，
Ｄをすべて含むように終点のみを（Ｘｄ，Ｙｃ）から
（Ｘｂ，Ｙｄ）へと更新する。そして、矩形ラベル２の
矩形データについては矩形ラベルを０として無効とす
る。At the same time, the pixel data including the pixel D is diagonally continuous with the pixel data including the pixel B.
The label of the pixel data including is changed from label 2 to label 1. Then, for the rectangular data of rectangular label 1,
The number of pixels of the rectangular label 2 is added to make a total of 4 pixels, the rectangular label remains unchanged at 1, and the rectangular coordinates are pixels A, B, C,
Only the end point is updated from (Xd, Yc) to (Xb, Yd) so as to include all D. Then, the rectangular data of the rectangular label 2 is invalidated by setting the rectangular label to 0.

【００３８】以上の様にして２行目が終了したら３行目
に移る。When the second line is completed as described above, the process moves to the third line.

【００３９】３行目の縮小画素行ヘッダを参照し、その
縮小画素データ先頭アドレスから最初の縮小画素データ
を読み取ると画素Ｅのデータを参照する事になる。３行
目の最初の画素Ｅを含む画素データは画素Ｃを含む画素
データと斜めに連続しているので画素ラベル１を付加
し、矩形ラベル１の矩形データに対し画素数は１加算し
て計５画素となり矩形ラベルは変わらず１のままで、矩
形座標は始点をＸａ，Ｙａ）から（Ｘｅ，Ｙａ）へ、お
よび終点を（Ｘｂ，Ｙｄ）から（Ｘｂ，Ｙｅ）へと更新
する。When the reduced pixel row header of the third row is referred to and the first reduced pixel data is read from the reduced pixel data start address, the data of the pixel E is referred to. Since the pixel data including the first pixel E in the third row is diagonally continuous with the pixel data including the pixel C, the pixel label 1 is added, and the number of pixels is added to the rectangular data of the rectangular label 1 to calculate the total. The number of pixels becomes 5 pixels, the rectangular label remains unchanged at 1, and the rectangular coordinates are updated from the starting point of Xa, Ya) to (Xe, Ya) and the ending point of (Xb, Yd) to (Xb, Ye).

【００４０】以下同様にして全縮小画素に対しラベリン
グと矩形のかたどりを行う。この結果、縦横斜めに連続
する縮小黒画素群に対しては同じラベルが付けられ、同
一ラベルで表わされる縮小画素群を内接する矩形の左上
隅の座標データと、右下隅の座標データ、そして、縮小
黒画素数が得られることになる。In the same manner, labeling and rectangular tracing are performed on all the reduced pixels. As a result, the same label is attached to the reduced black pixel groups that are continuous vertically and horizontally and diagonally, and the coordinate data of the upper left corner of the rectangle that inscribes the reduced pixel groups represented by the same label, the coordinate data of the lower right corner, and The reduced number of black pixels will be obtained.

【００４１】４）ステップＳ２０４の説明この処理は、基本属性判別部１０８で行われる。基本属
性判別部１０８は、上記ラベリングと矩形のかたどりの
後、各々の矩形部分が本文に該当する矩形、図形または
写真、表等に該当する矩形、或いはセパレータに該当す
る矩形のいずれであるのかを、そのラベリングされた矩
形の幅Ｗ、高さＨ、面積Ｓ、面積に対する黒画素の数、
すなわち画素密度Ｄ（これらは矩形データを用いて計算
する事により容易に求まる）を用いて区別する。ここ
で、言うセパレータとは、画像の区切り部分、例えば文
章部分と写真部分、或いは文章部分どうしを区切るため
の線（新聞等で文章部分を区切る線等である）を意味す
る。以下、図６のフローチャートに従って説明する。
尚、基本属性判別部１０８内部には、図示しないＣＰＵ
があって、このＣＰＵが同図フローチャートに対応する
プログラムに従って処理を進めることになる。4) Description of Step S204 This process is performed by the basic attribute determination unit 108. After tracing the above labeling and rectangles, the basic attribute determination unit 108 determines whether each rectangle is a rectangle corresponding to a text, a rectangle corresponding to a figure or a photograph, a table, or a rectangle corresponding to a separator. , Width W, height H, area S of the labeled rectangle, the number of black pixels for the area,
That is, the pixel density D (these are easily obtained by calculation using rectangular data) is used for the discrimination. The term "separator" as used herein means a delimiter portion of an image, for example, a text portion and a photograph portion, or a line (for example, a line that delimits a text portion in a newspaper or the like) for demarcating text portions. Hereinafter, a description will be given according to the flowchart of FIG.
A CPU (not shown) is provided inside the basic attribute determination unit 108.
Therefore, the CPU proceeds with the processing according to the program corresponding to the flowchart of FIG.

【００４２】セパレータは、幅Ｗが閾値Ｔｗ１以下でか
つ高さＨが幅Ｗの閾値Ｔｗ２倍以上（図６のＳ６０
３）、もしくは幅Ｗが閾値Ｔｗ１より大きくかつ高さＨ
が幅Ｗの閾値Ｔｗ３倍以上（図６のＳ６０４）であれば
縦長セパレータとして矩形ラベルを「−３」として統一
し、矩形を構成する画素ラベルは矩形ラベルとは別にそ
のまま保持しておく（図６のＳ６１４）。In the separator, the width W is not more than the threshold Tw1 and the height H is not less than twice the threshold Tw of the width W (S60 in FIG. 6).
3) or the width W is larger than the threshold Tw1 and the height H
Is equal to or greater than the threshold value Tw three times the width W (S604 in FIG. 6), the rectangular label is unified as “-3” as the vertically long separator, and the pixel labels forming the rectangle are held as they are separately from the rectangular label (FIG. 6 of S614).

【００４３】また幅と高さを入れ替えたものも上記と同
様にして（図６のＳ６０５，Ｓ６０６）横長セパレータ
として矩形ラベルを「−３」に変更する。Also, in the case where the width and the height are interchanged, the rectangular label is changed to "-3" as the horizontally long separator in the same manner as above (S605 and S606 in FIG. 6).

【００４４】更に、図７に示す様に、画素密度Ｄが閾値
Ｔｄ１以下の時（図６のＳ６０７）はカギ型などの変形
セパレータと見なして矩形ラベルを「−３」に変更する
（図６のＳ６１４）。Further, as shown in FIG. 7, when the pixel density D is less than or equal to the threshold value Td1 (S607 in FIG. 6), the rectangular label is changed to "-3" by considering it as a deformed separator such as a key-shaped one (FIG. 6). S614).

【００４５】また、面積Ｓが閾値Ｔｓ１より大きい時
（図６のＳ６０８）、その画素密度Ｄが閾値Ｔｄ２未満
（図６のＳ６０９）ならば表と見なして矩形ラベルを
「−４」に変更し（図６のＳ６１５）、画素密度Ｄが閾
値Ｔｄ２以上ならば図形または写真とみなして矩形ラベ
ルを「−５」に変更する（図６のＳ６１６）。When the area S is larger than the threshold value Ts1 (S608 in FIG. 6) and the pixel density D is less than the threshold value Td2 (S609 in FIG. 6), it is regarded as a table and the rectangular label is changed to "-4". If the pixel density D is equal to or higher than the threshold value Td2 (S615 in FIG. 6), the rectangle label is changed to “−5” by regarding it as a figure or a photograph (S616 in FIG. 6).

【００４６】つまり、一般に表は、線分で構成され、文
字と比較すると面積は大きい。また、その面積内はほと
んど白であるので、矩形密度は小さい値になる。これに
対し、図形や、写真等は、表に比較して画素密度が大き
くなることに由来している。That is, in general, a table is composed of line segments and has a large area compared to characters. Moreover, since the area is almost white, the rectangular density has a small value. On the other hand, figures, photographs and the like have a higher pixel density than those in the table.

【００４７】また、面積Ｓが閾値Ｔｓ１以下でかつ閾値
Ｔｓ２以上の時で（図６のＳ６１０）、その画素密度Ｄ
が閾値Ｔｄ３以上（図６のＳ６１１）の矩形や、幅Ｗ及
び高さＨが共に閾値Ｔｗ４以上でかつ画素密度ＤがＴｄ
５以上の時（図６のＳ６１３）の矩形も図形または写真
とみなして矩形ラベルを「−５」に変更する。When the area S is less than or equal to the threshold Ts1 and greater than or equal to the threshold Ts2 (S610 in FIG. 6), the pixel density D
Is a threshold value Td3 or more (S611 in FIG. 6), or the width W and height H are both threshold value Tw4 or more and the pixel density D is Td.
When the number is 5 or more (S613 in FIG. 6), the rectangle is regarded as a figure or a photograph, and the rectangle label is changed to "-5".

【００４８】更に、面積Ｓが閾値Ｔｓ１以下でかつ閾値
Ｔｓ２以上の時（図６のＳ６１０）、画素密度Ｄが閾値
Ｔｄ４未満（図６のＳ６１２）の矩形を表とみなし矩形
ラベルを「−４」に変更する（図６のＳ６１５）。Further, when the area S is less than or equal to the threshold Ts1 and greater than or equal to the threshold Ts2 (S610 in FIG. 6), the rectangle having the pixel density D less than the threshold Td4 (S612 in FIG. 6) is regarded as a table and the rectangle label is "-4". To “” (S615 in FIG. 6).

【００４９】以上の様にして、図形または写真、表等に
該当する矩形、セパレータに該当する矩形等を検出し、
残った矩形を本文として矩形ラベルはそのまま（符号が
正なので表等と容易に区別出来る）画素ラベルと同一に
しておく（図６のＳ６１７）。As described above, a rectangle corresponding to a figure, a photograph, a table, etc., a rectangle corresponding to a separator, etc. are detected,
The remaining rectangle is used as the text and the rectangle label is made the same as it is (because the sign is positive, it can be easily distinguished from a table etc.) and the pixel label (S617 of FIG. 6).

【００５０】５）ステップＳ２０５の説明次に組方向検出部１０９の動作処理内容を図８のフロー
チャートに従って説明する。尚、組方向検出部１０９内
にはＣＰＵと、同図のフローチャートに対応するプログ
ラムを記憶しているメモリが設けられてる。5) Description of Step S205 Next, the operation processing contents of the assembly direction detecting unit 109 will be described with reference to the flowchart of FIG. A CPU and a memory that stores a program corresponding to the flowchart of FIG.

【００５１】さて、横書き文章の場合、本文として残っ
た矩形は横方向に縮小黒画素が連続して横長の矩形にな
りやすく、縦書き文章であれば、本文として残った矩形
は縦長になりやすい。In the case of a horizontally written sentence, the rectangle remaining as the body tends to become a horizontally long rectangle in which reduced black pixels are continuous in the horizontal direction, and in the case of a vertically written sentence, the rectangle remaining as the body tends to become vertically long. .

【００５２】そこで本文の矩形の幅Ｗと高さＨの平均値
を算出し（図８のＳ８０１）、平均幅ｗが平均高さｈよ
り大きい場合は横書きの多い章と見なし平均高さｈを１
文字の文字サイズとする（図８のＳ８０２，Ｓ８０
３）。逆に平均高さｈが平均幅ｗより大きい場合は縦書
きの多い文章と見なし平均高さｗを１文字の文字サイズ
とする（図８のＳ８０２，Ｓ８１０）。Therefore, the average value of the width W and the height H of the rectangle of the text is calculated (S801 in FIG. 8), and if the average width w is larger than the average height h, it is regarded as a chapter with many horizontal writing and the average height h is set. 1
Set to the character size of the character (S802, S80 in FIG. 8
3). On the contrary, when the average height h is larger than the average width w, it is regarded as a sentence with many vertical writings and the average height w is set to the character size of one character (S802 and S810 in FIG. 8).

【００５３】次に、組方向とは逆方向に、それぞれの本
文矩形の縮小黒画素のヒストグラムを取り（図８のＳ８
０４）、周辺分布の形状から閾値Ｔｋ以下の場所を段落
の分かれ目とする（図８のＳ８０５，Ｓ８１２、図９参
照）。また、段階ごとに組方向と同じ方向に矩形のヒス
トグラムを取って（図８の８０８，Ｓ８１５）、周辺分
布の形状から黒画素の連続の長さをその段落内文字のそ
の字サイズ（高さサイズ）とし、白画素の連続の長さを
行ピッチとして検出する（図８のＳ８０９，Ｓ８１６お
よび図８）。Next, a histogram of the reduced black pixels of each body rectangle is taken in the opposite direction to the set direction (S8 in FIG. 8).
04), the location of the threshold value Tk or less is set as a paragraph break based on the shape of the peripheral distribution (see S805 and S812 in FIG. 8 and FIG. 9). Also, for each step, a rectangular histogram is taken in the same direction as the set direction (808, S815 in FIG. 8), and from the shape of the marginal distribution, the continuous length of black pixels is determined by the character size (height) of the characters in the paragraph. Size) and the continuous length of white pixels is detected as a row pitch (S809, S816 in FIG. 8 and FIG. 8).

【００５４】６）ステップＳ２０６の説明次に見出し類検出部１１０において、組方向と文字サイ
ズより、見出し類の検出を行う手順を図１０のフローチ
ャートに従って説明する。尚、ここでは本文が縦書きの
場合を説明し、横書きについては説明しないが、以下の
説明からすれば横書きの場合の処理手順は容易に推察さ
れよう。6) Description of Step S206 Next, the procedure for detecting the headings based on the set direction and the character size in the headings detection unit 110 will be described with reference to the flowchart of FIG. It should be noted that the case where the text is written vertically will be described here, and the case where the text is written horizontally will not be described, but from the following description, the processing procedure in the case of horizontal writing can be easily guessed.

【００５５】先ず、本分矩形の中から矩形の幅Ｗが文字
サイズｗよりも閾値Ｔｍ１倍以上大きい矩形を見出し矩
形の候補として検出する（図１０のＳ１００５）。First, from the main rectangle, a rectangle whose width W is larger than the character size w by a threshold value Tm1 times or more is detected as a candidate for a heading rectangle (S1005 in FIG. 10).

【００５６】さらに、矩形の高さＨが文字サイズのｗの
Ｔｍ２倍よりも小さい場合は（図１０のＳ１００６）、
横書きの文字が連続したものを含む場合があるので、本
文矩形と見出し矩形の距離が閾値Ｔ１より近いもの（図
１０のＳ１００７）では見出しから本文に訂正する（図
１０のＳ１００９）。Furthermore, when the height H of the rectangle is smaller than Tm2 times the character size w (S1006 in FIG. 10),
Since there may be a case where the horizontally written characters are continuous, if the distance between the body rectangle and the heading rectangle is shorter than the threshold T1 (S1007 in FIG. 10), the headline is corrected to the body (S1009 in FIG. 10).

【００５７】さらに上記の処理によって残った本文矩形
の中で文字サイズｗの範囲に本文及び見出し類の矩形が
ないものを孤立した矩形として見出しとする（図１０の
Ｓ１０１０〜Ｓ１０１３）。Further, among the body rectangles remaining by the above-mentioned processing, a body body and a headline type rectangle in the range of the character size w are regarded as isolated rectangles as headings (S1010 to S1013 in FIG. 10).

【００５８】以上説明したように、横書きの場合には、
上記処理における９０度回転したものとなるだけである
ので、容易に想到できよう。As described above, in the case of horizontal writing,
It can be easily conceived because it is only rotated by 90 degrees in the above processing.

【００５９】７）ステップＳ２０７の説明次に矩形の合併部１１１の処理内容を説明する。7) Description of Step S207 Next, the processing contents of the rectangular merger 111 will be described.

【００６０】この処理では、何の関係もなくばらばらに
存在したままの本文に該当する矩形及び見出し類の矩形
を合併する。In this processing, the rectangles corresponding to the main text that have been separated and have no relation to each other and the rectangles of the headings are merged.

【００６１】図１１に例に、矩形Ａ１と矩形Ｂとの合併
を考える。矩形Ａ１に対してＸ方向に対してＰｘ、Ｙ方
向に対してＰｙだけ拡張した仮想矩形Ａ１’を考え、Ａ
１′に対してその周囲に接触或は内包する矩形があるか
サーチする。図示の場合、矩形Ｂの様に接触していれば
矩形Ａ１と矩形Ｂを合併して新しく合併矩形Ａ２を作成
し矩形Ａ１の矩形データを更新することで記憶し、矩形
Ｂのデータは無効とする。ただし、この時Ｐｘ，Ｐｙは
文字サイズと行ピッチから求めた値であり、また矩形を
合併してできた矩形が図形や表やセパレータ等と接触す
る場合は、合併すること自体を無効とし元の矩形のまま
としておく。As an example in FIG. 11, consider merging rectangles A1 and B. Consider a virtual rectangle A1 ′ that is expanded by Px in the X direction and Py in the Y direction with respect to the rectangle A1.
1'is searched for a rectangle surrounding or in contact with it. In the case shown in the figure, if they are in contact with each other like rectangle B, rectangles A1 and B are merged to create a new merged rectangle A2 and the rectangle data of rectangle A1 is updated and stored. The data of rectangle B is invalid. To do. However, at this time, Px and Py are values obtained from the character size and the line pitch, and when the rectangle formed by merging the rectangles contacts a figure, a table, a separator, etc., the merging itself is invalidated. The rectangle is left as it is.

【００６２】８）ステップＳ２０８の説明最後に、以上の様にして求めた各種領域の矩形データを
画像データと共に出力部１０４から出力する。8) Description of Step S208 Finally, the rectangular data of various areas obtained as described above is output from the output unit 104 together with the image data.

【００６３】以上説明したように本実施例によれば、入
力画像の解像度によって縮小１画素あたりのドット数を
変更し、縮小すると同時に、主走査方向は黒画素のラン
レングスをリンクして記憶し、それらのデータの先頭ア
ドレスを各行のヘッダ領域に記憶する画像縮小データ化
手段によって入力画像を縮小を行う事によって、ラベ
リング処理の処理時間を短縮、記憶領域等の小規模
化、複雑な領域構成の分割精度の向上等がはかれる効
果がある。As described above, according to this embodiment, the number of dots per reduced pixel is changed and reduced according to the resolution of the input image, and at the same time, the run length of black pixels is linked and stored in the main scanning direction. By reducing the input image by the image reduction data conversion means that stores the start address of the data in the header area of each row, the processing time of the labeling process is shortened, the storage area is downsized, and the complicated area structure is formed. This has the effect of improving the division accuracy of.

【００６４】尚、入力した画像に対する処理は、本発明
の主旨から外れるが、例えば、本文では文字認識し、写
真画像領域であれば誤差拡散法等による処理を施す等が
考えられよう。It should be noted that although the processing on the input image is out of the scope of the present invention, it is conceivable that, for example, characters are recognized in the text and processing is performed by an error diffusion method or the like in the case of a photographic image area.

【００６５】また、上記基本属性判別部１０８で処理内
容で示された各閾値は、入力される画像データの解像度
によって決定されるものであり、固定ではない。Further, the respective threshold values indicated by the processing contents in the basic attribute discrimination section 108 are determined by the resolution of the input image data and are not fixed.

【００６６】また、ステップＳ２０１の画像入力の際に
多値であるかどうかを判断して２値画像に変換する事
で、入力画像がカラー等の多値画像であっても領域分割
処理は可能である。Further, it is possible to perform area division processing even if the input image is a multi-valued image such as color by judging whether it is multi-valued or not when inputting the image in step S201 and converting it into a binary image. Is.

【００６７】また、ステップＳ２０２の画像縮小データ
化処理において、入力画像の画素数が処理速度を損なわ
ない程度に十分少なければ縮小処理を省略し入力画像を
直接画素データ化する事も可能である。Further, in the image reduced data conversion processing of step S202, if the number of pixels of the input image is sufficiently small so as not to impair the processing speed, the reduction processing may be omitted and the input image may be directly converted into pixel data.

【００６８】また、ステップＳ２０４の基本属性検出処
理において、セパレータの縦・横違いや、図形や表等求
める時の閾値の違いなどで矩形ラベルを区別する事で、
更に詳細な領域属性の分類を行う事も可能である。Further, in the basic attribute detection processing of step S204, the rectangular labels are distinguished by the vertical / horizontal difference of the separator, the difference of the threshold value when obtaining figures, tables, etc.
It is also possible to perform more detailed classification of area attributes.

【００６９】また、同じくステップＳ２０４の基本属性
検出処理において、最初から文章のみなどの様に特定の
属性の画像しか入力しない場合は、予めその旨を指定す
ることで、処理を省略する事も可能である。Similarly, in the basic attribute detection processing of step S204, when only an image having a specific attribute is input from the beginning such as a sentence, the processing can be omitted by designating that fact in advance. Is.

【００７０】また、ステップＳ２０５の組方向の検出に
おいて、平均幅、平均高さを求める代わりにそれぞれの
最多値を求めることによって代替する事ができる。Further, in the detection of the set direction in step S205, instead of obtaining the average width and the average height, the maximum value of each can be obtained for substitution.

【００７１】また、同じくステップＳ２０５の組方向の
検出において、段落ごとに組方向を再度確認すれば、縦
書き文章と横書き文章が混在の原稿にも正確に処理を行
うことができる。Similarly, in the detection of the set direction in step S205, if the set direction is reconfirmed for each paragraph, it is possible to accurately perform processing even on a document in which vertical writing sentences and horizontal writing sentences are mixed.

【００７２】また、同じくステップＳ２０５の組方向の
検出において、最初から一定の組方向の文章しか入力し
ないならば、組方向を固定値とする事（不図示の操作パ
ネルから指定する）で代替する事ができる。Similarly, in the detection of the set direction in step S205, if only the text of a fixed set direction is input from the beginning, the set direction is fixed (specified from the operation panel (not shown)). I can do things.

【００７３】また、ステップＳ２０６の見出し類検出に
おいて、最初から見出し類のない画像しか入力しないな
らば、処理を省略する事も可能である。勿論、この場合
には、見出しが無しを操作部より指示しておく。Further, in the heading detection in step S206, if only an image without a heading is input from the beginning, the process can be omitted. Of course, in this case, it is instructed from the operation unit that there is no heading.

【００７４】また、ステップＳ２０７の矩形の合併にお
いて、図１２に示す様に、セパレータ等の矩形と接触し
た場合に、更に実際の縮小画素とも接触指定するか否か
を確認し、接触していなければ合併を実行する事で、傾
斜した画像にも対処できる。Further, in the merge of rectangles in step S207, as shown in FIG. 12, when a rectangle such as a separator is contacted, it is confirmed whether or not the contact is also designated with the actual reduced pixel, and the contact must be made. For example, by performing merger, it is possible to deal with tilted images.

【００７５】また、ステップＳ２０８の最後の出力の際
に、各種領域の矩形データを参照して、必要な領域の画
像データ（勿論、対応する原画像の領域）のみを出力す
ることも可能であり、記憶領域の更なる縮小や、処理時
間の更なる短縮がはかれる。例えば、文字認識装置に入
力画像データを出力するのであれば、本文であると判定
された画像のみをその装置に出力することが可能にな
る。尚、縮小画素の位置から、原画像中の位置を算出す
るのは、極めて簡単である。すなわち、縮小画素が原画
像のＮ×Ｍ画素領域に対応するのであれば、縮小画素の
位置（Ｘ，Ｙ）にそれぞれＮ、Ｍをかけあわせれば良
い。At the final output in step S208, it is also possible to refer to the rectangular data of various areas and output only the image data of the necessary area (of course, the area of the corresponding original image). Further, the storage area can be further reduced and the processing time can be further shortened. For example, if the input image data is output to the character recognition device, only the image determined to be the text can be output to the device. It is extremely easy to calculate the position in the original image from the position of the reduced pixel. That is, if the reduced pixel corresponds to the N × M pixel area of the original image, the positions (X, Y) of the reduced pixel may be multiplied by N and M, respectively.

【００７６】従って、例えば、実施例の装置を用いて、
それに画像データを供給し、所望とする種類の画像のみ
を出力させるようにも構成することができる。Therefore, for example, using the apparatus of the embodiment,
It can also be configured to supply image data to it and output only an image of a desired type.

【００７７】また、上記実施例では、領分割部１０５内
の各構成ユニットに独立したＣＰＵが備えられ、それぞ
れで処理が独立して行われるように説明したが、装置全
体を１つのＣＰＵで動作処理する場合にも適応できるの
は勿論である。Further, in the above-described embodiment, it has been described that each constituent unit in the area dividing unit 105 is provided with an independent CPU, and the processing is independently performed in each of the constituent units. However, the entire apparatus is operated by one CPU. Of course, it can be applied to the case of processing.

【００７８】尚、本発明は、複数の機器から構成される
システムに適用しても、１つの機器から成る装置に適用
しても良い。また、本発明はシステム或は装置にプログ
ラムを供給することによって達成される場合にも適用で
きる。The present invention may be applied to a system composed of a plurality of devices or an apparatus composed of one device. The present invention can also be applied to the case where it is achieved by supplying a program to a system or an apparatus.

【００７９】[0079]

【発明の効果】以上説明した様に本発明によれば、画像
データの領域判定を高速に、しかも、メモリ量を抑えな
がら行うことが可能になる。As described above, according to the present invention, it is possible to determine the area of image data at high speed while suppressing the amount of memory.

【００８０】[0080]

[Brief description of drawings]

【図１】実施例の画像処理装置の概略構成を示す図であ
る。FIG. 1 is a diagram illustrating a schematic configuration of an image processing apparatus according to an embodiment.

【図２】実施例の画像処理装置における画像処理を示す
フローチャートである。FIG. 2 is a flowchart showing image processing in the image processing apparatus of the embodiment.

【図３】実施例のラベリング処理を説明するための図で
ある。FIG. 3 is a diagram for explaining labeling processing according to the embodiment.

【図４】実施例の矩形データ構造を示す図である。FIG. 4 is a diagram showing a rectangular data structure of the embodiment.

【図５】実施例における１ラインにおける縮小画素デー
タのリンク構造と、各ラインのヘッダ部分のリンク構造
を示す図である。FIG. 5 is a diagram showing a link structure of reduced pixel data on one line and a link structure of a header portion of each line in the embodiment.

【図６】実施例の基本属性判別部の処理を示すフローチ
ャートである。FIG. 6 is a flowchart showing a process of a basic attribute determination unit of the embodiment.

【図７】実施例の矩形（領域）の属性の密度Ｄと面積Ｓ
での切り分けを示す図である。FIG. 7 is a density (D) and area (S) of attributes of rectangles (regions) in the embodiment
It is a figure which shows the division in.

【図８】実施例の組方向検出部の処理を示すフローチャ
ートである。FIG. 8 is a flowchart showing a process of a set direction detecting unit of the embodiment.

【図９】実施例の段落検出処理の例を示す図である。FIG. 9 is a diagram illustrating an example of paragraph detection processing according to the embodiment.

【図１０】実施例の見出し類検出部の処理を示すフロー
チャートである。FIG. 10 is a flowchart showing a process of a headline type detection unit of the embodiment.

【図１１】実施例の矩形の合併処理の例を示す図であ
る。FIG. 11 is a diagram illustrating an example of rectangle merging processing according to the embodiment.

【図１２】他の実施例のセパレータ矩形の合併処理の例
を示す図である。FIG. 12 is a diagram illustrating an example of a separator rectangle merging process according to another embodiment.

[Explanation of symbols]

１０１入力部１０２演算処理部（ＣＰＵ）１０３記憶部１０４出力部１０５領域分割部１０６画像縮小データ化部１０７ラベリング部１０８基本属性判別部１０９組方向検出部１１０見出し類検出部１１１矩形の合併部 101 input unit 102 arithmetic processing unit (CPU) 103 storage unit 104 output unit 105 region dividing unit 106 image reduction data converting unit 107 labeling unit 108 basic attribute discriminating unit 109 set direction detecting unit 110 heading detecting unit 111 rectangular merging unit

Claims

[Claims]

1. An image processing apparatus for determining the type of each area of given image data, comprising: an extracting means for extracting one pixel of low resolution from an m × n pixel area in the image data; An image characterized by including an area discriminating means for discriminating an area of an image type from the continuous spread of pixels, and a discriminating means for judging the type of the area from the area information discriminated by the area discriminating means. Processing equipment.

2. The image processing apparatus according to claim 1, wherein the m and n have larger values as the resolution of input image data is higher.

3. The extracting means extracts, when there is at least one significant pixel in the m × n pixel, a low-resolution pixel corresponding to the m × n pixel as a significant pixel. The image processing device according to item 1.

4. The image processing apparatus according to claim 3, wherein the extraction unit includes a calculation unit that calculates a logical sum of dot states of m × n pixels.

5. The image processing apparatus according to claim 1, wherein the area discriminating means discriminates a rectangle inscribed in a continuous spread of low-resolution pixels as an area.

6. The area discrimination means is means for obtaining a significant pixel position of each low resolution in line units in which a pixel group of m × n in given image data is one pixel, and in the main scanning direction. A memory for storing the start position of consecutive significant low-resolution pixels and the number of consecutive pixels for each scanning line, and storing predetermined discrimination information when there is no significant low-resolution pixel in the line of interest. Means, a searching means for searching a significant low-resolution two-dimensional continuous region in accordance with the contents stored in the storage means, and a label unique to the continuous low-resolution pixel group obtained by the search And a means for obtaining the coordinates of a rectangular area inscribed in the pixel group based on the coordinate position of the pixel group of the specific label.
An image processing apparatus according to the item.

7. The determining means determines the type of the area according to the vertical and horizontal sizes of the area determined by the area determining means and the ratio of the number of pixels of significant low resolution in the area. The image processing apparatus according to claim 1, wherein:

8. An image processing method for determining the type of each area of given image data, comprising an extraction step of extracting one pixel of low resolution from an m × n pixel area in the image data, and a low resolution. An image characterized by including an area discrimination step of discriminating an area of an image type from the continuous spread of pixels, and a discrimination step of judging the type of the area from the area information discriminated in the area discrimination step. Processing method.

9. The image processing method according to claim 8, wherein the m and n have larger values as the resolution of input image data is higher.

10. The extracting step extracts, when there is at least one significant pixel in m × n pixels, a low-resolution pixel for the m × n pixel as a significant pixel. Item 8. The image processing method according to Item 8.

11. The image processing method according to claim 10, wherein the extraction step includes calculation means for calculating a logical sum of dot states of m × n pixels.

12. The image processing method according to claim 8, wherein in the area determining step, a rectangle inscribed in a continuous spread of low resolution pixels is determined as an area.

13. The area determining step includes a step of obtaining a significant pixel position of each low resolution in line units in which an m × n pixel group in given image data is one pixel, and in the main scanning direction. A memory for storing the start position of consecutive significant low-resolution pixels and the number of consecutive pixels for each scanning line, and storing predetermined discrimination information when there is no significant low-resolution pixel in the line of interest. A process, a search process for searching a significant low-resolution two-dimensional continuous region in accordance with the contents stored in the storage process, and a unique label for the continuous low-resolution pixel group obtained by the search. And obtaining the coordinates of a rectangular area inscribed in the pixel group based on the coordinate position of the pixel group of the unique label.
The image processing method according to item 2.

14. The determining step determines the type of the area according to the vertical and horizontal sizes of the area determined in the area determining step and the ratio of the number of significant low-resolution pixels in the area. The image processing method according to claim 8, further comprising: