CN114708543A - Examination student positioning method in examination room monitoring video image - Google Patents

Examination student positioning method in examination room monitoring video image Download PDF

Info

Publication number
CN114708543A
CN114708543A CN202210629393.7A CN202210629393A CN114708543A CN 114708543 A CN114708543 A CN 114708543A CN 202210629393 A CN202210629393 A CN 202210629393A CN 114708543 A CN114708543 A CN 114708543A
Authority
CN
China
Prior art keywords
image data
examination room
video image
monitoring video
hair
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202210629393.7A
Other languages
Chinese (zh)
Other versions
CN114708543B (en
Inventor
刘说
潘帆
李翔
赵启军
黄珂
杨玲
杨智鹏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Chengdu University of Information Technology
Original Assignee
Chengdu University of Information Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Chengdu University of Information Technology filed Critical Chengdu University of Information Technology
Priority to CN202210629393.7A priority Critical patent/CN114708543B/en
Publication of CN114708543A publication Critical patent/CN114708543A/en
Application granted granted Critical
Publication of CN114708543B publication Critical patent/CN114708543B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention relates to the field of image processing, in particular to a method for positioning examinees in an examination room monitoring video image, which mainly comprises the steps of firstly carrying out frame selection marking based on head hair areas of the examinees on a large amount of examination room monitoring video image data containing different examination scenes and different examinees according to the ear visible conditions of the examinees in the examination room monitoring video image data, establishing an examinee head hair area data set, carrying out primary screening based on high false alarm rate target detection on the basis, finally establishing a model based on SSD deep learning target detection, positioning the head hair areas of the examinees, and finally realizing the positioning of the examinees.

Description

一种考场监控视频图像中考生定位方法A method for locating candidates in surveillance video images of examination room

技术领域technical field

本发明属于图像处理领域,具体涉及一种考场监控视频图像中考生定位方法。The invention belongs to the field of image processing, and in particular relates to a method for locating a candidate in a monitoring video image of an examination room.

背景技术Background technique

在世界范围内,考试一直作为重要的检验、选拔手段广泛使用,这是由于其在一定程度上能确保公平、公正。然而为了顺利通过考试,存在各种各样作弊手段,为了保证考试的公平、公正原则,考试监控系统大量应用于各类考试中。然而,考场拥有了视频监控系统,却并不意味着能很好的解决作弊问题。In the world, the examination has been widely used as an important test and selection method, because it can ensure fairness and justice to a certain extent. However, in order to successfully pass the exam, there are various cheating methods. In order to ensure the fairness and impartiality of the exam, the exam monitoring system is widely used in various exams. However, the fact that the examination room has a video surveillance system does not mean that it can solve the problem of cheating very well.

这是由于视频监控虽然能较为完整的记录考场信息,但是是否存在考试作弊行为,仍然需要相关部门投入大量的人力去对这些视频数据进行后期的处理和审查,其中很大比例的视频中是没有作弊行为的,但每一段视频都需要经过相关人员的仔细审查,由此产生了大量的工作量,由此产生了对考场监控视频中考生的行为进行自动识别的需求,而如何对考场监控视频中考生进行定位,则成为了一个必须解决的关键问题。This is because although video surveillance can record the information of the examination room relatively completely, whether there is cheating in the examination still requires relevant departments to invest a lot of manpower to process and review these video data in the later stage. cheating behavior, but each video needs to be carefully reviewed by relevant personnel, resulting in a lot of workload, which has created the need to automatically identify the behavior of candidates in the examination room surveillance video, and how to check the examination room surveillance video. The positioning of middle school students has become a key problem that must be solved.

对于考场监控视频检测定位方法大致可以分为基于背景差分的方法、基于模板匹配的方法、基于图像特征的方法,这些方法存在检测范围有限,对于考场布局的依赖性较大等问题。The detection and positioning methods of surveillance video in the examination room can be roughly divided into methods based on background difference, methods based on template matching, and methods based on image features. These methods have problems such as limited detection range and greater dependence on the layout of the examination room.

发明内容SUMMARY OF THE INVENTION

本发明针对上述现有技术的不足,提出了一种考场监控视频图像中考生定位方法,包括以下步骤:Aiming at the deficiencies of the above-mentioned prior art, the present invention proposes a method for locating a candidate in a monitoring video image of an examination room, comprising the following steps:

步骤1:对包含了不同考试场景、不同考生的大量考场监控视频图像数据进行基于考生头顶部头发区域的框选标记,然后建立考场监控视频图像数据的考生头顶部头发区域数据集;Step 1: Make a frame selection mark based on the hair area on the top of the examinee's head for a large number of examination room surveillance video image data that includes different examination scenes and different candidates, and then establish a data set of the examinee's head hair area of the examination room surveillance video image data;

步骤2:建立针对考场监控视频图像数据中考生头顶部头发区域定位的目标检测深度学习模型,首先对考场监控视频图像数据中的可能为头发的像素点进行筛选,得到预处理图像数据,然后在预处理图像数据上进行基于SSD的深度学习目标检测;Step 2: Establish a target detection deep learning model for locating the hair area on the top of the examinee's head in the surveillance video image data of the examination room. First, screen the pixels that may be hair in the surveillance video image data of the examination room to obtain the preprocessed image data, and then in the Perform SSD-based deep learning target detection on preprocessed image data;

步骤3:将建立的考场监控视频图像数据的考生头顶部头发区域数据集按比例进行划分,分别生成训练数据集和测试数据集,对建立的针对考场监控视频图像数据中考生头顶部头发区域定位的目标检测深度学习模型进行训练和测试,得到最终目标检测模型

Figure DEST_PATH_IMAGE001
;Step 3: Divide the data set of the hair area on the top of the candidate's head according to the proportion of the established monitoring video image data of the examination room, respectively generate a training data set and a testing data set, and locate the hair area on the top of the candidate's head in the established monitoring video image data for the examination room. The target detection deep learning model is trained and tested to obtain the final target detection model
Figure DEST_PATH_IMAGE001
;

步骤4:将考场监控视频初始图像数据输入到最终目标检测模型

Figure DEST_PATH_IMAGE002
中,然后得到对考场监控视频图像数据中的考生定位结果。Step 4: Input the initial image data of the surveillance video of the examination room into the final target detection model
Figure DEST_PATH_IMAGE002
, and then obtain the positioning results of the candidates in the monitoring video image data of the examination room.

进一步的,步骤1:对包含了不同考试场景、不同考生的大量考场监控视频图像数据进行基于考生头顶部头发区域的框选标记,具体方法为:对考场监控视频图像数据中的考生的头顶部头发区域进行基于考生耳朵显露情况及基于近似图像的方框标记。Further, step 1: carry out a frame selection mark based on the hair area on the top of the examinee's head for a large number of examination room surveillance video image data including different examination scenes and different candidates, and the specific method is: the examination room surveillance video image data. The hair area is box marked based on the exposure of the candidate's ears and based on the approximate image.

进一步的,对考场监控视频图像数据中的考生的头顶部头发区域进行基于考生耳朵显露情况方框标记,耳朵显露情况分为:两个耳朵显露,一个耳朵显露,没有显露耳朵。Further, the hair area on the top of the candidate's head in the monitoring video image data of the examination room is marked with a box based on the candidate's ear exposure. The ear exposure is divided into: two ears are exposed, one ear is exposed, and no ears are exposed.

进一步的,对考场监控视频图像数据中的考生的头顶部头发区域进行基于近似图像的方框标记,具体为生成的边框的水平和垂直边都平行于图像数据边缘。Further, the approximate image-based frame marking is performed on the hair area on the top of the examinee's head in the monitoring video image data of the examination room, specifically, the horizontal and vertical sides of the generated frame are parallel to the edge of the image data.

进一步的,视频图像数据中考生两个耳朵显露,则框选区域为:以边缘检测所得头发与额头交界的最低点为框选区域底部,以框选区域底部到边缘检测所得头发最顶部距离的

Figure DEST_PATH_IMAGE003
倍为框选高度,以头发左右两侧与背景交界最长距离的
Figure DEST_PATH_IMAGE004
倍为框选宽度,组成框选区域,变量
Figure DEST_PATH_IMAGE005
Figure DEST_PATH_IMAGE006
为加权系数。Further, if the two ears of the examinee are exposed in the video image data, the frame selection area is: take the lowest point of the junction between the hair and the forehead obtained by edge detection as the bottom of the frame selection area, and take the distance from the bottom of the frame selection area to the top of the hair obtained by edge detection.
Figure DEST_PATH_IMAGE003
times the height of the frame selection, the longest distance between the left and right sides of the hair and the background
Figure DEST_PATH_IMAGE004
times the width of the frame selection, forming a frame selection area, variable
Figure DEST_PATH_IMAGE005
and
Figure DEST_PATH_IMAGE006
is the weighting factor.

进一步的,视频图像数据中考生一个耳朵显露,则框选区域为:以头发与额头交界的最高点和最低点之间的中间值点为框选区域底部,以框选区域底部到头发最顶部距离的

Figure DEST_PATH_IMAGE007
倍为框选高度,以显露的一个耳朵与头发交界的边缘与另一侧头发与背景交界的边缘的最长距离的
Figure DEST_PATH_IMAGE008
倍为框选宽度,组成框选区域,变量
Figure DEST_PATH_IMAGE009
Figure DEST_PATH_IMAGE010
为加权系数。Further, if one ear of the examinee is exposed in the video image data, the frame selection area is: the middle value point between the highest point and the lowest point at the junction of the hair and the forehead is the bottom of the frame selection area, and the bottom of the frame selection area to the top of the hair. distance
Figure DEST_PATH_IMAGE007
times the height of the box selection, to the longest distance between the edge where one ear meets the hair and the edge where the other side meets the background.
Figure DEST_PATH_IMAGE008
times the width of the frame selection, forming a frame selection area, variable
Figure DEST_PATH_IMAGE009
and
Figure DEST_PATH_IMAGE010
is the weighting factor.

进一步的,视频图像数据中考生没有显露耳朵,以头发与额头交界的最高点为框选区域底部,以框选区域底部到头发最顶部的距离为框选高度,以额头在图像数据中显示的水平宽度为框选宽度,组成框选区域。Further, in the video image data, the candidates do not show their ears. The highest point at the junction of the hair and the forehead is the bottom of the frame selection area, the distance from the bottom of the frame selection area to the top of the hair is the frame selection height, and the forehead is displayed in the image data. The horizontal width is the width of the frame selection, forming the frame selection area.

进一步的,步骤2:对考场监控视频图像数据中的可能为头发的像素点进行筛选,得到预处理图像数据,具体方法为:首先对图像数据进行灰度化处理,得到灰度图像数据

Figure DEST_PATH_IMAGE011
;然后对灰度图像数据中的每个像素点的像素值按照
Figure DEST_PATH_IMAGE012
进行取反,得到灰度取反图像数据
Figure DEST_PATH_IMAGE013
,其中
Figure DEST_PATH_IMAGE014
Figure DEST_PATH_IMAGE015
为图像数据
Figure DEST_PATH_IMAGE016
Figure DEST_PATH_IMAGE017
中的横坐标为
Figure DEST_PATH_IMAGE018
,纵坐标为
Figure DEST_PATH_IMAGE019
的像素点的灰度值;对灰度取反图像数据
Figure 606426DEST_PATH_IMAGE017
进行高虚警率的CFAR目标检测,得到筛选后的图像数据,设置阈值
Figure DEST_PATH_IMAGE020
,对筛选后的图像数据进行二值化处理,得到预处理图像数据
Figure DEST_PATH_IMAGE021
。Further, step 2: screening the pixels that may be hair in the monitoring video image data of the examination room to obtain preprocessed image data, and the specific method is: first, perform grayscale processing on the image data to obtain grayscale image data
Figure DEST_PATH_IMAGE011
; Then the pixel value of each pixel in the grayscale image data is calculated according to
Figure DEST_PATH_IMAGE012
Invert to get grayscale inverse image data
Figure DEST_PATH_IMAGE013
,in
Figure DEST_PATH_IMAGE014
,
Figure DEST_PATH_IMAGE015
for image data
Figure DEST_PATH_IMAGE016
,
Figure DEST_PATH_IMAGE017
The abscissa in is
Figure DEST_PATH_IMAGE018
, the ordinate is
Figure DEST_PATH_IMAGE019
The gray value of the pixel point; invert the grayscale image data
Figure 606426DEST_PATH_IMAGE017
Perform CFAR target detection with high false alarm rate, obtain filtered image data, and set thresholds
Figure DEST_PATH_IMAGE020
, binarize the filtered image data to obtain preprocessed image data
Figure DEST_PATH_IMAGE021
.

进一步的,步骤2:在预处理图像数据上进行基于SSD的深度学习目标检测,具体为:将二值检测结果图像数据

Figure DEST_PATH_IMAGE022
作为索引图像,将索引图像中非0像素值像素点在图像中的坐标映射到相应考场监控视频图像数据上,以映射的像素点作为锚框中心点,基于SSD目标检测框架,建立针对考场监控视频图像数据头发区域定位的目标检测模型。Further, step 2: perform deep learning target detection based on SSD on the preprocessed image data, specifically: the binary detection result image data
Figure DEST_PATH_IMAGE022
As an index image, map the coordinates of pixels with non-zero pixel values in the index image to the corresponding monitoring video image data of the examination room, and use the mapped pixels as the center point of the anchor frame. Based on the SSD target detection framework, establish a monitoring system for the examination room. Object detection model for hair region localization in video image data.

进一步的,步骤4:将考场监控视频初始图像数据输入到最终目标检测模型

Figure DEST_PATH_IMAGE023
中,然后得到对考场监控视频图像数据中的考生定位结果,具体为:将考场监控视频初始图像数据输入到最终目标检测模型
Figure DEST_PATH_IMAGE024
中,得到头发区域框选结果,将每个框选区域向下扩展自身范围的
Figure DEST_PATH_IMAGE025
倍,得到更新的区域框选结果,将该更新的区域框选结果定为考生定位结果。Further, step 4: input the initial image data of the surveillance video of the examination room into the final target detection model
Figure DEST_PATH_IMAGE023
, and then obtain the location results of the candidates in the surveillance video image data of the examination room, specifically: input the initial image data of the surveillance video of the examination room into the final target detection model
Figure DEST_PATH_IMAGE024
, get the result of the hair area frame selection, and extend each frame selection area down to its own range
Figure DEST_PATH_IMAGE025
times, to obtain the updated area frame selection result, and set the updated area frame selection result as the candidate positioning result.

本发明解决了以下技术问题:The present invention solves the following technical problems:

1、提出一种根据考场监控视频图像数据中考生的耳朵可见情况,对考场监控视频图像数据进行基于考生头发区域的框选标记方法,提高了考生头发区域数据集的准确性和可靠性。1. This paper proposes a method of frame selection and marking based on the examinee's hair area for the examination room surveillance video image data according to the examinee's ear visibility in the examination room surveillance video image data, which improves the accuracy and reliability of the examinee's hair area data set.

2、通过对考场监控视频图像数据中的可能为头发的像素点进行基于高虚警率的目标检测的初步筛选,有效的提高了考生头发区域检测的准确性。2. Through the preliminary screening of the target detection based on the high false alarm rate for the pixels in the monitoring video image data of the examination room that may be hair, the accuracy of the examinee's hair area detection is effectively improved.

3、对考场监控视频图像数据的二值检测结果作为索引图像,以索引图像为基础进行锚框选择,在提高了考生头发区域检测的准确性的同时又降低了目标检测模型的复杂度。3. The binary detection result of the monitoring video image data of the examination room is used as the index image, and the anchor frame is selected based on the index image, which improves the accuracy of the examinee's hair area detection and reduces the complexity of the target detection model.

附图说明Description of drawings

图1为一种考场监控视频图像中考生定位方法流程图。Figure 1 is a flowchart of a method for locating candidates in a surveillance video image of an examination room.

具体实施方式Detailed ways

下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整的描述,方法流程图如图1所示,包括以下步骤:The technical solutions in the embodiments of the present invention will be described clearly and completely below in conjunction with the accompanying drawings in the embodiments of the present invention. The flow chart of the method is shown in Figure 1 and includes the following steps:

一种考场监控视频图像中考生定位方法,包括:A method for locating candidates in a monitoring video image of an examination room, comprising:

步骤1:对包含了不同考试场景、不同考生的大量考场监控视频图像数据进行基于考生头顶部头发区域的框选标记,然后建立考场监控视频图像数据的考生头顶部头发区域数据集;Step 1: Make a frame selection mark based on the hair area on the top of the examinee's head for a large number of examination room surveillance video image data that includes different examination scenes and different candidates, and then establish a data set of the examinee's head hair area of the examination room surveillance video image data;

步骤2:建立针对考场监控视频图像数据中考生头顶部头发区域定位的目标检测深度学习模型,首先对考场监控视频图像数据中的可能为头发的像素点进行筛选,得到预处理图像数据,然后在预处理图像数据上进行基于SSD的深度学习目标检测;Step 2: Establish a target detection deep learning model for locating the hair area on the top of the examinee's head in the surveillance video image data of the examination room. First, screen the pixels that may be hair in the surveillance video image data of the examination room to obtain the preprocessed image data, and then in the Perform SSD-based deep learning target detection on preprocessed image data;

步骤3:将建立的考场监控视频图像数据的考生头顶部头发区域数据集按比例进行划分,分别生成训练数据集和测试数据集,对建立的针对考场监控视频图像数据中考生头顶部头发区域定位的目标检测深度学习模型进行训练和测试,得到最终目标检测模型

Figure DEST_PATH_IMAGE026
;Step 3: Divide the data set of the hair area on the top of the candidate's head according to the proportion of the established monitoring video image data of the examination room, respectively generate a training data set and a testing data set, and locate the hair area on the top of the candidate's head in the established monitoring video image data for the examination room. The target detection deep learning model is trained and tested to obtain the final target detection model
Figure DEST_PATH_IMAGE026
;

步骤4:将考场监控视频初始图像数据输入到最终目标检测模型

Figure 163047DEST_PATH_IMAGE002
中,然后得到对考场监控视频图像数据中的考生定位结果。Step 4: Input the initial image data of the surveillance video of the examination room into the final target detection model
Figure 163047DEST_PATH_IMAGE002
, and then obtain the positioning results of the candidates in the monitoring video image data of the examination room.

进一步的,步骤1:对包含了不同考试场景、不同考生的大量考场监控视频图像数据进行基于考生头顶部头发区域的框选标记,具体方法为:对考场监控视频图像数据中的考生的头顶部头发区域进行基于考生耳朵显露情况及基于近似图像的方框标记。Further, step 1: carry out a frame selection mark based on the hair area on the top of the examinee's head for a large number of examination room surveillance video image data including different examination scenes and different candidates, and the specific method is: the examination room surveillance video image data. The hair area is box marked based on the exposure of the candidate's ears and based on the approximate image.

进一步的,对考场监控视频图像数据中的考生的头顶部头发区域进行基于考生耳朵显露情况方框标记,耳朵显露情况分为:两个耳朵显露,一个耳朵显露,没有显露耳朵。Further, the hair area on the top of the candidate's head in the monitoring video image data of the examination room is marked with a box based on the candidate's ear exposure. The ear exposure is divided into: two ears are exposed, one ear is exposed, and no ears are exposed.

进一步的,对考场监控视频图像数据中的考生的头顶部头发区域进行基于近似图像的方框标记,具体为生成的边框的水平和垂直边都平行于图像数据边缘。Further, the approximate image-based frame marking is performed on the hair area on the top of the examinee's head in the monitoring video image data of the examination room, specifically, the horizontal and vertical sides of the generated frame are parallel to the edge of the image data.

进一步的,视频图像数据中考生两个耳朵显露,则框选区域为:以边缘检测所得头发与额头交界的最低点为框选区域底部,以框选区域底部到边缘检测所得头发最顶部距离的

Figure 867086DEST_PATH_IMAGE005
倍为框选高度,以头发左右两侧与背景交界最长距离的
Figure DEST_PATH_IMAGE027
倍为框选宽度,组成框选区域,变量
Figure 918087DEST_PATH_IMAGE005
Figure 466880DEST_PATH_IMAGE006
为加权系数。Further, if the two ears of the examinee are exposed in the video image data, the frame selection area is: take the lowest point of the junction between the hair and the forehead obtained by edge detection as the bottom of the frame selection area, and take the distance from the bottom of the frame selection area to the top of the hair obtained by edge detection.
Figure 867086DEST_PATH_IMAGE005
times the height of the frame selection, the longest distance between the left and right sides of the hair and the background
Figure DEST_PATH_IMAGE027
times the width of the frame selection, forming a frame selection area, variable
Figure 918087DEST_PATH_IMAGE005
and
Figure 466880DEST_PATH_IMAGE006
is the weighting factor.

进一步的,视频图像数据中考生一个耳朵显露,则框选区域为:以头发与额头交界的最高点和最低点之间的中间值点为框选区域底部,以框选区域底部到头发最顶部距离的

Figure DEST_PATH_IMAGE028
倍为框选高度,以显露的一个耳朵与头发交界的边缘与另一侧头发与背景交界的边缘的最长距离的
Figure DEST_PATH_IMAGE029
倍为框选宽度,组成框选区域,变量
Figure 86080DEST_PATH_IMAGE009
Figure DEST_PATH_IMAGE030
为加权系数。Further, if one ear of the examinee is exposed in the video image data, the frame selection area is: the middle value point between the highest point and the lowest point at the junction of the hair and the forehead is the bottom of the frame selection area, and the bottom of the frame selection area to the top of the hair. distance
Figure DEST_PATH_IMAGE028
times the height of the box selection, to the longest distance between the edge where one ear meets the hair and the edge where the other side meets the background.
Figure DEST_PATH_IMAGE029
times the width of the frame selection, forming a frame selection area, variable
Figure 86080DEST_PATH_IMAGE009
and
Figure DEST_PATH_IMAGE030
is the weighting factor.

进一步的,视频图像数据中考生没有显露耳朵,以头发与额头交界的最高点为框选区域底部,以框选区域底部到头发最顶部的距离为框选高度,以额头在图像数据中显示的水平宽度为框选宽度,组成框选区域。Further, in the video image data, the candidates do not show their ears. The highest point at the junction of the hair and the forehead is the bottom of the frame selection area, the distance from the bottom of the frame selection area to the top of the hair is the frame selection height, and the forehead is displayed in the image data. The horizontal width is the width of the frame selection, forming the frame selection area.

进一步的,步骤2:对考场监控视频图像数据中的可能为头发的像素点进行筛选,得到预处理图像数据,具体方法为:首先对图像数据进行灰度化处理,得到灰度图像数据

Figure 274485DEST_PATH_IMAGE016
;然后对灰度图像数据中的每个像素点的像素值按照
Figure DEST_PATH_IMAGE031
进行取反,得到灰度取反图像数据
Figure 120389DEST_PATH_IMAGE017
,其中
Figure DEST_PATH_IMAGE032
Figure DEST_PATH_IMAGE033
为图像数据
Figure 382743DEST_PATH_IMAGE016
Figure 297478DEST_PATH_IMAGE017
中的横坐标为
Figure DEST_PATH_IMAGE034
,纵坐标为
Figure DEST_PATH_IMAGE035
的像素点的灰度值;对灰度取反图像数据
Figure 442020DEST_PATH_IMAGE017
进行高虚警率的CFAR目标检测,得到筛选后的图像数据,设置阈值
Figure DEST_PATH_IMAGE036
,对筛选后的图像数据进行二值化处理,得到预处理图像数据
Figure 585557DEST_PATH_IMAGE022
。Further, step 2: screening the pixels that may be hair in the monitoring video image data of the examination room to obtain preprocessed image data, and the specific method is: first, perform grayscale processing on the image data to obtain grayscale image data
Figure 274485DEST_PATH_IMAGE016
; Then the pixel value of each pixel in the grayscale image data is calculated according to
Figure DEST_PATH_IMAGE031
Invert to get grayscale inverse image data
Figure 120389DEST_PATH_IMAGE017
,in
Figure DEST_PATH_IMAGE032
,
Figure DEST_PATH_IMAGE033
for image data
Figure 382743DEST_PATH_IMAGE016
,
Figure 297478DEST_PATH_IMAGE017
The abscissa in is
Figure DEST_PATH_IMAGE034
, the ordinate is
Figure DEST_PATH_IMAGE035
The gray value of the pixel point; invert the grayscale image data
Figure 442020DEST_PATH_IMAGE017
Perform CFAR target detection with high false alarm rate, obtain filtered image data, and set thresholds
Figure DEST_PATH_IMAGE036
, binarize the filtered image data to obtain preprocessed image data
Figure 585557DEST_PATH_IMAGE022
.

进一步的,步骤2:在预处理图像数据上进行基于SSD的深度学习目标检测,具体为:将二值检测结果图像数据

Figure 33243DEST_PATH_IMAGE022
作为索引图像,将索引图像中非0像素值像素点在图像中的坐标映射到相应考场监控视频图像数据上,以映射的像素点作为锚框中心点,基于SSD目标检测框架,建立针对考场监控视频图像数据头发区域定位的目标检测模型。Further, step 2: perform deep learning target detection based on SSD on the preprocessed image data, specifically: the binary detection result image data
Figure 33243DEST_PATH_IMAGE022
As an index image, map the coordinates of pixels with non-zero pixel values in the index image to the corresponding monitoring video image data of the examination room, and use the mapped pixels as the center point of the anchor frame. Based on the SSD target detection framework, establish a monitoring system for the examination room. Object detection model for hair region localization in video image data.

进一步的,步骤4:将考场监控视频初始图像数据输入到最终目标检测模型

Figure 197508DEST_PATH_IMAGE023
中,然后得到对考场监控视频图像数据中的考生定位结果,具体为:将考场监控视频初始图像数据输入到最终目标检测模型
Figure 360505DEST_PATH_IMAGE024
中,得到头发区域框选结果,将每个框选区域向下扩展自身范围的
Figure DEST_PATH_IMAGE037
倍,得到更新的区域框选结果,将该更新的区域框选结果定为考生定位结果。Further, step 4: input the initial image data of the surveillance video of the examination room into the final target detection model
Figure 197508DEST_PATH_IMAGE023
, and then obtain the location results of the candidates in the surveillance video image data of the examination room, specifically: input the initial image data of the surveillance video of the examination room into the final target detection model
Figure 360505DEST_PATH_IMAGE024
, get the result of the hair area frame selection, and extend each frame selection area down to its own range
Figure DEST_PATH_IMAGE037
times, to obtain the updated area frame selection result, and set the updated area frame selection result as the candidate positioning result.

显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的其他实施例,都属于本发明保护的范围。Obviously, the described embodiments are only some, but not all, embodiments of the present invention. Based on the embodiments of the present invention, other embodiments obtained by persons of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.

Claims (10)

1. A method for positioning examinees in an examination room monitoring video image is characterized by comprising the following steps:
step 1: performing frame selection marking based on the head hair area of the examinee on a large amount of examination room monitoring video image data containing different examination scenes and different examinees, and then establishing an examinee head hair area data set of the examination room monitoring video image data;
step 2: establishing a target detection deep learning model for positioning a hair area at the top of an examinee in examination room monitoring video image data, firstly screening pixel points which are possibly hairs in the examination room monitoring video image data to obtain preprocessed image data, and then carrying out deep learning target detection based on SSD on the preprocessed image data;
and step 3: dividing an examinee's head top hair region data set of the established examination room monitoring video image data in proportion to generate a training data set and a testing data set respectively, training and testing the established target detection depth learning model for positioning the examinee's head top hair region in the examination room monitoring video image data to obtain a final target detection model
Figure 669108DEST_PATH_IMAGE001
And 4, step 4: inputting initial image data of examination room monitoring video into final target detection model
Figure 587385DEST_PATH_IMAGE002
Then, the examinee positioning result in the examination room monitoring video image data is obtained.
2. The method for locating the examinee in the examination room monitoring video image according to claim 1, characterized in that the step 1: the method comprises the following steps of performing frame selection marking based on the head hair area of an examinee on a large amount of examination room monitoring video image data containing different examination scenes and different examinees, and specifically comprises the following steps: and carrying out box marking based on the ear exposure condition of the examinee and based on the approximate image on the head and top hair area of the examinee in the examination room monitoring video image data.
3. The method according to claim 2, wherein the examination room monitoring video image is characterized in that the examination room monitoring video image data is subjected to examination room ear exposure condition-based frame marking on the top of the head and hair area of the examinee, and the ear exposure condition is divided into: two ears are exposed, one ear is exposed, and no ear is exposed.
4. The method according to claim 2, wherein the top hair area of the head of the examinee in the examinee surveillance video image data is marked by a box based on an approximate image, and the horizontal and vertical edges of the generated frame are parallel to the edges of the image data.
5. The method according to claim 3, wherein if two ears of the examinee are exposed in the video image data, the frame selection area is: using the lowest point of the boundary between the hair and the forehead obtained by the edge detection as the bottom of the frame selection region, and the distance from the bottom of the frame selection region to the top of the hair obtained by the edge detection
Figure 626361DEST_PATH_IMAGE003
Selecting the height as the frame, and taking the longest distance between the left and right sides of the hair and the background
Figure 251377DEST_PATH_IMAGE004
Doubling the frame selection width to form frame selection area and variable
Figure 579590DEST_PATH_IMAGE005
And
Figure 860530DEST_PATH_IMAGE006
are weighting coefficients.
6. The method of claim 3, wherein if an ear of the test taker in the video image data is exposed, the selection area is: using the middle point between the highest point and the lowest point of the junction between the hair and the forehead as the bottom of the framed selection area, and the distance from the bottom of the framed selection area to the top of the hair
Figure 768443DEST_PATH_IMAGE007
Height is selected as the longest distance between the edge of one exposed ear and hair and the edge of the other side hair and background
Figure 575862DEST_PATH_IMAGE008
Doubling the frame selection width to form frame selection area and variable
Figure 950343DEST_PATH_IMAGE009
And
Figure 780896DEST_PATH_IMAGE010
are weighting coefficients.
7. The method as claimed in claim 3, wherein the examinee is located in the video image data without exposing ears, the highest point of the boundary between the hair and the forehead is the bottom of the frame selection area, the distance from the bottom of the frame selection area to the top of the hair is the frame selection height, and the horizontal width of the forehead displayed in the image data is the frame selection width, thereby forming the frame selection area.
8. The method for positioning the examinee in the examination room monitoring video image according to claim 1, characterized in that the step 2: screening pixel points which are possibly hairs in examination room monitoring video image data to obtain preprocessed image data, wherein the specific method comprises the following steps: firstly, the image data is grayed to obtain the grayscale image data
Figure 289237DEST_PATH_IMAGE011
(ii) a Then, the pixel value of each pixel point in the gray image data is calculated according to
Figure 888846DEST_PATH_IMAGE012
Performing inversion to obtain grayscale inverted image data
Figure 434228DEST_PATH_IMAGE013
Wherein
Figure 814394DEST_PATH_IMAGE014
Figure 798530DEST_PATH_IMAGE015
As image data
Figure 455908DEST_PATH_IMAGE016
Figure 296825DEST_PATH_IMAGE017
On the abscissa of
Figure 101970DEST_PATH_IMAGE018
On the ordinate of
Figure 561901DEST_PATH_IMAGE019
The gray value of the pixel point; inverting gray scale image data
Figure 198419DEST_PATH_IMAGE017
CFAR target detection with high false alarm rate is carried out to obtain screened image data, and a threshold value is set
Figure 413499DEST_PATH_IMAGE020
Carrying out binarization processing on the screened image data to obtain preprocessed image data
Figure 112465DEST_PATH_IMAGE021
9. The method for locating the examinee in the examination room monitoring video image according to claim 1, characterized in that the step 2: performing SSD-based deep learning target detection on the preprocessed image data, specifically: image data of binary detection result
Figure 766300DEST_PATH_IMAGE022
As an index image, mapping the coordinates of non-0 pixel value pixel points in the index image to corresponding examination room monitoring video image data, and using the mapped pixel points as the index imageAnd establishing a target detection model for positioning the hair region of the examination room monitoring video image data based on an SSD target detection framework as the center point of the anchor frame.
10. The method for locating the examinee in the examination room monitoring video image according to claim 1, characterized in that the step 4: inputting initial image data of examination room monitoring video into final target detection model
Figure 195008DEST_PATH_IMAGE023
Then, obtaining the examinee positioning result in the examination room monitoring video image data, specifically: inputting initial image data of examination room monitoring video into final target detection model
Figure 253093DEST_PATH_IMAGE001
In the method, a hair region frame selection result is obtained, and each frame selection region is expanded downwards to the self range
Figure 95147DEST_PATH_IMAGE024
And obtaining an updated region frame selection result, and determining the updated region frame selection result as a test taker positioning result.
CN202210629393.7A 2022-06-06 2022-06-06 A method for locating candidates in surveillance video images of examination room Active CN114708543B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210629393.7A CN114708543B (en) 2022-06-06 2022-06-06 A method for locating candidates in surveillance video images of examination room

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210629393.7A CN114708543B (en) 2022-06-06 2022-06-06 A method for locating candidates in surveillance video images of examination room

Publications (2)

Publication Number Publication Date
CN114708543A true CN114708543A (en) 2022-07-05
CN114708543B CN114708543B (en) 2022-08-30

Family

ID=82177605

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210629393.7A Active CN114708543B (en) 2022-06-06 2022-06-06 A method for locating candidates in surveillance video images of examination room

Country Status (1)

Country Link
CN (1) CN114708543B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116310806A (en) * 2023-02-28 2023-06-23 北京理工大学珠海学院 An integrated management system and method for smart agriculture based on image recognition

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5430809A (en) * 1992-07-10 1995-07-04 Sony Corporation Human face tracking system
CN105678213A (en) * 2015-12-20 2016-06-15 华南理工大学 Dual-mode masked man event automatic detection method based on video characteristic statistics
CN106991360A (en) * 2016-01-20 2017-07-28 腾讯科技(深圳)有限公司 Face identification method and face identification system
CN107451555A (en) * 2017-07-27 2017-12-08 安徽慧视金瞳科技有限公司 A kind of hair based on gradient direction divides to determination methods
CN108260918A (en) * 2018-02-09 2018-07-10 武汉技兴科技有限公司 A kind of human hair information collecting method, device and intelligent clipping device
CN109711377A (en) * 2018-12-30 2019-05-03 陕西师范大学 A method of positioning and counting candidates in a single-frame image of standardized examination room monitoring
CN112686965A (en) * 2020-12-25 2021-04-20 百果园技术(新加坡)有限公司 Skin color detection method, device, mobile terminal and storage medium

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5430809A (en) * 1992-07-10 1995-07-04 Sony Corporation Human face tracking system
CN105678213A (en) * 2015-12-20 2016-06-15 华南理工大学 Dual-mode masked man event automatic detection method based on video characteristic statistics
CN106991360A (en) * 2016-01-20 2017-07-28 腾讯科技(深圳)有限公司 Face identification method and face identification system
CN107451555A (en) * 2017-07-27 2017-12-08 安徽慧视金瞳科技有限公司 A kind of hair based on gradient direction divides to determination methods
CN108260918A (en) * 2018-02-09 2018-07-10 武汉技兴科技有限公司 A kind of human hair information collecting method, device and intelligent clipping device
CN109711377A (en) * 2018-12-30 2019-05-03 陕西师范大学 A method of positioning and counting candidates in a single-frame image of standardized examination room monitoring
CN112686965A (en) * 2020-12-25 2021-04-20 百果园技术(新加坡)有限公司 Skin color detection method, device, mobile terminal and storage medium

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
XIAOFEI PENG等: "Teaching Assistant and Class Attendance Analysis Using Surveillance Camera", 《IFTC 2018: DIGITAL TV AND MULTIMEDIA COMMUNICATION》 *
王红梅: "基于视频的人体检测与计数技术研究", 《中国优秀博硕士学位论文全文数据库(硕士)信息科技辑》 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116310806A (en) * 2023-02-28 2023-06-23 北京理工大学珠海学院 An integrated management system and method for smart agriculture based on image recognition
CN116310806B (en) * 2023-02-28 2023-08-29 北京理工大学珠海学院 Intelligent agriculture integrated management system and method based on image recognition

Also Published As

Publication number Publication date
CN114708543B (en) 2022-08-30

Similar Documents

Publication Publication Date Title
CN110705583B (en) Cell detection model training method, device, computer equipment and storage medium
CN111260645B (en) Tampered image detection method and system based on block classification deep learning
CN112712273B (en) Handwriting Chinese character aesthetic degree judging method based on skeleton similarity
CN109635875A (en) A kind of end-to-end network interface detection method based on deep learning
CN101615292B (en) Accurate positioning method for human eye on the basis of gray gradation information
CN110689000B (en) Vehicle license plate recognition method based on license plate sample generated in complex environment
CN105869085A (en) Transcript inputting system and method for processing images
CN111753873A (en) An image detection method and device
CN108257125B (en) Depth image quality non-reference evaluation method based on natural scene statistics
CN116433651B (en) Small sample panel defect detection method, system, equipment and storage medium
CN110400293A (en) A no-reference image quality assessment method based on deep forest classification
CN106340000A (en) Bone age assessment method
CN106960433B (en) A full-reference sonar image quality evaluation method based on image entropy and edge
CN113012167A (en) Combined segmentation method for cell nucleus and cytoplasm
CN116596877B (en) A Bone Age Intelligent Assessment Method Based on Attention Mechanism
CN102081742B (en) Method for automatically evaluating writing ability
CN115345858A (en) Method, system and terminal device for evaluating quality of echocardiogram
CN111126330A (en) A method for locating the center of the pupil membrane and a method for detecting the fatigue degree of students listening to a lecture
CN104751160B (en) Galactophore image processing method based on sparse autocoding depth network
CN108764029A (en) Model intelligence inspection method
CN114708543A (en) Examination student positioning method in examination room monitoring video image
CN120450982B (en) Lung cancer image enhancement method based on generative adversarial network
CN115311447A (en) Pointer instrument indicating number identification method based on deep convolutional neural network
KR20200005853A (en) Method and System for People Count based on Deep Learning
CN116708759A (en) Method for detecting abnormal deflection of camera

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant