CN120220116B - A control system and method based on unmanned road patrol tow truck - Google Patents

A control system and method based on unmanned road patrol tow truck

Info

Publication number
CN120220116B
CN120220116B CN202510381147.8A CN202510381147A CN120220116B CN 120220116 B CN120220116 B CN 120220116B CN 202510381147 A CN202510381147 A CN 202510381147A CN 120220116 B CN120220116 B CN 120220116B
Authority
CN
China
Prior art keywords
confidence score
dimensional
vehicle
data
point cloud
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202510381147.8A
Other languages
Chinese (zh)
Other versions
CN120220116A (en
Inventor
包亦轩
陈雷翔
厉质彬
王书琪
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fujian Pingtan Ruiqian Intelligent Technology Co ltd
Original Assignee
Fujian Pingtan Ruiqian Intelligent Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fujian Pingtan Ruiqian Intelligent Technology Co ltd filed Critical Fujian Pingtan Ruiqian Intelligent Technology Co ltd
Priority to CN202510381147.8A priority Critical patent/CN120220116B/en
Publication of CN120220116A publication Critical patent/CN120220116A/en
Application granted granted Critical
Publication of CN120220116B publication Critical patent/CN120220116B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/50Context or environment of the image
    • G06V20/56Context or environment of the image exterior to a vehicle by using sensors mounted on the vehicle
    • G06V20/58Recognition of moving objects or obstacles, e.g. vehicles or pedestrians; Recognition of traffic objects, e.g. traffic signs, traffic lights or roads
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0464Convolutional networks [CNN, ConvNet]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/77Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
    • G06V10/80Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level
    • G06V10/806Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level of extracted features
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/60Type of objects
    • G06V20/62Text, e.g. of license plates, overlay texts or captions on TV images

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computing Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • Software Systems (AREA)
  • Evolutionary Computation (AREA)
  • General Health & Medical Sciences (AREA)
  • Biophysics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Mathematical Physics (AREA)
  • Computational Linguistics (AREA)
  • Biomedical Technology (AREA)
  • Molecular Biology (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Medical Informatics (AREA)
  • Databases & Information Systems (AREA)
  • Image Analysis (AREA)
  • Control Of Position, Course, Altitude, Or Attitude Of Moving Bodies (AREA)
  • Traffic Control Systems (AREA)

Abstract

The invention provides a control system and a control method based on an unmanned road guard patrol wrecker, which relate to the technical field of automatic wrecker, and comprise a multi-mode sensing module consisting of an image sensor, a laser radar and a depth sensor, wherein two-dimensional image data, three-dimensional point cloud data and three-dimensional depth data are acquired, data characteristic fusion is carried out by utilizing a layered detection module combining a double-flow depth neural network and a self-attention mechanism, multi-level grading is output, accurate assessment of vehicle damage and intelligent selection of wrecker modes are realized, and a system is continuously optimized through self-adaptive learning. According to the invention, through the multi-sensor cooperation, double-flow deep neural network feature extraction and self-attention dynamic fusion mechanism, high-precision vehicle state analysis and intelligent obstacle clearance decision are realized, intelligent accurate judgment on road fault vehicle damage conditions is realized, secondary damage risks caused by damage evaluation errors are reduced, the frequency of manual intervention is reduced, and the efficiency of road obstacle clearance operation is remarkably improved.

Description

一种基于无人道路值守巡逻清障车的控制系统及方法A control system and method based on unmanned road patrol tow truck

技术领域Technical Field

本发明涉及自动清障技术领域,尤其涉及一种基于无人道路值守巡逻清障车的控制系统及方法。The present invention relates to the technical field of automatic obstacle removal, and in particular to a control system and method based on an unmanned road patrol obstacle removal vehicle.

背景技术Background Art

随着人工智能技术、自动驾驶技术以及智慧交通技术的飞速发展,以无人车辆为核心的智慧化、自动化道路管理系统日益受到关注。其中,无人道路值守巡逻清障车作为一种能够自主巡逻、及时发现道路故障车辆,并快速实施有效处置的设备,已成为智能交通和智慧城市的重要发展方向之一。清障车通常根据车辆的受损情况选择清障方式,如针对受损较轻的故障车辆,清障车通过连接车辆的拖车钩将其牵引至承托部;针对受损严重的故障车辆,清障车通过吊装机构连接故障车辆的四轮,将其吊装至承托部,或通过托举的方式,将车辆的一端抬起并固定,直接拖拽进行清障。With the rapid development of artificial intelligence, autonomous driving, and smart transportation technologies, intelligent and automated road management systems centered around unmanned vehicles are gaining increasing attention. Among them, unmanned road patrol tow trucks, capable of autonomous patrols, timely detection of roadside vehicles, and rapid and effective disposal, have become a key development direction for intelligent transportation and smart cities. Tow trucks typically choose a towing method based on the damage to the vehicle. For example, for a minor tow truck, the tow truck tows the vehicle to a support structure using a tow hook. For a severely damaged tow truck, the tow truck connects to the vehicle's four wheels via a lifting mechanism and hoists it to a support structure. Alternatively, the tow truck lifts and secures one end of the vehicle through a lifting mechanism and then directly tows it for towing.

近年来,研究人员提出了一些自动化或半自动化的清障技术方案,依赖摄像头或激光雷达,通过光学视觉识别车辆外观特征进行识别,然而,当前的无人清障系统普遍存在识别准确性不高、数据处理方法单一等问题,对复杂环境中的车辆受损情况难以做出准确、有效的评估,导致在实际清障作业过程中往往难以避免人工干预。同时,由于传统的无人清障车在数据处理和决策方面缺乏智能性,无法准确区分不同损伤程度对应的清障方式(如牵引、托举或吊装),经常出现误判,从而降低了清障工作的效率,甚至引发二次损坏事故。In recent years, researchers have proposed a number of automated or semi-automated tow truck removal solutions that rely on cameras or lidar to identify vehicles through optical visual recognition of their exterior features. However, current unmanned tow truck systems generally suffer from low recognition accuracy and limited data processing methods. This makes it difficult to accurately and effectively assess vehicle damage in complex environments, making it difficult to avoid manual intervention during actual tow truck removal operations. Furthermore, due to the lack of intelligence in data processing and decision-making, traditional unmanned tow trucks are unable to accurately distinguish between appropriate towing methods (such as towing, lifting, or hoisting) for different degrees of damage. This leads to frequent misjudgments, which reduces tow truck efficiency and can even cause secondary damage accidents.

因此,为满足现代交通管理对清障作业高效性、智能性与可靠性的更高要求,有必要研发一种具备更精确的损伤评估能力、更灵活的作业决策方式以及更智能的数据融合处理机制的无人清障控制系统及方法。Therefore, in order to meet the higher requirements of modern traffic management for efficiency, intelligence and reliability of obstacle removal operations, it is necessary to develop an unmanned obstacle removal control system and method with more accurate damage assessment capabilities, more flexible operation decision-making methods and more intelligent data fusion processing mechanisms.

发明内容Summary of the Invention

为了克服现有技术的缺陷,本发明所要解决的技术问题在于提出一种基于无人道路值守巡逻清障车的控制系统及方法,采用以下技术方案:In order to overcome the defects of the prior art, the technical problem to be solved by the present invention is to propose a control system and method based on an unmanned road patrol tow truck, which adopts the following technical solutions:

本发明一方面提供了一种基于无人道路值守巡逻清障车的控制系统,包括:On one hand, the present invention provides a control system based on an unmanned road patrol tow truck, comprising:

多模态感知模块,至少包括:The multimodal perception module includes at least:

图像传感器,用于获取故障车辆的二维图像数据,并通过OCR提取车身标识信息;Image sensor, used to obtain two-dimensional image data of the faulty vehicle and extract vehicle body identification information through OCR;

激光传感器,用于生成故障车辆外观的三维点云数据;A laser sensor for generating three-dimensional point cloud data of the faulty vehicle's appearance;

深度传感器,用于生成故障车辆局部外观的三维深度数据;A depth sensor for generating three-dimensional depth data of the local appearance of the faulty vehicle;

分层检测模块,包括双流架构的深度神经网络计算模型:Hierarchical detection module, including a deep neural network computing model with a two-stream architecture:

第一流网络,采用卷积神经网络提取上述二维图像数据的图像特征向量Fimg,输出第一置信度评分C1The first stream network uses a convolutional neural network to extract the image feature vector F img of the two-dimensional image data and output a first confidence score C 1 ;

第二流网络,采用点云处理算法,提取上述三维点云数据的点云特征向量Fpc和三维深度数据的深度特征向量Fdep;叠加融合上述二维图像数据和三维点云数据,输出第二置信度评分C2;若上述第二置信度评分C2≤85,则更新融合上述三维点云数据以及三维深度数据,输出第三置信度评分C3The second stream network uses a point cloud processing algorithm to extract the point cloud feature vector Fpc of the 3D point cloud data and the depth feature vector Fdep of the 3D depth data; superimposes and fuses the 2D image data and the 3D point cloud data, and outputs a second confidence score C2 ; if the second confidence score C2 is ≤85, the 3D point cloud data and the 3D depth data are updated and fused, and a third confidence score C3 is output;

特征融合单元,采用自注意力机制加权融合上述图像特征向量Fimg、点云特征向量Fpc以及深度特征向量Fdep,生成融合特征向量FfusionThe feature fusion unit uses a self-attention mechanism to weightedly fuse the image feature vector F img , the point cloud feature vector F pc , and the depth feature vector F dep to generate a fused feature vector F fusion ;

控制处理单元,与上述分层检测模块相连,并根据预设阈值规则选择清障模式,当上述第三置信度评分C3≤85,请求人工介入。The control processing unit is connected to the hierarchical detection module and selects the obstacle removal mode according to a preset threshold rule. When the third confidence score C 3 ≤85, manual intervention is requested.

作进一步改进的,上述图像传感器扫描故障车辆的车头与车尾,上述第一置信度评分C1的计算公式为:As a further improvement, the image sensor scans the front and rear of the faulty vehicle, and the calculation formula of the first confidence score C1 is:

;

其中,Fimg表示第一流卷积神经网络输出的图像特征向量;Where F img represents the image feature vector output by the first-stream convolutional neural network;

W1为权重矩阵,b1为偏置项;W 1 is the weight matrix, b 1 is the bias term;

σ(·)为Sigmoid激活函数,使上述第一置信度评分C1映射至0~100的范围内。σ(·) is the Sigmoid activation function, which maps the above first confidence score C1 to the range of 0~100.

扫描故障车辆的车头与车尾,通常情况下车头与车尾标识有车辆品牌Logo、车辆型号等信息,图像传感器一方面判断车头与车尾的损伤情况,另一方面通过OCR采集故障车辆的车辆信息Scan the front and rear of the faulty vehicle. Usually, the front and rear of the vehicle are marked with information such as the vehicle brand logo and vehicle model. The image sensor determines the damage to the front and rear of the vehicle on the one hand, and collects the vehicle information of the faulty vehicle through OCR on the other hand.

作进一步改进的,上述激光传感器扫描故障车辆的车身两侧及车顶,上述第二置信度评分C2的计算公式为:As a further improvement, the laser sensor scans both sides and the roof of the faulty vehicle, and the calculation formula of the second confidence score C2 is:

;

其中,W2为权重矩阵,b2为偏置项;Among them, W 2 is the weight matrix and b 2 is the bias term;

σ(·)为Sigmoid激活函数,使上述第二置信度评分C2映射至0~100的范围内;σ(·) is the Sigmoid activation function, which maps the second confidence score C 2 to the range of 0~100;

融合特征向量Ffusion(img,pc)由上述二维图像数据与三维点云数据经过自注意力机制融合生成,具体公式为:The fusion feature vector F fusion (img, pc) is generated by fusing the above two-dimensional image data and three-dimensional point cloud data through the self-attention mechanism. The specific formula is:

;

上述权重系数γ 1γ 2满足γ 1+γ 2=1,且当第一置信度评分C1>75时,即故障车辆的车头与车尾受损较轻时,降低权重系数γ 1,提高权重系数γ 2The weight coefficients γ1 and γ2 satisfy γ1 + γ2 = 1, and when the first confidence score C1 > 75 , that is , when the front and rear of the faulty vehicle are slightly damaged , the weight coefficient γ1 is reduced and the weight coefficient γ2 is increased .

作进一步改进的,上述深度传感器扫描故障车辆的车轮和轮轴区域,上述第三置信度评分C3的计算公式为:As a further improvement, the depth sensor scans the wheel and axle area of the faulty vehicle, and the calculation formula of the third confidence score C3 is:

;

其中,W3为权重矩阵,b3为偏置项;Among them, W 3 is the weight matrix and b 3 is the bias term;

σ(·)为Sigmoid激活函数,使上述第三置信度评分C3映射至0~100的范围内;σ(·) is the Sigmoid activation function, which maps the third confidence score C 3 to the range of 0~100;

融合特征向量Ffusion(pc,dep)由上述二维图像数据与三维点云数据经过自注意力机制融合生成,具体公式为:The fusion feature vector F fusion (pc, dep) is generated by fusing the above two-dimensional image data and three-dimensional point cloud data through the self-attention mechanism. The specific formula is:

;

其中,M为掩码矩阵,通过掩码矩阵M将上述三维点云数据与上述三维深度数据的重叠部分更新为上述三维深度数据;Wherein, M is a mask matrix, and the overlapping part of the above-mentioned three-dimensional point cloud data and the above-mentioned three-dimensional depth data is updated to the above-mentioned three-dimensional depth data through the mask matrix M;

上述权重系数γ 2´和γ 3满足γ 2´+γ 3=1,且权重系数γ 3γ 2´。The above weight coefficients γ 2 ´ and γ 3 satisfy γ 2 ´ + γ 3 = 1, and the weight coefficient γ 3 > γ 2 ´.

作进一步改进的,还包括自适应学习模块,与上述控制处理单元相连,通过离线学习和增量学习优化上述特征融合单元中的权重系数。As a further improvement, it also includes an adaptive learning module, which is connected to the above-mentioned control processing unit and optimizes the weight coefficients in the above-mentioned feature fusion unit through offline learning and incremental learning.

作进一步改进的,上述权重系数的离线学习采用最小化均方误差的损失函数:As a further improvement, the offline learning of the above weight coefficients adopts the loss function of minimizing the mean square error:

;

其中,Ftarget,i为人工介入后标注的车辆状态的标准特征向量。Among them, F target,i is the standard feature vector of the vehicle state marked after manual intervention.

作进一步改进的,上述权重系数的增量学习方法为:As a further improvement, the incremental learning method of the above weight coefficients is:

当近期系统检测误差出现持续增加趋势时,采用以下规则更新权重系数:When the recent system detection error shows a continuous increasing trend, the weight coefficient is updated using the following rules:

;

其中,Accuracy为最近5次任务中,系统与测评分与人工核查评分之间误差在±5以内的任务占比;Accuracy is the percentage of tasks in the last five tasks where the error between the system and the test score and the manual verification score is within ±5;

Δ为学习速率,初始限定于0.01~0.05之间;Δ is the learning rate, initially limited to 0.01~0.05;

当Accuracy≥95%时,停止权重系数更新,以保持系统的参数稳定性。When Accuracy ≥ 95%, the weight coefficient update is stopped to maintain the parameter stability of the system.

作进一步改进的,上述控制处理单元选择清障模式的预设规则为:As a further improvement, the preset rule for the control processing unit to select the obstacle removal mode is:

当上述第一置信度评分C1>75且第二置信度评分C2>85时,控制执行牵引模式,以不超过20km/h速度牵引故障车辆;When the first confidence score C 1 is greater than 75 and the second confidence score C 2 is greater than 85, the control executes the towing mode to tow the faulty vehicle at a speed not exceeding 20 km/h;

当第一置信度评分C1>75且第二置信度评分C2≤85时,若上述第三置信度评分C3>85则控制执行托举模式,若第三置信度评分C3≤85则请求人工介入;When the first confidence score C 1 >75 and the second confidence score C 2 ≤85, if the third confidence score C 3 >85, the lifting mode is executed; if the third confidence score C 3 ≤85, manual intervention is requested;

当第一置信度评分C1≤75且第二置信度评分C2>85时,控制执行吊装模式;When the first confidence score C 1 ≤75 and the second confidence score C 2 >85, the control executes the hoisting mode;

当第一置信度评分C1≤75且第二置信度评分C2≤85时,若上述第三置信度评分C3>85则控制执行托举模式,若第三置信度评分C3≤85则请求人工介入。When the first confidence score C 1 ≤75 and the second confidence score C 2 ≤85, if the third confidence score C 3 >85, the lifting mode is executed; if the third confidence score C 3 ≤85, manual intervention is requested.

本发明另一方面提供一种基于无人道路值守巡逻清障车的控制方法,应用于如上任意一项提出的控制系统,包括如下步骤:Another aspect of the present invention provides a control method based on an unmanned road patrol tow truck, which is applied to the control system proposed in any one of the above items, comprising the following steps:

S10:控制上述图像传感器采集故障车辆车头与车尾的二维图像数据,通过第一流网络提取上述二维图像数据的图像特征向量Fimg,并生成第一置信度评分C1S10: Control the image sensor to collect two-dimensional image data of the front and rear of the faulty vehicle, extract an image feature vector F img of the two-dimensional image data through a first flow network, and generate a first confidence score C 1 ;

S20:控制上述激光传感器获得故障车辆车身两侧和车顶区域的三维点云数据,利用第二流网络提取上述三维点云数据的点云特征向量FpcS20: controlling the laser sensor to obtain three-dimensional point cloud data of both sides and the roof of the faulty vehicle, and extracting a point cloud feature vector F pc of the three-dimensional point cloud data using a second stream network;

S21:将上述二维图像数据与三维点云数据通过自注意力机制进行叠加融合,并生成第二置信度评分C2S21: superimposing and fusing the above-mentioned two-dimensional image data and three-dimensional point cloud data through a self-attention mechanism, and generating a second confidence score C 2 ;

S30:根据评分结果选择相应的清障模式:S30: Select the corresponding obstacle removal mode according to the scoring results:

当第一置信度评分C1>75,且第二置信度评分C2>85,时,选择牵引模式,通过连接故障车辆的拖车钩进行清障作业;When the first confidence score C 1 >75 and the second confidence score C 2 >85, the towing mode is selected to perform the towing operation by connecting the tow hook of the faulty vehicle;

当第一置信度评分C1≤75,且第二置信度评分C2>85,时,选择吊装模式,通过连接故障车辆的四轮,将故障车辆吊起,进行清障作业;When the first confidence score C 1 ≤ 75 and the second confidence score C 2 > 85, the hoisting mode is selected to lift the faulty vehicle by connecting the four wheels and perform the towing operation;

S40:若第二置信度评分C2≤85,控制上述深度传感器获取故障车辆车轮和轮轴区域的三维深度数据,利用第二流网络提取上述三维深度数据的深度特征向量FdepS40: If the second confidence score C 2 ≤85, control the depth sensor to acquire three-dimensional depth data of the wheel and axle area of the faulty vehicle, and use the second flow network to extract a depth feature vector F dep of the three-dimensional depth data;

S41:将上述三维点云数据以及三维深度数据通过自注意力机制进行更新融合,并生成第三置信度评分C3S41: updating and fusing the above 3D point cloud data and 3D depth data through a self-attention mechanism, and generating a third confidence score C 3 ;

S50:根据评分结果选择相应的清障模式:S50: Select the corresponding obstacle removal mode based on the scoring results:

当第三置信度评分C3>85,时,选择托举模式,将故障车辆一端抬起并固定,拖拽故障车辆进行清障作业;When the third confidence score C 3 > 85, the lifting mode is selected to lift and fix one end of the faulty vehicle and tow the faulty vehicle for towing.

当第三置信度评分C3≤85,时,请求人工介入;When the third confidence score C 3 ≤85, manual intervention is requested;

S60:实时记录每次任务的评分结果和实际清障作业反馈,通过在线增量学习和离线批量学习的方式优化模型权重系数,以提高后续检测任务的准确性。S60: Records the scoring results of each task and actual clearance feedback in real time, and optimizes the model weight coefficients through online incremental learning and offline batch learning to improve the accuracy of subsequent detection tasks.

与现有技术相比,本发明的有益效果是:Compared with the prior art, the present invention has the following beneficial effects:

其一,本发明的技术方案通过采用图像传感器、激光传感器和深度传感器组成的多模态感知模块,对故障车辆进行全面的感知和数据采集,以二维图像、三维点云以及三维深度数据共同构建车辆损伤的多维表征。多模态的数据获取方式使系统能够克服单一传感器识别信息片面的问题,具体而言,通过图像传感器获取车身前后损伤情况和车辆标识信息,通过第一置信度评分C1判断是否满足牵引条件,进一步通过激光传感器获得车身除了车头与车尾的三维点云数据,与二维图像数据叠加形成表征全车身损伤状态的第二置信度评分C2,在不满足牵引条件的情况下,若第二置信度评分C2评分高于预设值,即车身受损轻微,则通过吊装机构连接故障车辆四轮,将车辆吊起清障,进一步的,当第二置信度评分C2低于预设值,表示车身受损严重,此时若采用吊装可能因车身不平衡而倾覆,因此进一步通过深度传感器扫描车轮与轮轴区域,输出第三置信度C3判断车轮是否能够转动,若能则采用托举模式清障,即将故障车辆一端抬起并固定,拖行搬运。上述技术方案有效提升了对故障车辆损伤情况评估的准确性与全面性,通过判断车辆受损情况,选择合适的清障方式,在显著降低误判概率的同时,也有效避免了因错误决策而引发的二次损坏风险,提升交通管理的整体运行效率和安全性,具有明显的现实应用优势。First, the technical solution of the present invention uses a multimodal perception module composed of image sensors, laser sensors and depth sensors to comprehensively perceive and collect data on faulty vehicles, and jointly constructs a multidimensional representation of vehicle damage with two-dimensional images, three-dimensional point clouds and three-dimensional depth data. The multimodal data acquisition method enables the system to overcome the problem of one-sided information recognition by a single sensor. Specifically, the image sensor obtains the front and rear damage of the vehicle body and vehicle identification information. The first confidence score C1 is used to determine whether the towing conditions are met. The laser sensor further obtains three-dimensional point cloud data of the vehicle body, excluding the front and rear ends. This is superimposed with the two-dimensional image data to form a second confidence score C2 , which represents the damage status of the entire vehicle body. If the towing conditions are not met, if the second confidence score C2 is higher than a preset value, indicating that the vehicle body damage is minor, the four wheels of the faulty vehicle are connected through a lifting mechanism and the vehicle is lifted for towing. Furthermore, if the second confidence score C2 is lower than the preset value, indicating that the vehicle body damage is severe, lifting the vehicle may overturn due to imbalance. Therefore, the depth sensor scans the wheel and axle area and outputs a third confidence score C3 to determine whether the wheel can rotate. If it can, the towing mode is adopted for towing. The above technical solution effectively improves the accuracy and comprehensiveness of the assessment of damage to faulty vehicles. By judging the damage to the vehicle and selecting the appropriate clearance method, it significantly reduces the probability of misjudgment while effectively avoiding the risk of secondary damage caused by wrong decisions, thereby improving the overall operational efficiency and safety of traffic management, and has obvious practical application advantages.

其二,本发明的技术方案包括基于双流架构的分层检测模块,通过深度神经网络分别处理多种模态数据,实现了高效的特征提取与特征融合过程。当第二置信度评分C2不足以提供明确判断时,系统会更新融合三维深度数据,将表征车轮和轮轴的三维深度数据替换到原三维点云数据,生成更精细的第三置信度评分C3,以强化车轮区域的损伤识别精度。通过逐级细化的评估方式有效避免了因损伤评估不充分而导致的作业方式误判,面对损伤较小的情况不调用深度传感器,节省系统资源,大幅提高了系统对复杂场景的适应能力,提升了决策的准确性及清障作业选择的可靠性。Secondly, the technical solution of the present invention includes a hierarchical detection module based on a dual-stream architecture, which processes multiple modal data separately through a deep neural network to achieve an efficient feature extraction and feature fusion process. When the second confidence score C2 is not enough to provide a clear judgment, the system will update the fused three-dimensional depth data, replace the three-dimensional depth data representing the wheels and axles with the original three-dimensional point cloud data, and generate a more refined third confidence score C3 to enhance the damage identification accuracy of the wheel area. The step-by-step refinement of the evaluation method effectively avoids the misjudgment of the operation mode due to insufficient damage assessment. In the case of minor damage, the depth sensor is not called, which saves system resources, greatly improves the system's adaptability to complex scenarios, and improves the accuracy of decision-making and the reliability of obstacle clearance operation selection.

其三,本发明的技术方案还包括自适应学习模块,实现权重系数的动态优化。人工介入后,将处理结果录入系统,生成标准特征向量,通过离线学习修正权重系数,另一方面,当近5次评分结果与人工核验偏差过大,则通过增量学习的方式修正权重系数。通过自适应学习模块有效增强了系统的鲁棒性与环境适应能力,保证了无人清障车控制系统在实际复杂路况下长期运行中的高精度与稳定性。Third, the technical solution of the present invention also includes an adaptive learning module to achieve dynamic optimization of the weight coefficient. After manual intervention, the processing results are entered into the system, a standard feature vector is generated, and the weight coefficient is corrected through offline learning. On the other hand, when the deviation between the last five scoring results and manual verification is too large, the weight coefficient is corrected through incremental learning. The adaptive learning module effectively enhances the robustness and environmental adaptability of the system, ensuring the high precision and stability of the unmanned tow truck control system in long-term operation under actual complex road conditions.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

为了更清楚地说明本发明实施方式的技术方案,下面将对实施方式中所需要使用的附图作简单的介绍,应当理解,以下附图仅示出了本发明的某些实施例,因此不应被看作是对范围的限定,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他相关的附图。In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the following is a brief introduction to the drawings required for use in the embodiments. It should be understood that the following drawings only illustrate certain embodiments of the present invention and therefore should not be regarded as limiting the scope. For ordinary technicians in this field, other relevant drawings can be obtained based on these drawings without paying any creative work.

图1为本发明系统的框架结构图;FIG1 is a schematic diagram of the framework of the system of the present invention;

图2为本发明方法的步骤流程图;FIG2 is a flow chart of the steps of the method of the present invention;

图3为本发明中清障模式选择规则的流程图。FIG3 is a flow chart of the obstacle removal mode selection rule in the present invention.

具体实施方式DETAILED DESCRIPTION

为了便于本领域技术人员理解,现将实施例结合附图对本发明的结构作进一步详细描述:In order to facilitate understanding by those skilled in the art, the structure of the present invention is further described in detail with reference to the embodiments and the accompanying drawings:

在本发明的描述中,术语“第一”“第二”仅用于描述目的,而不能理解为指示或暗示相对重要性或者隐含指明所指示的技术特征的数量。术语“部”、“侧”、“端”等指示的方位或位置关系为基于附图所示的方位或位置关系,仅是为了便于描述本发明和简化描述,而不是指示或暗示所指的装置或元件必须具有特定的方位、以特定的方位构造和操作,因此不能理解为对本发明的限制。In the description of the present invention, the terms "first" and "second" are used for descriptive purposes only and should not be construed as indicating or implying relative importance or implicitly specifying the quantity of the technical features indicated. Terms such as "portion," "side," and "end" indicate positions or relationships based on those shown in the accompanying drawings and are intended solely to facilitate and simplify the description of the present invention. They do not indicate or imply that the devices or components indicated must have, be constructed, or operate in a specific orientation. Therefore, they should not be construed as limitations on the present invention.

如图1所示,本申请提供一种基于无人道路值守巡逻清障车的控制系统,包括多模态感知模块、分层检测模块以及控制处理单元。As shown in FIG1 , the present application provides a control system based on an unmanned road patrol tow truck, comprising a multimodal perception module, a hierarchical detection module, and a control processing unit.

如图1所示,多模态感知模块至少包括图像传感器、激光传感器以及深度传感器,其中,图像传感器用于获取故障车辆的二维图像数据,并通过OCR提取车身标识信息,具体为高分辨率摄像头等光学拍摄设备,能够清晰捕捉故障车辆的车牌号码、车型信息以及车身损伤表面细节。激光传感器用于生成故障车辆外观的三维点云数据,具体为3D激光雷达等设备,通过主动发射激光脉冲并测量反射时间以获取目标物体表面的三维点云数据。激光雷达具有较高的距离和角度测量精度,同时能够不受环境光照变化影响,可高效稳定地获取故障车辆车身外形和损伤部位的三维结构数据。深度传感器具体为结构光扫描仪或TOF相机等深度成像设备,用于生成故障车辆局部外观的三维深度数据,通过光线变形计算目标的深度信息,适合高精度获取车辆底盘区域、车轮和轮轴区域损伤的细节信息,便于进一步判断车辆移动能力,避免因漏检车底损伤而导致的清障作业失败。上述图像传感器、激光传感器以及深度传感器通过清障车车身移动以及车身机械臂控制,扫描故障车辆的车头、车尾、车身左右两侧、车顶等区域,且通过时空标定对齐数据,数据表征误差≤2cm,以便后续数据融合准确。As shown in Figure 1, the multimodal perception module includes at least an image sensor, a laser sensor, and a depth sensor. The image sensor, specifically an optical imaging device such as a high-resolution camera, is used to capture 2D image data of the faulty vehicle and extract vehicle identification information through optical character recognition (OCR). It can clearly capture the faulty vehicle's license plate number, vehicle model, and surface damage details. The laser sensor, specifically a device such as a 3D lidar, generates 3D point cloud data of the faulty vehicle's exterior. This device actively emits laser pulses and measures reflection time to obtain 3D point cloud data of the target object's surface. LiDAR offers high distance and angle measurement accuracy and is unaffected by ambient lighting variations. It can efficiently and stably acquire 3D structural data of the faulty vehicle's exterior and damaged areas. The depth sensor, specifically a depth imaging device such as a structured light scanner or time-of-flight camera, generates 3D depth data of the faulty vehicle's exterior. By calculating target depth information through light deformation, it is suitable for accurately capturing detailed damage information on the vehicle's chassis, wheels, and axles. This facilitates further assessment of the vehicle's mobility and avoids unsuccessful clearance operations due to missed underbody damage. The above-mentioned image sensors, laser sensors and depth sensors are controlled by the movement of the tow truck body and the body robotic arm to scan the front, rear, left and right sides, roof and other areas of the faulty vehicle, and align the data through spatiotemporal calibration. The data representation error is ≤2cm, so that subsequent data fusion is accurate.

进一步的,如图1所示,分层检测模块包括双流架构的深度神经网络计算模型,具体而言:第一流网络采用卷积神经网络,如ResNet模型或DenseNet模型,通过多层卷积操作对输入的二维图像数据逐步提取图像特征向量Fimg,形成第一置信度评分C1。以确定故障车辆车头、车尾损伤程度,也用于判断故障车辆是否满足牵引条件。Furthermore, as shown in Figure 1, the layered detection module includes a deep neural network computing model with a two-stream architecture. Specifically, the first-stream network uses a convolutional neural network, such as a ResNet or DenseNet model. Through multi-layer convolution operations, it gradually extracts image feature vectors F img from the input two-dimensional image data to form a first confidence score C 1 . This score is used to determine the extent of damage to the front and rear of the faulty vehicle and whether the faulty vehicle meets the towing conditions.

第二流网络利用基于PointNet或PointNet++等点云处理神经网络模型,将激光传感器获取的三维点云数据与深度传感器获取的三维深度数据进行融合和特征提取,通过神经网络内部的MLP结构对每个点云数据点进行特征映射、聚类、降维,得到三维点云数据的点云特征向量Fpc和三维深度数据的深度特征向量Fdep。叠加融合二维图像数据和三维点云数据,输出第二置信度评分C2。在一种实施例中,二维图像数据表征车身前后的损伤信息,三维点云数据表征车身两侧和车顶的损伤信息,二者叠加融合则表征车身外观的损伤信息。若第二置信度评分C2≤85,则更新融合三维点云数据以及三维深度数据,输出第三置信度评分C3,用于进一步判断故障车辆损伤程度及适用的清障方式,具体而言,三维深度数据表征车轮及轮轴的局部信息,与三维点云数据的部分表征信息重叠,因此将三维深度数据更新至三维点云数据中,即二者更新融合形成新的三维数据,用于进一步判断故障车辆车轮的损伤程度。The second-stream network utilizes a point cloud processing neural network model based on PointNet or PointNet++ to fuse and extract features from the 3D point cloud data acquired by the laser sensor and the 3D depth data acquired by the depth sensor. The neural network's internal Multi-Layered Propagation (MLP) structure performs feature mapping, clustering, and dimensionality reduction on each point cloud data point, generating a point cloud feature vector F pc for the 3D point cloud data and a depth feature vector F dep for the 3D depth data. The 2D image data and 3D point cloud data are then superimposed and fused to output a second confidence score C 2 . In one embodiment, the 2D image data represents damage information on the front and rear of the vehicle body, while the 3D point cloud data represents damage information on the sides and roof of the vehicle body. The superposition and fusion of the two represents damage information on the exterior of the vehicle body. If the second confidence score C 2 ≤ 85, the 3D point cloud data and 3D depth data are updated and fused to output a third confidence score C 3 , which is used to further determine the damage extent of the faulty vehicle and the applicable clearance method. Specifically, the 3D depth data represents local information about the wheels and axles, which overlaps with some of the representational information of the 3D point cloud data. Therefore, the 3D depth data is updated and incorporated into the 3D point cloud data. That is, the two are updated and fused to form new 3D data, which is used to further determine the damage extent of the faulty vehicle's wheels.

进一步的,分层检测模块还包括特征融合单元,特征融合单元采用基于Transformer架构的自注意力机制,该机制在融合多个数据模态的特征向量时,能根据每个模态特征的重要性自动生成注意力权重系数,以突出关键损伤信息。通过特征融合过程,能够进一步减少单一模态信息缺失或误判带来的负面影响,提升整体车辆损伤评估的准确性与鲁棒性。具体而言,采用自注意力机制加权融合图像特征向量Fimg、点云特征向量Fpc以及深度特征向量Fdep,生成融合特征向量FfusionFurthermore, the hierarchical detection module includes a feature fusion unit. This unit utilizes a self-attention mechanism based on the Transformer architecture. When fusing feature vectors from multiple data modalities, this mechanism automatically generates attention weights based on the importance of each modal feature to highlight key damage information. This feature fusion process further mitigates the negative impact of missing or misjudged information from a single modality, improving the accuracy and robustness of overall vehicle damage assessment. Specifically, the self-attention mechanism is used to weightedly fuse the image feature vector F img , the point cloud feature vector F pc , and the depth feature vector F dep , generating the fused feature vector F fusion .

如图1所示,控制处理单元,与分层检测模块相连,并根据预设阈值规则选择清障模式,当第三置信度评分C3≤85,请求人工介入。As shown in FIG1 , the control processing unit is connected to the hierarchical detection module and selects the obstacle removal mode according to a preset threshold rule. When the third confidence score C 3 ≤85, manual intervention is requested.

如图3所示,控制处理单元选择清障模式的预设规则具体为:As shown in FIG3 , the preset rules for the control processing unit to select the obstacle removal mode are specifically:

当第一置信度评分C1>75且第二置信度评分C2>85时,控制执行牵引模式,以不超过20km/h速度牵引故障车辆。When the first confidence score C 1 >75 and the second confidence score C 2 >85, the control executes the towing mode to tow the faulty vehicle at a speed not exceeding 20 km/h.

该规则中,当第一置信度评分C1>75时判断车头与车尾受损较轻,满足安装拖车钩的条件,因此进一步判断车身受损情况,若第二置信度评分C2>85,则判断可以执行牵引模式。In this rule, when the first confidence score C 1 > 75, it is determined that the damage to the front and rear of the vehicle is minor, meeting the conditions for installing a tow hook. Therefore, the damage to the vehicle body is further determined. If the second confidence score C 2 > 85, it is determined that the towing mode can be executed.

当第一置信度评分C1≤75且第二置信度评分C2>85时,控制执行吊装模式。When the first confidence score C 1 ≤75 and the second confidence score C 2 >85, the control executes the hoisting mode.

在该规则中,当第一置信度评分C1≤75,表示车头与车尾受损较为严重,不满足拖车钩的安装条件,因此进一步判断车身受损情况,若第二置信度评分C2>85,则表示车身受损较轻,执行吊装模式,通过吊装机构固定于车轮区域,将故障车辆吊起并承载于清障车的承载部进行搬运。In this rule, when the first confidence score C 1 ≤ 75, it indicates that the front and rear of the vehicle are severely damaged and do not meet the installation conditions for a tow hook. Therefore, the damage to the vehicle body is further determined. If the second confidence score C 2 > 85, it indicates that the damage to the vehicle body is relatively minor. The lifting mode is executed. The faulty vehicle is fixed to the wheel area through the lifting mechanism, lifted, and placed on the load-bearing part of the tow truck for transportation.

当第一置信度评分C1>75且第二置信度评分C2≤85时,若第三置信度评分C3>85则控制执行托举模式,若第三置信度评分C3≤85则请求人工介入。When the first confidence score C 1 >75 and the second confidence score C 2 ≤85, if the third confidence score C 3 >85, the control executes the lifting mode; if the third confidence score C 3 ≤85, the control requests manual intervention.

当第一置信度评分C1≤75且第二置信度评分C2≤85时,若第三置信度评分C3>85则控制执行托举模式,若第三置信度评分C3≤85则请求人工介入。When the first confidence score C 1 ≤75 and the second confidence score C 2 ≤85, if the third confidence score C 3 >85, the control executes the lifting mode; if the third confidence score C 3 ≤85, the control requests manual intervention.

在该规则中,当第二置信度评分C2≤85则表示车身受损较为严重,损伤部位可能是车身,也可能是车轮,若车身受损严重,吊装时可能因车身不平衡导致故障车辆倾覆,造成二次损伤,若车轮受损严重,牵引时可能因车轮无法转向或方向偏移造成不可预料的风险,因此需要进一步引入第三置信度评分C3进行判断,若第三置信度评分C3>85,则表示车轮损伤较轻(此时无需考虑第一置信度评分C1与第二置信度评分C2),可以采用托举的方式进行清障,即将故障车辆一端抬起,另一端依靠车轮着地支撑,清障车拖拽故障车进行清障。若第三置信度评分C3≤85则表示车轮区域损伤较重,请求人工介入,采用更为复杂清障模式,如吊装车辆底盘、人工固定故障车辆搬运等方式。In this rule, if the second confidence score C 2 ≤ 85, it indicates significant vehicle damage. The damage could be to the vehicle or the wheels. If the vehicle is severely damaged, the imbalance of the vehicle could cause the vehicle to overturn during lifting, resulting in secondary damage. If the wheels are severely damaged, towing could result in unpredictable risks due to wheel loss of steering or directional deviation. Therefore, the third confidence score C 3 is required for further analysis. If the third confidence score C 3 > 85, the wheel damage is minor (the first and second confidence scores C 1 and C 2 are not considered in this case). Lifting is recommended for towing purposes: one end of the vehicle is lifted, the other end supported by the wheels, and the tow truck tows the vehicle. If the third confidence score C 3 ≤ 85, the wheel area is severely damaged, requiring manual intervention and a more complex towing method, such as lifting the vehicle chassis or manually securing the vehicle for transportation.

在一种实施例中,清障车通过图像传感器识别的车辆品牌、车辆型号、车辆识别码等信息,将故障车辆搬运至预设路径上的维修站点或临时存放点,实现自动化清障任务。In one embodiment, a tow truck uses information such as the vehicle brand, vehicle model, and vehicle identification code identified by an image sensor to move the faulty vehicle to a maintenance site or temporary storage point on a preset route, thereby completing an automated tow removal task.

本发明提出一种基于多模态感知与分层置信度评估的无人清障车控制系统,通过多传感器协同、双流深度神经网络特征提取及自注意力动态融合机制,实现高精度车辆状态分析与智能化清障决策,实现了对道路故障车辆损伤情况的智能精准判断,能有效降低因损伤评估失误而引发的二次损坏风险,减少人工介入的频率,具有自动化程度高、稳定性强、环境适应能力强等优点,能够显著提高道路清障作业的效率、安全性与智能化水平,具备广泛的实际应用价值和良好的市场前景。The present invention proposes an unmanned tow truck control system based on multimodal perception and hierarchical confidence assessment. Through multi-sensor collaboration, dual-stream deep neural network feature extraction and self-attention dynamic fusion mechanism, high-precision vehicle status analysis and intelligent tow clearance decision-making are achieved, and intelligent and accurate judgment of the damage status of vehicles with road faults is realized. It can effectively reduce the risk of secondary damage caused by damage assessment errors and reduce the frequency of manual intervention. It has the advantages of high degree of automation, strong stability, and strong environmental adaptability. It can significantly improve the efficiency, safety and intelligence level of road towing operations, and has broad practical application value and good market prospects.

以下通过一具体实施例说明分层检测模块和自适应学习模块的工作原理:The working principles of the hierarchical detection module and the adaptive learning module are described below through a specific embodiment:

图像传感器扫描故障车辆的车头与车尾,第一置信度评分C1的计算公式为:The image sensor scans the front and rear of the faulty vehicle, and the calculation formula for the first confidence score C1 is:

;

其中,Fimg表示第一流卷积神经网络输出的图像特征向量,W1为权重矩阵,与Fimg为同维向量,用于将Fimg映射到评分空间,b1为偏置项,初始值为0;σ(·)为Sigmoid激活函数,其原始线性输出映射区间为[0,1],将其放大100倍,使第一置信度评分C1映射至0~100的范围内。Where F img represents the image feature vector output by the first-stream convolutional neural network, W 1 is the weight matrix, which is a vector of the same dimension as F img and is used to map F img to the score space, b 1 is the bias term with an initial value of 0, and σ (·) is the Sigmoid activation function, whose original linear output mapping interval is [0, 1]. It is amplified by 100 times to map the first confidence score C 1 to the range of 0 to 100.

图像特征向量Fimg的计算式为:The calculation formula of image feature vector F img is:

;

其中,Qimg,Kimg,Vimg为自注意力机制中的查询矩阵Q、键矩阵K以及值矩阵V,由二维图像数据生成。Among them, Q img , K img , and V img are the query matrix Q, key matrix K, and value matrix V in the self-attention mechanism, which are generated from two-dimensional image data.

激光传感器扫描故障车辆的车身两侧及车顶,第二置信度评分C2的计算公式为:The laser sensor scans both sides and the roof of the faulty vehicle. The second confidence score C2 is calculated as follows:

;

其中,W2为权重矩阵,b2为偏置项;σ(·)为Sigmoid激活函数,放大100倍,使第二置信度评分C2映射至0~100的范围内。Among them, W 2 is the weight matrix, b 2 is the bias term; σ (·) is the Sigmoid activation function, which is amplified 100 times to map the second confidence score C 2 to the range of 0~100.

融合特征向量Ffusion(img,pc)由二维图像数据与三维点云数据经过自注意力机制融合生成,具体公式为:The fusion feature vector F fusion (img, pc) is generated by fusing the two-dimensional image data and the three-dimensional point cloud data through the self-attention mechanism. The specific formula is:

;

其中,Qimg,Kimg,Vimg为自注意力机制中的查询矩阵Q、键矩阵K以及值矩阵V,由二维图像数据生成;Qpc,Kpc,Vpc为由三维点云数据生成的查询矩阵Q、键矩阵K以及值矩阵V。Among them, Q img , K img , V img are the query matrix Q, key matrix K, and value matrix V in the self-attention mechanism, which are generated from two-dimensional image data; Q pc , K pc , V pc are the query matrix Q, key matrix K, and value matrix V generated from three-dimensional point cloud data.

权重系数γ 1γ 2满足γ 1+γ 2=1,且当第一置信度评分C1>75时,即故障车辆的车头与车尾受损较轻时,降低权重系数γ 1,提高权重系数γ 2。在一种优选实施例中,γ 1的初始值为0.5,γ 2的初始值为0.5,若C1>75,则γ 1变化为0.3,γ 2变化为0.7。The weight coefficients γ1 and γ2 satisfy γ1 + γ2 = 1. When the first confidence score C1 > 75 , i.e., when the front and rear of the faulty vehicle are less damaged , the weight coefficient γ1 is reduced and the weight coefficient γ2 is increased . In a preferred embodiment, the initial values of γ1 and γ2 are 0.5 and 0.5, respectively. If C1 > 75, γ1 is changed to 0.3 and γ2 is changed to 0.7.

深度传感器扫描故障车辆的车轮和轮轴区域,第三置信度评分C3的计算公式为:The depth sensor scans the wheel and axle area of the faulty vehicle, and the third confidence score C3 is calculated as follows:

;

其中,W3为权重矩阵,b3为偏置项;σ(·)为Sigmoid激活函数,放大100倍,使第三置信度评分C3映射至0~100的范围内。Where W 3 is the weight matrix, b 3 is the bias term, and σ(·) is the Sigmoid activation function, which is amplified by 100 times to map the third confidence score C 3 to the range of 0~100.

融合特征向量Ffusion(pc,dep)由二维图像数据与三维点云数据经过自注意力机制融合生成,具体公式为:The fusion feature vector F fusion (pc, dep) is generated by fusing the two-dimensional image data and the three-dimensional point cloud data through the self-attention mechanism. The specific formula is:

;

其中,M为掩码矩阵,通过掩码矩阵M将三维点云数据与三维深度数据的重叠部分更新为三维深度数据。在该实施例中,重叠部分为车轮与轮轴的侧表面,在另一种实施例中,深度传感器还用于探测车辆底盘,则车辆底盘的三维深度数据不作为三维点云数据的更新内容。Where M is a mask matrix, and the overlapping portion of the 3D point cloud data and the 3D depth data is updated to 3D depth data using the mask matrix M. In this embodiment, the overlapping portion is the side surface of the wheel and the axle. In another embodiment, the depth sensor is also used to detect the vehicle chassis, in which case the 3D depth data of the vehicle chassis is not included in the update of the 3D point cloud data.

权重系数γ 2´和γ 3满足γ 2´+γ 3=1,且权重系数γ 3γ 2´,作为优选,γ 2´=0.3,γ 3=0.7。The weight coefficients γ 2 ´ and γ 3 satisfy γ 2 ´ + γ 3 = 1, and the weight coefficient γ 3 > γ 2 ´. Preferably, γ 2 ´ = 0.3, γ 3 = 0.7.

本系统还包括自适应学习模块,与控制处理单元相连,通过离线学习和增量学习优化特征融合单元中的权重系数。The system also includes an adaptive learning module connected to the control processing unit, which optimizes the weight coefficients in the feature fusion unit through offline learning and incremental learning.

权重系数的离线学习采用最小化均方误差的损失函数:The offline learning of weight coefficients adopts the loss function of minimizing the mean square error:

;

其中,Ftarget,i为人工介入后标注的车辆状态的标准特征向量,可以是历史平均数据转化成的标准特征向量,也可以是专业人员人为标定的标准特征向量。通过N组数据,前向计算融合特征向量Ffusion(·),更新上述权重矩阵、权重参数等可学习变量。Here, F target,i is the standard feature vector of the vehicle state annotated after manual intervention. This can be a standard feature vector converted from historical average data or manually calibrated by professionals. Using N sets of data, the fused feature vector F fusion (·) is forward-calculated to update the aforementioned learnable variables, including the weight matrix and weight parameters.

权重系数的增量学习方法为:The incremental learning method of weight coefficients is:

当近期系统检测误差出现持续增加趋势时,采用以下规则更新权重系数:When the recent system detection error shows a continuous increasing trend, the weight coefficient is updated using the following rules:

;

其中,Accuracy为最近5次任务中,系统与测评分与人工核查评分之间误差在±5以内的任务占比;Accuracy is the percentage of tasks in the last five tasks where the error between the system and the test score and the manual verification score is within ±5;

Δ为学习速率,初始限定于0.01~0.05之间;Δ is the learning rate, initially limited to 0.01~0.05;

当Accuracy≥95%时,停止权重系数更新,以保持系统的参数稳定性。When Accuracy ≥ 95%, the weight coefficient update is stopped to maintain the parameter stability of the system.

在一些实施例中,学习速率可随Accuracy动态调整,例如:In some embodiments, the learning rate can be dynamically adjusted with the accuracy, for example:

若Accuracy=93%,Δ降低至0.01,稳定系统;If Accuracy = 93%, Δ is reduced to 0.01, stabilizing the system;

若Accuracy=70%,Δ提高至0.05,加速收敛。If Accuracy = 70%, Δ is increased to 0.05 to accelerate convergence.

如图2所示,本发明另一方面提供一种基于无人道路值守巡逻清障车的控制方法,应用于上述的控制系统,包括如下步骤:As shown in FIG2 , another aspect of the present invention provides a control method based on an unmanned road patrol tow truck, which is applied to the above-mentioned control system and includes the following steps:

S10:控制图像传感器采集故障车辆车头与车尾的二维图像数据,通过第一流网络提取二维图像数据的图像特征向量Fimg,并生成第一置信度评分C1S10: Controlling the image sensor to collect two-dimensional image data of the front and rear of the faulty vehicle, extracting an image feature vector F img of the two-dimensional image data through a first-stream network, and generating a first confidence score C 1 ;

S20:控制激光传感器获得故障车辆车身两侧和车顶区域的三维点云数据,利用第二流网络提取三维点云数据的点云特征向量FpcS20: Controlling the laser sensor to obtain three-dimensional point cloud data of both sides of the body and the roof area of the faulty vehicle, and using the second stream network to extract the point cloud feature vector F pc of the three-dimensional point cloud data;

S21:将二维图像数据与三维点云数据通过自注意力机制进行叠加融合,并生成第二置信度评分C2S21: superimpose and fuse the two-dimensional image data and the three-dimensional point cloud data through the self-attention mechanism, and generate a second confidence score C 2 ;

S30:根据评分结果选择相应的清障模式:S30: Select the corresponding obstacle removal mode according to the scoring results:

当第一置信度评分C1>75,且第二置信度评分C2>85,时,选择牵引模式,通过连接故障车辆的拖车钩进行清障作业;When the first confidence score C 1 >75 and the second confidence score C 2 >85, the towing mode is selected to perform the towing operation by connecting the tow hook of the faulty vehicle;

当第一置信度评分C1≤75,且第二置信度评分C2>85,时,选择吊装模式,通过连接故障车辆的四轮,将故障车辆吊起,进行清障作业;When the first confidence score C 1 ≤ 75 and the second confidence score C 2 > 85, the hoisting mode is selected to lift the faulty vehicle by connecting the four wheels and perform the towing operation;

S40:若第二置信度评分C2≤85,控制深度传感器获取故障车辆车轮和轮轴区域的三维深度数据,利用第二流网络提取三维深度数据的深度特征向量FdepS40: If the second confidence score C 2 ≤85, control the depth sensor to acquire three-dimensional depth data of the wheel and axle area of the faulty vehicle, and use the second flow network to extract a depth feature vector F dep of the three-dimensional depth data;

S41:将三维点云数据以及三维深度数据通过自注意力机制进行更新融合,并生成第三置信度评分C3S41: updating and fusing the 3D point cloud data and the 3D depth data through a self-attention mechanism, and generating a third confidence score C 3 ;

S50:根据评分结果选择相应的清障模式:S50: Select the corresponding obstacle removal mode based on the scoring results:

当第三置信度评分C3>85,时,选择托举模式,将故障车辆一端抬起并固定,拖拽故障车辆进行清障作业;When the third confidence score C 3 > 85, the lifting mode is selected to lift and fix one end of the faulty vehicle and tow the faulty vehicle for towing.

当第三置信度评分C3≤85,时,请求人工介入;When the third confidence score C 3 ≤85, manual intervention is requested;

S60:实时记录每次任务的评分结果和实际清障作业反馈,通过在线增量学习和离线批量学习的方式优化模型权重系数,以提高后续检测任务的准确性。S60: Records the scoring results of each task and actual clearance feedback in real time, and optimizes the model weight coefficients through online incremental learning and offline batch learning to improve the accuracy of subsequent detection tasks.

以上所述仅为本发明的优选实施方式而已,并不用于限制本发明,对于本领域的技术人员来说,本发明可以有各种更改和变化。凡在本发明的精神和原则之内,所作的任何修改、等同替换、改进等,均应包含在本发明的保护范围之内。The foregoing description is merely a preferred embodiment of the present invention and is not intended to limit the present invention. Those skilled in the art will readily appreciate that various modifications and variations are possible. Any modifications, equivalent substitutions, or improvements made within the spirit and principles of the present invention are intended to be within the scope of protection of the present invention.

Claims (9)

1. Control system based on unmanned road on duty patrol clearance car, characterized in that includes:
the multimode sensing module at least comprises:
the image sensor is used for acquiring two-dimensional image data of the fault vehicle and extracting vehicle body identification information through OCR;
A laser sensor for generating three-dimensional point cloud data of the appearance of the faulty vehicle;
a depth sensor for generating three-dimensional depth data of a local appearance of the faulty vehicle;
the layering detection module comprises a deep neural network calculation model with a double-flow architecture:
the first flow network is used for extracting an image feature vector F img of the two-dimensional image data by adopting a convolutional neural network and outputting a first confidence score C 1;
The second flow network adopts a point cloud processing algorithm to extract a point cloud characteristic vector F pc of the three-dimensional point cloud data and a depth characteristic vector F dep of the three-dimensional depth data, superimposes and fuses the two-dimensional image data and the three-dimensional point cloud data to output a second confidence score C 2, and updates and fuses the three-dimensional point cloud data and the three-dimensional depth data to output a third confidence score C 3 if the second confidence score C 2 is less than or equal to 85;
The feature fusion unit adopts a self-attention mechanism to weight and fuse the image feature vector F img, the point cloud feature vector F pc and the depth feature vector F dep to generate a fusion feature vector F fusion;
And the control processing unit is connected with the layering detection module, selects an obstacle clearance mode according to a preset threshold rule, and requests manual intervention when the third confidence score C 3 is less than or equal to 85.
2. The unmanned road patrol wrecker-based control system of claim 1, wherein the image sensor scans the head and tail of a faulty vehicle, and the first confidence score C 1 is calculated as:
;
Wherein F img represents an image feature vector output by the first stream convolutional neural network;
W 1 is a weight matrix, and b 1 is a bias term;
Sigma (·) is a Sigmoid activation function, mapping the first confidence score C 1 to a range of 0-100.
3. The unmanned road guard patrol wrecker-based control system of claim 2, wherein the laser sensor scans both sides and roof of the body of the faulty vehicle, and the second confidence score C 2 is calculated as:
;
wherein W 2 is a weight matrix, and b 2 is a bias term;
Sigma (·) is a Sigmoid activation function, so that the second confidence score C 2 is mapped to a range of 0-100;
The fusion feature vector F fusion(img,pc) is generated by fusion of the two-dimensional image data and the three-dimensional point cloud data through a self-attention mechanism, and the specific formula is as follows:
;
The weight coefficients gamma 1 and gamma 2 meet the requirement that gamma 12 =1, and when the first confidence score C 1 is greater than 75, namely the head and tail of the fault vehicle are damaged slightly, the weight coefficient gamma 1 is reduced, and the weight coefficient gamma 2 is improved.
4. A control system based on an unmanned road duty patrol wrecker as claimed in claim 3, wherein the depth sensor scans the wheel and axle area of the faulty vehicle, and the third confidence score C 3 is calculated as:
;
Wherein W 3 is a weight matrix, and b 3 is a bias term;
Sigma (·) is a Sigmoid activation function, so that the third confidence score C 3 is mapped to a range of 0-100;
The fusion feature vector F fusion(pc,dep) is generated by fusion of the two-dimensional image data and the three-dimensional point cloud data through a self-attention mechanism, and the specific formula is as follows:
;
Wherein M is a mask matrix, the overlapped part of the three-dimensional point cloud data and the three-dimensional depth data is updated into the three-dimensional depth data through the mask matrix M, as the laser sensor scans the two sides and the top of the vehicle body, the depth sensor scans the wheels and the wheel shafts, and certain overlapped areas exist in the two scanning areas, the mask matrix M with the same dimension as the data feature vector dimension is introduced, wherein the element is 0 or 1, specifically defined as that when one element of the mask matrix M is 1, the corresponding position is simultaneously scanned by the laser sensor and the depth sensor, the position is replaced by the three-dimensional depth data of the depth sensor,
The weight coefficients γ 2 ́ and γ 3 satisfy γ 2´+γ3 =1, and the weight coefficient γ 3>γ2 ́.
5. The unmanned road patrol wrecker-based control system of claim 1, further comprising an adaptive learning module, in communication with the control processing unit, for optimizing the weight coefficients in the feature fusion unit by offline learning and incremental learning.
6. The unmanned road patrol wrecker-based control system of claim 5, wherein the offline learning of the weight coefficients employs a loss function that minimizes mean square error:
;
wherein F target,i is a standard feature vector of the vehicle state noted after the human intervention.
7. The control system based on the unmanned road guard patrol wrecker as claimed in claim 6, wherein the incremental learning method of the weight coefficient is as follows:
When the recent system detection error has a continuous increasing trend, the weight coefficient is updated by adopting the following rule:
;
among the latest 5 tasks, accuracy is the task duty ratio that the error between the system and the measurement score and the artificial check score is within +/-5;
delta is learning rate and is initially limited to 0.01-0.05;
and stopping updating the weight coefficient when Accuracy is more than or equal to 95% so as to maintain the parameter stability of the system.
8. The control system based on the unmanned road guard patrol wrecker as claimed in claim 1, wherein the preset rules for selecting the wrecker mode by the control processing unit are:
When the first confidence score C 1 >75 and the second confidence score C 2 >85, controlling to execute a traction mode to traction the faulty vehicle at a speed of not more than 20 km/h;
when the first confidence score C 1 is more than 75 and the second confidence score C 2 is less than or equal to 85, controlling to execute a lifting mode if the third confidence score C 3 is more than 85, and requesting manual intervention if the third confidence score C 3 is less than or equal to 85;
When the first confidence score C 1 is less than or equal to 75 and the second confidence score C 2 is more than 85, controlling to execute a hoisting mode;
when the first confidence score C 1 is less than or equal to 75 and the second confidence score C 2 is less than or equal to 85, controlling to execute a lifting mode if the third confidence score C 3 >85, and requesting manual intervention if the third confidence score C 3 is less than or equal to 85.
9. A control method based on unmanned road patrol wrecker, which is applied to the control system as claimed in any one of claims 1-8, and is characterized by comprising the following steps:
S10, controlling the image sensor to acquire two-dimensional image data of the head and the tail of a fault vehicle, extracting an image feature vector F img of the two-dimensional image data through a first flow network, and generating a first confidence score C 1;
S20, controlling the laser sensor to obtain three-dimensional point cloud data of two sides of a vehicle body and a roof area of a fault vehicle, and extracting a point cloud feature vector F pc of the three-dimensional point cloud data by using a second flow network;
S21, superposing and fusing the two-dimensional image data and the three-dimensional point cloud data through a self-attention mechanism, and generating a second confidence score C 2;
s30, selecting a corresponding obstacle clearance mode according to the grading result:
When the first confidence score C 1 is more than 75 and the second confidence score C 2 is more than 85, selecting a traction mode, and performing obstacle clearing operation through a trailer hook connected with a fault vehicle;
When the first confidence score C 1 is less than or equal to 75 and the second confidence score C 2 is more than 85, selecting a hoisting mode, hoisting the fault vehicle by connecting four wheels of the fault vehicle, and performing obstacle clearing operation;
S40, if the second confidence score C 2 is less than or equal to 85, controlling the depth sensor to acquire three-dimensional depth data of the wheel and axle area of the fault vehicle, and extracting a depth feature vector F dep of the three-dimensional depth data by using a second flow network;
S41, updating and fusing the three-dimensional point cloud data and the three-dimensional depth data through a self-attention mechanism, and generating a third confidence score C 3;
s50, selecting a corresponding obstacle clearance mode according to the grading result:
When the third confidence score C 3 is more than 85, selecting a lifting mode, lifting and fixing one end of the fault vehicle, and dragging the fault vehicle to perform obstacle clearing operation;
When the third confidence score C 3 is less than or equal to 85, requesting manual intervention;
And S60, recording the grading result of each task and the actual obstacle clearance operation feedback in real time, and optimizing the model weight coefficient in an online increment learning and offline batch learning mode so as to improve the accuracy of the subsequent detection task.
CN202510381147.8A 2025-03-28 2025-03-28 A control system and method based on unmanned road patrol tow truck Active CN120220116B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202510381147.8A CN120220116B (en) 2025-03-28 2025-03-28 A control system and method based on unmanned road patrol tow truck

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202510381147.8A CN120220116B (en) 2025-03-28 2025-03-28 A control system and method based on unmanned road patrol tow truck

Publications (2)

Publication Number Publication Date
CN120220116A CN120220116A (en) 2025-06-27
CN120220116B true CN120220116B (en) 2025-10-03

Family

ID=96107617

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202510381147.8A Active CN120220116B (en) 2025-03-28 2025-03-28 A control system and method based on unmanned road patrol tow truck

Country Status (1)

Country Link
CN (1) CN120220116B (en)

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN119687918A (en) * 2025-02-24 2025-03-25 江苏海之隅智慧科技有限公司 Navigation obstacle avoidance method and system in low confidence and feature similarity environment

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10991156B2 (en) * 2018-12-05 2021-04-27 Sri International Multi-modal data fusion for enhanced 3D perception for platforms
US11443147B2 (en) * 2020-12-11 2022-09-13 Argo AI, LLC Systems and methods for object detection using stereovision information
WO2024155584A1 (en) * 2023-01-16 2024-07-25 Strong Force Iot Portfolio 2016, Llc Systems, methods, devices, and platforms for industrial internet of things
CN119672577B (en) * 2024-12-05 2025-06-24 中科方寸知微(南京)科技有限公司 Adaptive obstacle detection method and system for UAV inspection routes based on monocular camera
CN119625279A (en) * 2024-12-09 2025-03-14 广东电网有限责任公司 Multimodal target detection method, device and multimodal recognition system

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN119687918A (en) * 2025-02-24 2025-03-25 江苏海之隅智慧科技有限公司 Navigation obstacle avoidance method and system in low confidence and feature similarity environment

Also Published As

Publication number Publication date
CN120220116A (en) 2025-06-27

Similar Documents

Publication Publication Date Title
CN110969655B (en) Method, device, equipment, storage medium and vehicle for detecting parking space
CN110243380B (en) A Map Matching Method Based on Multi-sensor Data and Angle Feature Recognition
US20240246570A1 (en) Path planning system and path planning method thereof
CN112233136B (en) Method, system, equipment and storage medium for alignment of container trucks based on binocular recognition
CN116403186A (en) 3D object detection method for autonomous driving based on FPN Swin Transformer and Pointnet++
CN111610191A (en) A road detection and repair system
US12036832B2 (en) Systems and methods for tethering an autonomous electric-powered trailer to a towing nexus
CN120220116B (en) A control system and method based on unmanned road patrol tow truck
WO2024093616A1 (en) Unmanned drayage truck alignment method, apparatus, and device, and readable storage medium
CN115371690B (en) Positioning method, device and movable device based on environmental condition changes
CN114419567B (en) Laser radar target detection method and device based on deep learning
CN113298044B (en) Obstacle detection method, system, device and storage medium based on positioning compensation
CN114299466A (en) Method, device and electronic device for vehicle attitude determination based on monocular camera
CN117975396A (en) Traffic sign line positioning method, system, device and medium
CN118134875A (en) A method and system for calculating the size and severity of road rutting subsidence based on machine learning
CN115457088B (en) Method and system for fixing axle of train
CN115100251A (en) Thermal imager and laser radar-based vehicle front pedestrian detection method and terminal
CN118125011B (en) A vehicle-mounted mechanism controller and control method for a non-ground garbage truck
CN120524696B (en) A method and equipment for guiding vehicles into a flatbed trailer.
CN120800384B (en) A Local Path Planning Method for Undercarriage Inspection Robots Based on Cross-Modal Cooperative Learning
CN121661613B (en) A confidence-based collaborative inspection system and method for unmanned vehicles and robot dogs
US12528321B2 (en) Systems and methods for positioning one or more coupling sensors on an autonomous electric-powered trailer
CN121740071A (en) Path planning method, vehicle running control method, related device and vehicle
CN119148158B (en) A method for recognizing the outline of quay crane container based on improved laser features
US20230314167A1 (en) Method and apparatus for generating map data, and non-transitory computer-readable storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant