CN116258224A

CN116258224A - Express delivery time prediction method, device, computer equipment and storage medium

Info

Publication number: CN116258224A
Application number: CN202111490832.2A
Authority: CN
Inventors: 武晨; 李珂
Original assignee: SF Technology Co Ltd
Current assignee: SF Technology Co Ltd
Priority date: 2021-12-08
Filing date: 2021-12-08
Publication date: 2023-06-13
Anticipated expiration: 2041-12-08
Also published as: CN116258224B

Abstract

The application provides a method, device, computer equipment and storage medium for predicting the timeliness of a shipment. The method includes: obtaining the shipment attribute information of the target shipment; performing feature engineering processing on the shipment attribute information to obtain the shipment characteristics of the target shipment; inputting the shipment characteristics into The trained timeliness prediction model outputs the predicted probability value of the target shipment under each preset timeliness type; the trained timeliness prediction model is composed of a fully connected layer, a dropout layer, a fusion layer, and a normalized layer; analysis Each prediction probability value is used to determine the logistics timeliness of the target express. Adopting this method can improve the prediction accuracy rate of express delivery timeliness.

Description

Express delivery time prediction method, device, computer equipment and storage medium

技术领域technical field

本申请实施例涉及人工智能技术领域，特别是涉及一种快件时效预测方法、装置、计算机设备及存储介质。The embodiments of the present application relate to the technical field of artificial intelligence, and in particular, to a method, device, computer equipment, and storage medium for express delivery timeliness prediction.

背景技术Background technique

在物流领域中，快件承诺时效通常是根据业务经验粗略估计的，统称为快件时效预测方法，包括但不限于如下几种方式：(1)以城市中一个代表网点的某产品流向的85％快件能够达成的时效，作为这个城市所有网点的该产品该流向的承诺时效；(2)将快件生命周期分成收件端-网点-中转场-中转场-网点-派件端的六点五段的形式，每一段根据网点/中转场的每天的班次规划，根据历史14天数据计算85％的可达成时效，然后进行累加。In the field of logistics, the timeliness of express delivery commitments is usually roughly estimated based on business experience, collectively referred to as express delivery timeliness prediction methods, including but not limited to the following methods: (1) 85% of the express delivery of a certain product in a representative network in the city The timeliness that can be achieved is the promised timeliness of the flow of the product at all outlets in this city; (2) Divide the express delivery life cycle into six points and five stages: receiving end-outlet-transit station-transit station-outlet-delivering end , each section calculates 85% of the achievable time limit according to the daily shift planning of the outlet/transit station, and then accumulates it based on the historical 14-day data.

然而，第一种方式的缺点是对相同城市网点不加区分的给与一个固定承诺时效，会产生一定的偏差，根据件量选择的代表网点的时效并不一定具有代表性；第二种方式的缺点是历史14天数据只能覆盖70％的流向，导致预测时数据缺失，每一段独立计算导致误差累加，5张表每周2次的先后依赖刷新，不仅耗费大量机器资源还存在因某环节调度失败后续环节需要重跑的风险。However, the disadvantage of the first method is that a fixed commitment time limit is given to the outlets in the same city without distinction, which will produce a certain deviation, and the timeliness of the representative outlets selected according to the number of pieces is not necessarily representative; the second method The disadvantage is that the historical 14-day data can only cover 70% of the flow direction, resulting in missing data during forecasting, and the accumulation of errors caused by independent calculations for each segment. The 5 tables need to be refreshed twice a week, which not only consumes a lot of machine resources, but also has some problems. The risk of re-running subsequent links if the link scheduling fails.

因此，传统的快件时效预测方法存在着预测准确率低的问题。Therefore, the traditional method of forecasting the timeliness of express shipments has the problem of low prediction accuracy.

发明内容Contents of the invention

本申请的目的在于提供一种快件时效预测方法、装置、计算机设备及存储介质，用以提升快件时效的预测准确率。The purpose of the present application is to provide a method, device, computer equipment and storage medium for predicting the timeliness of express shipments, so as to improve the prediction accuracy of the timeliness of express shipments.

第一方面，本申请提供一种快件时效预测方法，包括：In the first aspect, the present application provides a method for predicting the timeliness of express shipments, including:

获取目标快件的快件属性信息；Obtain the shipment attribute information of the target shipment;

对快件属性信息进行特征工程处理，得到目标快件的快件特征；Perform feature engineering processing on the attribute information of the shipment to obtain the shipment characteristics of the target shipment;

将快件特征输入至已训练的时效预测模型，输出目标快件在各个预设时效类型下的预测概率值；其中，已训练的时效预测模型是由全连接层、dropout层、融合层以及归一化层构成的；Input the features of the express shipment into the trained timeliness prediction model, and output the predicted probability value of the target shipment under each preset timeliness type; where the trained timeliness prediction model is composed of a fully connected layer, a dropout layer, a fusion layer and a normalized layered

分析各预测概率值，以确定目标快件的物流时效。Analyze each predicted probability value to determine the logistics timeliness of the target express.

在本申请一些实施例中，将快件特征输入至已训练的时效预测模型，输出目标快件在各个预设时效类型下的预测概率值，包括：将快件特征进行特征合并，得到合并后的快件特征；将快件特征输入至已训练的时效预测模型，输出特征内部的第一关联信息；以及将合并后的快件特征输入至已训练的时效预测模型，输出特征之间的第二关联信息；分析第一关联信息和第二关联信息，得到目标快件在各个预设时效类型下的预测概率值。In some embodiments of the present application, the express shipment features are input into the trained aging prediction model, and the predicted probability values of the target shipments under each preset aging type are output, including: combining the express shipment features to obtain the combined express shipment features ; Input the express features into the trained aging prediction model, and output the first correlation information inside the features; and input the combined express features into the trained aging prediction model, and output the second correlation information between the features; analyze the first The first association information and the second association information are used to obtain the predicted probability values of the target express shipment under each preset aging type.

在本申请一些实施例中，将快件特征输入至已训练的时效预测模型，输出特征内部的第一关联信息，包括：将快件特征输入至已训练的时效预测模型，通过全连接层和dropout层对快件特征进行特征分类，得到初始特征向量和初始特征向量的差异化系数；通过融合层对初始特征向量和差异化系数进行融合，得到携带有差异化系数的目标特征向量；通过归一化层对目标特征向量进行归一化处理，得到特征内部的第一关联信息。In some embodiments of the present application, the express feature is input into the trained time-efficiency prediction model, and the first associated information inside the feature is output, including: inputting the express feature into the trained time-efficiency prediction model, through the fully connected layer and the dropout layer Classify the express features to obtain the initial feature vector and the differentiation coefficient of the initial feature vector; fuse the initial feature vector and the differentiation coefficient through the fusion layer to obtain the target feature vector with the differentiation coefficient; through the normalization layer The target feature vector is normalized to obtain the first associated information inside the feature.

在本申请一些实施例中，分析第一关联信息和第二关联信息，得到目标快件在各个预设时效类型下的预测概率值，包括：获取第一关联信息对应的第一特征矩阵，及第二关联信息对应的第二特征矩阵；将第一特征矩阵与第二特征矩阵进行矩阵相乘，得到综合特征；通过携带有sigmoid函数的目标全连接层，对综合特征进行特征分类，得到目标快件在各个预设时效类型下的预测概率值。In some embodiments of the present application, analyzing the first association information and the second association information to obtain the predicted probability value of the target express shipment under each preset aging type includes: obtaining the first characteristic matrix corresponding to the first association information, and the second The second feature matrix corresponding to the two related information; matrix multiplication of the first feature matrix and the second feature matrix to obtain the comprehensive feature; through the target fully connected layer carrying the sigmoid function, the feature classification of the comprehensive feature is obtained to obtain the target express Predicted probability values under each preset aging type.

在本申请一些实施例中，在将快件特征输入至已训练的时效预测模型之前，还包括：构建初始的时效预测模型；基于预设的初始学习率，对初始的时效预测模型进行初步训练，直至训练次数达到预设第一阈值，得到初步训练后的时效预测模型；基于预设的峰值学习率，对初步训练后的时效预测模型进行水平训练，直至训练次数达到预设第二阈值，得到水平训练后的时效预测模型；按照峰值学习率的三角函数曲线，对水平训练后的时效预测模型进行调试训练，得到已训练的时效预测模型。In some embodiments of the present application, before inputting the features of the shipment into the trained aging prediction model, it also includes: constructing an initial aging prediction model; performing preliminary training on the initial aging prediction model based on a preset initial learning rate, Until the number of training times reaches the preset first threshold, the time-effect prediction model after the preliminary training is obtained; based on the preset peak learning rate, the time-effect prediction model after the preliminary training is trained horizontally until the number of training times reaches the preset second threshold, and the time-effect prediction model is obtained The time-effect prediction model after level training; according to the trigonometric function curve of the peak learning rate, the time-effect prediction model after level training is debugged and trained to obtain the trained time-effect prediction model.

在本申请一些实施例中，对快件属性信息进行特征工程处理，得到目标快件的快件特征，包括：提取快件属性信息包括的地址信息、时间信息以及产品信息；对地址信息进行特征工程处理，得到网点信息和距离信息作为快件特征；以及对时间信息进行特征工程处理，得到节日信息和寄件时刻作为快件特征；以及对产品信息进行特征工程处理，得到产品类型信息作为快件特征。In some embodiments of the present application, feature engineering processing is performed on the attribute information of the express to obtain the express features of the target express, including: extracting address information, time information and product information included in the attribute information of the express; performing feature engineering processing on the address information to obtain The outlet information and distance information are used as the express features; and the time information is processed by feature engineering, and the festival information and delivery time are obtained as the express features; and the product information is processed by feature engineering, and the product type information is obtained as the express features.

在本申请一些实施例中，分析各预测概率值，以确定目标快件的物流时效，包括：筛选出各预测概率值中的最大值，得到目标预测概率值；确定目标预测概率值对应的时效类型，作为目标快件的物流时效。In some embodiments of the present application, analyzing each predicted probability value to determine the logistics timeliness of the target express delivery includes: screening out the maximum value of each predicted probability value to obtain the target predicted probability value; determining the timeliness type corresponding to the target predicted probability value , as the logistics timeliness of the target shipment.

第二方面，本申请提供一种快件时效预测装置，包括：In the second aspect, the present application provides a device for predicting the aging of express shipments, including:

信息获取模块，用于获取目标快件的快件属性信息；An information acquisition module, configured to acquire the attribute information of the target express;

特征工程模块，用于对快件属性信息进行特征工程处理，得到目标快件的快件特征；The feature engineering module is used to perform feature engineering processing on the attribute information of the express to obtain the express features of the target express;

模型分析模块，用于将快件特征输入至已训练的时效预测模型，输出目标快件在各个预设时效类型下的预测概率值；其中，已训练的时效预测模型是由全连接层、dropout层、融合层以及归一化层构成的；The model analysis module is used to input the characteristics of the express shipment into the trained aging prediction model, and output the predicted probability value of the target shipment under each preset aging type; wherein, the trained aging prediction model is composed of a fully connected layer, a dropout layer, Composed of fusion layer and normalization layer;

时效确定模块，用于分析各预测概率值，以确定目标快件的物流时效。The timeliness determining module is used for analyzing each predicted probability value to determine the logistics timeliness of the target express.

第三方面，本申请还提供一种计算机设备，包括：In a third aspect, the present application also provides a computer device, including:

一个或多个处理器；one or more processors;

存储器；以及一个或多个应用程序，其中的一个或多个应用程序被存储于存储器中，并配置为由处理器执行以实现上述第一方面的快件时效预测方法。memory; and one or more application programs, wherein one or more application programs are stored in the memory and configured to be executed by the processor to implement the method for predicting the delivery time in the first aspect above.

第四方面，本申请还提供一种计算机可读存储介质，其上存储有计算机程序，计算机程序被处理器进行加载，以执行快件时效预测方法中的步骤。In a fourth aspect, the present application also provides a computer-readable storage medium, on which a computer program is stored, and the computer program is loaded by a processor to execute the steps in the method for express delivery timeliness prediction.

第五方面，本申请实施例提供一种计算机程序产品或计算机程序，该计算机程序产品或计算机程序包括计算机指令，该计算机指令存储在计算机可读存储介质中。计算机设备的处理器从计算机可读存储介质读取该计算机指令，处理器执行该计算机指令，使得该计算机设备执行上述第一方面提供的方法。In a fifth aspect, embodiments of the present application provide a computer program product or computer program, where the computer program product or computer program includes computer instructions, and the computer instructions are stored in a computer-readable storage medium. The processor of the computer device reads the computer instruction from the computer-readable storage medium, and the processor executes the computer instruction, so that the computer device executes the method provided by the first aspect above.

上述快件时效预测方法、装置、计算机设备及存储介质，服务器通过获取目标快件的快件属性信息，并对快件属性信息进行特征工程处理，可得到目标快件的快件特征，进而将快件特征输入至已训练的时效预测模型，即可输出得到目标快件在各个预设时效类型下的预测概率值，以此分析各预测概率值，确定目标快件的物流时效。由此，采用本申请提出的时效预测模型对目标快件进行时效预测，无需关注快件中转环节的耗时影响，确保了预测结果的稳定性，最终提升了快件时效的预测准确率。In the aforementioned method, device, computer equipment, and storage medium for predicting the timeliness of a shipment, the server obtains the shipment characteristics of the target shipment by obtaining the shipment attribute information of the target shipment, and performs feature engineering processing on the shipment attribute information, and then inputs the shipment characteristics into the trained The timeliness prediction model can output the predicted probability value of the target express under each preset timeliness type, and analyze each predicted probability value to determine the logistics timeliness of the target express. Therefore, the timeliness prediction model proposed in this application is used to predict the timeliness of the target express shipment, without paying attention to the time-consuming impact of the express delivery link, ensuring the stability of the prediction results, and finally improving the prediction accuracy of the expressage timeliness.

附图说明Description of drawings

为了更清楚地说明本申请实施例中的技术方案，下面将对实施例描述中所需要使用的附图作简单地介绍，显而易见地，下面描述中的附图仅仅是本申请的一些实施例，对于本领域技术人员来讲，在不付出创造性劳动的前提下，还可以根据这些附图获得其他的附图。In order to more clearly illustrate the technical solutions in the embodiments of the present application, the drawings that need to be used in the description of the embodiments will be briefly introduced below. Obviously, the drawings in the following description are only some embodiments of the present application. For those skilled in the art, other drawings can also be obtained based on these drawings without any creative effort.

图1为本申请实施例中提供的快件时效预测方法的场景示意图；Fig. 1 is the schematic diagram of the scenario of the method for predicting the express delivery timeliness provided in the embodiment of the present application;

图2为本申请实施例中提供的快件时效预测方法的流程示意图；Fig. 2 is a schematic flow chart of the method for predicting the express delivery timeliness provided in the embodiment of the present application;

图3为本申请实施例中提供的时效预测模型的架构应用流程图；Fig. 3 is the framework application flow diagram of the aging prediction model provided in the embodiment of the present application;

图4是本申请实施例中提供的快件时效预测装置的结构示意图；Fig. 4 is a schematic structural diagram of an express delivery time prediction device provided in an embodiment of the present application;

图5是本申请实施例中提供的计算机设备的结构示意图。Fig. 5 is a schematic structural diagram of a computer device provided in an embodiment of the present application.

具体实施方式Detailed ways

下面将结合本申请实施例中的附图，对本申请实施例中的技术方案进行清楚、完整地描述，显然，所描述的实施例仅仅是本申请一部分实施例，而不是全部的实施例。基于本申请中的实施例，本领域技术人员在没有做出创造性劳动前提下所获得的所有其他实施例，都属于本申请保护的范围。The following will clearly and completely describe the technical solutions in the embodiments of the application with reference to the drawings in the embodiments of the application. Apparently, the described embodiments are only some of the embodiments of the application, not all of them. Based on the embodiments in this application, all other embodiments obtained by those skilled in the art without making creative efforts belong to the scope of protection of this application.

在本申请的描述中，术语“第一”、“第二”仅用于描述目的，而不能理解为指示或暗示相对重要性或者隐含指明所指示的技术特征的数量。由此，限定有“第一”、“第二”的特征可以明示或者隐含地包括一个或者更多个所述特征。在本申请的描述中，“多个”的含义是两个或两个以上，除非另有明确具体的限定。In the description of the present application, the terms "first" and "second" are used for descriptive purposes only, and cannot be understood as indicating or implying relative importance or implicitly specifying the quantity of indicated technical features. Thus, a feature defined as "first" or "second" may explicitly or implicitly include one or more of said features. In the description of the present application, "plurality" means two or more, unless otherwise specifically defined.

在本申请的描述中，术语“例如”一词用来表示“用作例子、例证或说明”。本申请中被描述为“例如”的任何实施例不一定被解释为比其它实施例更优选或更具优势。为了使本领域任何技术人员能够实现和使用本发明，给出了以下描述。在以下描述中，为了解释的目的而列出了细节。应当明白的是，本领域普通技术人员可以认识到，在不使用这些特定细节的情况下也可以实现本发明。在其它实例中，不会对公知的结构和过程进行详细阐述，以避免不必要的细节使本发明的描述变得晦涩。因此，本发明并非旨在限于所示的实施例，而是与符合本申请所公开的原理和特征的最广范围相一致。In the description of this application, the term "for example" is used to mean "serving as an example, illustration or illustration". Any embodiment described in this application as "such as" is not necessarily to be construed as preferred or advantageous over other embodiments. The following description is given to enable any person skilled in the art to make and use the invention. In the following description, details are set forth for purposes of explanation. It should be understood that one of ordinary skill in the art would recognize that the present invention may be practiced without the use of these specific details. In other instances, well-known structures and procedures are not described in detail to avoid obscuring the description of the present invention with unnecessary detail. Thus, the present invention is not intended to be limited to the embodiments shown, but is to be accorded the widest scope consistent with the principles and features disclosed in this application.

在本申请实施例中，本申请实施例提供的快件时效预测方法，可以应用于如图1所示的快件时效预测系统中。其中，快件时效预测系统包括终端102和服务器104。终端102可以是既包括接收和发射硬件的设备，即具有能够在双向通信链路上，执行双向通信的接收和发射硬件的设备。这种设备可以包括：蜂窝或其他通信设备，其具有单线路显示器或多线路显示器或没有多线路显示器的蜂窝或其他通信设备。终端102具体可以是台式终端或移动终端，终端102具体还可以是手机、平板电脑、笔记本电脑中的一种。服务器104可以是独立的服务器，也可以是服务器组成的服务器网络或服务器集群，其包括但不限于计算机、网络主机、单个网络服务器、多个网络服务器集或多个服务器构成的云服务器。其中，云服务器由基于云计算(Cloud Computing)的大量计算机或网络服务器构成。此外，终端102与服务器104之间通过网络建立通信连接，网络具体可以是广域网、局域网、城域网中的任意一种。In the embodiment of the present application, the express delivery aging prediction method provided in the embodiment of the present application can be applied to the express delivery aging prediction system as shown in FIG. 1 . Wherein, the delivery time prediction system includes a terminal 102 and a server 104 . The terminal 102 may be a device including both receiving and transmitting hardware, that is, a device having receiving and transmitting hardware capable of performing bidirectional communication over a bidirectional communication link. Such devices may include cellular or other communication devices having a single line display or a multi-line display or a cellular or other communication device without a multi-line display. Specifically, the terminal 102 may be a desktop terminal or a mobile terminal, and specifically, the terminal 102 may also be one of a mobile phone, a tablet computer, and a notebook computer. The server 104 can be an independent server, or a server network or server cluster composed of servers, including but not limited to computers, network hosts, single network servers, multiple network server sets, or cloud servers composed of multiple servers. Wherein, the cloud server is composed of a large number of computers or network servers based on cloud computing (Cloud Computing). In addition, a communication connection is established between the terminal 102 and the server 104 through a network, and the network may specifically be any one of a wide area network, a local area network, and a metropolitan area network.

本领域技术人员可以理解，图1中示出的应用环境，仅仅是适用于本申请方案的一种应用场景，并不构成对本申请方案应用场景的限定，其他的应用环境还可以包括比图1中所示更多或更少的设备。例如，图1中仅示出1个服务器。可以理解的是，该快件时效预测系统还可以包括一个或多个其他服务器，具体此处不作限定。另外，如图1所示，该快件时效预测系统还可以包括存储器，用于存储数据，如存储目标快件的快件属性信息。Those skilled in the art can understand that the application environment shown in Figure 1 is only an application scenario applicable to the solution of this application, and does not constitute a limitation on the application scenario of the solution of this application. more or less equipment than shown. For example, only one server is shown in FIG. 1 . It can be understood that the delivery time prediction system may also include one or more other servers, which are not specifically limited here. In addition, as shown in FIG. 1 , the express mail aging prediction system may further include a memory for storing data, such as storing express mail attribute information of a target express mail.

需要说明的是，图1所示的快件时效预测系统的场景示意图仅仅是一个示例，本发明实施例描述的快件时效预测系统以及场景是为了更加清楚的说明本发明实施例的技术方案，并不构成对于本发明实施例提供的技术方案的限定，本领域普通技术人员可知，随着快件时效预测系统的演变和新业务场景的出现，本发明实施例提供的技术方案对于类似的技术问题，同样适用。It should be noted that the schematic diagram of the scene of the delivery timeliness prediction system shown in FIG. Constituting a limitation on the technical solutions provided by the embodiments of the present invention, those of ordinary skill in the art know that with the evolution of the express delivery timeliness prediction system and the emergence of new business scenarios, the technical solutions provided by the embodiments of the present invention are equally applicable to similar technical problems Be applicable.

参阅图2，本申请实施例提供了一种快件时效预测方法，本实施例主要以该方法应用于上述图1中的服务器104来举例说明，该方法包括步骤S201至S204，具体如下：Referring to Fig. 2, the embodiment of the present application provides a method for predicting the timeliness of express mail. This embodiment mainly uses the method to be applied to the server 104 in the above Fig. 1 as an example. The method includes steps S201 to S204, as follows:

S201，获取目标快件的快件属性信息。S201. Obtain shipment attribute information of a target shipment.

其中，目标快件可以为选用于时效预测所用的待托运货物，且待托运货物具体为运输速度较快、运费较高的“快件”。需要说明的是，本申请实施例提出采用目标快件进行物流时效预测，本意在于不区分物流领域中的货物类型，即不区分“快件”和“慢件”等，均以“快件”指代物流领域中待托运的货物，但若是在其他实施例中区分“快件”和“慢件”等货物类型，则可采用对应类型的货物信息进行物流时效预测。Wherein, the target express shipment may be the goods to be consigned selected for timeliness prediction, and the goods to be consigned are specifically "express shipments" with faster transportation speed and higher freight costs. It should be noted that the embodiment of this application proposes the use of target express shipments for logistics timeliness prediction. The original intention is not to distinguish the types of goods in the logistics field, that is, to not distinguish between "express shipments" and "slow shipments", and to use "express shipments" to refer to logistics The goods to be consigned in the field, but if the types of goods such as "express" and "slow" are distinguished in other embodiments, the corresponding type of goods information can be used for logistics timeliness prediction.

其中，快件属性信息可以包括地址信息、时间信息以及产品信息，地址信息可以包括寄件地址信息、收件地址信息等；时间信息可以包括寄件时间、该流向历史耗时等；产品信息可以包括产品类型等。Among them, the express attribute information may include address information, time information, and product information, and the address information may include sending address information, receiving address information, etc.; time information may include sending time, time-consuming history of the flow, etc.; product information may include product type etc.

具体实现中，服务器104为了提升快件时效的预测准确率，可首先获取待预测的目标快件的快件属性信息，而针对于快件属性信息的获取方式，包括但不局限于如下方式：1、在普通网络结构中，服务器104从终端102或其他建立有网络连接的设备处接收快件属性信息；2、在预建立的区块链网络中，服务器104可从其他终端节点或服务器节点处同步获取快件属性信息，该区块链网络可以是公有链、私有链等；3、在预置的树状结构中，服务器104可从上级服务器请求得到快件属性信息，或是从下级服务器轮询得到快件属性信息。In the specific implementation, in order to improve the prediction accuracy of the delivery timeliness, the server 104 may first obtain the delivery attribute information of the target delivery to be predicted, and the methods for obtaining the delivery attribute information include but are not limited to the following methods: 1. In the network structure, the server 104 receives the express mail attribute information from the terminal 102 or other devices with network connections; 2. In the pre-established blockchain network, the server 104 can obtain the express mail attribute information synchronously from other terminal nodes or server nodes information, the blockchain network can be a public chain, a private chain, etc.; 3. In the preset tree structure, the server 104 can request the express mail attribute information from the upper-level server, or poll the lower-level server to obtain the express mail attribute information .

除此之外，服务器104还可获取各个快件的地址信息、时间信息以及产品信息，进而整合得到目标快件的快件属性信息。此时，各个快件的地址信息、时间信息以及产品信息可由同一设备获取得到，也可由多个设备分别获取得到。In addition, the server 104 can also obtain the address information, time information and product information of each shipment, and then integrate and obtain the shipment attribute information of the target shipment. At this time, the address information, time information and product information of each express shipment can be obtained by the same device, or can be obtained by multiple devices respectively.

S202，对快件属性信息进行特征工程处理，得到目标快件的快件特征。S202. Perform feature engineering processing on the attribute information of the express to obtain the express features of the target express.

其中，特征工程指的是把原始数据转变为模型的训练数据的过程，它的目的就是获取更好的训练数据特征，使得机器学习模型逼近这个上限。特征工程能使得模型的性能得到提升，有时甚至在简单的模型上也能取得不错的效果。特征工程在机器学习中占有非常重要的作用，一般包括“特征构建、特征提取、特征选择”三个部分。Among them, feature engineering refers to the process of transforming the original data into the training data of the model. Its purpose is to obtain better training data characteristics, so that the machine learning model can approach this upper limit. Feature engineering can improve the performance of the model, and sometimes even achieve good results on simple models. Feature engineering plays a very important role in machine learning, and generally includes three parts: "feature construction, feature extraction, and feature selection".

具体实现中，服务器104执行对快件属性信息的特征工程处理操作，实际可视为特征设计，本申请实施例中的特征设计将突出时刻信息和静态信息，去掉训练集与测试集之间日期差异对模型的影响，提升模型的泛化能力。In a specific implementation, the server 104 performs feature engineering processing operations on the attribute information of the express, which can actually be regarded as feature design. The feature design in the embodiment of the present application will highlight time information and static information, and remove the date difference between the training set and the test set. The impact on the model improves the generalization ability of the model.

在一个实施例中，本步骤包括：提取快件属性信息包括的地址信息、时间信息以及产品信息；对地址信息进行特征工程处理，得到网点信息和距离信息作为快件特征；以及对时间信息进行特征工程处理，得到节日信息和寄件时刻作为快件特征；以及对产品信息进行特征工程处理，得到产品类型信息作为快件特征。In one embodiment, this step includes: extracting the address information, time information and product information included in the attribute information of the express; performing feature engineering processing on the address information to obtain network point information and distance information as express features; and performing feature engineering on the time information Process to get holiday information and delivery time as express features; and perform feature engineering processing on product information to get product type information as express features.

具体实现中，服务器104可对地址信息进行特征工程处理，得到收寄端各自的网点、网点级别、上级网点、网点类型、区部、大区、城市代表网点、行政区代表网点等网点信息，以及城市直线距离、城市路程距离、经纬度、距离类型等距离信息，均可作为快件特征，本申请实施例分析所得的特征可包括11个数值特征和27个类别特征，总计38个快件特征。In a specific implementation, the server 104 can perform feature engineering processing on the address information to obtain information about outlets, outlet levels, upper-level outlets, outlet types, districts, regions, city representative outlets, administrative district representative outlets, etc. of the receiving and sending ends, and Distance information such as city line distance, city distance, longitude and latitude, and distance type can all be used as express features. The features obtained from the analysis of the embodiment of the present application can include 11 numerical features and 27 category features, a total of 38 express features.

例如，收寄端各自的网点可以是某个学校、大厦、社区办公室等；网点级别可按业务规则划分为一级、二级、三级等，业务规则可以是收件量大小、网点覆盖面积、工作人员数量等；网点类型可以包括医院、学校等；区部可以包括城市区部，如深圳市南山区、福田区等；大区可以包括华中、华南、华西、华北等；城市直线距离可以是收寄端之间的地图直线距离，距离类型包括同城、省内、省外、经济圈等。For example, the respective outlets of the receiving and sending ends can be a certain school, building, community office, etc.; the outlets can be divided into first, second, and third grades according to business rules. , the number of staff, etc.; the types of outlets can include hospitals, schools, etc.; districts can include urban districts, such as Nanshan District, Shenzhen, Futian District, etc.; large districts can include Central China, South China, West China, North China, etc.; It is the straight-line distance on the map between the sending and receiving ends, and the distance types include the same city, within the province, outside the province, and the economic circle.

进一步地，服务器104可对时间信息进行特征工程处理，得到是否节假日、城市路程耗时、该流向历史耗时75，80，85分位数等信息，再将“yyyy-MM-dd hh:mm:ss”的寄件时间转化为“hh*3600+mm*60+ss”的float数值，均可作为快件特征。其中，“yyyy-MM-dd hh:mm:ss”代表将时间转换为12小时制，例如，“2018-06-27 03:24:21”。Further, the server 104 can perform feature engineering processing on the time information to obtain information such as whether it is a holiday, the city journey time, and the 75th, 80th, and 85th quantile of the historical time of the flow, and then "yyyy-MM-dd hh:mm The shipping time of ":ss" is converted into the float value of "hh*3600+mm*60+ss", which can be used as the express feature. Among them, "yyyy-MM-dd hh:mm:ss" means converting the time to a 12-hour clock, for example, "2018-06-27 03:24:21".

更进一步地，服务器104可对产品信息进行特征工程处理，得到产品类型信息，如标快、特快等产品类型，作为快件特征。需要说明的是，在本实施例中，所有快件特征都是描述寄件地和收件地的“静态特征”，而寄件地信息和收件地信息一般都是完整的，其他关于距离和历史耗时的特征都可以通过关联现有的已知信息直接获得，所以数据覆盖率可以达到100％，不存在数据缺失问题，也即不会降低快件时效预测的准确率。Furthermore, the server 104 can perform feature engineering processing on product information to obtain product type information, such as standard express, express and other product types, as express features. It should be noted that, in this embodiment, all express features are "static features" describing the place of origin and destination, and the information of the place of origin and destination are generally complete, and other information about distance and Historical time-consuming features can be directly obtained by correlating existing known information, so the data coverage can reach 100%, and there is no problem of missing data, that is, the accuracy of express delivery time prediction will not be reduced.

S203，将快件特征输入至已训练的时效预测模型，输出目标快件在各个预设时效类型下的预测概率值；其中，已训练的时效预测模型是由全连接层、dropout层、融合层以及归一化层构成的。S203. Input the features of the express shipment into the trained time-effect prediction model, and output the predicted probability values of the target shipment under each preset time-effect type; wherein, the trained time-effect prediction model is composed of a fully connected layer, a dropout layer, a fusion layer and a regression layer. It is composed of one chemical layer.

其中，预设时效类型可以是由时间范围构成的时效类型，例如，时效类型包括“2d12”、“2d18”、“2d22”等，“2d12”表示为“今天到明天12点”的时间范围，“2d18”表示为“今天到明天18点”的时间范围，“2d22”表示为“今天到明天22点”的时间范围，以此内推。可以理解的是，具体的时间范围可依据实际业务需求设定，本申请不做具体限定。Wherein, the preset aging type may be an aging type composed of a time range, for example, the aging type includes "2d12", "2d18", "2d22", etc., and "2d12" means the time range of "12 o'clock from today to tomorrow", "2d18" means the time range of "18:00 today to tomorrow", "2d22" means the time range of "22:00 today to tomorrow", and extrapolate accordingly. It can be understood that the specific time range can be set according to actual business requirements, and is not specifically limited in this application.

具体实现中，服务器104分析获取到快件特征之后，可调用已训练的时效预测模型，进而将快件特征输入至已训练的时效预测模型中进行特征分析，以使已训练的时效预测模型输出目标快件在各个预设时效类型下的预测概率值。除此之外，服务器104调用已训练的时效预测模型之前，可对时效预测模型进行模型训练，且本申请提出采用“直线上升—>保持—>曲线下降”的学习率自动化训练时效预测模型，以最大优化模型性能。如此，模型预测目标仅根据收寄两端的信息，并模糊了各种中间中转环节，就端到端式推断快件生命周期的“整体耗时”，即无需分阶段计算并叠加每段耗时，避免了多环节调度相互依赖的缺陷，提升了快件时效的预测准确率。本实施例中涉及的模型训练步骤、模型分析步骤将在下文详细说明。In a specific implementation, after the server 104 analyzes and acquires the characteristics of the express shipment, it can call the trained timeliness prediction model, and then input the expressage features into the trained timeliness prediction model for feature analysis, so that the trained timeliness prediction model outputs the target shipment Predicted probability values under each preset aging type. In addition, before the server 104 invokes the trained timeliness prediction model, model training can be performed on the timeliness prediction model, and this application proposes to automatically train the timeliness prediction model using the learning rate of "straight up -> maintain -> curve down", To maximize model performance. In this way, the model predicts the target only based on the information at the two ends of the delivery, and blurs the various intermediate transit links, and infers the "overall time consumption" of the express life cycle in an end-to-end manner, that is, there is no need to calculate and superimpose each period of time consumption in stages, It avoids the defect of multi-link scheduling interdependence, and improves the prediction accuracy of express delivery timeliness. The model training steps and model analysis steps involved in this embodiment will be described in detail below.

在一个实施例中，本步骤包括：将快件特征进行特征合并，得到合并后的快件特征；将快件特征输入至已训练的时效预测模型，输出特征内部的第一关联信息；以及将合并后的快件特征输入至已训练的时效预测模型，输出特征之间的第二关联信息；分析第一关联信息和第二关联信息，得到目标快件在各个预设时效类型下的预测概率值。In one embodiment, this step includes: performing feature merging on the express features to obtain the combined express features; inputting the express features to the trained aging prediction model, and outputting the first associated information inside the features; and combining the combined The features of the shipment are input to the trained aging prediction model, and the second correlation information between the features is output; the first correlation information and the second correlation information are analyzed to obtain the predicted probability value of the target shipment under each preset aging type.

具体实现中，可参阅图3，为本申请实施例提供的时效预测模型的架构应用流程图，本申请提供的时效预测模型采用独特的网络结构，具体由全连接层、dropout层、融合层以及归一化层构成，其不仅能够实现快件特征的抽象化，还能够自动学习到快件特征不同维度的差异系数。In the specific implementation, please refer to Fig. 3, which is a flowchart of the application of the framework of the time-sensitive prediction model provided by the embodiment of the present application. The normalization layer is composed, which can not only realize the abstraction of express features, but also automatically learn the difference coefficients of different dimensions of express features.

具体而言，服务器104将38个快件特征输入至已训练的时效预测模型之后，各个快件特征将顺序经过全连接层、dropout层、融合层以及归一化层，得到特征内部的差异信息和相关信息，作为第一关联信息；同时，时效预测模型不仅可应用于38个独立的快件特征，也可应用于“合并之后”的总快件特征。也即是说，服务器104可将快件特征进行特征合并，得到合并后的快件特征作为总快件特征，进而将总快件特征输入至已训练的时效预测模型进行分析，同样顺序经过全连接层、dropout层、融合层以及归一化层，即可得到特征之间的差异信息和相关信息，作为第二关联信息。最终，分析第一关联信息和第二关联信息，即可得到目标快件在各个预设时效类型下的预测概率值。Specifically, after the server 104 inputs 38 shipment features into the trained time-efficiency prediction model, each shipment feature will sequentially pass through the fully connected layer, dropout layer, fusion layer, and normalization layer to obtain the internal difference information and correlation of the features. Information, as the first associated information; at the same time, the aging prediction model can not only be applied to the 38 independent express features, but also can be applied to the total express features "after the merger". That is to say, the server 104 can combine the features of the express shipments to obtain the combined express features as the total express features, and then input the total express features into the trained time-efficiency prediction model for analysis, and also go through the fully connected layer, dropout Layer, fusion layer and normalization layer, the difference information and related information between features can be obtained as the second related information. Finally, by analyzing the first associated information and the second associated information, the predicted probability values of the target express shipment under each preset aging type can be obtained.

在一个实施例中，将快件特征输入至已训练的时效预测模型，输出特征内部的第一关联信息，包括：将快件特征输入至已训练的时效预测模型，通过全连接层和dropout层对快件特征进行特征分类，得到初始特征向量和初始特征向量的差异化系数；通过融合层对初始特征向量和差异化系数进行融合，得到携带有差异化系数的目标特征向量；通过归一化层对目标特征向量进行归一化处理，得到特征内部的第一关联信息。In one embodiment, the express feature is input into the trained time-efficiency prediction model, and the first associated information inside the feature is output, including: inputting the express feature into the trained time-efficiency prediction model, and the express is processed through the fully connected layer and the dropout layer. Features are classified to obtain the initial feature vector and the differentiation coefficient of the initial feature vector; the initial feature vector and the differentiation coefficient are fused through the fusion layer to obtain the target feature vector with the differentiation coefficient; the target feature vector is obtained through the normalization layer The feature vector is normalized to obtain the first associated information inside the feature.

具体实现中，服务器104可将快件特征输入至已训练的时效预测模型，以使各个快件特征顺序经过带ReLu激活函数的dense全连接层、无激活函数的dense全连接层、dropout层、16维的dense全连接层，得到16维特征向量作为初始特征向量。此时，若初始特征向量再经过带sigmoid函数的全连接层，则将计算出各个初始特征向量的差异化系数。In a specific implementation, the server 104 can input the express features into the trained time-effect prediction model, so that each express feature can pass through the dense fully connected layer with the ReLu activation function, the dense fully connected layer without the activation function, the dropout layer, and the 16-dimensional The dense fully connected layer obtains a 16-dimensional feature vector as the initial feature vector. At this time, if the initial feature vector passes through the fully connected layer with sigmoid function, the differentiation coefficient of each initial feature vector will be calculated.

进一步地，分别利用“dense-16”计算得出的初始特征向量，以及“Dense(Sigmoid)-16”计算得出的差异化系数经过点乘之后，就得到了带有差异化系数的目标特征向量，使用层归一化技术计算38个目标特征向量相同维度元素的均值和方差，并进行归一化处理，即可得到均值为“0”，方差为“1”的分布，同时学习到特征内部的“差异”和“相关”信息，作为第一关联信息。Further, after using the initial feature vector calculated by "dense-16" and the differentiation coefficient calculated by "Dense(Sigmoid)-16", after dot multiplication, the target feature with the differentiation coefficient is obtained Vector, use the layer normalization technique to calculate the mean and variance of the elements of the same dimension of the 38 target feature vectors, and perform normalization processing, you can get the distribution with the mean value of "0" and the variance of "1", and learn the feature The internal "difference" and "related" information is used as the first associated information.

在一个实施例中，分析第一关联信息和第二关联信息，得到目标快件在各个预设时效类型下的预测概率值，包括：获取第一关联信息对应的第一特征矩阵，及第二关联信息对应的第二特征矩阵；将第一特征矩阵与第二特征矩阵进行矩阵相乘，得到综合特征；通过携带有sigmoid函数的目标全连接层，对综合特征进行特征分类，得到目标快件在各个预设时效类型下的预测概率值。In one embodiment, analyzing the first association information and the second association information to obtain the predicted probability value of the target shipment under each preset aging type includes: obtaining the first feature matrix corresponding to the first association information, and the second association information The second feature matrix corresponding to the information; matrix multiplication of the first feature matrix and the second feature matrix to obtain the comprehensive feature; through the target fully connected layer carrying the sigmoid function, the feature classification of the comprehensive feature is carried out to obtain the target express in each The predicted probability value under the default aging type.

具体实现中，第一关联信息表示为38个快件特征的内部抽象信息，实际经由时效预测模型输出的是“16*38”的第一特征矩阵；同时，第二关联信息表示为38个快件特征的外部抽象信息，实际经由时效预测模型输出的是“16*1”的第一特征矩阵。在此之后，服务器104可将第一特征矩阵进行转置，以使得矩阵维度为“38*16”，再经过“矩阵乘法”与第二特征矩阵进行矩阵相乘，即可实现各个快件特征的交叉融合。最后，将矩阵相乘得到的综合特征经过带sigmoid函数的全连接层，时效预测模型将输出目标快件在各个预设时效类型下的预测概率值。In the specific implementation, the first associated information is expressed as the internal abstract information of 38 express features, and the actual output through the aging prediction model is the "16*38" first feature matrix; at the same time, the second associated information is expressed as 38 express features The external abstract information of , the actual output through the time-effect prediction model is the first feature matrix of "16*1". After that, the server 104 can transpose the first feature matrix so that the dimension of the matrix is "38*16", and then perform matrix multiplication with the second feature matrix through "matrix multiplication", so that the feature of each express can be realized. cross fusion. Finally, the comprehensive features obtained by multiplying the matrices pass through a fully connected layer with a sigmoid function, and the timeliness prediction model will output the predicted probability values of the target shipment under each preset timeliness type.

可以理解的是，本申请实施例设置的激活函数，具体是用来加入非线性因素的，提高神经网络对模型的表达能力，解决线性模型所不能解决的问题，使用ReLu函数可提高计算速度和收敛速度，使用Sigmoid函数能够把输入的连续实值变换为0和1之间的输出。It can be understood that the activation function set in the embodiment of the present application is specifically used to add nonlinear factors, improve the expressive ability of the neural network to the model, and solve problems that cannot be solved by the linear model. Using the ReLu function can improve the calculation speed and Convergence speed, using the Sigmoid function can transform the continuous real value of the input into an output between 0 and 1.

在一个实施例中，在本步骤之前，还包括：构建初始的时效预测模型；基于预设的初始学习率，对初始的时效预测模型进行初步训练，直至训练次数达到预设第一阈值，得到初步训练后的时效预测模型；基于预设的峰值学习率，对初步训练后的时效预测模型进行水平训练，直至训练次数达到预设第二阈值，得到水平训练后的时效预测模型；按照峰值学习率的三角函数曲线，对水平训练后的时效预测模型进行调试训练，得到已训练的时效预测模型。In one embodiment, before this step, it also includes: constructing an initial time-effect prediction model; based on a preset initial learning rate, performing preliminary training on the initial time-effect prediction model until the number of training times reaches a preset first threshold, obtaining The time-effect prediction model after the preliminary training; based on the preset peak learning rate, the time-effect prediction model after the preliminary training is trained horizontally until the number of training times reaches the preset second threshold, and the time-effect prediction model after the level training is obtained; learn according to the peak value The trigonometric function curve of the rate is used to debug and train the time-effect prediction model after level training to obtain the trained time-effect prediction model.

具体实现中，学习速率在模型训练的不同迭代轮数中通常是保持不变的，例如，学习速率往往设置成0.1、0.01、0.005等。其中，较大的学习速率会使模型极易陷入局部最优解，且损失波动较大；而较小的学习速率会使模型收敛过慢，最终无法完全收敛，依旧无法找到全局最优解。因此，本申请提出一种随迭代次数而自动变化的学习速率，即学习速率自动按照“直线上升—>保持—>曲线下降”规则进行自动调节，可避免较大波动，且在损失稳定时加快学习速率，在后续模型接近全局最优解时减小学习率，稳步逼近最优解，避免学习率过大而越过最优解，致使损失变差。In the specific implementation, the learning rate is usually kept constant in different iterations of model training. For example, the learning rate is often set to 0.1, 0.01, 0.005, etc. Among them, a larger learning rate will make the model easily fall into the local optimal solution, and the loss will fluctuate greatly; while a smaller learning rate will make the model converge too slowly, and eventually it will not be able to converge completely, and the global optimal solution will still not be found. Therefore, this application proposes a learning rate that changes automatically with the number of iterations, that is, the learning rate is automatically adjusted according to the rule of "straight up -> maintain -> curve down", which can avoid large fluctuations and speed up when the loss is stable Learning rate, reduce the learning rate when the subsequent model is close to the global optimal solution, and steadily approach the optimal solution, so as to avoid excessive learning rate and cross the optimal solution, resulting in poor loss.

具体而言，初始学习率可设置为“0.001”，经过2轮线性提升到峰值学习率“0.004”，保持2轮水平训练后，按照三角函数曲线规律逐步下降学习率，下降速度可先慢后快，即可得到已训练的时效预测模型。如此，可以给模型优化的不同阶段配置动态适配学习速率，实现了模型向全局最优解的稳步收敛，避免传统方案中因学习速率在训练过程固定不变而导致模型不收敛的情况。Specifically, the initial learning rate can be set to "0.001", after two rounds of linear increase to the peak learning rate "0.004", after maintaining two rounds of level training, the learning rate will be gradually reduced according to the law of the trigonometric function curve, and the speed of decline can be slow first and then Fast, you can get the trained time-effect prediction model. In this way, dynamic adaptive learning rates can be configured for different stages of model optimization, realizing the steady convergence of the model to the global optimal solution, and avoiding the situation in which the model does not converge due to the fixed learning rate during the training process in traditional solutions.

S204，分析各预测概率值，以确定目标快件的物流时效。S204, analyzing each predicted probability value to determine the logistics timeliness of the target express.

其中，物流时效可以包括物流承诺到件时间，也可称为“预计派送时间”，例如，目标快件“A”的物流时效为“今天到明天12点”，目标快件“B”的物流时效为“今天到明天18点”。Among them, the logistics timeliness can include the delivery time of the logistics promise, which can also be called the "estimated delivery time". For example, the logistics timeliness of the target shipment "A" is "today to 12 o'clock tomorrow", and the logistics timeliness of the target shipment "B" is "Today to tomorrow at 18 o'clock".

具体实现中，服务器104基于上述步骤分析得到目标快件在各个预设时效类型下的预测概率值之后，即可进一步分析各个预测概率值，以确定所述目标快件的物流时效，分析方式可以是比较各个预测概率值的大小。In a specific implementation, after the server 104 analyzes and obtains the predicted probability values of the target express shipment under each preset timeliness type based on the above steps, it can further analyze each predicted probability value to determine the logistics timeliness of the target express piece. The analysis method can be to compare The magnitude of each predicted probability value.

在一个实施例中，本步骤包括：筛选出各预测概率值中的最大值，得到目标预测概率值；确定目标预测概率值对应的时效类型，作为目标快件的物流时效。In one embodiment, this step includes: screening out the maximum value of each predicted probability value to obtain the target predicted probability value; determining the timeliness type corresponding to the target predicted probability value as the logistics timeliness of the target express.

具体实现中，可参阅图3，若时效预测模型中设有带sigmoid函数的13维全连接层，则输出的预测概率值有13个，在这13个预测概率值挑选出最大值，即可作为目标快件的物流时效。但可以理解的是，若时效预测模型中设有带sigmoid函数的N维全连接层，则将输出N个预测概率值，N≥2。For specific implementation, please refer to Figure 3. If there is a 13-dimensional fully connected layer with a sigmoid function in the time-effect prediction model, there are 13 output prediction probability values, and the maximum value is selected from these 13 prediction probability values. As the logistics timeliness of the target shipment. However, it is understandable that if there is an N-dimensional fully connected layer with a sigmoid function in the time-effect prediction model, N prediction probability values will be output, and N≥2.

上述实施例中的快件时效预测方法，服务器通过获取目标快件的快件属性信息，并对快件属性信息进行特征工程处理，可得到目标快件的快件特征，进而将快件特征输入至已训练的时效预测模型，即可输出得到目标快件在各个预设时效类型下的预测概率值，以此分析各预测概率值，可确定目标快件的物流时效。由此，采用本申请提出的时效预测模型对目标快件进行时效预测，可使数据覆盖率大幅上升，且避免多环节相互依赖影响，最终在模型性能最佳的状态下提升快件时效的预测准确率。In the method for predicting the timeliness of a shipment in the above embodiment, the server obtains the shipment attribute information of the target shipment and performs feature engineering processing on the shipment attribute information to obtain the shipment characteristics of the target shipment, and then inputs the shipment characteristics into the trained timeliness prediction model , the predicted probability value of the target express shipment under each preset aging type can be outputted, and each predicted probability value can be analyzed to determine the logistics aging effect of the target express shipment. Therefore, using the timeliness prediction model proposed in this application to predict the timeliness of the target shipment can greatly increase the data coverage, avoid the interdependence of multiple links, and finally improve the prediction accuracy of the timeliness of the express piece under the state of the best model performance .

应该理解的是，虽然图2的流程图中的各个步骤按照箭头的指示依次显示，但是这些步骤并不是必然按照箭头指示的顺序依次执行。除非本文中有明确的说明，这些步骤的执行并没有严格的顺序限制，这些步骤可以以其它的顺序执行。而且，图2中的至少一部分步骤可以包括多个子步骤或者多个阶段，这些子步骤或者阶段并不必然是在同一时刻执行完成，而是可以在不同的时刻执行，这些子步骤或者阶段的执行顺序也不必然是依次进行，而是可以与其它步骤或者其它步骤的子步骤或者阶段的至少一部分轮流或者交替地执行。It should be understood that although the various steps in the flow chart of FIG. 2 are displayed sequentially according to the arrows, these steps are not necessarily executed sequentially in the order indicated by the arrows. Unless otherwise specified herein, there is no strict order restriction on the execution of these steps, and these steps can be executed in other orders. Moreover, at least some of the steps in FIG. 2 may include multiple sub-steps or multiple stages. These sub-steps or stages are not necessarily executed at the same time, but may be executed at different times. The execution of these sub-steps or stages The order is not necessarily performed sequentially, but may be performed alternately or alternately with at least a part of other steps or sub-steps or stages of other steps.

为了更好实施本申请实施例提供的快件时效预测方法，在本申请实施例所提出的快件时效预测方法的基础之上，本申请实施例中还提供了一种快件时效预测装置，如图4所示，该快件时效预测装置400包括：In order to better implement the express delivery aging prediction method provided in the embodiment of the present application, on the basis of the express delivery aging prediction method proposed in the embodiment of the present application, an express delivery aging prediction device is also provided in the embodiment of the application, as shown in Figure 4 As shown, the delivery aging prediction device 400 includes:

信息获取模块410，用于获取目标快件的快件属性信息；An information acquisition module 410, configured to acquire the attribute information of the target express;

特征工程模块420，用于对快件属性信息进行特征工程处理，得到目标快件的快件特征；The feature engineering module 420 is used to perform feature engineering processing on the attribute information of the express to obtain the express features of the target express;

模型分析模块430，用于将快件特征输入至已训练的时效预测模型，输出目标快件在各个预设时效类型下的预测概率值；其中，已训练的时效预测模型是由全连接层、dropout层、融合层以及归一化层构成的；The model analysis module 430 is used to input the characteristics of the express shipment into the trained aging prediction model, and output the predicted probability value of the target shipment under each preset aging type; wherein, the trained aging prediction model is composed of a fully connected layer and a dropout layer. , fusion layer and normalization layer;

时效确定模块440，用于分析各预测概率值，以确定目标快件的物流时效。The timeliness determination module 440 is configured to analyze each predicted probability value to determine the logistics timeliness of the target express.

在一个实施例中，模型分析模块430还用于将快件特征进行特征合并，得到合并后的快件特征；将快件特征输入至已训练的时效预测模型，输出特征内部的第一关联信息；以及将合并后的快件特征输入至已训练的时效预测模型，输出特征之间的第二关联信息；分析第一关联信息和第二关联信息，得到目标快件在各个预设时效类型下的预测概率值。In one embodiment, the model analysis module 430 is also used to combine the features of the express shipment to obtain the combined express features; input the express shipment features to the trained aging prediction model, and output the first associated information inside the feature; and The combined express features are input to the trained aging prediction model, and the second correlation information between the features is output; the first correlation information and the second correlation information are analyzed to obtain the predicted probability value of the target shipment under each preset aging type.

在一个实施例中，模型分析模块430还用于将快件特征输入至已训练的时效预测模型，通过全连接层和dropout层对快件特征进行特征分类，得到初始特征向量和初始特征向量的差异化系数；通过融合层对初始特征向量和差异化系数进行融合，得到携带有差异化系数的目标特征向量；通过归一化层对目标特征向量进行归一化处理，得到特征内部的第一关联信息。In one embodiment, the model analysis module 430 is also used to input the features of the shipment into the trained time-effect prediction model, and classify the features of the shipment through the fully connected layer and the dropout layer to obtain the difference between the initial feature vector and the initial feature vector Coefficient; through the fusion layer, the initial feature vector and the differentiation coefficient are fused to obtain the target feature vector carrying the differentiation coefficient; through the normalization layer, the target feature vector is normalized to obtain the first associated information inside the feature .

在一个实施例中，模型分析模块430还用于获取第一关联信息对应的第一特征矩阵，及第二关联信息对应的第二特征矩阵；将第一特征矩阵与第二特征矩阵进行矩阵相乘，得到综合特征；通过携带有sigmoid函数的目标全连接层，对综合特征进行特征分类，得到目标快件在各个预设时效类型下的预测概率值。In one embodiment, the model analysis module 430 is further configured to obtain a first feature matrix corresponding to the first correlation information, and a second feature matrix corresponding to the second correlation information; perform matrix comparison of the first feature matrix and the second feature matrix Multiplying to get the comprehensive feature; through the target fully connected layer with the sigmoid function, the comprehensive feature is classified, and the predicted probability value of the target express shipment under each preset aging type is obtained.

在一个实施例中，快件时效预测装置400还包括模型训练模块，用于构建初始的时效预测模型；基于预设的初始学习率，对初始的时效预测模型进行初步训练，直至训练次数达到预设第一阈值，得到初步训练后的时效预测模型；基于预设的峰值学习率，对初步训练后的时效预测模型进行水平训练，直至训练次数达到预设第二阈值，得到水平训练后的时效预测模型；按照峰值学习率的三角函数曲线，对水平训练后的时效预测模型进行调试训练，得到已训练的时效预测模型。In one embodiment, the express delivery aging prediction device 400 also includes a model training module for constructing an initial aging prediction model; based on a preset initial learning rate, the initial aging prediction model is initially trained until the number of training times reaches the preset The first threshold is to obtain the time-effect prediction model after preliminary training; based on the preset peak learning rate, the time-effect prediction model after preliminary training is trained horizontally until the number of training times reaches the preset second threshold, and the time-effect prediction after level training is obtained Model; according to the trigonometric function curve of the peak learning rate, the time-effect prediction model after level training is debugged and trained to obtain the trained time-effect prediction model.

在一个实施例中，特征工程模块420还用于提取快件属性信息包括的地址信息、时间信息以及产品信息；对地址信息进行特征工程处理，得到网点信息和距离信息作为快件特征；以及对时间信息进行特征工程处理，得到节日信息和寄件时刻作为快件特征；以及对产品信息进行特征工程处理，得到产品类型信息作为快件特征。In one embodiment, the feature engineering module 420 is also used to extract address information, time information and product information included in the attribute information of the express shipment; carry out feature engineering processing on the address information, and obtain network point information and distance information as express mail features; and analyze the time information Perform feature engineering processing to obtain holiday information and delivery time as express features; and perform feature engineering processing on product information to obtain product type information as express features.

在一个实施例中，时效确定模块440还用于筛选出各预测概率值中的最大值，得到目标预测概率值；确定目标预测概率值对应的时效类型，作为目标快件的物流时效。In one embodiment, the timeliness determination module 440 is also used to filter out the maximum value among the predicted probability values to obtain the target predicted probability value; determine the timeliness type corresponding to the target predicted probability value as the logistics timeliness of the target express.

上述实施例中，采用本申请提出的时效预测模型对目标快件进行时效预测，可使数据覆盖率大幅上升，且避免多环节相互依赖影响，最终在模型性能最佳的状态下提升快件时效的预测准确率。In the above-mentioned embodiment, using the timeliness prediction model proposed by this application to predict the timeliness of the target shipment can greatly increase the data coverage, avoid the interdependence of multiple links, and finally improve the timeliness prediction of the express piece under the state of the best model performance Accuracy.

需要说明的是，关于快件时效预测装置的具体限定可以参见上文中对于快件时效预测方法的限定，在此不再赘述。上述快件时效预测装置中的各个模块可全部或部分通过软件、硬件及其组合来实现。上述各模块可以硬件形式内嵌于或独立于电子设备中的处理器中，也可以以软件形式存储于电子设备中的存储器中，以便于处理器调用执行以上各个模块对应的操作。It should be noted that, for the specific limitations of the express mail aging prediction device, please refer to the above-mentioned limitation on the express mail aging prediction method, which will not be repeated here. Each module in the above-mentioned delivery aging prediction device can be fully or partially realized by software, hardware and a combination thereof. The above-mentioned modules can be embedded in or independent of the processor in the electronic device in the form of hardware, and can also be stored in the memory of the electronic device in the form of software, so that the processor can invoke and execute the corresponding operations of the above-mentioned modules.

在本申请一些实施例中，快件时效预测装置400可以实现为一种计算机程序的形式，计算机程序可在如图5所示的计算机设备上运行。计算机设备的存储器中可存储组成该快件时效预测装置400的各个程序模块，比如，图4所示的信息获取模块410、特征工程模块420、模型分析模块430以及时效确定模块440；各个程序模块构成的计算机程序使得处理器执行本说明书中描述的本申请各个实施例的快件时效预测方法中的步骤。例如，图5所示的计算机设备可以通过如图4所示的快件时效预测装置400中的信息获取模块410执行步骤S201。计算机设备可通过特征工程模块420执行步骤S202。计算机设备可通过模型分析模块430执行步骤S203。计算机设备可通过时效确定模块440执行步骤S204。其中，该计算机设备包括通过系统总线连接的处理器、存储器和网络接口。其中，该计算机设备的处理器用于提供计算和控制能力。该计算机设备的存储器包括非易失性存储介质和内存储器。该非易失性存储介质存储有操作系统和计算机程序。该内存储器为非易失性存储介质中的操作系统和计算机程序的运行提供环境。该计算机设备的网络接口用于与外部的计算机设备通过网络连接通信。该计算机程序被处理器执行时以实现一种快件时效预测方法。In some embodiments of the present application, the delivery time prediction apparatus 400 may be implemented in the form of a computer program, and the computer program may be run on a computer device as shown in FIG. 5 . Can be stored in the memory of computer equipment and form each program module of this express delivery aging prediction device 400, such as, the information acquisition module 410 shown in Figure 4, feature engineering module 420, model analysis module 430 and aging determination module 440; The computer program enables the processor to execute the steps in the method for predicting the delivery time of each embodiment of the application described in this specification. For example, the computer equipment shown in FIG. 5 may execute step S201 through the information acquisition module 410 in the delivery time prediction apparatus 400 shown in FIG. 4 . The computer device can execute step S202 through the feature engineering module 420 . The computer device can execute step S203 through the model analysis module 430 . The computer device may execute step S204 through the aging determination module 440 . Wherein, the computer device includes a processor, a memory and a network interface connected through a system bus. Wherein, the processor of the computer device is used to provide calculation and control capabilities. The memory of the computer device includes a non-volatile storage medium and an internal memory. The non-volatile storage medium stores an operating system and computer programs. The internal memory provides an environment for the operation of the operating system and computer programs in the non-volatile storage medium. The network interface of the computer device is used to communicate with external computer devices via a network connection. When the computer program is executed by a processor, a method for predicting the time limit of express shipments is realized.

本领域技术人员可以理解，图5中示出的结构，仅仅是与本申请方案相关的部分结构的框图，并不构成对本申请方案所应用于其上的计算机设备的限定，具体的计算机设备可以包括比图中所示更多或更少的部件，或者组合某些部件，或者具有不同的部件布置。Those skilled in the art can understand that the structure shown in Figure 5 is only a block diagram of a part of the structure related to the solution of this application, and does not constitute a limitation to the computer equipment on which the solution of this application is applied. The specific computer equipment can be More or fewer components than shown in the figures may be included, or some components may be combined, or have a different arrangement of components.

在本申请一些实施例中，提供了一种计算机设备，包括一个或多个处理器；存储器；以及一个或多个应用程序，其中的一个或多个应用程序被存储于存储器中，并配置为由处理器执行上述快件时效预测方法的步骤。此处快件时效预测方法的步骤可以是上述各个实施例的快件时效预测方法中的步骤。In some embodiments of the present application, a computer device is provided, including one or more processors; memory; and one or more application programs, wherein one or more application programs are stored in the memory and configured as The processor executes the steps of the method for predicting the timeliness of express shipments above. Here, the steps in the express delivery aging prediction method may be the steps in the express delivery aging prediction methods in the above-mentioned various embodiments.

在本申请一些实施例中，提供了一种计算机可读存储介质，存储有计算机程序，计算机程序被处理器进行加载，使得处理器执行上述快件时效预测方法的步骤。此处快件时效预测方法的步骤可以是上述各个实施例的快件时效预测方法中的步骤。In some embodiments of the present application, a computer-readable storage medium is provided, which stores a computer program, and the computer program is loaded by a processor, so that the processor executes the steps of the above-mentioned method for express delivery timeliness prediction. Here, the steps in the express delivery aging prediction method may be the steps in the express delivery aging prediction methods in the above-mentioned various embodiments.

本邻域普通技术人员可以理解实现上述实施例方法中的全部或部分流程，是可以通过计算机程序来指令相关的硬件来完成，所述的计算机程序可存储于非易失性计算机可读取存储介质中，该计算机程序在执行时，可包括如上述各方法的实施例的流程。本申请所提供的各实施例中所使用的对存储器、存储、数据库或其它介质的任何引用，均可包括非易失性和易失性存储器中的至少一种。非易失性存储器可包括只读存储器(Read-OnlyMemory，ROM)、磁带、软盘、闪存或光存储器等。易失性存储器可包括随机存取存储器(Random Access Memory，RAM)或外部高速缓冲存储器。作为说明而非局限，RAM可以是多种形式，如静态随机存取存储器(Static Random Access Memory，SRAM)或动态随机存取存储器(Dynamic Random Access Memory，DRAM)等。Those of ordinary skill in the art can understand that the realization of all or part of the processes in the methods of the above embodiments can be completed by instructing related hardware through computer programs, and the computer programs can be stored in non-volatile computer-readable storage In the medium, when the computer program is executed, it may include the processes of the embodiments of the above-mentioned methods. Any reference to memory, storage, database or other media used in the various embodiments provided in this application may include at least one of non-volatile and volatile memory. The non-volatile memory may include read-only memory (Read-Only Memory, ROM), magnetic tape, floppy disk, flash memory or optical memory, and the like. Volatile memory may include random access memory (Random Access Memory, RAM) or external cache memory. As an illustration and not a limitation, the RAM can be in various forms, such as Static Random Access Memory (SRAM) or Dynamic Random Access Memory (Dynamic Random Access Memory, DRAM).

以上实施例的各技术特征可以进行任意的组合，为使描述简洁，未对上述实施例中的各个技术特征所有可能的组合都进行描述，然而，只要这些技术特征的组合不存在矛盾，都应当认为是本说明书记载的范围。The technical features of the above embodiments can be combined arbitrarily. To make the description concise, all possible combinations of the technical features in the above embodiments are not described. However, as long as there is no contradiction in the combination of these technical features, they should be It is considered to be within the range described in this specification.

以上对本申请实施例提供的一种快件时效预测方法、装置、计算机设备及存储介质进行了详细介绍，本文中应用了具体个例对本发明的原理及实施方式进行了阐述，以上实施例的说明只是用于帮助理解本发明的方法及其核心思想；同时，对于本领域的技术人员，依据本发明的思想，在具体实施方式及应用范围上均会有改变之处，综上所述，本说明书内容不应理解为对本发明的限制。A method, device, computer equipment, and storage medium for predicting the timeliness of express shipments provided by the embodiments of the present application have been described above in detail. In this paper, specific examples have been used to illustrate the principles and implementation methods of the present invention. The descriptions of the above embodiments are only It is used to help understand the method and its core idea of the present invention; at the same time, for those skilled in the art, according to the idea of the present invention, there will be changes in the specific implementation and scope of application. In summary, this specification The content should not be construed as a limitation of the invention.

Claims

1. A method for express delivery timeliness prediction, comprising:

Obtain the shipment attribute information of the target shipment;

performing feature engineering processing on the attribute information of the express to obtain the express characteristics of the target express;

Input the features of the express shipment into the trained aging prediction model, and output the predicted probability value of the target shipment under each preset aging type; wherein, the trained aging prediction model is composed of a fully connected layer, a dropout layer, Composed of fusion layer and normalization layer;

Each of the predicted probability values is analyzed to determine the logistics timeliness of the target express.

2. The method according to claim 1, wherein the input of the features of the express shipment into the trained aging prediction model, and outputting the predicted probability values of the target express shipment under each preset aging type, include:

Combining the features of the express mail to obtain the merged express mail features;

Inputting the features of the shipment into the trained aging prediction model, and outputting the first associated information inside the features; and

Inputting the combined express features into the trained aging prediction model, and outputting the second correlation information between the features;

The first association information and the second association information are analyzed to obtain the predicted probability values of the target express shipment under each preset aging type.

3. The method according to claim 2, wherein the input of the express features into the trained aging prediction model, and the output of the first associated information inside the features include:

The express feature is input to the trained time-effect prediction model, and the express feature is classified through the fully connected layer and the dropout layer to obtain an initial feature vector and a differentiation coefficient of the initial feature vector;

Fusing the initial feature vector and the differentiation coefficient through the fusion layer to obtain a target feature vector carrying the differentiation coefficient;

The normalization process is performed on the target feature vector through the normalization layer to obtain the first associated information inside the feature.

4. The method according to claim 2, wherein the analysis of the first associated information and the second associated information to obtain the predicted probability value of the target express shipment under each preset aging type includes: :

Obtaining a first feature matrix corresponding to the first associated information, and a second feature matrix corresponding to the second associated information;

performing matrix multiplication of the first feature matrix and the second feature matrix to obtain comprehensive features;

Through the target fully-connected layer carrying the sigmoid function, the feature classification is performed on the comprehensive feature, and the predicted probability value of the target express shipment under each preset aging type is obtained.

5. The method according to any one of claims 1 to 4, further comprising:

Build an initial time-efficiency prediction model;

Preliminary training is performed on the initial time-effect prediction model based on a preset initial learning rate until the number of training times reaches a preset first threshold to obtain a time-effect prediction model after preliminary training;

Based on the preset peak learning rate, perform horizontal training on the time-effect prediction model after the preliminary training, until the number of training times reaches the preset second threshold, and obtain the time-effect prediction model after the level training;

According to the trigonometric function curve of the peak learning rate, the time-effect prediction model after the level training is debugged and trained to obtain the trained time-effect prediction model.

6. The method according to claim 1, wherein said carrying out feature engineering processing to said express piece attribute information to obtain the express piece feature of said target express piece includes:

Extracting the address information, time information and product information included in the attribute information of the express shipment;

performing feature engineering processing on the address information to obtain outlet information and distance information as the express features; and

Perform feature engineering processing on the time information to obtain holiday information and delivery time as the express features; and

Perform feature engineering processing on the product information to obtain product type information as the express feature.

7. The method according to claim 1, wherein the analysis of each of the predicted probability values to determine the logistics timeliness of the target express mail comprises:

Screening out the maximum value of each of the predicted probability values to obtain the target predicted probability value;

The timeliness type corresponding to the target predicted probability value is determined as the logistics timeliness of the target express.

8. A device for predicting the aging of express mail, characterized in that it comprises:

An information acquisition module, configured to acquire the attribute information of the target express;

A feature engineering module, configured to perform feature engineering processing on the attribute information of the shipment to obtain the shipment features of the target shipment;

The model analysis module is used to input the characteristics of the express shipment into the trained aging prediction model, and output the predicted probability value of the target express shipment under each preset aging type; wherein, the trained aging prediction model is composed of all Consists of connection layer, dropout layer, fusion layer and normalization layer;

The timeliness determination module is used to analyze each of the predicted probability values to determine the logistics timeliness of the target express.

9. A computer device, characterized in that the computer device comprises:

one or more processors;

memory; and one or more application programs, wherein the one or more application programs are stored in the memory and configured to be executed by the processor to implement any one of claims 1 to 7 Express delivery time prediction method.

10. A computer-readable storage medium, characterized in that a computer program is stored thereon, and the computer program is loaded by a processor to execute the method for express delivery aging prediction according to any one of claims 1 to 7. step.