CN109035761A

CN109035761A - Travel time estimation method based on back-up surveillance study

Info

Publication number: CN109035761A
Application number: CN201810658375.5A
Authority: CN
Inventors: 孙未未; 章瀚元; 吴昊
Original assignee: Fudan University
Current assignee: Fudan University
Priority date: 2018-06-25
Filing date: 2018-06-25
Publication date: 2018-12-18
Anticipated expiration: 2038-06-25
Also published as: CN109035761B

Abstract

The invention belongs to the technical field of intelligent transportation, in particular to a method for estimating travel time based on auxiliary supervised learning. It looks for statistical laws from massive historical trajectory data, and estimates the time of the entire trip through an end-to-end deep learning model; the steps include: feature extraction and representation stages, preprocessing the trajectory data, extracting its time and Spatial features, driving state features, short-term and long-term traffic condition features; in the training and prediction phase, these extracted features are trained and predicted with a unified two-way cyclic neural network; each step of the cyclic neural network outputs through the current small area The time cost of these small areas is the sum of the time cost of the total path. At the same time, a bidirectional interval loss function is also introduced to constrain the intermediate time overhead. This method can efficiently and accurately estimate the vehicle travel time in the city, and has a good effect in the actual environment.

Description

Travel Time Estimation Method Based on Auxiliary Supervised Learning

技术领域technical field

本发明属于智能交通技术领域，具体涉及一种基于辅助监督学习的行程时间估计方法。The invention belongs to the technical field of intelligent transportation, and in particular relates to a method for estimating travel time based on auxiliary supervised learning.

背景技术Background technique

行程时间估计是城市交通领域一个必不可少的重要技术，可以为人们的出行通勤提供帮助，也可以为政府规划决策提供支持。但这并不是一个简单的小问题，而是会受到各种动态因素的影响，如交通动态，路口状况，司机驾驶行为的变化和历史周期性的数据演化等等。这些因素导致行程时间估计存在不确定性和难度。随着支持GPS的移动设备的发展和普及，目前已经有大量的轨迹数据在源源不断地产生，并且覆盖城市的各个角落。有了这些海量的历史轨迹数据，我们可以挖掘数据背后的内在规律，通过构建算法模型来学习出行程时间的变化的周期和趋势，从而更加准确地推断当前查询轨迹所需的时间开销。Travel time estimation is an essential and important technology in the field of urban transportation, which can provide assistance for people's travel and commuting, and can also provide support for government planning decisions. But this is not a simple small problem, but will be affected by various dynamic factors, such as traffic dynamics, intersection conditions, changes in driver's driving behavior and historical periodic data evolution, etc. These factors lead to uncertainty and difficulty in estimating travel time. With the development and popularization of GPS-enabled mobile devices, a large amount of trajectory data has been continuously generated and covers every corner of the city. With these massive historical trajectory data, we can mine the inherent laws behind the data, and learn the cycle and trend of travel time changes by building an algorithm model, so as to more accurately infer the time required for the current query trajectory.

目前已有的方法大多采用分而治之(divide-and-conquer)的方法，主要是通过将路径分解一系列的路段或者子路径这两类。At present, most of the existing methods adopt a divide-and-conquer method, mainly by decomposing the path into a series of road sections or sub-paths.

(1)基于单一路段的方法：(1) The method based on a single road section:

基于单路段的方法主要通过估计每一条单一路段的轨迹经过时的平均速度，进而根据路段长度计算出经过的平均时间开销，最后将各个路段的时间和累加得到总的时间。但这种方法没有考虑路段之间的路口时间开销。另外，这种估计严重依赖于高质量的速度数据，而这往往在轨迹数据中无法得到。The method based on a single road segment mainly estimates the average speed of the trajectory of each single road segment, and then calculates the average time cost of passing according to the length of the road segment, and finally accumulates the time of each road segment to obtain the total time. But this method does not consider the intersection time overhead between road segments. Additionally, this estimation relies heavily on high-quality velocity data, which is often not available in trajectory data.

(2)基于子路径的方法：(2) Subpath-based method:

基于子路径的方法主要通过将路径分割成一系列的子路径方法，使得路口的时间开销也得到考虑。主要思路都是对历史数据中丰富的公共子路径信息进行拼接和挖掘。尽管这种方法可以克服单一路段方法的许多缺陷，但它仍然是基于启发式设计，而不是直接将行程时间作为算法优化目标。The subpath-based method mainly divides the path into a series of subpaths, so that the time cost of the intersection is also considered. The main idea is to splicing and mining the rich public sub-path information in the historical data. Although this approach can overcome many of the shortcomings of the single-segment approach, it is still based on heuristic design rather than directly taking travel time as the algorithm optimization goal.

总而言之，目前已有的方法无法达到令人满意的准确性有两个方面的原因。一个是它们没有把路径看成一个整体，而是拆分成各个子块。在这一拆分过程中，损失了很多有用的信息。并且，它们没有充分利用轨迹数据特有的中间监督标签，也就是每一个中间GPS采样点的时间戳信息。另一方面，随着深度学习技术的发展和繁荣，更多的问题可以通过端到端一体式地解决，相较于传统启发式模型要更为高效。并且，深度学习有着强大的表征能力，与手工模型相比，可以捕捉到更多的潜在特征，能够处理行程估计问题中各种复杂的动态性。All in all, there are two reasons why currently existing methods cannot achieve satisfactory accuracy. One is that they do not treat the path as a whole, but split it into sub-blocks. In this splitting process, a lot of useful information is lost. Moreover, they do not make full use of the unique intermediate supervision labels of trajectory data, that is, the timestamp information of each intermediate GPS sampling point. On the other hand, with the development and prosperity of deep learning technology, more problems can be solved through end-to-end integration, which is more efficient than traditional heuristic models. Moreover, deep learning has a powerful representation ability. Compared with manual models, it can capture more potential features and can deal with various complex dynamics in travel estimation problems.

发明内容Contents of the invention

本发明的目的是针对传统的两类行程时间估计技术的局限性，提出一种基于辅助监督学习的历史轨迹的行程时间估计方法，以克服现有技术的不足。The purpose of the present invention is to address the limitations of the traditional two types of travel time estimation techniques, and propose a travel time estimation method based on historical trajectories of auxiliary supervised learning to overcome the deficiencies of the prior art.

本发明方法从海量历史轨迹数据中寻找统计规律，通过端到端的深度学习模型对整个行程的时间进行整体的估计。基本步骤包括：特征提取和表示阶段，对轨迹数据进行预处理，分别抽取它的各方面特征；训练和预测阶段，将这些提取的特征用一个统一的双向循环神经网络进行训练和预测；循环神经网络每一步都输出通过当前小区域的时间开销；这些小区域的时间开销的总和即为总路径的时间开销；为了更加有效地进行训练，还引入了双向区间损失函数来约束中间时间开销。The method of the present invention searches for statistical laws from massive historical trajectory data, and performs an overall estimation of the entire travel time through an end-to-end deep learning model. The basic steps include: the feature extraction and representation stage, preprocessing the trajectory data, and extracting its various features; the training and prediction stage, using a unified two-way cyclic neural network to train and predict these extracted features; cyclic neural network Each step of the network outputs the time cost of passing through the current small area; the sum of the time cost of these small areas is the time cost of the total path; in order to train more effectively, a bidirectional interval loss function is also introduced to constrain the intermediate time cost.

本发明提出的基于辅助监督学习的历史轨迹的行程时间估计方法，分为如下三个阶段：The travel time estimation method based on the historical trajectory of auxiliary supervised learning proposed by the present invention is divided into the following three stages:

(一)特征提取和表示阶段，对历史轨迹数据进行预处理，抽取它的各方面特征(包括时间特征和空间特征，驾驶状态特征，短时间和长时间的交通状况特征等)。具体步骤为：(1) Feature extraction and representation stage, preprocessing the historical trajectory data, and extracting its various features (including temporal and spatial features, driving state features, short-term and long-term traffic conditions, etc.). The specific steps are:

步骤(1)，在城市范围内，根据经纬度坐标对网格进行细粒度划分，形成一个个相邻的矩形小区域。将按时间顺序排序，由GPS坐标组成的轨迹序列中的每一个坐标点映射到对应的小区域中，形成一个由网格坐标组成的序列。对于相邻轨迹点距离较远，落在不连续的小区域内的情况，可以地图匹配等算法得到中间经过路径，补全这部分不连续的区域信息。In step (1), within the city limits, the grid is fine-grained according to the latitude and longitude coordinates to form adjacent small rectangular areas. Sorting in chronological order, each coordinate point in the trajectory sequence composed of GPS coordinates is mapped to the corresponding small area to form a sequence composed of grid coordinates. For the situation that the adjacent trajectory points are far away and fall in a small discontinuous area, the intermediate path can be obtained by algorithms such as map matching, and the information of this part of the discontinuous area can be supplemented.

步骤(2)，对于每一个网格，挖掘它不同方面的特征。首先，使用嵌入向量技术来挖掘潜在语义信息。嵌入向量技术在自然语言处理和社交网络等领域等到了广泛的使用，主要是利用低维的实数向量来代表每一个词或者事物的语义信息，通过向量空间中的距离关系来衡量实物之间的对应关系。本发明利用嵌入向量技术来表征每一个网格小区域在不同空间以及不同时间段的语义信息。这些信息包含了城市不同的功能区域(例如居民区，商业区或工业区等等)空间区位信息，也包括了早高峰，周末等时间信息。具体地，利用低维向量来表示每一个网格的空间向量V_sp，将一天划分成多个时间桶(例如一个小时一个桶)，每一条轨迹根据具体落入的时间桶来得到时间向量V_tp。对V_sp和V_tp进行随机初始化，之后在模型训练时跟着模型一起训练。Step (2), for each grid, mining its different aspects of features. First, embedding vector technology is used to mine latent semantic information. Embedded vector technology has been widely used in the fields of natural language processing and social networks. It mainly uses low-dimensional real number vectors to represent the semantic information of each word or thing, and measures the relationship between objects through the distance relationship in the vector space. Correspondence. The present invention uses embedded vector technology to represent the semantic information of each small grid area in different spaces and different time periods. These information include the spatial location information of different functional areas of the city (such as residential areas, commercial areas or industrial areas, etc.), as well as time information such as morning peak hours and weekends. Specifically, a low-dimensional vector is used to represent the space vector V _sp of each grid, and a day is divided into multiple time buckets (for example, one bucket per hour), and each trajectory obtains the time vector V according to the specific time bucket it falls into. _tp . Randomly initialize V _sp and V _tp , and then train with the model during model training.

步骤(3)，司机在开车时，在不同的行驶状态时，行驶的速度和驾驶行为都会发生变化。例如，车辆在行驶路径的中间部分时，会更倾向于行驶在大路或者高架上，这时速度会更快。而在刚出发或者快到终点时，由于行驶在小路或者人多的区域，往往速度就会变慢。具体地，使用四维向量V_dri来表示当前行驶阶段是出发阶段，中途阶段，还是结束阶段，以及在各个阶段已经行驶的比例。例如，V_dri＝(1,0,0,0.2)表示司机行驶在开始阶段，占了总行程的20％。In step (3), when the driver is driving, the driving speed and driving behavior will change in different driving states. For example, when the vehicle is in the middle of the driving path, it will be more inclined to drive on the road or elevated, and the speed will be faster at this time. When just starting or approaching the end, the speed will often slow down due to driving on a small road or in a crowded area. Specifically, the four-dimensional vector V _dri is used to indicate whether the current driving stage is a departure stage, a midway stage, or an end stage, and the proportion of the current driving in each stage. For example, V _dri =(1, 0, 0, 0.2) means that the driver travels at the beginning stage, which accounts for 20% of the total trip.

步骤(4)，在一个区域内的交通状况，往往随着时间演变会有周期性和规律性的变化。例如，如果一个路段在8点到8点半都很堵，那么8点35分它也可能很堵。也就是说，过去短时间内的交通状况信息，对预测当前的交通状态很有帮助。定义该短时间的交通状况特征为V_short。与此同时，长时间周期性的交通状况变化也能帮助预测当前交通状况，例如工作日和周末的交通状况变化规律。定义该长时间的交通状况特征为V_long。具体来说，In step (4), traffic conditions in an area tend to change periodically and regularly over time. For example, if a road segment is congested from 8:00 to 8:30, it may also be congested at 8:35. That is to say, the traffic condition information in a short period of time in the past is very helpful for predicting the current traffic condition. The traffic condition characteristic of this short time is defined as V _short . At the same time, long-term periodic changes in traffic conditions can also help predict the current traffic conditions, such as the regularity of traffic conditions on weekdays and weekends. The traffic condition characteristic of this long time is defined as V _long . Specifically,

定义： definition:

表示在过去第j个时间区间内，当前小区域g_i的交通状况，其中v_j表示历史平均速度，n_j表示历史轨迹数据数量，len_i/v_j表示粗略估计的通过时间。将这些交通状况特征按照历史时间顺序输入到一个子循环神经网络中，可以抽取出交通状况特征。Indicates the traffic status of the current small area g _i in the jth time interval in the past, where v _j represents the historical average speed, n _j represents the number of historical trajectory data, and len _i /v _j represents the roughly estimated passing time. These traffic condition features are input into a sub-cycle neural network according to the order of historical time, and the traffic condition features can be extracted.

另外，由于历史数据在不同空间区域分布不均衡，有些区域轨迹经过数量较少，可能会对估计的准确性造成影响。为了解决这一数据稀疏问题，将邻接小区域的交通状况信息也考虑进来，即In addition, due to the uneven distribution of historical data in different spatial regions, some regions have a small number of trajectories, which may affect the accuracy of estimation. In order to solve this data sparsity problem, the traffic status information of adjacent small areas is also taken into account, that is,

定义： definition:

表示距离g_i距离不超过d的网格集合，收集它们过去短时的交通状况特征，一起输入到神经网络中。其中，x,y表示网格的坐标，g_j表示除g_i以外的其他网格。Represents a set of grids whose distance from g _i does not exceed d, collects their past short-term traffic condition characteristics, and inputs them into the neural network together. Among them, x, y represent the coordinates of the grid, and g _j represents other grids except g _i .

(二)训练阶段，将历史轨迹数据中提取的特征输入到一个统一的双向循环神经网络(bidirectional LSTM，参考文献：Graves A,Schmidhuber J.Framewise phonemeclassification with bidirectional LSTM and other neural network architectures[J].Neural Networks,2005,18(5-6):602-610.)进行训练，并且以双向区间损失函数作为训练的约束；具体步骤为：(2) In the training phase, the features extracted from the historical trajectory data are input into a unified bidirectional LSTM, references: Graves A, Schmidhuber J. Framewise phoneme classification with bidirectional LSTM and other neural network architectures[J]. Neural Networks,2005,18(5-6):602-610.) for training, and use the bidirectional interval loss function as the training constraint; the specific steps are:

步骤(1)，构建循环神经网络。定义网络隐层为输入数据为那么，第t步的输入数据为x_t，第t步得到的计算结果为h_t，则有：Step (1), constructing a recurrent neural network. Define the hidden layer of the network as The input data is Then, the input data of step t is x _t , and the calculation result obtained in step t is h _t , then:

h_t＝φ(x_t·W_x+h_t-1·W_h+b) (3)h _t ＝φ(x _t ·W _x +h _t-1 ·W _h +b) (3)

其中，是输入数据的权重矩阵(weight matrix),是隐层的权重矩阵,是偏置参数(bias)。φ表示一个非线性激活函数，可以是sigmoid函数，ReLU函数，tanh函数等等。in, is the weight matrix of the input data, is the weight matrix of the hidden layer, is the bias parameter (bias). φ represents a nonlinear activation function, which can be a sigmoid function, a ReLU function, a tanh function, and so on.

也就是说隐状态可以表示为函数：That is to say, the hidden state can be expressed as a function:

h_t＝f(h_t-1,x_t) (4)h _t ＝f(h _t-1 ,x _t ) (4)

在这基础上，定义遗忘门为：On this basis, the forget gate is defined as:

f_t＝σ(W_f·[h_t-1,x_t]+b_f) (5)f _t ＝σ(W _f ·[h _t-1 ,x _t ]+b _f ) (5)

输入门为：The input gate is:

i_t＝σ(W_i·[h_t-1,x_t]+b_i) (6)i _t =σ(W _i ·[h _t-1 ,x _t ]+b _i ) (6)

输出门为：The output gate is:

o_t＝σ(W_o[h_t-1,x_t]+b_o) (7)o _t ＝σ(W _o [h _t-1 ,x _t ]+b _o ) (7)

记忆单元的更新为：The update of the memory unit is:

隐层的更新为：The update of the hidden layer is:

h_t＝O_t·tanh(C_t) (10)h _t ＝O _t ·tanh(C _t ) (10)

其中，W_f、W_i、W_o、分别表示遗忘门、输入门、输出门和记忆单元的权重矩阵，b_f、b_i、b_o、则是对应的偏置参数。σ()为一个非线性的激活函数，例如是一个sigmoid函数，是一个双曲正切函数，f( )表示一个包含各层参数的抽象神经网络函数。定义循环神经网络对应的参数为W_N；从[-α,α]的均匀分布中对循环神经网络中的每个权重参数进行初始化，其中，α是为一个超参数，设定范围为0.01到1。Among them, W _f , W _i , W _o , represent the weight matrix of forget gate, input gate, output gate and memory unit respectively, b _f , b _i , b _o , is the corresponding bias parameter. σ() is a non-linear activation function, such as is a sigmoid function, is a hyperbolic tangent function, and f( ) represents an abstract neural network function including parameters of each layer. Define the parameter corresponding to the cyclic neural network as W _N ; initialize each weight parameter in the cyclic neural network from the uniform distribution of [-α,α], where α is a hyperparameter, and the setting range is from 0.01 to 1.

双向循环神经网络同时使用一个正向的循环神经网络和一个反向的循环神经网络进行计算。其中正向循环神经网络根据序列的顺序依次将之前步骤提取的网格特征输入，而反向循环神经网络则将序列逆序后输入网格特征。这么做的优点在于，可以使得神经网络同时观察到当前网格距离起点和终点的位置距离，从而拥有一个整体的特征。定义它的隐变量为正向和反向网络的拼接其中表示正向循环神经网络的隐层，表示反向循环神经网络的隐层。Bidirectional RNNs use both a forward RNN and a reverse RNN for computation. Among them, the forward cyclic neural network sequentially inputs the grid features extracted in the previous steps according to the order of the sequence, while the reverse cyclic neural network inputs the grid features after the sequence is reversed. The advantage of doing this is that the neural network can observe the position distance of the current grid from the starting point and the end point at the same time, so as to have an overall feature. Define its hidden variable as the splicing of forward and reverse networks in Represents the hidden layer of the forward recurrent neural network, Represents the hidden layer of a reverse recurrent neural network.

步骤(2)，将历史轨迹数据中提取的特征，即空间特征时间特征驾驶状态特征历史上短时间和长时间的交通状态特征和拼接成一个统一的特征向量：Step (2), the features extracted from the historical trajectory data, that is, the spatial features time feature driving status characteristics Historical short-term and long-term traffic status characteristics and Concatenate into a unified eigenvector:

在每一个经过的小网格输入到双向循环神经网络，以得到经过该网格的通过时间，即W^T·h_i+b。总的行程的时间开销为：Input the bidirectional cyclic neural network into each passing small grid to obtain the passing time of the grid, that is, W ^T h _i +b. total trip time cost for:

定义分别为计算总时间开销的权重矩阵和偏置参数。W^T表示W矩阵的转置。definition are the weight matrix and bias parameters for calculating the total time overhead, respectively. W ^T denotes the transpose of the W matrix.

步骤(3)，定义轨迹经过各个网格序列的真实时间开销向量为T。顺序的真实时间开销向量为T^f，逆序的真实时间开销向量为T^b。则神经网络估计得到的时间开销向量为：Step (3), define the real time cost vector of the trajectory passing through each grid sequence as T. The sequential real time cost vector is T ^f , and the reverse real time cost vector is T ^b . Then the time cost vector estimated by the neural network is:

使用双向区间损失函数对模型进行辅助监督学习，使其不仅学习整条路径的时间开销，同时可以学习各个中间阶段的通行时间。定义双向区间损失函数为：The model is assisted with supervised learning using a bidirectional interval loss function, so that it not only learns the time cost of the entire path, but also learns the transit time of each intermediate stage. Define the two-way interval loss function as:

其中，M表示轨迹是否经过小区域的掩码，[]表示向量每个元素间的操作。Among them, M indicates whether the trajectory passes through the mask of the small area, and [] indicates the operation between each element of the vector.

步骤(4)，训练的目标是，最小化损失函数L，即：Step (4), the training goal is to minimize the loss function L, namely:

其中，θ表示模型的训练参数，ε表示时间和空间上的嵌入向量，S是训练集的大小。最后，使用基于时间顺序的反向传播算法对模型进行参数的更新和优化。反向传播算法参考文献：Chauvin Y,Rumelhart D E.Backpropagation:theory,architectures,andapplications[M].Psychology Press,2013.Among them, θ represents the training parameters of the model, ε represents the embedding vector in time and space, and S is the size of the training set. Finally, the parameters of the model are updated and optimized using the backpropagation algorithm based on time sequence. Backpropagation Algorithm References: Chauvin Y, Rumelhart D E. Backpropagation: theory, architectures, and applications [M]. Psychology Press, 2013.

(三)预测阶段，用双向循环神经网络对查询路径中提取的特征进行推断并估计行程时间；具体步骤为：(3) In the prediction stage, use the bidirectional recurrent neural network to infer the features extracted from the query path and estimate the travel time; the specific steps are:

步骤(1)，给定一条没有时间戳标记的真实行程作为查询路径，根据经过的实际路径，得到其映射的网格序列。对于每一个经过的小网格，使用特征提取和表示阶段抽取得到的时空特征V_sp和V_tp，驾驶状态特征V_dri，和历史上短时间和长时间的交通状态特征V_short和V_long，作为该网格的总特征表示V。其中，时空特征的嵌入向量使用经过训练过程的参数更新后的向量信息。短时和长时间的交通状态特征使用经过训练的子循环神经网络进行特征挖掘。Step (1), given a real itinerary without a time stamp as the query path, according to the actual path passed, get its mapped grid sequence. For each passing small grid, use the spatio-temporal features V _sp and V _tp obtained in the feature extraction and representation stages, the driving state features V _dri , and the historical short-term and long-term traffic state features V _short and V _long , V is represented as the total feature of the grid. Among them, the embedding vector of the spatio-temporal feature uses the vector information updated by the parameters of the training process. Short-term and long-term traffic state features are mined using a trained sub-recurrent neural network.

步骤(2)，在每一个经过的网格，将抽取的各方面特征输入到已经经过训练的双向循环神经网络中，得到当前的隐变量h_t,那么经过当前区域的估计时间为W^T·h_t+b。而总的时间开销估计值为：In step (2), in each passing grid, input all aspects of the extracted features into the trained bidirectional cyclic neural network to obtain the current hidden variable h _t , then the estimated time to pass through the current area is W ^T · _ht +b. And the total time cost estimate is:

其中，n表示经过的总网格数目，为经过训练得到的，计算总时间开销的权重矩阵和偏置参数。W^T表示W矩阵的转置。Among them, n represents the total number of grids passed, Calculate the weight matrix and bias parameters of the total time overhead for training. W ^T denotes the transpose of the W matrix.

总的来说，本发明方法有以下几个优点。首先，利用端到端(end-to-end)基于历史数据训练的深度学习方法，直接学习出整条路径的特征并估计出整体的通行时间。我们定义了一个双向区间损失函数，可以在监督整体的路径时间的基础上，同时辅助监督通过中间路段的时间开销。这种引入辅助监督的方法既丰富了路径的样本信息，又可以使得反向传播算法对参数更新时传播信号可以更加准确。其次，提出了一个特征抽取结构，通过提取时空嵌入向量，行驶状态，以及短时间和长时间的交通状况等不同维度的动态特征，能有效地估计出路径的通行时间。最后，在实际环境下经过实验验证，具有比已有方法更好的实验结果。In general, the method of the present invention has the following advantages. First, using an end-to-end (end-to-end) deep learning method based on historical data training, directly learn the characteristics of the entire path and estimate the overall transit time. We define a bidirectional interval loss function, which can supervise the overall path time while assisting in supervising the time cost of passing through the intermediate road sections. This method of introducing auxiliary supervision not only enriches the sample information of the path, but also makes the propagation signal more accurate when the backpropagation algorithm updates the parameters. Secondly, a feature extraction structure is proposed, which can effectively estimate the travel time of the route by extracting dynamic features of different dimensions such as spatio-temporal embedding vectors, driving status, and short-term and long-term traffic conditions. Finally, it has been verified by experiments in the actual environment, and has better experimental results than existing methods.

如表1所示，我们用真实的历史轨迹数据进行实验，包括波尔图和上海两个城市。我们用路段平均时间法，子路径动态规划法，网格全连接网络法，网格卷积网络法等已有方法进行对比。其中，路段平均时间法通过统计每个路段的平均通过时间，直接累加得到结果。子路径动态规划法[Yilun Wang,Yu Zheng,and Yexiang Xue.Travel timeestimation of a path using sparse trajecto-ries.In Proceedings of the 20thInternational Conference on Knowledge Discovery and Data Mining(SIGKDD),pages25–34,2014.]利用动态规划找到子路径的最优拼接方法。网格全连接网络法和网格卷积网络法将N×N的整体网格作为输入，分别用全连接网络(Multi-Layer Perceptron)和卷积神经网络(Convolutional Neural Network)进行优化和估计。我们使用MAE,RMSE,MAPE三个误差度量指标衡量方法的好坏。As shown in Table 1, we conduct experiments with real historical trajectory data, including the two cities of Porto and Shanghai. We compare existing methods such as segment average time method, sub-path dynamic programming method, grid fully connected network method, and grid convolution network method. Among them, the road section average time method obtains the result by directly accumulating the average passing time of each road section. Sub-path dynamic programming method[Yilun Wang, Yu Zheng, and Yexiang Xue.Travel timeestimation of a path using sparse trajectory-ries.In Proceedings of the 20thInternational Conference on Knowledge Discovery and Data Mining(SIGKDD),pages25–34,2014.] Using dynamic programming to find the optimal stitching method for subpaths. The grid fully connected network method and the grid convolutional network method take an N×N overall grid as input, and use a fully connected network (Multi-Layer Perceptron) and a convolutional neural network (Convolutional Neural Network) for optimization and estimation, respectively. We use MAE, RMSE, and MAPE three error metrics to measure the quality of the method.

其中，y表示真实值，表示估计值，n表示样本总数。由表1结果可知，本发明方法在各项指标都要远好于已有的对比方法。例如，在上海数据集上，本发明方法估计MAE误差只有126秒，MAPE误差是13.3％，而已有最好方法的MAE误差在168秒，MAPE误差是19.1％。Among them, y represents the real value, Indicates the estimated value, and n indicates the total number of samples. As can be seen from the results in Table 1, the method of the present invention is far better than the existing comparative method in every index. For example, on the Shanghai data set, the method of the present invention estimates that the MAE error is only 126 seconds, and the MAPE error is 13.3%, while the MAE error of the existing best method is 168 seconds, and the MAPE error is 19.1%.

表1Table 1

附图说明Description of drawings

图1表示一个真实轨迹样本，包含每一个中间GPS轨迹点的时间戳信息，一共经过720s。Figure 1 shows a real trajectory sample, including the time stamp information of each intermediate GPS trajectory point, and a total of 720s has passed.

图2表示需要查询的路径样本，仅包含具体经过的路径信息，不包含任何时间戳信息。Figure 2 shows the path samples that need to be queried, which only contain specific path information and do not contain any time stamp information.

具体实施方式Detailed ways

下面结合具体实例来说明本发明的具体实施过程：The specific implementation process of the present invention will be described below in conjunction with specific examples:

如图1中的历史轨迹用于训练，并估计图2中的行程时间。The historical trajectories in Figure 1 are used for training, and the travel time in Figure 2 is estimated.

一、预处理阶段，特征提取和表示阶段，对轨迹数据进行预处理，抽取它的各方面特征。以图1为例，具体步骤为：1. The preprocessing stage, the feature extraction and representation stage, preprocesses the trajectory data and extracts its various features. Taking Figure 1 as an example, the specific steps are:

(1)在城市范围内，进行细粒度网格划分，分成一个个相邻的小区域。如图1中，将地图划分成5×6个网格。将轨迹序列中的每一个坐标点映射到对应的小区域中，形成一个网格序列，即g＝{g₁,g₂,…,g₁₀}。(1) In the urban area, fine-grained grid division is carried out, and it is divided into adjacent small areas. As shown in Figure 1, the map is divided into 5×6 grids. Each coordinate point in the trajectory sequence is mapped to a corresponding small area to form a grid sequence, that is, g={g ₁ , g ₂ ,...,g ₁₀ }.

(2)对于每一个网格，挖掘它不同方面的特征。例如，对于g₁，使用随机向量和来表示时空语义信息。即：(2) For each grid, mine its characteristics in different aspects. For example, for g ₁ , use a random vector and to represent spatiotemporal semantic information. which is:

其次，使用四维向量来表示当前行驶阶段是出发阶段，中途阶段，还是结束阶段，以及在各个阶段已经行驶的比例,即：Second, use the 4D vector To indicate whether the current driving stage is the starting stage, the halfway stage, or the end stage, and the proportion of driving in each stage, namely:

最后，使用过去短时间和长时间的交通状况信息来预测当前的交通状况特征和具体来说，定义为过去的第1到6个时间区间(5min)内，当前区域g₁的交通状况。例如表示历史平均速度是10m/s，共有8条历史轨迹，平均通过时间估计为20m/s。将将这些交通状况特征按照历史时间顺序输入到一个子循环神经网络中，将最后输出的隐层向量h₆作为交通状况特征。Finally, use past short-term and long-term traffic condition information to predict current traffic condition characteristics and Specifically, define It is the traffic condition of the current area g ₁ in the past 1st to 6th time interval (5min). E.g Indicates that the historical average speed is 10m/s, there are 8 historical trajectories, and the average passing time is estimated to be 20m/s. Will These traffic condition features are input into a sub-cycle neural network according to the order of historical time, and the final output hidden layer vector h ₆ is used as the traffic condition feature.

二、训练阶段，具体步骤为：Second, the training phase, the specific steps are:

(1)建立双向循环神经网络(Bi-directional LSTM)模型。随机初始化模型的各项参数，包括遗忘门，输入门，输出门的矩阵参数和偏置参数。(1) Establish a bidirectional cyclic neural network (Bi-directional LSTM) model. Randomly initialize the parameters of the model, including the forget gate, input gate, matrix parameters and bias parameters of the output gate.

(2)将历史轨迹数据中提取的特征，即空间特征V_sp，时间特征V_sp，驾驶状态特征V_dri，历史上短时间和长时间的交通状态特征V_short和V_long，拼接成一个统一的特征向量。以网格g₁为例(2) The features extracted from the historical trajectory data, namely the spatial feature V _sp , the temporal feature V _sp , the driving state feature V _dri , the historical short-term and long-term traffic state features V _short and V _long , are spliced into a unified eigenvectors of . Take grid g ₁ as an example

(3)在每一个经过的小网格输入到双向循环神经网络，以得到经过该网格的通过时间，即W^T·h_i+b。总的行程的时间开销为：(3) Input each passed small grid into the bidirectional cyclic neural network to obtain the passing time of the grid, that is, W ^T ·h _i +b. The total travel time cost is:

例如，定义W＝(0.1,0.3,..,0.7)，10个网格隐层变量值为h₁＝(0.8,0.3,…,0.2),…,h₁₀＝(0.7,0.4,..0.5),偏置值b＝0.7,则：For example, define W=(0.1,0.3,..,0.7), and the 10 grid hidden layer variable values are h ₁ =(0.8,0.3,…,0.2),…,h ₁₀ =(0.7,0.4,.. 0.5), bias value b=0.7, then:

(4)定义轨迹经过各个网格序列的真实时间开销向量为T。顺序的真实时间开销向量为T^f＝(70,120,…,720)，逆序的真实时间开销向量为T^b＝(720,640,…,50)。则神经网络估计得到的时间开销向量为：(4) Define the real time cost vector of the trajectory passing through each grid sequence as T. The real time cost vector in sequence is T ^f =(70,120,...,720), and the real time cost vector in reverse order is T ^b =(720,640,...,50). Then the time cost vector estimated by the neural network is:

(5)最小化损失函数L，即：(5) Minimize the loss function L, namely:

其中，θ表示模型的训练参数，ε表示时间和空间上的嵌入向量，S是训练集的大小。最后，使用基于时间顺序的反向传播算法对模型进行参数的更新和优化。Among them, θ represents the training parameters of the model, ε represents the embedding vector in time and space, and S is the size of the training set. Finally, the parameters of the model are updated and optimized using the backpropagation algorithm based on time sequence.

三.预测阶段，具体步骤为(以图2为例)：3. Prediction stage, the specific steps are (take Figure 2 as an example):

(1)给定一条没有时间戳标记的真实行程作为查询路径g＝{g₁,g₂,…,g₈}，根据经过的实际路径，得到其映射的网格序列。对于每一个经过的小网格g₁～g₈，使用特征提取和表示阶段抽取得到的时空特征V_sp和V_tp，驾驶状态特征V_dri，和历史上短时间和长时间的交通状态特征V_short和V_long，作为该网格的总特征表示V。其中，时空特征的嵌入向量使用经过训练过程的参数更新后的向量信息。短时和长时间的交通状态特征使用经过训练的子循环神经网络进行特征挖掘。(1) Given a real itinerary without a time stamp as the query path g={g ₁ ,g ₂ ,…,g ₈ }, according to the actual path passed, obtain its mapped grid sequence. For each passing small grid g ₁ ~ g ₈ , the spatio-temporal features V _sp and V _tp , the driving state features V _dri , and the historical short-term and long-term traffic state features V _short and V _long , representing V as the total feature of the grid. Among them, the embedding vector of the spatio-temporal feature uses the vector information updated by the parameters of the training process. Short-term and long-term traffic state features are mined using a trained sub-recurrent neural network.

(2)在每一个经过的网格，将抽取的各方面特征输入到已经经过训练的双向循环神经网络中，得到当前的隐变量h_t，那么经过当前区域的估计时间为W^T·h_t+b。而总的时间开销估计值为；(2) In each passing grid, input all aspects of the extracted features into the trained bidirectional cyclic neural network to obtain the current hidden variable h _t , then the estimated time to pass through the current area is W ^T h _t +b. And the total time overhead estimate is;

其中，W和b均由之前训练过程中得到的参数。Among them, W and b are the parameters obtained in the previous training process.

Claims

1. A travel time estimation method based on auxiliary supervised learning, characterized in that, is divided into three stages:

(1) Feature extraction and representation stage, preprocessing the historical trajectory data and extracting its various features;

(2) In the training phase, the features extracted from the historical trajectory data are input into a unified bidirectional recurrent neural network for training, and the bidirectional interval loss function is used as the training constraint;

(3) In the prediction stage, a bidirectional recurrent neural network is used to infer the features extracted in the query path and estimate the travel time;

(1) The specific steps in the feature extraction and representation stage are:

Step (1), within the city, fine-grained division of the grid according to latitude and longitude coordinates to form adjacent small rectangular areas; each coordinate point in the track sequence composed of historical GPS coordinates sorted in chronological order Mapped to the corresponding small area to form a sequence composed of grid coordinates;

Step (2), for each grid, mine its different features; first, use the embedding vector technology to represent the semantic information of each grid small area in different spaces and different time periods; these information include different functions of the city Regional spatial location information also includes time information such as morning peak hours and weekends; specifically, a low-dimensional vector is used to represent the spatial vector V _sp of each grid, and a day is divided into multiple time buckets, and each trajectory falls into time bucket to get the time vector V _tp ; randomly initialize V _sp and V _tp , and then train with the model during model training;

Step (3), using the four-dimensional vector V _dri to indicate whether the current driving stage is the departure stage, the halfway stage, or the end stage, and the proportion of the travel in each stage;

Step (4), define the short-term traffic condition characteristic as V _short , define the long-term traffic condition characteristic as V _long , specifically,

definition:

Indicates the traffic status of the current small area g _i in the jth time interval in the past, where v _j represents the historical average speed, n _j represents the number of historical trajectory data, len _i /v _j represents the roughly estimated passing time; these traffic The condition features are input into a sub-cyclic neural network in order of historical time to extract traffic condition features;

In addition, consider the traffic condition information of adjacent small areas, namely

definition:

Indicates the grid set whose distance from g _i does not exceed d, collects their past short-term traffic condition characteristics, and inputs them into the neural network together; where x, y represent the coordinates of the grid, and g _j represents other grids except g _i grid;

(2) The specific steps in the training phase are:

Step (1), build a recurrent neural network; define the hidden layer of the network as The input data is Then, the input data of step t is x _t , and the calculation result obtained in step t is h _t , then:

h _t ＝φ(x _t ·W _x +h _t-1 ·W _h +b) (3)

in, is the weight matrix of the input data, is the weight matrix of the hidden layer, is the bias parameter (bias);

That is, the hidden state is expressed as a function:

h _t ＝f(h _t-1 ,x _t ) (4)

On this basis, the forget gate is defined as:

f _t ＝σ(W _f ·[h _t-1 ,x _t ]+b _f ) (5)

The input gate is:

i _t =σ(W _i ·[h _t-1 ,x _t ]+b _i ) (6)

The output gate is:

o _t ＝σ(W _o [h _t-1 ,x _t ]+b _o ) (7)

The update of the memory unit is:

The update of the hidden layer is:

h _t ＝O _t ·tanh(C _t ) (10)

Among them, W _f ,W _i ,W _o , represent the weight matrix of forget gate, input gate, output gate and memory unit respectively, b _f , b _i , b _o , is the corresponding bias parameter; σ() is a nonlinear activation function; f( ) represents an abstract neural network function including parameters of each layer, and defines the corresponding parameter of the recurrent neural network as W _N , from [-α, Initialize each element in the uniform distribution of α], where α is a hyperparameter with a setting range of 0.01 to 1;

The two-way cyclic neural network uses both the forward cyclic neural network and the reverse cyclic neural network for calculation; the forward cyclic neural network sequentially inputs the grid features extracted in the previous steps according to the order of the sequence, and the reverse cyclic neural network uses Enter the grid feature after the sequence is reversed; define its hidden variable as the splicing of the forward and reverse networks, namely:

In step (2), the features extracted from the historical trajectory data, namely the spatial feature V _sp , the temporal feature V _sp , the driving state feature V _dri , and the historical short-term and long-term traffic state features V _short and V _long , are spliced into A uniform eigenvector:

V=(V _sp , V _sp , V _dri , V _short , V _long ) (11)

Input each passing small grid to the two-way cyclic neural network to obtain the passing time through the grid, that is, W ^T h _i + b, the total travel time overhead for:

To calculate the weight matrix and bias parameters of the total time overhead, W ^T represents the transposition of the W matrix;

Step (3), define the real time cost vector of the trajectory passing through each grid sequence as T; the real time cost vector of sequence is T ^f , the real time cost vector of reverse order is T ^b ; then the time cost vector estimated by the neural network is :

Use the two-way interval loss function to assist the model in supervised learning, so that it can not only learn the time cost of the entire path, but also learn the transit time of each intermediate stage; define the two-way interval loss function as:

Among them, M indicates whether the trajectory passes through the mask of a small area, and [] indicates the operation between each element of the vector;

Step (4), the training goal is to minimize the loss function L, namely:

Among them, θ represents the training parameters of the model, ε represents the embedding vector in time and space, and S is the size of the training set; finally, the parameters of the model are updated and optimized using the backpropagation algorithm based on time sequence;

(3) The specific steps in the prediction stage are as follows:

Step (1), given a real itinerary without a time stamp as the query path, according to the actual path passed, the grid sequence mapped to it is obtained; for each passing small grid, the feature extraction and representation stages are used to obtain Spatio-temporal features V _sp and V _tp , driving state features V _dri , and historical short-term and long-term traffic state features V _short and V _long , as the total feature representation V of the grid; among them, the embedding vector of spatio-temporal features Use the vector information updated by the parameters of the training process; the short-term and long-term traffic state features use the trained sub-cycle neural network for feature mining;

In step (2), in each passing grid, input all aspects of the extracted features into the trained bidirectional cyclic neural network to obtain the current hidden variable h _t , then the estimated time to pass through the current area is W ^T · h _t +b, and the estimated total time cost is:

Among them, n represents the total number of grids passed, is the weight matrix and bias parameters obtained through training to calculate the total time overhead, and W ^T represents the transpose of the W matrix.