CN109035761A - Travel time estimation method based on back-up surveillance study - Google Patents

Travel time estimation method based on back-up surveillance study Download PDF

Info

Publication number
CN109035761A
CN109035761A CN201810658375.5A CN201810658375A CN109035761A CN 109035761 A CN109035761 A CN 109035761A CN 201810658375 A CN201810658375 A CN 201810658375A CN 109035761 A CN109035761 A CN 109035761A
Authority
CN
China
Prior art keywords
time
grid
neural network
features
vector
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201810658375.5A
Other languages
Chinese (zh)
Other versions
CN109035761B (en
Inventor
孙未未
章瀚元
吴昊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fudan University
Original Assignee
Fudan University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fudan University filed Critical Fudan University
Priority to CN201810658375.5A priority Critical patent/CN109035761B/en
Publication of CN109035761A publication Critical patent/CN109035761A/en
Application granted granted Critical
Publication of CN109035761B publication Critical patent/CN109035761B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • G08G1/01Detecting movement of traffic to be counted or controlled
    • G08G1/0104Measuring and analyzing of parameters relative to traffic conditions
    • G08G1/0125Traffic data processing
    • G08G1/0129Traffic data processing for creating historical data or processing based on historical data
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • G08G1/01Detecting movement of traffic to be counted or controlled
    • G08G1/0104Measuring and analyzing of parameters relative to traffic conditions
    • G08G1/0137Measuring and analyzing of parameters relative to traffic conditions for specific applications

Landscapes

  • Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Analytical Chemistry (AREA)
  • Chemical & Material Sciences (AREA)
  • Theoretical Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Biophysics (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Computational Linguistics (AREA)
  • Biomedical Technology (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Artificial Intelligence (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Traffic Control Systems (AREA)

Abstract

本发明属于智能交通技术领域,具体为一种基于辅助监督学习的行程时间估计方法。其从海量历史轨迹数据中寻找统计规律,通过端到端的深度学习模型对整个行程的时间进行整体的估计;步骤包括:特征提取和表示阶段,对轨迹数据进行预处理,分别抽取它的时间和空间特征,驾驶状态特征,短时间和长时间的交通状况特征;训练和预测阶段,将这些提取的特征用统一的双向循环神经网络进行训练和预测;循环神经网络每一步都输出通过当前小区域的时间开销;这些小区域的时间开销的总和即为总路径的时间开销。同时,还引入双向区间损失函数来约束中间时间开销。本方法可高效准确地对城市中的车辆行程时间进行估计,在实际环境下具有较好的效果。

The invention belongs to the technical field of intelligent transportation, in particular to a method for estimating travel time based on auxiliary supervised learning. It looks for statistical laws from massive historical trajectory data, and estimates the time of the entire trip through an end-to-end deep learning model; the steps include: feature extraction and representation stages, preprocessing the trajectory data, extracting its time and Spatial features, driving state features, short-term and long-term traffic condition features; in the training and prediction phase, these extracted features are trained and predicted with a unified two-way cyclic neural network; each step of the cyclic neural network outputs through the current small area The time cost of these small areas is the sum of the time cost of the total path. At the same time, a bidirectional interval loss function is also introduced to constrain the intermediate time overhead. This method can efficiently and accurately estimate the vehicle travel time in the city, and has a good effect in the actual environment.

Description

基于辅助监督学习的行程时间估计方法Travel Time Estimation Method Based on Auxiliary Supervised Learning

技术领域technical field

本发明属于智能交通技术领域,具体涉及一种基于辅助监督学习的行程时间估计方法。The invention belongs to the technical field of intelligent transportation, and in particular relates to a method for estimating travel time based on auxiliary supervised learning.

背景技术Background technique

行程时间估计是城市交通领域一个必不可少的重要技术,可以为人们的出行通勤提供帮助,也可以为政府规划决策提供支持。但这并不是一个简单的小问题,而是会受到各种动态因素的影响,如交通动态,路口状况,司机驾驶行为的变化和历史周期性的数据演化等等。这些因素导致行程时间估计存在不确定性和难度。随着支持GPS的移动设备的发展和普及,目前已经有大量的轨迹数据在源源不断地产生,并且覆盖城市的各个角落。有了这些海量的历史轨迹数据,我们可以挖掘数据背后的内在规律,通过构建算法模型来学习出行程时间的变化的周期和趋势,从而更加准确地推断当前查询轨迹所需的时间开销。Travel time estimation is an essential and important technology in the field of urban transportation, which can provide assistance for people's travel and commuting, and can also provide support for government planning decisions. But this is not a simple small problem, but will be affected by various dynamic factors, such as traffic dynamics, intersection conditions, changes in driver's driving behavior and historical periodic data evolution, etc. These factors lead to uncertainty and difficulty in estimating travel time. With the development and popularization of GPS-enabled mobile devices, a large amount of trajectory data has been continuously generated and covers every corner of the city. With these massive historical trajectory data, we can mine the inherent laws behind the data, and learn the cycle and trend of travel time changes by building an algorithm model, so as to more accurately infer the time required for the current query trajectory.

目前已有的方法大多采用分而治之(divide-and-conquer)的方法,主要是通过将路径分解一系列的路段或者子路径这两类。At present, most of the existing methods adopt a divide-and-conquer method, mainly by decomposing the path into a series of road sections or sub-paths.

(1)基于单一路段的方法:(1) The method based on a single road section:

基于单路段的方法主要通过估计每一条单一路段的轨迹经过时的平均速度,进而根据路段长度计算出经过的平均时间开销,最后将各个路段的时间和累加得到总的时间。但这种方法没有考虑路段之间的路口时间开销。另外,这种估计严重依赖于高质量的速度数据,而这往往在轨迹数据中无法得到。The method based on a single road segment mainly estimates the average speed of the trajectory of each single road segment, and then calculates the average time cost of passing according to the length of the road segment, and finally accumulates the time of each road segment to obtain the total time. But this method does not consider the intersection time overhead between road segments. Additionally, this estimation relies heavily on high-quality velocity data, which is often not available in trajectory data.

(2)基于子路径的方法:(2) Subpath-based method:

基于子路径的方法主要通过将路径分割成一系列的子路径方法,使得路口的时间开销也得到考虑。主要思路都是对历史数据中丰富的公共子路径信息进行拼接和挖掘。尽管这种方法可以克服单一路段方法的许多缺陷,但它仍然是基于启发式设计,而不是直接将行程时间作为算法优化目标。The subpath-based method mainly divides the path into a series of subpaths, so that the time cost of the intersection is also considered. The main idea is to splicing and mining the rich public sub-path information in the historical data. Although this approach can overcome many of the shortcomings of the single-segment approach, it is still based on heuristic design rather than directly taking travel time as the algorithm optimization goal.

总而言之,目前已有的方法无法达到令人满意的准确性有两个方面的原因。一个是它们没有把路径看成一个整体,而是拆分成各个子块。在这一拆分过程中,损失了很多有用的信息。并且,它们没有充分利用轨迹数据特有的中间监督标签,也就是每一个中间GPS采样点的时间戳信息。另一方面,随着深度学习技术的发展和繁荣,更多的问题可以通过端到端一体式地解决,相较于传统启发式模型要更为高效。并且,深度学习有着强大的表征能力,与手工模型相比,可以捕捉到更多的潜在特征,能够处理行程估计问题中各种复杂的动态性。All in all, there are two reasons why currently existing methods cannot achieve satisfactory accuracy. One is that they do not treat the path as a whole, but split it into sub-blocks. In this splitting process, a lot of useful information is lost. Moreover, they do not make full use of the unique intermediate supervision labels of trajectory data, that is, the timestamp information of each intermediate GPS sampling point. On the other hand, with the development and prosperity of deep learning technology, more problems can be solved through end-to-end integration, which is more efficient than traditional heuristic models. Moreover, deep learning has a powerful representation ability. Compared with manual models, it can capture more potential features and can deal with various complex dynamics in travel estimation problems.

发明内容Contents of the invention

本发明的目的是针对传统的两类行程时间估计技术的局限性,提出一种基于辅助监督学习的历史轨迹的行程时间估计方法,以克服现有技术的不足。The purpose of the present invention is to address the limitations of the traditional two types of travel time estimation techniques, and propose a travel time estimation method based on historical trajectories of auxiliary supervised learning to overcome the deficiencies of the prior art.

本发明方法从海量历史轨迹数据中寻找统计规律,通过端到端的深度学习模型对整个行程的时间进行整体的估计。基本步骤包括:特征提取和表示阶段,对轨迹数据进行预处理,分别抽取它的各方面特征;训练和预测阶段,将这些提取的特征用一个统一的双向循环神经网络进行训练和预测;循环神经网络每一步都输出通过当前小区域的时间开销;这些小区域的时间开销的总和即为总路径的时间开销;为了更加有效地进行训练,还引入了双向区间损失函数来约束中间时间开销。The method of the present invention searches for statistical laws from massive historical trajectory data, and performs an overall estimation of the entire travel time through an end-to-end deep learning model. The basic steps include: the feature extraction and representation stage, preprocessing the trajectory data, and extracting its various features; the training and prediction stage, using a unified two-way cyclic neural network to train and predict these extracted features; cyclic neural network Each step of the network outputs the time cost of passing through the current small area; the sum of the time cost of these small areas is the time cost of the total path; in order to train more effectively, a bidirectional interval loss function is also introduced to constrain the intermediate time cost.

本发明提出的基于辅助监督学习的历史轨迹的行程时间估计方法,分为如下三个阶段:The travel time estimation method based on the historical trajectory of auxiliary supervised learning proposed by the present invention is divided into the following three stages:

(一)特征提取和表示阶段,对历史轨迹数据进行预处理,抽取它的各方面特征(包括时间特征和空间特征,驾驶状态特征,短时间和长时间的交通状况特征等)。具体步骤为:(1) Feature extraction and representation stage, preprocessing the historical trajectory data, and extracting its various features (including temporal and spatial features, driving state features, short-term and long-term traffic conditions, etc.). The specific steps are:

步骤(1),在城市范围内,根据经纬度坐标对网格进行细粒度划分,形成一个个相邻的矩形小区域。将按时间顺序排序,由GPS坐标组成的轨迹序列中的每一个坐标点映射到对应的小区域中,形成一个由网格坐标组成的序列。对于相邻轨迹点距离较远,落在不连续的小区域内的情况,可以地图匹配等算法得到中间经过路径,补全这部分不连续的区域信息。In step (1), within the city limits, the grid is fine-grained according to the latitude and longitude coordinates to form adjacent small rectangular areas. Sorting in chronological order, each coordinate point in the trajectory sequence composed of GPS coordinates is mapped to the corresponding small area to form a sequence composed of grid coordinates. For the situation that the adjacent trajectory points are far away and fall in a small discontinuous area, the intermediate path can be obtained by algorithms such as map matching, and the information of this part of the discontinuous area can be supplemented.

步骤(2),对于每一个网格,挖掘它不同方面的特征。首先,使用嵌入向量技术来挖掘潜在语义信息。嵌入向量技术在自然语言处理和社交网络等领域等到了广泛的使用,主要是利用低维的实数向量来代表每一个词或者事物的语义信息,通过向量空间中的距离关系来衡量实物之间的对应关系。本发明利用嵌入向量技术来表征每一个网格小区域在不同空间以及不同时间段的语义信息。这些信息包含了城市不同的功能区域(例如居民区,商业区或工业区等等)空间区位信息,也包括了早高峰,周末等时间信息。具体地,利用低维向量来表示每一个网格的空间向量Vsp,将一天划分成多个时间桶(例如一个小时一个桶),每一条轨迹根据具体落入的时间桶来得到时间向量Vtp。对Vsp和Vtp进行随机初始化,之后在模型训练时跟着模型一起训练。Step (2), for each grid, mining its different aspects of features. First, embedding vector technology is used to mine latent semantic information. Embedded vector technology has been widely used in the fields of natural language processing and social networks. It mainly uses low-dimensional real number vectors to represent the semantic information of each word or thing, and measures the relationship between objects through the distance relationship in the vector space. Correspondence. The present invention uses embedded vector technology to represent the semantic information of each small grid area in different spaces and different time periods. These information include the spatial location information of different functional areas of the city (such as residential areas, commercial areas or industrial areas, etc.), as well as time information such as morning peak hours and weekends. Specifically, a low-dimensional vector is used to represent the space vector V sp of each grid, and a day is divided into multiple time buckets (for example, one bucket per hour), and each trajectory obtains the time vector V according to the specific time bucket it falls into. tp . Randomly initialize V sp and V tp , and then train with the model during model training.

步骤(3),司机在开车时,在不同的行驶状态时,行驶的速度和驾驶行为都会发生变化。例如,车辆在行驶路径的中间部分时,会更倾向于行驶在大路或者高架上,这时速度会更快。而在刚出发或者快到终点时,由于行驶在小路或者人多的区域,往往速度就会变慢。具体地,使用四维向量Vdri来表示当前行驶阶段是出发阶段,中途阶段,还是结束阶段,以及在各个阶段已经行驶的比例。例如,Vdri=(1,0,0,0.2)表示司机行驶在开始阶段,占了总行程的20%。In step (3), when the driver is driving, the driving speed and driving behavior will change in different driving states. For example, when the vehicle is in the middle of the driving path, it will be more inclined to drive on the road or elevated, and the speed will be faster at this time. When just starting or approaching the end, the speed will often slow down due to driving on a small road or in a crowded area. Specifically, the four-dimensional vector V dri is used to indicate whether the current driving stage is a departure stage, a midway stage, or an end stage, and the proportion of the current driving in each stage. For example, V dri =(1, 0, 0, 0.2) means that the driver travels at the beginning stage, which accounts for 20% of the total trip.

步骤(4),在一个区域内的交通状况,往往随着时间演变会有周期性和规律性的变化。例如,如果一个路段在8点到8点半都很堵,那么8点35分它也可能很堵。也就是说,过去短时间内的交通状况信息,对预测当前的交通状态很有帮助。定义该短时间的交通状况特征为Vshort。与此同时,长时间周期性的交通状况变化也能帮助预测当前交通状况,例如工作日和周末的交通状况变化规律。定义该长时间的交通状况特征为Vlong。具体来说,In step (4), traffic conditions in an area tend to change periodically and regularly over time. For example, if a road segment is congested from 8:00 to 8:30, it may also be congested at 8:35. That is to say, the traffic condition information in a short period of time in the past is very helpful for predicting the current traffic condition. The traffic condition characteristic of this short time is defined as V short . At the same time, long-term periodic changes in traffic conditions can also help predict the current traffic conditions, such as the regularity of traffic conditions on weekdays and weekends. The traffic condition characteristic of this long time is defined as V long . Specifically,

定义: definition:

表示在过去第j个时间区间内,当前小区域gi的交通状况,其中vj表示历史平均速度,nj表示历史轨迹数据数量,leni/vj表示粗略估计的通过时间。将这些交通状况特征按照历史时间顺序输入到一个子循环神经网络中,可以抽取出交通状况特征。Indicates the traffic status of the current small area g i in the jth time interval in the past, where v j represents the historical average speed, n j represents the number of historical trajectory data, and len i /v j represents the roughly estimated passing time. These traffic condition features are input into a sub-cycle neural network according to the order of historical time, and the traffic condition features can be extracted.

另外,由于历史数据在不同空间区域分布不均衡,有些区域轨迹经过数量较少,可能会对估计的准确性造成影响。为了解决这一数据稀疏问题,将邻接小区域的交通状况信息也考虑进来,即In addition, due to the uneven distribution of historical data in different spatial regions, some regions have a small number of trajectories, which may affect the accuracy of estimation. In order to solve this data sparsity problem, the traffic status information of adjacent small areas is also taken into account, that is,

定义: definition:

表示距离gi距离不超过d的网格集合,收集它们过去短时的交通状况特征,一起输入到神经网络中。其中,x,y表示网格的坐标,gj表示除gi以外的其他网格。Represents a set of grids whose distance from g i does not exceed d, collects their past short-term traffic condition characteristics, and inputs them into the neural network together. Among them, x, y represent the coordinates of the grid, and g j represents other grids except g i .

(二)训练阶段,将历史轨迹数据中提取的特征输入到一个统一的双向循环神经网络(bidirectional LSTM,参考文献:Graves A,Schmidhuber J.Framewise phonemeclassification with bidirectional LSTM and other neural network architectures[J].Neural Networks,2005,18(5-6):602-610.)进行训练,并且以双向区间损失函数作为训练的约束;具体步骤为:(2) In the training phase, the features extracted from the historical trajectory data are input into a unified bidirectional LSTM, references: Graves A, Schmidhuber J. Framewise phoneme classification with bidirectional LSTM and other neural network architectures[J]. Neural Networks,2005,18(5-6):602-610.) for training, and use the bidirectional interval loss function as the training constraint; the specific steps are:

步骤(1),构建循环神经网络。定义网络隐层为输入数据为那么,第t步的输入数据为xt,第t步得到的计算结果为ht,则有:Step (1), constructing a recurrent neural network. Define the hidden layer of the network as The input data is Then, the input data of step t is x t , and the calculation result obtained in step t is h t , then:

ht=φ(xt·Wx+ht-1·Wh+b) (3)h t =φ(x t ·W x +h t-1 ·W h +b) (3)

其中,是输入数据的权重矩阵(weight matrix),是隐层的权重矩阵,是偏置参数(bias)。φ表示一个非线性激活函数,可以是sigmoid函数,ReLU函数,tanh函数等等。in, is the weight matrix of the input data, is the weight matrix of the hidden layer, is the bias parameter (bias). φ represents a nonlinear activation function, which can be a sigmoid function, a ReLU function, a tanh function, and so on.

也就是说隐状态可以表示为函数:That is to say, the hidden state can be expressed as a function:

ht=f(ht-1,xt) (4)h t =f(h t-1 ,x t ) (4)

在这基础上,定义遗忘门为:On this basis, the forget gate is defined as:

ft=σ(Wf·[ht-1,xt]+bf) (5)f t =σ(W f ·[h t-1 ,x t ]+b f ) (5)

输入门为:The input gate is:

it=σ(Wi·[ht-1,xt]+bi) (6)i t =σ(W i ·[h t-1 ,x t ]+b i ) (6)

输出门为:The output gate is:

ot=σ(Wo[ht-1,xt]+bo) (7)o t =σ(W o [h t-1 ,x t ]+b o ) (7)

记忆单元的更新为:The update of the memory unit is:

隐层的更新为:The update of the hidden layer is:

ht=Ot·tanh(Ct) (10)h t =O t ·tanh(C t ) (10)

其中,Wf、Wi、Wo分别表示遗忘门、输入门、输出门和记忆单元的权重矩阵,bf、bi、bo则是对应的偏置参数。σ()为一个非线性的激活函数,例如是一个sigmoid函数,是一个双曲正切函数,f( )表示一个包含各层参数的抽象神经网络函数。定义循环神经网络对应的参数为WN;从[-α,α]的均匀分布中对循环神经网络中的每个权重参数进行初始化,其中,α是为一个超参数,设定范围为0.01到1。Among them, W f , W i , W o , represent the weight matrix of forget gate, input gate, output gate and memory unit respectively, b f , b i , b o , is the corresponding bias parameter. σ() is a non-linear activation function, such as is a sigmoid function, is a hyperbolic tangent function, and f( ) represents an abstract neural network function including parameters of each layer. Define the parameter corresponding to the cyclic neural network as W N ; initialize each weight parameter in the cyclic neural network from the uniform distribution of [-α,α], where α is a hyperparameter, and the setting range is from 0.01 to 1.

双向循环神经网络同时使用一个正向的循环神经网络和一个反向的循环神经网络进行计算。其中正向循环神经网络根据序列的顺序依次将之前步骤提取的网格特征输入,而反向循环神经网络则将序列逆序后输入网格特征。这么做的优点在于,可以使得神经网络同时观察到当前网格距离起点和终点的位置距离,从而拥有一个整体的特征。定义它的隐变量为正向和反向网络的拼接其中表示正向循环神经网络的隐层,表示反向循环神经网络的隐层。Bidirectional RNNs use both a forward RNN and a reverse RNN for computation. Among them, the forward cyclic neural network sequentially inputs the grid features extracted in the previous steps according to the order of the sequence, while the reverse cyclic neural network inputs the grid features after the sequence is reversed. The advantage of doing this is that the neural network can observe the position distance of the current grid from the starting point and the end point at the same time, so as to have an overall feature. Define its hidden variable as the splicing of forward and reverse networks in Represents the hidden layer of the forward recurrent neural network, Represents the hidden layer of a reverse recurrent neural network.

步骤(2),将历史轨迹数据中提取的特征,即空间特征时间特征驾驶状态特征历史上短时间和长时间的交通状态特征拼接成一个统一的特征向量:Step (2), the features extracted from the historical trajectory data, that is, the spatial features time feature driving status characteristics Historical short-term and long-term traffic status characteristics and Concatenate into a unified eigenvector:

在每一个经过的小网格输入到双向循环神经网络,以得到经过该网格的通过时间,即WT·hi+b。总的行程的时间开销为:Input the bidirectional cyclic neural network into each passing small grid to obtain the passing time of the grid, that is, W T h i +b. total trip time cost for:

定义分别为计算总时间开销的权重矩阵和偏置参数。WT表示W矩阵的转置。definition are the weight matrix and bias parameters for calculating the total time overhead, respectively. W T denotes the transpose of the W matrix.

步骤(3),定义轨迹经过各个网格序列的真实时间开销向量为T。顺序的真实时间开销向量为Tf,逆序的真实时间开销向量为Tb。则神经网络估计得到的时间开销向量为:Step (3), define the real time cost vector of the trajectory passing through each grid sequence as T. The sequential real time cost vector is T f , and the reverse real time cost vector is T b . Then the time cost vector estimated by the neural network is:

使用双向区间损失函数对模型进行辅助监督学习,使其不仅学习整条路径的时间开销,同时可以学习各个中间阶段的通行时间。定义双向区间损失函数为:The model is assisted with supervised learning using a bidirectional interval loss function, so that it not only learns the time cost of the entire path, but also learns the transit time of each intermediate stage. Define the two-way interval loss function as:

其中,M表示轨迹是否经过小区域的掩码,[]表示向量每个元素间的操作。Among them, M indicates whether the trajectory passes through the mask of the small area, and [] indicates the operation between each element of the vector.

步骤(4),训练的目标是,最小化损失函数L,即:Step (4), the training goal is to minimize the loss function L, namely:

其中,θ表示模型的训练参数,ε表示时间和空间上的嵌入向量,S是训练集的大小。最后,使用基于时间顺序的反向传播算法对模型进行参数的更新和优化。反向传播算法参考文献:Chauvin Y,Rumelhart D E.Backpropagation:theory,architectures,andapplications[M].Psychology Press,2013.Among them, θ represents the training parameters of the model, ε represents the embedding vector in time and space, and S is the size of the training set. Finally, the parameters of the model are updated and optimized using the backpropagation algorithm based on time sequence. Backpropagation Algorithm References: Chauvin Y, Rumelhart D E. Backpropagation: theory, architectures, and applications [M]. Psychology Press, 2013.

(三)预测阶段,用双向循环神经网络对查询路径中提取的特征进行推断并估计行程时间;具体步骤为:(3) In the prediction stage, use the bidirectional recurrent neural network to infer the features extracted from the query path and estimate the travel time; the specific steps are:

步骤(1),给定一条没有时间戳标记的真实行程作为查询路径,根据经过的实际路径,得到其映射的网格序列。对于每一个经过的小网格,使用特征提取和表示阶段抽取得到的时空特征Vsp和Vtp,驾驶状态特征Vdri,和历史上短时间和长时间的交通状态特征Vshort和Vlong,作为该网格的总特征表示V。其中,时空特征的嵌入向量使用经过训练过程的参数更新后的向量信息。短时和长时间的交通状态特征使用经过训练的子循环神经网络进行特征挖掘。Step (1), given a real itinerary without a time stamp as the query path, according to the actual path passed, get its mapped grid sequence. For each passing small grid, use the spatio-temporal features V sp and V tp obtained in the feature extraction and representation stages, the driving state features V dri , and the historical short-term and long-term traffic state features V short and V long , V is represented as the total feature of the grid. Among them, the embedding vector of the spatio-temporal feature uses the vector information updated by the parameters of the training process. Short-term and long-term traffic state features are mined using a trained sub-recurrent neural network.

步骤(2),在每一个经过的网格,将抽取的各方面特征输入到已经经过训练的双向循环神经网络中,得到当前的隐变量ht,那么经过当前区域的估计时间为WT·ht+b。而总的时间开销估计值为:In step (2), in each passing grid, input all aspects of the extracted features into the trained bidirectional cyclic neural network to obtain the current hidden variable h t , then the estimated time to pass through the current area is W T · ht +b. And the total time cost estimate is:

其中,n表示经过的总网格数目,为经过训练得到的,计算总时间开销的权重矩阵和偏置参数。WT表示W矩阵的转置。Among them, n represents the total number of grids passed, Calculate the weight matrix and bias parameters of the total time overhead for training. W T denotes the transpose of the W matrix.

总的来说,本发明方法有以下几个优点。首先,利用端到端(end-to-end)基于历史数据训练的深度学习方法,直接学习出整条路径的特征并估计出整体的通行时间。我们定义了一个双向区间损失函数,可以在监督整体的路径时间的基础上,同时辅助监督通过中间路段的时间开销。这种引入辅助监督的方法既丰富了路径的样本信息,又可以使得反向传播算法对参数更新时传播信号可以更加准确。其次,提出了一个特征抽取结构,通过提取时空嵌入向量,行驶状态,以及短时间和长时间的交通状况等不同维度的动态特征,能有效地估计出路径的通行时间。最后,在实际环境下经过实验验证,具有比已有方法更好的实验结果。In general, the method of the present invention has the following advantages. First, using an end-to-end (end-to-end) deep learning method based on historical data training, directly learn the characteristics of the entire path and estimate the overall transit time. We define a bidirectional interval loss function, which can supervise the overall path time while assisting in supervising the time cost of passing through the intermediate road sections. This method of introducing auxiliary supervision not only enriches the sample information of the path, but also makes the propagation signal more accurate when the backpropagation algorithm updates the parameters. Secondly, a feature extraction structure is proposed, which can effectively estimate the travel time of the route by extracting dynamic features of different dimensions such as spatio-temporal embedding vectors, driving status, and short-term and long-term traffic conditions. Finally, it has been verified by experiments in the actual environment, and has better experimental results than existing methods.

如表1所示,我们用真实的历史轨迹数据进行实验,包括波尔图和上海两个城市。我们用路段平均时间法,子路径动态规划法,网格全连接网络法,网格卷积网络法等已有方法进行对比。其中,路段平均时间法通过统计每个路段的平均通过时间,直接累加得到结果。子路径动态规划法[Yilun Wang,Yu Zheng,and Yexiang Xue.Travel timeestimation of a path using sparse trajecto-ries.In Proceedings of the 20thInternational Conference on Knowledge Discovery and Data Mining(SIGKDD),pages25–34,2014.]利用动态规划找到子路径的最优拼接方法。网格全连接网络法和网格卷积网络法将N×N的整体网格作为输入,分别用全连接网络(Multi-Layer Perceptron)和卷积神经网络(Convolutional Neural Network)进行优化和估计。我们使用MAE,RMSE,MAPE三个误差度量指标衡量方法的好坏。As shown in Table 1, we conduct experiments with real historical trajectory data, including the two cities of Porto and Shanghai. We compare existing methods such as segment average time method, sub-path dynamic programming method, grid fully connected network method, and grid convolution network method. Among them, the road section average time method obtains the result by directly accumulating the average passing time of each road section. Sub-path dynamic programming method[Yilun Wang, Yu Zheng, and Yexiang Xue.Travel timeestimation of a path using sparse trajectory-ries.In Proceedings of the 20thInternational Conference on Knowledge Discovery and Data Mining(SIGKDD),pages25–34,2014.] Using dynamic programming to find the optimal stitching method for subpaths. The grid fully connected network method and the grid convolutional network method take an N×N overall grid as input, and use a fully connected network (Multi-Layer Perceptron) and a convolutional neural network (Convolutional Neural Network) for optimization and estimation, respectively. We use MAE, RMSE, and MAPE three error metrics to measure the quality of the method.

其中,y表示真实值,表示估计值,n表示样本总数。由表1结果可知,本发明方法在各项指标都要远好于已有的对比方法。例如,在上海数据集上,本发明方法估计MAE误差只有126秒,MAPE误差是13.3%,而已有最好方法的MAE误差在168秒,MAPE误差是19.1%。Among them, y represents the real value, Indicates the estimated value, and n indicates the total number of samples. As can be seen from the results in Table 1, the method of the present invention is far better than the existing comparative method in every index. For example, on the Shanghai data set, the method of the present invention estimates that the MAE error is only 126 seconds, and the MAPE error is 13.3%, while the MAE error of the existing best method is 168 seconds, and the MAPE error is 19.1%.

表1Table 1

附图说明Description of drawings

图1表示一个真实轨迹样本,包含每一个中间GPS轨迹点的时间戳信息,一共经过720s。Figure 1 shows a real trajectory sample, including the time stamp information of each intermediate GPS trajectory point, and a total of 720s has passed.

图2表示需要查询的路径样本,仅包含具体经过的路径信息,不包含任何时间戳信息。Figure 2 shows the path samples that need to be queried, which only contain specific path information and do not contain any time stamp information.

具体实施方式Detailed ways

下面结合具体实例来说明本发明的具体实施过程:The specific implementation process of the present invention will be described below in conjunction with specific examples:

如图1中的历史轨迹用于训练,并估计图2中的行程时间。The historical trajectories in Figure 1 are used for training, and the travel time in Figure 2 is estimated.

一、预处理阶段,特征提取和表示阶段,对轨迹数据进行预处理,抽取它的各方面特征。以图1为例,具体步骤为:1. The preprocessing stage, the feature extraction and representation stage, preprocesses the trajectory data and extracts its various features. Taking Figure 1 as an example, the specific steps are:

(1)在城市范围内,进行细粒度网格划分,分成一个个相邻的小区域。如图1中,将地图划分成5×6个网格。将轨迹序列中的每一个坐标点映射到对应的小区域中,形成一个网格序列,即g={g1,g2,…,g10}。(1) In the urban area, fine-grained grid division is carried out, and it is divided into adjacent small areas. As shown in Figure 1, the map is divided into 5×6 grids. Each coordinate point in the trajectory sequence is mapped to a corresponding small area to form a grid sequence, that is, g={g 1 , g 2 ,...,g 10 }.

(2)对于每一个网格,挖掘它不同方面的特征。例如,对于g1,使用随机向量来表示时空语义信息。即:(2) For each grid, mine its characteristics in different aspects. For example, for g 1 , use a random vector and to represent spatiotemporal semantic information. which is:

其次,使用四维向量来表示当前行驶阶段是出发阶段,中途阶段,还是结束阶段,以及在各个阶段已经行驶的比例,即:Second, use the 4D vector To indicate whether the current driving stage is the starting stage, the halfway stage, or the end stage, and the proportion of driving in each stage, namely:

最后,使用过去短时间和长时间的交通状况信息来预测当前的交通状况特征具体来说,定义为过去的第1到6个时间区间(5min)内,当前区域g1的交通状况。例如表示历史平均速度是10m/s,共有8条历史轨迹,平均通过时间估计为20m/s。将将这些交通状况特征按照历史时间顺序输入到一个子循环神经网络中,将最后输出的隐层向量h6作为交通状况特征。Finally, use past short-term and long-term traffic condition information to predict current traffic condition characteristics and Specifically, define It is the traffic condition of the current area g 1 in the past 1st to 6th time interval (5min). E.g Indicates that the historical average speed is 10m/s, there are 8 historical trajectories, and the average passing time is estimated to be 20m/s. Will These traffic condition features are input into a sub-cycle neural network according to the order of historical time, and the final output hidden layer vector h 6 is used as the traffic condition feature.

二、训练阶段,具体步骤为:Second, the training phase, the specific steps are:

(1)建立双向循环神经网络(Bi-directional LSTM)模型。随机初始化模型的各项参数,包括遗忘门,输入门,输出门的矩阵参数和偏置参数。(1) Establish a bidirectional cyclic neural network (Bi-directional LSTM) model. Randomly initialize the parameters of the model, including the forget gate, input gate, matrix parameters and bias parameters of the output gate.

(2)将历史轨迹数据中提取的特征,即空间特征Vsp,时间特征Vsp,驾驶状态特征Vdri,历史上短时间和长时间的交通状态特征Vshort和Vlong,拼接成一个统一的特征向量。以网格g1为例(2) The features extracted from the historical trajectory data, namely the spatial feature V sp , the temporal feature V sp , the driving state feature V dri , the historical short-term and long-term traffic state features V short and V long , are spliced into a unified eigenvectors of . Take grid g 1 as an example

(3)在每一个经过的小网格输入到双向循环神经网络,以得到经过该网格的通过时间,即WT·hi+b。总的行程的时间开销为:(3) Input each passed small grid into the bidirectional cyclic neural network to obtain the passing time of the grid, that is, W T ·h i +b. The total travel time cost is:

例如,定义W=(0.1,0.3,..,0.7),10个网格隐层变量值为h1=(0.8,0.3,…,0.2),…,h10=(0.7,0.4,..0.5),偏置值b=0.7,则:For example, define W=(0.1,0.3,..,0.7), and the 10 grid hidden layer variable values are h 1 =(0.8,0.3,…,0.2),…,h 10 =(0.7,0.4,.. 0.5), bias value b=0.7, then:

(4)定义轨迹经过各个网格序列的真实时间开销向量为T。顺序的真实时间开销向量为Tf=(70,120,…,720),逆序的真实时间开销向量为Tb=(720,640,…,50)。则神经网络估计得到的时间开销向量为:(4) Define the real time cost vector of the trajectory passing through each grid sequence as T. The real time cost vector in sequence is T f =(70,120,...,720), and the real time cost vector in reverse order is T b =(720,640,...,50). Then the time cost vector estimated by the neural network is:

使用双向区间损失函数对模型进行辅助监督学习,使其不仅学习整条路径的时间开销,同时可以学习各个中间阶段的通行时间。定义双向区间损失函数为:The model is assisted with supervised learning using a bidirectional interval loss function, so that it not only learns the time cost of the entire path, but also learns the transit time of each intermediate stage. Define the two-way interval loss function as:

其中,M表示轨迹是否经过小区域的掩码,[]表示向量每个元素间的操作。Among them, M indicates whether the trajectory passes through the mask of the small area, and [] indicates the operation between each element of the vector.

(5)最小化损失函数L,即:(5) Minimize the loss function L, namely:

其中,θ表示模型的训练参数,ε表示时间和空间上的嵌入向量,S是训练集的大小。最后,使用基于时间顺序的反向传播算法对模型进行参数的更新和优化。Among them, θ represents the training parameters of the model, ε represents the embedding vector in time and space, and S is the size of the training set. Finally, the parameters of the model are updated and optimized using the backpropagation algorithm based on time sequence.

三.预测阶段,具体步骤为(以图2为例):3. Prediction stage, the specific steps are (take Figure 2 as an example):

(1)给定一条没有时间戳标记的真实行程作为查询路径g={g1,g2,…,g8},根据经过的实际路径,得到其映射的网格序列。对于每一个经过的小网格g1~g8,使用特征提取和表示阶段抽取得到的时空特征Vsp和Vtp,驾驶状态特征Vdri,和历史上短时间和长时间的交通状态特征Vshort和Vlong,作为该网格的总特征表示V。其中,时空特征的嵌入向量使用经过训练过程的参数更新后的向量信息。短时和长时间的交通状态特征使用经过训练的子循环神经网络进行特征挖掘。(1) Given a real itinerary without a time stamp as the query path g={g 1 ,g 2 ,…,g 8 }, according to the actual path passed, obtain its mapped grid sequence. For each passing small grid g 1 ~ g 8 , the spatio-temporal features V sp and V tp , the driving state features V dri , and the historical short-term and long-term traffic state features V short and V long , representing V as the total feature of the grid. Among them, the embedding vector of the spatio-temporal feature uses the vector information updated by the parameters of the training process. Short-term and long-term traffic state features are mined using a trained sub-recurrent neural network.

(2)在每一个经过的网格,将抽取的各方面特征输入到已经经过训练的双向循环神经网络中,得到当前的隐变量ht,那么经过当前区域的估计时间为WT·ht+b。而总的时间开销估计值为;(2) In each passing grid, input all aspects of the extracted features into the trained bidirectional cyclic neural network to obtain the current hidden variable h t , then the estimated time to pass through the current area is W T h t +b. And the total time overhead estimate is;

其中,W和b均由之前训练过程中得到的参数。Among them, W and b are the parameters obtained in the previous training process.

Claims (1)

1.一种基于辅助监督学习的行程时间估计方法,其特征在于,分为三个阶段:1. A travel time estimation method based on auxiliary supervised learning, characterized in that, is divided into three stages: (一)特征提取和表示阶段,对历史轨迹数据进行预处理,抽取它的各方面特征;(1) Feature extraction and representation stage, preprocessing the historical trajectory data and extracting its various features; (二)训练阶段,将历史轨迹数据中提取的特征输入到一个统一的双向循环神经网络进行训练,并且以双向区间损失函数作为训练的约束;(2) In the training phase, the features extracted from the historical trajectory data are input into a unified bidirectional recurrent neural network for training, and the bidirectional interval loss function is used as the training constraint; (三)预测阶段,用双向循环神经网络对查询路径中提取的特征进行推断并估计行程时间;(3) In the prediction stage, a bidirectional recurrent neural network is used to infer the features extracted in the query path and estimate the travel time; (一)特征提取和表示阶段的具体步骤为:(1) The specific steps in the feature extraction and representation stage are: 步骤(1),在城市范围内,根据经纬度坐标对网格进行细粒度划分,形成一个个相邻的矩形小区域;将由按时间顺序排序的历史GPS坐标组成的轨迹序列中的每一个坐标点映射到对应的小区域中,形成一个由网格坐标组成的序列;Step (1), within the city, fine-grained division of the grid according to latitude and longitude coordinates to form adjacent small rectangular areas; each coordinate point in the track sequence composed of historical GPS coordinates sorted in chronological order Mapped to the corresponding small area to form a sequence composed of grid coordinates; 步骤(2),对于每一个网格,挖掘它不同方面的特征;首先,利用嵌入向量技术来表征每一个网格小区域在不同空间以及不同时间段的语义信息;这些信息包含城市不同的功能区域空间区位信息,也包括早高峰,周末等时间信息;具体地,利用低维向量来表示每一个网格的空间向量Vsp,将一天划分成多个时间桶,每一条轨迹根据具体落入的时间桶来得到时间向量Vtp;对Vsp和Vtp进行随机初始化,之后在模型训练时跟着模型一起训练;Step (2), for each grid, mine its different features; first, use the embedding vector technology to represent the semantic information of each grid small area in different spaces and different time periods; these information include different functions of the city Regional spatial location information also includes time information such as morning peak hours and weekends; specifically, a low-dimensional vector is used to represent the spatial vector V sp of each grid, and a day is divided into multiple time buckets, and each trajectory falls into time bucket to get the time vector V tp ; randomly initialize V sp and V tp , and then train with the model during model training; 步骤(3),使用四维向量Vdri来表示当前行驶阶段是出发阶段,中途阶段,还是结束阶段,以及在各个阶段已经行驶的比例;Step (3), using the four-dimensional vector V dri to indicate whether the current driving stage is the departure stage, the halfway stage, or the end stage, and the proportion of the travel in each stage; 步骤(4),定义短时间交通状况特征为Vshort,定义长时间交通状况特征为Vlong,具体地,Step (4), define the short-term traffic condition characteristic as V short , define the long-term traffic condition characteristic as V long , specifically, 定义: definition: 表示在过去第j个时间区间内,当前小区域gi的交通状况,其中vj表示历史平均速度,nj表示历史轨迹数据数量,leni/vj表示粗略估计的通过时间;将这些交通状况特征按照历史时间顺序输入到一个子循环神经网络中,用以抽取出交通状况特征;Indicates the traffic status of the current small area g i in the jth time interval in the past, where v j represents the historical average speed, n j represents the number of historical trajectory data, len i /v j represents the roughly estimated passing time; these traffic The condition features are input into a sub-cyclic neural network in order of historical time to extract traffic condition features; 另外,考虑邻接小区域的交通状况信息,即In addition, consider the traffic condition information of adjacent small areas, namely 定义: definition: 表示距离gi距离不超过d的网格集合,收集它们过去短时的交通状况特征,一起输入到神经网络中;其中,x,y表示网格的坐标,gj表示除gi以外的其他网格;Indicates the grid set whose distance from g i does not exceed d, collects their past short-term traffic condition characteristics, and inputs them into the neural network together; where x, y represent the coordinates of the grid, and g j represents other grids except g i grid; (二)训练阶段的具体步骤为:(2) The specific steps in the training phase are: 步骤(1),构建循环神经网络;定义网络隐层为输入数据为那么,第t步的输入数据为xt,第t步得到的计算结果为ht,则有:Step (1), build a recurrent neural network; define the hidden layer of the network as The input data is Then, the input data of step t is x t , and the calculation result obtained in step t is h t , then: ht=φ(xt·Wx+ht-1·Wh+b) (3)h t =φ(x t ·W x +h t-1 ·W h +b) (3) 其中,是输入数据的权重矩阵(weight matrix),是隐层的权重矩阵,是偏置参数(bias);in, is the weight matrix of the input data, is the weight matrix of the hidden layer, is the bias parameter (bias); 即隐状态表示为函数:That is, the hidden state is expressed as a function: ht=f(ht-1,xt) (4)h t =f(h t-1 ,x t ) (4) 在这基础上,定义遗忘门为:On this basis, the forget gate is defined as: ft=σ(Wf·[ht-1,xt]+bf) (5)f t =σ(W f ·[h t-1 ,x t ]+b f ) (5) 输入门为:The input gate is: it=σ(Wi·[ht-1,xt]+bi) (6)i t =σ(W i ·[h t-1 ,x t ]+b i ) (6) 输出门为:The output gate is: ot=σ(Wo[ht-1,xt]+bo) (7)o t =σ(W o [h t-1 ,x t ]+b o ) (7) 记忆单元的更新为:The update of the memory unit is: 隐层的更新为:The update of the hidden layer is: ht=Ot·tanh(Ct) (10)h t =O t ·tanh(C t ) (10) 其中,Wf,Wi,Wo,分别表示遗忘门,输入门,输出门和记忆单元的权重矩阵,bf,bi,bo,则是对应的偏置参数;σ()为一个非线性的激活函数;f( )表示一个包含各层参数的抽象神经网络函数,定义循环神经网络对应的参数为WN,从[-α,α]的均匀分布中对每个元素进行初始化,其中,α是为一个超参数,设定范围为0.01到1;Among them, W f ,W i ,W o , represent the weight matrix of forget gate, input gate, output gate and memory unit respectively, b f , b i , b o , is the corresponding bias parameter; σ() is a nonlinear activation function; f( ) represents an abstract neural network function including parameters of each layer, and defines the corresponding parameter of the recurrent neural network as W N , from [-α, Initialize each element in the uniform distribution of α], where α is a hyperparameter with a setting range of 0.01 to 1; 双向循环神经网络同时使用正向的循环神经网络和反向的循环神经网络进行计算;其中正向循环神经网络根据序列的顺序依次将之前步骤提取的网格特征输入,反向循环神经网络则将序列逆序后输入网格特征;定义它的隐变量为正向和反向网络的拼接,即: The two-way cyclic neural network uses both the forward cyclic neural network and the reverse cyclic neural network for calculation; the forward cyclic neural network sequentially inputs the grid features extracted in the previous steps according to the order of the sequence, and the reverse cyclic neural network uses Enter the grid feature after the sequence is reversed; define its hidden variable as the splicing of the forward and reverse networks, namely: 步骤(2),将历史轨迹数据中提取的特征,即空间特征Vsp,时间特征Vsp,驾驶状态特征Vdri,历史上短时间和长时间的交通状态特征Vshort和Vlong,拼接成一个统一的特征向量:In step (2), the features extracted from the historical trajectory data, namely the spatial feature V sp , the temporal feature V sp , the driving state feature V dri , and the historical short-term and long-term traffic state features V short and V long , are spliced into A uniform eigenvector: V=(Vsp,Vsp,Vdri,Vshort,Vlong) (11)V=(V sp , V sp , V dri , V short , V long ) (11) 在每一个经过的小网格输入到双向循环神经网络,以得到经过该网格的通过时间,即WT·hi+b,总的行程的时间开销为:Input each passing small grid to the two-way cyclic neural network to obtain the passing time through the grid, that is, W T h i + b, the total travel time overhead for: 为计算总时间开销的权重矩阵和偏置参数,WT表示W矩阵的转置; To calculate the weight matrix and bias parameters of the total time overhead, W T represents the transposition of the W matrix; 步骤(3),定义轨迹经过各个网格序列的真实时间开销向量为T;顺序的真实时间开销向量为Tf,逆序的真实时间开销向量为Tb;则神经网络估计得到的时间开销向量为:Step (3), define the real time cost vector of the trajectory passing through each grid sequence as T; the real time cost vector of sequence is T f , the real time cost vector of reverse order is T b ; then the time cost vector estimated by the neural network is : 使用双向区间损失函数对模型进行辅助监督学习,使其不仅学习整条路径的时间开销,同时可以学习各个中间阶段的通行时间;定义双向区间损失函数为:Use the two-way interval loss function to assist the model in supervised learning, so that it can not only learn the time cost of the entire path, but also learn the transit time of each intermediate stage; define the two-way interval loss function as: 其中,M表示轨迹是否经过小区域的掩码,[]表示向量每个元素间的操作;Among them, M indicates whether the trajectory passes through the mask of a small area, and [] indicates the operation between each element of the vector; 步骤(4),训练的目标是最小化损失函数L,即:Step (4), the training goal is to minimize the loss function L, namely: 其中,θ表示模型的训练参数,ε表示时间和空间上的嵌入向量,S是训练集的大小;最后,使用基于时间顺序的反向传播算法对模型进行参数的更新和优化;Among them, θ represents the training parameters of the model, ε represents the embedding vector in time and space, and S is the size of the training set; finally, the parameters of the model are updated and optimized using the backpropagation algorithm based on time sequence; (三)预测阶段的具体步骤为:(3) The specific steps in the prediction stage are as follows: 步骤(1),给定一条没有时间戳标记的真实行程作为查询路径,根据经过的实际路径,得到其映射的网格序列;对于每一个经过的小网格,使用特征提取和表示阶段抽取得到的时空特征Vsp和Vtp,驾驶状态特征Vdri,和历史上短时间和长时间的交通状态特征Vshort和Vlong,作为该网格的总特征表示V;其中,时空特征的嵌入向量使用经过训练过程的参数更新后的向量信息;短时间和长时间的交通状态特征使用经过训练的子循环神经网络进行特征挖掘;Step (1), given a real itinerary without a time stamp as the query path, according to the actual path passed, the grid sequence mapped to it is obtained; for each passing small grid, the feature extraction and representation stages are used to obtain Spatio-temporal features V sp and V tp , driving state features V dri , and historical short-term and long-term traffic state features V short and V long , as the total feature representation V of the grid; among them, the embedding vector of spatio-temporal features Use the vector information updated by the parameters of the training process; the short-term and long-term traffic state features use the trained sub-cycle neural network for feature mining; 步骤(2),在每一个经过的网格,将抽取的各方面特征输入到已经经过训练的双向循环神经网络中,得到当前的隐变量ht,那么经过当前区域的估计时间为WT·ht+b,而总的时间开销估计值为:In step (2), in each passing grid, input all aspects of the extracted features into the trained bidirectional cyclic neural network to obtain the current hidden variable h t , then the estimated time to pass through the current area is W T · h t +b, and the estimated total time cost is: 其中,n表示经过的总网格数目,为经过训练得到的,计算总时间开销的权重矩阵和偏置参数,WT表示W矩阵的转置。Among them, n represents the total number of grids passed, is the weight matrix and bias parameters obtained through training to calculate the total time overhead, and W T represents the transpose of the W matrix.
CN201810658375.5A 2018-06-25 2018-06-25 A Travel Time Estimation Method Based on Assisted Supervised Learning Expired - Fee Related CN109035761B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810658375.5A CN109035761B (en) 2018-06-25 2018-06-25 A Travel Time Estimation Method Based on Assisted Supervised Learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810658375.5A CN109035761B (en) 2018-06-25 2018-06-25 A Travel Time Estimation Method Based on Assisted Supervised Learning

Publications (2)

Publication Number Publication Date
CN109035761A true CN109035761A (en) 2018-12-18
CN109035761B CN109035761B (en) 2021-06-04

Family

ID=64611043

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810658375.5A Expired - Fee Related CN109035761B (en) 2018-06-25 2018-06-25 A Travel Time Estimation Method Based on Assisted Supervised Learning

Country Status (1)

Country Link
CN (1) CN109035761B (en)

Cited By (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109783771A (en) * 2019-01-22 2019-05-21 清华大学 Track sets are converted to processing method, device and the storage medium of image array
CN109799533A (en) * 2018-12-28 2019-05-24 中国石油化工股份有限公司 A kind of method for predicting reservoir based on bidirectional circulating neural network
CN110232169A (en) * 2019-05-09 2019-09-13 北京航空航天大学 Track denoising method based on two-way length memory models and Kalman filtering in short-term
CN110488835A (en) * 2019-08-28 2019-11-22 北京航空航天大学 A kind of unmanned systems intelligence local paths planning method based on double reverse transmittance nerve networks
CN110675632A (en) * 2019-11-11 2020-01-10 重庆邮电大学 Vehicle short-time trajectory prediction control method aiming at multi-feature space and data sparseness
CN110942211A (en) * 2019-12-12 2020-03-31 武汉中海庭数据技术有限公司 Prediction arrival time prediction method and device based on deep neural network
CN111047107A (en) * 2019-12-23 2020-04-21 北京百度网讯科技有限公司 Road traffic time prediction method, device, electronic equipment and storage medium
CN111339156A (en) * 2020-02-07 2020-06-26 京东城市(北京)数字科技有限公司 Long-term determination method and device of business data and computer readable storage medium
CN111721306A (en) * 2019-03-20 2020-09-29 北京嘀嘀无限科技发展有限公司 Road matching method and device, electronic equipment and readable storage medium
CN111738500A (en) * 2020-06-11 2020-10-02 大连海事大学 A deep learning-based navigation time prediction method and device
CN112017436A (en) * 2020-09-09 2020-12-01 中国科学院自动化研究所 Method and system for predicting urban traffic travel time
CN112101132A (en) * 2020-08-24 2020-12-18 西北工业大学 Traffic condition prediction method based on graph embedding model and metric learning
CN112185148A (en) * 2019-07-03 2021-01-05 罗伯特·博世有限公司 Method for evaluating possible trajectories
CN112859898A (en) * 2021-01-18 2021-05-28 中山大学 Aircraft trajectory prediction method based on two-channel bidirectional neural network
CN114445577A (en) * 2021-12-29 2022-05-06 武汉中海庭数据技术有限公司 Method and system for calculating estimated time of arrival based on graph network
CN114638395A (en) * 2022-01-13 2022-06-17 中国科学院大学 Method, system, computer equipment and storage medium for predicting vehicle travel time
CN115394088A (en) * 2022-10-31 2022-11-25 江苏博宇鑫信息科技股份有限公司 Crossing traffic time prediction method based on space-time attention mechanism
CN116542438A (en) * 2023-03-28 2023-08-04 大连海事大学 Bus passenger starting and stopping point estimation and repair method based on non-reference real phase
CN116612631A (en) * 2023-04-01 2023-08-18 复旦大学 Track reduction system based on urban traffic network

Citations (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103280110A (en) * 2013-06-08 2013-09-04 北京云星宇交通工程有限公司 Method and device for predicting expressway travel time
CN104157142A (en) * 2014-08-27 2014-11-19 河海大学 Urban path travel time forecasting method based on floating vehicle data
CN104269059A (en) * 2014-10-15 2015-01-07 河海大学 City path travel time forecasting method based on multi-source data fusion
CN104299442A (en) * 2014-10-15 2015-01-21 河海大学 Urban route travel time forecasting method based on pattern matching
CN104408958A (en) * 2014-11-11 2015-03-11 河海大学 Urban dynamic route travel time predication method
CN104615897A (en) * 2015-02-13 2015-05-13 北方工业大学 Road section travel time estimation method based on low-frequency GPS data
CN105006147A (en) * 2015-06-19 2015-10-28 武汉大学 Road segment travel time deducing method based on road space-time incidence relation
CN105679021A (en) * 2016-02-02 2016-06-15 重庆云途交通科技有限公司 Travel time fusion prediction and query method based on traffic big data
CN106023588A (en) * 2016-06-15 2016-10-12 重庆云途交通科技有限公司 Traffic big data-based travel time extraction, prediction and query method
CN106096767A (en) * 2016-06-07 2016-11-09 中国科学院自动化研究所 A kind of link travel time prediction method based on LSTM
CN106156531A (en) * 2016-08-04 2016-11-23 复旦大学 Travel time estimation method based on low sample history track
CN106940930A (en) * 2016-11-01 2017-07-11 深圳市城市交通规划设计研究中心有限公司 Motorway journeys time prediction system and Forecasting Methodology
CN106981198A (en) * 2017-05-24 2017-07-25 北京航空航天大学 Deep learning network model and its method for building up for predicting travel time
CN107085943A (en) * 2015-12-23 2017-08-22 青岛海信网络科技股份有限公司 A kind of road travel time short term prediction method and system
CN107480786A (en) * 2017-08-07 2017-12-15 复旦大学 Recognition with Recurrent Neural Network track likelihood probability computational methods based on output state limitation
CN107688556A (en) * 2017-07-24 2018-02-13 中山大学 A kind of real-time travel time computation method based on function type principal component analysis
CN107945507A (en) * 2016-10-13 2018-04-20 腾讯科技(深圳)有限公司 Travel Time Estimation Method and device
CN108073941A (en) * 2016-11-17 2018-05-25 江南大学 A kind of image, semantic generation method based on deep learning
KR20180064692A (en) * 2016-12-06 2018-06-15 현대자동차주식회사 A navigator, a vehicle the navigator is installed in, a system for determining a route, a method for controlling the same
WO2018109516A1 (en) * 2016-12-13 2018-06-21 日産自動車株式会社 Automatic driving assistance method and device

Patent Citations (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103280110A (en) * 2013-06-08 2013-09-04 北京云星宇交通工程有限公司 Method and device for predicting expressway travel time
CN104157142A (en) * 2014-08-27 2014-11-19 河海大学 Urban path travel time forecasting method based on floating vehicle data
CN104269059A (en) * 2014-10-15 2015-01-07 河海大学 City path travel time forecasting method based on multi-source data fusion
CN104299442A (en) * 2014-10-15 2015-01-21 河海大学 Urban route travel time forecasting method based on pattern matching
CN104408958A (en) * 2014-11-11 2015-03-11 河海大学 Urban dynamic route travel time predication method
CN104615897A (en) * 2015-02-13 2015-05-13 北方工业大学 Road section travel time estimation method based on low-frequency GPS data
CN105006147A (en) * 2015-06-19 2015-10-28 武汉大学 Road segment travel time deducing method based on road space-time incidence relation
CN107085943A (en) * 2015-12-23 2017-08-22 青岛海信网络科技股份有限公司 A kind of road travel time short term prediction method and system
CN105679021A (en) * 2016-02-02 2016-06-15 重庆云途交通科技有限公司 Travel time fusion prediction and query method based on traffic big data
CN106096767A (en) * 2016-06-07 2016-11-09 中国科学院自动化研究所 A kind of link travel time prediction method based on LSTM
CN106023588A (en) * 2016-06-15 2016-10-12 重庆云途交通科技有限公司 Traffic big data-based travel time extraction, prediction and query method
CN106156531A (en) * 2016-08-04 2016-11-23 复旦大学 Travel time estimation method based on low sample history track
CN107945507A (en) * 2016-10-13 2018-04-20 腾讯科技(深圳)有限公司 Travel Time Estimation Method and device
CN106940930A (en) * 2016-11-01 2017-07-11 深圳市城市交通规划设计研究中心有限公司 Motorway journeys time prediction system and Forecasting Methodology
CN108073941A (en) * 2016-11-17 2018-05-25 江南大学 A kind of image, semantic generation method based on deep learning
KR20180064692A (en) * 2016-12-06 2018-06-15 현대자동차주식회사 A navigator, a vehicle the navigator is installed in, a system for determining a route, a method for controlling the same
WO2018109516A1 (en) * 2016-12-13 2018-06-21 日産自動車株式会社 Automatic driving assistance method and device
CN106981198A (en) * 2017-05-24 2017-07-25 北京航空航天大学 Deep learning network model and its method for building up for predicting travel time
CN107688556A (en) * 2017-07-24 2018-02-13 中山大学 A kind of real-time travel time computation method based on function type principal component analysis
CN107480786A (en) * 2017-08-07 2017-12-15 复旦大学 Recognition with Recurrent Neural Network track likelihood probability computational methods based on output state limitation

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
DONG WANG, JUNBO ZHANG, WEI CAO, JIAN LI, YU ZHENG: "WhenWill You Arrive? Estimating Travel Time Based on Deep Neural Networks", 《MICROSOFT》 *
XIAOLEI MA,ZHIMIN TAO, YINHAI WANG, HAIYANG YU: "Long short-term memory neural network for traffic speed", 《TRANSPORTATION RESEARCH PART C》 *
YANGDONG LIU: "Short-term travel time prediction by deep learning: A comparison of different LSTM-DNN models", 《2017 IEEE 20TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC)》 *
张发明,朱欣焰,呙维,胡涛: "利用浮动车大数据进行稀疏路段行程时间推断", 《武汉大学学报· 信息科学版》 *
杨庆芳,韦学武,林赐云,李志林,刘翔宇: "基于时空贝叶斯模型的行程时间可靠性预测", 《华南理工大学学报( 自然科学版)》 *

Cited By (28)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109799533A (en) * 2018-12-28 2019-05-24 中国石油化工股份有限公司 A kind of method for predicting reservoir based on bidirectional circulating neural network
CN109783771A (en) * 2019-01-22 2019-05-21 清华大学 Track sets are converted to processing method, device and the storage medium of image array
CN111721306B (en) * 2019-03-20 2022-07-05 北京嘀嘀无限科技发展有限公司 Road matching method and device, electronic equipment and readable storage medium
CN111721306A (en) * 2019-03-20 2020-09-29 北京嘀嘀无限科技发展有限公司 Road matching method and device, electronic equipment and readable storage medium
CN110232169A (en) * 2019-05-09 2019-09-13 北京航空航天大学 Track denoising method based on two-way length memory models and Kalman filtering in short-term
CN110232169B (en) * 2019-05-09 2022-01-04 北京航空航天大学 Track denoising method based on bidirectional long-time and short-time memory model and Kalman filtering
CN112185148A (en) * 2019-07-03 2021-01-05 罗伯特·博世有限公司 Method for evaluating possible trajectories
CN110488835A (en) * 2019-08-28 2019-11-22 北京航空航天大学 A kind of unmanned systems intelligence local paths planning method based on double reverse transmittance nerve networks
CN110675632A (en) * 2019-11-11 2020-01-10 重庆邮电大学 Vehicle short-time trajectory prediction control method aiming at multi-feature space and data sparseness
CN110675632B (en) * 2019-11-11 2021-11-30 重庆邮电大学 Vehicle short-time trajectory prediction control method aiming at multi-feature space and data sparseness
CN110942211A (en) * 2019-12-12 2020-03-31 武汉中海庭数据技术有限公司 Prediction arrival time prediction method and device based on deep neural network
CN111047107A (en) * 2019-12-23 2020-04-21 北京百度网讯科技有限公司 Road traffic time prediction method, device, electronic equipment and storage medium
CN111047107B (en) * 2019-12-23 2022-05-10 北京百度网讯科技有限公司 Highway transit time prediction method, device, electronic device and storage medium
CN111339156A (en) * 2020-02-07 2020-06-26 京东城市(北京)数字科技有限公司 Long-term determination method and device of business data and computer readable storage medium
CN111339156B (en) * 2020-02-07 2023-09-26 京东城市(北京)数字科技有限公司 Method, apparatus and computer readable storage medium for long-term determination of business data
CN111738500A (en) * 2020-06-11 2020-10-02 大连海事大学 A deep learning-based navigation time prediction method and device
CN111738500B (en) * 2020-06-11 2024-01-12 大连海事大学 A deep learning-based navigation time prediction method and device
CN112101132A (en) * 2020-08-24 2020-12-18 西北工业大学 Traffic condition prediction method based on graph embedding model and metric learning
CN112017436A (en) * 2020-09-09 2020-12-01 中国科学院自动化研究所 Method and system for predicting urban traffic travel time
CN112017436B (en) * 2020-09-09 2021-09-28 中国科学院自动化研究所 Method and system for predicting urban traffic travel time
CN112859898A (en) * 2021-01-18 2021-05-28 中山大学 Aircraft trajectory prediction method based on two-channel bidirectional neural network
CN112859898B (en) * 2021-01-18 2022-03-22 中山大学 Aircraft trajectory prediction method based on two-channel bidirectional neural network
CN114445577A (en) * 2021-12-29 2022-05-06 武汉中海庭数据技术有限公司 Method and system for calculating estimated time of arrival based on graph network
CN114445577B (en) * 2021-12-29 2025-04-29 武汉中海庭数据技术有限公司 A method and system for calculating estimated arrival time based on graph network
CN114638395A (en) * 2022-01-13 2022-06-17 中国科学院大学 Method, system, computer equipment and storage medium for predicting vehicle travel time
CN115394088A (en) * 2022-10-31 2022-11-25 江苏博宇鑫信息科技股份有限公司 Crossing traffic time prediction method based on space-time attention mechanism
CN116542438A (en) * 2023-03-28 2023-08-04 大连海事大学 Bus passenger starting and stopping point estimation and repair method based on non-reference real phase
CN116612631A (en) * 2023-04-01 2023-08-18 复旦大学 Track reduction system based on urban traffic network

Also Published As

Publication number Publication date
CN109035761B (en) 2021-06-04

Similar Documents

Publication Publication Date Title
CN109035761A (en) Travel time estimation method based on back-up surveillance study
Ali et al. Leveraging spatio-temporal patterns for predicting citywide traffic crowd flows using deep hybrid neural networks
CN110491129A (en) The traffic flow forecasting method of divergent convolution Recognition with Recurrent Neural Network based on space-time diagram
Xu et al. AGNP: Network-wide short-term probabilistic traffic speed prediction and imputation
Zhang et al. Predicting citywide crowd flows using deep spatio-temporal residual networks
CN106971547B (en) A Short-Term Traffic Flow Prediction Method Considering Spatio-temporal Correlations
CN110709908B (en) Computer system and method for state prediction of a transportation system
JP7625140B2 (en) Distributed Multi-Task Machine Learning for Traffic Forecasting
CN110414747B (en) Space-time long-short-term urban pedestrian flow prediction method based on deep learning
CN107230351B (en) A short-term traffic flow prediction method based on deep learning
Chen et al. A multiscale-grid-based stacked bidirectional GRU neural network model for predicting traffic speeds of urban expressways
CN104408913B (en) A kind of traffic flow three parameter real-time predicting method considering temporal correlation
CN112785077A (en) Travel demand prediction method and system based on space-time data
Liu et al. A method for short-term traffic flow forecasting based on GCN-LSTM
Mikluščák et al. Using neural networks for route and destination prediction in intelligent transport systems
Song et al. Learn travel time distribution with graph deep learning and generative adversarial network
Awan et al. A novel deep stacking-based ensemble approach for short-term traffic speed prediction
CN113628455B (en) Intersection signal optimization control method considering number of people in vehicle under Internet of vehicles environment
CN113159371B (en) Unknown target feature modeling and demand prediction method based on cross-modal data fusion
Ma et al. Short-term subway passenger flow prediction based on GCN-BiLSTM
CN115565388A (en) Traffic light control method based on multi-channel vehicle detection and three-dimensional feature labeling
Miao et al. A queue hybrid neural network with weather weighted factor for traffic flow prediction
Elmi et al. Travel time prediction in missing data areas: feature-based transfer learning approach
Khodabandelou et al. Attention-based gated recurrent unit for links traffic speed forecasting
Shi et al. Queue length prediction method of short-linked intersections based on long short-term memory

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20210604

CF01 Termination of patent right due to non-payment of annual fee