Nothing Special   »   [go: up one dir, main page]

CN109035761A - Travel time estimation method based on back-up surveillance study - Google Patents

Travel time estimation method based on back-up surveillance study Download PDF

Info

Publication number
CN109035761A
CN109035761A CN201810658375.5A CN201810658375A CN109035761A CN 109035761 A CN109035761 A CN 109035761A CN 201810658375 A CN201810658375 A CN 201810658375A CN 109035761 A CN109035761 A CN 109035761A
Authority
CN
China
Prior art keywords
time
neural network
grid
vector
training
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201810658375.5A
Other languages
Chinese (zh)
Other versions
CN109035761B (en
Inventor
孙未未
章瀚元
吴昊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fudan University
Original Assignee
Fudan University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fudan University filed Critical Fudan University
Priority to CN201810658375.5A priority Critical patent/CN109035761B/en
Publication of CN109035761A publication Critical patent/CN109035761A/en
Application granted granted Critical
Publication of CN109035761B publication Critical patent/CN109035761B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • G08G1/01Detecting movement of traffic to be counted or controlled
    • G08G1/0104Measuring and analyzing of parameters relative to traffic conditions
    • G08G1/0125Traffic data processing
    • G08G1/0129Traffic data processing for creating historical data or processing based on historical data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • G08G1/01Detecting movement of traffic to be counted or controlled
    • G08G1/0104Measuring and analyzing of parameters relative to traffic conditions
    • G08G1/0137Measuring and analyzing of parameters relative to traffic conditions for specific applications

Landscapes

  • Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Analytical Chemistry (AREA)
  • Chemical & Material Sciences (AREA)
  • Theoretical Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Biophysics (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Computational Linguistics (AREA)
  • Biomedical Technology (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Artificial Intelligence (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Traffic Control Systems (AREA)

Abstract

The invention belongs to field of intelligent transportation technology, specially a kind of travel time estimation method based on back-up surveillance study.It finds statistical law from magnanimity historical trajectory data, carries out whole estimation by time of the deep learning model end to end to entire stroke;Step includes: feature extraction and expression stage, is pre-processed to track data, extracts its time and space characteristics, driving condition feature, short time and prolonged traffic condition feature respectively;These features extracted are trained and are predicted with unified bidirectional circulating neural network by trained and forecast period;The each step of Recognition with Recurrent Neural Network all exports the time overhead by current area domain;The summation of the time overhead of these zonules is the time overhead of total path.Meanwhile introducing two-way section loss function also to constrain interlude expense.This method can efficiently and accurately estimate the vehicle travel time in city there is preferable effect under practical circumstances.

Description

Travel time estimation method based on auxiliary supervised learning
Technical Field
The invention belongs to the technical field of intelligent transportation, and particularly relates to a travel time estimation method based on auxiliary supervised learning.
Background
Travel time estimation is an essential important technology in the field of urban traffic, can provide help for travel and commute of people, and can also provide support for government planning decisions. However, this is not a simple and small problem, but is influenced by various dynamic factors, such as traffic dynamics, intersection conditions, changes in driver driving behavior, and historical periodic data evolution. These factors contribute to uncertainty and difficulty in the travel time estimate. With the development and popularization of mobile devices supporting GPS, a great amount of trajectory data is continuously generated and covers all corners of a city. With the massive historical track data, the internal rules behind the data can be mined, and the period and the trend of the change of the travel time are learned by constructing an algorithm model, so that the time overhead required by the current query track can be accurately inferred.
Most of the existing methods adopt a divide-and-conquer (divide-and-conquer) method, mainly by decomposing a path into a series of segments or sub-paths.
(1) Single-road-segment-based methods:
the method based on the single road section mainly estimates the average speed of each single road section when the track passes through, calculates the passing average time cost according to the length of the road section, and finally accumulates the time sum of each road section to obtain the total time. This approach does not take into account the intersection time overhead between road segments. In addition, such estimation relies heavily on high quality velocity data, which is often not available in the trajectory data.
(2) The method based on the sub-path comprises the following steps:
the sub-path based approach mainly divides the path into a series of sub-paths, so that the time overhead of the intersection is also considered. The main idea is to splice and mine rich public sub-path information in historical data. Although this approach may overcome many of the drawbacks of the single-route approach, it is still based on heuristic design rather than directly targeting travel time as an algorithm optimization.
In summary, the methods available today fail to achieve satisfactory accuracy for two reasons. One is that they do not see the path as a whole, but split into sub-blocks. During this splitting process, much useful information is lost. And they do not take full advantage of the intermediate surveillance tags that are unique to the trace data, i.e., the time stamp information for each intermediate GPS sample point. On the other hand, with the development and prosperity of deep learning technology, more problems can be solved in an end-to-end integrated manner, and the method is more efficient compared with the traditional heuristic model. In addition, deep learning has strong characterization capability, and compared with a manual model, the deep learning can capture more potential features and can process various complex dynamics in the travel estimation problem.
Disclosure of Invention
The invention aims to provide a travel time estimation method based on a history track of auxiliary supervision learning aiming at the limitations of the traditional two types of travel time estimation technologies so as to overcome the defects of the prior art.
The method searches for a statistical rule from massive historical track data, and integrally estimates the time of the whole travel through an end-to-end deep learning model. The method comprises the following basic steps: a characteristic extraction and representation stage, wherein the track data is preprocessed, and various characteristics of the track data are respectively extracted; training and predicting stage, training and predicting the extracted features with one unified bidirectional circulating neural network; the time cost of passing through the current small area is output by each step of the recurrent neural network; the sum of the time overhead of the small areas is the time overhead of the total path; for more efficient training, a two-way interval loss function is also introduced to constrain the intermediate time overhead.
The invention provides a travel time estimation method of a historical track based on auxiliary supervised learning, which comprises the following three stages:
and (I) a characteristic extraction and representation stage, namely preprocessing historical track data and extracting various characteristics (including time characteristics, space characteristics, driving state characteristics, short-time and long-time traffic condition characteristics and the like) of the historical track data. The method comprises the following specific steps:
and (1) carrying out fine-grained division on the grids according to longitude and latitude coordinates in the urban range to form adjacent rectangular small areas. And mapping each coordinate point in the track sequence consisting of the GPS coordinates into a corresponding small area according to the chronological sequence to form a sequence consisting of grid coordinates. For the condition that the adjacent track points are far away and fall into discontinuous small areas, the intermediate passing path can be obtained by algorithms such as map matching and the like, and the discontinuous area information is supplemented.
And (2) mining the characteristics of different aspects of each grid. First, latent semantic information is mined using an embedded vector technique. The embedded vector technology is widely used in the fields of natural language processing, social networks and the like, mainly utilizes low-dimensional real number vectors to represent semantic information of each word or thing, and measures the corresponding relation between real objects through the distance relation in a vector space. The invention utilizes an embedded vector technology to represent semantic information of each grid small region in different spaces and different time periods. The information includes spatial location information of different functional areas (such as residential areas, commercial areas or industrial areas, etc.) of the city, and also includes time information of morning rush hours, weekends, etc. In particular, the space vector V of each mesh is represented by a low-dimensional vectorspDividing a day into a plurality of time buckets (e.g., one bucket per hour), each track obtaining a time vector V according to a specific falling time buckettp. To VspAnd VtpA random initialization is performed followed by training with the model as it is trained.
And (3) when a driver drives the vehicle, the driving speed and the driving behavior can be changed in different driving states. For example, the vehicle may be more inclined to travel on a road or overhead in the middle of the travel path, where the speed may be faster. When the vehicle starts or arrives at the end, the speed is often reduced because the vehicle travels on a small road or in an area with many people. In particular, a four-dimensional vector V is useddriTo indicate whether the current driving phase is a departure phase, a midway phase, or an end phase, and the proportion of driving that has been performed in each phase. For example, VdriThe value of (1,0,0,0.2) indicates that the driver is driving at the beginning stage, and accounts for 20% of the total travel.
And (4) the traffic condition in one area is changed periodically and regularly along with the time evolution. For example, if a road segment is very congested from 8 o 'clock to 8 o' clock, 8 o 'clock and 35 o' clockIt may also be very clogged. That is, the traffic condition information in the past in a short time is useful for predicting the current traffic state. Defining the short-time traffic condition as Vshort. Meanwhile, the long-time periodic traffic condition change can also help to predict the current traffic condition, such as the change rule of the traffic condition on weekdays and weekends. Defining the long-term traffic condition as Vlong. In particular, the present invention relates to a method for producing,
defining:
indicating the current small area g in the past jth time intervaliIn which v isjRepresenting historical average speed, njIndicates the amount of historical track data, leni/vjRepresenting the roughly estimated transit time. The traffic condition features are input into a sub-cyclic neural network according to the historical time sequence, and the traffic condition features can be extracted.
In addition, since the historical data are unevenly distributed in different spatial regions, the tracks of some regions pass through a small number of regions, which may affect the accuracy of estimation. To solve this data sparseness problem, traffic condition information of neighboring small areas is also taken into account, i.e., traffic condition information of neighboring small areas is taken into account
Defining:
represents the distance giAnd collecting the traffic condition characteristics of the grids with the distance not exceeding d in the past short time, and inputting the collected traffic condition characteristics into the neural network together. Wherein x, y represent the coordinates of the grid, gjRepresents except giOther meshes than mesh.
(II) a training stage, inputting the features extracted from the historical track data into a uniform bidirectional recurrent Neural network (bidirectional LSTM, reference: Graves A, Schmidhuber J. frame with photonic classification with bidirectional LSTM and other Neural network architectures [ J ]. Neural Networks,2005,18(5-6): 602-610) for training, and taking a bidirectional interval loss function as the constraint of training; the method comprises the following specific steps:
and (1) constructing a recurrent neural network. Defining a network hidden layer asInput data asThen, the input data of the t step is xtAnd the calculation result obtained in the t step is htThen, there are:
ht=φ(xt·Wx+ht-1·Wh+b) (3)
wherein,is a weight matrix (weight matrix) of the input data,is a weight matrix of the hidden layer(s),is the bias parameter (bias). Phi denotes a non-linear activation function, which may be a sigmoid function, a ReLU function, a tanh function, etc.
That is, the hidden state can be expressed as a function:
ht=f(ht-1,xt) (4)
on this basis, define forgetting the door to be:
ft=σ(Wf·[ht-1,xt]+bf) (5)
the input gates are:
it=σ(Wi·[ht-1,xt]+bi) (6)
the output gate is:
ot=σ(Wo[ht-1,xt]+bo) (7)
the memory cell is updated by:
the hidden layer is updated as follows:
ht=Ot·tanh(Ct) (10)
wherein, Wf、Wi、WoWeight matrices representing forgetting gate, input gate, output gate and memory cell, respectively, bf、bi、boThen it is the corresponding bias parameter. σ () is a non-linear activation function, e.g.Is a function of the sigmoid and is,is a hyperbolic tangent function, and f () represents an abstract neural network function containing parameters of each layer. Defining the corresponding parameter of the recurrent neural network as WNfrom [ - α, α ]]Is uniformly dividedeach weight parameter in the recurrent neural network is initialized in the method, wherein α is a hyperparameter and is set to be in a range of 0.01 to 1.
The bidirectional recurrent neural network performs calculation by using a forward recurrent neural network and a reverse recurrent neural network simultaneously. The forward circulation neural network inputs the grid features extracted in the previous step according to the sequence of the sequence, and the reverse circulation neural network inputs the grid features after the sequence is in the reverse order. This has the advantage that the neural network can be made to observe the position distances of the current mesh from the start point and the end point simultaneously, thereby having an overall characteristic. Defining its hidden variables as a concatenation of forward and reverse networksWhereinRepresents a hidden layer of the forward recurrent neural network,representing a hidden layer of the inverse recurrent neural network.
Step (2), extracting the characteristics, namely the spatial characteristics, in the historical track dataTemporal characteristicsDriving state characteristicsHistorically short and long term traffic status characteristicsAndare spliced into aOne unified feature vector:
at each small grid passing through, the two-way recurrent neural network is input to obtain the passing time of the grid, namely WT·hi+ b. Time overhead of total tripComprises the following steps:
definition ofRespectively, a weight matrix and bias parameters for calculating the total time overhead. WTRepresenting the transpose of the W matrix.
And (3) defining the real time overhead vector of the track passing through each grid sequence as T. The sequential real time overhead vector is TfThe real time overhead vector of the reverse order is Tb. The time overhead vector estimated by the neural network is:
and the model is subjected to auxiliary supervised learning by using the bidirectional interval loss function, so that the time overhead of the whole path can be learned, and the transit time of each intermediate stage can be learned. The two-way interval loss function is defined as:
where M denotes whether a trajectory passes through a mask of a small region, [ ] denotes an operation between each element of the vector.
Step (4), the training goal is to minimize the loss function L, i.e.:
where θ represents the training parameters of the model, ε represents the embedding vector in time and space, and S is the size of the training set. Finally, the model is subjected to parameter updating and optimization by using a time sequence-based back propagation algorithm. Back propagation algorithm references: chauvin Y, Rumelhart D E.Backproperation the term, architecures, and applications [ M ]. Psychology Press,2013.
A prediction stage, deducing the features extracted from the query path by using a bidirectional cyclic neural network and estimating the travel time; the method comprises the following specific steps:
and (1) giving a real travel without a timestamp mark as a query path, and obtaining a grid sequence mapped by the real travel according to a passed actual path. For each small grid passing through, using space-time characteristics V obtained by characteristic extraction and expression stage extractionspAnd VtpDriving state characteristic VdriAnd historical short and long term traffic condition characteristics VshortAnd VlongV is indicated as a total feature of the mesh. Wherein the embedded vector of the spatio-temporal features uses vector information updated by parameters of a training process. Short-term and long-term traffic state features are feature mined using a trained sub-cyclic neural network.
Step (2), inputting the extracted various aspects of characteristics into the trained pairs in each gridObtaining the current hidden variable h in the cyclic neural networktThen the estimated time to pass through the current region is WT·ht+ b. The total time overhead estimate is:
where n represents the total number of grids passed,for the trained results, a weight matrix and bias parameters for the total time overhead are calculated. WTRepresenting the transpose of the W matrix.
Overall, the process of the invention has several advantages. Firstly, an end-to-end (end-to-end) deep learning method based on historical data training is utilized to directly learn the characteristics of the whole path and estimate the whole transit time. A bidirectional interval loss function is defined, and the time overhead of passing through an intermediate road section can be monitored in an auxiliary mode on the basis of monitoring the whole path time. The method for introducing the auxiliary supervision enriches the sample information of the path and can ensure that the propagation signal can be more accurate when the back propagation algorithm updates the parameters. Secondly, a feature extraction structure is provided, and the passing time of the path can be effectively estimated by extracting dynamic features of different dimensions such as space-time embedded vectors, driving states, short-time and long-time traffic conditions and the like. Finally, the experimental result is better than that of the existing method through experimental verification in the actual environment.
As shown in table 1, we performed experiments with real historical trajectory data, including two cities, boehr diagram and shanghai. The existing methods such as a road section average time method, a sub-path dynamic programming method, a grid full-connection network method, a grid convolution network method and the like are used for comparison. The road section average time method obtains a result by directly accumulating the average passing time of each road section. Dynamic programming of subpaths [ Yilun Wang, Yu Zheng, and Yixing Xue. travel timing of a path using mapping project-ries in Proceedings of the 20th International Conference on Knowledge Discovery and Data Mining (SIGKDD), pages 25-34,2014 ] uses dynamic programming to find the optimal splicing method of subpaths. The grid full-connection Network method and the grid convolution Network method take an NxN whole grid as input, and respectively carry out optimization and estimation by using a full-connection Network (Multi-Layer Perceptin) and a convolution Neural Network (Convolutional Neural Network). We use three error measurement indexes of MAE, RMSE and MAPE to measure the quality of the method.
Wherein, y represents the true value,representing the estimated value, and n represents the total number of samples. As can be seen from the results in Table 1, the method of the present invention is far better than the prior comparative methods in each index. For example, on the Shanghai data set, the method of the present invention estimates MAE error to be only 126 seconds, with MAPE error of 13.3%, whereas the best method has MAE error of 168 seconds and MAPE error of 19.1%.
TABLE 1
Drawings
Fig. 1 shows a real trace sample containing time stamp information for each intermediate GPS trace point, for a total of 720 s.
Fig. 2 shows a path sample that needs to be queried, and only contains specifically-passed path information, and does not contain any timestamp information.
Detailed Description
The following describes the specific implementation process of the present invention with reference to specific examples:
the historical trajectories as in fig. 1 are used for training and the travel times in fig. 2 are estimated.
The method comprises a preprocessing stage and a feature extraction and representation stage, wherein the track data are preprocessed to extract various features of the track data. Taking fig. 1 as an example, the specific steps are as follows:
(1) in the city range, fine-grained grid division is carried out and divided into small areas which are adjacent to each other. As in fig. 1, the map is divided into 5 × 6 meshes. Mapping each coordinate point in the track sequence into a corresponding small area to form a grid sequence, namely g ═ g { (g } {1,g2,…,g10}。
(2) For each mesh, features of different aspects of it are mined. For example, for g1Using random vectorsAndto represent spatiotemporal semantic information. Namely:
second, using a four-dimensional vectorTo indicate whether the current driving phase is a departure phase, a midway phase, or an end phase, and the proportion of the driving that has been performed in each phase, namely:
finally, the past short and long time traffic condition information is used to predict the current traffic condition characteristicsAndin particular, defineFor the past 1 st to 6 th time interval (5min), the current region g1The traffic situation of (1). For exampleIt is shown that the historical average speed is 10m/s, 8 historical tracks are in total, and the average transit time is estimated to be 20 m/s. Will be provided withInputting the traffic condition characteristics into a subcircular neural network according to historical time sequence, and finally outputting hidden vector h6As a characteristic of traffic conditions.
II, a training stage, which comprises the following specific steps:
(1) and establishing a Bi-directional circulating neural network (Bi-directional LSTM) model. And randomly initializing various parameters of the model, including matrix parameters and bias parameters of a forgetting gate, an input gate and an output gate.
(2) Extracting the feature in the historical track data, namely the spatial feature VspTime characteristic VspDriving state characteristic VdriHistorically short and long time traffic status features VshortAnd VlongAnd splicing the two into a uniform feature vector. With a grid g1For example, as
(3) At each small grid passing through, the two-way recurrent neural network is input to obtain the passing time of the grid, namely WT·hi+ b. The total travel time overhead is:
for example, W is defined as (0.1, 0.3., 0.7), and 10 mesh hidden layer variable values are defined as h1=(0.8,0.3,…,0.2),…,h10When the offset value b is equal to 0.7, (0.7, 0.4.. 0.5) then:
(4) the real time cost vector of the trajectory through each grid sequence is defined as T. The sequential real time overhead vector is TfTrue time cost vector in reverse order of (70,120, …,720) is Tb(720,640, …, 50). The time overhead vector estimated by the neural network is:
and the model is subjected to auxiliary supervised learning by using the bidirectional interval loss function, so that the time overhead of the whole path can be learned, and the transit time of each intermediate stage can be learned. The two-way interval loss function is defined as:
where M denotes whether a trajectory passes through a mask of a small region, [ ] denotes an operation between each element of the vector.
(5) Minimize the loss function L, i.e.:
where θ represents the training parameters of the model, ε represents the embedding vector in time and space, and S is the size of the training set. Finally, the model is subjected to parameter updating and optimization by using a time sequence-based back propagation algorithm.
The prediction stage comprises the following specific steps (taking fig. 2 as an example):
(1) giving a real trip without timestamp marking as query path g ═ g1,g2,…,g8And obtaining a grid sequence mapped by the actual path according to the actual path. For each small grid g of passes1~g8Using spatio-temporal features V extracted in the feature extraction and representation stagesspAnd VtpDriving state characteristic VdriAnd historical short and long term traffic condition characteristics VshortAnd VlongV is indicated as a total feature of the mesh. Wherein the embedded vector of the spatio-temporal features uses vector information updated by parameters of a training process. Short and long term traffic condition characterizationAnd performing feature mining by using the trained sub-cycle neural network.
(2) Inputting the extracted various aspects of characteristics into the trained bidirectional cyclic neural network in each passing grid to obtain the current hidden variable htThen the estimated time to pass through the current region is WT·ht+ b. The total time overhead estimated value is;
where both W and b are parameters obtained from previous training procedures.

Claims (1)

1. A travel time estimation method based on auxiliary supervised learning is characterized by comprising three stages:
the method comprises the steps of (I) feature extraction and representation, wherein historical track data are preprocessed, and various features of the historical track data are extracted;
in the training stage, the features extracted from the historical track data are input into a uniform bidirectional cyclic neural network for training, and a bidirectional interval loss function is used as the constraint of training;
a prediction stage, deducing the features extracted from the query path by using a bidirectional cyclic neural network and estimating the travel time;
the specific steps of the characteristic extraction and representation stage are as follows:
step (1), in an urban area, carrying out fine-grained division on grids according to longitude and latitude coordinates to form adjacent rectangular small areas; mapping each coordinate point in a track sequence consisting of historical GPS coordinates sequenced according to a time sequence into a corresponding small region to form a sequence consisting of grid coordinates;
step (2), for each grid, excavating the characteristics of different aspects of the grid; firstly, representing semantic information of each grid small region in different spaces and different time periods by using an embedded vector technology; the information comprises space location information of different functional areas of a city and also comprises time information such as early peak, weekend and the like; in particular, the space vector V of each mesh is represented by a low-dimensional vectorspDividing a day into a plurality of time buckets, and obtaining a time vector V according to the time bucket in which each track fallstp(ii) a To VspAnd VtpCarrying out random initialization, and then training the model along with the model during model training;
step (3) of using the four-dimensional vector VdriTo indicate whether the current driving phase is a starting phase, a midway phase or an ending phase, and the proportion of driving in each phase;
step (4), defining the short-time traffic condition characteristic as VshortDefining a long-term traffic condition characteristic as VlongIn the case of a liquid crystal display device, in particular,
defining:
indicating the current small area g in the past jth time intervaliIn which v isjRepresenting historical average speed, njIndicates the amount of historical track data, leni/vjRepresenting a coarse estimated transit time; inputting the traffic condition characteristics into a subcircular neural network according to historical time sequence for extractingTaking out the traffic condition characteristics;
in addition, traffic condition information of adjacent small areas is taken into account, i.e.
Defining:
represents the distance giCollecting the traffic condition characteristics of the grid set with the distance not exceeding d in the past short time, and inputting the traffic condition characteristics into the neural network together; wherein x, y represent the coordinates of the grid, gjRepresents except giOther meshes than the mesh;
the second training stage comprises the following specific steps:
step (1), constructing a recurrent neural network; defining a network hidden layer asInput data asThen, the input data of the t step is xtAnd the calculation result obtained in the t step is htThen, there are:
ht=φ(xt·Wx+ht-1·Wh+b) (3)
wherein,is a weight matrix (weight matrix) of the input data,is a weight matrix of the hidden layer(s),is the bias parameter (bias);
i.e. the hidden state is represented as a function:
ht=f(ht-1,xt) (4)
on this basis, define forgetting the door to be:
ft=σ(Wf·[ht-1,xt]+bf) (5)
the input gates are:
it=σ(Wi·[ht-1,xt]+bi) (6)
the output gate is:
ot=σ(Wo[ht-1,xt]+bo) (7)
the memory cell is updated by:
the hidden layer is updated as follows:
ht=Ot·tanh(Ct) (10)
wherein, Wf,Wi,Wo,Weight matrices representing forgetting gate, input gate, output gate and memory cell, respectively, bf,bi,bo,Then it is the corresponding bias parameter; σ () is a nonlinear activation function; f () represents an abstract neural network function containing parameters of each layer, and the corresponding parameter of the recurrent neural network is defined as WNfrom [ - α, α]initializing each element in the uniform distribution, wherein α is a hyper-parameter and is set to be in a range of 0.01 to 1;
the bidirectional cyclic neural network simultaneously uses the forward cyclic neural network and the reverse cyclic neural network for calculation; the forward circulation neural network inputs the grid features extracted in the previous step according to the sequence in turn and reversesInputting the grid characteristics to a recurrent neural network after reversing the sequence; its hidden variables are defined as the concatenation of the forward and reverse networks, i.e.:
step (2), extracting the feature in the historical track data, namely the spatial feature VspTime characteristic VspDriving state characteristic VdriHistorically short and long time traffic status features VshortAnd VlongAnd splicing into a unified feature vector:
V=(Vsp,Vsp,Vdri,Vshort,Vlong) (11)
at each small grid passing through, the two-way recurrent neural network is input to obtain the passing time of the grid, namely WT·hi+ b, total travel time overheadComprises the following steps:
to calculate the weight matrix and bias parameters for the total time overhead, WTRepresents the transpose of the W matrix;
step (3), defining the real time overhead vector of the track passing through each grid sequence as T; the sequential real time overhead vector is TfThe real time overhead vector of the reverse order is Tb(ii) a The time overhead vector estimated by the neural network is:
the model is subjected to auxiliary supervised learning by using a bidirectional interval loss function, so that the time overhead of the whole path can be learned, and the transit time of each intermediate stage can be learned; the two-way interval loss function is defined as:
wherein, M represents whether the track passes through the mask of a small region, and [ ] represents the operation among each element of the vector;
step (4), the goal of training is to minimize the loss function L, i.e.:
wherein, theta represents the training parameter of the model, epsilon represents the embedding vector on time and space, and S is the size of the training set; finally, updating and optimizing parameters of the model by using a time sequence-based back propagation algorithm;
the third step of the prediction stage comprises the following steps:
step (1), a real travel without a timestamp mark is given as a query path, and a grid sequence mapped by the real travel is obtained according to a passed actual path; for each small grid passing through, using space-time characteristics V obtained by characteristic extraction and expression stage extractionspAnd VtpDriving state characteristic VdriAnd historical short and long term traffic condition characteristics VshortAnd VlongV is represented as a total feature of the mesh; the embedded vector of the space-time characteristics uses vector information updated by parameters in a training process; carrying out feature mining on the traffic state features of short time and long time by using the trained sub-cycle neural network;
step (2), inputting the extracted various aspects of characteristics into the trained bidirectional cyclic neural network in each passing grid to obtain the current timeHidden variable h of fronttThen the estimated time to pass through the current region is WT·ht+ b, and the total time overhead estimate is:
where n represents the total number of grids passed,for the trained weight matrix and bias parameters, W, of the total time overheadTRepresenting the transpose of the W matrix.
CN201810658375.5A 2018-06-25 2018-06-25 Travel time estimation method based on auxiliary supervised learning Expired - Fee Related CN109035761B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810658375.5A CN109035761B (en) 2018-06-25 2018-06-25 Travel time estimation method based on auxiliary supervised learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810658375.5A CN109035761B (en) 2018-06-25 2018-06-25 Travel time estimation method based on auxiliary supervised learning

Publications (2)

Publication Number Publication Date
CN109035761A true CN109035761A (en) 2018-12-18
CN109035761B CN109035761B (en) 2021-06-04

Family

ID=64611043

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810658375.5A Expired - Fee Related CN109035761B (en) 2018-06-25 2018-06-25 Travel time estimation method based on auxiliary supervised learning

Country Status (1)

Country Link
CN (1) CN109035761B (en)

Cited By (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109783771A (en) * 2019-01-22 2019-05-21 清华大学 Track sets are converted to processing method, device and the storage medium of image array
CN109799533A (en) * 2018-12-28 2019-05-24 中国石油化工股份有限公司 A kind of method for predicting reservoir based on bidirectional circulating neural network
CN110232169A (en) * 2019-05-09 2019-09-13 北京航空航天大学 Track denoising method based on two-way length memory models and Kalman filtering in short-term
CN110488835A (en) * 2019-08-28 2019-11-22 北京航空航天大学 A kind of unmanned systems intelligence local paths planning method based on double reverse transmittance nerve networks
CN110675632A (en) * 2019-11-11 2020-01-10 重庆邮电大学 Vehicle short-time trajectory prediction control method aiming at multi-feature space and data sparseness
CN110942211A (en) * 2019-12-12 2020-03-31 武汉中海庭数据技术有限公司 Prediction arrival time prediction method and device based on deep neural network
CN111047107A (en) * 2019-12-23 2020-04-21 北京百度网讯科技有限公司 Road traffic time prediction method, device, electronic equipment and storage medium
CN111339156A (en) * 2020-02-07 2020-06-26 京东城市(北京)数字科技有限公司 Long-term determination method and device of business data and computer readable storage medium
CN111721306A (en) * 2019-03-20 2020-09-29 北京嘀嘀无限科技发展有限公司 Road matching method and device, electronic equipment and readable storage medium
CN111738500A (en) * 2020-06-11 2020-10-02 大连海事大学 Navigation time prediction method and device based on deep learning
CN112017436A (en) * 2020-09-09 2020-12-01 中国科学院自动化研究所 Method and system for predicting urban traffic travel time
CN112101132A (en) * 2020-08-24 2020-12-18 西北工业大学 Traffic condition prediction method based on graph embedding model and metric learning
CN112185148A (en) * 2019-07-03 2021-01-05 罗伯特·博世有限公司 Method for evaluating possible trajectories
CN112859898A (en) * 2021-01-18 2021-05-28 中山大学 Aircraft trajectory prediction method based on two-channel bidirectional neural network
CN115394088A (en) * 2022-10-31 2022-11-25 江苏博宇鑫信息科技股份有限公司 Crossing traffic time prediction method based on space-time attention mechanism
CN116542438A (en) * 2023-03-28 2023-08-04 大连海事大学 Bus passenger starting and stopping point estimation and repair method based on non-reference real phase

Citations (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103280110A (en) * 2013-06-08 2013-09-04 北京云星宇交通工程有限公司 Method and device for predicting expressway travel time
CN104157142A (en) * 2014-08-27 2014-11-19 河海大学 Urban path travel time forecasting method based on floating vehicle data
CN104269059A (en) * 2014-10-15 2015-01-07 河海大学 City path travel time forecasting method based on multi-source data fusion
CN104299442A (en) * 2014-10-15 2015-01-21 河海大学 Urban route travel time forecasting method based on pattern matching
CN104408958A (en) * 2014-11-11 2015-03-11 河海大学 Urban dynamic route travel time predication method
CN104615897A (en) * 2015-02-13 2015-05-13 北方工业大学 Road section travel time estimation method based on low-frequency GPS data
CN105006147A (en) * 2015-06-19 2015-10-28 武汉大学 Road segment travel time deducing method based on road space-time incidence relation
CN105679021A (en) * 2016-02-02 2016-06-15 重庆云途交通科技有限公司 Travel time fusion prediction and query method based on traffic big data
CN106023588A (en) * 2016-06-15 2016-10-12 重庆云途交通科技有限公司 Traffic big data-based travel time extraction, prediction and query method
CN106096767A (en) * 2016-06-07 2016-11-09 中国科学院自动化研究所 A kind of link travel time prediction method based on LSTM
CN106156531A (en) * 2016-08-04 2016-11-23 复旦大学 Travel time estimation method based on low sample history track
CN106940930A (en) * 2016-11-01 2017-07-11 深圳市城市交通规划设计研究中心有限公司 Motorway journeys time prediction system and Forecasting Methodology
CN106981198A (en) * 2017-05-24 2017-07-25 北京航空航天大学 Deep learning network model and its method for building up for predicting travel time
CN107085943A (en) * 2015-12-23 2017-08-22 青岛海信网络科技股份有限公司 A kind of road travel time short term prediction method and system
CN107480786A (en) * 2017-08-07 2017-12-15 复旦大学 Recognition with Recurrent Neural Network track likelihood probability computational methods based on output state limitation
CN107688556A (en) * 2017-07-24 2018-02-13 中山大学 A kind of real-time travel time computation method based on function type principal component analysis
CN107945507A (en) * 2016-10-13 2018-04-20 腾讯科技(深圳)有限公司 Travel Time Estimation Method and device
CN108073941A (en) * 2016-11-17 2018-05-25 江南大学 A kind of image, semantic generation method based on deep learning
KR20180064692A (en) * 2016-12-06 2018-06-15 현대자동차주식회사 A navigator, a vehicle the navigator is installed in, a system for determining a route, a method for controlling the same
WO2018109516A1 (en) * 2016-12-13 2018-06-21 日産自動車株式会社 Automatic driving assistance method and device

Patent Citations (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103280110A (en) * 2013-06-08 2013-09-04 北京云星宇交通工程有限公司 Method and device for predicting expressway travel time
CN104157142A (en) * 2014-08-27 2014-11-19 河海大学 Urban path travel time forecasting method based on floating vehicle data
CN104269059A (en) * 2014-10-15 2015-01-07 河海大学 City path travel time forecasting method based on multi-source data fusion
CN104299442A (en) * 2014-10-15 2015-01-21 河海大学 Urban route travel time forecasting method based on pattern matching
CN104408958A (en) * 2014-11-11 2015-03-11 河海大学 Urban dynamic route travel time predication method
CN104615897A (en) * 2015-02-13 2015-05-13 北方工业大学 Road section travel time estimation method based on low-frequency GPS data
CN105006147A (en) * 2015-06-19 2015-10-28 武汉大学 Road segment travel time deducing method based on road space-time incidence relation
CN107085943A (en) * 2015-12-23 2017-08-22 青岛海信网络科技股份有限公司 A kind of road travel time short term prediction method and system
CN105679021A (en) * 2016-02-02 2016-06-15 重庆云途交通科技有限公司 Travel time fusion prediction and query method based on traffic big data
CN106096767A (en) * 2016-06-07 2016-11-09 中国科学院自动化研究所 A kind of link travel time prediction method based on LSTM
CN106023588A (en) * 2016-06-15 2016-10-12 重庆云途交通科技有限公司 Traffic big data-based travel time extraction, prediction and query method
CN106156531A (en) * 2016-08-04 2016-11-23 复旦大学 Travel time estimation method based on low sample history track
CN107945507A (en) * 2016-10-13 2018-04-20 腾讯科技(深圳)有限公司 Travel Time Estimation Method and device
CN106940930A (en) * 2016-11-01 2017-07-11 深圳市城市交通规划设计研究中心有限公司 Motorway journeys time prediction system and Forecasting Methodology
CN108073941A (en) * 2016-11-17 2018-05-25 江南大学 A kind of image, semantic generation method based on deep learning
KR20180064692A (en) * 2016-12-06 2018-06-15 현대자동차주식회사 A navigator, a vehicle the navigator is installed in, a system for determining a route, a method for controlling the same
WO2018109516A1 (en) * 2016-12-13 2018-06-21 日産自動車株式会社 Automatic driving assistance method and device
CN106981198A (en) * 2017-05-24 2017-07-25 北京航空航天大学 Deep learning network model and its method for building up for predicting travel time
CN107688556A (en) * 2017-07-24 2018-02-13 中山大学 A kind of real-time travel time computation method based on function type principal component analysis
CN107480786A (en) * 2017-08-07 2017-12-15 复旦大学 Recognition with Recurrent Neural Network track likelihood probability computational methods based on output state limitation

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
DONG WANG, JUNBO ZHANG, WEI CAO, JIAN LI, YU ZHENG: "WhenWill You Arrive? Estimating Travel Time Based on Deep Neural Networks", 《MICROSOFT》 *
XIAOLEI MA,ZHIMIN TAO, YINHAI WANG, HAIYANG YU: "Long short-term memory neural network for traffic speed", 《TRANSPORTATION RESEARCH PART C》 *
YANGDONG LIU: "Short-term travel time prediction by deep learning: A comparison of different LSTM-DNN models", 《2017 IEEE 20TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC)》 *
张发明,朱欣焰,呙维,胡涛: "利用浮动车大数据进行稀疏路段行程时间推断", 《武汉大学学报· 信息科学版》 *
杨庆芳,韦学武,林赐云,李志林,刘翔宇: "基于时空贝叶斯模型的行程时间可靠性预测", 《华南理工大学学报( 自然科学版)》 *

Cited By (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109799533A (en) * 2018-12-28 2019-05-24 中国石油化工股份有限公司 A kind of method for predicting reservoir based on bidirectional circulating neural network
CN109783771A (en) * 2019-01-22 2019-05-21 清华大学 Track sets are converted to processing method, device and the storage medium of image array
CN111721306B (en) * 2019-03-20 2022-07-05 北京嘀嘀无限科技发展有限公司 Road matching method and device, electronic equipment and readable storage medium
CN111721306A (en) * 2019-03-20 2020-09-29 北京嘀嘀无限科技发展有限公司 Road matching method and device, electronic equipment and readable storage medium
CN110232169A (en) * 2019-05-09 2019-09-13 北京航空航天大学 Track denoising method based on two-way length memory models and Kalman filtering in short-term
CN110232169B (en) * 2019-05-09 2022-01-04 北京航空航天大学 Track denoising method based on bidirectional long-time and short-time memory model and Kalman filtering
CN112185148A (en) * 2019-07-03 2021-01-05 罗伯特·博世有限公司 Method for evaluating possible trajectories
CN110488835A (en) * 2019-08-28 2019-11-22 北京航空航天大学 A kind of unmanned systems intelligence local paths planning method based on double reverse transmittance nerve networks
CN110675632A (en) * 2019-11-11 2020-01-10 重庆邮电大学 Vehicle short-time trajectory prediction control method aiming at multi-feature space and data sparseness
CN110675632B (en) * 2019-11-11 2021-11-30 重庆邮电大学 Vehicle short-time trajectory prediction control method aiming at multi-feature space and data sparseness
CN110942211A (en) * 2019-12-12 2020-03-31 武汉中海庭数据技术有限公司 Prediction arrival time prediction method and device based on deep neural network
CN111047107A (en) * 2019-12-23 2020-04-21 北京百度网讯科技有限公司 Road traffic time prediction method, device, electronic equipment and storage medium
CN111047107B (en) * 2019-12-23 2022-05-10 北京百度网讯科技有限公司 Road traffic time prediction method, device, electronic equipment and storage medium
CN111339156A (en) * 2020-02-07 2020-06-26 京东城市(北京)数字科技有限公司 Long-term determination method and device of business data and computer readable storage medium
CN111339156B (en) * 2020-02-07 2023-09-26 京东城市(北京)数字科技有限公司 Method, apparatus and computer readable storage medium for long-term determination of business data
CN111738500A (en) * 2020-06-11 2020-10-02 大连海事大学 Navigation time prediction method and device based on deep learning
CN111738500B (en) * 2020-06-11 2024-01-12 大连海事大学 Navigation time prediction method and device based on deep learning
CN112101132A (en) * 2020-08-24 2020-12-18 西北工业大学 Traffic condition prediction method based on graph embedding model and metric learning
CN112017436B (en) * 2020-09-09 2021-09-28 中国科学院自动化研究所 Method and system for predicting urban traffic travel time
CN112017436A (en) * 2020-09-09 2020-12-01 中国科学院自动化研究所 Method and system for predicting urban traffic travel time
CN112859898A (en) * 2021-01-18 2021-05-28 中山大学 Aircraft trajectory prediction method based on two-channel bidirectional neural network
CN112859898B (en) * 2021-01-18 2022-03-22 中山大学 Aircraft trajectory prediction method based on two-channel bidirectional neural network
CN115394088A (en) * 2022-10-31 2022-11-25 江苏博宇鑫信息科技股份有限公司 Crossing traffic time prediction method based on space-time attention mechanism
CN116542438A (en) * 2023-03-28 2023-08-04 大连海事大学 Bus passenger starting and stopping point estimation and repair method based on non-reference real phase

Also Published As

Publication number Publication date
CN109035761B (en) 2021-06-04

Similar Documents

Publication Publication Date Title
CN109035761B (en) Travel time estimation method based on auxiliary supervised learning
Qiu et al. Nei-TTE: Intelligent traffic time estimation based on fine-grained time derivation of road segments for smart city
Zhang et al. Deeptravel: a neural network based travel time estimation model with auxiliary supervision
Ma et al. A novel STFSA-CNN-GRU hybrid model for short-term traffic speed prediction
Guo et al. Attention based spatial-temporal graph convolutional networks for traffic flow forecasting
CN109658695B (en) Multi-factor short-term traffic flow prediction method
CN115240425B (en) Traffic prediction method based on multi-scale space-time fusion graph network
Thomas et al. Predictions of urban volumes in single time series
Niu et al. A novel spatio-temporal model for city-scale traffic speed prediction
Zhou et al. Urban flow prediction with spatial–temporal neural ODEs
CN115578852B (en) DSTGCN-based traffic prediction method
Olayode et al. Prediction and modeling of traffic flow of human-driven vehicles at a signalized road intersection using artificial neural network model: A South African road transportation system scenario
Rahman et al. Short-term traffic speed prediction for freeways during hurricane evacuation: a deep learning approach
Rajeh et al. Modeling multi-regional temporal correlation with gated recurrent unit and multiple linear regression for urban traffic flow prediction
CN116052427B (en) Inter-city inter-regional mobility prediction method and device based on private car travel track data
Haputhanthri et al. Short-term traffic forecasting using LSTM-based deep learning models
Lu et al. Efficient deep learning based method for multi‐lane speed forecasting: a case study in Beijing
Zhang et al. A hybrid deep learning approach for urban expressway travel time prediction considering spatial-temporal features
Doğan Short-term traffic flow prediction using artificial intelligence with periodic clustering and elected set
Qiao et al. Short-term traffic flow forecast based on parallel long short-term memory neural network
Pramanik et al. Modeling traffic congestion in developing countries using google maps data
Buroni et al. A tutorial on network-wide multi-horizon traffic forecasting with deep learning.
Ye et al. Short-term traffic flow prediction at isolated intersections based on parallel multi-task learning
Zhang et al. Application of long short-term memory neural network for multi-step travel time forecasting on urban expressways
Zhou et al. a Method for Traffic Flow Forecasting in a Large-Scale Road Network Using Multifeatures

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20210604