A Spatiotemporal Feature-Based Approach for the Detection of Unlicensed Taxis in Urban Areas
<p>Calculation process of vehicle operational characteristic indicators.</p> "> Figure 2
<p>Distribution characteristics of the training sample.</p> "> Figure 3
<p>ROC curve of unlicensed-taxi-identification model.</p> "> Figure 4
<p>Probability distribution of vehicles engaging in unlicensed-taxi activities.</p> "> Figure 5
<p>Distribution of average daily mileage for three types of vehicles.</p> "> Figure 6
<p>Operating-time-characteristic distribution: (<b>a</b>) operating days; (<b>b</b>) average daily operating time.</p> "> Figure 7
<p>Distribution characteristics of operating time periods within a day.</p> "> Figure 8
<p>Main operating areas of suspected unlicensed taxis: (<b>a</b>) overall distribution; (<b>b</b>) operating hotspot areas.</p> "> Figure 9
<p>Distribution of traffic surveillance bayonets passed by the suspected unlicensed taxi each day: (<b>a</b>) first pass; (<b>b</b>) last pass.</p> "> Figure 10
<p>Temporal distribution of traffic surveillance bayonets passed by the suspected unlicensed taxi during the statistical period: (<b>a</b>) start time; (<b>b</b>) end time.</p> ">
Abstract
:1. Introduction
- (1)
- This study addresses the limitations of previous research, which was constrained by a narrow range of data types and small sample sizes. Unlike prior studies that mined data from the vehicle’s perspective, this study adopts the perspective of traffic managers, utilizing traffic surveillance bayonets distributed across the road network to collect vehicle passage data. This method facilitates the convenient and accurate gathering of multidimensional traffic data from vehicles on the road, providing a robust data foundation for identifying unlicensed taxis and ensuring the successful implementation of practical applications.
- (2)
- This study avoided relying on a single vehicle characteristic by incorporating multiple features, such as daily average mileage, daily operating time, and the ratio of operating days. Through multidimensional cross-analysis, this approach improved identification accuracy and bolstered the model’s robustness, mitigating biases associated with relying on a single indicator.
- (3)
- To develop effective solutions for the precise regulation of unlicensed taxis, this study analyzed the spatiotemporal distribution of suspected unlicensed taxis, including their operating start- and end-points and times, to identify distribution patterns. This thorough analysis provides a strong basis for traffic management authorities to implement targeted regulations, thereby improving management efficiency.
2. Multi-Source Data Processing
2.1. Data Collection and Filtering
2.1.1. Data Collection
2.1.2. Data Filtering
- (1)
- Step 1: Exclude Special Vehicle Data
- (2)
- Step 2: Eliminating Noisy Data
- (a)
- Retrieval of Noise Data
- (b)
- Similarity Analysis Between Noise Data and Normal Data
- (c)
- Data Supplementation and Removal
- (3)
- Step 3: Invalid Data Filtering
2.2. Training Sample Selection
3. Identification Indicators
3.1. Operational Characteristic Indicators of Unlicensed Taxis
- (1)
- Definition 1: Average Daily Mileage
- (2)
- Definition 2: Average Daily Operating Time
- (3)
- Definition 3: Operating-Days Ratio
3.2. Calculation of Vehicle Operational Characteristic Indicators
3.2.1. Construction of Bayonet Distance Matrix
3.2.2. Segmentation of Passage Records
3.2.3. Calculation of Driving Mileage
- (1)
- The traffic-surveillance-bayonet numbers {tsc1, tsc2, …, tsck} for vehicle j in the i-th trip are extracted as the key nodes of the short-term trip Gi, where k represents the total number of traffic surveillance bayonets passed during the i-th trip.
- (2)
- The key nodes {tsc1, tsc2, …, tsck} of the short-term trip Gi are matched with the bayonet distance matrix to obtain the distances {l1, l2, …, lk−1} between adjacent bayonets. The driving mileage si of the short-term trip Gi can then be calculated, as shown in Equation (11).
- (3)
- The driving mileage si for all short-term trips of vehicle j within the statistical period T is calculated, and the total driving mileage Sj of vehicle j during the statistical period is then determined, as shown in Equation (12).
3.2.4. Calculation of Driving Time
- (1)
- The number of days dj within the statistical period during which vehicle j has driving records, along with the corresponding short-term trips, is calculated. Days without vehicle passage records are marked as 0.
- (2)
- The passage times {ptime1, ptime2, …, ptimek} at the traffic surveillance bayonets for vehicle j during the i-th trip are extracted, and the driving time tsi for that trip is calculated, as shown in Equation (13).
- (3)
- The driving time for all short-term trips of vehicle j within the statistical period T is calculated, and the total operating time To of vehicle j during the statistical period is determined, as shown in Equation (14).
3.3. Identification Indicator Determination
4. Identification Method
4.1. Suspected-Unlicensed-Taxi-Identification Model
4.2. Parameter Estimation
4.3. Model Validation
4.4. Model Performance and Predictive Ability Evaluation
5. Empirical Analysis
5.1. Identification of Suspected Unlicensed Taxis
5.2. Operational Characteristic Analysis
5.2.1. Mileage Characteristics
5.2.2. Operating-Time Characteristics
5.3. Spatiotemporal-Distribution Characteristics Analysis
5.3.1. Overall Spatial-Distribution Characteristics
5.3.2. Temporal- and Spatial-Distribution Characteristics of Travel Origins
6. Conclusions
- (1)
- Based on traffic-surveillance-bayonet data, a distance matrix was constructed and a driving interval threshold was set. The mileage and operating time data were calculated, and variance analysis was conducted to compare the differences between private cars, unlicensed taxis, and compliant taxis, leading to the determination of unlicensed-taxi-identification indicators.
- (2)
- Based on the identified unlicensed-taxi indicators, a binary Logistic regression model was established. The model parameters were estimated using the maximum likelihood method, and the model’s goodness-of-fit and predictive power were evaluated through Hosmer–Lemeshow tests and ROC curve analysis. The results show that the model can effectively predict the likelihood of a vehicle engaging in unlicensed-taxi activities (R = 89.26%, P = 94.74%, ACC = 99.10%, F1 = 91.91%, AUC = 0.994).
- (3)
- Using the information from the identified suspected unlicensed taxis, an analysis of their daily start and end times and location distribution was conducted to provide a basis for precise management by traffic authorities. The results show that the operational characteristics of suspected unlicensed taxis differ from those of private cars, with regular patterns in their daily start and end times and location distributions.
7. Patents
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Hu, Y.; Liu, L.; Xie, Y. Intelligent Recognition Model and its Mechnism for Illiagle Taxis Based on Spatio-Temporal Features. Transp. Energy Conserv. Environ. Prot. 2022, 18, 64–69. [Google Scholar]
- Liu, R.; Li, Y.; Mao, H.; Qian, J.; Wang, J.; Ma, Y. Short-Term Traffic Flow Forecasting at Intersections Based on Frequent Sequence Pattern Mining. Urban Transp. China 2023, 21, 87–98. [Google Scholar]
- Wang, M.; Ding, H.; Li, L.; Xu, R.; Liu, Y. Analysis of non-local vehicle travel and technical parameters time-varying characteristics based on video bayonet data. Acta Sci. Nat. Univ. Sunyatseni 2023, 62, 106–114. [Google Scholar]
- Chen, H.; Yang, C.; Xu, X. Clustering Vehicle Temporal and Spatial Travel Behavior Using License Plate Recognition Data. J. Adv. Transp. 2017, 2017, 1738085. [Google Scholar] [CrossRef]
- Wei, Q.; Wang, J.; Mo, J.; Chen, C. Travel Characteristics Analysis Based on Bayonet System. Traffic Transp. 2019, 35, 33–36. [Google Scholar]
- Long, X.; Su, Y.; Yu, C.; Wu, D. Analyzing Methods of Vehicle’ Travel Using Plate Recognition Data. J. Transp. Syst. Eng. Inf. Technol. 2019, 19, 66–72. [Google Scholar]
- Ruan, S.; Wang, F.; Ma, D.; Jin, S.; Wang, D. Vehicle trajectory extraction algorithm based on license plate recognition data. J. Zhejiang Univ. 2018, 52, 836–844. [Google Scholar]
- Yao, W.; Zhang, M.; Jin, S.; Ma, D. Understanding vehicles commuting pattern based on license plate recognition data. Transp. Res. Part C 2021, 128, 103142. [Google Scholar] [CrossRef]
- Wu, H.; Yang, J.; Yuan, M.; Li, X. Heuristic Optimal Scheduling for Road Traffic Incident Detection Under Computational Constraints. Sensors 2024, 24, 7221. [Google Scholar] [CrossRef]
- Lin, J.; Wang, L. Design and Implementation of Illegal Operating Vehicles’ Reporting System Based on NFC and Android. Tech. Autom. Appl. 2018, 37, 119–122+138. [Google Scholar]
- Wang, N.; Duan, P.; Wu, L. Research of illegal operation vehicle identification based on convolutional neural network. J. Comput. Appl. 2016, 36, 193–196. [Google Scholar]
- Li, R.; Xiao, Y. Identification Method for Unlicensed Taxis in Urban Areas: A Case Study. J. Urban Dev. Manag. 2023, 2, 22–33. [Google Scholar] [CrossRef]
- Shuai, D.; Lan, Z.; Li, Y. Recognising Illegal Operation Vehicles Based on K-mediods and ITS Improved Algorithm. Comput. Appl. Softw. 2016, 33, 154–157+211. [Google Scholar]
- Ma, S.; Sun, C.; Zhu, J. Application of SOM Neural Network in the Identification of Illegal Operating Vehicles. Inf. Comput. 2017, 83–85. [Google Scholar]
- Tian, Q.; Li, Q.; Yu, J.; Yang, C.; Su, J. Research on the Characteristics of Illegal Passenger Vehicles Based on ETC Toll Pipeline Data. Logist. Eng. Manag. 2023, 45, 124–128+161. [Google Scholar]
- Zhao, Q.; Jiang, T.; Zhao, F.; Ma, B. Abnormal vehicle detection based on spatio-temporal big data. Transducer Microsyst. Technol. 2019, 38, 139–142. [Google Scholar]
- Chen, L.; Zheng, L.; Xia, L.; Liu, W.; Sun, D. Detecting and analyzing unlicensed taxis: A case study of Chongqing City. Phys. A Stat. Mech. Its Appl. 2021, 584, 126324. [Google Scholar] [CrossRef]
- Yuan, W.; Deng, P.; Taleb, T.; Wan, J.; Bi, C. An Unlicensed Taxi Identification Model Based on Big Data Analysis. Ieee Trans. Intell. Transp. Syst. 2016, 17, 1703–1713. [Google Scholar] [CrossRef]
- Wang, Y.; Fan, X.; Liu, X.; Zheng, C.; Chen, L.; Wang, C.; Li, J. Unlicensed Taxis Detection Service Based on Large-Scale Vehicles Mobility Data. In Proceedings of the 2017 IEEE International Conference on Web Services (ICWS), Honolulu, HI, USA, 25–30 June 2017; IEEE: Piscataway, NJ, USA, 2017. [Google Scholar]
- Tian, Y.; Yang, J.; Lu, P. Unlicensed Taxi Detection Algorithm Based on Traffic Surveillance Data. In Proceedings of the 5th ACM SIGSPATIAL International Workshop on the Use of GIS in Emergency Management, Chicago, IL, USA, 5 November 2019; Association for Computing Machinery: Broadway, NY, USA, 2019. [Google Scholar]
- Huang, Z.; Li, Q. Research on Identification of Urban Illegal Vehicles Based on Random Forest Model. Electron. Sci. Technol. 2024, 37, 66–71. [Google Scholar]
- Molina-Campoverde, J.J.; Rivera-Campoverde, N.; Molina Campoverde, P.A.; Bermeo Naula, A.K. Urban Mobility Pattern Detection: Development of a Classification Algorithm Based on Machine Learning and GPS. Sensors 2024, 24, 3884. [Google Scholar] [CrossRef] [PubMed]
- Crouzil, A.; Khoudour, L.; Valiere, P.; Cong, D.N.T. Automatic vehicle counting system for traffic monitoring. J. Electron. Imaging 2016, 25, 051207. [Google Scholar] [CrossRef]
- Malinovskiy, Y.; Wu, Y.; Wang, Y. Video-Based Vehicle Detection and Tracking Using Spatiotemporal Maps. Transp. Res. Rec. 2009, 2121, 81–89. [Google Scholar] [CrossRef]
- Xia, L.; Hu, X.; Wang, J. Anomaly detection in traffic surveillance with sparse topic model. J. Cent. South. Univ. 2018, 25, 2245–2257. [Google Scholar] [CrossRef]
- Zou, B.; Yu, Z.; He, Z.; Huang, M.; Chen, K.; Sha, Z. The characteristics of urban road usage using automatic vehicle identification data. Acta Sci. Nat. Univ. Sunyatseni 2022, 61, 132–139. [Google Scholar]
- Zhao, P.; Wu, G.; Wang, X.; Wang, S.; Zan, Y. A Data-driven Method for Identifying Congestion State and Selecting Guided Vehicles for Urban Expressways. J. Transp. Inf. Saf. 2021, 39, 82–90. [Google Scholar]
- Lin, P.; Weng, J.; Hu, S.; Jing, Y.; Yin, B. Day-to-day Similarity of Individual Activity Chain of Public Transport Passengers. J. Transp. Syst. Eng. Inf. Technol. 2020, 20, 178–183+204. [Google Scholar]
- Fu, F.; Fang, Y.; Dong, H.; Chen, W. Automation of Electric Power Systems. Autom. Electr. Power Syst. 2018, 42, 72–80. [Google Scholar]
- Phattarasuda, W.; Surachet, P.; Kunnawee, K.; Kazushi, S.; Suksun, H. Influential Factors Affecting Travelers’ Mode Choice Behavior on Mass Transit in Bangkok, Thailand. Sustainability 2020, 12, 9522. [Google Scholar] [CrossRef]
- Zhang, F.; Ji, Y.; Lv, H.; Ma, X. Analysis of factors influencing delivery e-bikes’ red-light running behavior: A correlated mixed binary logit approach. Accid. Anal. Prev. 2021, 152, 105977. [Google Scholar] [CrossRef] [PubMed]
- Li, J.; Nie, Q.; Liu, Q.; Jia, X. Logistic regression model for stability assessment of karst ground collapse of Tangshan city. Rock. Soil Mech. 2017, 38, 250–256. [Google Scholar]
Number | Data Type | Example | Meaning |
---|---|---|---|
1 | pnum | 1832926172 | The count of vehicles passing through traffic surveillance bayonet |
2 | vnum | Wan F****1 | Vehicle license plate number |
3 | vncol | Blue | Vehicle license plate color |
4 | tscnum | 34060300001190110040 | Traffic surveillance bayonet number |
5 | ptime | 2021/12/1 6:59:05 | The time the vehicle passes through the traffic surveillance bayonet |
6 | pd | 9.0 | Vehicle driving direction |
7 | pspeed | 46 | The speed of the vehicle passing through the traffic surveillance bayonet |
Number | Data Type | Example | Meaning |
---|---|---|---|
1 | tscnum | 340621000011901401 | Traffic-surveillance-bayonet number |
2 | lon | 116.764 | Longitude of the traffic surveillance bayonet |
3 | lat | 33.922 | Latitude of the traffic surveillance bayonet |
Error Identification Information | Cause of Error |
---|---|
05377 | Incomplete License Plate Information |
0B70641 | Misidentification of Leading “0” in Number |
Serial Number | tsc1 | tsc2 | … | tsci | … | tscm |
---|---|---|---|---|---|---|
tsc1 | 0 | l12 | … | l1i | … | l1m |
tsc2 | l21 | 0 | … | l2i | … | l2m |
… | … | … | … | … | … | … |
tsci | li1 | li2 | … | 0 | … | lim |
… | … | … | … | … | … | … |
tscm | lm1 | lm2 | … | lm3 | … | 0 |
Mean | Standard Deviation | F | Sig | Post Hoc Test | ||
---|---|---|---|---|---|---|
Sa | private cars | 21.926 | 14.571 | 4328.400 | 0.00 | I < II; I < III |
confirmed unlicensed taxis | 175.291 | 79.383 | ||||
compliant taxis | 175.058 | 71.649 | ||||
Ta | private cars | 0.738 | 0.426 | 4404.217 | 0.00 | I < II; I < III |
confirmed unlicensed taxis | 8.125 | 3.875 | ||||
compliant taxis | 8.762 | 3.776 | ||||
τ | private cars | 0.561 | 0.342 | 1341.889 | 0.00 | I < II; I < III |
confirmed unlicensed taxis | 0.976 | 0.078 | ||||
compliant taxis | 0.975 | 0.094 |
vnum | Sa/km | Ta/h | τ |
---|---|---|---|
Wan F****9 | 310.027 | 12.464 | 1 |
Wan F****2 | 302.631 | 15.682 | 1 |
… | … | … | … |
Wan L****5 | 1.000 | 0.410 | 0.8 |
Variables. | B | S.E | Wald | df | Sig | Exp (B) |
---|---|---|---|---|---|---|
Sa | 0.027 | 0.014 | 3.936 | 1 | 0.047 | 1.027 |
Ta | 1.488 | 0.361 | 16.997 | 1 | 0.000 | 4.430 |
τ | 5.409 | 1.916 | 7.973 | 1 | 0.005 | 223.487 |
Constant | −11.885 | 1.916 | 38.461 | 1 | 0.000 | 0.000 |
Steps | Chi-Square | df | Sig |
---|---|---|---|
1 | 9.175 | 8 | 0.328 |
Category | Predicted Values | ||
---|---|---|---|
Unlicensed Taxis | Private Cars | ||
Actual values | Unlicensed taxis | 108 | 13 |
Private cars | 6 | 1994 |
vnum | Sa/km | Ta/h | τ |
---|---|---|---|
Wan F****2 | 302.631 | 15.682 | 1 |
Wan F****0 | 297.365 | 15.492 | 1 |
… | … | … | … |
Wan F****2 | 76.443 | 2.956 | 1 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Xiao, Y.; Li, R.; Li, J. A Spatiotemporal Feature-Based Approach for the Detection of Unlicensed Taxis in Urban Areas. Sensors 2024, 24, 8206. https://doi.org/10.3390/s24248206
Xiao Y, Li R, Li J. A Spatiotemporal Feature-Based Approach for the Detection of Unlicensed Taxis in Urban Areas. Sensors. 2024; 24(24):8206. https://doi.org/10.3390/s24248206
Chicago/Turabian StyleXiao, Yun, Rongqiao Li, and Jinyan Li. 2024. "A Spatiotemporal Feature-Based Approach for the Detection of Unlicensed Taxis in Urban Areas" Sensors 24, no. 24: 8206. https://doi.org/10.3390/s24248206
APA StyleXiao, Y., Li, R., & Li, J. (2024). A Spatiotemporal Feature-Based Approach for the Detection of Unlicensed Taxis in Urban Areas. Sensors, 24(24), 8206. https://doi.org/10.3390/s24248206