Nothing Special   »   [go: up one dir, main page]

Next Article in Journal
Knowledge Reasoning- and Progressive Distillation-Integrated Detection of Electrical Construction Violations
Previous Article in Journal
Correction: Oluwasanmi et al. Multi-Head Spatiotemporal Attention Graph Convolutional Network for Traffic Prediction. Sensors 2023, 23, 3836
Previous Article in Special Issue
TEA-GCN: Transformer-Enhanced Adaptive Graph Convolutional Network for Traffic Flow Forecasting
You seem to have javascript disabled. Please note that many of the page functionalities won't work as expected without javascript enabled.
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Spatiotemporal Feature-Based Approach for the Detection of Unlicensed Taxis in Urban Areas

1
School of Urban Construction and Transportation, Hefei University, Hefei 230601, China
2
School of Civil Engineering and Transportation, Northeast Forestry University, Harbin 150040, China
*
Author to whom correspondence should be addressed.
Sensors 2024, 24(24), 8206; https://doi.org/10.3390/s24248206
Submission received: 26 October 2024 / Revised: 20 December 2024 / Accepted: 20 December 2024 / Published: 23 December 2024
(This article belongs to the Special Issue Data and Network Analytics in Transportation Systems)

Abstract

:
Unlicensed taxis seriously disrupt the transportation market order, and threaten passenger safety. Therefore, this paper proposes a method for identifying unlicensed taxis based on travel characteristics. First, the vehicle mileage and operation time are calculated using traffic surveillance bayonet data, and variance analysis is applied to identification indicators for unlicensed taxis. Secondly, the mathematical model for identifying unlicensed taxis is established. The model is validated using the Hosmer–Lemeshow test, confusion matrix and ROC curve analysis. Finally, by applying methods such as geographic information matching, the spatiotemporal distribution characteristics of suspected unlicensed taxis in a city in Anhui Province are identified. The results show that the model effectively identifies suspected unlicensed taxis (ACC = 99.10%). The daily average mileage, daily average operating time, and number of operating days for suspected unlicensed taxis are significantly higher than those for private cars. Additionally, the suspected unlicensed taxis exhibit regular patterns in their travel origin–destination points and temporal distribution, enabling traffic management authorities to implement targeted regulatory measures.

1. Introduction

Unlicensed taxis refers to vehicles engaged in transportation operations without legally obtaining the necessary operating rights, posing significant safety risks to the transportation sector [1]. Currently, many drivers in China are involved in unlicensed taxi operations. Due to the lack of necessary assessments and qualification checks, the quality of drivers is inconsistent. Some even have histories of criminal records, posing significant threats to passenger safety. Additionally, most unlicensed taxis are low-end or poorly maintained vehicles, making it difficult to ensure vehicle quality, which poses potential risks to passenger safety. Traditional methods for identifying unlicensed taxis primarily rely on manual roadside inspections and passenger reports, which not only consume substantial human and material resources but also easily disrupt normal traffic order, leading to social conflicts. Therefore, enhancing traditional methods by leveraging traffic big data analysis to identify unlicensed taxi operations is crucial for optimizing road transportation management.
Traffic surveillance bayonet data contain a vast amount of vehicle travel information, from which vehicle travel characteristics can be effectively extracted, enabling accurate identification of unlicensed vehicles and providing a new approach to managing these vehicles. Some scholars have utilized big data mining techniques to extract vehicle travel information from traffic surveillance bayonet data. Liu et al. [2] analyzed vehicle travel characteristics based on frequent sequence-pattern mining algorithms using bayonet vehicle trajectory sequences. Wang et al. [3] collected hourly scale data on the structure of vehicle types, road types, and time-varying characteristics of travel and technical parameters for non-local vehicles in the road network inside and outside the First Ring of Foshan City using big data mining techniques. Chen et al. [4] proposed a method for completing vehicle travel OD based on license plate recognition data, supplementing and restoring OD at measurement points by distributing link flows to obtain complete vehicle travel paths. Wei et al. [5] established a vehicle travel-demand evaluation index system based on traffic surveillance bayonet data and analyzed vehicle travel characteristics from multiple dimensions. Long et al. [6] extracted vehicle travel-path information from traffic surveillance bayonet data and analyzed vehicle travel patterns at both individual and aggregate levels. Ruan et al. [7] combined the K-shortest path algorithm with the grey relational algorithm to complete decisions and reconstruct travel paths generated from license-plate recognition data. Yao et al. [8] used bayonet license-plate recognition data and employed factor analysis to integrate vehicle travel stability factors, efficiently identifying commuter vehicles. Wu et al. [9] introduced a heuristic optimal scheduling approach to identify abnormal traffic events, and the results demonstrated that this method outperforms existing algorithms. The above studies are based on traffic surveillance bayonet data, and set feature indicators according to vehicle travel patterns to extract commuter vehicle travel characteristics, providing a methodological reference for extracting and identifying the features of suspected unlicensed taxis.
Currently, scholars both domestically and internationally have conducted extensive research on identifying unlicensed taxis. Lin et al. [10] proposed a reporting system for unlicensed vehicles based on Near Field Communication technology and Android, encouraging passengers to report suspected unlicensed taxis and creating an effective environment for their governance. Wang et al. [11] used Convolutional Neural Networks (CNNs) to obtain trajectory data of unlicensed and normal vehicles through simulation experiments, performing feature learning and recognition to improve the identification rate of suspected unlicensed taxis. Li et al. [12] used vehicle mileage as an identification indicator to establish a model for identifying unlicensed taxis and identifying suspected unlicensed taxis through case analysis. Shuai et al. [13] extracted vehicle Radio Frequency Identification data and proposed a k-medoids-based algorithm for identifying unlicensed taxis, which was validated through experiments. Ma et al. [14] collected vehicle operation data using RFID technology and applied the SOM neural network clustering algorithm to establish a mathematical model for identifying unlicensed taxis, achieving favorable results. Tian et al. [15] developed a method for identifying unlicensed taxis based on ETC toll data, employed an improved K-means++ algorithm, and empirically identified unlicensed taxis in Guiyang. The focus of the above studies is on algorithm models, but they primarily rely on data simulation for theoretical validation, and their applicability requires further testing.
With the development of big-data analysis techniques, some scholars have conducted research on identifying unlicensed taxis by analyzing various types of traffic data and extracting different feature indicators. Zhao et al. [16] analyzed vehicle refueling data, extracting temporal and spatial refueling characteristics, and defined vehicles with abnormal refueling patterns as suspected unlicensed taxis. Chen et al. [17] used Electronic Registration Identification data to build a detection model for unlicensed taxis using ensemble learning methods, identifying suspected unlicensed taxis. Yuan et al. [18] extracted vehicle travel data from traffic surveillance bayonets, developed a detection model to identify coarse-grained unlicensed taxis and further applied a feature-trained Support Vector Machine classification model to identify fine-grained unlicensed taxis. Wang et al. [19] extracted two types of behavioral features: daily behaviors and sustainable behaviors, and used three machine learning methods to compare the accuracy and quantity of identified unlicensed taxis by each method. Tian et al. [20] proposed two indicators: “path irregularity” and “time irregularity”, and developed a model to distinguish between commercial and non-commercial vehicles. Huang et al. [21] employed a random forest algorithm to develop a classifier for identifying unlicensed taxis, comparing its performance with that of other models, to validate the accuracy of the proposed method. Juan et al. [22] utilized mobile GPS data to develop a decision-tree machine-learning classification algorithm, which can be applied to urban traffic monitoring. The above studies used machine learning algorithms to identify suspected unlicensed taxis in the study areas, but they did not further analyze the spatiotemporal trajectory characteristics of these vehicles, limiting their ability to provide strong support for precise management by authorities.
Therefore, this study extended previous research by using traffic surveillance bayonet data to gather vehicle passage information. The spatiotemporal characteristic indicators of private cars, confirmed unlicensed taxis and compliant taxis were selected and analyzed using variance analysis to explore their differences, thereby determining the identification indicators for unlicensed taxis. Based on this, a Binary Logistic Regression analysis was conducted to develop a model for identifying unlicensed taxis using significant identification indicators. The high-precision spatiotemporal characteristics of suspected unlicensed taxis were then extracted, and governance measures for these vehicles were proposed. The research results contribute to enriching the methods for identifying and regulating unlicensed taxis, providing strategic guidance for traffic management authorities in formulating policies to combat illegal operations, thereby enhancing law enforcement efficiency and passage safety. The primary contributions of this study are summarized as follows:
(1)
This study addresses the limitations of previous research, which was constrained by a narrow range of data types and small sample sizes. Unlike prior studies that mined data from the vehicle’s perspective, this study adopts the perspective of traffic managers, utilizing traffic surveillance bayonets distributed across the road network to collect vehicle passage data. This method facilitates the convenient and accurate gathering of multidimensional traffic data from vehicles on the road, providing a robust data foundation for identifying unlicensed taxis and ensuring the successful implementation of practical applications.
(2)
This study avoided relying on a single vehicle characteristic by incorporating multiple features, such as daily average mileage, daily operating time, and the ratio of operating days. Through multidimensional cross-analysis, this approach improved identification accuracy and bolstered the model’s robustness, mitigating biases associated with relying on a single indicator.
(3)
To develop effective solutions for the precise regulation of unlicensed taxis, this study analyzed the spatiotemporal distribution of suspected unlicensed taxis, including their operating start- and end-points and times, to identify distribution patterns. This thorough analysis provides a strong basis for traffic management authorities to implement targeted regulations, thereby improving management efficiency.

2. Multi-Source Data Processing

2.1. Data Collection and Filtering

2.1.1. Data Collection

Traffic surveillance bayonet data are characterized by strong timeliness and high quality, accurately reflecting traffic flow and vehicle trajectories. This is one of the essential foundational datasets for traffic big data analysis [23,24,25]. Thees data include vehicle passage data and bayonet location information, with their types and meanings shown in Table 1 and Table 2.
This study collected 42.045 million vehicle passage records over ten days in a city in Anhui Province through traffic surveillance bayonets. Irrelevant data attributes were removed according to the research objectives, and each vehicle passage record r can be represented by Equation (1).
r i = ( v n u m i , v n c o l i , t s c n u m i , p t i m e i , p s p e e d i )
where i represents the i-th vehicle passage data recorded by the traffic surveillance bayonet.
By sorting each vehicle’s passage records in chronological order based on the time they passed through the traffic surveillance bayonets, the resulting set of passage records for each vehicle, denoted as R, can be represented by Equation (2).
R j = r 1 , r 2 , , r i
where Rj represents the set of passage records for vehicle j, ordered by time.

2.1.2. Data Filtering

When collecting vehicle passage data through traffic surveillance bayonets, irrelevant special vehicle passage data may be recorded, and noise data may be generated due to factors such as weather and equipment conditions [26,27]. To improve the accuracy of vehicle travel-information estimation, the raw bayonet data need to be processed as follows:
(1)
Step 1: Exclude Special Vehicle Data
Special vehicles include trucks, police cars, government vehicles, driving test vehicles, taxis, etc., which are unrelated to unlicensed taxi operations. This study filters out special vehicle data based on the vehicle numbering rules in China and the vehicle information provided by the relevant authorities.
(2)
Step 2: Eliminating Noisy Data
When calculating various vehicle indicators based on the data collected from traffic surveillance bayonets, noise data can increase calculation errors and even produce incorrect results. Common types of noise data include missing data, duplicate data, and erroneous data. The types of noise data generated by the traffic surveillance bayonet in this city are shown in Table 3. This study addresses missing data according to the following steps.
(a)
Retrieval of Noise Data
Based on the vehicle passage records processed in Step 1, a binary function ψ is constructed to determine whether the vnumi data in ri is abnormal, as shown in Equation (3).
ψ i = 0 ,   a b n o r m a l   v e h i c l e   l i c e n s e   p l a t e 1 ,   n o r m a l   v e h i c l e   l i c e n s e   p l a t e
(b)
Similarity Analysis Between Noise Data and Normal Data
Common methods for similarity analysis between noise data and normal data include Cosine Similarity, Jaccard Similarity, and Levenshtein Distance. Due to its intuitiveness, simplicity, and sensitivity to editing operations, this study uses Levenshtein Distance for the similarity analysis between noise data and normal data. Levenshtein Distance refers to the minimum number of editing operations required to transform string S into target string T. The editing operations include insertion, deletion, and substitution of individual characters. The calculation formula is shown in Equation (4) [28].
l e v S , T ( i , j ) = max ( i , j ) min l e v S , T ( i 1 , j ) + 1 l e v S , T ( i , j 1 ) + 1 l e v S , T ( i 1 , j 1 ) + 1 ( S i T j ) i f min ( i , j ) = 0 o t h e r w i s e
where Si and Tj represent the i-th and j-th characters of strings S and T, respectively; 1(SiTj) equals 1 when SiTj, otherwise it equals 0.
If S and T are two different strings, their similarity can be calculated using Equation (5).
γ = 1 l e v S , T ( i , j ) max ( L S , L T )
where LS and LT represent the lengths of strings S and T, respectively. The larger the value of γ, the higher the similarity between S and T.
The differentiation between the two sets of license plate data, ψ = 0 and ψ = 1, was quantitatively assessed using the Levenshtein Distance, resulting in the similarity γ between abnormal and normal license plates.
(c)
Data Supplementation and Removal
For each abnormal license plate and normal license plate, a similarity comparison is performed. If the abnormal license plate vnumi has the highest similarity with the normal license plate vnumj, and the passage records of Ri and Rj show temporal continuity, then Ri is merged into Rj for data supplementation. Data that do not meet these conditions are considered invalid and are removed.
(3)
Step 3: Invalid Data Filtering
The distance ΔS between the traffic surveillance bayonets corresponding to two consecutive passage records of vehicle j, as well as the time difference Δt between these records, was calculated. The spatial movement speed v0 = ΔSt of vehicle j within the adjacent traffic surveillance bayonets is then determined. If v0 > 100 km/h, the passage record is marked as invalid and is removed.

2.2. Training Sample Selection

Unlicensed taxis and compliant taxis (including traditional taxis and ride-hailing vehicles) have similar operating patterns, which differ to some extent from those of regular private cars [17]. Analyzing the similarities and differences between them can effectively capture their operational characteristics. Therefore, this study extracts historical driving data from a subset of private cars, confirmed unlicensed taxis, and compliant taxis as the training sample υ, used for identifying indicators and analyzing operational characteristics. The expression is given in Equation (6). The training sample υ consists of the set of passage records from private cars υN, confirmed unlicensed taxis υI, and compliant taxis υL.
υ N = R 1 , R 2 , , R n υ I = R 1 , R 2 , , R m υ L = R 1 , R 2 , , R g
where n, m, and g represent the number of private cars, confirmed unlicensed taxis, and compliant taxis, respectively.
Based on the bayonet data from 10 days, this study selected the passage records of 1852 compliant taxis, 121 confirmed unlicensed taxis, and a random sample of 2000 private cars in the city as the training set for further analysis.

3. Identification Indicators

3.1. Operational Characteristic Indicators of Unlicensed Taxis

Unlicensed taxis typically exhibit the following characteristics: (1) lower vehicle prices; (2) higher maximum passenger capacity; (3) longer driving times; and (4) greater driving distances [16]. Based on this, the definitions of the operational characteristic indicators for unlicensed taxis are provided.
(1)
Definition 1: Average Daily Mileage
The average daily mileage (Sa) is defined as the ratio of the total mileage Si of vehicle i during a statistical period of T days to the number of days di the vehicle was in operation, as shown in Equation (7).
S a i = S i d i = j = 1 T s i , j d i
where si,j represents the mileage of vehicle i on the j-th day during the statistical period T, measured in kilometers.
(2)
Definition 2: Average Daily Operating Time
The average daily operating time (Ta) is defined as the ratio of the total operating time Toi of vehicle i during a statistical period of T days to the number of days di the vehicle was in operation, as shown in Equation (8).
T a i = T o i d i = j = 1 T t i , j d i
where ti,j represents the operating time of vehicle i on the j-th day during the statistical period T.
(3)
Definition 3: Operating-Days Ratio
The operating-days ratio (τ) is defined as the ratio of the number of operating days di of vehicle i during a statistical period of T days to the total statistical period T, as shown in Equation (9). The value of τ ranges from 0 to 1, with a value closer to 1 indicating that the vehicle operated on more days within the statistical period.
τ = d i T

3.2. Calculation of Vehicle Operational Characteristic Indicators

The calculation of identification indicator data for each vehicle within the set of passage records R is conducted, following the steps shown in Figure 1.

3.2.1. Construction of Bayonet Distance Matrix

The bayonet distance matrix for adjacent points is constructed based on the latitude and longitude information from urban traffic surveillance attributes. For adjacent traffic surveillance bayonets tsci and tscj in the city, the shortest path between them is determined based on the urban road network, and the path is divided into several key nodes (x1, y1), (x2, y2), …, (xn, yn). The distance between adjacent nodes is calculated and summed to determine the total distance between adjacent traffic surveillance bayonets tsci and tscj, as shown in Equation (10).
s i , j = 1 i R e * ar cos [ cos ( y n 1 ) * cos ( y n ) * cos ( x n 1 x n ) + sin ( y n 1 ) * sin ( y n ) ]
where Re represents the Earth’s radius, which is 6371.0 km.
The distance matrix for m adjacent traffic surveillance bayonets is established as shown in Table 4. where lim represents the road segment distance between adjacent traffic surveillance bayonet i and bayonet m, measured in kilometers.

3.2.2. Segmentation of Passage Records

Vehicle passage records are segmented based on the time intervals between adjacent traffic surveillance bayonets [29]. Based on the average time for all vehicles to pass through the bayonets, this study sets the time interval threshold ΔT to 15 min. When the time interval for vehicle j to pass through adjacent traffic surveillance bayonets tsci and tsci+1 meets the condition ptimei+1ptimei ≥ ΔT, the passage records read by the adjacent traffic surveillance bayonets tsci and tsci+1 are segmented into two separate short-term trips. Finally, the set of driving records Rj for vehicle j within the statistical period is divided into short-term trips {G1, G2, …, Gm}.

3.2.3. Calculation of Driving Mileage

The key nodes for vehicle j in different short-term trips are extracted, matched with the bayonet distance matrix to determine the mileage s of each trip, and finally, the total mileage S of vehicle j is calculated. The steps are as follows:
(1)
The traffic-surveillance-bayonet numbers {tsc1, tsc2, …, tsck} for vehicle j in the i-th trip are extracted as the key nodes of the short-term trip Gi, where k represents the total number of traffic surveillance bayonets passed during the i-th trip.
(2)
The key nodes {tsc1, tsc2, …, tsck} of the short-term trip Gi are matched with the bayonet distance matrix to obtain the distances {l1, l2, …, lk1} between adjacent bayonets. The driving mileage si of the short-term trip Gi can then be calculated, as shown in Equation (11).
s i = w = 1 k 1 l w
where w represents the distance between the w-th pair of adjacent bayonets in the short-term trip Gi, measured in kilometers.
(3)
The driving mileage si for all short-term trips of vehicle j within the statistical period T is calculated, and the total driving mileage Sj of vehicle j during the statistical period is then determined, as shown in Equation (12).
S j = i = 1 m s i

3.2.4. Calculation of Driving Time

Similar to Section 3.2.3, the short-term trips Gi of vehicle j for each day within the statistical period T are extracted. The driving time ts for each short-term trip of vehicle j is then calculated, and finally, the total operating time To and the number of days dj the vehicle was in operation during the statistical period are determined. The steps are as follows:
(1)
The number of days dj within the statistical period during which vehicle j has driving records, along with the corresponding short-term trips, is calculated. Days without vehicle passage records are marked as 0.
(2)
The passage times {ptime1, ptime2, …, ptimek} at the traffic surveillance bayonets for vehicle j during the i-th trip are extracted, and the driving time tsi for that trip is calculated, as shown in Equation (13).
t s i = p t i m e k p t i m e 1
(3)
The driving time for all short-term trips of vehicle j within the statistical period T is calculated, and the total operating time To of vehicle j during the statistical period is determined, as shown in Equation (14).
T o j = i = 1 m t s i

3.3. Identification Indicator Determination

The average daily mileage Sa, average daily operating time Ta, and operating-days ratio τ for private cars, confirmed unlicensed taxis, and compliant taxis in the training sample υ were calculated, with the distribution shown in Figure 2. The results show that Sa, Ta, and τ effectively characterize the spatiotemporal differences between private cars, confirmed unlicensed taxis, and compliant taxis. Most private cars are concentrated in segments with low average daily mileage and operating time, but their operating days are widely dispersed. Confirmed unlicensed taxis and compliant taxis exhibit a high degree of similarity, with most having significantly higher average daily mileage, operating time, and operating-days ratio, compared to private cars.
A variance analysis was conducted on the training sample υ, comparing private cars, confirmed unlicensed taxis, and compliant taxis across three dimensions. The analysis aimed to explore the extent to which these parameters differentiate the operational characteristics of different vehicle types, with results shown in Table 5. The results show that the average daily mileage for private cars is 21.926 km, while for confirmed unlicensed taxis and compliant taxis it is 175.291 km and 175.058 km, respectively. The average daily operating time for private cars is 0.738 h, while for confirmed unlicensed taxis and compliant taxis it is 8.125 h and 8.762 h, respectively. The operating-days ratio for private cars is 0.561, while for confirmed unlicensed taxis and compliant taxis it is 0.976 and 0.975, respectively. The data distribution for private cars across the three dimensions is significantly lower than that for both confirmed unlicensed taxis and compliant taxis. Therefore, this study uses Sa, Ta, and τ as identification indicators for detecting suspected unlicensed taxis.
Therefore, based on the basic definitions of the identification indicators, the identification indicator data for 252,332 vehicles were calculated. The results are shown in Table 6.

4. Identification Method

Binary Logistic Regression is a probabilistic model primarily used for regression analysis when the dependent variable is binary. The independent variables can be either categorical or continuous. Due to its advantages, including an explicit probability expression, fast solution speed, and high prediction accuracy, it is widely used in the field of traffic behavior analysis and prediction [30,31]. Based on Binary Logistic Regression, this study constructs an unlicensed-taxi-identification model using significant identification indicators. The influence of different indicators on the discrimination of unlicensed taxis is converted into a probability range between 0 and 1, enabling the detection and identification of unlicensed taxis.

4.1. Suspected-Unlicensed-Taxi-Identification Model

Assume that in the filtered bayonet vehicle-passage information, the vehicle usage type Y is a binary variable. When Y = 0, it represents a private car; when Y = 1, it represents an unlicensed taxi. Then P = P (Y = 1) represents the probability that the vehicle is an unlicensed taxi, and P(Y = 0) = 1 − P represents the probability that the vehicle is a private car. x1, x2, …, xm are m factors affecting the vehicle type. According to the binary Logistic model, the probabilities that the vehicle is an unlicensed taxi or a private car are given by Equations (15) and (16), respectively.
P = e β 0 + β 1 x 1 + β 2 x 2 + + β m x m 1 + e β 0 + β 1 x 1 + β 2 x 2 + + β m x m
P ( Y = 0 ) = 1 P = 1 1 + e β 0 + β 1 x 1 + β 2 x 2 + + β m x m
where β0 is the intercept, and β1, β2, …, βm are the partial regression coefficients.

4.2. Parameter Estimation

The parameters in the binary Logistic model are estimated using the Maximum Likelihood Estimation (MLE) method. This process involves first constructing the likelihood function and then determining the parameter estimates that maximize it. First, for the existing sample, the likelihood function is established as shown in Equation (17).
L = i = 1 n P i Y i ( 1 P i ) 1 Y i
where i = 1, 2, …, n represents the i-th vehicle sample; L is the likelihood function, with a range of [0, 1]; Yi is the value of the binary variable corresponding to vehicle i; and Pi is the probability that the vehicle is an unlicensed taxi.
The log-likelihood function of the likelihood function is given in Equation (18). By taking the first derivative of lnL setting it to zero, and then applying the Newton–Raphson iterative method to solve the resulting system of equations, the maximum likelihood estimates and standard errors of the parameters can be obtained.
ln L = i = 1 n [ Y i ln P i + ( 1 Y i ) ln ( 1 P i ) ]
In this study, IBM SPSS Statistics 26 was used to perform binary Logistic regression modeling based on the training sample υ consisting of private cars and unlicensed taxis. The calculation and analysis results of various statistics are shown in Table 7.
The unlicensed-taxi-identification model is obtained as shown in Equation (19), where P represents the probability that the vehicle is an unlicensed taxi. When P ≥ 0.5, the vehicle is considered a suspected unlicensed taxi. x1, x2, and x3 represent the vehicle’s average daily mileage, average daily operating time, and operating-days ratio, respectively.
P = e 11.885 + 0.027 x 1 + 1.488 x + 5.409 x 3 1 + e 11.885 + 0.027 x 1 + 1.488 x + 5.409 x 3

4.3. Model Validation

The Hosmer–Lemeshow test assesses the degree of agreement between the fitted values and the observed values. A significance level greater than 0.05 indicates that the model passes the Hosmer–Lemeshow test. As shown in the Hosmer–Lemeshow test results in Table 8, the significance level is 0.328, which is greater than 0.05, indicating that the binary Logistic model has a good fit.

4.4. Model Performance and Predictive Ability Evaluation

The goodness-of-fit evaluation of the model only suggests that the information regarding private cars and unlicensed taxis in the training sample has been adequately captured, which does not necessarily indicate that the model has strong predictive performance. Therefore, an evaluation of the model’s predictive power is also necessary [32]. Therefore, this study employs recall (R), precision (P), accuracy (ACC), and F1 score (F1) as performance evaluation metrics, and plots the ROC curve to assess the model’s predictive ability.
The formulas for calculating the model performance metrics based on the confusion matrix are presented in Equations (13)–(16). Specifically, TP refers to the number of samples that are actually positive and predicted as positive, i.e., the number of unlicensed taxis correctly identified as unlicensed taxis; FP refers to the number of samples that are actually negative but predicted as positive, i.e., the number of private cars incorrectly predicted as unlicensed taxis; FN refers to the number of samples that are actually positive but predicted as negative, i.e., the number of unlicensed taxis incorrectly predicted as private cars; and TN refers to the number of samples that are actually negative and predicted as negative, i.e., the number of private cars correctly predicted as private cars.
R = T P T P + F N
P = T P T P + F P
A C C = T P + T N T P + T N + F P + F N
F 1 = 2 P R P + R
The confusion matrix obtained from the experimental results is shown in Table 9. The model’s performance-evaluation metrics are 89.26%, 94.74%, 99.10%, and 91.91%, respectively, indicating that the model can effectively distinguish between private cars and unlicensed taxis. Moreover, the model’s accuracy is 99.10%, which is higher than that of related studies [9,14,16,18,19], further demonstrating the high recognition performance of the unlicensed-taxi-identification model constructed using binary Logistic regression.
When evaluating the predictive ability of the unlicensed-taxi-identification model using the ROC curve, the greater the curve’s bulge toward the top-left corner and the closer the area under the curve (AUC) value is to 1, the better the model’s predictive performance. The model’s predicted probabilities are used as test variables, and private cars and unlicensed taxis in the training sample are used as status variables for ROC analysis. The results are shown in Figure 3. As shown in the figure, the AUC is 0.994, indicating that the unlicensed-taxi-identification model has a strong predictive capability for vehicle attributes.

5. Empirical Analysis

5.1. Identification of Suspected Unlicensed Taxis

Based on the unlicensed-taxi-identification model, Figure 4 presents the probability distribution characteristics of 252,332 vehicles engaging in unlicensed-taxi activities. As shown in Figure 4, 251,802 vehicles have a low probability (P < 50%) of engaging in unlicensed-taxi activities, while 530 vehicles have a high probability (P > 50%) of engaging in such activities.
Vehicles with a high probability of engaging in unlicensed-taxi activities are defined as suspected unlicensed taxis, as shown in Table 10.

5.2. Operational Characteristic Analysis

5.2.1. Mileage Characteristics

This study analyzes the mileage characteristics among private cars, suspected unlicensed taxis, and compliant taxis. A total of 99.31% of private cars have an average daily mileage of less than 80 km, while 80.00% of suspected unlicensed taxis and 90.55% of compliant taxis have an average daily mileage of more than 80 km. Overall, compliant taxis have the highest average daily mileage, followed by suspected unlicensed taxis, with private cars having the lowest, as shown in Figure 5.
Considering the practical context, this phenomenon can be explained with regard to the following two aspects. (1) Private cars are primarily used for daily commuting and other personal activities, resulting in lower mileage. In contrast, suspected unlicensed taxis and compliant taxis, as commercial vehicles, require long-term operation for passenger transportation, leading to higher mileage. (2) Compared to suspected unlicensed taxis, compliant taxis not only possess legal operating licenses, but also enjoy greater visibility, which results in more ride orders. Therefore, the overall mileage of compliant taxis is higher than that of suspected unlicensed taxis.

5.2.2. Operating-Time Characteristics

To explore the differences in time distribution among different types of vehicles, this study analyzes the average operating days, average daily operating time, and operating periods within each day, for different vehicle types.
Figure 6a shows the average operating days of different types of vehicles over a 10-day statistical period. As shown in the figure, in the interval where the average operating days are no less than 9 days, suspected unlicensed taxis account for 94.91%, compliant taxis account for 95.25%, and private cars account for 31.04%. Therefore, the average operating days of compliant taxis and suspected unlicensed taxis are similar, both concentrated in the higher range of operating days, while the distribution of operating days for private cars is more even.
Figure 6b shows the average daily operating time of different types of vehicles over a 10-day statistical period. Compared to suspected unlicensed taxis and compliant taxis, private cars have the highest number of vehicles with an average daily operating time of 0 to 2 h, accounting for 98.86% of all private cars, and a maximum operating time of no more than 5 h. The daily operating time of suspected unlicensed taxis mainly ranges from 1 to 10 h, with the majority concentrated in the 3 to 4 h range, accounting for 33.40%. Compliant taxis have a wider range of average daily operating times, ranging from 1 to 15 h.
Figure 7 shows the distribution of daily operating periods for different types of vehicles over a 10-day statistical period. As shown in the figure, the main operating period for private cars and suspected unlicensed taxis is from 7:00 to 21:00 each day, while the main operating period for compliant taxis is from 7:00 to 24:00. During the secondary operating periods (0:00 to 7:00 and 21:00 to 24:00) for private cars and suspected unlicensed taxis, the distribution proportion of suspected unlicensed taxis and compliant taxis is higher than that of private cars. Additionally, the distribution of all three types of vehicles during the morning and evening peak hours (7:00 to 9:00 and 17:00 to 20:00) is higher than at other times, with private cars showing the most significant peak.
In summary, in terms of operating time, suspected unlicensed taxis exhibit characteristics such as a higher number of operating days, longer daily operating times, and a higher proportion of night-time operation, which are similar to those of compliant taxis. In contrast, private cars display lower levels of these characteristics. This is consistent with real-world situations: private cars typically have shorter daily driving times and mainly operate during the day, whereas both compliant taxis and unlicensed taxis need to operate continuously throughout the day and night to transport passengers and earn higher income.

5.3. Spatiotemporal-Distribution Characteristics Analysis

5.3.1. Overall Spatial-Distribution Characteristics

The main operating areas of suspected unlicensed taxis in this city are shown in Figure 8. Figure 8a shows the overall distribution of operating areas for suspected unlicensed taxis in this City. They are mainly concentrated in the main urban area, with the highest density in the central urban area. Figure 8b further analyzes the distribution characteristics of vehicle operations within the main urban area. As shown by the circled regions in the figure, suspected unlicensed taxis are primarily distributed around stations, universities, parks, and commercial areas within the main urban area. This distribution aligns with the common destinations of passengers, suggesting that management authorities should intensify inspections in these areas.

5.3.2. Temporal- and Spatial-Distribution Characteristics of Travel Origins

This study selects a suspected unlicensed taxi and records its first- and last-passage information daily at traffic surveillance bayonets, to examine the temporal and spatial distribution characteristics of the taxi’s travel origins.
During the statistical period, the distribution of traffic surveillance bayonets where the suspected unlicensed taxi passes first and last each day is shown in Figure 9. It can be observed that the first bayonet the taxi passes each day is not unique, but rather, one of three closely spaced bayonets: A, B, or C. However, the last bayonet the taxi passes each day is fixed, and is always bayonet D. Based on the layout of the city’s traffic surveillance bayonets, it can be inferred that bayonets A, B, C, and D are positioned at closely spaced intersections along adjacent road segments. The suspected unlicensed taxi’s daily travel origin and destination points show a high degree of overlap, indicating that this location is likely the driver’s residence. Management authorities can leverage this information, along with the driver’s daily routine, to implement targeted regulation.
Figure 10 shows the temporal distribution of traffic surveillance bayonets where the suspected unlicensed taxi has its first and last passes during the statistical period. As shown in the figure, this suspected unlicensed taxi operated every day during the statistical period, with a wide range of start and end times each day. Additionally, this suspected unlicensed taxi shows a clear concentration in both start and end times, with start times concentrated between 7:40 and 8:00, and end times concentrated between 22:40 and 23:00.
In summary, a targeted enforcement plan can be developed, based on the spatiotemporal-distribution characteristics of suspected unlicensed taxis, to verify whether they are engaged in illegal operations and to ensure precise enforcement.
On the one hand, based on the temporal- and spatial-distribution characteristics of travel origins, traffic management authorities should identify the periods corresponding to the first and last passages of suspected unlicensed taxis through traffic surveillance bayonets, daily. Inspection checkpoints should be preemptively established at intersections or residential areas likely to be traversed by these vehicles. Since the data on suspected unlicensed taxis include vehicle license-plate information, which can be used to retrieve details such as vehicle type and driver identity, precise monitoring can be conducted by comparing whether passing vehicles correspond to the data of the suspected unlicensed taxis. On the other hand, traffic management authorities can utilize the location and time data from the end of the passage record. Based on this, they can retrieve nearby video surveillance footage to verify whether the suspected unlicensed taxi was involved in passenger transport, thereby determining whether it is truly engaged in illegal operations.
The use of the above method enables precise identification and regulation of suspected unlicensed taxis, effectively improving law enforcement efficiency and reducing the costs associated with identifying and managing unlicensed taxis.

6. Conclusions

This study employs traffic surveillance bayonets to collect vehicle passage information and analyzes the spatiotemporal characteristics of private cars, confirmed unlicensed taxis, and compliant passenger vehicles. Based on this analysis, identification indicators are established, and vehicle-operation-characteristic metrics are calculated using the aforementioned method. An unlicensed-taxi-identification model is then applied to obtain data on suspected unlicensed taxis, enabling traffic management authorities to develop targeted regulatory strategies for these vehicles. The main conclusions of the study are as follows:
(1)
Based on traffic-surveillance-bayonet data, a distance matrix was constructed and a driving interval threshold was set. The mileage and operating time data were calculated, and variance analysis was conducted to compare the differences between private cars, unlicensed taxis, and compliant taxis, leading to the determination of unlicensed-taxi-identification indicators.
(2)
Based on the identified unlicensed-taxi indicators, a binary Logistic regression model was established. The model parameters were estimated using the maximum likelihood method, and the model’s goodness-of-fit and predictive power were evaluated through Hosmer–Lemeshow tests and ROC curve analysis. The results show that the model can effectively predict the likelihood of a vehicle engaging in unlicensed-taxi activities (R = 89.26%, P = 94.74%, ACC = 99.10%, F1 = 91.91%, AUC = 0.994).
(3)
Using the information from the identified suspected unlicensed taxis, an analysis of their daily start and end times and location distribution was conducted to provide a basis for precise management by traffic authorities. The results show that the operational characteristics of suspected unlicensed taxis differ from those of private cars, with regular patterns in their daily start and end times and location distributions.
The study provides precise data support for traffic management authorities, enhancing law enforcement efficiency and contributing to improved passage safety. Furthermore, the study offers a foundation for formulating more rational traffic management policies and promoting the standardization and sustainable development of the transport industry. However, there is still room for further development. Future research will build upon existing studies and the results from traffic management authorities to explore the characteristics of groups involved in illegal operations. This will inform the development of a preventive regulatory identification method for unlicensed taxis, facilitating proactive supervision during processes such as new vehicle purchases and second-hand vehicle transactions.

7. Patents

The work related in this study has obtained a Chinese patent, grant number CN202111474233.1.

Author Contributions

Conceptualization, Y.X. and R.L.; methodology, Y.X. and R.L.; software, Y.X. and J.L.; validation, R.L. and J.L.; formal analysis, Y.X. and R.L.; investigation, R.L. and J.L.; resources, Y.X.; data curation, Y.X. and R.L.; writing—original draft preparation, R.L. and J.L.; writing—review and editing, Y.X., and R.L.; visualization, R.L. and J.L.; supervision, Y.X. and R.L.; project administration, Y.X., R.L. and J.L.; funding acquisition, Y.X. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Natural Science Foundation of Anhui Province, grant number 2208085ME147; the Key Project of the Natural Science Foundation of Universities in Anhui Province, grant number 2022AH051800; and the 2024 Independent Innovation Project of Anhui Public Meteorological Service Center, grant number GFCX202411.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The datasets used or analyzed during the current study are available from the corresponding author on reasonable request.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Hu, Y.; Liu, L.; Xie, Y. Intelligent Recognition Model and its Mechnism for Illiagle Taxis Based on Spatio-Temporal Features. Transp. Energy Conserv. Environ. Prot. 2022, 18, 64–69. [Google Scholar]
  2. Liu, R.; Li, Y.; Mao, H.; Qian, J.; Wang, J.; Ma, Y. Short-Term Traffic Flow Forecasting at Intersections Based on Frequent Sequence Pattern Mining. Urban Transp. China 2023, 21, 87–98. [Google Scholar]
  3. Wang, M.; Ding, H.; Li, L.; Xu, R.; Liu, Y. Analysis of non-local vehicle travel and technical parameters time-varying characteristics based on video bayonet data. Acta Sci. Nat. Univ. Sunyatseni 2023, 62, 106–114. [Google Scholar]
  4. Chen, H.; Yang, C.; Xu, X. Clustering Vehicle Temporal and Spatial Travel Behavior Using License Plate Recognition Data. J. Adv. Transp. 2017, 2017, 1738085. [Google Scholar] [CrossRef]
  5. Wei, Q.; Wang, J.; Mo, J.; Chen, C. Travel Characteristics Analysis Based on Bayonet System. Traffic Transp. 2019, 35, 33–36. [Google Scholar]
  6. Long, X.; Su, Y.; Yu, C.; Wu, D. Analyzing Methods of Vehicle’ Travel Using Plate Recognition Data. J. Transp. Syst. Eng. Inf. Technol. 2019, 19, 66–72. [Google Scholar]
  7. Ruan, S.; Wang, F.; Ma, D.; Jin, S.; Wang, D. Vehicle trajectory extraction algorithm based on license plate recognition data. J. Zhejiang Univ. 2018, 52, 836–844. [Google Scholar]
  8. Yao, W.; Zhang, M.; Jin, S.; Ma, D. Understanding vehicles commuting pattern based on license plate recognition data. Transp. Res. Part C 2021, 128, 103142. [Google Scholar] [CrossRef]
  9. Wu, H.; Yang, J.; Yuan, M.; Li, X. Heuristic Optimal Scheduling for Road Traffic Incident Detection Under Computational Constraints. Sensors 2024, 24, 7221. [Google Scholar] [CrossRef]
  10. Lin, J.; Wang, L. Design and Implementation of Illegal Operating Vehicles’ Reporting System Based on NFC and Android. Tech. Autom. Appl. 2018, 37, 119–122+138. [Google Scholar]
  11. Wang, N.; Duan, P.; Wu, L. Research of illegal operation vehicle identification based on convolutional neural network. J. Comput. Appl. 2016, 36, 193–196. [Google Scholar]
  12. Li, R.; Xiao, Y. Identification Method for Unlicensed Taxis in Urban Areas: A Case Study. J. Urban Dev. Manag. 2023, 2, 22–33. [Google Scholar] [CrossRef]
  13. Shuai, D.; Lan, Z.; Li, Y. Recognising Illegal Operation Vehicles Based on K-mediods and ITS Improved Algorithm. Comput. Appl. Softw. 2016, 33, 154–157+211. [Google Scholar]
  14. Ma, S.; Sun, C.; Zhu, J. Application of SOM Neural Network in the Identification of Illegal Operating Vehicles. Inf. Comput. 2017, 83–85. [Google Scholar]
  15. Tian, Q.; Li, Q.; Yu, J.; Yang, C.; Su, J. Research on the Characteristics of Illegal Passenger Vehicles Based on ETC Toll Pipeline Data. Logist. Eng. Manag. 2023, 45, 124–128+161. [Google Scholar]
  16. Zhao, Q.; Jiang, T.; Zhao, F.; Ma, B. Abnormal vehicle detection based on spatio-temporal big data. Transducer Microsyst. Technol. 2019, 38, 139–142. [Google Scholar]
  17. Chen, L.; Zheng, L.; Xia, L.; Liu, W.; Sun, D. Detecting and analyzing unlicensed taxis: A case study of Chongqing City. Phys. A Stat. Mech. Its Appl. 2021, 584, 126324. [Google Scholar] [CrossRef]
  18. Yuan, W.; Deng, P.; Taleb, T.; Wan, J.; Bi, C. An Unlicensed Taxi Identification Model Based on Big Data Analysis. Ieee Trans. Intell. Transp. Syst. 2016, 17, 1703–1713. [Google Scholar] [CrossRef]
  19. Wang, Y.; Fan, X.; Liu, X.; Zheng, C.; Chen, L.; Wang, C.; Li, J. Unlicensed Taxis Detection Service Based on Large-Scale Vehicles Mobility Data. In Proceedings of the 2017 IEEE International Conference on Web Services (ICWS), Honolulu, HI, USA, 25–30 June 2017; IEEE: Piscataway, NJ, USA, 2017. [Google Scholar]
  20. Tian, Y.; Yang, J.; Lu, P. Unlicensed Taxi Detection Algorithm Based on Traffic Surveillance Data. In Proceedings of the 5th ACM SIGSPATIAL International Workshop on the Use of GIS in Emergency Management, Chicago, IL, USA, 5 November 2019; Association for Computing Machinery: Broadway, NY, USA, 2019. [Google Scholar]
  21. Huang, Z.; Li, Q. Research on Identification of Urban Illegal Vehicles Based on Random Forest Model. Electron. Sci. Technol. 2024, 37, 66–71. [Google Scholar]
  22. Molina-Campoverde, J.J.; Rivera-Campoverde, N.; Molina Campoverde, P.A.; Bermeo Naula, A.K. Urban Mobility Pattern Detection: Development of a Classification Algorithm Based on Machine Learning and GPS. Sensors 2024, 24, 3884. [Google Scholar] [CrossRef] [PubMed]
  23. Crouzil, A.; Khoudour, L.; Valiere, P.; Cong, D.N.T. Automatic vehicle counting system for traffic monitoring. J. Electron. Imaging 2016, 25, 051207. [Google Scholar] [CrossRef]
  24. Malinovskiy, Y.; Wu, Y.; Wang, Y. Video-Based Vehicle Detection and Tracking Using Spatiotemporal Maps. Transp. Res. Rec. 2009, 2121, 81–89. [Google Scholar] [CrossRef]
  25. Xia, L.; Hu, X.; Wang, J. Anomaly detection in traffic surveillance with sparse topic model. J. Cent. South. Univ. 2018, 25, 2245–2257. [Google Scholar] [CrossRef]
  26. Zou, B.; Yu, Z.; He, Z.; Huang, M.; Chen, K.; Sha, Z. The characteristics of urban road usage using automatic vehicle identification data. Acta Sci. Nat. Univ. Sunyatseni 2022, 61, 132–139. [Google Scholar]
  27. Zhao, P.; Wu, G.; Wang, X.; Wang, S.; Zan, Y. A Data-driven Method for Identifying Congestion State and Selecting Guided Vehicles for Urban Expressways. J. Transp. Inf. Saf. 2021, 39, 82–90. [Google Scholar]
  28. Lin, P.; Weng, J.; Hu, S.; Jing, Y.; Yin, B. Day-to-day Similarity of Individual Activity Chain of Public Transport Passengers. J. Transp. Syst. Eng. Inf. Technol. 2020, 20, 178–183+204. [Google Scholar]
  29. Fu, F.; Fang, Y.; Dong, H.; Chen, W. Automation of Electric Power Systems. Autom. Electr. Power Syst. 2018, 42, 72–80. [Google Scholar]
  30. Phattarasuda, W.; Surachet, P.; Kunnawee, K.; Kazushi, S.; Suksun, H. Influential Factors Affecting Travelers’ Mode Choice Behavior on Mass Transit in Bangkok, Thailand. Sustainability 2020, 12, 9522. [Google Scholar] [CrossRef]
  31. Zhang, F.; Ji, Y.; Lv, H.; Ma, X. Analysis of factors influencing delivery e-bikes’ red-light running behavior: A correlated mixed binary logit approach. Accid. Anal. Prev. 2021, 152, 105977. [Google Scholar] [CrossRef] [PubMed]
  32. Li, J.; Nie, Q.; Liu, Q.; Jia, X. Logistic regression model for stability assessment of karst ground collapse of Tangshan city. Rock. Soil Mech. 2017, 38, 250–256. [Google Scholar]
Figure 1. Calculation process of vehicle operational characteristic indicators.
Figure 1. Calculation process of vehicle operational characteristic indicators.
Sensors 24 08206 g001
Figure 2. Distribution characteristics of the training sample.
Figure 2. Distribution characteristics of the training sample.
Sensors 24 08206 g002
Figure 3. ROC curve of unlicensed-taxi-identification model.
Figure 3. ROC curve of unlicensed-taxi-identification model.
Sensors 24 08206 g003
Figure 4. Probability distribution of vehicles engaging in unlicensed-taxi activities.
Figure 4. Probability distribution of vehicles engaging in unlicensed-taxi activities.
Sensors 24 08206 g004
Figure 5. Distribution of average daily mileage for three types of vehicles.
Figure 5. Distribution of average daily mileage for three types of vehicles.
Sensors 24 08206 g005
Figure 6. Operating-time-characteristic distribution: (a) operating days; (b) average daily operating time.
Figure 6. Operating-time-characteristic distribution: (a) operating days; (b) average daily operating time.
Sensors 24 08206 g006
Figure 7. Distribution characteristics of operating time periods within a day.
Figure 7. Distribution characteristics of operating time periods within a day.
Sensors 24 08206 g007
Figure 8. Main operating areas of suspected unlicensed taxis: (a) overall distribution; (b) operating hotspot areas.
Figure 8. Main operating areas of suspected unlicensed taxis: (a) overall distribution; (b) operating hotspot areas.
Sensors 24 08206 g008
Figure 9. Distribution of traffic surveillance bayonets passed by the suspected unlicensed taxi each day: (a) first pass; (b) last pass.
Figure 9. Distribution of traffic surveillance bayonets passed by the suspected unlicensed taxi each day: (a) first pass; (b) last pass.
Sensors 24 08206 g009
Figure 10. Temporal distribution of traffic surveillance bayonets passed by the suspected unlicensed taxi during the statistical period: (a) start time; (b) end time.
Figure 10. Temporal distribution of traffic surveillance bayonets passed by the suspected unlicensed taxi during the statistical period: (a) start time; (b) end time.
Sensors 24 08206 g010
Table 1. Types and meanings of traffic-surveillance-bayonet vehicle passage data.
Table 1. Types and meanings of traffic-surveillance-bayonet vehicle passage data.
NumberData TypeExampleMeaning
1pnum1832926172The count of vehicles passing through traffic surveillance bayonet
2vnumWan F****1Vehicle license plate number
3vncolBlueVehicle license plate color
4tscnum34060300001190110040Traffic surveillance bayonet number
5ptime2021/12/1 6:59:05The time the vehicle passes through the traffic surveillance bayonet
6pd9.0Vehicle driving direction
7pspeed46The speed of the vehicle passing through the traffic surveillance bayonet
Table 2. Types and meanings of traffic-surveillance-bayonet location information data.
Table 2. Types and meanings of traffic-surveillance-bayonet location information data.
NumberData TypeExampleMeaning
1tscnum340621000011901401Traffic-surveillance-bayonet number
2lon116.764Longitude of the traffic surveillance bayonet
3lat33.922Latitude of the traffic surveillance bayonet
Table 3. Type of driving data error.
Table 3. Type of driving data error.
Error Identification InformationCause of Error
05377Incomplete License Plate Information
0B70641Misidentification of Leading “0” in Number
Table 4. Adjacent-traffic-surveillance distance matrix.
Table 4. Adjacent-traffic-surveillance distance matrix.
Serial Numbertsc1tsc2tscitscm
tsc10l12l1il1m
tsc2l210l2il2m
tscili1li20lim
tscmlm1lm2lm30
Table 5. Variance analysis of identification indicators.
Table 5. Variance analysis of identification indicators.
MeanStandard DeviationFSigPost Hoc Test
Saprivate cars21.926 14.571 4328.4000.00I < II;
I < III
confirmed unlicensed taxis175.291 79.383
compliant taxis175.058 71.649
Taprivate cars0.738 0.426 4404.2170.00I < II;
I < III
confirmed unlicensed taxis8.125 3.875
compliant taxis8.762 3.776
τprivate cars0.561 0.342 1341.8890.00I < II;
I < III
confirmed unlicensed taxis0.976 0.078
compliant taxis0.975 0.094
Note: In the post hoc test, I represents private cars, II represents unlicensed taxis, and III represents compliant taxis.
Table 6. Filtered vehicle-identification-indicator data.
Table 6. Filtered vehicle-identification-indicator data.
vnumSa/kmTa/hτ
Wan F****9310.02712.4641
Wan F****2302.63115.6821
Wan L****51.0000.4100.8
Table 7. Maximum-likelihood estimation results.
Table 7. Maximum-likelihood estimation results.
Variables.BS.EWalddfSigExp (B)
Sa0.0270.0143.93610.0471.027
Ta1.4880.36116.99710.0004.430
τ5.4091.9167.97310.005223.487
Constant−11.8851.91638.46110.0000.000
Table 8. Hosmer–Lemeshow test.
Table 8. Hosmer–Lemeshow test.
StepsChi-SquaredfSig
19.17580.328
Table 9. Confusion matrix results.
Table 9. Confusion matrix results.
CategoryPredicted Values
Unlicensed TaxisPrivate Cars
Actual valuesUnlicensed taxis10813
Private cars61994
Table 10. Information on suspected unlicensed taxis.
Table 10. Information on suspected unlicensed taxis.
vnumSa/kmTa/hτ
Wan F****2302.63115.6821
Wan F****0297.36515.4921
Wan F****276.4432.9561
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Xiao, Y.; Li, R.; Li, J. A Spatiotemporal Feature-Based Approach for the Detection of Unlicensed Taxis in Urban Areas. Sensors 2024, 24, 8206. https://doi.org/10.3390/s24248206

AMA Style

Xiao Y, Li R, Li J. A Spatiotemporal Feature-Based Approach for the Detection of Unlicensed Taxis in Urban Areas. Sensors. 2024; 24(24):8206. https://doi.org/10.3390/s24248206

Chicago/Turabian Style

Xiao, Yun, Rongqiao Li, and Jinyan Li. 2024. "A Spatiotemporal Feature-Based Approach for the Detection of Unlicensed Taxis in Urban Areas" Sensors 24, no. 24: 8206. https://doi.org/10.3390/s24248206

APA Style

Xiao, Y., Li, R., & Li, J. (2024). A Spatiotemporal Feature-Based Approach for the Detection of Unlicensed Taxis in Urban Areas. Sensors, 24(24), 8206. https://doi.org/10.3390/s24248206

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop