Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

Multivariate hierarchical DBSCAN model for enhanced maritime data analytics

Published: 02 July 2024 Publication History

Abstract

Clustering is an important data analytics technique and has numerous use cases. It leads to the determination of insights and knowledge which would not be readily discernible on routine examination of the data. Enhancement of clustering techniques is an active field of research, with various optimisation models being proposed. Such enhancements are also undertaken to address particular issues being faced in specific applications. This paper looks at a particular use case in the maritime domain and how an enhancement of Density-Based Spatial Clustering of Applications with Noise (DBSCAN) clustering results in the apt use of data analytics to solve a real-life issue. Passage of vessels over water is one of the significant utilisations of maritime regions. Trajectory analysis of these vessels helps provide valuable information, thus, maritime movement data and the knowledge extracted from manipulation of this data play an essential role in various applications, viz., assessing traffic densities, identifying traffic routes, reducing collision risks, etc. Optimised trajectory information would help enable safe and energy-efficient green operations at sea and assist autonomous operations of maritime systems and vehicles. Many studies focus on determining trajectory densities but miss out on individual trajectory granularities. Determining trajectories by using unique identities of the vessels may also lead to errors. Using an unsupervised DBSCAN method of identifying trajectories could help overcome these limitations. Further, to enhance outcomes and insights, the inclusion of temporal information along with additional parameters of Automatic Identification System (AIS) data in DBSCAN is proposed. Towards this, a new design and implementation for data analytics called the Multivariate Hierarchical DBSCAN method for better clustering of Maritime movement data, such as AIS, has been developed, which helps determine granular information and individual trajectories in an unsupervised manner. It is seen from the evaluation metrics that the performance of this method is better than other data clustering techniques.

References

[1]
Madhavan, N.,. 2018. Maritime security–are we reaching a tipping point?. https://www.linkedin.com/pulse/maritime-security-we-reaching-tipping-point-nitin-madhavan?trk=portfolio_article-card_title (accessed 23 April 2023).
[2]
IBM, 2023. AI-Powered Automation - Mayflower Autonomous Ship. https://www.ibm.com/cloud/automation/mayflower-autonomous-ship (accessed 23 April 2023).
[3]
Y. Wang, J. Liu, R.W. Liu, Y. Liu, Z. Yuan, Data-driven methods for detection of abnormal ship behavior: progress and trends, Ocean Eng. 271 (2023),.
[4]
FleetMon, 2023. AIS Spoofing. https://help.fleetmon.com/en/articles/4379768-ais-spoofing (accessed 23 April 2023).
[5]
Panbo, 2023. Invalid MMSI numbers, a real problem. https://panbo.com/invalid-mmsi-numbers-a-real-problem (accessed 23 April 2023).
[6]
WorkBoat, 2018. Coast guard warns of inaccurate AIS data. https://www.workboat.com/coastal-inland-waterways/coast-guard-warns-of-inaccurate-ais-data (accessed 23 April 2023).
[7]
B.V. Chowdary, Y. Radhika, A survey on applications of data mining techniques, Int. J. Appl. Eng. Res. 13 (2018) 5384–5392. https://www.ripublication.com/ijaer18/ijaerv13n7_112.pdf.
[8]
D. Janu, K. Singh, S. Kumar, Machine learning for cooperative spectrum sensing and sharing: a survey, Trans. Emerg. Telecommun. Technol. 33 (2022),.
[9]
M. Ester, H.P. Kriegel, J. Sander, X. Xu, A density-based algorithm for discovering clusters in large spatial databases with noise, in: Proceedings of the 2nd International Conference on Knowledge Discovery and Data Mining, 1996, pp. 226–231. https://dl.acm.org/doi/10.5555/3001460.3001507.
[10]
W. Jin, L. Yan, J. Jiang, An object association matching method based on V2I system, Int. J. Comput. Intell. Syst. 16 (2023) 132,.
[11]
W. Li, Y. Wang, Y. Shao, G. Hu, D. Li, TrackPuzzle: efficient registration of unlabeled PDR trajectories for learning indoor route graph, Future Gener. Comput. Syst. 149 (2023) 171–183,.
[12]
H. Li, Z. Yang, Towards safe navigation environment: the imminent role of spatio-temporal pattern mining in maritime piracy incidents analysis, Reliab. Eng. Syst. Saf. 238 (2023),.
[13]
Peng, W., Bai, X., Yang, D., Yuen, K.F., Wu, J., 2022. A deep learning approach for port congestion estimation and prediction. https://doi.org/10.1080/03088839.2022.2057608.
[14]
Z. Zhang, G. Ni, Y. Xu, Comparison of trajectory clustering methods based on K-means and DBSCAN, in: Proceedings of the IEEE International Conference on Big Data and Artificial Intelligence, ICIBA, 2020, pp. 557–561,. 2020.
[15]
Z. Chen, J. Guo, Q. Liu, DBSCAN algorithm clustering for massive AIS data based on the Hadoop platform, in: Proceedings of the International Conference on Industrial Informatics - Computing Technology, Intelligent Technology, Industrial Information Integration, ICIICII, 2018, pp. 25–28,. 2017.
[16]
Z. Deng, Z. Wang, T. Zhang, C. Zhang, W. Si, A novel density-based clustering method for effective removal of spurious intersections in bearings-only localization, EURASIP J. Adv. Signal Process. 2023 (2023) 1–21,.
[17]
Wang, X., 2022. Research on the clustering of marine traffic status based on AIS data using DBSCAN [WWW Document]. https://oacis.repo.nii.ac.jp/?action=repository_action_common_download&item_id=2593&item_no=1&attribute_id=20&file_no=1 (accessed 3.30.23).
[18]
I.M.O. Widyantara, I.P.N. Hartawan, A.A.I.N.E. Karyawati, N.I. Er, K.B. Artana, Automatic identification system-based trajectory clustering framework to identify vessel movement pattern, IAES Int. J. Artif. Intell. 12 (2023) 1–11,.
[19]
D. Zhang, Y. Zhang, C. Zhang, Data mining approach for automatic ship-route design for coastal seas using AIS trajectory clustering analysis, Ocean Eng. 236 (2021),.
[20]
C. Wang, G. Li, P. Han, O. Osen, H. Zhang, Impacts of COVID-19 on ship behaviours in port area: an AIS data-based pattern recognition approach, IEEE Trans. Intell. Transp. Syst. 23 (2022) 25127–25138,.
[21]
J.G. Lee, J. Han, K.Y. Whang, Trajectory clustering: a partition-and-group framework, in: Proceedings of the ACM SIGMOD International Conference on Management of Data, 2007, pp. 593–604,.
[22]
M. Etemad, A. Soares, A. Hoseyni, J. Rose, S. Matwin, A trajectory segmentation algorithm based on interpolation-based change detection strategies, in: Proceedings of the CEUR Workshop, 2019, p. 2322.
[23]
V.F. Arguedas, F. Mazzarella, M. Vespe, Spatio-temporal data mining for maritime situational awareness, in: Proceedings of the MTS/IEEE Ocean, 2015,. 2015 - Genova Discov. Sustain. Ocean Energy a New World.
[24]
Y. Li, Y. Zhang, F. Zhu, The method of detecting AIS isolated information based on clustering and distance, in: Proceedings of the 2nd IEEE International Conference on Computer and Communications, 2016, pp. 870–873,.
[25]
G. Pallotta, M. Vespe, K. Bryan, Vessel pattern knowledge discovery from AIS data: a framework for anomaly detection and route prediction, Entropy 15 (2013) 2218–2245,.
[26]
V. Fernandez Arguedas, G. Pallotta, M. Vespe, Maritime traffic networks: from historical positioning data to unsupervised maritime traffic monitoring, IEEE Trans. Intell. Transp. Syst. 19 (2018) 722–732,.
[27]
H.T. Lee, J.S. Lee, H. Yang, I.S. Cho, An AIS data-driven approach to analyze the pattern of ship trajectories in ports using the DBSCAN algorithm, Appl. Sci. 11 (2021) 799,.
[28]
L. Liu, R. Shibasaki, Y. Zhang, N. Kosuge, M. Zhang, Y. Hu, Data-driven framework for extracting global maritime shipping networks by machine learning, Ocean Eng. 269 (2023),.
[29]
S.K. Zhang, G.Y. Shi, Z.J. Liu, Z.W. Zhao, Z.L. Wu, Data-driven based automatic maritime routing from massive AIS trajectories in the face of disparity, Ocean Eng. 155 (2018) 240–250,.
[30]
L. Zhao, G. Shi, J. Yang, An adaptive hierarchical clustering method for ship trajectory data based on DBSCAN algorithm, in: Proceedings of the IEEE 2nd International Conference on Big Data Analysis, ICBDA, 2017, pp. 329–336,. 2017.
[31]
N. Andrienko, G. Andrienko, Visual analytics of movement: an overview of methods, tools and procedures, Inf. Vis. 12 (2013) 3–24,.
[32]
F. Ren, Y. Han, S. Wang, H. Jiang, A novel high-dimensional trajectories construction network based on multi-clustering algorithm, EURASIP J. Wirel. Commun. Netw. (2022),. 2022.
[33]
Z. Liu, B. Zhang, M. Zhang, H. Wang, X. Fu, A quantitative method for the analysis of ship collision risk using AIS data, Ocean Eng. 272 (2023),.
[34]
M. Zhang, F. Conti, H. Le Sourne, D. Vassalos, P. Kujala, D. Lindroth, S. Hirdaris, A method for the direct assessment of ship collision damage and flooding risk in real conditions, Ocean Eng. 237 (2021),.
[35]
M. Marino, L. Cavallaro, E. Castro, R.E. Musumeci, M. Martignoni, F. Roman, E. Foti, New frontiers in the risk assessment of ship collision, Ocean Eng. 274 (2023),.
[36]
Z. Shi, R. Zhen, J. Liu, Fuzzy logic-based modeling method for regional multi-ship collision risk assessment considering impacts of ship crossing angle and navigational environment, Ocean Eng. 259 (2022),.
[37]
R. Zhen, Z. Shi, J. Liu, Z. Shao, A novel arena-based regional collision risk assessment method of multi-ship encounter situation in complex waters, Ocean Eng. 246 (2022),.
[38]
G. Spiliopoulos, K. Chatzikokolakis, D. Zissis, E. Biliri, D. Papaspyros, G. Tsapelas, S. Mouzakitis, Knowledge extraction from maritime spatiotemporal data: an evaluation of clustering algorithms on Big Data, in: Proceedings of the IEEE International Conference on Big Data, Big Data 2017, Institute of Electrical and Electronics Engineers Inc., 2017, pp. 1682–1687,.
[39]
D. Birant, A. Kut, ST-DBSCAN: an algorithm for clustering spatial-temporal data, Data Knowl. Eng. 60 (2007) 208–221,.
[40]
E. Cakmak, M. Plank, D.S. Calovi, A. Jordan, D. Keim, Spatio-temporal clustering benchmark for collective animal behavior, in: Proceedings of the 1st ACM SIGSPATIAL International Workshop on Animal Movement Ecology and Human Mobility, HANIMOB, 2021, 2021,.
[41]
Y. Ren, X. Liu, W. Liu, DBCAMM: a novel density based clustering algorithm via using the Mahalanobis metric, Appl. Soft Comput. J. 12 (2012) 1542–1554,.
[42]
M. Sangeetha, V. Padikkaramu, R.T. Chellan, A novel density based clustering algorithm by incorporating mahalanobis distance, Int. J. Intell. Eng. Syst. 11 (2018) 121–129,.
[43]
L. Wang, P. Chen, L. Chen, J. Mou, Ship AIS trajectory clustering: an HDBSCAN-based approach, J. Mar. Sci. Eng. 9 (2021),.
[44]
X. Xu, D. Cui, Y. Li, Y. Xiao, Research on ship trajectory extraction based on multi-attribute dbscan optimisation algorithm, Pol. Marit. Res. 28 (2021) 136–148,.
[45]
I. Kontopoulos, I. Varlamis, K. Tserpes, A distributed framework for extracting maritime traffic patterns, Int. J. Geogr. Inf. Sci. (2020) 1–26,.
[46]
X. Han, C. Armenakis, M. Jadidi, DBscan optimization for improving marine trajectory clustering and anomaly detection, Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. ISPRS Arch. 43 (2020) 455–461,.
[47]
W. Yan, R. Wen, A.N. Zhang, D. Yang, Vessel movement analysis and pattern discovery using density-based clustering approach, in: Proceedings of the IEEE International Conference on Big Data, Big Data 2016, 2016, pp. 3798–3806,.
[48]
B. Liu, E.N. De Souza, S. Matwin, M. Sydow, Knowledge-based clustering of ship trajectories using density-based approach, in: Proceedings of the IEEE International Conference on Big Data, IEEE Big Data 2014, 2015, pp. 603–608,.
[49]
C. Choi, S.Y. Hong, MDST-DBSCAN: a density-based clustering method for multidimensional spatiotemporal data, ISPRS Int. J. Geo-Inf. 10 (2021),.
[50]
P. Sheng, J. Yin, Extracting shipping route patterns by trajectory clustering model based on automatic identification system data, Sustainability 10 (2018),.
[51]
C. Huang, X. Qi, J. Zheng, R. Zhu, J. Shen, A maritime traffic route extraction method based on density-based spatial clustering of applications with noise for multi-dimensional data, Ocean Eng. 268 (2023),.
[52]
C. Liu, M. Musharraf, F. Li, P. Kujala, A data mining method for automatic identification and analysis of icebreaker assistance operation in ice-covered waters, Ocean Eng. 266 (2022),.
[53]
X. Xu, C. Liu, J. Li, Y. Miao, Trajectory clustering for SVR-based time of arrival estimation, Ocean Eng. 259 (2022),.
[54]
J. Yang, Y. Liu, L. Ma, C. Ji, Maritime traffic flow clustering analysis by density based trajectory clustering with noise, Ocean Eng. 249 (2022),.
[55]
UNSD_MM, 2020. Overview of AIS dataset - AIS Handbook - UN Statistics Wiki [WWW Document]. https://unstats.un.org/wiki/display/AIS/Overview+of+AIS+dataset (accessed 9.24.23).
[56]
AccessAIS - MarineCadastre.gov [WWW Document], 2023. https://marinecadastre.gov/accessais/ (accessed 9.24.23).
[57]
Manimaran, 2019. Clustering evaluation strategies. https://towardsdatascience.com/clustering-evaluation-strategies-98a4006fcfc (accessed 23 April 2023).

Cited By

View all

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Data & Knowledge Engineering
Data & Knowledge Engineering  Volume 150, Issue C
Mar 2024
247 pages

Publisher

Elsevier Science Publishers B. V.

Netherlands

Publication History

Published: 02 July 2024

Author Tags

  1. Data analytics
  2. DBSCAN
  3. Maritime situational awareness
  4. Maritime trajectories
  5. AIS
  6. Machine learning

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 0
    Total Downloads
  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 26 Sep 2024

Other Metrics

Citations

Cited By

View all

View Options

View options

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media