Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

Methods for Analyzing Medical-Order Sequence Variants in Sequential Pattern Mining for Electronic Medical Record Systems

Published: 30 March 2023 Publication History

Abstract

Electronic medical record systems have been adopted by many large hospitals worldwide, enabling the recorded data to be analyzed by various computer-based techniques to gain a better understanding of hospital-based disease treatments. Among such techniques, sequential pattern mining, already widely used for data mining and knowledge discovery in other application domains, has shown great potential for discovering frequent patterns in sequences of disease treatments. However, studies have yet to evaluate the use of medical-order sequence variants, where a “frequent pattern” can include some limited variations to the pattern, or have considered the factors that lead to these variants. Such a study would be meaningful for medical tasks such as improving the quality of a particular treatment method, comparing treatments with multiple hospitals, recommending the best-suited treatment for each patient, and optimizing the running costs in hospitals. This article proposes methods for evaluating medical-order sequence variants and understanding variant factors based on a statistical approach. We consider the safety and efficiency of sequences and related information about the variants, such as gender, age, and test results from hospitals. Our proposal has been demonstrated as effective by experimentally evaluating an electronic medical record system’s real dataset and obtaining feedback from medical workers. The experimental results indicate that the medical treatment history and specimen test results after hospitalization are significant in identifying the factors that lead to variants.

References

[1]
Gomariz Antonio, Campos Manuel, Marin Roque, and Goethals Bart. 2013. ClaSP: An efficient algorithm for mining frequent closed sequences. In Proceedings of the 17th Pacific-Asia Conference on Knowledge Discovery and Data Mining. Springer, 50–61.
[2]
Jay Ayres, Jason Flannick, Johannes Gehrke, and Tomi Yiu. 2002. Sequential pattern mining using a bitmap representation. In Proceedings of the 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 429–435.
[3]
Chetna Chand, Amit Thakkar, and Amit Ganatra. 2012. Target oriented sequential pattern mining using recency and monetary constraints. International Journal of Computer Applications 45, 10 (2012), 12–18.
[4]
Lei Chang, Dongqing Yang, Shiwei Tang, and Tengjiao Wang. 2006. Mining compressed sequential patterns. In Proceedings of the 2nd International Conference on Advanced Data Mining and Applications. Springer, 761–768.
[5]
Zhengping Che, David Kale, Wenzhe Li, Mohammad Taha Bahadori, and Yan Liu. 2015. Deep computational phenotyping. In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 507–516.
[6]
Enhong Chen, Huanhuan Cao, Qing Li, and Tieyun Qian. 2008. Efficient strategies for tough aggregate constraint-based sequential pattern mining. Information Sciences 178, 6 (2008), 1498–1518.
[7]
Ding-An Chiang, Yi-Fan Wang, Shao-Lun Lee, and Cheng-Jung Lin. 2003. Goal-oriented sequential pattern for network banking churn analysis. Expert Systems with Applications 25, 3 (2003), 293–302.
[8]
Edward Choi, Mohammad Taha Bahadori, Andy Schuetz, Walter F. Stewart, and Jimeng Sun. 2016. Doctor AI: Predicting clinical events via recurrent neural networks. In Proceedings of the 1st Machine Learning for Healthcare Conference. PMLR, 301–318.
[9]
Edward Choi, Mohammad Taha Bahadori, Elizabeth Searles, Catherine Coffey, Michael Thompson, James Bost, Javier Tejedor-Sojo, and Jimeng Sun. 2016. Multi-layer representation learning for medical concepts. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 1495–1504.
[10]
Edward Choi, Mohammad Taha Bahadori, Le Song, Walter F. Stewart, and Jimeng Sun. 2017. GRAM: Graph-based attention model for healthcare representation learning. In Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 787–795.
[11]
Edward Choi, Andy Schuetz, Walter F. Stewart, and Jimeng Sun. 2017. Using recurrent neural network models for early detection of heart failure onset. Journal of the American Medical Informatics Association 24, 2 (2017), 361–370.
[12]
Pavlos Delias, Michael Doumpos, Evangelos Grigoroudis, Panagiotis Manolitzas, and Nikolaos Matsatsinis. 2015. Supporting healthcare management decisions via robust clustering of event logs. Knowledge-Based Systems 84 (2015), 203–213.
[13]
Philippe Fournier-Viger, Chao Cheng, Zhi Cheng, Jerry Chun-Wei Lin, and Nazha Selmaoui-Folcher. 2019. Mining significant trend sequences in dynamic attributed graphs. Knowledge-Based Systems 182 (2019), 104797.
[14]
Philippe Fournier-Viger, Antonio Gomariz, Manuel Campos, and Rincy Thomas. 2014. Fast vertical mining of sequential patterns using co-occurrence information. In Proceedings of the 18th Pacific-Asia Conference on Knowledge Discovery and Data Mining. Springer, 40–52.
[15]
Philippe Fournier-Viger, Antonio Gomariz, Ted Gueniche, Espérance Mwamikazi, and Rincy Thomas. 2013. TKS: Efficient mining of top-k sequential patterns. In Proceedings of the 9th International Conference on Advanced Data Mining and Applications. Springer, 109–120.
[16]
Philippe Fournier-Viger, Zhitian Li, Jerry Chun-Wei Lin, Rage Uday Kiran, and Hamido Fujita. 2019. Efficient algorithms to identify periodic patterns in multiple sequences. Information Sciences 489 (2019), 205–226.
[17]
Philippe Fournier-Viger, Peng Yang, Zhitian Li, Jerry Chun-Wei Lin, and Rage Uday Kiran. 2020. Discovering rare correlated periodic patterns in multiple sequences. Data & Knowledge Engineering 126 (2020), 101733.
[18]
Fabio Fumarola, Pasqua Fabiana Lanotte, Michelangelo Ceci, and Donato Malerba. 2016. CloFAST: Closed sequential pattern mining using sparse and vertical id-lists. Knowledge and Information Systems 48, 2 (2016), 429–463.
[19]
Wensheng Gan, Jerry Chun-Wei Lin, Jiexiong Zhang, Han-Chieh Chao, Hamido Fujita, and S. Yu Philip. 2020. ProUM: Projection-based utility mining on sequence data. Information Sciences 513 (2020), 222–240.
[20]
Wensheng Gan, Jerry Chun-Wei Lin, Jiexiong Zhang, Hongzhi Yin, Philippe Fournier-Viger, Han-Chieh Chao, and Philip S. Yu. 2021. Utility mining across multi-dimensional sequences. ACM Transactions on Knowledge Discovery from Data 15, 5 (2021), 1–24.
[21]
Wensheng Gan, Jerry Chun-Wei Lin, Jiexiong Zhang, and Philip S. Yu. 2020. Utility mining across multi-sequences with individualized thresholds. ACM Transactions on Data Science 1, 2 (2020), 1–29.
[22]
Shoji Hirano and Shusaku Tsumoto. 2013. Clustering of order sequences based on the typicalness index for finding clinical pathway candidates. In Proceedings of the 13th International Conference on Data Mining Workshop. IEEE, 206–210.
[23]
Yu Hirate and Hayato Yamana. 2006. Generalized sequential pattern mining with item intervals. Journal of Computers 1, 3 (2006), 51–60.
[24]
Andreas Holzinger, Chris Biemann, Constantinos S. Pattichis, and Douglas B. Kell. 2017. What do we need to build explainable AI systems for the medical domain? arXiv:1712.09923. Retrieved from https://arxiv.org/abs/1712.09923.
[25]
Andreas Holzinger, Georg Langs, Helmut Denk, Kurt Zatloukal, and Heimo Müller. 2019. Causability and explainability of artificial intelligence in medicine. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery 9, 4 (2019), e1312.
[26]
Andreas Holzinger and Heimo Müller. 2021. Toward human–AI interfaces to support explainability and causability in medical AI. Computer 54, 10 (2021), 78–86.
[27]
Yuichi Honda, Muneo Kushima, Tomoyoshi Yamazaki, Kenji Araki, and Haruo Yokota. 2017. Detection and visualization of variants in typical medical treatment sequences. In Proceedings of the 3rd VLDB Workshop on Data Management and Analytics for Medicine and Healthcare. Springer, 88–101.
[28]
Gengsen Huang, Wensheng Gan, and Philip S. Yu. 2022. TaSPM: Targeted sequential pattern mining. arXiv:2202.13202. Retrieved from https://arxiv.org/abs/2202.13202.
[29]
Zhengxing Huang, Xudong Lu, and Huilong Duan. 2012. On mining clinical pathway patterns from medical behaviors. Artificial Intelligence in Medicine 56, 1 (2012), 35–50.
[30]
Han Jiawei, Pei Jian, Mortazavi-Asl Behzad, Pinto Helen, Chen Qiming, Dayal Umeshwar, and Hsu Mei-Chun. 2001. PrefixSpan: Mining sequential patterns efficiently by prefix-projected pattern growth. In Proceedings of the 17th International Conference on Data Engineering. 215–224.
[31]
Bo Jin, Chao Che, Zhen Liu, Shulong Zhang, Xiaomeng Yin, and Xiaopeng Wei. 2018. Predicting the risk of heart failure with EHR sequential data modeling. IEEE Access 6 (2018), 9256–9261.
[32]
David C. Kale, Zhengping Che, Mohammad Taha Bahadori, Wenzhe Li, Yan Liu, and Randall Wetzel. 2015. Causal phenotype discovery via deep networks. In Proceedings of the AMIA Annual Symposium, Vol. 2015. American Medical Informatics Association, 677–686.
[33]
Takako Kanakubo and Hadi Kharrazi. 2019. Comparing the trends of electronic health record adoption among hospitals of the united states and japan. Journal of Medical Systems 43, 223 (2019), 110–122.
[34]
Hye-Chung Kum, Jian Pei, Wei Wang, and Dean Duncan. 2003. ApproxMAP: Approximate mining of consensus sequential patterns. In Proceedings of the 3rd SIAM International Conference on Data Mining. SIAM, 311–315.
[35]
Hoang Thanh Lam, Fabian Mörchen, Dmitriy Fradkin, and Toon Calders. 2014. Mining compressing sequential patterns. Statistical Analysis and Data Mining: The ASA Data Science Journal 7, 1 (2014), 34–52.
[36]
Hieu Hanh Le, Henrik Edman, Yuichi Honda, Muneo Kushima, Tomoyoshi Yamazaki, Kenji Araki, and Haruo Yokota. 2017. Fast generation of clinical pathways including time intervals in sequential pattern mining on electronic medical record systems. In Proceedings of the 4th International Conference on Computer Science and Computational Intelligent. 1726–1731.
[37]
Hieu Hanh Le, Yutaka Horino, Tomoyoshi Yamazaki, Kenji Araki, and Haruo Yokota. 2021. Sequential pattern mining of large combinable items with values for a set-of-items recommendation. In Proceedings of the 34th IEEE Symposium on Computer-based Medical Systems. IEEE, 56–61.
[38]
Hieu Hanh Le, Tatsuhiro Yamada, Yuichi Honda, Masaaki Kayahara, Muneo Kushima, Kenji Araki, and Haruo Yokota. 2019. Analyzing sequence pattern variants in sequential pattern mining and its application to electronic medical record systems. In Proceedings of the 30th International Conference on Database and Expert Systems Applications. Springer, 393–408.
[39]
Chaofeng Li and Yansheng Lu. 2007. Similarity measurement of web sessions by sequence alignment. In Proceedings of the 4th International Conference on Network and Parallel Computing Workshops. IEEE, 716–720.
[40]
Jerry Chun-Wei Lin, Youcef Djenouri, Gautam Srivastava, Yuanfa Li, and Philip S. Yu. 2021. Scalable mining of high-utility sequential patterns with three-tier mapreduce model. ACM Transactions on Knowledge Discovery from Data 16, 3 (2021), 1–26.
[41]
Zachary C. Lipton, David C. Kale, and Randall C. Wetzel. 2015. Phenotyping of clinical time series with LSTM recurrent neural networks. arXiv:1510.07641. Retrieved from https://arxiv.org/abs/1510.07641.
[42]
Shuchuan Lo. 2005. Binary prediction based on weighted sequential mining method. In Proceedings of the 4th International Conference on Web Intelligence. IEEE, 755–761.
[43]
Izet Masic, Milan Miokovic, and Belma Muhamedagic. 2008. Evidence based medicine–new approaches and challenges. Acta Informatica Medica 16, 4 (2008), 219.
[44]
Ministry of Health, Labour and Welfare (Japan). 2020. Facilitation in Informatization in Healthcare Field (in Japanese). Retrieved September, 2021 from http://www.mhlw.go.jp/stf/seisakunitsuite/bunya/kenkou_iryou/iryou/johoka.
[45]
Ministry of Health, Labour and Welfare (Japan). 2021. Various Information of Medical Fee (in Japanese). Retrieved September, 2021 from http://www.iryohoken.go.jp/shinryohoshu/kaitei/.
[46]
World Health Organization. 2019. International Classification of Diseases. Retrieved September, 2021 from https://www.who.int/standards/classifications/classification-of-diseases.
[47]
François Petitjean, Tao Li, Nikolaj Tatti, and Geoffrey I. Webb. 2016. Skopus: Mining top-k sequential patterns under leverage. Data Mining and Knowledge Discovery 30, 5 (2016), 1086–1111.
[48]
Thi-Thiet Pham, Tung Do, Anh Nguyen, Bay Vo, and Tzung-Pei Hong. 2020. An efficient method for mining top-k closed sequential patterns. IEEE Access 8 (2020), 118156–118163.
[49]
Helen Pinto, Jiawei Han, Jian Pei, Ke Wang, Qiming Chen, and Umeshwar Dayal. 2001. Multi-dimensional sequential pattern mining. In Proceedings the 10th International Conference on Information and Knowledge Management. 81–88.
[50]
Aaditya Prakash, Siyuan Zhao, Sadid A. Hasan, Vivek Datla, Kathy Lee, Ashequl Qadir, Joey Liu, and Oladimeji Farri. 2017. Condensed memory networks for clinical diagnostic inferencing. In Proceedings of the 31st AAAI Conference on Artificial Intelligence. 3274–3280.
[51]
Raju V. Purushothama and Varma G. P. Saradhi. 2015. Mining closed sequential patterns in large sequence databases. International Journal of Database Management Systems 7, 1 (2015), 29–39.
[52]
Agrawal Rakesh and Srikant Ramakrishnan. 1994. Fast algorithms for mining association rules in large databases. In Proceedings of the 20th International Conference on Very Large Data Bases (VLDB’94). Morgan Kaufmann Publishers Inc., San Francisco, CA, 487–499. Retrieved from https://dl.acm.org/citation.cfm?id=645920.672836.
[53]
Agrawal Rakesh and Srikant Ramakrishnan. 1995. Mining sequential patterns. In Proceedings of the 11th International Conference on Data Engineering. IEEE, 3–14.
[54]
Marcella Rovani, Fabrizio M. Maggi, Massimiliano De Leoni, and Wil M. P. Van Der Aalst. 2015. Declarative process mining in healthcare. Expert Systems with Applications 42, 23 (2015), 9236–9251.
[55]
David L. Sackett. 1997. Evidence-based medicine. In Proceedings of the Seminars in Perinatology, Vol. 21. Elsevier, 3–5.
[56]
Hassan Saneifar, Sandra Bringay, Anne Laurent, and Maguelonne Teisseire. 2008. S2MP: Similarity measure for sequential patterns. In Proceedings of the 7th Australasian Data Mining Conference. ACS, 095–104.
[57]
Gautam Srivastava, Jerry Chun-Wei Lin, Xuyun Zhang, and Yuanfa Li. 2020. Large-scale high-utility sequential pattern analytics in internet of things. IEEE Internet of Things Journal 8, 16 (2020), 12669–12678.
[58]
Qiuling Suo, Fenglong Ma, Giovanni Canino, Jing Gao, Aidong Zhang, Pierangelo Veltri, and Gnasso Agostino. 2017. A multi-task framework for monitoring health conditions via attention-based recurrent neural networks. In Proceedings of the AMIA Annual Symposium, Vol. 2017. American Medical Informatics Association, 1665–1674.
[59]
Core Create System. 2017. Denshi Karte System WATATUMI (Electronic Medical Records WATATUMI). Retrieved September 2021 from http://www.corecreate.com/02_01_izanami.html.
[60]
Erico Tjoa and Cuntai Guan. 2020. A survey on explainable artificial intelligence (XAI): Toward medical XAI. IEEE Transactions on Neural Networks and Learning Systems 32, 11 (2020), 4793–4813.
[61]
Chieh-Yuan Tsai, James J. H. Liou, Chih-Jung Chen, and Ching-Chuan Hsiao. 2012. Generating touring path suggestions using time-interval sequential pattern mining. Expert Systems with Applications 39, 3 (2012), 3593–3602.
[62]
Shusaku Tsumoto, Shoji Hirano, and Tomohiro Kimura. 2021. Mining clinical pathways using dual clustering. The Review of Socionetwork Strategies (2021), 1–21.
[63]
Petre Tzvetkov, Xifeng Yan, and Jiawei Han. 2005. TSP: Mining top-k closed sequential patterns. Knowledge and Information Systems 7, 4 (2005), 438–457.
[64]
Clinical Research Support Center University of Miyazaki Hospital. 2016. Clinical Research that Must be Published (Public Disclosure) (in Japanese). Retrieved August 2022 from http://www.med.miyazaki-u.ac.jp/home/crsc/patient/notice/.
[65]
Keishiro Uragaki, Tomoyuki Hosaka, Yoshitaka Arahori, Muneo Kushima, Tomoyoshi Yamazaki, Kenji Araki, and Haruo Yokota. 2016. Sequential pattern mining on electronic medical records with handling time intervals and the efficacy of medicines. In Proceedings of the 2016 IEEE Symposium on Computers and Communication. IEEE, 20–25.
[66]
Shunji Wakamiya and Kazunobu Yamauchi. 2009. What are the standard functions of electronic clinical pathways? International Journal of Medical Informatics 78, 8 (2009), 543–550.
[67]
Jianyong Wang, Jiawei Han, and Chun Li. 2007. Frequent closed sequence mining without candidate maintenance. IEEE Transactions on Knowledge and Data Engineering 19, 8 (2007), 1042–1056.
[68]
Tomasz Wiktorski, Aleksandra Królak, Karolina Rosińska, Pawel Strumillo, and Jerry Chun-Wei Lin. 2020. Visualization of generic utility of sequential patterns. IEEE Access 8 (2020), 78004–78014.
[69]
Aileen P. Wright, Adam T. Wright, Allison B. McCoy, and Dean F. Sittig. 2015. The use of sequential pattern mining to predict next prescribed medications. Journal of Biomedical Informatics 53 (2015), 73–80.
[70]
Youxi Wu, Yao Tong, Xingquan Zhu, and Xindong Wu. 2017. NOSEP: Nonoverlapping sequence pattern mining with gap constraints. IEEE Transactions on Cybernetics 48, 10 (2017), 2809–2822.
[71]
Youxi Wu, Changrui Zhu, Yan Li, Lei Guo, and Xindong Wu. 2020. NetNCSP: Nonoverlapping closed sequential pattern mining. Knowledge-based Systems 196 (2020), 105812.
[72]
Yan Xifeng, Han Jiawei, and Afshar Ramin. 2003. CloSpan: Mining closed sequential patterns in large datasets. In Proceedings of the International Conference on Data Mining. SIAM, 166–177.
[73]
Guang Yang, Qinghao Ye, and Jun Xia. 2022. Unbox the black-box for the medical explainable AI via multi-modal and multi-centre data fusion: A mini-review, two showcases and beyond. Information Fusion 77 (2022), 29–52.
[74]
Ghim-Eng Yap, Xiao-Li Li, and Philip S. Yu. 2012. Effective next-items recommendation via personalized sequential pattern mining. In Proceedings of the 17th Database Systems for Advanced Applications. 48–64.
[75]
Hideo Yasunaga. 2019. Real world data in japan: Chapter II the diagnosis procedure combination database. Annals of Clinical Epidemiology 1, 3 (2019), 76–79.
[76]
Chen Yen-Liang, Chiang Mei-Ching, and Ko Ming-Tat. 2003. Discovering time-interval sequential patterns in sequence databases. Expert Systems with Applications 25, 3 (2003), 343–354.
[77]
Unil Yun. 2008. A new framework for detecting weighted sequential patterns in large sequence databases. Knowledge-Based Systems 21, 2 (2008), 110–122.
[78]
Mohammed J. Zaki. 2000. Sequence mining in categorical domains: Incorporating constraints. In Proceedings of the 9th International Conference on Information and Knowledge Management. 422–429.
[79]
Mohammed J. Zaki. 2001. SPADE: An efficient algorithm for mining frequent sequences. Machine Learning 42, 1 (2001), 31–60.

Cited By

View all
  • (2024)A Clustering-based Sequence Variants Analysis Method for Electronic Medical Records of Multimedical Institutions2024 IEEE 7th International Conference on Multimedia Information Processing and Retrieval (MIPR)10.1109/MIPR62202.2024.00113(653-659)Online publication date: 7-Aug-2024
  • (2024)Sequence-Walking Decision Tree for Multivariate Healthcare Data2024 IEEE 12th International Conference on Healthcare Informatics (ICHI)10.1109/ICHI61247.2024.00011(21-30)Online publication date: 3-Jun-2024
  • (2023)Analysis of Transitions in Differences between Frequent Medical-order Sequences for COVID-192023 IEEE 36th International Symposium on Computer-Based Medical Systems (CBMS)10.1109/CBMS58004.2023.00297(666-671)Online publication date: Jun-2023

Index Terms

  1. Methods for Analyzing Medical-Order Sequence Variants in Sequential Pattern Mining for Electronic Medical Record Systems

        Recommendations

        Comments

        Please enable JavaScript to view thecomments powered by Disqus.

        Information & Contributors

        Information

        Published In

        cover image ACM Transactions on Computing for Healthcare
        ACM Transactions on Computing for Healthcare  Volume 4, Issue 1
        January 2023
        217 pages
        EISSN:2637-8051
        DOI:10.1145/3582897
        Issue’s Table of Contents

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 30 March 2023
        Online AM: 12 September 2022
        Accepted: 24 August 2022
        Revised: 16 June 2022
        Received: 19 October 2021
        Published in HEALTH Volume 4, Issue 1

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. Sequential pattern mining
        2. sequence pattern variant
        3. electronic medical record system

        Qualifiers

        • Research-article

        Funding Sources

        • Health Labour Sciences Research Grant (Ministry of Health, Labour and Welfare, Japan)
        • Grants-in-Aid for Scientific Research
        • Grants-in-Aid for Early-Career Scientists
        • Japan Society for the Promotion of Science

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)159
        • Downloads (Last 6 weeks)30
        Reflects downloads up to 18 Nov 2024

        Other Metrics

        Citations

        Cited By

        View all
        • (2024)A Clustering-based Sequence Variants Analysis Method for Electronic Medical Records of Multimedical Institutions2024 IEEE 7th International Conference on Multimedia Information Processing and Retrieval (MIPR)10.1109/MIPR62202.2024.00113(653-659)Online publication date: 7-Aug-2024
        • (2024)Sequence-Walking Decision Tree for Multivariate Healthcare Data2024 IEEE 12th International Conference on Healthcare Informatics (ICHI)10.1109/ICHI61247.2024.00011(21-30)Online publication date: 3-Jun-2024
        • (2023)Analysis of Transitions in Differences between Frequent Medical-order Sequences for COVID-192023 IEEE 36th International Symposium on Computer-Based Medical Systems (CBMS)10.1109/CBMS58004.2023.00297(666-671)Online publication date: Jun-2023

        View Options

        Login options

        Full Access

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Full Text

        View this article in Full Text.

        Full Text

        HTML Format

        View this article in HTML Format.

        HTML Format

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media