Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2110363.2110417acmconferencesArticle/Chapter ViewAbstractPublication PagesihiConference Proceedingsconference-collections
research-article

A software tool for large-scale sharing and querying of clinical documents modeled using HL7 version 3 standard

Published: 28 January 2012 Publication History

Abstract

We present a novel software tool called CDN (Collaborative Data Network) for large-scale sharing and querying of clinical documents modeled using HL7 v3 standard (e.g., Clinical Document Architecture (CDA), Continuity of Care Document (CCD)). Similar to the caBIG initiative, CDN aims to foster innovations in cancer treatment and diagnosis through large-scale, sharing of clinical data. We focus on cancer because it is the second leading cause of deaths in the US. CDN is based on the synergistic combination of peer-to-peer technology and the extensible markup language XML and XQuery. Using CDN, a user can pose both structured queries and keyword queries on the HL7 v3 documents hosted by data providers. CDN is unique in its design - it supports location oblivious queries in a large-scale, network wherein a user does not explicitly provide the location of the data for a query. A location service in CDN discovers data of interest in the network at query time. CDN uses standard cryptographic techniques to provide security to data providers and protect the privacy of patients. Using CDN, a user can pose clinical queries pertaining to cancer containing aggregations and joins across data hosted by multiple data providers. CDN is implemented with open-source software for web application development and XML query processing. We report the evaluation of CDN in a distributed environment (LAN) using a real dataset of discharge summaries available from the i2b2 project.

References

[1]
http://www.hoise.com/vmw/07/articles/vmw/LV-VM-01-07-29.html.
[2]
caAdapter. https://cabig.nci.nih.gov/tools/caAdapter/.
[3]
caBIG Annual Report 2009. http://cabig.cancer.gov/resources/reports/2009ar/.
[4]
Defining Health Information Exchange. http://www.himss.org/content/files/2009DefiningHIE.pdf.
[5]
DXQP - Distributed XQuery Processor. http://sig.biostr.washington.edu/projects/dxqp/.
[6]
Galax: An Implementation of XQuery. http://galax.sourceforge.net/.
[7]
HIMSS Health Information Exchange. http://www.himss.org/asp/topics_hie.asp.
[8]
Overview of Health Information Exchange (HIE). http://www.himss.org/content/files/RHIO/RHIO_HIE_GeneralPresentation.pdf.
[9]
Project Voldemort. http://project-voldemort.com/.
[10]
The caBIG Pilot Phase Report Executive Summary. https://cabig.nci.nih.gov/overview/pilotreport_ExSum.
[11]
The HL7 Tooling Project. https://www.projects.openhealthtools.org/sf/projects/hl7tooling/.
[12]
The Model-Driven Health Tools Project. https://www.projects.openhealthtools.org/sf/projects/mdht/.
[13]
The ψx Project. http://vortex.sce.umkc.edu/psix.
[14]
Crossing the Quality Chasm: A New Health System for the 21st Century. The National Academies Press, Washington D.C., 2005.
[15]
S. Abiteboul, I. Manolescu, N. Polyzotis, N. Preda, and C. Sun. XML Processing in DHT Networks. In Proc. of the 24th IEEE ICDE, Cancun, Apr. 2008.
[16]
E. Curtmola, A. Deutsch, D. Logothetis, K. K. Ramakrishnan, D. Srivastava, and K. Yocum. XTreeNet: democratic community search. In Proc. of the 34st VLDB Conference, pages 1448--1451, Auckland, 2008.
[17]
G. DeCandia, D. Hastorun, M. Jampani, G. Kakulapati, A. Lakshman, A. Pilchin, S. Sivasubramanian, P. Vosshall, and W. Vogels. Dynamo: Amazon's highly available key-value store. In Proc. of 21st Symposium on Operating Systems Principles, pages 205--220, Stevenson, WA, 2007.
[18]
L. T. Detwiler, D. Suciu, J. D. Franklin, E. B. Moore, A. V. Poliakov, E. S. Lee, D. P. Corina, G. A. Ojemann, and J. F. Brinkley. Distributed XQuery-based integration and visualization of multimodality brain mapping data. Frontiers in Neuroinformatics, 3(0), 2009.
[19]
R. H. Dolin, L. Alschuler, S. Boyer, C. Beebe, F. M. Behlen, P. V. Biron, and A. Shabo Shvo. HL7 Clinical Document Architecture, Release 2. Journal of the American Medical Informatics Association, 13(1):30--39, 2006.
[20]
D. Fenstermacher, C. Street, T. McSherry, V. Nayak, C. Overby, and M. Feldman. The Cancer Biomedical Informatics Grid (caBIG). In Proceedings of IEEE Engineering in Medicine and Biology Society, pages 743--746, Shanghai, China, 2005.
[21]
M. Fernandez, T. Jim, K. Morton, N. Onose, and J. Simeon. DXQ: A Distributed XQuery Scripting Language. In 4th International Workshop on XQuery Implementation Experience and Perspectives, 2007.
[22]
M. F. Fernàndez, T. Jim, K. Morton, N. Onose, and J. Siméon. Highly Distributed XQuery with DXQ. In Proc. of SIGMOD 2007, pages 1159--1161, 2007.
[23]
L. Galanis, Y. Wang, S. R. Jeffery, and D. J. DeWitt. Locating Data Sources in Large Distributed Systems. In Proc. of the 29th VLDB Conference, Berlin, 2003.
[24]
M. Kay. SAXON: The XSLT and XQuery Processor. Available from http://saxon.sourceforge.net/.
[25]
D. B. Keator, D. Wei, S. Gadde, H. J. Bockholt, J. S. Grethe, D. Marcus, N. Aucoin, and I. B. Ozyurt. Derived data storage and exchange workflow for large-scale neuroimaging analyses on the BIRN grid. Frontiers in Neuroinformatics, 3(0), 2009.
[26]
K. Kim. Clinical Data Standards in Health Care: Five Case Studies. http://www.chcf.org/publications/2005/07/clinical-data-standards-in-health-care-five-case-studies.
[27]
G. Koloniari and E. Pitoura. Peer-to-Peer Management of XML Data: Issues and Research Challenges. SIGMOD Record, 34(2):6--17, June 2005.
[28]
D. Kossmann. The State of the Art in Distributed Query Processing. ACM Comput. Surv., 32(4):422--469, 2000.
[29]
A. Lakshman and P. Malik. Cassandra: A Structured Storage System on a P2P network. In Proc. of the 2008 ACM-SIGMOD Conference, Vancouver, Canada, 2008.
[30]
O. E. Livne, N. D. Schultz, and S. P. Narus. Federated Querying Architecture For Clinical And Translational Health IT. In Proc. of the 1st ACM International Health Informatics Symposium, pages 250--256, Arlington, Virginia, USA, 2010.
[31]
P. Maymounkov and D. Mazières. Kademlia: A Peer-to-Peer Information System Based on the XOR Metric. In Proceedings of First International Workshop on Peer-to-Peer Systems, pages 53--65, London, UK, 2002.
[32]
C. Mead. Data Interchange Standards In Healthcare IT - Computable Semantic Interoperability: Now Possible But Still Difficult, Do We Really Need A Better Mousetrap? Journal of Healthcare Information Management, 20(1):71--8, 2006.
[33]
P. Rao, S. Edlavitch, J. Hackman, T. Hickman, D. McNair, and D. Rao. Towards large-scale sharing of electronic health records of cancer patients. In Proc. of 1st ACM International Health Informatics Symposium, pages 545--549, Arlington, VA, 2010.
[34]
P. Rao and B. Moon. An Internet-Scale Service for Publishing and Locating XML Documents. In Proc. of the 25th IEEE Intl. Conference on Data Engineering, pages 1459--1462, Shanghai, China, March 2009.
[35]
P. Rao and B. Moon. Locating XML Documents in a Peer-to-Peer Network using Distributed Hash Tables. IEEE Transactions on Knowledge and Data Engineering, 21(12):1737--1752, December 2009.
[36]
S. Ratnasamy, P. Francis, M. Handley, R. Karp, and S. Schenker. A scalable content-addressable network. In Proc. of the 2001 ACM-SIGCOMM Conference, pages 161--172, 2001.
[37]
C. Re, J. Brinkley, K. Hinshaw, and D. Suciu. Distributed XQuery. In Proc. of the Workshop on Information Integration on the Web, pages 116--121, 2004.
[38]
A. Rowstron and P. Druschel. Pastry: Scalable, decentralized object location and routing for large-scale peer-to-peer systems. In Proc. of the IFIP/ACM Intl. Conference on Distributed Systems Platforms (Middleware 2001), Heidelberg, Germany, Nov. 2001.
[39]
J. Saltz, S. Oster, S. Hastings, S. Langella, T. Kurc, W. Sanchez, M. Kher, A. Manisundaram, K. Shanbhag, and P. Covitz. caGrid: Design and Implementation of the Core Architecture of the Cancer Biomedical Informatics Grid. Bioinformatics, 22(15):1910--1916, 2006.
[40]
W. W. Stead and H. S. Lin. Computational Technology for Effective Health Care: Immediate Steps and Strategic Directions. The National Academies Press, Washington D.C., 2009.
[41]
I. Stoica, R. Morris, D. Karger, M. F. Kaashoek, and H. Balakrishnan. Chord: A Scalable Peer-to-peer Lookup Service for Internet Applications. In Proc. of the 2001 ACM-SIGCOMM Conference, pages 149--160, San Diego, 2001.
[42]
Özlem. Uzuner, I. Goldstein, Y. Luo, and I. Kohane. Identifying patient smoking status from medical discharge records. Journal of the American Medical Informatics Association, 15(1):14--24, 2008.
[43]
G. M. Weber, S. N. Murphy, A. J. McMurry, D. MacFadden, D. J. Nigrin, S. Churchill, and I. S. Kohane. The Shared Health Research Information Network (SHRINE): A Prototype Federated Query Tool for Clinical Data Repositories. Journal of the American Medical Informatics Association, 16(5):624--630, Sept. 2009.
[44]
Y. Zhang and P. A. Boncz. XRPC: Interoperable and Efficient Distributed XQuery. In Proc of Very Large Data Bases, Vienna, Austria, September 2007.
[45]
B. Zhao, L. Huang, J. Stribling, S. Rhea, A. Joseph, and J. Kubiatowicz. Tapestry: A Resilient Global-scale Overlay for Service Deployment. IEEE Journal on Selected Areas in Communications, 22(1):41--53, Jan. 2004.

Cited By

View all
  • (2018)Evolution of Health Level-7Proceedings of the 2018 International Conference on Software Engineering and Information Management10.1145/3178461.3178480(118-123)Online publication date: 4-Jan-2018
  • (2014)A gossip-based approach for Internet-scale cardinality estimation of XPath queries over distributed semistructured dataThe VLDB Journal — The International Journal on Very Large Data Bases10.1007/s00778-013-0314-123:1(51-76)Online publication date: 1-Feb-2014
  • (2013)A new tool for sharing and querying of clinical documents modeled using HL7 Version 3 standardComputer Methods and Programs in Biomedicine10.1016/j.cmpb.2013.07.002112:3(529-552)Online publication date: 1-Dec-2013

Index Terms

  1. A software tool for large-scale sharing and querying of clinical documents modeled using HL7 version 3 standard

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    IHI '12: Proceedings of the 2nd ACM SIGHIT International Health Informatics Symposium
    January 2012
    914 pages
    ISBN:9781450307819
    DOI:10.1145/2110363
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 28 January 2012

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. hl7 v3
    2. large-scale
    3. peer-to-peer
    4. xquery processing

    Qualifiers

    • Research-article

    Conference

    IHI '12
    Sponsor:
    IHI '12: ACM International Health Informatics Symposium
    January 28 - 30, 2012
    Florida, Miami, USA

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)2
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 27 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2018)Evolution of Health Level-7Proceedings of the 2018 International Conference on Software Engineering and Information Management10.1145/3178461.3178480(118-123)Online publication date: 4-Jan-2018
    • (2014)A gossip-based approach for Internet-scale cardinality estimation of XPath queries over distributed semistructured dataThe VLDB Journal — The International Journal on Very Large Data Bases10.1007/s00778-013-0314-123:1(51-76)Online publication date: 1-Feb-2014
    • (2013)A new tool for sharing and querying of clinical documents modeled using HL7 Version 3 standardComputer Methods and Programs in Biomedicine10.1016/j.cmpb.2013.07.002112:3(529-552)Online publication date: 1-Dec-2013

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media