Grid-based framework for high-performance processing of scientific knowledge

Chang-Hoo Jeong¹,
Yun-Soo Choi¹,
Hong-Woo Chun¹,
Sa-Kwang Song¹,
Hanmin Jung¹,
Sangkwan Lee² &
…
Sung-Pil Choi¹

172 Accesses
1 Citation
Explore all metrics

Abstract

An essential matter in the knowledge-based information society is how to extract useful information quickly from a large volume of literature. Since most existing data mining frameworks deal with structured input data, many limitations are faced in analyzing unstructured scientific literature and extracting new information. This study proposes a scientific-knowledge processing framework, which offers high performance by using grid computing technology for extracting important entities and their relations from the scientific literature. Since the grid computing provides a large volume of data storage and high-speed computing, the proposed framework can efficiently analyze the massive body of scientific literature and process knowledge. The workflow tool that we have developed for the proposed framework enables users to easily design and execute complicated applications that consist of complicated scientific-knowledge processes. The experimental results showed that the proposed framework reduced working time by approximately 83 % when the number of running nodes was assigned in accordance with the workload ratio of each step in scientific-knowledge processes. As a result, it is possible to effectively process a large volume of scientific literature by flexibly adjusting the number of computing nodes that constitute the grid environment as the number of documents for processing increases.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Real Time Search Technique for Distributed Massive Data Using Grid Computing

From the desktop to the grid: scalable bioinformatics via workflow conversion

Article Open access 12 March 2016

UMDISW: A Universal Multi-Domain Intelligent Scientific Workflow Framework for the Whole Life Cycle of Scientific Data

Notes

References

Alsairafi S, Emmanouil F, Ghanem M, Giannadakis N, Guo Y, Kalaitzopoulos D, Osmond M, Rowe A, Syed J, Wendel P (2003) The design of discovery net: towards open grid services for knowledge discovery. Int J High Perform Comput Appl 17(3):297–315
Article Google Scholar
Altintas I, Berkley C, Jaeger E, Jones M, Ludascher B, Mock S (2004) Kepler: an extensible system for design and execution of scientific workflows. In: Proceedings of the 16th International Conference on Scientific and Statistical Database Management: 423–424
Brezany P, Janciak I, Tjoa A (2005) GridMiner: a fundamental infrastructure for building intelligent grid systems. In: Proceedings of IEEE/WIC/ACM International Conference on Web Intelligence: 150–156
Choi S, Myaeng S (2010) Simplicity is better: revisiting single kernel PPI extraction. In: Proceedings of the 23rd International Conference on Computational Linguistics
Chun H, Jeong C, Song S, Choi Y, Choi S, Sung W (2011) Relation extraction based on composite kernel combining pattern similarity of predicate-argument structure. In: Proceedings of U-and E-Service, Science and Technology: 269–273
Congiusta A, Talia D, Trunfio P (2007) Service-oriented middleware for distributed data mining on the grid. J Parallel Distrib Comput 68(1):3–15
Article Google Scholar
Goble C, Wroe C, Stevens R (2003) The myGrid project: services, architecture and demonstrator. In: Proceedings of UK e-Science All Hands Meeting: 595–603
Harrison A, Wang I, Taylor I, Shields M (2007) WS-RF workflow in Triana. International Journal of High Performance Computing Applications Special Issue on Workflow Systems in Grid Environments
Hull D, Wolstencroft K, Stevens R, Goble C, Pocock M, Li P, Oinn T (2006) Taverna: a tool for building and running workflows of services. Nucleic Acids Res 34(Web Server issue):729–732
Article Google Scholar
Le-Khac N, Kechadi T, Carthy J (2006) ADMIRE framework: distributed data mining on data grid platforms. In: Proceedings of the 1st International Conference on Software and Data Technologies: 67–72
Song S, Choi Y, Chun H, Jeong C, Choi S, Sung W (2011) Multi-words terminology recognition using web search. In: Proceedings of U-and E-Service, Science and Technology: 233–238
Stankovski V, Trnkoczy J, Swain M, Dubitzky W, Kravtsov V, Schuster A, Niessen T, Wegener D, May M, Rohm M, Franke J (2008) Digging deep into the data mine with DataMiningGrid. IEEE Internet Comput 12(6):69–76
Article Google Scholar
Talia D, Trunfio P (2007) How distributed data mining tasks can thrive as services on Grids. In: Proceedings of National Science Foundation Symposium on Next Generation of Data Mining and Cyber-Enabled Discovery for Innovation
Talia D, Trunfio P (2010) How distributed data mining tasks can thrive as knowledge services. Commun ACM 53(7):132–137
Article Google Scholar
Talia D, Trunfio P, Verta O (2008) The Weka4WS framework for distributed data mining in service-oriented Grids. Concurrency Comput Pract Ex 20(16):1933–1951
Article Google Scholar

Download references

Author information

Authors and Affiliations

Software Research Center, Korea Institute of Science and Technology Information (KISTI), 335 Gwahangno, Yuseong-gu, Daejeon, South Korea
Chang-Hoo Jeong, Yun-Soo Choi, Hong-Woo Chun, Sa-Kwang Song, Hanmin Jung & Sung-Pil Choi
Catholic University of Pusan, 57 Oryundae-ro, Geumjeong-gu, Busan, Korea
Sangkwan Lee

Authors

Chang-Hoo Jeong
View author publications
You can also search for this author in PubMed Google Scholar
Yun-Soo Choi
View author publications
You can also search for this author in PubMed Google Scholar
Hong-Woo Chun
View author publications
You can also search for this author in PubMed Google Scholar
Sa-Kwang Song
View author publications
You can also search for this author in PubMed Google Scholar
Hanmin Jung
View author publications
You can also search for this author in PubMed Google Scholar
Sangkwan Lee
View author publications
You can also search for this author in PubMed Google Scholar
Sung-Pil Choi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sung-Pil Choi.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Jeong, CH., Choi, YS., Chun, HW. et al. Grid-based framework for high-performance processing of scientific knowledge. Multimed Tools Appl 71, 783–798 (2014). https://doi.org/10.1007/s11042-013-1411-2

Download citation

Published: 20 March 2013
Issue Date: July 2014
DOI: https://doi.org/10.1007/s11042-013-1411-2

Grid-based framework for high-performance processing of scientific knowledge

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Real Time Search Technique for Distributed Massive Data Using Grid Computing

From the desktop to the grid: scalable bioinformatics via workflow conversion

UMDISW: A Universal Multi-Domain Intelligent Scientific Workflow Framework for the Whole Life Cycle of Scientific Data

Notes

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Grid-based framework for high-performance processing of scientific knowledge

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Real Time Search Technique for Distributed Massive Data Using Grid Computing

From the desktop to the grid: scalable bioinformatics via workflow conversion

UMDISW: A Universal Multi-Domain Intelligent Scientific Workflow Framework for the Whole Life Cycle of Scientific Data

Notes

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now