Abstract
This paper describes a method for creating structure from heterogeneous sources, as part of an information database, or more specifically, a ‘concept base’. Structures called ‘concept trees’ can grow from the semi-structured sources when consistent sequences of concepts are presented. They might be considered to be dynamic databases, possibly a variation on the distributed Agent-Based or Cellular Automata models, or even related to Markov models. Semantic comparison of text is required, but the trees can be built more, from automatic knowledge and statistical feedback. This reduced model might also be attractive for security or privacy reasons, as not all of the potential data gets saved. The construction process maintains the key requirement of generality, allowing it to be used as part of a generic framework. The nature of the method also means that some level of optimisation or normalisation of the information will occur. This gives comparisons with databases or knowledge-bases, but a database system would firstly model its environment or datasets and then populate the database with instance values. The concept base deals with a more uncertain environment and therefore cannot fully model it beforehand. The model itself therefore evolves over time. Similar to databases, it also needs a good indexing system, where the construction process provides memory and indexing structures. These allow for more complex concepts to be automatically created, stored and retrieved, possibly as part of a more cognitive model. There are also some arguments, or more abstract ideas, for merging physical-world laws into these automatic processes.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
- 2.
- 3.
http://en.wikipedia.org/wiki/Entropy, plus_(information_theory), _(statistical_thermo-dynamics), or _(order_and_disorder), for example.
- 4.
Scholarpedia http://www.scholarpedia.org/article/Agent_based_modeling.
- 5.
I have to note my recent interest in WordNet, although, most of the new theory here was formulated before that, with WordNet then supporting it.
References
Al-Obasiat, Y. & Braun, R. (2007). A multi-agent flexible architecture for autonomic services and network management. In IEEE/ACS International Conference on Computer Systems and Applications, AICCSA’07 (pp. 132–138). ISBN 1-4244-1031-2.
Aslam, M. A., Shen, J., Auer, S. & Herrmann, M. (2007). An integration life cycle for semantic web services composition. In Proceedings of the 2007 11th International Conference on Computer Supported Cooperative Work in Design (pp. 490–495).
Atkinson, C., Bostan, P., Hummel, O. & Stoll, D. (2007). A practical approach to web service discovery and retrieval. In IEEE International Conference on Web Services (ICWS 2007).
Berners-Lee, T., Hendler, J. & Lassila, O. (2001, May). The semantic web: A new form of web content that is meaningful to computers will unleash a revolution of new possibilities. Scientific American.
Blumberg, R. & Atre, S. (2003, February). The problem with unstructured data. DM Review (pp. 42–46).
Bonabeau, E. (2001). Agent-based modeling: Methods and techniques for simulating human systems. Proceedings of the National Academy of Sciences, 99(3), 7280–7287.
Carr, L., Hall, W., Bechhofer, S. & Goble, C. (2001). Conceptual linking: Ontology-based open hypermedia. In WWW10 (pp. 334–342), Hong Kong.
Codd, E. F. (1970). A relational model of data for large shared data banks. Communications of the ACM, 13(6), 377–387.
Coutaz, J., Crowley, J. L., Dobson, S., & Garlan, D. (2005). Context is Key. Communications of the ACM, 48(3), 49–53.
Encheva, S. (2011). Lattices and patterns. In Proceedings of the 10th WSEAS International Conference on Artificial Intelligence, Knowledge Engineering and Data Bases (AIKED’11) (pp. 156–161), Cambridge, UK.
Fellbaum, C. (Ed.). (1998) WordNet: An electronic lexical database. Cambridge, MA: MIT Press.
Fractal Foundation. (2014). http://fractalfoundation.org/. Accessed 25 February 14.
Gilbert, S., & Lynch, N. (2002). Brewer’s conjecture and the feasibility of consistent, available, partition-tolerant web services. ACM SIGACT News, 33(2), 51–59. doi:10.1145/564585.564601.
Goel, A. K. (2013). Biologically inspired design: A new program for computational sustainability. IEEE Intelligent Systems, 28(3), 80–84.
Greenfield, A. (2006). Everyware: The dawning age of ubiquitous computing (1st ed.). Berkeley, CA: New Riders Press. ISBN 0321384016.
Greer, K. (2008). Thinking networks—The large and small of it: Autonomic and reasoning processes for information networks. published with LuLu.com, 2008. ISBN 1440433275. Also available on Google books.
Greer, K. (2011). Symbolic neural networks for clustering higher-level concepts. NAUN International Journal of Computers, 3(5), 378–386 [extended version of the WSEAS/EUROPMENT International Conference on Computers and Computing (ICCC’11)].
Greer, K. (2013a). New ideas for brain modelling. Published on arXiv at http://arxiv.org/abs/1403.1080, also on Scribd.
Greer, K. (2013b). Turing: Then, now and still key. In: X-S. Yang (Ed.), Artificial intelligence, evolutionary computation and metaheuristics (AIECM)—Turing 2012. Studies in Computational Intelligence. Berlin: Springer.
Grolinger, K., Wilson, A. H., Tiwari, A., & Capretz, M. (2013). Data management in cloud environments: NoSQL and NewSQL data stores. Journal of Cloud Computing: Advances, Systems and Applications, 2(22), 1–24.
Gruber, T. (1993). A translation approach to portable ontology specifications. Knowledge Acquisition, 5, 199–220.
Hansmann, U. (2003). Pervasive Computing: The mobile word. Berlin: Springer. ISBN 3540002189.
Holland, J. (1995). Hidden Order: How adaptation builds complexity. Reading, MA: Perseus.
Ising, E. (1925). A contribution to the theory of ferromagnetism. Zeitschrift für Physik, 31(1), 253–258.
Jarke, M., Eherer, S., Gallersdorfer, R., Jeusfeld, M. A., & Staudt, M. (1995). ConceptBase—A deductive object base manager. Journal on Intelligent Information Systems, 4(2), 167–192.
Karin, M., Prasad, M. D., Atreyee, D., Ramanujam, H., Mukesh, M., Deepak, P., Reed, J. & Schumacher, S. (2012). Exploiting evidence from unstructured data to enhance master data management. In Proceedings of the VLDB Endowment The 38th International Conference on Very Large Data Bases (Vol. 5(12) pp. 1862–1873). Istanbul: Turkey.
Kauffman, S. A. (1993). The origins of order: Self-organization and selection in evolution. Oxford, UK: Oxford University Press.
Lovelock, J. & Epton, S. (1975). The quest for Gaia. New Scientist Magazine. Available on Google Books.
Macal, C. M. & North, M. J. (2006). Tutorial on agent-based modelling and simulation part 2: How to model with agents. In L. F. Perrone, F. P. Wieland, J. Liu, B. G. Lawson, D. M. Nicol, & R. M. Fujimoto. (Eds.), Proceedings of the 2006 Winter Simulation Conference.
Mandelbrot, B. B. (1983). The fractal geometry of nature. New York: Macmillan.
Miller, G. A. (1995). WordNet: A lexical database for English. Communications of the ACM, 38(11), 39–41.
OASIS. (2014). http://www.oasis-open.org. Accessed 25 Jan 2014.
Quinlan, J. R. (1986). Induction of decision trees. Machine Learning, 1, 81–106.
Robinson, R. & Indulska, J. (2003). Superstring: A scalable service discovery protocol for the wide-area Pervasive environment. In The 11th IEEE International Conference on Networks, ICON2003 (pp. 699–704). ISSN 1531-2216, ISBN 0-7803-7788-5.
Shannon, C. E. (1948). A mathematical theory of communication (continued). The Bell System Technical Journal, 27(4), 623–656. ISSN 0005-8580.
Sibson, R. (1973). SLINK: An optimally efficient algorithm for the single-link cluster method. The Computer Journal (British Computer Society), 16(1), 30–34.
Towards the Semantic Web: Ontology-driven Knowledge Management. (2003). In John Davies, Dieter Fensel, Frank van Harmelen (Eds.), Wiley. ISBN 0470858079, 9780470858073
Waldrop, M. M. (1993). In L. Sternlieb (Ed.), Complexity: The emerging science at the edge of order and chaos.
Wolfram, S. (1983). Cellular Automata, Los Alamos science.
XPath. (2014). http://www.w3.org/TR/xpath/. Accessed 10 Mar 2014.
Zhang, Y., & Ji, Q. (2009). Efficient sensor selection for active information fusion. IEEE Transaction on Systems, Man, and Cybernetics—Part B: Cybernetics, 10(3), 719–728.
Zhao, J., Gao, Y., Liu, H., & Lu, R. (2007). Automatic construction of a lexical attribute knowledge base. In Z. Zhang & J. Siekmann (Eds.), Proceedings of Second International Conference, KSEM 2007, Melbourne, Australia (pp. 198–209). LNAI 4798 Berlin: Springer.
Disclosure
This paper is an updated version of a paper called ‘Concept Trees: Indexing and Memory from Semi-Structured Data’, originally published on DCS and Scribd, June 2012.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Greer, K. (2015). Concept Trees: Building Dynamic Concepts from Semi-structured Data Using Nature-Inspired Methods. In: Zhu, Q., Azar, A. (eds) Complex System Modelling and Control Through Intelligent Soft Computations. Studies in Fuzziness and Soft Computing, vol 319. Springer, Cham. https://doi.org/10.1007/978-3-319-12883-2_8
Download citation
DOI: https://doi.org/10.1007/978-3-319-12883-2_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-12882-5
Online ISBN: 978-3-319-12883-2
eBook Packages: EngineeringEngineering (R0)