Abstract
An evolutionary algorithm (EA) usually initializes its population with random genotypes, which represent random solutions to the target problem instance. If the problem is one of constrained optimization, an initial population whose genotypes all represent empty solutions might allow the EA to grow valid solutions as much as search for them and thereby identify good solutions more quickly. This is the case in a genetic algorithm (GA) for the longest common subsequence problem, which seeks the length of a longest subsequence common to each of a set of given strings. The GA encodes sequences as binary strings that indicate subsequences of the shortest or first given string. In tests on a variety of problem instances, the GA always identifies an optimum subsequence, but on most instances, the GA reaches an optimum more quickly when its initial population encodes empty sequences than when its initial genotypes represent random sequences.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Hinkemeyer, B., Julstrom, B.A.: A genetic algorithm for the longest common subsequence problem. In: Proceedings of the 2006 Genetic and Evolutionary Computation Conference, The ACM Press, New York (2006)
Maier, D.: The complexity of some problems on subsequences and supersequences. Journal of the ACM 25, 322–336 (1978)
Irving, R., Fraser, C.: Two algorithms for the longest common subsequence of three (or more) strings. In: Proceedings of the 3rd Annual Symposium on Combinatorial Pattern Matching, pp. 214–229. Springer, New York (1992)
Fogel, G., Chellapilla, K., Fogel, D.: Identification of coding regions in DNA sequences using evolved neural networks. In: Evolutionary Computation in Bioinformatics, Morgan Kaufmann, San Francisco (2003)
Forrest, S., Javornik, B., Smith, R., Perelson, A.: Using genetic algorithms to explore pattern recognition in the immune system. Evolutionary Computation 1, 191–211 (1993)
Pei, M., Goodman, E.D., Punch, W.F.: Pattern discovery from data using genetic algorithms. In: Proceedings of 1st Pacific-Asia Conference on Knowledge Discovery & Data Mining (PAKDD 1997) (1997)
Shankar, P.: V Diff - A file comparer with visual output (2003) (Retrieved October 25, 2004), On-line at: http://www.codeproject.com/cpp/vdiff.asp?df=100&forumid=14638&exp=0&select=825191
Clough, P.: Plagiarism in natural and programming languages: an overview of current tools and technologies (2000) (Retrieved October 13, 2004), On-line at: http://www.dcs.shef.ac.uk/~cloughie/papers/Plagiarism.pdf
Grant, T., Baker, K.: Identifying reliable, valid markers of authorship: A response to Chaski. Forensic Linguistics 8, 66–79 (2001)
Lovis, C., Baud, R.H.: Fast exact string pattern-matching algorithms adapted to the characteristics of the medical language. Journal of the American Medical Informatics Association 7, 378–391 (2000)
Losee, R.: Learning syntactic rules and tags with genetic algorithms. Information Processing & Management 32, 185–197 (1996)
Louis, S.: Learning to improve genetic algorithm performance on job shop scheduling (2002) (Retrieved October 21, 2004), On-line at: http://www.cs.unr.edu/~sushil/papers/conference/newpapers/2002/gecco2002/cigarScheduling/cigarScheduling/cigarScheduling.html
Navarro, G., Raffinot, M.: Flexible Pattern Matching in Strings. Cambridge University Press, New York (2002)
Bergroth, L., Hakonen, H., Raita, T.: A survey of longest common subsequence algorithms. In: Proceedings of the Seventh International Symposium on String Processing and Information Retrieval (2000)
Koza, J.R.: Genetic Programming: On the programming of computers by means of natural selection. The MIT Press, Cambridge (1992)
Banzhaf, W., Nordin, P., Keller, R.E., Francone, F.D.: Genetic programming: An introduction. Morgan Kaufmann, San Francisco (1998)
Koza, J.R., Bennett, F.H., Andre, D., Keane, M.A.: Evolution using genetic programming of a low-distortion 96 decibel operational amplifier. In: Bryant, B., Carroll, J., Oppenheim, D., Hightower, J., George, K.M. (eds.) Applied Computing 1997. Proceedings of the 1997 ACM Symposium on Applied Computing, pp. 207–216. ACM Press, New York (1997)
Koza, J.R., Bennett, F.H., Andre, D., Keane, M.A., Dunlap, F.: Automated systhesis of analog electrical circuits by means of genetic programming. IEEE Transactions on Evolutionary Computation 1, 109–128 (1997)
Koza, J.R., Bennett, F.H., Andre, D., Keane, M.A.: Genetic Programming III: Darwinian Invention and Problem Solving. Morgan Kaufmann, San Francisco (1999)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Julstrom, B.A., Hinkemeyer, B. (2006). Starting from Scratch: Growing Longest Common Subsequences with Evolution. In: Runarsson, T.P., Beyer, HG., Burke, E., Merelo-Guervós, J.J., Whitley, L.D., Yao, X. (eds) Parallel Problem Solving from Nature - PPSN IX. PPSN 2006. Lecture Notes in Computer Science, vol 4193. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11844297_94
Download citation
DOI: https://doi.org/10.1007/11844297_94
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-38990-3
Online ISBN: 978-3-540-38991-0
eBook Packages: Computer ScienceComputer Science (R0)