Abstract
The Linux operating system embodies a development history of 15 years and community effort of hundreds of voluntary developers. We examine the structure and evolution of the Linux kernel by considering the source code of the kernel as ordinary text without any regard to its semantics. After selecting three functionally central modules to study, we identified code segments using local alignments of source code from a reduced set of file comparisons. The further stages of the analyses take advantage of these identified alignments. We build module-specific visualizations, or descendant graphs, to visualize the overall code migration between versions and files. More detailed view can be achieved with chain graphs which show the time evolution of alignments between selected files. The methods used here may also prove useful in studying large collections of legacy code, whose original maintainers are not available.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Beck, M., Böhme, H., Dziadzka, M., Kunitz, U., Magnus, R., Schröter, C., Verworner, D.: Linux Kernel Programming, 3rd edn. Pearson Education Ltd., London (2002)
Bellman, R.: On the theory of dynamic programming. In: Proceedings of the National Academy of Sciences of the United States of America, vol. 38, pp. 716–719 (August 1952)
Carver, E.A., Stubbs, L.: Zooming in on the human-mouse comparative map: Genome conservation re-examined on a high-resolution scale. Genome Research 7(12), 1123–1137 (1997)
Godfrey, M.W., Tu, Q.: Evolution in open source software: A case study. In: 16th IEEE International Conference on Software Maintenance (ICSM 2000), vol. 0, pp. 131–142 (2000)
Hand, D.J., Mannila, H., Smyth, P.: Principles of Data Mining. MIT Press, Cambridge (2001)
Jaro, M.A.: Advances in record linking methodology as applied to the 1985 census of tampa florida. Journal of the American Statistical Society 64, 1183–1210 (1989)
Needleman, S.B., Wunsch, C.D.: A general method applicable to the search for similarities in the amino acid sequence of two proteins. Journal of Molecular Biology 48, 443–453 (1970)
Rusling, D.A.: The Linux kernel. Last visited on (March 25, 2006), Published in Web at: http://www.tldp.org/LDP/tlk/tlk.html
Smith, T.F., Waterman, M.S.: Identification of common molecular subsequences. Journal of Molecular Biology 147, 195–197 (1981)
Tahvildari, L., Gregory, R., Kontogianni, K.: An approach for measuring software evolution using source code features. In: Sixth Asia-Pacific Software Engineering Conference (APSEC 1999), vol. 00, pp. 10–17 (1999)
Torvalds, L.: Linux: A portable operating system. Master’s thesis, University of Helsinki (1997)
Viégas, F.B., Wattenberg, M., Dave, K.: Studying cooperation and conflict between authors with history flow visualizations. In: Conference on Human Factors in Computing Systems (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Rasinen, A., Hollmén, J., Mannila, H. (2006). Analysis of Linux Evolution Using Aligned Source Code Segments. In: Todorovski, L., Lavrač, N., Jantke, K.P. (eds) Discovery Science. DS 2006. Lecture Notes in Computer Science(), vol 4265. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11893318_22
Download citation
DOI: https://doi.org/10.1007/11893318_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-46491-4
Online ISBN: 978-3-540-46493-8
eBook Packages: Computer ScienceComputer Science (R0)