Abstract
Nowadays, the exponential availability of biological sequences and the complexity of the computational methods that use them as input motivate the research of new compact representations. To this end, we propose an alternative method for storing sets of sequences based on set decision diagrams instead of classical compression techniques. The set decision diagrams are an extension of the reduced ordered binary decision diagrams, a graph data structure used as a symbolic compact representation of sets or relations between sets. Some experiments with genes of the mitochondrion DNA support the feasibility of our approach.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Benson, D.A., Karsch-Mizrachi, I., Lipman, D.J., Ostell, J., Sayers, E.W.: GenBank. Nucleic. Acids Res. 51, D46–D51 (2010)
Bollig, B., Wegener, I.: Improving the variable ordering of OBDDs Is NP-Complete. IEEE Trans. Comput. 45, 993–1002 (1996)
Bryant, R.E.: Symbolic manipulation of boolean functions using a graphical representation. In: Proc. of the 22nd ACM/IEEE Design Automation Conference, pp. 688–694. ACM (1985)
Couvreur, J.-M., Encrenaz, E., Paviot-Adet, E., Poitrenaud, D., Wacrenier, P.-A.: Data Decision Diagrams for Petri Net Analysis. In: Esparza, J., Lakos, C.A. (eds.) ICATPN 2002. LNCS, vol. 2360, pp. 101–120. Springer, Heidelberg (2002)
Couvreur, J.-M., Thierry-Mieg, Y.: Hierarchical decision diagrams to exploit model structure. In: Wang, F. (ed.) FORTE 2005. LNCS, vol. 3731, pp. 443–457. Springer, Heidelberg (2005)
Denzumi, S., Arimura, H., Minato, S.: Implementation of sequence BDDs in Erlang. In: Proceedings of the 10th ACM SIGPLAN W. on Erlang, pp. 90–91. ACM (2011)
Denzumi, S., Yoshinaka, R., Arimura, H., Minato, S.: Notes on Sequence Binary Decision Diagrams: Relationship to Acyclic Automata and Complexities of Binary Set Operations. In: Holub, J., et al. (eds.) Proc. of the Prague Stringology Conference 2011, pp. 147–161, Prague Stringology Club (2011)
Giancarlo, R., Scaturro, D., Utro, F.: Textual data compression in computational biology: a synopsis. Bioinformatics 25, 1575–1586 (2009)
LIP6/Move: the libDDD environment (2011), http://ddd.lip6.fr
Loekito, E., Bailey, J., Pei, J.: A binary decision diagram based approach for mining frequent subsequences. Knowl. Inf. Syst. 24, 235–268 (2010)
Meinel, C., Theobald, T.: Algorithms and data structures in VLSI design: OBDD - Foundations and applications. Springer, Heidelberg (1998)
Miller, D.M.: Multiple-valued logic design tools. In: Proceedings of the 23rd Int. Symposium on Multiple-Valued Logic, pp. 2–11. IEEE (1993)
Nalbantoglu, Ö.U., Russell, D.J., Sayood, K.: Data compression concepts and algorithms and their applications to bioinformatics. Entropy 12, 34–52 (2009)
Polanski, A., Kimmel, M.: Bioinformatics. Springer, Heidelberg (2007)
Requeno, J.I.: Análisis filogenético mediante lógica temporal y model checking. Master Thesis in Computer Science and Engineering, University of Zaragoza (2010)
Requeno, J.I., Blanco, R., de Miguel Casado, G., Colom, J.M.: Phylogenetic Analysis Using an SMV Tool. In: Rocha, M.P., Rodríguez, J.M.C., Fdez-Riverola, F., Valencia, A. (eds.) 5th International Conference on Practical Applications of Computational Biology & Bioinformatics (PACBB 2011). AISC, vol. 93, pp. 167–174. Springer, Heidelberg (2011)
Rudell, R.: Dynamic variable ordering for ordered binary decision diagrams. In: Proc. of the 1993 IEEE/ACM Int. Conf. on Computer-aided Design, pp. 42–47. IEEE (1993)
Strelets, V.B., Lim, H.A.: Compression of protein sequence databases. Comput. Appl. Biosci. 11, 557–561 (1995)
White, W.T., Hendy, M.: Compressing DNA sequence databases with coil. BMC Bioinformatics 9, 242–257 (2008)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Requeno, J.I., Colom, J.M. (2012). Compact Representation of Biological Sequences Using Set Decision Diagrams. In: Rocha, M., Luscombe, N., Fdez-Riverola, F., Rodríguez, J. (eds) 6th International Conference on Practical Applications of Computational Biology & Bioinformatics. Advances in Intelligent and Soft Computing, vol 154. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-28839-5_27
Download citation
DOI: https://doi.org/10.1007/978-3-642-28839-5_27
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-28838-8
Online ISBN: 978-3-642-28839-5
eBook Packages: EngineeringEngineering (R0)