Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3388440.3414710acmconferencesArticle/Chapter ViewAbstractPublication PagesbcbConference Proceedingsconference-collections
research-article

Using Curriculum Learning in Pattern Recognition of 3-dimensional Cryo-electron Microscopy Density Maps

Published: 24 November 2020 Publication History

Abstract

Although Cryo-electron microscopy (cryo-EM) has been successfully used to derive atomic structures for many proteins, it is still challenging to derive atomic structure when the resolution of cryo-EM density maps is in the medium range, e.g., 5-10 Å. Studies have attempted to utilize machine learning methods, especially deep neural networks to build predictive models for the detection of protein secondary structures from cryo-EM images, which ultimately helps to derive the atomic structure of proteins. However, the large variation in data quality makes it challenging to train a deep neural network with high prediction accuracy. Curriculum learning has been shown as an effective learning paradigm in machine learning. In this paper, we present a study using curriculum learning as a more effective way to utilize cryo-EM density maps with varying quality. We investigated three distinct training curricula that differ in whether/how images used for training in past are reused while the network was continually trained using new images. A total of 1,382 3-dimensional cryo-EM images were extracted from density maps of Electron Microscopy Data Bank in our study. Our results indicate learning with curriculum significantly improves the performance of the final trained network when the forgetting problem is properly addressed.

References

[1]
Jiang, W., Baker, M. L., Ludtke, S. J. and Chiu, W. Bridging the information gap: computational tools for intermediate resolution structure interpretation. Journal of molecular biology, 308, 5 (May 18 2001), 1033--1044.
[2]
Kong, Y. and Ma, J. A structural-informatics approach for mining beta-sheets: locating sheets in intermediate-resolution density maps. Journal of molecular biology, 332, 2 (Sep 12 2003), 399--413.
[3]
Dal Palu, A., He, J., Pontelli, E. and Lu, Y. Identification of Alpha-Helices from Low Resolution Protein Density Maps. Proceeding of Computational Systems Bioinformatics Conference(CSB) (2006), 89--98.
[4]
Baker, M. L., Ju, T. and Chiu, W. Identification of secondary structure elements in intermediate-resolution density maps. Structure, 15, 1 (Jan 2007), 7--19.
[5]
Si, D., Ji, S., Nasr, K. A. and He, J. A Machine Learning Approach for the Identification of Protein Secondary Structure Elements from Electron Cryo-Microscopy Density Maps. Biopolymers, 97, 9 (2012), 698--708.
[6]
Rusu, M. and Wriggers, W. Evolutionary bidirectional expansion for the tracing of alpha helices in cryo-electron microscopy reconstructions. Journal of structural biology, 177, 2 (Feb 2012), 410--419.
[7]
Si, D. and He, J. Beta-sheet Detection and Representation from Medium Resolution Cryo-EM Density Maps. Proceedings of the International Conference on Bioinformatics, Computational Biology and Biomedical Informatics (2013), 764--770.
[8]
Si, D. and He, J. Tracing Beta Strands Using StrandTwister from Cryo-EM Density Maps at Medium Resolutions. Structure, 22, 11 (2014/11/04/ 2014), 1665--1676.
[9]
Nasr, K. A., Chen, L., Si, D., Ranjan, D., Zubair, M. and He, J. Building the initial chain of the proteins through de novo modeling of the cryo-electron microscopy volume data at the medium resolutions. In Proceedings of the Proceedings of the ACM Conference on Bioinformatics, Computational Biology and Biomedicine (2012), 490--497.
[10]
Al-Nasr, K., Ranjan, D., Zubair, M. and He, J. Ranking Valid Topologies of the Secondary Structure Elements Using a Constraint Graph. J. Bioinformatics and Computational Biology, 9, 3 (2011), 415--430.
[11]
Nasr, K. A., Ranjan, D., Zubair, M., Chen, L. and He, J. Solving the Secondary Structure Matching Problem in Cryo-EM De Novo Modeling Using a Constrained $K$-Shortest Path Graph Algorithm. IEEE/ACM transactions on computational biology and bioinformatics, 11, 2 (2014), 419--430.
[12]
Li, R., Si, D., Zeng, T., Ji, S. and He, J. Deep convolutional neural networks for detecting secondary structures in protein density maps from cryo-electron microscopy. 2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) (15-18 Dec. 2016 2016), 41--46.
[13]
Haslam, D., Zeng, T., Li, R. and He, J. Exploratory Studies Detecting Secondary Structures in Medium Resolution 3D Cryo-EM Images Using Deep Convolutional Neural Networks. In Proceedings of the 2018 ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics (2018), 628--632.
[14]
Maddhuri, S. V. S., Terashi, G. and Kihara, D. Protein secondary structure detection in intermediate-resolution cryo-EM maps using deep learning. Nature methods, 16, 9 (2019), 911--917.
[15]
Wriggers, W. and He, J. Numerical geometry of map and model assessment. Journal of structural biology, 192, 2 (2015/11/01/ 2015), 255--261.
[16]
Sazzed, S., Scheible, P., Alshammari, M., Wriggers, W. and He, J. Cylindrical Similarity Measurement for Helices in Medium-Resolution Cryo-Electron Microscopy Density Maps. Journal of Chemical Information and Modeling, 60, 5 (2020/05/26 2020), 2644--2650.
[17]
Ronneberger, O., Fischer, P. and Brox, T. U-Net: Convolutional Networks for Biomedical Image Segmentation. CoRR, abs/1505.04597 (2015).
[18]
Avrahami, J., Kareev, Y., Bogot, Y., Caspi, R., Dunaevsky, S. and Lerner, S. Teaching by examples: Implications for the process of category acquisition. The Quarterly Journal of Experimental Psychology Section A, 50, 3 (1997), 586--606.
[19]
Jesson, A., Guizard, N., Ghalehjegh, S. H., Goblot, D., Soudan, F. and Chapados, N. CASED: curriculum adaptive sampling for extreme data imbalance. In International Conference on Medical Image Computing and Computer-Assisted Intervention (2017), 639--646.
[20]
Amodei, D., Ananthanarayanan, S., Anubhai, R., Bai, J., Battenberg, E., Case, C., Casper, J., Catanzaro, B., Cheng, Q. and Chen, G. Deep speech 2: End-to-end speech recognition in english and mandarin. International conference on machine learning (2016), 173--182.
[21]
Zaremba, W. and Sutskever, I. Learning to execute. arXiv preprint arXiv:1410.4615 (2014).
[22]
Hacohen, G. and Weinshall, D. On the power of curriculum learning in training deep networks. arXiv preprint arXiv:1904.03626 (2019).
[23]
Lopez-Paz, D. and Ranzato, M. A. Gradient episodic memory for continual learning. Advances in neural information processing systems (2017), 6467--6476.
[24]
Sazzed, S., Scheible, P., Alshammari, M., Wriggers, W. and He, J. Cylindrical Similarity Measurement for Helices in Medium-Resolution Cryo-Electron Microscopy Density Maps. Journal of Chemical Information and Modeling (2020).
[25]
Serrà, J., Suris, D., Miron, M. and Karatzoglou, A. Overcoming catastrophic forgetting with hard attention to the task. arXiv preprint arXiv:1801.01423 (2018).
[26]
Kingma, D. P. and Ba, J. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).

Cited By

View all
  • (2024)Determining Protein Secondary Structures in Heterogeneous Medium-Resolution Cryo-EM Images Using CryoSSESegACS Omega10.1021/acsomega.4c026089:24(26409-26416)Online publication date: 8-Jun-2024
  • (2023)An Approach to Developing Benchmark Datasets for Protein Secondary Structure Segmentation from Cryo-EM Density MapsProceedings of the 14th ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics10.1145/3584371.3612947(1-8)Online publication date: 3-Sep-2023

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
BCB '20: Proceedings of the 11th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics
September 2020
193 pages
ISBN:9781450379649
DOI:10.1145/3388440
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 24 November 2020

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Deep learning
  2. cryo-electron microscopy
  3. curriculum
  4. image
  5. pattern recognition
  6. protein structure
  7. secondary structure

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

BCB '20
Sponsor:

Acceptance Rates

Overall Acceptance Rate 254 of 885 submissions, 29%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)11
  • Downloads (Last 6 weeks)3
Reflects downloads up to 30 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Determining Protein Secondary Structures in Heterogeneous Medium-Resolution Cryo-EM Images Using CryoSSESegACS Omega10.1021/acsomega.4c026089:24(26409-26416)Online publication date: 8-Jun-2024
  • (2023)An Approach to Developing Benchmark Datasets for Protein Secondary Structure Segmentation from Cryo-EM Density MapsProceedings of the 14th ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics10.1145/3584371.3612947(1-8)Online publication date: 3-Sep-2023

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media