Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3673038.3673129acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicppConference Proceedingsconference-collections
research-article
Open access

SuperCSR: A Space-Time-Efficient CSR Representation for Large-scale Graph Applications on Supercomputers

Published: 12 August 2024 Publication History

Abstract

It is widely accepted that graph representations such as the Compressed Sparse Row (CSR) format, directly affect the space and time complexities of graph processing. However, the standard CSR and its current variations are prone to high memory footprint and complicated calculations, which necessitates the development of more efficient graph processing techniques to save memory and reduce calculations. This paper presents SuperCSR, a more space-time-efficient CSR representation for fast graph processing. SuperCSR’s key idea is to leverage the law of sorted graphs, which would directly access adjacent vertex sets from an active vertex ID without complex indexing calculations and with a lower memory footprint.
We validate SuperCSR through theoretical analysis and experimental evaluation. Theoretically, it is proved that SuperCSR would reduce the space of the auxiliary array to O(1), compared to the best previously claimed O(Nnz), where Nnz is the number of vertices with degrees > 0. Moreover, SuperCSR requires the fewest calculations among all CSR-like methods. Practically, extensive evaluations validate that SuperCSR achieves up to 98.61% in space savings and SuperCSR-based graph traversal exhibits the highest performance in graph processing among CSR-like formats. Finally, SuperCSR has been deployed on a 79,024-node supercomputer to rank Graph500. Based on SuperCSR, the next-generation Tianhe supercomputer surpasses the top 1 with 58% higher performance and 50% lower memory space.

References

[1]
2021. https://graph500.org/. (2021).
[2]
2022. Breadth-first search. https://en.wikipedia.org/wiki/Breadth-first_search.
[3]
Yasir Arfat, Rashid Mehmood, and Aiiad Albeshri. 2018. Parallel Shortest Path Graph Computations of United States Road Network Data on Apache Spark. Social Informatics and Telecommunications Engineering (2018), 323–336.
[4]
Albert-László Barabási. 2009. Scale-free networks: a decade and beyond. science 325, 5939 (2009), 412–413.
[5]
Scott Beamer, Krste Asanovic, and David Patterson. 2012. Direction-optimizing breadth-first search. In SC’12: Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis. IEEE, 1–10.
[6]
Urban Borštnik, Joost VandeVondele, Valéry Weber, and Jürg Hutter. 2014. Sparse matrix multiplication: The distributed block-compressed sparse row library. Parallel Comput. 40, 5-6 (2014), 47–58.
[7]
Aydin Buluç, Jeremy T Fineman, Matteo Frigo, John R Gilbert, and Charles E Leiserson. 2009. Parallel sparse matrix-vector and matrix-transpose-vector multiplication using compressed sparse blocks. In Proceedings of the twenty-first annual symposium on Parallelism in algorithms and architectures. 233–244.
[8]
Aydin Buluc and John R Gilbert. 2008. On the representation and multiplication of hypersparse matrices. In 2008 IEEE International Symposium on Parallel and Distributed Processing. IEEE, 1–11.
[9]
Fabio Checconi and Fabrizio Petrini. 2014. Traversing trillions of edges in real time: Graph exploration on large-scale parallel machines. In 2014 IEEE 28th International Parallel and Distributed Processing Symposium. IEEE, 425–434.
[10]
F. Chierichetti, R. Kumar, S. Lattanzi, M. Mitzenmacher, A. Panconesi, and P. Raghavan. 2009. On compressing social networks. SIGKDD (2009), 219–228.
[11]
Xinbiao Gan, Yiming Zhang, Ruibo Wang, Tiejun Li, Tiaojie Xiao, Ruigeng Zeng, Jie Liu, and Kai Lu. 2021. TianheGraph: Customizing Graph Search for Graph500 on Tianhe Supercomputer. IEEE Transactions on Parallel and Distributed Systems 33, 4 (2021), 941–951.
[12]
Xinbiao Gan, Yiming Zhang, Ruibo Wang, Tiejun Li, Tiaojie Xiao, Ruigeng Zeng, Jie Liu, and Kai Lu. 2021. TianheGraph: Customizing Graph Search for Graph500 on Tianhe Supercomputer. IEEE Transactions on Parallel and Distributed Systems (2021), 1–1. https://doi.org/10.1109/TPDS.2021.31007852
[13]
Joseph E Gonzalez, Yucheng Low, Haijie Gu, Danny Bickson, and Carlos Guestrin. 2012. { PowerGraph} : Distributed { Graph-Parallel} Computation on Natural Graphs. In 10th USENIX symposium on operating systems design and implementation (OSDI 12). 17–30.
[14]
Keita Iwabuchi, Hitoshi Sato, Yuichiro Yasui, Katsuki Fujisawa, and Satoshi Matsuoka. 2014. NVM-based hybrid BFS with memory efficient data structure. In 2014 IEEE International Conference on Big Data (Big Data). IEEE, 529–538.
[15]
U Kang, Duen Horng Chau, and Christos Faloutsos. 2011. Mining large graphs: Algorithms, inference, and discoveries. In 2011 IEEE 27th International Conference on Data Engineering. IEEE, 243–254.
[16]
Haewoon Kwak, Changhyun Lee, Hosung Park, and Sue Moon. 2010. What is Twitter, a social network or a news media?Www ’10 Proceedings of International Conference on World Wide Web (2010), 591–600.
[17]
Yongsub Lim, Won-Jo Lee, Ho-Jin Choi, and U Kang. 2015. Discovering large subsets with high quality partitions in real world graphs. In 2015 International Conference on Big Data and Smart Computing (BIGCOMP). IEEE, 186–193.
[18]
Heng Lin, Xiaowei Zhu, Bowen Yu, Xiongchao Tang, Wei Xue, Wenguang Chen, Lufei Zhang, Torsten Hoefler, Xiaosong Ma, Xin Liu, 2018. Shentu: processing multi-trillion edge graphs on millions of cores in seconds. In SC18: International Conference for High Performance Computing, Networking, Storage and Analysis. IEEE, 706–716.
[19]
Tingting Liu, Dong Lu, Hao Zhang, Mingyue Zheng, Huaiyu Yang, Yechun Xu, Cheng Luo, Weiliang Zhu, Kunqian Yu, and Hualiang Jiang. 2016. Applying high-performance computing in drug discovery and molecular simulation. National science review 3, 1 (2016), 49–63.
[20]
Weifeng Liu and Brian Vinter. 2015. CSR5: An efficient storage format for cross-platform sparse matrix-vector multiplication. In Proceedings of the 29th ACM on International Conference on Supercomputing. 339–350.
[21]
Yucheng Low, Danny Bickson, Joseph Gonzalez, Carlos Guestrin, Aapo Kyrola, and Joseph M. Hellerstein. 2012. Distributed GraphLab: A Framework for Machine Learning in the Cloud. PVLDB 5, 8 (2012), 716–727.
[22]
Eurıpides Montagne and Anand Ekambaram. 2004. An optimal storage format for sparse matrices. Inform. Process. Lett. 90, 2 (2004), 87–92.
[23]
Koji Ueno, Toyotaro Suzumura, Naoya Maruyama, Katsuki Fujisawa, and Satoshi Matsuoka. 2016. Extreme scale breadth-first search on supercomputers. In 2016 IEEE International Conference on Big Data (Big Data). IEEE, 1040–1047.
[24]
Koji Ueno, Toyotaro Suzumura, Naoya Maruyama, Katsuki Fujisawa, and Satoshi Matsuoka. 2017. Efficient breadth-first search on massively parallel and distributed-memory machines. Data Science and Engineering 2, 1 (2017), 22–35.
[25]
Norases Vesdapunt and Hector Garcia-Molina. 2016. Updating an Existing Social Graph Snapshot via a Limited API. In Proceedings of the 25th ACM International on Conference on Information and Knowledge Management. ACM, 1693–1702.
[26]
Wikipedia. 2021. Breadth-first search. https://en.wikipedia.org/wiki/Breadth-first_search Last accessed 16 September 2021.
[27]
Gan Xinbiao, Tan Wen, and Liu Jie. 2021. Bidirectional-Bitmap Based CSR for Reducing Large-Scale Graph Space. Journal of Computer Research and Development 58, 3 (2021), 458.
[28]
R. Zafarani and H. Liu. 2009. Social Computing Data Repository at ASU [http://socialcomputing.asu.edu]. Informatics and Decision Systems Engineering (2009).
[29]
Mingxing Zhang, Yongwei Wu, Kang Chen, Xuehai Qian, Xue Li, and Weimin Zheng. 2016. Exploring the hidden dimension in graph processing. In 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 16).
[30]
Yiming Zhang, Haonan Wang, Menghan Jia, Jinyan Wang, Dong sheng Li, Guangtao Xue, and K. Tan. 2020. TopoX: Topology Refactorization for Minimizing Network Communication in Graph Computations. IEEE/ACM Transactions on Networking 28 (2020), 2768–2782.
[31]
Xiaowei Zhu, Wenguang Chen, Weimin Zheng, and Xiaosong Ma. 2016. Gemini: A Computation-Centric Distributed Graph Processing System. In 12th USENIX Symposium on Operating Systems Design and Implementation, OSDI 2016, Savannah, GA, USA, November 2-4, 2016, Kimberly Keeton and Timothy Roscoe (Eds.). USENIX Association, 301–316. https://www.usenix.org/conference/osdi16/technical-sessions/presentation/zhu

Index Terms

  1. SuperCSR: A Space-Time-Efficient CSR Representation for Large-scale Graph Applications on Supercomputers

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    ICPP '24: Proceedings of the 53rd International Conference on Parallel Processing
    August 2024
    1279 pages
    ISBN:9798400717932
    DOI:10.1145/3673038
    This work is licensed under a Creative Commons Attribution International 4.0 License.

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 12 August 2024

    Check for updates

    Author Tags

    1. CSR
    2. Graph processing
    3. Graph500
    4. Sorted graph

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    ICPP '24

    Acceptance Rates

    Overall Acceptance Rate 91 of 313 submissions, 29%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 199
      Total Downloads
    • Downloads (Last 12 months)199
    • Downloads (Last 6 weeks)102
    Reflects downloads up to 19 Nov 2024

    Other Metrics

    Citations

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media