default search action
"Mooncake: A KVCache-centric Disaggregated Architecture for LLM Serving."
Ruoyu Qin et al. (2024)
- Ruoyu Qin, Zheming Li, Weiran He, Mingxing Zhang, Yongwei Wu, Weimin Zheng, Xinran Xu:
Mooncake: A KVCache-centric Disaggregated Architecture for LLM Serving. CoRR abs/2407.00079 (2024)
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.