BMCook: A Task-agnostic Compression Toolkit for Big Models.

AllImages Videos Shopping Maps News Books

BMCook: A Task-agnostic Compression Toolkit for Big Models

aclanthology.org › 2022.emnlp-demos.40

In BMCook, we implement four representative compression methods, including quantization, pruning, distillation, and MoEfication. Developers can easily combine ...

[PDF] BMCook: A Task-agnostic Compression Toolkit for Big Models

aclanthology.org › 2022.emnlp-de...

Task-agnostic compression can provide an effi- cient and versatile big model for both prompt- ing and delta tuning, leading to a more gen- eral impact than task ...

BMCook: A Task-agnostic Compression Toolkit for Big Models

www.researchgate.net › publication › 37...

The compression methods used in our experiments include 8bit quantization, structured pruning, unstructured pruning, and MoEfication.

2022 Emnlp-Demos 40 | PDF | Data Compression - Scribd

www.scribd.com › document › 2022-em...

This document introduces BMCook, a task-agnostic compression toolkit for large language models (LLMs) with billions of parameters.

‪Weilin Zhao‬ - ‪Google Scholar‬

scholar.google.com › citations

BMCook: A task-agnostic compression toolkit for big models. Z Zhang, B Gong ... BMInf: An Efficient Toolkit for Big Model Inference and Tuning. X Han, G ...

publications | Weilin Zhao - GitHub Pages

achazwl.github.io › publications

To address the computation bottleneck encountered in deploying big models in real-world scenarios, we introduce an open-source toolkit for big model inference ...

Zhengyan Zhang - ACL Anthology

anthology.aclweb.org › people › zhengy...

... task-specific compression. Hence, we introduce a task-agnostic compression toolkit BMCook for big models. In BMCook, we implement four representative ...

OpenBMB: Big Model Systems for Large-Scale Representation Learning

link.springer.com › chapter

Aug 24, 2023 · We introduce OpenBMB, an open-source suite of big models, to break the barriers of computation and expertise of big model applications.

Awesome LLM compression research papers and tools. - GitHub

github.com › HuangOwen › Awesome-L...

Tools. BMCook: Model Compression for Big Models [Code]. llama.cpp: Inference of LLaMA model in pure C/C++ [Code]. LangChain: Building applications with LLMs ...

[PDF] arXiv:2307.07705v2 [cs.CL] 15 Nov 2023

arxiv.org › pdf

Nov 15, 2023 · Bmcook: A task- · agnostic compression toolkit for big models. In Pro- ceedings of EMNLP Demonstration, pages 396–405. Zhengyan Zhang, Yankai ...