awq

Here are 9 public repositories matching this topic...

intel / neural-compressor

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

sparsity pruning quantization knowledge-distillation auto-tuning int8 low-precision quantization-aware-training post-training-quantization awq int4 large-language-models gptq smoothquant sparsegpt fp4 mxformat

Updated Nov 27, 2024
Python

ModelTC / llmc

Star

[EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".

Updated Nov 27, 2024
Python

intel / auto-round

Star

Advanced Quantization Algorithm for LLMs/VLMs. This is official implementation of "Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs"

rounding quantization awq int4 gptq neural-compressor

Updated Nov 28, 2024
Python

hcd233 / Aris-AI-Model-Server

Star

An OpenAI Compatible API which integrates LLM, Embedding and Reranker. 一个集成 LLM、Embedding 和 Reranker 的 OpenAI 兼容 API

ai embedding mlx reranker rag fastapi sentence-transformers awq llm vllm gptq openai-compatible-api

Updated Jul 13, 2024
Python

GURPREETKAURJETHRA / Quantize-LLM-using-AWQ

Star

Quantize LLM using AWQ

quantize awq large-language-models llms generative-ai llm-training

Updated Apr 26, 2024
Jupyter Notebook

harleyszhang / harleyszhang.github.io

Star

🧗‍♂️ harleyszhang 的个人博客

blog awq llm llm-inference

Updated Nov 25, 2024
HTML

FireStrike1010 / artificial_personality

Star

Artificial Personality is text2text AI chatbot that can use character cards

ai chatbot transformers neural-networks chatbot-framework awq tavernai

Updated May 28, 2024
Python

glurp / rfilter

Star

programmable filter, as posix awq, with ruby syntaxe and embeddable function

ruby bash filter plotting awq

Updated Apr 25, 2022
Ruby

This repository contains notebooks and resources related to the Software Development Group Project (SDGP) machine learning component. Specifically, it includes two notebooks used for creating a dataset and fine-tuning a Mistral-7B-v0.1-Instruct model.

machine-learning transformers pytorch peft awq qlora autoawq

Updated Mar 21, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the awq topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the awq topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

awq

Here are 9 public repositories matching this topic...

intel / neural-compressor

ModelTC / llmc

intel / auto-round

hcd233 / Aris-AI-Model-Server

GURPREETKAURJETHRA / Quantize-LLM-using-AWQ

harleyszhang / harleyszhang.github.io

FireStrike1010 / artificial_personality

glurp / rfilter

vpgits / sdgp-ml

Improve this page

Add this topic to your repo