VimalWill

Vimal William VimalWill

ML Systems & Coffee

40 followers · 53 following

@itsvwill

Achievements

Lists (1)

Sort

Chisel

1 repository

Starred repositories

lucidrains / linear-attention-transformer

Transformer based on a variant of attention that is linear complexity in respect to sequence length

Python 801 72 Updated May 5, 2024

sail-sg / Attention-Sink

[ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)

Python 131 5 Updated Jul 8, 2025

google / sanitizers

AddressSanitizer, ThreadSanitizer, MemorySanitizer

C 12,155 1,080 Updated Oct 2, 2025

i404788 / s5-pytorch

Pytorch implementation of Simplified Structured State-Spaces for Sequence Modeling (S5)

Python 79 3 Updated Apr 26, 2024

Zefan-Cai / Awesome-LLM-KV-Cache

Awesome-LLM-KV-Cache: A curated list of 📙Awesome LLM KV Cache Papers with Codes.

378 22 Updated Mar 3, 2025

Zefan-Cai / KVCache-Factory

Unified KV Cache Compression Methods for Auto-Regressive Models

Python 1,266 160 Updated Jan 4, 2025

alshedivat / al-folio

A beautiful, simple, clean, and responsive Jekyll theme for academics

HTML 14,285 12,401 Updated Sep 17, 2025

vmarinowski / infini-attention

An unofficial pytorch implementation of 'Efficient Infinite Context Transformers with Infini-attention'

Python 53 9 Updated Aug 19, 2024

HazyResearch / based

Code for exploring Based models from "Simple linear attention language models balance the recall-throughput tradeoff"

Python 241 17 Updated Jun 6, 2025

tobna / TaylorShift

This repository contains the code for the paper "TaylorShift: Shifting the Complexity of Self-Attention from Squared to Linear (and Back) using Taylor-Softmax"

Python 12 Updated Mar 17, 2025

LeapLabTHU / InLine

Official repository of InLine attention (NeurIPS 2024)

Python 56 1 Updated Dec 22, 2024

meta-llama / PurpleLlama

Set of tools to assess and improve LLM security.

Python 3,826 652 Updated Oct 22, 2025

hdong920 / LESS

Python 53 3 Updated May 13, 2024

LeapLabTHU / FLatten-Transformer

Official repository of FLatten Transformer (ICCV2023)

Python 442 24 Updated Nov 4, 2024

facebookresearch / fairscale

PyTorch extensions for high performance and large scale training.

Python 3,384 294 Updated Apr 26, 2025

SoumilB7 / Llama3.2_1B_pytorch_barebones

Pytorch implementation of Llama 3.2 1B architecture barebones + nuggets of wisdom

Jupyter Notebook 5 1 Updated Jul 14, 2025

ChuanyangZheng / L2ViT

Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer

Python 14 Updated Sep 7, 2024

AMD-AGI / AMD_QTViT

Python 5 1 Updated Feb 24, 2025

sdiehl / mlir-egglog

A toy compiler for NumPy array expressions that uses e-graphs and MLIR

Python 109 7 Updated Aug 11, 2025

karpathy / micrograd

A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

Jupyter Notebook 13,307 1,964 Updated Aug 8, 2024

hanhanW / iree-dispatch-dump-viewer

Toying with displaying dispatch dependency in IREE

HTML 1 Updated May 8, 2025

sii-research / siiRL

siiRL: Shanghai Innovation Institute RL Framework for Advanced LLMs and Multi-Agent Systems

Python 222 20 Updated Oct 22, 2025

NVIDIA / kvpress

LLM KV cache compression made easy

Python 660 71 Updated Oct 21, 2025

huggingface / course

The Hugging Face course on Transformers

MDX 3,435 1,153 Updated Oct 16, 2025

HazyResearch / ThunderKittens

Tile primitives for speedy kernels

Cuda 2,832 190 Updated Oct 18, 2025

dingo-actual / infini-transformer

PyTorch implementation of Infini-Transformer from "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" (https://arxiv.org/abs/2404.07143)

Python 291 23 Updated May 4, 2024