Highlights
Stars
Supporting code for the blog post on modular manifolds.
The official Python SDK for the Perceptron API
A PyTorch native platform for training generative AI models
Ungreedy subword tokenizer and vocabulary trainer for Python, Go & Javascript
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
Development repository for the Triton language and compiler
MoVQGAN - model for the image encoding and reconstruction
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
A repository for research on medium sized language models.
Easily compute clip embeddings from video frames
commaVQ is a dataset of compressed driving video
Easily create large video dataset from video urls
Train vision models using JAX and 🤗 transformers
Story-Based Retrieval with Contextual Embeddings. Largest freely available movie video dataset. [ACCV'20]
Official repo for consistency models.
You like pytorch? You like micrograd? You love tinygrad! ❤️
A feature-rich command-line audio/video downloader
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
An open-source framework for training large multimodal models.
Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/text, ...
A curated list of deep learning resources for video-text retrieval.