-
BDFL @ Unum
- London, San Francisco, Yerevan
-
13:52
(UTC +01:00) - ashvardanian.com
- https://orcid.org/0000-0002-4882-1815
- @ashvardanian
- in/ashvardanian
- ashvardanian
- @ashvardanian
Starred repositories
An evolutionary approach to find small and low latency sorting networks
Large collection of number systems providing custom arithmetic for mixed-precision algorithm development and optimization for AI, Machine Learning, Computer Vision, Signal Processing, CAE, EDA, con…
Minimalistic Rust toolkit for hash function quality analysis. Tests avalanche effect, differential patterns, and statistical distribution across variable-length n-grams.
HJLebbink / RustGPT
Forked from tekaratzas/RustGPTAn transformer based LLM. Written completely in Rust
Apache Spark and Unum USearch integration example benchmarking distributed Vector Search against Lucene and OpenSearch
Apache Arrow-compatible space-efficient "tape" class in pure Rust to be used with StringZilla for GPU, NUMA, and disk transfers of variable length strings
Fast, Flexible and Portable Structured Generation
Lower-latency OpenMP-style minimalistic scoped thread-pool designed for 'Fork-Join' parallelism in Rust and C++, avoiding memory allocations, mutexes, CAS-primitives, and false-sharing on the hot p…
Any model. Any hardware. Zero compromise. Built with @ziglang / @openxla / MLIR / @bazelbuild
Lightpanda: the headless browser designed for AI and automation
Optimizing bit-level Jaccard Index and Population Counts for large-scale quantized Vector Search via Harley-Seal CSA and Lookup Tables
Multi-platform high-performance compute language extension for Rust.
The indexing service for ScyllaDB for vector searching functionality
A Visual Studio Code extension for building and debugging CUDA applications.
A Datacenter Scale Distributed Inference Serving Framework
Code at the speed of thought – Zed is a high-performance, multiplayer code editor from the creators of Atom and Tree-sitter.
Official repository for the Microsoft C/C++ extension for VS Code.
A benchmark for low-level CPU micro-architectural features
NetworkX-like Python experience for Postgres, SQLite, MongoDB, and Neo4J
A versatile pairwise aligner for genomic and spliced nucleotide sequences
Capstone disassembly/disassembler framework for ARM, ARM64 (ARMv8), Alpha, BPF, Ethereum VM, HPPA, LoongArch, M68K, M680X, Mips, MOS65XX, PPC, RISC-V(rv32G/rv64G), SH, Sparc, SystemZ, TMS320C64X, T…
OpenMM is a toolkit for molecular simulation using high performance GPU code.
Heap Layers: An Extensible Memory Allocation Infrastructure
Compile Time Regular Expression in C++
Playing around "Less Slow" coding practices in C++ 20, C, CUDA, PTX, & Assembly, from numerics & SIMD to coroutines, ranges, exception handling, networking and user-space IO
Parallel Computing starter project to build GPU & CPU kernels in CUDA & C++ and call them from Python without a single line of CMake using PyBind11