-
GSoC'23 @wikimedia developer @mdgspace
- Bengaluru, Karnataka, India
-
16:47
(UTC +05:30) - https://nik-55.github.io
- in/nikhilmahajan123
- @m_nik55
- https://medium.com/@nik.xyz.in
- https://huggingface.co/nik-55
Highlights
- Pro
Lists (4)
Sort Name ascending (A-Z)
Starred repositories
A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related webs…
Collect some World Models for Autonomous Driving (and Robotic) papers.
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism
Open-source, vision-first browser agent
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
A Straightforward, Step-by-Step Implementation of a Video Diffusion Model
Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition
[ICCV 2025] GameFactory: Creating New Games with Generative Interactive Videos
Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.
Wan: Open and Advanced Large-Scale Video Generative Models
[ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editing
Matrix-Game 2.0: An Open-Source, Real-Time, and Streaming Interactive World Model
openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system on 300+ supported cars.
MineRL Competition for Sample Efficient Reinforcement Learning - Python Package
SkyReels-V2: Infinite-length Film Generative model
Minecraft mod development framework used by Forge and FML for the gradle build system
A set of 13 diverse machine-learning tasks that require memory to solve.
A networking protocol for agent-environment communication
A customisable 3D platform for agent-based AI research
Reinforcement Learning environments based on the 1993 game Doom
Project Malmo is a platform for Artificial Intelligence experimentation and research built on top of Minecraft. We aim to inspire a new generation of research into challenging new problems presente…
Outlining and demonstrating how language models are able to understand image, video, and text content.
Production First and Production Ready End-to-End Keyword Spotting Toolkit
On-device wake word detection powered by deep learning
An open-source audio wake word (or phrase) detection framework with a focus on performance and simplicity.