jbwang1997

jbwang1997 jbwang1997

Keep calm and carry on coding!!!

166 followers · 75 following

NWPU -> NKU
Tianjin, China
16:21 (UTC +08:00)
https://jbwang1997.github.io/

Achievements

x3 x2 x3 x2

Achievements

x3 x2 x3 x2

Stars

myscience / open-genie

Pytorch implementation of "Genie: Generative Interactive Environments", Bruce et al. (2024).

Python 213 30 Updated Aug 21, 2024

Tencent-Hunyuan / HunyuanWorld-Mirror

Universal 3D World Reconstruction with Any-Prior Prompting

Python 241 14 Updated Oct 23, 2025

deepseek-ai / DeepSeek-OCR

Contexts Optical Compression

Python 13,294 775 Updated Oct 23, 2025

xiaomi-research / recogdrive

ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving

Python 306 24 Updated Oct 20, 2025

MCZhi / GameFormer

[ICCV 2023 Oral] Game-theoretic modeling and learning of Transformer-based interactive prediction and planning

Python 315 42 Updated Mar 8, 2024

BraveGroup / DriveVLA-W0

127 8 Updated Oct 17, 2025

KellerJordan / modded-nanogpt

NanoGPT (124M) in 3 minutes

Python 3,674 470 Updated Oct 16, 2025

NVlabs / vla0

131 Updated Oct 15, 2025

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 31,188 3,323 Updated Oct 22, 2025

DiffusionAD / Flow-Planner

[NeurIPS 2025] Official implementation for "Flow Matching-Based Autonomous Driving Planning with Advanced Interactive Behavior Modeling"

47 9 Updated Oct 14, 2025

elisabettafedele / superdec

[ICCV 2025] SuperDec: 3D Scene Decomposition with  Superquadric Primitives.

Python 127 5 Updated Oct 14, 2025

bytetriper / RAE

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,307 29 Updated Oct 15, 2025

EnVision-Research / DA-2

Official Implementation of DA^2: Depth Anything in Any Direction

Python 179 14 Updated Oct 12, 2025

SOTAMak1r / Infinite-Forcing

Forked from guandeh17/Self-Forcing

Infinite-Forcing: Towards Infinite-Long Video Generation

Python 80 2 Updated Oct 22, 2025

facebookresearch / 4DGT

[NeurIPS 2025 (Spotlight)] The implementation for the paper "4DGT Learning a 4D Gaussian Transformer Using Real-World Monocular Videos"

Python 294 4 Updated Sep 19, 2025

Fictionarry / GeoSVR

[NeurIPS'25 Spotlight] GeoSVR: Taming Sparse Voxels for Geometrically Accurate Surface Reconstruction

Python 124 7 Updated Sep 28, 2025

hustvl / RAD

[NeurIPS 2025] RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning

Python 78 1 Updated Oct 6, 2025

AlmondGod / tinyworlds

A minimal implementation of DeepMind's Genie world model

Python 992 69 Updated Sep 28, 2025

OpenDriveLab / ViDAR

[CVPR 2024 Highlight] Visual Point Cloud Forecasting

Python 338 23 Updated Jul 2, 2025

SpatialVision / Prior-Depth-Anything

Python 406 32 Updated Sep 2, 2025

NVlabs / Long-RL

Long-RL: Scaling RL to Long Sequences (NeurIPS 2025)

Python 644 23 Updated Sep 24, 2025

tum-vision / scenedino

Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion (ICCV 2025)

Python 68 5 Updated Sep 18, 2025

QwenLM / Qwen3-VL

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 15,347 1,196 Updated Oct 22, 2025

YXB-NKU / SE-GUI

[NeurIPS 2025]"Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning"

Python 73 4 Updated Oct 21, 2025

zcablii / ViTP

Offical implementation of "Visual Instruction Pretraining for Domain-Specific Foundation Models"

Python 75 1 Updated Oct 21, 2025

EvolvingLMMs-Lab / multimodal-search-r1

MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search tools.

Python 336 17 Updated Aug 26, 2025

Alibaba-NLP / DeepResearch

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 16,329 1,229 Updated Oct 18, 2025

microsoft / MoGe

[CVPR'25 Oral] MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision

Python 1,849 114 Updated Oct 20, 2025

yangzhou24 / OmniWorld

OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling

Python 377 6 Updated Oct 15, 2025

EnVision-Research / Lotus

Official implementation of Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction

Python 747 45 Updated Apr 3, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

jbwang1997 jbwang1997

Achievements

Achievements

Block or report jbwang1997

Stars

myscience / open-genie

Tencent-Hunyuan / HunyuanWorld-Mirror

deepseek-ai / DeepSeek-OCR

xiaomi-research / recogdrive

MCZhi / GameFormer

BraveGroup / DriveVLA-W0

KellerJordan / modded-nanogpt

NVlabs / vla0

karpathy / nanochat

DiffusionAD / Flow-Planner

elisabettafedele / superdec

bytetriper / RAE

EnVision-Research / DA-2

SOTAMak1r / Infinite-Forcing

facebookresearch / 4DGT

Fictionarry / GeoSVR

hustvl / RAD

AlmondGod / tinyworlds

OpenDriveLab / ViDAR

SpatialVision / Prior-Depth-Anything

NVlabs / Long-RL

tum-vision / scenedino

QwenLM / Qwen3-VL

YXB-NKU / SE-GUI

zcablii / ViTP

EvolvingLMMs-Lab / multimodal-search-r1

Alibaba-NLP / DeepResearch

microsoft / MoGe

yangzhou24 / OmniWorld

EnVision-Research / Lotus