akawincent

😵

working

wincent akawincent

😵

working

Young dumb who is intrigued by robotics and computer vision

44 followers · 64 following

Westlake University
Hang Zhou, Zhe Jiang, China
10:14 (UTC +08:00)
https://akawincent.github.io/
https://www.zhihu.com/people/wincent-84
@pu_wen99907

Achievements

Organizations

Lists (9)

Sort

Stars

bytetriper / RAE

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,301 28 Updated Oct 15, 2025

WU-CVGL / GS-Reasoner

Reasoning in Space via Grounding in the World

Python 26 Updated Oct 17, 2025

WU-CVGL / E-MoFlow

[NeurIPS 2025] E-MoFlow: Learning Egomotion and Optical Flow from Event Data via Implicit Regularization

Python 24 1 Updated Oct 21, 2025

huanngzh / MV-Adapter

[ICCV 2025] Official impl. of "MV-Adapter: Multi-view Consistent Image Generation Made Easy"

Python 1,181 71 Updated Jun 26, 2025

Inception3D / TTT3R

A simple state update rule to enhance length generalization for CUT3R

Python 460 9 Updated Oct 1, 2025

facebookresearch / map-anything

MapAnything: Universal Feed-Forward Metric 3D Reconstruction

Python 2,048 113 Updated Oct 9, 2025

LogosRoboticsGroup / SPAR

From Flatland to Space (SPAR). Accepted to NeurIPS 2025 Datasets & Benchmarks. A large-scale dataset & benchmark for 3D spatial perception and reasoning in VLMs.

Python 59 Updated Oct 7, 2025

NVlabs / describe-anything

[ICCV 2025] Implementation for Describe Anything: Detailed Localized Image and Video Captioning

Python 1,366 76 Updated Jun 26, 2025

openai / tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 16,300 1,266 Updated Oct 6, 2025

ScanNet / ScanNet

C 2,140 363 Updated May 5, 2024

mystorm16 / FastVGGT

Code for FastVGGT: Training-Free Acceleration of Visual Geometry Transformer

Python 548 24 Updated Oct 14, 2025

vision-x-nyu / thinking-in-space

Official repo and evaluation implementation of VSI-Bench

Python 606 36 Updated Aug 5, 2025

Relaxed-System-Lab / Flash-Sparse-Attention

🚀🚀 Efficient implementations of Native Sparse Attention

Python 980 8 Updated Sep 29, 2025

QwenLM / Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 6,318 467 Updated Aug 7, 2024

3DTopia / 4DNeX

4DNeX: Feed-Forward 4D Generative Modeling Made Easy

Python 780 7 Updated Oct 2, 2025

huggingface / peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 19,875 2,070 Updated Oct 22, 2025

liguodongiot / llm-action

本项目旨在分享大模型相关技术原理以及实战经验（大模型工程化、大模型应用落地）

HTML 21,452 2,518 Updated Oct 19, 2025

facebookresearch / dinov3

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 7,870 513 Updated Oct 22, 2025

xuxw98 / ESAM

[ICLR 2025, Oral] EmbodiedSAM: Online Segment Any 3D Thing in Real Time

Python 576 28 Updated May 7, 2025

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

16,520 1,067 Updated Oct 20, 2025

mll-lab-nu / MindCube

Python 94 3 Updated Oct 2, 2025

AIGeeksGroup / 3D-R1

3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding

Python 342 11 Updated Sep 28, 2025

openai / gpt-oss

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 18,913 1,864 Updated Oct 6, 2025

ByteDance-Seed / Bagel

Open-source unified multimodal model

Python 5,192 449 Updated Aug 22, 2025

zhengqili / Neural-Scene-Flow-Fields

PyTorch implementation of paper "Neural Scene Flow Fields for Space-Time View Synthesis of Dynamic Scenes"

Python 735 94 Updated Jul 15, 2022

sii-research / siiRL

siiRL: Shanghai Innovation Institute RL Framework for Advanced LLMs and Multi-Agent Systems

Python 222 20 Updated Oct 22, 2025

Lilac-Lee / Neural_Scene_Flow_Prior

Neural Scene Flow Prior (NeurIPS 2021 spotlight)

Python 134 13 Updated Apr 24, 2023

EvolvingLMMs-Lab / lmms-eval

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,207 401 Updated Oct 22, 2025

QwenLM / Qwen3-VL

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 15,321 1,191 Updated Oct 22, 2025

diankun-wu / Spatial-MLLM

Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence

Python 367 9 Updated Jun 22, 2025

wincent akawincent

Organizations

Lists (9)

📸 SLAMs

event camera

🤯 Paper recommondation

🔨 Toolbox

✨ Inspiration

language

🗻 NeRFs

🤖 Robotics

❄️ Gaussian Splatting

Stars