Stars
The official repository of SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization
「TCSVT2021」A Transformer-Based Feature Segmentation and Region Alignment Method For UAV-View Geo-Localization
[CVPR25] Official implementation of `MobileMamba: Lightweight Multi-Receptive Visual Mamba Network.'
[ECCV'24] Approaching Outside: Scaling Unsupervised 3D Object Detection from 2D Scene.
An offical repo for ECCV 2024 Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatial Relation Matching
https://dl.acm.org/doi/10.1145/3689095.3689102
[Pattern Recognition'24] Pytorch implementation of Multiple-environment Self-adaptive Network for Aerial-view Geo-localization 🚁 https://arxiv.org/abs/2204.08381
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
SuperSonic is the next-generation AI+BI platform that unifies Chat BI (powered by LLM) and Headless BI (powered by semantic layer) paradigms.
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Dreamer312 / LPN
Forked from wtyhub/LPNPytorch implementation of Each Part Matters: Local Patterns Facilitate Cross-view Geo-localization https://arxiv.org/abs/2008.11646
layumi / PiPa
Forked from chen742/PiPaOfficial Implementation of PiPa: Pixel- and Patch-wise Self-supervised Learning for Domain Adaptative Semantic Segmentation
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Zhedong Zheng Homepage https://www.zdzheng.xyz
layumi / LPN
Forked from wtyhub/LPNPytorch implementation of Each Part Matters: Local Patterns Facilitate Cross-view Geo-localization https://arxiv.org/abs/2008.11646
UAVM @ ACM MM2023 Workshop on UAVs in Multimedia: Capturing the World from a New Perspective
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
Pytorch implementation of Learning Cross-view Geo-localization Embeddings via Dynamic Weighted Decorrelation Regularization https://arxiv.org/abs/2211.05296
Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"
This repo includes ChatGPT prompt curation to use ChatGPT and other LLM tools better.