Change the repository type filter
All
Repositories list
20 repositories
- A comprehensive list [GoMatching@NeurIPS'24, DeepSolo(++)@ CVPR'23, DPText-DETR@AAAI'23, I3CL@IJCV'22] of our research works related to scene text detection, spotting, etc., including papers, codes.
LeMeViT
PublicThe official repo for [IJCAI'24] "LeMeViT: Efficient Vision Transformer with Learnable Meta Tokens for Remote Sensing Image Interpretation"RSP
PublicThe official repo for [TGRS'22] "An Empirical Study of Remote Sensing Pretraining"DeepSolo
PublicThe official repo for [CVPR'23] "DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting" & [ArXiv'23] "DeepSolo++: Let Transformer Decoder with Explicit Points Solo for Multilingual Text Spotting"SAMRS
PublicThe official repo for [NeurIPS'23] "SAMRS: Scaling-up Remote Sensing Segmentation Dataset with Segment Anything Model"ViTPose
PublicThe official repo for [NeurIPS'22] "ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation" and [TPAMI'23] "ViTPose++: Vision Transformer for Generic Body Pose Estimation"MTP
PublicThe official repo for [JSTARS'24] "MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining"- A comprehensive list [SAMRS@NeurIPS'23, RVSA@TGRS'22, RSP@TGRS'22] of our research works related to remote sensing, including papers, codes, and citations. Note: The repo for [TGRS'22] "An Empirical Study of Remote Sensing Pretraining" has been moved to: https://github.com/ViTAE-Transformer/RSP
- The official repo for [AAAI 2024] "SimDistill: Simulated Multi-modal Distillation for BEV 3D Object Detection""
APTv2
PublicThe official repo for the extension of [NeurIPS'22] "APT-36K: A Large-scale Benchmark for Animal Pose Estimation and Tracking": https://github.com/pandorgan/APT-36KQFormer
PublicThe official repo for [TPAMI'23] "Vision Transformer with Quadrangle Attention"P3M-Net
PublicThe official repo for [IJCV'23] "Rethinking Portrait Matting with Privacy Preserving"Remote-Sensing-RVSA
PublicThe official repo for [TGRS'22] "Advancing Plain Vision Transformer Towards Remote Sensing Foundation Model"SAMText
PublicThe official repo for the technical report "Scalable Mask Annotation for Video Text Spotting"I3CL
PublicThe official repo for [IJCV'22] "I3CL: Intra- and Inter-Instance Collaborative Learning for Arbitrary-shaped Scene Text Detection"- A comprehensive list [AIM@IJCAI'21, P3M@MM'21, GFM@IJCV'22, RIM@CVPR'23, P3MNet@IJCV'23] of our research works related to image matting, including papers, codes, datasets, demos, and citations. Note: The repo for [IJCV'23] "Rethinking Portrait Matting with Privacy Preserving" has been moved to: https://github.com/ViTAE-Transformer/P3M-Net
ViTAE-Transformer
PublicThe official repo for [NeurIPS'21] "ViTAE: Vision Transformer Advanced by Exploring Intrinsic Inductive Bias" and [IJCV'22] "ViTAEv2: Vision Transformer Advanced by Exploring Inductive Bias for Image Recognition and Beyond"ViTAE-VSA
PublicThe official repo for [ECCV'22] "VSA: Learning Varied-Size Window Attention in Vision Transformers"VOS-LLB
PublicThe official repo for [AAAI'23] "Learning to Learn Better for Video Object Segmentation"ViTDet
PublicUnofficial implementation for [ECCV'22] "Exploring Plain Vision Transformer Backbones for Object Detection"