有关anything相关的主流任务: 2d检测相关(AnyObject), 3d检测相关(Any3D),AI生成相关(AnyGeneration), AI模型优化相关(), AI任务相关, etc.
Title & Authors | Intro | Useful Links |
---|---|---|
Anything-3D: Segment-Anything + 3D, Let’s lift the anything to 3D (Project) LV-Lab, NUS |
Github | |
SAM 3D Selector: Utilizing segment-anything to help the region selection of 3D point cloud or mesh. (Project) Nexuslrf |
Github | |
3D-Box via Segment Anything. (Project) dvlab-research |
[Github] | |
Segment Anything 3D (Project) Yunhan Yang, Xiaoyang Wu |
[Github] |
Title & Authors | Intro | Useful Links |
---|---|---|
Caption Anything: Interactive Image Description with Diverse Multimodal Controls Teng Wang, Jinrui Zhang, Junjie Fei, Hao Zheng, Yunlong Tang, Zhe Li, Mingqi Gao, Shanshan Zhao > SUSTech VIP Lab > Preprint’23 Caption Anything (Project) |
[Github] [Demo] |
|
Image2Paragraph:Transform Image into Unique Paragraph (Project) Jinpeng Wang |
Github | |
… |
Paper | First Author | Venue | Topic |
---|---|---|---|
Segment Anything | Alexander Kirillov | Preprint’23 | Segmentation |
Learning to Segment Every Thing | Ronghang Hu | CVPR’18 | |
Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection | Shilong Liu | Preprint’23 | Grouding+Detection |
SegGPT: Segmenting Everything In Context | Xinlong Wang | Preprint’23 | Segmentation |
V3Det: Vast Vocabulary Visual Detection Dataset | Jiaqi Wang | Preprint’23 | Dataset |
Pose for Everything: Towards Category-Agnostic Pose Estimation | Lumin Xu | ECCV’22 Oral | Pose |
Paper | First Author | Venue | Topic |
---|---|---|---|
High-Resolution Image Synthesis with Latent Diffusion Models | Robin Rombach | CVPR’22 | Text-to-Image Generation |
Adding Conditional Control to Text-to-Image Diffusion Models | Lvmin Zhang | Preprint’23 | Controlllable Generation |
GigaGAN: Large-scale GAN for Text-to-Image Synthesis | Minguk Kang | CVPR’23 | Large-scale GAN |
Inpaint Anything: Segment Anything Meets Image Inpainting | Tao Yu | Preprint’23 | Inpainting |
Paper | First Author | Venue | Topic |
---|---|---|---|
DepGraph: Towards Any Structural Pruning | Gongfan Fang | CVPR’23 | Network Pruning |
MQBench: Towards Reproducible and Deployable Model Quantization Benchmark | Yuhang Li | NeurIPS’21 | Network Quantization |
OTOv2: Automatic, Generic, User-Friendly | Tianyi Chen | ICLR’23 | Network Pruning |
Deep Model Reassembly | Xingyi Yang | NeurIPS’22 | Model Reuse |
Paper | First Author | Venue | Topic |
---|---|---|---|
HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in HuggingFace | Yongliang Shen | Preprint’23 | Modelzoo + LLM |
TaskMatrix.AI: Completing Tasks by Connecting Foundation Models with Millions of APIs | Yaobo Liang | Preprint’23 | Modelzoo + LLM |
Generalized Decoding for Pixel, Image and Language | Xueyan Zou | CVPR’23 | Multi Tasking |
Pre-Trained Image Processing Transformer | Chen, Hanting | CVPR’21 | Low-level Vision |