OpenMMLab简介

Github链接https://github.com/open-mmlab官网链接https://openmmlab.com

OpenMMLab简介_第1张图片

OpenMMLab简介_第2张图片

目录

目标检测算法库MMDetection

目标检测算法库MMYOLO

 文字检测识别算法库MMOCR

3D目标检测算法库MMDetection3D

旋转目标检测算法库MMRotate

图像分割算法库MMSegmentation

图像分类+预训练+多模态算法库MMPretrain

姿态估计算法库MMPose

三维人体姿态估计算法库MMHuman3D

视频动作识别算法库MMAction

目标追踪MMTracking

生成模型+底层视觉+AIGC算法库MMagic

模型部署工具箱MMDeploy

趣味应用游乐场Playground


算法库详解

目标检测算法库MMDetection

MMDetectionhttp://github.com/open-mmlab/mmdetection

MMDetection 支持了各种不同的检测任务,包括目标检测实例分割全景分割,以及半监督目标检测。可以按照自己需求做自己的数据集并使用现有的框架快速训练、推理。

OpenMMLab简介_第3张图片

 算法框架

Object Detection Instance Segmentation Panoptic Segmentation Other
  • Fast R-CNN (ICCV'2015)
  • Faster R-CNN (NeurIPS'2015)
  • RPN (NeurIPS'2015)
  • SSD (ECCV'2016)
  • RetinaNet (ICCV'2017)
  • Cascade R-CNN (CVPR'2018)
  • YOLOv3 (ArXiv'2018)
  • CornerNet (ECCV'2018)
  • Grid R-CNN (CVPR'2019)
  • Guided Anchoring (CVPR'2019)
  • FSAF (CVPR'2019)
  • CenterNet (CVPR'2019)
  • Libra R-CNN (CVPR'2019)
  • TridentNet (ICCV'2019)
  • FCOS (ICCV'2019)
  • RepPoints (ICCV'2019)
  • FreeAnchor (NeurIPS'2019)
  • CascadeRPN (NeurIPS'2019)
  • Foveabox (TIP'2020)
  • Double-Head R-CNN (CVPR'2020)
  • ATSS (CVPR'2020)
  • NAS-FCOS (CVPR'2020)
  • CentripetalNet (CVPR'2020)
  • AutoAssign (ArXiv'2020)
  • Side-Aware Boundary Localization (ECCV'2020)
  • Dynamic R-CNN (ECCV'2020)
  • DETR (ECCV'2020)
  • PAA (ECCV'2020)
  • VarifocalNet (CVPR'2021)
  • Sparse R-CNN (CVPR'2021)
  • YOLOF (CVPR'2021)
  • YOLOX (CVPR'2021)
  • Deformable DETR (ICLR'2021)
  • TOOD (ICCV'2021)
  • DDOD (ACM MM'2021)
  • RTMDet (ArXiv'2022)
  • Conditional DETR (ICCV'2021)
  • DAB-DETR (ICLR'2022)
  • DINO (ICLR'2023)
  • DiffusionDet (ArXiv'2023)
  • EfficientDet (CVPR'2020)
  • Detic (ECCV'2022)
  • Mask R-CNN (ICCV'2017)
  • Cascade Mask R-CNN (CVPR'2018)
  • Mask Scoring R-CNN (CVPR'2019)
  • Hybrid Task Cascade (CVPR'2019)
  • YOLACT (ICCV'2019)
  • InstaBoost (ICCV'2019)
  • SOLO (ECCV'2020)
  • PointRend (CVPR'2020)
  • DetectoRS (ArXiv'2020)
  • SOLOv2 (NeurIPS'2020)
  • SCNet (AAAI'2021)
  • QueryInst (ICCV'2021)
  • Mask2Former (ArXiv'2021)
  • CondInst (ECCV'2020)
  • SparseInst (CVPR'2022)
  • RTMDet (ArXiv'2022)
  • BoxInst (CVPR'2021)
  • Panoptic FPN (CVPR'2019)
  • MaskFormer (NeurIPS'2021)
  • Mask2Former (ArXiv'2021)

模块组件

Backbones Necks Loss Common
  • VGG (ICLR'2015)
  • ResNet (CVPR'2016)
  • ResNeXt (CVPR'2017)
  • MobileNetV2 (CVPR'2018)
  • HRNet (CVPR'2019)
  • Generalized Attention (ICCV'2019)
  • GCNet (ICCVW'2019)
  • Res2Net (TPAMI'2020)
  • RegNet (CVPR'2020)
  • ResNeSt (ArXiv'2020)
  • PVT (ICCV'2021)
  • Swin (CVPR'2021)
  • PVTv2 (ArXiv'2021)
  • ResNet strikes back (ArXiv'2021)
  • EfficientNet (ArXiv'2021)
  • ConvNeXt (CVPR'2022)
  • ConvNeXtv2 (ArXiv'2023)
  • PAFPN (CVPR'2018)
  • NAS-FPN (CVPR'2019)
  • CARAFE (ICCV'2019)
  • FPG (ArXiv'2020)
  • GRoIE (ICPR'2020)
  • DyHead (CVPR'2021)
  • GHM (AAAI'2019)
  • Generalized Focal Loss (NeurIPS'2020)
  • Seasaw Loss (CVPR'2021)
  • OHEM (CVPR'2016)
  • Group Normalization (ECCV'2018)
  • DCN (ICCV'2017)
  • DCNv2 (CVPR'2019)
  • Weight Standardization (ArXiv'2019)
  • Prime Sample Attention (CVPR'2020)
  • Strong Baselines (CVPR'2021)
  • Resnet strikes back (ArXiv'2021)

目标检测算法库MMYOLO

MMYOLO 中目前实现了目标检测和旋转框目标检测算法,但是相比 MMDeteciton 版本有显著训练加速,训练速度相比原先版本提升 2.6 倍。

MMYOLOhttps://github.com/open-mmlab/mmyoloOpenMMLab简介_第4张图片

 文字检测识别算法库MMOCR

MMOCRhttps://github.com/open-mmlab/mmocr

专注于文本检测,文本识别以及相应的下游任务,如关键信息提取。 它是 OpenMMLab 项目的一部分。兼容中文,英文和数字。

OpenMMLab简介_第5张图片

 3D目标检测算法库MMDetection3D

MMDection3Dhttps://github.com/open-mmlab/mmdetection3d

面向 3D 检测的平台

OpenMMLab简介_第6张图片

 旋转目标检测算法库MMRotate

适合目标检测的框有方向的任务
GitHub - open-mmlab/mmrotate: OpenMMLab Rotated Object Detection Toolbox and BenchmarkOpenMMLab Rotated Object Detection Toolbox and Benchmark - GitHub - open-mmlab/mmrotate: OpenMMLab Rotated Object Detection Toolbox and Benchmarkhttps://github.com/open-mmlab/mmrotate

图像分割算法库MMSegmentation

 语义分割,适合街景,无人驾驶,理疗影像等任务GitHub - open-mmlab/mmsegmentation: OpenMMLab Semantic Segmentation Toolbox and Benchmark.OpenMMLab Semantic Segmentation Toolbox and Benchmark. - GitHub - open-mmlab/mmsegmentation: OpenMMLab Semantic Segmentation Toolbox and Benchmark.https://github.com/open-mmlab/mmsegmentation

OpenMMLab简介_第7张图片

 图像分类+预训练+多模态算法库MMPretrain

GitHub - open-mmlab/mmpretrain: OpenMMLab Pre-training Toolbox and BenchmarkOpenMMLab Pre-training Toolbox and Benchmark. Contribute to open-mmlab/mmpretrain development by creating an account on GitHub.https://github.com/open-mmlab/mmpretrain

  • 图像分类:输入图像,AI给出类别的概率,概率最高的就是最后的类型
  • 图像描述(Image Caption):输入图片,AI使用文字描述图片
  • 视觉问答(Visual Question Answering):输入图片,问AI一个问题,AI根据图片给出答案
  • 视觉定位(Visual Grounding):输入图片,给出提示提,AI根据提示词换出框
  • 检索(图搜图,图搜文,文搜图)

OpenMMLab简介_第8张图片

 姿态估计算法库MMPose

GitHub - open-mmlab/mmpose: OpenMMLab Pose Estimation Toolbox and Benchmark.OpenMMLab Pose Estimation Toolbox and Benchmark. Contribute to open-mmlab/mmpose development by creating an account on GitHub.https://github.com/open-mmlab/mmposeOpenMMLab简介_第9张图片

 除了人体关键提检测也可以做自己标注数据的关键点检测

OpenMMLab简介_第10张图片

 三维人体姿态估计算法库MMHuman3D

GitHub - open-mmlab/mmhuman3d: OpenMMLab 3D Human Parametric Model Toolbox and BenchmarkOpenMMLab 3D Human Parametric Model Toolbox and Benchmark - GitHub - open-mmlab/mmhuman3d: OpenMMLab 3D Human Parametric Model Toolbox and Benchmarkhttps://github.com/open-mmlab/mmhuman3dOpenMMLab简介_第11张图片

 视频动作识别算法库MMAction2

动作识别,时序动作检测,时空动作检测以及基于人体姿态的动作识别

GitHub - open-mmlab/mmaction2: OpenMMLab's Next Generation Video Understanding Toolbox and BenchmarkOpenMMLab's Next Generation Video Understanding Toolbox and Benchmark - GitHub - open-mmlab/mmaction2: OpenMMLab's Next Generation Video Understanding Toolbox and Benchmarkhttps://github.com/open-mmlab/mmaction2

无法通过单帧画面,需要结合上下文判断

OpenMMLab简介_第12张图片

目标追踪MMTracking

 支持视频目标检测,多目标跟踪,单目标跟踪和视频实例分割等多种任务和算法。

GitHub - open-mmlab/mmtracking: OpenMMLab Video Perception Toolbox. It supports Video Object Detection (VID), Multiple Object Tracking (MOT), Single Object Tracking (SOT), Video Instance Segmentation (VIS) with a unified framework.OpenMMLab Video Perception Toolbox. It supports Video Object Detection (VID), Multiple Object Tracking (MOT), Single Object Tracking (SOT), Video Instance Segmentation (VIS) with a unified framework. - GitHub - open-mmlab/mmtracking: OpenMMLab Video Perception Toolbox. It supports Video Object Detection (VID), Multiple Object Tracking (MOT), Single Object Tracking (SOT), Video Instance Segmentation (VIS) with a unified framework.https://github.com/open-mmlab/mmtrackingOpenMMLab简介_第13张图片

生成模型+底层视觉+AIGC算法库MMagic

支持了流行的图像修复、图文生成、3D生成、图像修补、抠图、超分辨率和生成等任务的应用。 支持 Stable Diffusion 的微调和ControlNet 动画生成。也支持 GANs 的插值,投影,编辑和其他流行的应用。

  • 文生图,图生图,图像填充(inpainting)
  • 图像去噪,重建,超分辨率,去雾
  • 视频插帧,超分辨率
  • 黑白老照片上色
  • Matting(抠人)

GitHub - open-mmlab/mmagic: OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic : Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, image/video restoration/enhancement, etc.OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic : Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, image/video restoration/enhancement, etc. - GitHub - open-mmlab/mmagic: OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic : Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, image/video restoration/enhancement, etc.https://github.com/open-mmlab/mmagic

模型部署工具箱MMDeploy

为各算法库提供统一的部署体验

open-mmlab/mmdeploy: OpenMMLab Model Deployment Framework (github.com)https://github.com/open-mmlab/mmdeploy

趣味应用游乐场Playground

一个用于收集和展示 OpenMMLab 相关前沿和有趣应用的项目

https://github.com/open-mmlab/playgroundhttps://github.com/open-mmlab/playground

你可能感兴趣的:(深度学习,人工智能)