gkm0120

【论文汇总】Semantic-Segmentation(语义分割)

语义分割

关于语义分割的所有论文和资源的列表。

数据集重要性

语义分割—深度学习

DL模型语义分割的若干实现

数据集

voc2012
CitySpaces
Mapillary
ADE20K
PASCAL Context
COCO-Stuff 10K dataset v1.1
2D-3D-S dataset
Mapillary Vistas
Stanford Background Dataset
Sift Flow Dataset
Barcelona Dataset
Microsoft COCO dataset
MSRC Dataset
LITS Liver Tumor Segmentation Dataset
KITTI
Pascal Context
Data from Games dataset
Human parsing dataset
Mapillary Vistas Dataset
Microsoft AirSim
MIT Scene Parsing Benchmark
COCO 2017 Stuff Segmentation Challenge
ADE20K Dataset
INRIA Annotations for Graz-02
Daimler dataset
ISBI Challenge: Segmentation of neuronal structures in EM stacks
INRIA Annotations for Graz-02 (IG02)
Pratheepan Dataset
Clothing Co-Parsing (CCP) Dataset

资料

论文综述

A 2017 Guide to Semantic Segmentation with Deep Learning by Qure AI [Blog about different sem. segm. methods]
A Review on Deep Learning Techniques Applied to Semantic Segmentation [Survey paper with a special focus on datasets and the highest performing methods]
Computer Vision for Autonomous Vehicles: Problems, Datasets and State-of-the-Art [Survey paper about all aspects of autonomous vehicles, including sem. segm.] [Webpage with a summary of all relevant publications]
A Survey on Deep Learning in Medical Image Analysis [Paper]

在线演示

CRF as RNN
SegNet

2维语义分割

论文:

[2019-CVPR oral] CLAN: Category-level Adversaries for Semantics Consistent [paper] [code]
[2019-CVPR] BRS: Interactive Image Segmentation via Backpropagating Refinement Scheme(***) [paper] [code]
[2019-CVPR] DFANet：Deep Feature Aggregation for Real-Time Semantic Segmentation(used in camera) [paper] [code]
[2019-CVPR] DeepCO3: Deep Instance Co-segmentation by Co-peak Search and Co-saliency [paper] [code]
[2019-CVPR] Domain Adaptation(reducing the domain shif) [paper]
[2019-CVPR] ELKPPNet: An Edge-aware Neural Network with Large Kernel Pyramid Pooling for Learning Discriminative Features in Semantic- Segmentation [paper] [code]
[2019-CVPR oral] GLNet: Collaborative Global-Local Networks for Memory-Efficient Segmentation of Ultra-High Resolution Images[paper] [code]
[2019-CVPR] Instance Segmentation by Jointly Optimizing Spatial Embeddings and Clustering Bandwidth(***SOTA) [paper] [code]
[2019-ECCV] ICNet: Real-Time Semantic Segmentation on High-Resolution Images [paper] [code]
[2019-CVPR] LEDNet: A Lightweight Encoder-Decoder Network for Real-Time Semantic Segmentation(***SOTA) [paper] [code]
[2019-arXiv] LightNet++: Boosted Light-weighted Networks for Real-time Semantic Segmentation [paper] [code]
[2019-CVPR] PTSNet: A Cascaded Network for Video Object Segmentation [paper] [code]
[2019-CVPR] PPGNet: Learning Point-Pair Graph for Line Segment Detection [paper] [code]
[2019-CVPR] Show, Match and Segment: Joint Learning of Semantic Matching and Object Co-segmentation [paper] [code]
[2019-CVPR] Video Instance Segmentation [paper] [code]

Arxiv-2018 ExFuse: Enhancing Feature Fusion for Semantic Segmentation 87.9% mean Iou->voc2012 [Paper]
CVPR-2018 spotlight Learning to Adapt Structured Output Space for Semantic Segmentation [Paper] [Code]
Arfix-2018 Adversarial Learning for Semi-supervised Semantic Segmentation [Paper] [Code]
Arxiv-2018 Context Encoding for Semantic Segmentation [Paper] [Code]
CVPR-2018 Learning to Adapt Structured Output Space for Semantic Segmentation [Paper][Code]
CVPR-2018 Dynamic-structured Semantic Propagation Network [Paper]
Deeplab v4: Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [Paper] [Code]
Deep Value Networks Learn to Evaluate and Iteratively Refine Structured Outputs [Paper][Code]
ICCV-2017 Semantic Line Detection and Its Applications [Paper]
ICCV-2017 Attentive Semantic Video Generation Using Captions [Paper]
ICCV-2017 BlitzNet: A Real-Time Deep Network for Scene Understanding [Paper] [Code]
ICCV-2017 SCNet: Learning Semantic Correspondence [Code]
CVPR-2017 End-to-End Instance Segmentation with Recurrent Attention [Code]
CVPR-2017 Deep Watershed Transform for Instance Segmentation [Code]
Piecewise Flat Embedding for Image Segmentation [Paper]
ICCV-2017 Curriculum Domain Adaptation for Semantic Segmentation of Urban Scenes [Paper][Code]
CVPR-2017 Not All Pixels Are Equal: Difficulty-Aware Semantic Segmentation via Deep Layer Cascade-2017 [Paper]
CVPR-2017 Annotating Object Instances with a Polygon-RNN-2017 [Project] [Paper]
CVPR-2017 Loss maxpooling for semantic image segmentation [Paper]
ICCV-2017 Scale-adaptive convolutions for scene parsing [Paper]
Towards End-to-End Lane Detection: an Instance Segmentation Approach [Paper]arxiv-1802
AAAI-2018 Mix-and-Match Tuning for Self-Supervised Semantic Segmentation [Paper] arxiv-1712
NIPS-2017-Learning Affinity via Spatial Propagation Networks [Paper]
AAAI-2018-Spatial As Deep: Spatial CNN for Traffic Scene Understanding [Paper]
Stacked Deconvolutional Network for Semantic Segmentation-2017 [Paper]
Deeplab v3: Rethinking Atrous Convolution for Semantic Image Segmentation-2017(DeeplabV3) [Paper]
CVPR-2017 Learning Object Interactions and Descriptions for Semantic Image Segmentation-2017 [Paper]
Pixel Deconvolutional Networks-2017 [Code-Tensorflow] [Paper]
Dilated Residual Networks-2017 [Paper]
A Review on Deep Learning Techniques Applied to Semantic Segmentation-2017 [Paper]
BiSeg: Simultaneous Instance Segmentation and Semantic Segmentation with Fully Convolutional Networks [Paper]
ICNet for Real-Time Semantic Segmentation on High-Resolution Images-2017 [Project] [Code] [Paper] [Video]

Feature Forwarding: Exploiting Encoder Representations for Efficient Semantic Segmentation-2017 [Project] [Code-Torch7]
Reformulating Level Sets as Deep Recurrent Neural Network Approach to Semantic Segmentation-2017 [Paper]
Adversarial Examples for Semantic Image Segmentation-2017 [Paper]
Large Kernel Matters - Improve Semantic Segmentation by Global Convolutional Network-2017 [Paper]
HyperNet: Towards Accurate Region Proposal Generation and Joint Object Detection [Paper]
Hypercolumns for Object Segmentation and Fine-grained Localization [Paper]
Matching-CNN meets KNN: Quasi-parametric human parsing[Paper]
Deep Human Parsing with Active Template Regression [Paper]
TPAMI-2012 Learning Hierarchical Features for Scene Labeling The first paper for applying dl on semantic segmentation !!! [Paper]
Label Refinement Network for Coarse-to-Fine Semantic Segmentation-2017 [Paper]
Laplacian Pyramid Reconstruction and Refinement for Semantic Segmentation [Paper]
ParseNet: Looking Wider to See Better [Paper]
CVPR-2016 Recombinator Networks: Learning Coarse-to-Fine Feature Aggregation [Paper]
PixelNet: Representation of the pixels, by the pixels, and for the pixels-2017 [Project] [Code-Caffe] [Paper]
LabelBank: Revisiting Global Perspectives for Semantic Segmentation-2017 [Paper]
Progressively Diffused Networks for Semantic Image Segmentation-2017 [Paper]
Understanding Convolution for Semantic Segmentation-2017 [Model-Mxnet] [Paper] [Code]
ICCV-2017 Predicting Deeper into the Future of Semantic Segmentation-2017 [Paper]
CVPR-2017 Pyramid Scene Parsing Network-2017 [Project] [Code-Caffe] [Paper] [Slides]
FCNs in the Wild: Pixel-level Adversarial and Constraint-based Adaptation-2016 [Paper]
FusionNet: A deep fully residual convolutional neural network for image segmentation in connectomics-2016 [Code-PyTorch] [Paper]
RefineNet: Multi-Path Refinement Networks for High-Resolution Semantic Segmentation-2016 [Code-MatConvNet] [Paper]
CVPRW-2017 The One Hundred Layers Tiramisu: Fully Convolutional DenseNets for Semantic Segmentation [Code-Theano] [Code-Keras1] [Code-Keras2] [Paper]
CVPR-2017 Full-Resolution Residual Networks for Semantic Segmentation in Street Scenes [Code-Theano] [Paper]
PixelNet: Towards a General Pixel-level Architecture-2016 [Paper]
Recalling Holistic Information for Semantic Segmentation-2016 [Paper]
Semantic Segmentation using Adversarial Networks-2016 [Paper] [Code-Chainer]
Region-based semantic segmentation with end-to-end training-2016 [Paper]
Exploring Context with Deep Structured models for Semantic Segmentation-2016 [Paper]
Multi-scale context aggregation by dilated convolutions [Paper]
Better Image Segmentation by Exploiting Dense Semantic Predictions-2016 [Paper]
Boundary-aware Instance Segmentation-2016 [Paper]
Improving Fully Convolution Network for Semantic Segmentation-2016 [Paper]
Deep Structured Features for Semantic Segmentation-2016 [Paper]
DeepLab v2:Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs-2016** [Project] [Code-Caffe] [Code-Tensorflow] [Code-PyTorch] [Paper]
DeepLab v1: Semantic Image Segmentation With Deep Convolutional Nets and Fully Connected CRFs-2014** [Code-Caffe1] [Code-Caffe2] [Paper]
Deep Learning Markov Random Field for Semantic Segmentation-2016 [Project] [Paper]
ECCV2016 Salient Deconvolutional Networks [Code]
Convolutional Random Walk Networks for Semantic Image Segmentation-2016 [Paper]
ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation-2016 [Code-Caffe1][Code-Caffe2] [Paper] [Blog]
High-performance Semantic Segmentation Using Very Deep Fully Convolutional Networks-2016 [Paper]
CVPR-2016-oral ScribbleSup: Scribble-Supervised Convolutional Networks for Semantic Segmentation-2016 [Paper]
Object Boundary Guided Semantic Segmentation-2016 [Code-Caffe] [Paper]
Segmentation from Natural Language Expressions-2016 [Project] [Code-Tensorflow] [Code-Caffe] [Paper]
Seed, Expand and Constrain: Three Principles for Weakly-Supervised Image Segmentation-2016 [Code-Caffe] [Paper]
Global Deconvolutional Networks for Semantic Segmentation-2016 [Paper]
Learning Transferrable Knowledge for Semantic Segmentation with Deep Convolutional Neural Network-2015 [Project] [Code-Caffe] [Paper]
Learning Dense Convolutional Embeddings for Semantic Segmentation-2015 [Paper]
ParseNet: Looking Wider to See Better-2015 [Code-Caffe] [Model-Caffe] [Paper]
Decoupled Deep Neural Network for Semi-supervised Semantic Segmentation-2015 [Project] [Code-Caffe] [Paper]
Bayesian segnet: Model uncertainty in deep convolutional encoder-decoder architectures for scene understanding [Paper]
SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation-2015 [Project] [Code-Caffe] [Paper] [Tutorial1] [Tutorial2]
Semantic Image Segmentation with Task-Specific Edge Detection Using CNNs and a Discriminatively Trained Domain Transform-2015 [Paper]
Semantic Segmentation with Boundary Neural Fields-2015 [Code] [Paper]
Semantic Image Segmentation via Deep Parsing Network-2015 [Project] [Paper1] [Paper2] [Slides]
What’s the Point: Semantic Segmentation with Point Supervision-2015 [Project] [Code-Caffe] [Model-Caffe] [Paper]
U-Net: Convolutional Networks for Biomedical Image Segmentation-2015 [Project] [Code+Data] [Code-Keras] [Code-Tensorflow] [Paper] [Notes]
Learning Deconvolution Network for Semantic Segmentation(DeconvNet)-2015 [Project] [Code-Caffe] [Paper] [Slides]
Multi-scale Context Aggregation by Dilated Convolutions-2015 [Project] [Code-Caffe] [Code-Keras] [Paper] [Notes]
ReSeg: A Recurrent Neural Network-based Model for Semantic Segmentation-2015 [Code-Theano] [Paper]
ICCV-2015 BoxSup: Exploiting Bounding Boxes to Supervise Convolutional Networks for Semantic Segmentation-2015 [Paper]
Feedforward semantic segmentation with zoom-out features-2015 [Code] [Paper] [Video]
Conditional Random Fields as Recurrent Neural Networks-2015 [Project] [Code-Caffe1] [Code-Caffe2] [Demo] [Paper1] [Paper2]
Efficient Piecewise Training of Deep Structured Models for Semantic Segmentation-2015 [Paper]
Fully Convolutional Networks for Semantic Segmentation-2015 [Code-Caffe] [Model-Caffe] [Code-Tensorflow1] [Code-Tensorflow2] [Code-Chainer] [Code-PyTorch] [Paper1] [Paper2] [Slides1] [Slides2]
Deep Joint Task Learning for Generic Object Extraction-2014 [Project] [Code-Caffe] [Dataset] [Paper]
Highly Efficient Forward and Backward Propagation of Convolutional Neural Networks for Pixelwise Classification-2014 [Code-Caffe] [Paper]
Wider or deeper: Revisiting the resnet model for visual recognition [Paper]
Describing the Scene as a Whole: Joint Object Detection, Scene Classification and Semantic Segmentation[Paper]
Analyzing Semantic Segmentation Using Hybrid Human-Machine CRFs[Paper]
Convolutional Patch Networks with Spatial Prior for Road Detection and Urban Scene Understanding[Paper]
Deep Deconvolutional Networks for Scene Parsing[Paper]
FusionSeg: Learning to combine motion and appearance for fully automatic segmention of generic objects in videos[Paper][Poject]
ICCV-2017 Deep Dual Learning for Semantic Image Segmentation [Paper]
From image-level to pixel level labeling with convolutional networks [Paper]
Scene Segmentation with DAG-Recurrent Neural Networks [Paper]
Learning to Segment Every Thing [Paper]
Panoptic Segmentation [Paper]
The Devil is in the Decoder [Paper]
Attention to Scale: Scale-aware Semantic Image Segmentation [Paper][Project]
Convolutional Oriented Boundaries: From Image Segmentation to High-Level Tasks [Paper] [Project]
Scale-Aware Alignment of Hierarchical Image Segmentation [Paper] [Project]
ICCV-2017 Semi Supervised Semantic Segmentation Using Generative Adversarial Network[Paper]
Object Region Mining with Adversarial Erasing: A Simple Classification to Semantic Segmentation Approach [Paper]

CVPR-2016 Convolutional Feature Masking for Joint Object and Stuff Segmentation [Paper]
ECCV-2016 Laplacian Pyramid Reconstruction and Refinement for Semantic Segmentation [Paper]

FastMask: Segment Object Multi-scale Candidates in One Shot-2016 [Code-Caffe] [Paper]
Pixel Objectness-2017 [Project] [Code-Caffe] [Paper]

3维语义分割

论文

PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation [Paper]
PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space (2017) [Paper]
Learning 3D Mesh Segmentation and Labeling (2010) [Paper]
Unsupervised Co-Segmentation of a Set of Shapes via Descriptor-Space Spectral Clustering (2011) [Paper]
Single-View Reconstruction via Joint Analysis of Image and Shape Collections (2015) [Paper]
3D Shape Segmentation with Projective Convolutional Networks (2017) [Paper]
Learning Hierarchical Shape Segmentation and Labeling from Online Repositories (2017) [Paper]
3D Graph Neural Networks for RGBD Semantic Segmentation (2017) [Paper]
3DCNN-DQN-RNN: A Deep Reinforcement Learning Framework for Semantic Parsing of Large-scale 3D Point Clouds (2017)[Paper]
Multi-view deep learning for consistent semantic mapping with rgb-d cameras [Paper]

ICCV-2017 Large-scale 3D Shape Reconstruction and Segmentation from ShapeNet Core55 [Paper][Project]

实例分割

Mask Scoring R-CNN (MS R-CNN) [Code][Paper]
Predicting Future Instance Segmentations by Forecasting Convolutional Features [Paper]
CVPR-2018 Path Aggregation Network for Instance Segmentation [Paper] better than Mask-rcnn!！COCO-2017 1st!
Pixelwise Instance Segmentation with a Dynamically Instantiated Network-2017 [Paper]
Semantic Instance Segmentation via Deep Metric Learning-2017 [Paper]
CVPR-2017 FastMask: Segment Multi-scale Object Candidates in One Shot [Code-Tensorflow] [Paper]
Pose2Instance: Harnessing Keypoints for Person Instance Segmentation-2017 [Paper]
Pixelwise Instance Segmentation with a Dynamically Instantiated Network-2017 [Paper]
CVPR-2017-spotlight Fully Convolutional Instance-aware Semantic Segmentation-2016 [Code] [Paper]
CVPR-2016-oral Instance-aware Semantic Segmentation via Multi-task Network Cascades-2015 [Code] [Paper]
Recurrent Instance Segmentation-2015 [Project] [Code-Torch7] [Paper] [Poster] [Video]
Annotating Object Instances with a Polygon-RNN [Paper]
MaskLab: Instance Segmentation by Refining Object Detection with Semantic and Direction Features [Paper]
FCIS:Fully Convolutional Instance-aware Semantic Segmentation [Paper]Code
MNC:Instance-aware Semantic Segmentation via Multi-task Network Cascades [Paper]Code
DeepMask:Learning to Segment Object Candidates [Paper] Code
SharpMask:Learning to Refine Object Segments [Paper]Code
RIS:Recurrent Instance Segmentation [Paper]Code
FastMask: Segment Multi-scale Object Candidates in One Shot [Paper]Code
Proposal-free network for instance-level object segmentation [Paper]
ECCV-2016 Instance-sensitive Fully Convolutional Networks [Paper]
Pixel-level encoding and depth layering for instance-level semantic labeling [Paper]

机器人科学(或技术)

Virtual-to-Real: Learning to Control in Visual Semantic Segmentation [Paper]
End-to-End Tracking and Semantic Segmentation Using Recurrent Neural Networks [Paper]
Semantic Segmentation using Adversarial Networks [Paper]

对抗训练

CVPR-2017-Image-to-Image Translation with Conditional Adversarial Networks [Paper]
ICCV-2017-Adversarial Examples for Semantic Segmentation and Object Detection [Paper]

场景理解

论文

1.Spatial As Deep: Spatial CNN for Traffic Scene Understanding [Paper]

数据集& 资料

SUNRGB-D 3D Object Detection Challenge [Link]
19 object categories for predicting a 3D bounding box in real world dimension Training set: 10,355 RGB-D scene images, Testing set: 2860 RGB-D images
SceneNN (2016) [Link]
100+ indoor scene meshes with per-vertex and per-pixel annotation.
ScanNet (2017) [Link]
An RGB-D video dataset containing 2.5 million views in more than 1500 scans, annotated with 3D camera poses, surface reconstructions, and instance-level semantic segmentations.
Matterport3D: Learning from RGB-D Data in Indoor Environments (2017) [Link]

10,800 panoramic views (in both RGB and depth) from 194,400 RGB-D images of 90 building-scale scenes of private rooms. Instance-level semantic segmentations are provided for region (living room, kitchen) and object (sofa, TV) categories.
SUNCG: A Large 3D Model Repository for Indoor Scenes (2017) [Link]

The dataset contains over 45K different scenes with manually created realistic room and furniture layouts. All of the scenes are semantically annotated at the object level.
MINOS: Multimodal Indoor Simulator (2017) [Link]
MINOS is a simulator designed to support the development of multisensory models for goal-directed navigation in complex indoor environments. MINOS leverages large datasets of complex 3D environments and supports flexible configuration of multimodal sensor suites. MINOS supports SUNCG and Matterport3D scenes.
Facebook House3D: A Rich and Realistic 3D Environment (2017) [Link]

House3D is a virtual 3D environment which consists of 45K indoor scenes equipped with a diverse set of scene types, layouts and objects sourced from the SUNCG dataset. All 3D objects are fully annotated with category labels. Agents in the environment have access to observations of multiple modalities, including RGB images, depth, segmentation masks and top-down 2D map views.
HoME: a Household Multimodal Environment (2017) [Link]

HoME integrates over 45,000 diverse 3D house layouts based on the SUNCG dataset, a scale which may facilitate learning, generalization, and transfer. HoME is an open-source, OpenAI Gym-compatible platform extensible to tasks in reinforcement learning, language grounding, sound-based navigation, robotics, multi-agent learning.
AI2-THOR: Photorealistic Interactive Environments for AI Agents [Link]

AI2-THOR is a photo-realistic interactable framework for AI agents. There are a total 120 scenes in version 1.0 of the THOR environment covering four different room categories: kitchens, living rooms, bedrooms, and bathrooms. Each room has a number of actionable objects.

弱监督分割 && 交互式分割 && 可转换语义分割

arxiv-2018 WebSeg: Learning Semantic Segmentation from Web Searches [Paper]
Weakly Supervised Object Localization Using Things and Stuff Transfer [Paper]
Semi and Weakly Supervised Semantic Segmentation Using Generative Adversarial Network [Paper]
Weakly- and Semi-Supervised Learning of a Deep Convolutional Network for Semantic Image Segmentation [Paper]

Weakly Supervised Structured Output Learning for Semantic Segmentation [Paper]
ICCV-2011 Weakly supervised semantic segmentation with a multi-image model [Paper]
ScribbleSup: Scribble-Supervised Convolutional Networks for Semantic Segmentation. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016[Paper]
Constrained convolutional neural networks for weakly supervised segmentation. Proceedings of the IEEE International Conference on Computer Vision. 2015.[Paper]
Weakly-and semi-supervised learning of a DCNN for semantic image segmentation. arXiv preprint arXiv:1502.02734 (2015).[Paper]
Learning to segment under various forms of weak supervision. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2015.[Paper]
STC: A Simple to Complex Framework for Weakly-supervised Semantic Segmentation 2017 TPAMI [Paper] [Project]
[Paper]
CVPR-2017-Simple Does It: Weakly Supervised Instance and Semantic Segmentation [Paper] [tensorflow]
CVPR-2017-Weakly Supervised Semantic Segmentation using Web-Crawled Videos [Paper]
AAAI-2017-Weakly Supervised Semantic Segmentation Using Superpixel Pooling Network [Paper]
ICCV-2015-Weakly supervised graph based semantic segmentation by learning communities of image-parts [Paper]
Towards Weakly Supervised Semantic Segmentation by Means of Multiple Instance and Multitask Learning [Paper]
Weakly-Supervised Semantic Segmentation using Motion Cues [Paper] [Project]
Weakly Supervised Semantic Segmentation Based on Web Image Co-segmentation [Paper]
Learning to Rene Object Segments [Paper]
Weakly-Supervised Dual Clustering for Image Semantic Segmentation [Paper]
Interactive Video Object Segmentation in the Wild [Paper]

视频语义分割

CVPR-2017 Video Object Segmentation Without Temporal Information One-Shot Video Object Segmentation [Project]
Feature Space Optimization for Semantic Video Segmentation[Paper][Slides]
The Basics of Video Object Segmentation [Blog]
ICCV2017----SegFlow_Joint Learning for Video Object Segmentation and Optical Flow
OSVOS:One-Shot Video Object Segmentation
Surveillance Video Parsing with Single Frame Supervision
The 2017 DAVIS Challenge on Video Object Segmentation
Video Propagation Networks
OnAVOS: Online Adaptation of Convolutional Neural Networks for Video Object Segmentation. P. Voigtlaender, B. Leibe, BMVC 2017. [Project Page] [Precomputed results]
MSK: Learning Video Object Segmentation from Static Images. F. Perazzi*, A. Khoreva*, R. Benenson, B. Schiele, A. Sorkine-Hornung, CVPR 2017. [Project Page] [Precomputed results]
SFL: SegFlow: Joint Learning for Video Object Segmentation and Optical Flow. J. Cheng, Y.-H. Tsai, S. Wang, M.-H. Yang, ICCV 2017. [Project Page] [Precomputed results]
CTN: Online Video Object Segmentation via Convolutional Trident Network. W.-D. Jang, C.-S. Kim, CVPR 2017. [Project Page] [Precomputed results]
VPN: Video Propagation Networks. V. Jampani, R. Gadde, P. V. Gehler, CVPR 2017. [Project Page] [Precomputed results]
PLM: Pixel-level Matching for Video Object Segmentation using Convolutional Neural Networks. J. Shin Yoon, F. Rameau, J. Kim, S. Lee, S. Shin, I. So Kweon, ICCV 2017. [Project Page] [Precomputed results]
OFL: Video Segmentation via Object Flow. Y.-H. Tsai, M.-H. Yang, M. Black, CVPR 2016. [Project Page] [Precomputed results]
BVS: Bilateral Space Video Segmentation. N. Marki, F. Perazzi, O. Wang, A. Sorkine-Hornung, CVPR 2016. [Project Page] [Precomputed results]
FCP: Fully Connected Object Proposals for Video Segmentation. F. Perazzi, O. Wang, M. Gross, A. Sorkine-Hornung, ICCV 2015. [Project Page] [Precomputed results]
JMP: JumpCut: Non-Successive Mask Transfer and Interpolation for Video Cutout. Q. Fan, F. Zhong, D. Lischinski, D. Cohen-Or, B. Chen, SIGGRAPH 2015. [Project Page] [Precomputed results]
HVS: Efficient hierarchical graph-based video segmentation. M. Grundmann, V. Kwatra, M. Han, I. A. Essa, CVPR 2010. [Project Page] [Precomputed results]
SEA: SeamSeg: Video Object Segmentation Using Patch Seams. S. Avinash Ramakanth, R. Venkatesh Babu, CVPR 2014. [Project Page] [Precomputed results]
ARP: Primary Object Segmentation in Videos Based on Region Augmentation and Reduction. Y.J. Koh, C.-S. Kim, CVPR 2017. [Project Page] [Precomputed results]
LVO: Learning Video Object Segmentation with Visual Memory. P. Tokmakov, K. Alahari, C. Schmid, ICCV 2017. [Project Page] [Precomputed results]
FSEG: FusionSeg: Learning to combine motion and appearance for fully automatic segmentation of generic objects in videos. S. Jain, B. Xiong, K. Grauman, CVPR 2017. [Project Page] [Precomputed results]
LMP: Learning Motion Patterns in Videos. P. Tokmakov, K. Alahari, C. Schmid, CVPR 2017. [Project Page] [Precomputed results]
SFL: SegFlow: Joint Learning for Video Object Segmentation and Optical Flow. J. Cheng, Y.-H. Tsai, S. Wang, M.-H. Yang, ICCV 2017. [Project Page] [Precomputed results]
FST: Fast Object Segmentation in Unconstrained Video. A. Papazoglou, V. Ferrari, ICCV 2013. [Project Page] [Precomputed results]
CUT: Motion Trajectory Segmentation via Minimum Cost Multicuts. M. Keuper, B. Andres, T. Brox, ICCV 2015. [Project Page] [Precomputed results]
NLC: Video Segmentation by Non-Local Consensus voting. A. Faktor, M. Irani, BMVC 2014. [Project Page] [Precomputed results]
MSG: Object segmentation in video: A hierarchical variational approach for turning point trajectories into dense regions. P. Ochs, T. Brox, ICCV 2011. [Project Page] [Precomputed results]
KEY: Key-segments for video object segmentation. Y. Lee, J. Kim, K. Grauman, ICCV 2011. [Project Page] [Precomputed results]
CVOS: Causal Video Object Segmentation from Persistence of Occlusions. B. Taylor, V. Karasev, S. Soatto, CVPR 2015. [Project Page] [Precomputed results]
TRC: Video segmentation by tracing discontinuities in a trajectory embedding. K. Fragkiadaki, G. Zhang, J. Shi, CVPR 2012. [Project Page] [Precomputed results]
Instance Embedding Transfer to Unsupervised Video Object Segmentation [Paper]

Result of DAVIS-Challenge 2017
Benchmark
2016----A Benchmark Dataset and Evaluation Methodology for Video Object Segmentation
2016----Clockwork Convnets for Video Semantic Segmentation
2016----MaskTrack ----Learning Video Object Segmentation from Static Images
2017----DAVIS-Challenge-1st----Video Object Segmentation with Re-identification
2017----DAVIS-Challenge-2nd----Lucid Data Dreaming for Multiple Object Tracking
2017----DAVIS-Challenge-3rd----Instance Re-Identification Flow for Video Object Segmentation
2017----DAVIS-Challenge-4th----Multiple-Instance Video Segmentation with Sequence-Specific Object Proposals
2017----DAVIS-Challenge-5th Online Adaptation of Convolutional Neural Networks for the 2017 DAVIS Challenge on Video Object Segmentation
2017----DAVIS-Challenge-6th ----Learning to Segment Instances in Videos with Spatial Propagation Network
2017----DAVIS-Challenge-7th----Some Promising Ideas about Multi-instance Video Segmentation
2017----DAVIS-Challenge-8th----One-Shot Video Object Segmentation with Iterative Online Fine-Tuning
2017----DAVIS-Challenge-9th----Video Object Segmentation using Tracked Object Proposals

多任务学习

论文:

Multi-Task Learning Using Uncertainty to Weigh Losses for Scene Geometry and Semantics [Paper]
Multi-task Learning using Multi-modal Encoder-Decoder Networks with Shared Skip Connections [Paper]

道路分割 && 实时分割

论文:

Deep Semantic Segmentation for Automated Driving: Taxonomy, Roadmap and Challenges [Paper]
2018-arxiv Real-time Semantic Segmentation Comparative Study[Paper][Code]
MultiNet: Real-time Joint Semantic Reasoning for Autonomous Driving [Paper]
self-driving-car-road-segmentation [Link]
Efficient Deep Models for Monocular Road Segmentation[Paper]
Semantic Road Segmentation via Multi-scale Ensembles of Learned Features [Paper]
Distantly Supervised Road Segmentation [Paper]
Deep Fully Convolutional Networks with Random Data Augmentation for Enhanced Generalization in Road Detection [Paper]
ICCV-2017 Real-time category-based and general obstacle detection for autonomous driving [Paper]
ICCV-2017 FoveaNet: Perspective-aware Urban Scene Parsing [Paper]
CVPR-2017 UberNet: Training a universal convolutional neural network for low-, mid-, and high-level vision using diverse datasets and limited memory [Paper]

LinkNet: Exploiting Encoder Representations for Efficient Semantic Segmentation [Paper]
ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation-2016 [Code-Caffe1][Code-Caffe2] [Paper] [Blog]
Efficient Deep Models for Monocular Road Segmentation[Paper]
Real-Time Coarse-to-fine Topologically Preserving Segmentation[Paper]
ICNet for Real-Time Semantic Segmentation on High-Resolution Images [Paper]
Efficient and robust deep networks for semantic segmentation [Paper]
NIPSW-2017 Speeding up semantic segmentation for autonomous driving [Paper]

ECCV-2012 Road Scene Segmentation from a Single Image [Paper]

代码

https://github.com/MarvinTeichmann/MultiNet
https://github.com/MarvinTeichmann/KittiSeg
https://github.com/vxy10/p5_VehicleDetection_Unet [Keras]
https://github.com/ndrplz/self-driving-car
https://github.com/mvirgo/MLND-Capstone
https://github.com/zhujun98/semantic_segmentation/tree/master/fcn8s_road

医学图像语义分割

论文

Arxiv-2018 Deep learning and its application to medical image segmentation [Paper]

Deep neural networks segment neuronal membranes in electron microscopy images
Semantic Image Segmentation with Deep Learning [Paper]
Automatic Liver and Tumor Segmentation of CT and MRI Volumes Using Cascaded Fully Convolutional Neural Networks [Paper]
DeepNAT: Deep Convolutional Neural Network for Segmenting Neuroanatomy [Paper]
CNN-based Segmentation of Medical Imaging Data [Paper]
Deep Retinal Image Understanding [Paper]
Model-based segmentation of vertebral bodies from MR images with 3D CNNs
Efficient multi-scale 3D CNN with fully connected CRF for accurate brain lesion segmentation
U-net: Convolutional networks for biomedical image segmentation
3D U-Net: Learning dense volumetric segmentation from sparse annotation.
V-Net: Fully convolutional neural networks for volumetric medical image segmentation.arXiv:1606.04797
The importance of skip connections in biomedical image segmentation Spatial clockwork recurrent neural network for muscle perimysium segmentation
NPIS-2015 Parallel multi-dimensional LSTM, with application to fast biomedical volumetric image segmentation
Multi-dimensional gated recurrent units for the segmentation of biomedical 3D-data
Combining fully convolutional and recurrent neural networks for 3D biomedical image segmentation
Recurrent fully convolutional neural networks for multi-slice MRI cardiac segmentation. arXiv:1608.03974
Automatic detection and classification of colorectal polyps by transferring low-level CNN features from nonmedical domain
Deep learning for multi-task medical image segmentation in multiple modalities
Sub-cortical brain structure segmentation using F-CNNs
Segmentation label propagation using deep convolutional neural networks and dense conditional random field
Fast fully automatic segmentation of the human placenta from motion corrupted MRI
Automatic detection of cerebral microbleeds from MR images via 3D convolutional neural networks
Non-uniform patch sampling with deep convolutional neural networks for white matter hyperintensity segmentation
A unified framework for automatic wound segmentation and analysis with deep convolutional neural networks
Deep 3D convolutional encoder networks with shortcuts for multiscale feature integration applied to Multiple Sclerosis lesion segmentation
Brain tumor segmentation using convolutional neural networks in MRI images
Deep feature learning for knee cartilage segmentation using a triplanar convolutional neural network
Automatic Coronary Calcium Scoring in Cardiac CT Angiography Using Convolutional Neural Networks [Paper]
Improving computer-aided detection using convolutional neural networks and random view aggregation [Paper]
Pulmonary nodule detection in CT images: false positive reduction using multi-view convolutional networks [Paper]

代码

零件语义分割

Look into Person: Self-supervised Structure-sensitive Learning and A New Benchmark for Human Parsing-2017 [Project] [Code-Caffe] [Paper]
Deep Learning for Human Part Discovery in Images-2016 [Code-Chainer] [Paper]
A CNN Cascade for Landmark Guided Semantic Part Segmentation-2016 [Project] [Paper]
Deep Learning for Semantic Part Segmentation With High-level Guidance-2015 [Paper]
Neural Activation Constellations-Unsupervised Part Model Discovery with Convolutional Networks-2015 [Paper]
Human Parsing with Contextualized Convolutional Neural Network-2015 [Paper]
Part detector discovery in deep convolutional neural networks-2014 [Code] [Paper]
Hypercolumns for object segmentation and fine-grained localization [Paper]

服装解析

Looking at Outfit to Parse Clothing-2017 [Paper]
Semantic Object Parsing with Local-Global Long Short-Term Memory-2015 [Paper]
-A High Performance CRF Model for Clothes Parsing-2014 [Project] [Code] [Dataset] [Paper]
Clothing co-parsing by joint image segmentation and labeling-2013 [Project] [Dataset] [Paper]
Parsing clothing in fashion photographs-2012 [Project] [Paper]

流行的方法和实现

U-Net [Paper] [Pytorch]
SegNet [Paper] [Caffe]
DeepLab [Paper] [Caffe]
FCN [Paper][tensorflow]
ENet [Paper] [Caffe]
LinkNet [Paper] [Torch]
DenseNet [Paper]
Tiramisu [Paper]
DilatedNet [Paper]
PixelNet [Paper] [Caffe]
ICNet [Paper] [Caffe]
ERFNet [Paper] [Torch]
RefineNet [Paper] [tensorflow]
PSPNet [Paper] [Caffe]
Dilated convolution [Paper] [Caffe]
DeconvNet [Paper] [Caffe]
FRRN [Paper] [Lasagne]
GCN [Paper] [PyTorch]
LRR [Paper] [Matconvnet]
DUC, HDC [Paper] [PyTorch]
MultiNet [Paper] tensorflow1tensorflow2
Segaware [Paper] [Caffe]
Semantic Segmentation using Adversarial Networks [Paper] [Chainer]
In-Place Activated BatchNorm:obtain #1 positions [Paper] [Pytorch]

标注工具:

https://github.com/AKSHAYUBHAT/ImageSegmentation
https://github.com/kyamagu/js-segment-annotator
https://github.com/CSAILVision/LabelMeAnnotationTool
https://github.com/seanbell/opensurfaces-segmentation-ui
https://github.com/lzx1413/labelImgPlus
https://github.com/wkentaro/labelme

杰出的研究人员和团队:

Liang-Chieh (Jay) Chen Deeplab-Google
Jianping Shi PSPNet
Kaiming He Mask-RCNN
Ming-Ming Cheng
Joachim M. Buhmann
Jifeng Dai FCIS-MSRA
Alex Kendall SegNet

结果:

MSRC-21
Cityscapes
VOC2012

参考

https://github.com/nightrome/really-awesome-semantic-segmentation

https://github.com/mrgloom/awesome-semantic-segmentation

你可能感兴趣的:(论文笔记)

论文笔记—NDT-Transformer: Large-Scale 3D Point Cloud Localization using the Normal Distribution Transfor 入门打工人笔记 slam 定位算法
论文笔记—NDT-Transformer:Large-Scale3DPointCloudLocalizationusingtheNormalDistributionTransformRepresentation文章摘要~~~~~~~在GPS挑战的环境中，自动驾驶对基于3D点云的地点识别有很高的要求，并且是基于激光雷达的SLAM系统的重要组成部分（即闭环检测）。本文提出了一种名为NDT-Transf
[论文笔记]Circle Loss: A Unified Perspective of Pair Similarity Optimization 愤怒的可乐 #文本匹配[论文]论文翻译/笔记自然语言处理论文阅读人工智能
引言为了理解CoSENT的loss，今天来读一下CircleLoss:AUnifiedPerspectiveofPairSimilarityOptimization。为了简单，下文中以翻译的口吻记录，比如替换"作者"为"我们"。这篇论文从对深度特征学习的成对相似度优化角度出发，旨在最大化同类之间的相似度sps_ps
【论文笔记】Multi-Task Learning as a Bargaining Game xhyu61 机器学习学习笔记论文笔记论文阅读人工智能深度学习
Abstract本文将多任务学习中的梯度组合步骤视为一种讨价还价式博弈(bargaininggame)，通过游戏，各个任务协商出共识梯度更新方向。在一定条件下，这种问题具有唯一解(NashBargainingSolution)，可以作为多任务学习中的一种原则方法。本文提出Nash-MTL，推导了其收敛性的理论保证。1Introduction大部分MTL优化算法遵循一个通用方案。计算所有任务的梯度g
[论文笔记] LLaVA 心心喵论文笔记论文阅读
一、LLaVA论文中的主要工作和实验结果ExistingGap:之前的大部分工作都在做模态对齐，做图片的representationlearning，而没有针对ChatBot（多轮对话，指令理解）这种场景优化。Contribution:这篇工作已经在BLIP-2之后了，所以Image的理解能力不是LLaVA希望提升的重点，LLaVA是想提升多模态模型的Instruction-Followingab
[论文笔记] LLM模型剪枝心心喵论文笔记论文阅读剪枝算法
AttentionIsAllYouNeedButYouDon’tNeedAllOfItForInferenceofLargeLanguageModelsLLaMA2在剪枝时，跳过ffn和跳过fulllayer的效果差不多。相比跳过ffn/fulllayer，跳过attentionlayer的影响会更小。跳过attentionlayer：7B/13B从100%参数剪枝到66%，平均指标只下降1.7～
【论文笔记】Training language models to follow instructions with human feedback B部分 Ctrl+Alt+L 大模型论文整理论文笔记论文阅读语言模型人工智能自然语言处理
TraininglanguagemodelstofollowinstructionswithhumanfeedbackB部分回顾一下第一代GPT-1：设计思路是“海量无标记文本进行无监督预训练+少量有标签文本有监督微调”范式；模型架构是基于Transformer的叠加解码器（掩码自注意力机制、残差、Layernorm）；下游各种具体任务的适应是通过在模型架构的输出后增加线性权重WyW_{y}Wy实
【论文笔记】：LAYN：用于小目标检测的轻量级多尺度注意力YOLOv8网络 hhhhhhkkkyyy 论文阅读目标检测 YOLO
背景针对嵌入式设备对目标检测算法的需求，大多数主流目标检测框架目前缺乏针对小目标的具体改进，然后提出的一种轻量级多尺度注意力YOLOv8小目标检测算法。小目标检测精度低的原因随着网络在训练过程中的加深，检测到的目标容易丢失边缘信息和灰度信息等。获得高级语义信息也较少，图像中可能存在一些噪声信息，误导训练网络学习不正确的特征。映射到原始图像的感受野的大小。当感受野相对较小时，空间结构特征保留较多，但
激光SLAM--(8) LeGO-LOAM论文笔记 lonely-stone slam 激光SLAM 论文阅读
论文标题：LeGO-LOAM：LightweightandGround-OptimizedLidarOdometryandMappingonVariableTerrain应用在可变地形场景的轻量级的、并利用地面优化的LOAMABSTRACT轻量级的、基于地面优化的LOAM实时进行六自由度位姿估计，应用在地面的车辆上。强调应用在地面车辆上是因为在这里面要求雷达必须水平安装，而像LOAM和LIO-SA
论文浅尝 - AAAI2020 | 迈向建立多语言义元知识库：用于 BabelNet Synsets 义元预测... 开放知识图谱机器学习人工智能知识图谱自然语言处理深度学习
论文笔记整理：潘锐，天津大学硕士。来源：AAAI2020链接：https://arxiv.org/pdf/1912.01795.pdf摘要义原被定义为人类语言的最小语义单位。义原知识库（KBs）是一种包含义原标注词汇的知识库，它已成功地应用于许多自然语言处理任务中。然而，现有的义原知识库建立在少数几种语言上，阻碍了它们的广泛应用。为此论文提出在多语种百科全书词典BabelNet的基础上建立一个统一
[论文笔记] LLM数据集——LongData-Corpus 心心喵论文笔记服务器 ubuntu linux
https://huggingface.co/datasets/yuyijiong/LongData-Corpus1、hf的数据在开发机上要设置sshkey，然后cat复制之后在设置在hf上2、中文小说数据在云盘上清华大学云盘下载：#!/bin/bash#BaseURLbase_url="https://cloud.tsinghua.edu.cn/d/0670fcb14d294c97b5cf/fi
[论文笔记] eval-big-refactor lm_eval 每两个任务使用一个gpu，并保证端口未被使用心心喵论文笔记 restful 后端
1.5B在eval时候两个任务一个gpu是可以的。7B+在evalbelebele时会OOM，所以分配时脚本不同。eval_fast.py：importsubprocessimportargparseimportosimportsocket#参数列表task_name_list=["flores_mt_en_to_id","flores_mt_en_to_vi","flores_mt_en_to_
【论文笔记】Separating the “Chirp” from the “Chat”: Self-supervised Visual Grounding of Sound and Language xhyu61 机器学习学习笔记论文笔记论文阅读
Abstract提出了DenseAV，一种新颖的双编码器接地架构，仅通过观看视频学习高分辨率、语义有意义和视听对齐的特征。在没有明确的本地化监督的情况下，DenseAV可以发现单词的"意义"和声音的"位置"。此外，它在没有监督的情况下自动发现并区分这两种类型的关联。DenseAV的定位能力源于一种新的多头特征聚合算子，该算子直接比较稠密的图像和音频表示进行对比学习。相比之下，许多其他学习"全局"音
图形学论文笔记 Jozky86 图形学图形学笔记
文章目录PBD：XPBD：shapematchingPBD：【深入浅出NvidiaFleX】(1)PositionBasedDynamics最简化的PBD(基于位置的动力学)算法详解-论文原理讲解和太极代码最简化的PBD(基于位置的动力学)算法详解-论文原理讲解和太极代码XPBD：基于XPBD的物理模拟一条龙：公式推导+代码+文字讲解（纯自制）【论文精读】XPBD基于位置的动力学XPBD论文解读(
【视觉三维重建】【论文笔记】Deblurring 3D Gaussian Splatting CS_Zero 论文阅读
去模糊的3D高斯泼溅，看Demo比3D高斯更加精细，对场景物体细节的还原度更高，[官网]（https://benhenryl.github.io/Deblurring-3D-Gaussian-Splatting/）背景技术Volumetricrendering-basednerualfields：NeRF.Rasterizationrendering:3D-GS.Rasterization比vol
[论文笔记] Transformer-XL 心心喵论文笔记 transformer 深度学习人工智能
这篇论文提出的Transformer-XL主要是针对Transformer在解决长依赖问题中受到固定长度上下文的限制，如Bert采用的Transformer最大上下文为512（其中是因为计算资源的限制，不是因为位置编码，因为使用的是绝对位置编码正余弦编码）。Transformer-XL能学习超过固定长度的依赖性，而不破坏时间一致性。它由段级递归机制和一种新的位置编码方案组成。该方法不仅能够捕获长期
SimpleShot: Revisiting Nearest-Neighbor Classification for Few-Shot Learning 论文笔记头柱碳只狼小样本学习
前言目前大多数小样本学习器首先使用一个卷积网络提取图像特征，然后将元学习方法与最近邻分类器结合起来，以进行图像识别。本文探讨了这样一种可能性，即在不使用元学习方法，而仅使用最近邻分类器的情况下，能否很好地处理小样本学习问题。本文发现，对图像特征进行简单的特征转换，然后再进行最近邻分类，也可以产生很好的小样本学习结果。比如，使用DenseNet特征的最近邻分类器，在结合均值相减（meansubtra
多模态相关论文笔记靖待大模型人工智能论文阅读
(cilp)LearningTransferableVisualModelsFromNaturalLanguageSupervision从自然语言监督中学习可迁移的视觉模型openAI2021年2月48页PDFCODECLIP(ContrastiveLanguage-ImagePre-Training)对比语言图像预训练模型引言它比ImageNet模型效果更好，计算效率更高。尤其是zero-sho
【论文笔记 · PFM】Lag-Llama: Towards Foundation Models for Time Series Forecasting lokol. 论文笔记论文阅读 llama
Lag-Llama:TowardsFoundationModelsforTimeSeriesForecasting摘要本文提出Lag-Llama，在大量时间序列数据上训练的通用单变量概率时间序列预测模型。模型在分布外泛化能力上取得较好效果。模型使用平滑破坏幂律（smoothlybrokenpower-laws）。介绍目前任务主要集中于在相同域的数据上训练模型。当前已有的大规模通用模型在大规模不同数
【论文笔记】Unsupervised Learning of Video Representations using LSTMs 奶茶不加糖え lstm 深度学习自然语言处理
摘要翻译我们使用长短时记忆（LongShortTermMemory,LSTM）网络来学习视频序列的表征。我们的模型使用LSTM编码器将输入序列映射到一个固定长度的表征向量。之后我们用一个或多个LSTM解码器解码这个表征向量来实现不同的任务，比如重建输入序列、预测未来序列。我们对两种输入序列——原始的图像小块和预训练卷积网络提取的高层表征向量——都做了实验。我们探索不同的设计选择，例如解码器的LST
MOSSE算法论文笔记以及代码解释 five days 计算机视觉深度学习机器学习
论文《VisualObjectTrackingusingAdaptiveCorrelationFilters》代码github1.论文idea提出以滤波器求相关的形式，找到最大响应处的位置，也就是我们所跟踪的目标的中心，进而不断的更新跟踪目标框和滤波器。2.跟踪策略如图，根据初始帧圈出的目标框训练滤波器，最大响应处为目标框的中心点，当移动到下一帧时，根据滤波器求相关的算法获得最大响应值，进而得出下
Attention Is All Your Need论文笔记 xiaoyan_lu 论文笔记论文阅读
论文解决了什么问题？提出了一个新的简单网络架构——transformer，仅仅是基于注意力机制，完全免去递推和卷积，使得神经网络训练地速度极大地提高。Weproposeanewsimplenetworkarchitecture,theTransformer,basedsolelyonattentionmechanisms,dispensingwithrecurrenceandconvolution
论文笔记：相似感知的多模态假新闻检测图学习的小张论文笔记论文阅读 python
整理了RecSys2020ProgressiveLayeredExtraction:ANovelMulti-TaskLearningModelforPersonalizedRecommendations）论文的阅读笔记背景模型实验论文地址：SAFE背景在此之前，对利用新闻文章中文本信息和视觉信息之间的关系(相似性)的关注较少。这种相似性有助于识别虚假新闻，例如，虚假新闻也许会试图使用不相关的图
[论文总结] 深度学习在农业领域应用论文笔记12 落痕的寒假论文总结深度学习论文阅读人工智能
文章目录1.3D-ZeF:A3DZebrafishTrackingBenchmarkDataset(CVPR,2020)摘要背景相关研究所提出的数据集方法和结果个人总结2.Automatedflowerclassificationoveralargenumberofclasses(ComputerVision,Graphics&ImageProcessing,2008)摘要背景分割与分类数据集和实
论文笔记之LINE:Large-scale Information Network Embedding 小弦弦喵喵喵
原文：LINE:Large-scaleInformationNetworkEmbedding本文提出一种新的networkembeddingmodel：LINE.能够处理大规模的各式各样的网络，比如：有向图、无向图、有权重图、无权重图.文中指出对于networkembedding问题，需要保留localstructure和globalstructure，分别对应first-orderproximi
打败一切NeRF！ 3D Gaussian Splatting 的简单入门知识 Ci_ci 17 3d python
新手的论文笔记3DGaussianSplatting的笔记introductionRelatedwork预备知识Gaussiansplatting3D高斯泼溅原理Overview3DGaussianSplatting的笔记每次都是在csdn上找救命稻草，这是第一次在csdn上发东西。确实是个不错的笔记网站，还能同步，保存哈哈哈。印象笔记，Onenote逊爆了。研一刚开学两个月，导师放养，给的方向还
《Residual Bi-Fusion Feature Pyramid Network for Accurate Single-shot Object Detection》论文笔记 m_buddy #General Object Detection Bi-Fusion
参考代码：无1.概述导读：在检测任务中一般会引入FPN增强在不同尺度下网络的检测性能，但是只通过top-down的FPN网络是很难去重建由于特征图的漂移（水平或是垂直方向运动）在经过pooling操作（pooling不具有平移不变性）带来结果相差很大的问题（特别针对小目标），而且FPN带来的性能提升会在使用较多卷积层之后逐渐被稀释（卷积的平移不变形），进而会导致一些小目标定位性能降低。对此可以通过
论文笔记-Generative Adversarial Nets 升不上三段的大鱼
论文链接：https://papers.nips.cc/paper/2014/file/5ca3e9b122f61f8f06494c97b1afccf3-Paper.pdf论文解读：https://www.bilibili.com/video/BV1rb4y187vD?share_source=copy_web一句话总结：提出了生成模型框架GAN，包括一个生成模型G和一个判别模型D，用有监督的损失
论文笔记：NIPS 2020 Graph Contrastive Learning with Augmentations 饮冰l 图弱监督数据挖掘机器学习神经网络深度学习
前言本文主要提出在图对比学习大框架下的图数据增强的若干方法。概括来说，本文提出了一种图对比学习框架来无监督的完成图表示学习，首先作者提出了基于各种先验信息的四种图数据增强方法。然后，作者分析了在四种不同的图数据增强条件下，不同组合对多个数据集的影响:半监督、无监督、迁移学习以及对抗性攻击。作者为GNN的预训练提出了基于图数据增强的对比学习框架来解决图中数据异质性的挑战，本文的主要贡献如下：作者提出
论文笔记-vChain: Enabling Verifiable Boolean Range Queries over Blockchain Databases qq_40431700 笔记区块链
核心方法：提出了一种基于累加器的可认证数据结构，可以动态聚合任意查询属性提出块内和块间索引，聚合块内和块间数据，可以做高效查询验证倒排前缀树结构，加速同时处理大量数据的订阅查询提出问题：1.range查询2.布尔查询3.没有可靠第三方、而且不能保证查询的完整性图中元素有：①全节点②矿工节点：是全节点，而且负责构建共识证明，比如计算nonce③轻节点：存nonce、区块的哈希，不存数据记录提出的Vc
论文笔记--Improving Language Understanding by Generative Pre-Training Isawany 论文阅读论文阅读自然语言处理 chatgpt 语言模型 nlp
论文笔记GPT1--ImprovingLanguageUnderstandingbyGenerativePre-Training1.文章简介2.文章导读2.1概括2.2文章重点技术2.2.1无监督预训练2.2.2有监督微调2.2.3不同微调任务的输入3.Bert&GPT4.文章亮点5.原文传送门6.References1.文章简介标题：ImprovingLanguageUnderstandingb
矩阵求逆（JAVA）利用伴随矩阵 qiuwanchi 利用伴随矩阵求逆矩阵
package gaodai.matrix; import gaodai.determinant.DeterminantCalculation; import java.util.ArrayList; import java.util.List; import java.util.Scanner; /** * 矩阵求逆(利用伴随矩阵) * @author 邱万迟
单例（Singleton）模式 aoyouzi 单例模式 Singleton
3.1 概述如果要保证系统里一个类最多只能存在一个实例时，我们就需要单例模式。这种情况在我们应用中经常碰到，例如缓存池，数据库连接池，线程池，一些应用服务实例等。在多线程环境中，为了保证实例的唯一性其实并不简单，这章将和读者一起探讨如何实现单例模式。 3.2
[开源与自主研发]就算可以轻易获得外部技术支持,自己也必须研发 comsci 开源
现在国内有大量的信息技术产品，都是通过盗版，免费下载，开源，附送等方式从国外的开发者那里获得的。。。。。。虽然这种情况带来了国内信息产业的短暂繁荣，也促进了电子商务和互联网产业的快速发展，但是实际上，我们应该清醒的看到，这些产业的核心力量是被国外的
页面有两个frame,怎样点击一个的链接改变另一个的内容 Array_06 UI XHTML
<a src="地址" targets="这里写你要操作的Frame的名字" />搜索然后你点击连接以后你的新页面就会显示在你设置的Frame名字的框那里 targerts="",就是你要填写目标的显示页面位置 ===================== 例如： <frame src=&
Struts2实现单个/多个文件上传和下载 oloz 文件上传 struts
struts2单文件上传：步骤01:jsp页面  　　<form action="fileUplo
推荐10个在线logo设计网站 362217990 logo
在线设计Logo网站。 1、http://flickr.nosv.org（这个太简单） 2、http://www.logomaker.com/?source=1.5770.1 3、http://www.simwebsol.com/ImageTool 4、http://www.logogenerator.com/logo.php?nal=1&tpl_catlist[]=2 5、ht
jsp上传文件香水浓 jsp fileupload
1. jsp上传 Notice： 1. form表单 method 属性必须设置为 POST 方法，不能使用 GET 方法 2. form表单 enctype 属性需要设置为 multipart/form-data 3. form表单 action 属性需要设置为提交到后台处理文件上传的jsp文件地址或者servlet地址。例如 uploadFile.jsp 程序文件用来处理上传的文
我的架构经验系列文章 - 前端架构 agevs JavaScript Web 框架 UI jQuer
框架层面：近几年前端发展很快，前端之所以叫前端因为前端是已经可以独立成为一种职业了，js也不再是十年前的玩具了，以前富客户端RIA的应用可能会用flash/flex或是silverlight，现在可以使用js来完成大部分的功能，因此js作为一门前端的支撑语言也不仅仅是进行的简单的编码，越来越多框架性的东西出现了。越来越多的开发模式转变为后端只是吐json的数据源，而前端做所有UI的事情。MVCMV
android ksoap2 中把XML(DataSet) 当做参数传递 aijuans android
我的android app中需要发送webservice ，于是我使用了 ksop2 进行发送，在测试过程中不是很顺利,不能正常工作.我的web service 请求格式如下 [html] view plain copy <Envelope xmlns="http://schemas.
使用Spring进行统一日志管理 + 统一异常管理 baalwolf spring
统一日志和异常管理配置好后，SSH项目中，代码以往散落的log.info() 和 try..catch..finally 再也不见踪影！统一日志异常实现类： [java] view plain copy package com.pilelot.web.util; impor
Android SDK 国内镜像 BigBird2012 android sdk
一、镜像地址： 1、东软信息学院的 Android SDK 镜像，比配置代理下载快多了。配置地址， http://mirrors.neusoft.edu.cn/configurations.we#android 2、北京化工大学的： IPV4:ubuntu.buct.edu.cn IPV4:ubuntu.buct.cn IPV6:ubuntu.buct6.edu.cn
HTML无害化和Sanitize模块 bijian1013 JavaScript AngularJS Linky Sanitize
一.ng-bind-html、ng-bind-html-unsafe AngularJS非常注重安全方面的问题，它会尽一切可能把大多数攻击手段最小化。其中一个攻击手段是向你的web页面里注入不安全的HTML，然后利用它触发跨站攻击或者注入攻击。考虑这样一个例子，假设我们有一个变量存
[Maven学习笔记二]Maven命令 bit1129 maven
mvn compile compile编译命令将src/main/java和src/main/resources中的代码和配置文件编译到target/classes中，不会对src/test/java中的测试类进行编译 MVN编译使用 maven-resources-plugin:2.6:resources maven-compiler-plugin:2.5.1:compile &nbs
【Java命令二】jhat bit1129 Java命令
jhat用于分析使用jmap dump的文件，，可以将堆中的对象以html的形式显示出来，包括对象的数量，大小等等，并支持对象查询语言。 jhat默认开启监听端口7000的HTTP服务，jhat是Java Heap Analysis Tool的缩写 1. 用法： [hadoop@hadoop bin]$ jhat -help Usage: jhat [-stack <bool&g
JBoss 5.1.0 GA:Error installing to Instantiated: name=AttachmentStore state=Desc ronin47
进到类似目录 server/default/conf/bootstrap，打开文件 profile.xml找到： Xml代码<bean name="AttachmentStore" class="org.jboss.system.server.profileservice.repository.AbstractAtta
写给初学者的6条网页设计安全配色指南 brotherlamp UI ui自学 ui视频 ui教程 ui资料
网页设计中最基本的原则之一是，不管你花多长时间创造一个华丽的设计，其最终的角色都是这场秀中真正的明星——内容的衬托我仍然清楚地记得我最早的一次美术课，那时我还是一个小小的、对凡事都充满渴望的孩子，我摆放出一大堆漂亮的彩色颜料。我仍然记得当我第一次看到原色与另一种颜色混合变成第二种颜色时的那种兴奋，并且我想，既然两种颜色能创造出一种全新的美丽色彩，那所有颜色
有一个数组，每次从中间随机取一个，然后放回去，当所有的元素都被取过，返回总共的取的次数。写一个函数实现。复杂度是什么。 bylijinnan java 算法面试
import java.util.Random; import java.util.Set; import java.util.TreeSet; /** * http://weibo.com/1915548291/z7HtOF4sx * #面试题#有一个数组，每次从中间随机取一个，然后放回去，当所有的元素都被取过，返回总共的取的次数。 * 写一个函数实现。复杂度是什么
struts2获得request、session、application方式 chiangfai application
1、与Servlet API解耦的访问方式。 a.Struts2对HttpServletRequest、HttpSession、ServletContext进行了封装，构造了三个Map对象来替代这三种对象要获取这三个Map对象，使用ActionContext类。 -----> package pro.action; import java.util.Map; imp
改变python的默认语言设置 chenchao051 python
import sys sys.getdefaultencoding() 可以测试出默认语言，要改变的话，需要在python lib的site-packages文件夹下新建： sitecustomize.py，这个文件比较特殊，会在python启动时来加载，所以就可以在里面写上： import sys sys.setdefaultencoding('utf-8') &n
mysql导入数据load data infile用法 daizj mysql 导入数据
我们常常导入数据！mysql有一个高效导入方法，那就是load data infile 下面来看案例说明基本语法： load data [low_priority] [local] infile 'file_name txt' [replace | ignore] into table tbl_name [fields [terminated by't'] [OPTI
phpexcel导入excel表到数据库简单入门示例 dcj3sjt126com PHP Excel
跟导出相对应的，同一个数据表，也是将phpexcel类放在class目录下，将Excel表格中的内容读取出来放到数据库中 <?php error_reporting(E_ALL); set_time_limit(0); ?> <html> <head> <meta http-equiv="Content-Type"
22岁到72岁的男人对女人的要求 dcj3sjt126com
22岁男人对女人的要求是：一，美丽，二，性感，三，有份具品味的职业，四，极有耐性，善解人意，五，该聪明的时候聪明，六，作小鸟依人状时尽量自然，七，怎样穿都好看，八，懂得适当地撒娇，九，虽作惊喜反应，但看起来自然，十，上了床就是个无条件荡妇。 32岁的男人对女人的要求，略作修定，是：一，入得厨房，进得睡房，二，不必服侍皇太后，三，不介意浪漫蜡烛配盒饭，四，听多过说，五，不再傻笑，六，懂得独
Spring和HIbernate对DDM设计的支持 e200702084 DAO 设计模式 spring Hibernate 领域模型
A：数据访问对象 DAO和资源库在领域驱动设计中都很重要。DAO是关系型数据库和应用之间的契约。它封装了Web应用中的数据库CRUD操作细节。另一方面，资源库是一个独立的抽象，它与DAO进行交互，并提供到领域模型的“业务接口”。资源库使用领域的通用语言，处理所有必要的DAO，并使用领域理解的语言提供对领域模型的数据访问服务。
NoSql 数据库的特性比较 geeksun NoSQL
Redis 是一个开源的使用ANSI C语言编写、支持网络、可基于内存亦可持久化的日志型、Key-Value数据库，并提供多种语言的API。目前由VMware主持开发工作。 1. 数据模型作为Key-value型数据库，Redis也提供了键（Key）和值（Value）的映射关系。除了常规的数值或字符串，Redis的键值还可以是以下形式之一： Lists （列表） Sets
使用 Nginx Upload Module 实现上传文件功能 hongtoushizi nginx
转载自： http://www.tuicool.com/wx/aUrAzm 普通网站在实现文件上传功能的时候，一般是使用Python，Java等后端程序实现，比较麻烦。Nginx有一个Upload模块，可以非常简单的实现文件上传功能。此模块的原理是先把用户上传的文件保存到临时文件，然后在交由后台页面处理，并且把文件的原名，上传后的名称，文件类型，文件大小set到页面。下
spring-boot-web-ui及thymeleaf基本使用 jishiweili spring thymeleaf
视图控制层代码demo如下： @Controller @RequestMapping("/") public class MessageController { private final MessageRepository messageRepository; @Autowired public MessageController(Mes
数据源架构模式之活动记录 home198979 PHP 架构活动记录数据映射
hello!架构一、概念活动记录（Active Record）：一个对象，它包装数据库表或视图中某一行，封装数据库访问，并在这些数据上增加了领域逻辑。对象既有数据又有行为。活动记录使用直截了当的方法，把数据访问逻辑置于领域对象中。二、实现简单活动记录活动记录在php许多框架中都有应用，如cakephp。 <?php /** * 行数据入口类 *
Linux Shell脚本之自动修改IP pda158 linux centos Debian 脚本
作为一名 Linux SA，日常运维中很多地方都会用到脚本，而服务器的ip一般采用静态ip或者MAC绑定，当然后者比较操作起来相对繁琐，而前者我们可以设置主机名、ip信息、网关等配置。修改成特定的主机名在维护和管理方面也比较方便。如下脚本用途为：修改ip和主机名等相关信息，可以根据实际需求修改，举一反三！ #!/bin/sh #auto Change ip netmask ga
开发环境搭建独浮云 eclipse jdk tomcat
最近在开发过程中，经常出现MyEclipse内存溢出等错误，需要重启的情况，好麻烦。对于一般的JAVA+TOMCAT项目开发，其实没有必要使用重量级的MyEclipse，使用eclipse就足够了。尤其是开发机器硬件配置一般的人。 &n