语义分割资源列表
https://github.com/mrgloom/awesome-semantic-segmentation
https://competitions.codalab.org/competitions/21120#participate
官方baseline: https://github.com/pubgeo/dfc2019
官方baseline使用基于resnet34的Unet,Unet是网上一个开源的Keras Segment库,官方baseline包含了训练好的权重。
官方baseline使用到以下开源库:
一个数据增强库:albumentations
一个图像分割算法库:segmentation_models
SpaceNet 数据集 https://spacenetchallenge.github.io/
8波段图像,0.3m分辨率,有全色图和多光谱图,以及融合后的多波段图像
标注:建筑物区块
├── AOI_4_Shanghai_Train.tar.gz
│ ├── geojson
│ │ └── buildings # Contains GeoJson labels of buildings for each tile
│ ├── MUL # Contains Tiles of 8-Band Multi-Spectral raster data from WorldView-3
│ ├── MUL-PanSharpen # Contains Tiles of 8-Band Multi-Spectral raster data pansharpened to 0.3m
│ ├── PAN # Contains Tiles of Panchromatic raster data from Worldview-3
│ ├── RGB-PanSharpen # Contains Tiles of RGB raster data from Worldview-3
│ └── summaryData # Contains CSV with pixel based labels for each building in the Tile Set.
Kaggle竞赛
https://www.kaggle.com/c/dstl-satellite-imagery-feature-detection/data
Sensor : WorldView 3
Wavebands :
Sensor Resolution (GSD) at Nadir :
Dynamic Range
分类:
sample_submission.csv - a sample submission file in the correct format
数据集:RGB 语义分割 6+1种类别
Data
Label
Each satellite image is paired with a mask image for land cover annotation. The mask is a RGB image with 7 classes of labels, using color-coding (R, G, B) as follows.
File names for satellite images and the corresponding mask image are _sat.jpg and _mask.png. is a randomized integer.
由国际摄影测量及遥感探测学会(ISPRS)组织发布
数据集官网:http://www2.isprs.org/commissions/comm3/wg4/2d-sem-label-vaihingen.html
Github上该数据集的一个语义分割开源代码:
https://github.com/nshaud/DeepNetsForEO
The ground sampling distance of both, the TOP and the DSM, is 5 cm. The DSM was generated via dense image matching with Trimble INPHO 5.6 software and Trimble INPHO OrthoVista was used to generate the TOP mosaic. In order to avoid areas without data (“holes”) in the TOP and the DSM, the patches were selected from the central part of the TOP mosaic and none at the boundaries. Remaining (very small) holes in the TOP and the DSM were interpolated.
The TOP come as TIFF files in different channel composistions, where each channel has a spectral resolution of 8bit:
In this way participants can pick the data needed conveniently.
The DSM are TIFF files with one band; the grey levels (corresponding to the DSM heights) are encoded as 32 bit float values. The TOP and the DSM are defined on the same grid (UTM WGS84). Each tile comes with an affine transformation file (tiff world file) in order to enable a re-composition of images to larger mosaics if desired.
The data set contains 33 patches (of different sizes), each consisting of a true orthophoto (TOP) extracted from a larger TOP mosaic, see Figure below and a DSM. For further information about the original input data, please refer to the data description of the object detection and 3d reconstruction benchmark.
http://www2.isprs.org/commissions/comm3/wg4/2d-sem-label-vaihingen.html
同样由国际摄影测量及遥感探测学会(ISPRS)组织发布。
SEN1-2 Dataset for Deep Learning in SAR-Optical Data Fusion:
http://www2.isprs.org/commissions/comm1/wg3/resources.html
https://figshare.com/articles/TNO_Image_Fusion_Dataset/1008029
https://github.com/rmkemker/RIT-18