HackerTom

MMDetection在ScanNet上训练

（2022.10.24）注意，scannet-frames-25k 这个子集的 instance maps 与原始 ScanNetV2 的 instance maps（即 scene*_*/instance/ 或 scene*_*/instance-filt/ 里面的那些 .png）格式是不同的，[5] 有讲这一点。所以如果想用原始 ScanNetV2 数据训练，在用本文改的转换代码转成 COCO object detection 格式 annotations 之前，要用原始 ScanNetV2 中的

原始 instance maps：scene*_*/instance/*.png（或 scene*_*/instance-filt/*.png），和
原始 label maps：scene*_*/label/*.png（或 scene*_*/label-filt/*.png）

合并成 scannet-frames-25k 格式的 instance maps 先。步骤是：

原始 label maps 中的 class IDs 是 NYU40 的 class ID，要用 scannetv2-labels.combined.tsv 将其转换成 ScanNet 的 class IDs；
合并 instance map 和 label map：instance_map = 1000 * raw_label_map + raw_instance_map。

这两步转换分别在 [2] 中 ScanNet/BenchmarkScripts/2d_helpers/ 下的 convert_scannet_label_image.py 和 convert_scannet_instance_image.py 中有实现。（不过，自己画一下原始 instance map 就会发现，同一 scan 下的 instance ID 是全局的，即同一 scan 下同一 instance 在不同帧里拥有相同 ID，可以跨帧追踪同一个 instance，evaluate 的时候也许会用得上。[2] 提供的 instance map 转换代码好像会破坏这一特性，需要的话可以自己 hack 一份，保留全局 instance ID）

需要用 ScanNet^[1,2] 训练一个 object detection 模型，使用 MMDetection^[3,4]。步骤：

下载 ScanNet-frames-25k（ScanNet 的子集）；
划分数据集；
将 annotations 转化成 COCO object detecion 的格式；
配 MMDetection 环境（从源码安装）；
改 MMDetection 文件，用来训练。

ScanNet

ScanNet 主要用于 3D 领域，但其数据形式是 RGB-D 序列，其中 RGB 的序列可以当视频，有 v1、v2 两个版本，完整的 v2 大约 1.8T。其它信息见 [1-2,5-7]，下载脚本 download-scannet.py 见 [8]。

[5] 有提到 scannet_frames_25k 这个子集，本文主要用它。对照 [8] 的代码可知，它是从完整的 v2 中抽出来的，大概每 100 帧抽一帧。下载的文件是：

scannet_frames_25k.zip，~5.6G，1513 份 scans（即 RGB-D 序列，这里简单当成 videos）；
scannet_frames_test.zip，~610M，100 份 scans，对应的测试集。

执行：

python download-scannet.py -o . --preprocessed_frames
python download-scannet.py -o . --test_frames_2d

下载（我脚本下不了，把下载链粘到迅雷下的）。解压看文件结构：

scannet_frames_25k/
|- scene0000_00/	# 一份 scan
|  |- color/		# RGB 序列，视频 (jpg)
|  |- depth/		# depth 序列 (png)
|  |- instance/		# instance mask (png)
|  |- label/
|  |- pose/
|  |- intrinsics_color.txt
|  `- intrinsics_depth.txt
|- scene0000_01/	#　另一份 scan
...

scannet_frames_test/
|- scene0707_00/
|  |- color/
|  |- depth/
|  |- pose/
|  |- intrinsics_color.txt
|  `- intrinsics_depth.txt
|- scene0708_00/
...

可见 test set 的文件少了跟 label 有关的文件。

[2] 中提供了官方的划分，在 ScanNet/Tasks/Benchmark/ 下，分成 train/val/test 3 部分。对照其中 v2 的几个划分文件（txt）和上述两个 zip 可知：

scannet_frames_25k.zip = train + val
scannet_frames_test.zip = test

所以这个子集 scan 应该是跟完整版一样多，只是对每个 scan 下的序列抽样了。

Splitting

从 MMDetection 的 configuration 文件看，要将 train/val 分在不同的目录，并生成不同的 json 标注文件。因为 test 缺少 instance/，而后续转成 COCO 格式时又要用到这个目录的东西，所以本文弃用 test set 的数据，用 val。

输出路径：data/scannet-frames/

# split-scannet.py
import os
import os.path as osp

"""split ScanNet
Only `scannet_frames_25k/` is used while `scannet_frames_test/` is ignored
    because the scenes in it have no `**/instance/` sub-folder which
    is needed by `convert2panopic.py`.
So I simply reuse validation set as the test set as in the COCO
    configuration file in MMDetection.
These 2 subset are then converted separately to produce separate
    annotation json files as needed in the configuration.
"""

DATA_ROOT = "/data"
# DATA_P = [osp.join(DATA_ROOT, p) for p in ("scannet_frames_25k", "scannet_frames_test")]
DATA_P = osp.join(DATA_ROOT, "scannet_frames_25k")

# 检查 scans 数
dir_list = next(os.walk(DATA_P))[1]
print("#data:", len(dir_list))  # 1513 -> ALL
print("conclusion: contains all data, only frames are down-sampled")

SPLIT_P = osp.join(os.environ["HOME"], "codes", "ScanNet", "Tasks", "Benchmark")
# scannet_frames_25k is only available for ScanNetv2
VER = "v2"
DEST = "data/scannet-frames"  # 代码目录中的 data/，不是 /data/
if not osp.exists(DEST):
    os.makedirs(DEST)

# test set has NO `**/instance/*.png` needed by `convert2panopic.py`
for subset in ["train", "val"]:
    split_file = osp.join(SPLIT_P, "scannet"+VER+"_"+subset+".txt")

    # soft-link all scans of this subset to `sub_dest`
    sub_dest = osp.join(DEST, subset)
    if not osp.exists(sub_dest):
        os.makedirs(sub_dest)

    with open(split_file, "r") as f:
        for line in f:
            line = line.strip()
            if "" == line:
                continue
            os.system("ln -s {} {}".format(
                osp.join(DATA_P, line),
                osp.join(sub_dest, line)))
    print(subset, "DONE")

Convert to COCO Format

MMDetection 推荐的组织数据方式之一是转化为 COCO 的格式^[9-11]。rvc_devkit^[12] 提供了一份转化的脚本：rconvert_scannet_coco.sh，其核心用的是 [2] 提供的转化代码：convert2panoptic.py。但这其实是为 panoptic segmentation 任务准备的（见 [9] 第 4 条），而此处于需要 object detection 的 annotation 格式（[9] 第 1 条）。

(2022.10.8) 另一份转化代码^[26]：scannet_train_val_to_efficientps.py，核心是 panoptic2detection_coco_format.py ^[27]。

于是基于 convert2panoptic.py 改一份适用于 object detection 的转化代码（convert-scannet-coco-objdet.py）：

# convert-scannet-coco-objdet.py

#!/usr/bin/python
#
# Convert to COCO-style panoptic segmentation format (http://cocodataset.org/#format-data).
#

"""iTom's modified version (2022.9.13)
This file is inherited from
    rvc_devkit/segmentation/conv_scannet/convert2panoptic.py
which is the same as
    ScanNet/BenchmarkScripts/convert2panoptic.py
But I modify it to fit objection detection format and be
    able to dinstiguish different subsets (i.e. train/val/test)
    in terms of the output json annotation files.

There are several modifications:

(a) Addition argment
    - an additional optional arguement of `convert2panoptic`:
        subset_tag, default = None
    - an additional optional command-line argument:
        --subset-tag -> args.subsetTag, default = None
If this argument is used, the name of output json annotation file
    will be modified accordingly.

(b) Move to COCO object detection annotation format instead of the
    original panoptic format. I borrowed the functions, i.e.
        - binary_mask_to_polygon
        - close_contour
    for polygon format segmentation info calculation. But the results
    are discarded due to
        - their wierdly large volumn
            (~38G for val set & ~114G for the training set !)
        - that they are not used in detection task
    If you want to reenable it, you may need to install
        - scikit-image
    (NOTE: I suspect this is buggy somewhere.)

(c) Change extension to ".jpg" in `images/file_name` field.
"""

# python imports
from __future__ import print_function, absolute_import, division, unicode_literals
from itertools import count
import os
import glob
import sys
import argparse
import json
import numpy as np

# iTom: for polygon calculation
from skimage import measure

# Image processing
from PIL import Image

EVAL_LABELS = [1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 14, 16, 24, 28, 33, 34, 36, 39]
EVAL_LABEL_NAMES = ["wall", "floor", "cabinet", "bed", "chair", "sofa", "table", "door", "window", "bookshelf", "picture", "counter", "desk", "curtain", "refrigerator", "shower curtain", "toilet", "sink", "bathtub", "otherfurniture"]
EVAL_LABEL_CATS = ["indoor", "indoor", "furniture", "furniture", "furniture", "furniture", "furniture", "furniture", "furniture", "furniture", "furniture", "furniture", "furniture", "furniture", "appliance", "furniture", "furniture", "appliance", "furniture", "furniture"]
EVAL_LABEL_COLORS = [(174, 199, 232), (152, 223, 138), (31, 119, 180), (255, 187, 120), (188, 189, 34), (140, 86, 75), (255, 152, 150), (214, 39, 40), (197, 176, 213), (148, 103, 189), (196, 156, 148), (23, 190, 207), (247, 182, 210), (219, 219, 141), (255, 127, 14), (158, 218, 229), (44, 160, 44), (112, 128, 144), (227, 119, 194), (82, 84, 163)]

def splitall(path):
    allparts = []
    while 1:
        parts = os.path.split(path)
        if parts[0] == path:  # sentinel for absolute paths
            allparts.insert(0, parts[0])
            break
        elif parts[1] == path: # sentinel for relative paths
            allparts.insert(0, parts[1])
            break
        else:
            path = parts[0]
            allparts.insert(0, parts[1])
    return allparts


def close_contour(contour):
    """iTom: helper function for binary mask -> polygon conversion
    from: https://github.com/waspinator/pycococreator/blob/master/pycococreatortools/pycococreatortools.py#L20
    """
    if not np.array_equal(contour[0], contour[-1]):
        contour = np.vstack((contour, contour[0]))
    return contour


def binary_mask_to_polygon(binary_mask, tolerance=0):
    """iTom: Converts a binary mask to COCO polygon representation
    Args:
        binary_mask: a 2D binary numpy array where '1's represent the object
        tolerance: Maximum distance from original points of polygon to approximated
            polygonal chain. If tolerance is 0, the original coordinate array is returned.

    from: https://github.com/waspinator/pycococreator/blob/master/pycococreatortools/pycococreatortools.py#L35
    ref:
    - https://github.com/cocodataset/cocoapi/issues/131
    - https://stackoverflow.com/questions/68663512/image-segmentation-mask-to-polygon-for-coco-json
    - https://stackoverflow.com/questions/58884265/python-convert-binary-mask-to-polygon
    """
    polygons = []
    # pad mask to close contours of shapes which start and end at an edge
    padded_binary_mask = np.pad(binary_mask, pad_width=1, mode='constant', constant_values=0)
    contours = measure.find_contours(padded_binary_mask, 0.5)
    # contours = np.subtract(contours, 1)  # iTom: original but buggy
    for i in range(len(contours)):  # iTom: change to for-loop subtraction
        contours[i] = np.subtract(contours[0], 1)
    for contour in contours:
        contour = close_contour(contour)
        contour = measure.approximate_polygon(contour, tolerance)
        if len(contour) < 3:
            continue
        contour = np.flip(contour, axis=1)
        segmentation = contour.ravel().tolist()
        # after padding and subtracting 1 we may get -0.5 points in our segmentation
        segmentation = [0 if i < 0 else i for i in segmentation]
        polygons.append(segmentation)

    return polygons


# The main method
def convert2panoptic(scannetPath, outputFolder=None, subset_tag=None, beginAnnoId=0, beginImageId=0, thingsOnly=False):
    """iTom's modification
    subset_tag: str, an optional subset distinguishing string for
        train/val seperation. One can simply ignore it to get the
        original output json file name.
    """

    if outputFolder is None:
        outputFolder = scannetPath

    # find files
    search = os.path.join(scannetPath, "*", "instance", "*.png")
    files = glob.glob(search)
    files.sort()
    # quit if we did not find anything
    if not files:
        print(
            "Did not find any files for using matching pattern {}. Please consult the README.".format(search)
        )
        sys.exit(-1)
    # a bit verbose
    print("Converting {} annotation files.".format(len(files)))

    outputBaseFile = "scannet_objdet"
    if subset_tag is not None:
        outputBaseFile = outputBaseFile + "_" + subset_tag
        print("iTom: modifying json annotation file name to:", outputBaseFile)
    outFile = os.path.join(outputFolder, "{}.json".format(outputBaseFile))
    print("Json file with the annotations in panoptic format will be saved in {}".format(outFile))
    # panopticFolder = os.path.join(outputFolder, outputBaseFile)
    # if not os.path.isdir(panopticFolder):
    #     print("Creating folder {} for panoptic segmentation PNGs".format(panopticFolder))
    #     os.mkdir(panopticFolder)
    # print("Corresponding segmentations in .png format will be saved in {}".format(panopticFolder))

    categories = []
    cls_is_things = {}  # iTom
    for idx in range(len(EVAL_LABELS)):
        label = EVAL_LABELS[idx]
        name = EVAL_LABEL_NAMES[idx]
        cat = EVAL_LABEL_CATS[idx]
        color = EVAL_LABEL_COLORS[idx]
        isthing = label > 2
        cls_is_things[int(label)] = isthing  # iTom
        if thingsOnly and not isthing:  # iTom
            continue
        categories.append({'id': int(label),
                           'name': name,
                           'color': color,
                           'supercategory': cat,
                           'isthing': isthing})

    images = []
    annotations = []
    for progress, f in enumerate(files):

        originalFormat = np.array(Image.open(f))

        parts = splitall(f)
        fileName = parts[-1]
        sceneName = parts[-3]
        outputFileName = "{}__{}".format(sceneName, fileName)
        inputFileName = os.path.join(sceneName, "color", fileName)
        # imageId = os.path.splitext(outputFileName)[0]
        imageId = beginImageId
        beginImageId += 1
        # image entry, id for image is its filename without extension
        images.append({"id": imageId,
                       "width": int(originalFormat.shape[1]),
                       "height": int(originalFormat.shape[0]),
                       "file_name": inputFileName.replace(".png", ".jpg")})
                       # "file_name": inputFileName})

        # pan_format = np.zeros(
        #     (originalFormat.shape[0], originalFormat.shape[1], 3), dtype=np.uint8
        # )
        segmentIds = np.unique(originalFormat)
        segmInfo = []
        for i_seg, segmentId in enumerate(segmentIds):
            isCrowd = 0
            if segmentId < 1000:
                semanticId = segmentId
            else:
                semanticId = segmentId // 1000
            if semanticId not in EVAL_LABELS:
                continue
            if thingsOnly and not cls_is_things[semanticId]:  # iTom
                continue

            mask = originalFormat == segmentId
            color = [segmentId % 256, segmentId // 256, segmentId // 256 // 256]
            # pan_format[mask] = color

            area = np.sum(mask) # segment area computation

            # bbox computation for a segment
            hor = np.sum(mask, axis=0)
            hor_idx = np.nonzero(hor)[0]
            x = hor_idx[0]
            width = hor_idx[-1] - x + 1
            vert = np.sum(mask, axis=1)
            vert_idx = np.nonzero(vert)[0]
            y = vert_idx[0]
            height = vert_idx[-1] - y + 1
            bbox = [int(x), int(y), int(width), int(height)]

            segmInfo.append({"id": int(segmentId),
                            "category_id": int(semanticId),
                            "area": int(area),
                            "bbox": bbox,
                            "iscrowd": isCrowd})

            # COCO objectoin detectoin format:
            #   - https://cocodataset.org/#format-data
            # ref:
            #   - https://zhuanlan.zhihu.com/p/29393415
            #   - https://zhuanlan.zhihu.com/p/263454360

            # polygon = binary_mask_to_polygon(mask)  # wiredly large, discarded
            polygon = []

            # # annoId = imageId + "_" + str(i_seg)  # "scene0046_00__000200_2"
            # spaceId, scanId = sceneName.split("scene")[1].split("_")
            # imgFileNum = os.path.splitext(fileName)[0]
            # annoId = int(spaceId) * 1000000 + int(scanId) * 10000 + int(imgFileNum) + i_seg
            # # print("annoId:", annoId, "<-", spaceId, scanId, imgFileNum, i_seg)
            annoId = beginAnnoId
            beginAnnoId += 1

            annotations.append({'id': annoId,
                                'image_id': imageId,
                                'category_id': int(semanticId),
                                "segmentation": polygon,
                                'area': int(area),
                                'bbox': bbox,
                                "iscrowd": isCrowd})
            # break  # debug

        ## iTom: original panoptic annotation, removed
        # annotations.append({'image_id': imageId,
        #                     'file_name': outputFileName,
        #                     "segments_info": segmInfo})

        # Image.fromarray(pan_format).save(os.path.join(panopticFolder, outputFileName))

        print("\rProgress: {:>3.2f} %".format((progress + 1) * 100 / len(files)), end=' ')
        sys.stdout.flush()
        # break  # debug

    print("\nSaving the json file {}".format(outFile))
    d = {'images': images,
        'annotations': annotations,
        'categories': categories}
    with open(outFile, 'w') as f:
        json.dump(d, f, sort_keys=True, indent=4)

    return beginAnnoId, beginImageId


def main():
    parser = argparse.ArgumentParser()
    parser.add_argument("--dataset-folder",
                        dest="scannetPath",
                        help="path to the ScanNet data 'scannet_frames_25k' folder",
                        required=True,
                        type=str)
    parser.add_argument("--output-folder",
                        dest="outputFolder",
                        help="path to the output folder.",
                        default=None,
                        type=str)
    # iTom-added, optional
    parser.add_argument("--subset-tag",
                        dest="subsetTag",
                        help="(iTom, optional) distinguishing str for train/val separation",
                        default=None,
                        type=str)
    parser.add_argument("--begin-anno-id",
                        dest="beginAnnoId",
                        help="(iTom) annotation IDs will start from this number." \
                            "When convert for each subset sequentially, " \
                            "use this to ensure that there is no duplicated annotation ID",
                        default=0,
                        type=int)
    parser.add_argument("--begin-image-id",
                        dest="beginImageId",
                        help="(iTom) image IDs will start from this number." \
                            "When convert for each subset sequentially, " \
                            "use this to ensure that there is no duplicated annotation ID",
                        default=0,
                        type=int)
    parser.add_argument('--things-only',
                        dest="thingsOnly",
                        action="store_true",
                        help="keep thing classes & drop stuff classes")
    args = parser.parse_args()

    last_unused_anno_id, last_unused_image_id = convert2panoptic(
        args.scannetPath, args.outputFolder, args.subsetTag, args.beginAnnoId, args.beginImageId, args.thingsOnly)
    # record the last unused annotation & image ID to interact with `scripts/split-cvt2coco.sh`
    with open("last-unused-anno-id.txt", "w") as f:
        f.write(str(last_unused_anno_id))
    with open("last-unused-image-id.txt", "w") as f:
        f.write(str(last_unused_image_id))


# call the main
if __name__ == "__main__":
    main()

主体还是 convert2panoptic.py 的内容，改变有：

按 [9] 中 COCO object detection 需要的格式改写 annotations；
- 其中 segmentation 域按 [9,10] 介绍，用 polygon 格式（因为原 convert2panoptic.py 写死 isCrowd = 0），而由 binary mask 计算 polygon 的代码抄自 [13]，相关可见 [14-16]。
- 但是我弃用了，因为这样做出来的 json 文件大得出奇（val ~38G、train ~114G；COCO 数据量大过 ScanNet 很多，整个 annotation 的 zip 才 ~241M），我怀疑有 bug，而且 object detection 似乎用不着。
- 由 [10]，annotations 的长度，即 annotation 的个数，等于整个数据（子）集中的 bounding box 数量，那 annotations/id 这个域应该给每个 bbox 赋一个不同的整数 ID 就行。注意：一定要是一个数，否则会报错，说不能转化为数字。
- 我此处用将 space ID、scan ID、scan 内 image 文件名的数字、一幅 image 内的 segmentation 顺序压缩成一个整数的方法保证 annotations/id 的惟一性。而我又针对这个数据集统计过：image 文件名的数字都是 100 整数倍，除以 100 之后不超过 100；每幅 image 内 segmentation 的个数也不超过 100。是以有代码中 annoId 的计算方式。
加了一个 subset tag 参数，使得 train/val 生成不同的 json 文件。
- 简单忽略这个参数就可以像原 convert2panoptic.py 一样，只生成一个 json。
images/file_name 这个域将 .png 后缀改为 .jpg。
- 原本的代码是用 **/instance/*.png 枚举帧的，默认 .png 后缀，然而实际上 color/ 下的 RGB 帧是 .jpg 后缀的。

调用：

#!/bin/bash

DEST=data/scannet-frames  # 前面 split 的输出

anno_id=0  # interact with `convert-scannet-coco-objdet.py`
image_id=0
for subset in train val; do
    python convert-scannet-coco-objdet.py \
        --dataset-folder $DEST/$subset \
        --output-folder $DEST \
        --things-only \
        --subset-tag $subset \
        --begin-anno-id $anno_id \
        --begin-image-id $image_id

    # update beginning (i.e. last unused) annotation & image ID
    anno_id=`cat last-unused-anno-id.txt`
    image_id=`cat last-unused-image-id.txt`
done
rm last-unused-anno-id.txt
rm last-unused-image-id.txt

Environment

因为后续需要用到 MMDetection 的文件，所以用源码安装的方式，参考 [4] 的安装教程：get_started.md，安装脚本：

#!/bin/bash
# env-mmdetection.sh

echo conda 安装虚拟环境
CONDA_P=~/miniconda3
ENV=openmmlab
if [ ! -d $CONDA_P/envs/$ENV ]; then
    conda create --name $ENV python=3.8 -y
fi
CONDA_BIN=$CONDA_P/envs/$ENV/bin

$CONDA_BIN/pip install torch==1.8.2 torchvision==0.9.2 --extra-index-url https://download.pytorch.org/whl/lts/1.8/cu111
# used in mmdetection/demo/video_gpuaccel_demo.py
$CONDA_BIN/pip install ffmpegcv scipy scikit-image
conda install -n $ENV ffmpeg -y

$CONDA_BIN/pip install -U openmim
$CONDA_BIN/mim install mmcv-full==1.6.0 mmengine
# avoid bug: KeyError: 'Cascade Mask R-CNN'
#   (i.e. open-mmlab/mim issues #125)
# https://github.com/open-mmlab/mim/issues/125
$CONDA_BIN/mim install mmdet==2.24.0

if [ ! -d mmdetection ]; then
    echo try to clone from the original github repo
    git clone https://github.com/open-mmlab/mmdetection.git
    # git submodule add https://github.com/open-mmlab/mmdetection.git
    if [ $? -ne 0 ]; then
        echo * FAILED to clone from github
        echo clone from a gitee transit repo instead
        git clone https://gitee.com/xoxleoxox/mmdetection
        # git submodule add https://gitee.com/xoxleoxox/mmdetection
    fi
fi
cd mmdetection
$CONDA_BIN/pip install -v -e .

echo 验证安装
$CONDA_BIN/mim download mmdet --config yolov3_mobilenetv2_320_300e_coco --dest .
$CONDA_BIN/python demo/image_demo.py demo/demo.jpg yolov3_mobilenetv2_320_300e_coco.py \
    yolov3_mobilenetv2_320_300e_coco_20210719_215349-d18dff72.pth --device cpu --out-file result.jpg

注：之前从源码安装，会在验证安装、跑 demo 的时候报过一个错，见 [17]，所以里面有句装 mmdet==2.24.0 的；但训练如果用 mmdet 2.24.0 又会报错，还是需要从源码安装新的版本，而这会覆盖掉 2.24.0 的旧版。

Training

codes of MMDetection

要用到 MMDetection 的代码，故用了 git submodule 在工程目录中添加 MMDetection 的仓库，参考 [18-20]。脚本：

#!/bin/bash
# add-submodules.sh

# echo rvc_devkit
# git submodule add https://github.com/ozendelait/rvc_devkit.git
# if [ $? -ne 0 ]; then
#     git submodule add https://gitee.com/tyloeng/rvc_devkit.git
# fi

echo mmdetection
git submodule add https://github.com/open-mmlab/mmdetection.git
if [ $? -ne 0 ]; then
    git submodule add https://gitee.com/xoxleoxox/mmdetection
fi

# echo ScanNet
# git submodule add https://github.com/ScanNet/ScanNet.git
# if [ $? -ne 0 ]; then
#     git submodule add https://gitee.com/gxdcode/ScanNet.git
# fi

git submodule update --init --recursive
git submodule update --remote

#CONDA_P=~/miniconda3
#ENV=openmmlab
#CONDA_BIN=$CONDA_P/envs/$ENV/bin

#cd rvc_devkit
#$CONDA_BIN/pip install -r requirements.txt
#cd objdet
#$CONDA_BIN/pip install -r requirements.txt

configuration files

利用 MMDetection 在新数据集上训练已经写好的模型可以参考 [4] 的示例 2_new_data_model.md、1_exist_data_model.md。数据前面准备好了，这里主要是要准备配置文件。我放照 [4] 的结构，在自己工程目录也建了个 configs/ 目录，从 [4] 复制了两份配置文件并改名：

mstrain_3x_scannet.py（来自 mmdetection/configs/common/mstrain_3x_coco.py）
faster_rcnn_x101_64x4d_fpn_mstrain_3x_scannet.py（来自 mmdetection/configs/faster_rcnn/faster_rcnn_x101_64x4d_fpn_mstrain_3x_coco.py）

此时工程目录结构形如：

my-project/
|- convert-scannet-coco-objdet.py
|- split-scannet.py
|- data/
|  `- scannet-frames/
|     |- train/						# splitting 产生
|     |- val/						# splitting 产生
|     |- scannet_objdet_train/		# anno 格式转化产生
|     |- scannet_objdet_val/		# anno 格式转化产生
|     |- scannet_objdet_train.json	# anno 格式转化产生
|     `- scannet_objdet_val.json	# anno 格式转化产生
|- mmdetection/						# submodule
|- configs/							# 仿照 mmdetection/configs/ 结构
|  |- common/
|  |  `- mstrain_3x_scannet.py
|  `- faster_rcnn/
|     `- faster_rcnn_x101_64x4d_fpn_mstrain_3x_scannet.py
`- scripts/
   |- add-modules.sh
   |- env-mmdetection.sh
   |- find_gpu.sh
   `- train-faster-rcnn-scannet-frames.sh

其中两个配置文件为：

mstrain_3x_scannet.py（data 部分改成自己的、改 _base_ 引用路径）
（2022.9.17）参考 [24-25]，将类集改为 ScanNet 的，要改 classes、data/train/dataset/classes、data/val/classes、data/test/classes。（但这个修改我未测试过，不知道有没有其它要相应改的。）

## iTom Notes
# Inherited from `mmdetection/configs/common/mstrain_3x_coco.py`,
# this file is designed for training Faster R-CNN on converted ScanNet-frames-25k.
import os.path as osp

_base_ = '../../mmdetection/configs/_base_/default_runtime.py'
# dataset settings
dataset_type = 'CocoDataset'
classes = (
    "wall", "floor", "cabinet", "bed", "chair",
    "sofa", "table", "door", "window", "bookshelf",
    "picture", "counter", "desk", "curtain", "refrigerator",
    "shower curtain", "toilet", "sink", "bathtub", "otherfurniture"
)
data_root = 'data/scannet-frames/'
img_norm_cfg = dict(
    mean=[123.675, 116.28, 103.53], std=[58.395, 57.12, 57.375], to_rgb=True)

# In mstrain 3x config, img_scale=[(1333, 640), (1333, 800)],
# multiscale_mode='range'
train_pipeline = [
    dict(type='LoadImageFromFile'),
    dict(type='LoadAnnotations', with_bbox=True),
    dict(
        type='Resize',
        img_scale=[(1333, 640), (1333, 800)],
        multiscale_mode='range',
        keep_ratio=True),
    dict(type='RandomFlip', flip_ratio=0.5),
    dict(type='Normalize', **img_norm_cfg),
    dict(type='Pad', size_divisor=32),
    dict(type='DefaultFormatBundle'),
    dict(type='Collect', keys=['img', 'gt_bboxes', 'gt_labels']),
]
test_pipeline = [
    dict(type='LoadImageFromFile'),
    dict(
        type='MultiScaleFlipAug',
        img_scale=(1333, 800),
        flip=False,
        transforms=[
            dict(type='Resize', keep_ratio=True),
            dict(type='RandomFlip'),
            dict(type='Normalize', **img_norm_cfg),
            dict(type='Pad', size_divisor=32),
            dict(type='ImageToTensor', keys=['img']),
            dict(type='Collect', keys=['img']),
        ])
]

# Use RepeatDataset to speed up training
data = dict(
    samples_per_gpu=2,
    workers_per_gpu=2,
    train=dict(
        type='RepeatDataset',
        times=3,
        dataset=dict(
            type=dataset_type,
            ann_file=osp.join(data_root, 'scannet_panoptic_train.json'),
            img_prefix=osp.join(data_root, 'train/'),
            pipeline=train_pipeline,
            classes=classes)),
    val=dict(
        type=dataset_type,
        ann_file=osp.join(data_root, 'scannet_panoptic_val.json'),
        img_prefix=osp.join(data_root, 'val/'),
        pipeline=test_pipeline,
        classes=classes),
    test=dict(
        type=dataset_type,
        ann_file=osp.join(data_root, 'scannet_panoptic_val.json'),
        img_prefix=osp.join(data_root, 'val/'),
        pipeline=test_pipeline,
        classes=classes))
evaluation = dict(interval=1, metric='bbox')

# optimizer
optimizer = dict(type='SGD', lr=0.02, momentum=0.9, weight_decay=0.0001)
optimizer_config = dict(grad_clip=None)

# learning policy
# Experiments show that using step=[9, 11] has higher performance
lr_config = dict(
    policy='step',
    warmup='linear',
    warmup_iters=500,
    warmup_ratio=0.001,
    step=[9, 11])
runner = dict(type='EpochBasedRunner', max_epochs=12)

faster_rcnn_x101_64x4d_fpn_mstrain_3x_scannet.py（_base_ 引用上述自己改的配置文件、改引用路径）
（2022.9.17）参考 [24-25]，将类集改为 ScanNet 的，要改 model/roi_head/bbox_head/num_classes。（但这个修改我未测试过，不知道有没有其它要相应改的。）

## iTom Notes
# Inherited from `mmdetection/configs/faster_rcnn/faster_rcnn_x101_64x4d_fpn_mstrain_3x_coco.py`,
# this file is designed for training Faster R-CNN on converted ScanNet-frames-25k.

_base_ = [
    # '../common/mstrain_3x_coco.py',
    '../common/mstrain_3x_scannet.py',
    # '../_base_/models/faster_rcnn_r50_fpn.py'
    '../../mmdetection/configs/_base_/models/faster_rcnn_r50_fpn.py'
]
model = dict(
    backbone=dict(
        type='ResNeXt',
        depth=101,
        groups=64,
        base_width=4,
        num_stages=4,
        out_indices=(0, 1, 2, 3),
        frozen_stages=1,
        norm_cfg=dict(type='BN', requires_grad=True),
        style='pytorch',
        init_cfg=dict(
            type='Pretrained', checkpoint='open-mmlab://resnext101_64x4d')),
    roi_head=dict(bbox_head=dict(num_classes=20)))

training

用 MMDetection 提供的脚本进行分布式训练：

#!/bin/bash
# train-faster-rcnn-scannet-frames.sh
clear

echo run \`conda activate openmmlab\` first

config=configs/faster_rcnn/faster_rcnn_x101_64x4d_fpn_mstrain_3x_scannet.py

. scripts/find_gpu.sh 4 14845

PATH=/usr/local/cuda/bin:$PATH \
PYTHONPATH=mmdetection/mmdet:$PYTHONPATH \
CUDA_VISIBLE_DEVICES=${gpu_id} \
MMDET_DATASETS=`pwd`/data/scannet-frames/ \
bash mmdetection/tools/dist_train.sh \
    $config ${n_gpu_found}
# python mmdetection/tools/train.py \
#     $config

其中：

find_gpu.sh 见 [21]；
将 cuda 的 bin/ 目录放在 $PATH 开头，保证使用 cuda 目录里的 nvcc 而不是 /usr/bin/nvcc，见 [22]。

执行 bash scripts/train-faster-rcnn-scannet-frames.sh 开始训练。

References

(CVPR 2017) ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes
ScanNet/ScanNet
(arXiv 2019) MMDetection: Open MMLab Detection Toolbox and Benchmark
open-mmlab/mmdetection
ScanNet Benchmark
关于ScanNet数据集
深度学习（1）RGB-D数据集：ScanNet
scannet数据集下载文件
COCO | Data format
COCO数据集的标注格式
COCO数据集标注详解
ozendelait/rvc_devkit
waspinator/pycococreator
convert mask binary image to polygon format #131
Image segmentation mask to polygon for coco json
Python - convert binary mask to polygon
KeyError: ‘Cascade Mask R-CNN’ #125
Git Tools - Submodules
Git submodule 子模块的管理和使用
Git Submodule使用完整教程
shell监视gpu使用情况
装detectron2报错：nvcc fatal : No input files specified； use option --help for more information
facebookresearch/detr/datasets/coco.py/convert_coco_poly_to_mask
AssertionError: The num_classes (3) in Shared2FCBBoxHead of MMDataParallel does not matches the length of CLASSES 80) in CocoDataset #4828
Train with customized datasets | Prepare a config
ScanNet-EfficientPS/tools/scannet_train_val_to_efficientps.py
panopticapi/converters/panoptic2detection_coco_format.py

你可能感兴趣的:(机器学习,ScanNet,MMDetection,COCO,目标检测,分布式训练,1024程序员节)

PyTorch & TensorFlow速成复习：从基础语法到模型部署实战（附FPGA移植衔接）阿牛的药铺算法移植部署 pytorch tensorflow fpga开发
PyTorch&TensorFlow速成复习：从基础语法到模型部署实战（附FPGA移植衔接）引言：为什么算法移植工程师必须掌握框架基础？针对光学类产品算法FPGA移植岗位需求（如可见光/红外图像处理），深度学习框架是算法落地的"桥梁"——既要用PyTorch/TensorFlow验证算法可行性，又要将训练好的模型（如CNN、目标检测）转换为FPGA可部署的格式（ONNX、TFLite）。本文采用"
分布式学习笔记_04_复制模型 NzuCRAS 分布式学习笔记架构后端
常见复制模型使用复制的目的在分布式系统中，数据通常需要被分布在多台机器上，主要为了达到：拓展性：数据量因读写负载巨大，一台机器无法承载，数据分散在多台机器上仍然可以有效地进行负载均衡，达到灵活的横向拓展高容错&高可用：在分布式系统中单机故障是常态，在单机故障的情况下希望整体系统仍然能够正常工作，这时候就需要数据在多台机器上做冗余，在遇到单机故障时能够让其他机器接管统一的用户体验：如果系统客户端分布
php 高并发下日志量巨大，如何高效采集、存储、分析贵哥的编程之路(热爱分享为后来者) PHP语言经典程序100题 php 开发语言
1.问题背景高并发系统每秒产生大量日志（如访问日志、错误日志、业务日志等）。单机写入、存储、分析能力有限，容易成为瓶颈。需要支持实时采集、分布式存储、快速检索与分析。2.主流架构方案一、分布式日志采集架构[应用服务器(PHP等)]|v[日志采集Agent（如Filebeat、Fluentd、Logstash）]|v[消息队列/缓冲（如Kafka、Redis、RabbitMQ）]|v[日志存储（如E
【目标检测】机场内部目标检测数据集4106张YOLO+VOC格式
数据集格式：VOC格式+YOLO格式压缩包内含：3个文件夹，分别存储图片、xml、txt文件JPEGImages文件夹中jpg图片总计：4106Annotations文件夹中xml文件总计：4106labels文件夹中txt文件总计：4106标签种类数：7标签名称:["Ground_vehicles","Horizontal_sign","Runaway_limit","Taxiway","Ver
RocketMQ 之死信队列 firepation RocketMQ rocketmq
在分布式消息系统中，消息的可靠传递和处理至关重要。然而，由于各种原因（如消息处理失败、消费超时等），一些消息可能无法被正常消费。这些无法被消费的消息如果不加以处理，会影响系统的稳定性和数据一致性。为了解决这一问题，RocketMQ提供了死信队列（DeadLetterQueue，DLQ）机制。本文将深入探讨RocketMQ的死信队列，包括其实现原理、应用场景以及使用示例。什么是死信队列？死信队列是一
传统检测响应慢？陌讯多模态引擎提速90+FPS实战 2501_92473147 算法计算机视觉目标检测
开篇痛点：实时目标检测在安防监控中的核心挑战在安防监控领域，实时目标检测是保障公共安全的关键技术。然而，传统算法如YOLOv5或开源框架MMDetection常面临两大痛点：误报率高（复杂光照或遮挡场景下检测不稳定）和响应延迟（高分辨率视频流处理FPS低于30）。实测数据显示，城市交通监控系统误报率达15%，导致安保资源浪费；客户反馈表明，延迟超100ms时，目标跟踪可能失效。这些问题源于算法泛化
C++11中的std::function
文章转载自：http://www.jellythink.com/archives/771看看这段代码先来看看下面这两行代码：std::functiononKeyPressed;std::functiononKeyReleased;这两行代码是从Cocos2d-x中摘出来的，重点是这两行代码的定义啊。std::function这是什么东西？如果你对上述两行代码表示毫无压力，那就不妨再看看本文，就当温
LangChain中的向量数据库接口－Weaviate 洪城叮当 langchain 数据库经验分享笔记交互人工智能知识图谱
文章目录前言一、原型定义二、代码解析1、add_texts方法1.1、应用样例2、from_texts方法2.1、应用样例3、similarity_search方法3.1、应用样例三、项目应用1、安装依赖2、引入依赖3、创建对象4、添加数据5、查询数据总结前言 Weaviate是一个开源的向量数据库，支持存储来自各类机器学习模型的数据对象和向量嵌入，并能无缝扩展至数十亿数据对象。它提供存储文档嵌
ZooKeeper架构及应用场景详解走过冬季学习笔记 zookeeper 架构分布式
ZooKeeper是一个开源的分布式协调服务，由Apache软件基金会维护。它旨在为分布式应用提供高性能、高可用、强一致性的基础服务，解决分布式系统中常见的协调难题（如配置管理、命名服务、分布式锁、服务发现、领导者选举等）。核心软件架构ZooKeeper的架构设计围绕其核心目标（协调）而优化，主要包含以下关键组件：集群模式(Ensemble):ZooKeeper通常部署为集群（称为ensemble
zookeeper etcd区别 sun007700 zookeeper etcd 分布式
ZooKeeper与etcd的核心区别体现在设计理念、数据模型、一致性协议及适用场景等方面。‌ZooKeeper基于ZAB协议实现分布式协调，采用树形数据结构和临时节点特性，适合传统分布式系统；而etcd基于Raft协议，以高性能键值对存储为核心，专为云原生场景优化，是Kubernetes等容器编排系统的默认存储组件。‌‌1‌‌2‌架构与设计目标差异‌‌ZooKeeper‌。‌设计定位‌:专注于分
目标检测（object detection）加油吧zkf 目标检测目标检测人工智能计算机视觉
目标检测作为计算机视觉的核心技术，在自动驾驶、安防监控、医疗影像等领域发挥着不可替代的作用。本文将系统讲解目标检测的概念、原理、主流模型、常见数据集及应用场景，帮助读者构建对这一技术的完整认知。一、目标检测的核心概念目标检测（ObjectDetection）是指在图像或视频中自动定位并识别出所有感兴趣的目标的技术。它需要解决两个核心问题：分类（Classification）：确定图像中每个目标的类
Python的科学计算库NumPy（一） linlin_1998 python numpy 开发语言
NumPy(NumericalPython)是Python中最基础、最重要的科学计算库之一，提供了高性能的多维数组（ndarray）对象和大量数学函数，是许多数据科学、机器学习库（如Pandas、SciPy、TensorFlow等）的基础依赖。1.创建一个numpy里面的一维数组importnumpyasnp###通过array方法创建一个ndarrayarray1=np.array([1,2,3
模型训练与部署注意事项篇---resize Atticus-Orion 图像处理篇深度学习篇模型训练与部署注意事项篇深度学习计算机视觉人工智能
图像大小的影响在YOLOv系列模型的训练和推理部署过程中，图像大小的选择是影响模型性能（精度、速度、泛化能力）的关键因素之一。两者的关系既相互关联，又存在一定的灵活性，具体可从以下几个方面详细分析：一、核心关系：训练与推理图像大小的“基准一致性”YOLOv模型（如YOLOv5、v7、v8等）的训练和推理图像大小通常以**“基准尺寸”**为核心关联，即训练时设定的图像尺寸会作为模型设计的基础，而推理
深度学习图像分类数据集—桃子识别分类 AI街潜水的八角深度学习图像数据集深度学习分类人工智能
该数据集为图像分类数据集，适用于ResNet、VGG等卷积神经网络，SENet、CBAM等注意力机制相关算法，VisionTransformer等Transformer相关算法。数据集信息介绍：桃子识别分类：['B1','M2','R0','S3']训练数据集总共有6637张图片，每个文件夹单独放一种数据各子文件夹图片统计:·B1:1601张图片·M2:1800张图片·R0:1601张图片·S3:
LLM-生成器判别器的实现
总结首先，使用GPT模型获取每个词的生成概率pLLMp_{LLM}pLLM。然后，使用训练好的生成判别器，对每个可能的生成结果进行打分，得到pθ(c∣x1:t)p_\theta(c|x_{1:t})pθ(c∣x1:t)。最后，结合两者的输出，用贝叶斯规则调整每个词的概率，选择调整后的概率最高的词作为输出。通过这样的组合，生成过程可以更好地满足预期需求，如生成符合特定风格或格式的文本。要在使用已经预
NVIDIA 系列之使用生成式 AI 增强 ROS2 机器人技术：使用 BLIP 和 Isaac Sim 进行实时图像字幕制作知识大胖 NVIDIA GPU和大语言模型开发教程人工智能机器人
简介在快速发展的机器人领域，集成先进的AI模型可以显著增强机器人系统的功能。在本博客中，我们将探讨如何在ROS2（机器人操作系统2）环境中利用BLIP（引导语言图像预训练）模型进行实时图像字幕制作，并使用NVIDIAIsaacSim进行模拟。我们将介绍如何实现一个ROS2节点，该节点订阅摄像头源、应用BLIP模型进行图像字幕制作，并实时显示结果。这种集成展示了生成式AI在增强人机交互方面的强大功能
微算法科技的前沿探索：量子机器学习算法在视觉任务中的革新应用 MicroTech2025 量子计算算法
在信息技术飞速发展的今天，计算机视觉作为人工智能领域的重要分支，正逐步渗透到我们生活的方方面面。从自动驾驶到人脸识别，从医疗影像分析到安防监控，计算机视觉技术展现了巨大的应用潜力。然而，随着视觉任务复杂度的不断提升，传统机器学习算法在处理大规模、高维度数据时遇到了计算瓶颈。在此背景下，量子计算作为一种颠覆性的计算模式，以其独特的并行处理能力和指数级增长的计算空间，为解决这一难题提供了新的思路。微算
【AI大模型】LLM模型架构深度解析：BERT vs. GPT vs. T5 我爱一条柴ya 学习AI记录 ai 人工智能 AI编程 python
引言Transformer架构的诞生（Vaswanietal.,2017）彻底改变了自然语言处理（NLP）。在其基础上，BERT、GPT和T5分别代表了三种不同的模型范式，主导了预训练语言模型的演进。理解它们的差异是LLM开发和学习的基石。一、核心架构对比特性BERT(BidirectionalEncoder)GPT(GenerativePre-trainedTransformer)T5(Text
目标检测中的NMS算法详解
好的，我们来详细解释一下目标检测中非极大值抑制（Non-MaximumSuppression,NMS）的相关概念和计算过程。1.为什么需要NMS？问题：目标检测模型（如FasterR-CNN,YOLO,SSD等）在推理时，对于同一个目标物体，通常会预测出多个重叠的、不同置信度（confidencescore）的候选边界框（BoundingBoxes）。直接输出所有这些框会导致：结果冗余：同一个物体
[论文阅读]Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smal 0x211 论文阅读语言模型人工智能自然语言处理
中文译名：逐步蒸馏！以较少的训练数据和较小的模型规模超越较大的语言模型发布链接：http://arxiv.org/abs/2305.02301AcceptedtoFindingsofACL2023阅读原因：近期任务需要用到蒸馏操作，了解相关知识核心思想：改变视角。原来的视角：把LLMs视为噪声标签的来源。现在的视角：把LLMs视为能够推理的代理。方法好在哪？需要的数据量少，得到的结果好。文章的方法
分布式选举算法＜一＞ Bully算法
分布式选举算法详解：Bully算法引言在分布式系统中，节点故障是不可避免的。当主节点（Leader）发生故障时，系统需要快速选举出新的主节点来保证服务的连续性。Bully算法是一种经典的分布式选举算法，以其简单高效的特点被广泛应用于各种分布式系统中。什么是Bully算法？Bully算法是一种基于优先级的分布式选举算法。每个节点都有一个唯一的ID，ID值越大的节点优先级越高。当主节点故障时，优先级最
全面探索Kafka：架构、应用与流处理
Kafka：企业级消息系统与流处理平台的深度解析ApacheKafka作为分布式流处理平台，广泛应用于大数据处理和实时分析领域。本文将基于其官方文档，详细探讨Kafka的核心功能、应用场景以及如何进行有效管理。背景简介Kafka作为高吞吐量的消息系统，支持企业级的发布-订阅模式。它能够处理大量实时数据，并支持高并发读写操作。本文将依据Kafka官方文档的内容，逐层深入，从入门到高级应用，帮助读者全
Elasticsearch搜索引擎存储：从原理到实践的全景解析 Python×CATIA工业智造搜索引擎 elasticsearch 大数据
引言在大数据时代，数据规模呈指数级增长，传统数据库的模糊查询、实时分析能力逐渐成为瓶颈。Elasticsearch（简称ES）凭借其分布式架构、实时搜索和灵活的数据分析能力，成为企业级搜索与存储的核心引擎。截至2025年，ES在全球日志分析、电商搜索、实时监控等场景的市场占有率超过60%。本文将从存储架构、核心技术、应用场景及优化策略四个维度，深入解析Elasticsearch的设计哲学与实践价值
Python爬虫实战：基于最新技术的定时签到系统开发全解析 Python爬虫项目 2025年爬虫实战项目 python 爬虫开发语言人工智能自动化知识图谱
摘要本文详细介绍了如何使用Python开发一个功能完善的定时签到爬虫系统。文章从爬虫基础知识讲起，逐步深入到高级技巧，包括异步请求处理、浏览器自动化、验证码破解、分布式架构等最新技术。我们将通过一个完整的定时签到项目案例，展示如何构建一个稳定、高效且具有良好扩展性的爬虫系统。文中提供了大量可运行的代码示例，涵盖requests、aiohttp、selenium、playwright等多种技术方案，
【Kafka专栏 13】Kafka的消息确认机制：不是所有的“收到”都叫“确认”！
作者名称：夏之以寒作者简介：专注于Java和大数据领域，致力于探索技术的边界，分享前沿的实践和洞见文章专栏：夏之以寒-kafka专栏专栏介绍：本专栏旨在以浅显易懂的方式介绍Kafka的基本概念、核心组件和使用场景，一步步构建起消息队列和流处理的知识体系，无论是对分布式系统感兴趣，还是准备在大数据领域迈出第一步，本专栏都提供所需的一切资源、指导，以及相关面试题，立刻免费订阅，开启Kafka学习之旅！
解决 Python 包安装失败问题：以 accelerate 为例
在使用Python开发项目时，我们经常会遇到依赖包安装失败的问题。今天，我们就以accelerate包为例，详细探讨一下可能的原因以及解决方法。通过这篇文章，你将了解到Python包安装失败的常见原因、如何切换镜像源、如何手动安装包，以及一些实用的注意事项。一、问题背景在开发一个深度学习项目时，我需要安装accelerate包来优化模型的训练过程。然而，当我运行以下命令时：bash复制pipins
在mac m1基于llama.cpp运行deepseek
lama.cpp是一个高效的机器学习推理库，目标是在各种硬件上实现LLM推断，保持最小设置和最先进性能。llama.cpp支持1.5位、2位、3位、4位、5位、6位和8位整数量化，通过ARMNEON、Accelerate和Metal支持Apple芯片，使得在MACM1处理器上运行Deepseek大模型成为可能。1下载llama.cppgitclonehttps://github.com/ggerg
web3中的ipfs 财神爷首席大弟子 web3 去中心化区块链
什么是web3：是基于区块链技术的分布式网络，主要目标是建立一个去中心化与信任化的互联网去中心化以及是信任化区块链：将所有的交易记录和什么护具存储在分布式网络中，每一个node都有完整的数据副本任何一个node修改都需要得到其他节点的认可，确保数据的真实性和和可信度web3有一些关键技术和标准，例如以太坊，IPFS，ENS，ERC标准等以太坊：以太币是一个开源的有智能合约功能的公共区块链平台，通过
使用ceph-ansible部署分布式存储Ceph-octopus版本降世神童云计算技术专栏分布式 ceph ansible
使用ceph-ansible部署分布式存储Ceph-octopus版本1.Ceph基础概念及部署方式1.1.Ceph基本概念1.2.Ceph部署方式2.系统初始化配置3.Ceph集群部署3.1.Ansible安装与配置3.2.ceph-ansible安装与配置3.2.1.下载ceph-ansible3.2.2.安装ceph-ansible依赖3.2.3.修改ceph配置文件3.3.开始部署ceph
2024年运维最新分布式存储ceph osd 常用操作_ceph查看osd对应硬盘(1)，2024年最新Linux运维编程基础教程 2401_83944328 程序员运维分布式 ceph
最全的Linux教程，Linux从入门到精通======================linux从入门到精通(第2版)Linux系统移植Linux驱动开发入门与实战LINUX系统移植第2版Linux开源网络全栈详解从DPDK到OpenFlow第一份《Linux从入门到精通》466页====================内容简介====本书是获得了很多读者好评的Linux经典畅销书**《Linu
如何用ruby来写hadoop的mapreduce并生成jar包 wudixiaotie mapreduce
ruby来写hadoop的mapreduce，我用的方法是rubydoop。怎么配置环境呢： 1.安装rvm：不说了网上有 2.安装ruby：由于我以前是做ruby的，所以习惯性的先安装了ruby，起码调试起来比jruby快多了。 3.安装jruby： rvm install jruby然后等待安
java编程思想 -- 访问控制权限百合不是茶 java 访问控制权限单例模式
访问权限是java中一个比较中要的知识点,它规定者什么方法可以访问,什么不可以访问一:包访问权限; 自定义包: package com.wj.control; //包 public class Demo { //定义一个无参的方法 public void DemoPackage(){ System.out.println("调用
[生物与医学]请审慎食用小龙虾 comsci 生物
现在的餐馆里面出售的小龙虾,有一些是在野外捕捉的,这些小龙虾身体里面可能带有某些病毒和细菌,人食用以后可能会导致一些疾病,严重的甚至会死亡..... 所以,参加聚餐的时候,最好不要点小龙虾...就吃养殖的猪肉,牛肉,羊肉和鱼,等动物蛋白质
org.apache.jasper.JasperException: Unable to compile class for JSP: 商人shang maven 2.2 jdk1.8
环境： jdk1.8 maven tomcat7-maven-plugin 2.0 原因： tomcat7-maven-plugin 2.0 不知吃 jdk 1.8，换成 tomcat7-maven-plugin 2.2就行，即 <plugin>
你的垃圾你处理掉了吗?GC oloz GC
前序:本人菜鸟，此文研究学习来自网络，各位牛牛多指教　 1.垃圾收集算法的核心思想　　Java语言建立了垃圾收集机制，用以跟踪正在使用的对象和发现并回收不再使用(引用)的对象。该机制可以有效防范动态内存分配中可能发生的两个危险：因内存垃圾过多而引发的内存耗尽，以及不恰当的内存释放所造成的内存非法引用。　　垃圾收集算法的核心思想是：对虚拟机可用内存空间，即堆空间中的对象进行识别
shiro 和 SESSSION 杨白白 shiro
shiro 在web项目里默认使用的是web容器提供的session，也就是说shiro使用的session是web容器产生的，并不是自己产生的，在用于非web环境时可用其他来源代替。在web工程启动的时候它就和容器绑定在了一起，这是通过web.xml里面的shiroFilter实现的。通过session.getSession()方法会在浏览器cokkice产生JESSIONID，当关闭浏览器，此
移动互联网终端淘宝客如何实现盈利小桔子移動客戶端淘客淘寶App
2012年淘宝联盟平台为站长和淘宝客带来的分成收入突破30亿元，同比增长100%。而来自移动端的分成达1亿元，其中美丽说、蘑菇街、果库、口袋购物等App运营商分成近5000万元。可以看出，虽然目前阶段PC端对于淘客而言仍旧是盈利的大头，但移动端已经呈现出爆发之势。而且这个势头将随着智能终端(手机，平板)的加速普及而更加迅猛
wordpress小工具制作 aichenglong wordpress 小工具
wordpress 使用侧边栏的小工具，很方便调整页面结构小工具的制作过程 1 在自己的主题文件中新建一个文件夹(如widget)，在文件夹中创建一个php(AWP_posts-category.php) 小工具是一个类,想侧边栏一样，还得使用代码注册，他才可以再后台使用，基本的代码一层不变 <?php class AWP_Post_Category extends WP_Wi
JS微信分享 AILIKES js
// 所有功能必须包含在 WeixinApi.ready 中进行 WeixinApi.ready(function(Api) { // 微信分享的数据 var wxData = { &nb
封装探讨百合不是茶 JAVA面向对象封装
//封装属性方法将某些东西包装在一起，通过创建对象或使用静态的方法来调用，称为封装；封装其实就是有选择性地公开或隐藏某些信息，它解决了数据的安全性问题，增加代码的可读性和可维护性在 Aname类中申明三个属性，将其封装在一个类中：通过对象来调用例如 1： //属性将其设为私有姓名 name 可以公开
jquery radio/checkbox change事件不能触发的问题 bijian1013 JavaScript jquery
我想让radio来控制当前我选择的是机动车还是特种车，如下所示： <html> <head> <script src="http://ajax.googleapis.com/ajax/libs/jquery/1.7.1/jquery.min.js" type="text/javascript"><
AngularJS中安全性措施 bijian1013 JavaScript AngularJS 安全性 XSRF JSON漏洞
在使用web应用中，安全性是应该首要考虑的一个问题。AngularJS提供了一些辅助机制，用来防护来自两个常见攻击方向的网络攻击。一.JSON漏洞当使用一个GET请求获取JSON数组信息的时候（尤其是当这一信息非常敏感，
[Maven学习笔记九]Maven发布web项目 bit1129 maven
基于Maven的web项目的标准项目结构 user-project user-core user-service user-web src
【Hive七】Hive用户自定义聚合函数(UDAF) bit1129 hive
用户自定义聚合函数，用户提供的多个入参通过聚合计算(求和、求最大值、求最小值)得到一个聚合计算结果的函数。问题：UDF也可以提供输入多个参数然后输出一个结果的运算，比如加法运算add(3，5)，add这个UDF需要实现UDF的evaluate方法,那么UDF和UDAF的实质分别究竟是什么？ Double evaluate(Double a, Double b)
通过 nginx-lua 给 Nginx 增加 OAuth 支持 ronin47
前言：我们使用Nginx的Lua中间件建立了OAuth2认证和授权层。如果你也有此打算，阅读下面的文档，实现自动化并获得收益。SeatGeek 在过去几年中取得了发展，我们已经积累了不少针对各种任务的不同管理接口。我们通常为新的展示需求创建新模块，比如我们自己的博客、图表等。我们还定期开发内部工具来处理诸如部署、可视化操作及事件处理等事务。在处理这些事务中，我们使用了几个不同的接口来认证： &n
利用tomcat-redis-session-manager做session同步时自定义类对象属性保存不上的解决方法 bsr1983 session
在利用tomcat-redis-session-manager做session同步时，遇到了在session保存一个自定义对象时，修改该对象中的某个属性，session未进行序列化，属性没有被存储到redis中。在 tomcat-redis-session-manager的github上有如下说明： Session Change Tracking As noted in the &qu
《代码大全》表驱动法-Table Driven Approach-1 bylijinnan java 算法
关于Table Driven Approach的一篇非常好的文章： http://www.codeproject.com/Articles/42732/Table-driven-Approach package com.ljn.base; import java.util.Random; public class TableDriven { public
Sybase封锁原理 chicony Sybase
昨天在操作Sybase IQ12.7时意外操作造成了数据库表锁定，不能删除被锁定表数据也不能往其中写入数据。由于着急往该表抽入数据，因此立马着手解决该表的解锁问题。无奈此前没有接触过Sybase IQ12.7这套数据库产品，加之当时已属于下班时间无法求助于支持人员支持，因此只有借助搜索引擎强大的
java异常处理机制 CrazyMizzz java
java异常关键字有以下几个，分别为 try catch final throw throws 他们的定义分别为 try： Opening exception-handling statement. catch： Captures the exception. finally： Runs its code before terminating
hive 数据插入DML语法汇总 daizj hive DML 数据插入
Hive的数据插入DML语法汇总1、Loading files into tables语法：1) LOAD DATA [LOCAL] INPATH 'filepath' [OVERWRITE] INTO TABLE tablename [PARTITION (partcol1=val1, partcol2=val2 ...)]解释：1)、上面命令执行环境为hive客户端环境下： hive>l
工厂设计模式 dcj3sjt126com 设计模式
使用设计模式是促进最佳实践和良好设计的好办法。设计模式可以提供针对常见的编程问题的灵活的解决方案。工厂模式工厂模式（Factory）允许你在代码执行时实例化对象。它之所以被称为工厂模式是因为它负责“生产”对象。工厂方法的参数是你要生成的对象对应的类名称。 Example #1 调用工厂方法（带参数） <?phpclass Example{
mysql字符串查找函数 dcj3sjt126com mysql
FIND_IN_SET(str,strlist) 假如字符串str 在由N 子链组成的字符串列表strlist 中，则返回值的范围在1到 N 之间。一个字符串列表就是一个由一些被‘,’符号分开的自链组成的字符串。如果第一个参数是一个常数字符串，而第二个是type SET列，则 FIND_IN_SET() 函数被优化，使用比特计算。如果str不在strlist 或st
jvm内存管理 easterfly jvm
一、JVM堆内存的划分分为年轻代和年老代。年轻代又分为三部分：一个eden,两个survivor。工作过程是这样的：e区空间满了后，执行minor gc，存活下来的对象放入s0, 对s0仍会进行minor gc，存活下来的的对象放入s1中，对s1同样执行minor gc，依旧存活的对象就放入年老代中；年老代满了之后会执行major gc，这个是stop the word模式，执行
CentOS-6.3安装配置JDK-8 gengzg centos
JAVA_HOME=/usr/java/jdk1.8.0_45 JRE_HOME=/usr/java/jdk1.8.0_45/jre PATH=$PATH:$JAVA_HOME/bin:$JRE_HOME/bin CLASSPATH=.:$JAVA_HOME/lib/dt.jar:$JAVA_HOME/lib/tools.jar:$JRE_HOME/lib export JAVA_HOME
【转】关于web路径的获取方法 huangyc1210 Web 路径
假定你的web application 名称为news,你在浏览器中输入请求路径： http://localhost:8080/news/main/list.jsp 则执行下面向行代码后打印出如下结果： 1、 System.out.println(request.getContextPath()); //可返回站点的根路径。也就是项
php里获取第一个中文首字母并排序远去的渡口数据结构 PHP
很久没来更新博客了，还是觉得工作需要多总结的好。今天来更新一个自己认为比较有成就的问题吧。最近在做储值结算，需求里结算首页需要按门店的首字母A-Z排序。我的数据结构原本是这样的： Array ( [0] => Array ( [sid] => 2885842 [recetcstoredpay] =&g
java内部类 hm4123660 java 内部类匿名内部类成员内部类方法内部类
　在Java中，可以将一个类定义在另一个类里面或者一个方法里面，这样的类称为内部类。内部类仍然是一个独立的类，在编译之后内部类会被编译成独立的.class文件，但是前面冠以外部类的类名和$符号。内部类可以间接解决多继承问题,可以使用内部类继承一个类，外部类继承一个类，实现多继承。 &nb
Caused by: java.lang.IncompatibleClassChangeError: class org.hibernate.cfg.Exten zhb8015
maven pom.xml关于hibernate的配置和异常信息如下，查了好多资料，问题还是没有解决。只知道是包冲突，就是不知道是哪个包....遇到这个问题的分享下是怎么解决的。。 maven pom: <dependency> <groupId>org.hibernate</groupId> <ar
Spark 性能相关参数配置详解－任务调度篇 Stark_Summer spark cache cpu 任务调度 yarn
随着Spark的逐渐成熟完善, 越来越多的可配置参数被添加到Spark中来, 本文试图通过阐述这其中部分参数的工作原理和配置思路, 和大家一起探讨一下如何根据实际场合对Spark进行配置优化。由于篇幅较长，所以在这里分篇组织，如果要看最新完整的网页版内容，可以戳这里：http://spark-config.readthedocs.org/，主要是便
css3滤镜 wangkeheng html css
经常看到一些网站的底部有一些灰色的图标，鼠标移入的时候会变亮，开始以为是js操作src或者bg呢，搜索了一下，发现了一个更好的方法：通过css3的滤镜方法。 html代码： <a href='' class='icon'><img src='utv.jpg' /></a> css代码： .icon{-webkit-filter: graysc