Arnold-FY-Chen

如何使用nuScenes数据集格式的单帧数据推理(以DETR3D为例)

【请尊重原创！转载和引用文章内容务必注明出处！未经许可上传到某文库或其他收费阅读/下载网站赚钱的必追究责任！】

无论是mmdetection3D还是OpenPCDet都只有使用数据集(使用哪个数据集由配置文件里指定)训练和测试的代码，没有使用某种数据集格式的单帧数据进行推理的代码(也就是一些2D目标检测框里里提供的推理demo代码)，而OpenPCDet尤其是mmdetection里面为了支持多种不同的数据集和多种不同模型的训练和测试，把很多实现步骤高度配置化，这是好事也是坏事，如果你只是使用这些框架(尤其是使用它们已支持的公开数据集)进行训练和测试，那么相对简单，如果是要使用这些框架对单帧数据(可以是他们支持的公开数据集里的数据或者遵循这些数据集的格式自己制作的数据集的数据)进行推理，就比较复杂了，框架没有提供工具代码只能自己去写，写之前当然得把框架的相关代码翻看几遍把支持某个数据集和模型的配置文件和分散在多处的代码的逻辑理顺连贯起来，不然没法下手，因为框架高度配置化的话相关的代码就会非常分散在连贯性可读性方面很差，需要花时间先理清楚，然后把那些配置化的相关零散代码都整理并修改和串联起来，完成数据的多步预处理、调用模型推理、对推理结果的解析和后处理的全部流程的实现。

用nuScense格式数据调用detr3d模型的一个最烦的地方就是，你在调用模型推理时非得提供对应的img_metas数据，模型的这个设计我觉得不是很合理，起码不算友好，对于硬件和相关标定参数定了后，这种参数完全可以通过配置隐式提供给模型，为何每次调用都得提供这些meta参数呢！反正我写下面的推理代码之前是浪费了不少时间在厘清img_metas这个参数的数据该怎么准备上！

此处已DETR3D最初的官方实现代码GitHub - WangYueFt/detr3d基础给出了相关数据预处理和模型调用及推理结果后处理等代码，另外，这些代码是用于在ROS2节点中运行，解析出的推理结果还按项目实际要求转换成autoware的ObjectDetection数据格式后发布，所以还包含了ROS2和autoware相关的代码，这些代码不是重点仅供做代码逻辑完整性展示。当然，如果你项目中有类似需要的话，可以直接拷贝使用这些代码。

import rclpy
from rclpy.node import Node
from rclpy.executors import MultiThreadedExecutor
from rclpy.callback_groups import MutuallyExclusiveCallbackGroup, ReentrantCallbackGroup

from cv_bridge import CvBridge
from sensor_msgs.msg import CompressedImage, Image
from std_msgs.msg import String
from autoware_auto_perception_msgs.msg import DetectedObjects
from autoware_auto_perception_msgs.msg import DetectedObject
from autoware_auto_perception_msgs.msg import ObjectClassification
from autoware_auto_perception_msgs.msg import DetectedObjectKinematics
from autoware_auto_perception_msgs.msg import Shape

import numpy as np
import mmcv
import sys, os
import torch
import warnings
from mmcv import Config, DictAction
from mmcv.cnn import fuse_conv_bn
from mmcv.parallel import MMDataParallel, MMDistributedDataParallel
from mmcv.runner import (get_dist_info, init_dist, load_checkpoint,
                         wrap_fp16_model)

from mmdet3d.models import build_model
from mmdet.apis import multi_gpu_test, set_random_seed
from mmdet.datasets import replace_ImageToTensor
from mmdet3d.core.bbox.structures.box_3d_mode import (Box3DMode, CameraInstance3DBoxes,
                              DepthInstance3DBoxes, LiDARInstance3DBoxes)

from nuscenes import NuScenes
from pyquaternion import Quaternion
from geometry_msgs.msg import Point
from geometry_msgs.msg import Vector3
from trimesh.transformations import quaternion_from_euler
from geometry_msgs.msg import Quaternion as GeoQuaternion
from geometry_msgs.msg import Twist
import math
import time

class DetectionPublisher(Node):

    def __init__(self):
        super().__init__('DetectionPublisher_python')
        
        self.publisher_ = self.create_publisher(String, 'example_topic', 10)
        timer_period = 0.5  # seconds
        self.timer = self.create_timer(timer_period, self.timer_callback)
        self.i = 0
        

    def timer_callback(self):
        msg = String()
        msg.data = 'Hello World: %d' % self.i
        self.publisher_.publish(msg)
        self.get_logger().info('Publishing: "%s"' % msg.data)
        self.i += 1

class PadMultiViewImage(object):
    """Pad the multi-view image.
    There are two padding modes: (1) pad to a fixed size and (2) pad to the
    minimum size that is divisible by some number.
    Added keys are "pad_shape", "pad_fixed_size", "pad_size_divisor",
    Args:
        size (tuple, optional): Fixed padding size.
        size_divisor (int, optional): The divisor of padded size.
        pad_val (float, optional): Padding value, 0 by default.
    """

    def __init__(self, size=None, size_divisor=32, pad_val=0):
        self.size = size
        self.size_divisor = size_divisor
        self.pad_val = pad_val
        # only one of size and size_divisor should be valid
        assert size is not None or size_divisor is not None
        assert size is None or size_divisor is None

    def _pad_img(self, results):
        """Pad images according to ``self.size``."""
        if self.size is not None:
            padded_img = [mmcv.impad(
                img, shape=self.size, pad_val=self.pad_val) for img in results['img']]
        elif self.size_divisor is not None:
            padded_img = [mmcv.impad_to_multiple(
                img, self.size_divisor, pad_val=self.pad_val) for img in results['img']]
        results['img'] = padded_img
        results['img_shape'] = [img.shape for img in padded_img]
        results['pad_shape'] = [img.shape for img in padded_img]
        results['pad_fixed_size'] = self.size
        results['pad_size_divisor'] = self.size_divisor

    def __call__(self, results):
        """Call function to pad images, masks, semantic segmentation maps.
        Args:
            results (dict): Result dict from loading pipeline.
        Returns:
            dict: Updated result dict.
        """
        self._pad_img(results)
        return results

    def __repr__(self):
        repr_str = self.__class__.__name__
        repr_str += f'(size={self.size}, '
        repr_str += f'size_divisor={self.size_divisor}, '
        repr_str += f'pad_val={self.pad_val})'
        return repr_str

class NormalizeMultiviewImage(object):
    """Normalize the image.
    Added key is "img_norm_cfg".
    Args:
        mean (sequence): Mean values of 3 channels.
        std (sequence): Std values of 3 channels.
        to_rgb (bool): Whether to convert the image from BGR to RGB,
            default is true.
    """

    def __init__(self, mean=[103.530, 116.280, 123.675], std=[1.0, 1.0, 1.0], to_rgb=True):
        self.mean = np.array(mean, dtype=np.float32)
        self.std = np.array(std, dtype=np.float32)
        self.to_rgb = to_rgb

    def __call__(self, results):
        """Call function to normalize images.
        Args:
            results (dict): Result dict from loading pipeline.
        Returns:
            dict: Normalized results, 'img_norm_cfg' key is added into
                result dict.
        """
        results['img'] = [mmcv.imnormalize(
            img, self.mean, self.std, self.to_rgb) for img in results['img']]
        results['img_norm_cfg'] = dict(
            mean=self.mean, std=self.std, to_rgb=self.to_rgb)
        return results

    def __repr__(self):
        repr_str = self.__class__.__name__
        repr_str += f'(mean={self.mean}, std={self.std}, to_rgb={self.to_rgb})'
        return repr_str

class LoadSingleViewImageFromFiles(object):
    """Load multi channel images from a list of separate channel files.

    Expects results['img_filename'] to be a list of filenames.

    Args:
        to_float32 (bool): Whether to convert the img to float32.
            Defaults to False.
        color_type (str): Color type of the file. Defaults to 'unchanged'.
    """

    def __init__(self, to_float32=True, color_type='unchanged'):
        self.to_float32 = to_float32
        self.color_type = color_type

    def __call__(self, results):
        """Call function to load multi-view image from files.

        Args:
            results (dict): Result dict containing multi-view image filenames.

        Returns:
            dict: The result dict containing the multi-view image data. \
                Added keys and values are described below.

                - filename (str): Multi-view image filenames.
                - img (np.ndarray): Multi-view image arrays.
                - img_shape (tuple[int]): Shape of multi-view image arrays.
                - ori_shape (tuple[int]): Shape of original image arrays.
                - pad_shape (tuple[int]): Shape of padded image arrays.
                - scale_factor (float): Scale factor.
                - img_norm_cfg (dict): Normalization configuration of images.
        """
        results['filename'] = 'sample.jpg'
        # h,w,c   => h, w, c, nv 
        img = np.stack([results['img']], axis=-1)
        if self.to_float32:
            img = img.astype(np.float32)
        results['img'] = [img[..., i] for i in range(img.shape[-1])]
        results['img_shape'] = img.shape
        results['ori_shape'] = img.shape
        # Set initial values for default meta_keys
        results['pad_shape'] = img.shape
        results['scale_factor'] = 1.0
        num_channels = 1 if len(img.shape) < 3 else img.shape[2]
        results['img_norm_cfg'] = dict(
            mean=np.zeros(num_channels, dtype=np.float32),
            std=np.ones(num_channels, dtype=np.float32),
            to_rgb=False)
        return results

    def __repr__(self):
        """str: Return a string that describes the module."""
        repr_str = self.__class__.__name__
        repr_str += f'(to_float32={self.to_float32}, '
        repr_str += f"color_type='{self.color_type}')"
        return repr_str

def obtain_sensor2top(nusc,
                      sensor_token,
                      l2e_t,
                      l2e_r_mat,
                      e2g_t,
                      e2g_r_mat,
                      sensor_type='lidar'):
    """Obtain the info with RT matric from general sensor to Top LiDAR.

    Args:
        nusc (class): Dataset class in the nuScenes dataset.
        sensor_token (str): Sample data token corresponding to the
            specific sensor type.
        l2e_t (np.ndarray): Translation from lidar to ego in shape (1, 3).
        l2e_r_mat (np.ndarray): Rotation matrix from lidar to ego
            in shape (3, 3).
        e2g_t (np.ndarray): Translation from ego to global in shape (1, 3).
        e2g_r_mat (np.ndarray): Rotation matrix from ego to global
            in shape (3, 3).
        sensor_type (str, optional): Sensor to calibrate. Default: 'lidar'.

    Returns:
        sweep (dict): Sweep information after transformation.
    """
    sd_rec = nusc.get('sample_data', sensor_token)
    cs_record = nusc.get('calibrated_sensor',
                         sd_rec['calibrated_sensor_token'])
    pose_record = nusc.get('ego_pose', sd_rec['ego_pose_token'])
    data_path = str(nusc.get_sample_data_path(sd_rec['token']))
    if os.getcwd() in data_path:  # path from lyftdataset is absolute path
        data_path = data_path.split(f'{os.getcwd()}/')[-1]  # relative path
    sweep = {
        'data_path': data_path,
        'type': sensor_type,
        'sample_data_token': sd_rec['token'],
        'sensor2ego_translation': cs_record['translation'],
        'sensor2ego_rotation': cs_record['rotation'],
        'ego2global_translation': pose_record['translation'],
        'ego2global_rotation': pose_record['rotation'],
        'timestamp': sd_rec['timestamp']
    }
    l2e_r_s = sweep['sensor2ego_rotation']
    l2e_t_s = sweep['sensor2ego_translation']
    e2g_r_s = sweep['ego2global_rotation']
    e2g_t_s = sweep['ego2global_translation']

    # obtain the RT from sensor to Top LiDAR
    # sweep->ego->global->ego'->lidar
    l2e_r_s_mat = Quaternion(l2e_r_s).rotation_matrix
    e2g_r_s_mat = Quaternion(e2g_r_s).rotation_matrix
    R = (l2e_r_s_mat.T @ e2g_r_s_mat.T) @ (
        np.linalg.inv(e2g_r_mat).T @ np.linalg.inv(l2e_r_mat).T)
    T = (l2e_t_s @ e2g_r_s_mat.T + e2g_t_s) @ (
        np.linalg.inv(e2g_r_mat).T @ np.linalg.inv(l2e_r_mat).T)
    T -= e2g_t @ (np.linalg.inv(e2g_r_mat).T @ np.linalg.inv(l2e_r_mat).T
                  ) + l2e_t @ np.linalg.inv(l2e_r_mat).T
    sweep['sensor2lidar_rotation'] = R.T  # points @ R.T + T
    sweep['sensor2lidar_translation'] = T
    return sweep

def getSemanticType(class_name):
    if (class_name == "CAR" or class_name == "Car"):
       return ObjectClassification.CAR
    elif (class_name == "TRUCK" or class_name == "Medium_Truck" or class_name =="Big_Truck"):
       return ObjectClassification.TRUCK
    elif (class_name == "BUS"):
       return ObjectClassification.BUS
    elif (class_name == "TRAILER"):
       return ObjectClassification.TRAILER
    elif (class_name == "BICYCLE"):
       return ObjectClassification.BICYCLE
    elif (class_name == "MOTORBIKE"):
       return ObjectClassification.MOTORCYCLE
    elif (class_name == "PEDESTRIAN" or class_name == "Pedestrian"):
       return ObjectClassification.PEDESTRIAN
    else: 
       return ObjectClassification.UNKNOWN


class CustomBox3D(object):
  def __init__(self,nid,score,x,y,z,w,l,h,rt,vel_x,vel_y):
      self.id = nid
      self.score = score
      self.x = x
      self.y = y
      self.z = z
      self.w = w
      self.l = l
      self.h = h
      self.rt = rt
      self.vel_x = vel_x
      self.vel_y = vel_y

def isCarLikeVehicle(label):
   return label == ObjectClassification.BICYCLE or label == ObjectClassification.BUS or \
         label == ObjectClassification.CAR or label == ObjectClassification.MOTORCYCLE or \
         label == ObjectClassification.TRAILER or label == ObjectClassification.TRUCK 

def createPoint(x, y, z):
  p = Point()
  p.x = float(x)
  p.y = float(y)
  p.z = float(z)
  return p

def createQuaternionFromYaw(yaw):
  # tf2.Quaternion
  # q.setRPY(0, 0, yaw)
  q = quaternion_from_euler(0, 0, yaw)
  # geometry_msgs.msg.Quaternion
  #return tf2.toMsg(q)
  #return GeoQuaternion(*q)
  return GeoQuaternion(x=q[0],y=q[1],z=q[2],w=q[3])

def createTranslation(x, y, z):
  v = Vector3()
  v.x = float(x)
  v.y = float(y)
  v.z = float(z)
  return v

def box3DToDetectedObject(box3d, class_names, has_twist, is_sign):
  obj = DetectedObject()
  obj.existence_probability = float(box3d.score)

  classification = ObjectClassification()
  classification.probability = 1.0
  if (box3d.id >= 0 and box3d.id < len(class_names)):
    classification.label = getSemanticType(class_names[box3d.id])
  else: 
    if is_sign:
      sign_label = 255
      classification.label = sign_label
    else:
      classification.label = ObjectClassification.UNKNOWN
      print("Unexpected label: UNKNOWN is set.")
  
  if (isCarLikeVehicle(classification.label)):
    obj.kinematics.orientation_availability = DetectedObjectKinematics.SIGN_UNKNOWN

  obj.classification.append(classification)

  # pose and shape
  # mmdet3d yaw format to ros yaw format
  yaw = -box3d.rt - np.pi / 2
  obj.kinematics.pose_with_covariance.pose.position = createPoint(box3d.x, box3d.y, box3d.z)
  obj.kinematics.pose_with_covariance.pose.orientation = createQuaternionFromYaw(yaw)
  obj.shape.type = Shape.BOUNDING_BOX
  obj.shape.dimensions = createTranslation(box3d.l, box3d.w, box3d.h)
  # twist
  if (has_twist):
    vel_x = float(box3d.vel_x)
    vel_y = float(box3d.vel_y)
    twist = Twist()
    twist.linear.x = math.sqrt(pow(vel_x, 2) + pow(vel_y, 2))
    twist.angular.z = 2 * (math.atan2(vel_y, vel_x) - yaw)
    obj.kinematics.twist_with_covariance.twist = twist
    obj.kinematics.has_twist = has_twist
  return obj  


class ImageSubscriber(Node):

    def __init__(self):
        super().__init__('ImageSubscriber_python')
        cb_group = MutuallyExclusiveCallbackGroup()
        self.img_sub = self.create_subscription(
            CompressedImage,
            'pub_image/compressed',
            self.image_callback,
            10,
            callback_group=cb_group)
        self.img_sub
        self.od_pub = self.create_publisher(DetectedObjects, 'pub_detection', 10)
        self.cvBridge  = CvBridge()
        self.pad = PadMultiViewImage()
        self.norm = NormalizeMultiviewImage()
        self.file_loader = LoadSingleViewImageFromFiles() 
        config_path = "./detr3d_res101_gridmask_wst.py"
        self.cfg = Config.fromfile(config_path)
        if self.cfg.get('custom_imports', None):
           from mmcv.utils import import_modules_from_strings
           import_modules_from_strings(**self.cfg['custom_imports'])
        if hasattr(self.cfg, 'plugin'):
           if self.cfg.plugin:
             import importlib
             if hasattr(self.cfg, 'plugin_dir'):
                plugin_dir = self.cfg.plugin_dir
                _module_dir = os.path.dirname(plugin_dir)
                _module_dir = _module_dir.split('/')
                _module_path = _module_dir[0]

                for m in _module_dir[1:]:
                    _module_path = _module_path + '.' + m
                print(_module_path)
                print(sys.path)
                plg_lib = importlib.import_module(_module_path)
        if self.cfg.get('cudnn_benchmark', False):
           torch.backends.cudnn.benchmark = True
        self.cfg.model.pretrained = None
        self.cfg.model.train_cfg = None
        self.model = build_model(self.cfg.model, test_cfg=self.cfg.get('test_cfg'))
        fp16_cfg = self.cfg.get('fp16', None)
        if fp16_cfg is not None:
           wrap_fp16_model(self.model)
        checkpoint = load_checkpoint(self.model, "epoch_200.pth", map_location='cpu')
        
        if 'CLASSES' in checkpoint.get('meta', {}):
           self.model.CLASSES = checkpoint['meta']['CLASSES']
        else:
           self.model.CLASSS = ('car', 'truck', 'trailer', 'bus', 'construction_vehicle',
               'bicycle', 'motorcycle', 'pedestrian', 'traffic_cone',
               'barrier')
        # palette for visualization in segmentation tasks
        if 'PALETTE' in checkpoint.get('meta', {}):
           self.model.PALETTE = checkpoint['meta']['PALETTE']
        self.model.cfg  = self.cfg 

        self.model.cuda()
        self.model.eval()
        #if torch.cuda.device_count() > 1:  # for server side
        #   self.model = nn.DataParallel(self.model)
        print("model is created!")

        nusc = NuScenes(version='v1.0-mini', dataroot='nuscenes_mini', verbose=True)
        scene0 = nusc.scene[0]
        first_sample_token = scene0['first_sample_token']
        first_sample = nusc.get('sample', first_sample_token)
        sd_rec = nusc.get('sample_data', first_sample['data']['LIDAR_TOP'])
        cs_record = nusc.get('calibrated_sensor',
                             sd_rec['calibrated_sensor_token'])
        pose_record = nusc.get('ego_pose', sd_rec['ego_pose_token'])
        lidar_token = first_sample['data']['LIDAR_TOP']
        lidar_path, boxes, _ = nusc.get_sample_data(lidar_token)
        info = {
            'lidar_path': lidar_path,
            'token': first_sample['token'],
            'sweeps': [],
            'cams': dict(),
            'lidar2ego_translation': cs_record['translation'],
            'lidar2ego_rotation': cs_record['rotation'],
            'ego2global_translation': pose_record['translation'],
            'ego2global_rotation': pose_record['rotation'],
            'timestamp': first_sample['timestamp'],
        }

        l2e_r = info['lidar2ego_rotation']
        l2e_t = info['lidar2ego_translation']
        e2g_r = info['ego2global_rotation']
        e2g_t = info['ego2global_translation']
        l2e_r_mat = Quaternion(l2e_r).rotation_matrix
        e2g_r_mat = Quaternion(e2g_r).rotation_matrix
        camera_types = [
            'CAM_FRONT',
            'CAM_FRONT_RIGHT',
            'CAM_FRONT_LEFT',
            'CAM_BACK',
            'CAM_BACK_LEFT',
            'CAM_BACK_RIGHT',
        ]
        for cam in camera_types:
            cam_token = first_sample['data'][cam]
            cam_path, _, cam_intrinsic = nusc.get_sample_data(cam_token)
            cam_info = obtain_sensor2top(nusc, cam_token, l2e_t, l2e_r_mat,
                                         e2g_t, e2g_r_mat, cam)
            cam_info.update(cam_intrinsic=cam_intrinsic)
            info['cams'].update({cam: cam_info})

        '''
        cam_front_sample_data = nusc.get('sample_data', first_sample['data']['CAM_FRONT'])
        cam_front_sample_path = os.path.join(nusc.dataroot, cam_front_sample_data['filename'])
        print("sample image file:", cam_front_sample_path)
        cam_front_calibrate = nusc.get('calibrated_sensor', cam_front_sample_data['calibrated_sensor_token'])
        sensor2lidar = translation = np.expand_dims(np.array(cam_front_calibrate['translation']), axis=-1)
        sensor2lidar_rotation = np.expand_dims(np.array(cam_front_calibrate['rotation']), axis=-1)
        camera_intrinsic = np.array(cam_front_calibrate['camera_intrinsic'])
        '''

        image_paths = []
        lidar2img_rts = []
        lidar2cam_rts = []
        cam_intrinsics = []
        for cam_type, cam_info in info['cams'].items():
            image_paths.append(cam_info['data_path'])
             # obtain lidar to image transformation matrix
            lidar2cam_r = np.linalg.inv(cam_info['sensor2lidar_rotation'])
            lidar2cam_t = cam_info[
                'sensor2lidar_translation'] @ lidar2cam_r.T
            lidar2cam_rt = np.eye(4)
            lidar2cam_rt[:3, :3] = lidar2cam_r.T
            lidar2cam_rt[3, :3] = -lidar2cam_t
            intrinsic = cam_info['cam_intrinsic']
            viewpad = np.eye(4)
            viewpad[:intrinsic.shape[0], :intrinsic.shape[1]] = intrinsic
            lidar2img_rt = (viewpad @ lidar2cam_rt.T)
            lidar2img_rts.append(lidar2img_rt)

            cam_intrinsics.append(viewpad)
            lidar2cam_rts.append(lidar2cam_rt.T)

        self.img_metas = {}
        self.img_metas.update(
                dict(
                    img_filename=image_paths,
                    lidar2img=lidar2img_rts,
                    cam_intrinsic=cam_intrinsics,
                    lidar2cam=lidar2cam_rts,
                ))
        self.class_names_ = self.cfg.class_names
        print("ImageSubscriber init done")
 
    
    def image_callback(self, msg):
        #image = self.cvBridge.imgmsg_to_cv2(msg, "bgr8")
        image = self.cvBridge.compressed_imgmsg_to_cv2(msg, "bgr8")
        #print("image received, shape:", image.shape)
        results = {'img': image}
        self.file_loader(results)
        self.norm(results)
        self.pad(results)
        image = results['img'][0]
        meta = {'filename': results['filename'], 
                'img_shape': results['img_shape'],
                'ori_shape': results['ori_shape'],
                'pad_shape': results['pad_shape'],
                'scale_factor': results['scale_factor'],
                'box_type_3d': LiDARInstance3DBoxes,     #CameraInstance3DBoxes/LiDARInstance3DBoxes
                'box_mode_3d': Box3DMode.LIDAR}
        meta.update(self.img_metas)
        #print("meta:", meta)
        img_metas = [[meta]]
        inputs = torch.tensor(image).to('cuda')
        # h,w,c => bs,nv,c,h,w
        inputs = inputs.permute(2,0,1)
        inputs = torch.unsqueeze(inputs, 0)
        inputs = torch.unsqueeze(inputs, 0)
        #print("input tensor shape:", inputs.shape)
        with torch.no_grad():
           outputs = self.model(return_loss=False, rescale=True, img=inputs, img_metas=img_metas) 
           torch.cuda.synchronize()
           pts_bbox = outputs[0]['pts_bbox']
           boxes_3d_enc = pts_bbox['boxes_3d']
           scores_3d = pts_bbox['scores_3d']
           labels_3d = pts_bbox['labels_3d']

           filter = scores_3d >= 0.5
           boxes_3d_enc.tensor = boxes_3d_enc.tensor[filter]
           boxes_3d = boxes_3d_enc.tensor.numpy() # [[cx, cy, cz, w, l, h, rot, vx, vy]]
           scores_3d = scores_3d[filter].numpy()
           labels_3d = labels_3d[filter].numpy()
           custom_boxes_3d = []
           for i, box_3d in enumerate(boxes_3d):
              box3d = CustomBox3D(labels_3d[i], scores_3d[i],
                                   box_3d[0],box_3d[1],box_3d[2],
                                   box_3d[3],box_3d[4],box_3d[5],
                                   box_3d[6],box_3d[7],box_3d[8])
              custom_boxes_3d.append(box3d)
               
           #print("boxes_3d", boxes_3d)
           #print("scores_3d", scores_3d)
           #print("labels_3d", labels_3d)
           output_msg = DetectedObjects()
           obj_num = len(boxes_3d)
           for i, box3d in enumerate(custom_boxes_3d):
              obj = box3DToDetectedObject(box3d, self.class_names_, True, False);
              output_msg.objects.append(obj)

           output_msg.header.stamp = self.get_clock().now().to_msg() #rclpy.time.Time()
           output_msg.header.frame_id = "base_link"
           self.od_pub.publish(output_msg)
           print(obj_num, "Objects published")


def main(args=None):
    rclpy.init(args=args)

    sys.path.append(os.getcwd())
    image_subscriber = ImageSubscriber()
    executor = MultiThreadedExecutor()
    executor.add_node(image_subscriber)
    executor.spin()

    image_subscriber.destroy_node()
    detection_publisher.destroy_node()
    rclpy.shutdown()


if __name__ == '__main__':
    main()

这里说的单帧数据可以是单张图片或者nuScenes格式的6张环视图片，之所以说一帧，是对应点云的概念，实际做3D目标检测尤其是雷视融合推理时一般都是以点云帧的频率为时间单位进行推理(在获取一帧点云的同时获取一张或多张图片(取决于有几个摄像头)，前融合的话是将点云和图像的数据做数据融合后输入模型或直接输入模型由模型内部抽取特征做特征融合，后融合的话是将点云和图像分别输入不同的模型推理，再对模型各自的结果进行融合处理)。

上面的代码由于是以DETR3D官方代码为基础的，所以和mmdetection3D后来集成DETR3D的实现代码不同，对比两者代码可以发现，mmdetection3D将DETR3D官方代码里的一些数据预处理环节的实现代码都移走了，在模型内部单独提供了一个Det3DDataPreprocessor类来进行一些公共处理，总体的配置文件和实现思路还是比较类似，毕竟DETR3D官方是在mmdetection3D的早期版本基础上开发的，例如两者的detr3d_r101_gridmask.py配置文件里面有些配置不同，但是思路是相似的：

mmdetection3d/projects/DETR3D/configs/detr3d_r101_gridmask.py

model = dict(
    type='DETR3D',
    use_grid_mask=True,
    data_preprocessor=dict(
        type='Det3DDataPreprocessor', **img_norm_cfg, pad_size_divisor=32),
    img_backbone=dict(
        type='mmdet.ResNet',
        depth=101,
        num_stages=4,
        out_indices=(0, 1, 2, 3),
        frozen_stages=1,
        norm_cfg=dict(type='BN2d', requires_grad=False),
        norm_eval=True,
        style='caffe',
        dcn=dict(type='DCNv2', deform_groups=1, fallback_on_stride=False),
        stage_with_dcn=(False, False, True, True)),
    img_neck=dict(
        type='mmdet.FPN',
        in_channels=[256, 512, 1024, 2048],
        out_channels=256,
        start_level=1,
        add_extra_convs='on_output',
        num_outs=4,
        relu_before_extra_convs=True),
    pts_bbox_head=dict(
        type='DETR3DHead',
        num_query=900,
        num_classes=10,
        in_channels=256,
        sync_cls_avg_factor=True,
        with_box_refine=True,
        as_two_stage=False,
        transformer=dict(
            type='Detr3DTransformer',
            decoder=dict(
                type='Detr3DTransformerDecoder',
                num_layers=6,
                return_intermediate=True,
                transformerlayers=dict(
                    type='mmdet.DetrTransformerDecoderLayer',
                    attn_cfgs=[
                        dict(
                            type='MultiheadAttention',  # mmcv.
                            embed_dims=256,
                            num_heads=8,
                            dropout=0.1),
                        dict(
                            type='Detr3DCrossAtten',
                            pc_range=point_cloud_range,
                            num_points=1,
                            embed_dims=256)
                    ],
                    feedforward_channels=512,
                    ffn_dropout=0.1,
                    operation_order=('self_attn', 'norm', 'cross_attn', 'norm',
                                     'ffn', 'norm')))),
        bbox_coder=dict(
            type='NMSFreeCoder',
            post_center_range=[-61.2, -61.2, -10.0, 61.2, 61.2, 10.0],
            pc_range=point_cloud_range,
            max_num=300,
            voxel_size=voxel_size,
            num_classes=10),
        positional_encoding=dict(
            type='mmdet.SinePositionalEncoding',
            num_feats=128,
            normalize=True,
            offset=-0.5),
        loss_cls=dict(
            type='mmdet.FocalLoss',
            use_sigmoid=True,
            gamma=2.0,
            alpha=0.25,
            loss_weight=2.0),
        loss_bbox=dict(type='mmdet.L1Loss', loss_weight=0.25),
        loss_iou=dict(type='mmdet.GIoULoss', loss_weight=0.0)),
    # model training and testing settings
    train_cfg=dict(
        pts=dict(
            grid_size=[512, 512, 1],
            voxel_size=voxel_size,
            point_cloud_range=point_cloud_range,
            out_size_factor=4,
            assigner=dict(
                type='HungarianAssigner3D',
                cls_cost=dict(type='mmdet.FocalLossCost', weight=2.0),
                reg_cost=dict(type='BBox3DL1Cost', weight=0.25),
                # ↓ Fake cost. This is just to get compatible with DETR head
                iou_cost=dict(type='mmdet.IoUCost', weight=0.0),
                pc_range=point_cloud_range))))

dataset_type = 'NuScenesDataset'
data_root = 'data/nuscenes/'

test_transforms = [
    dict(
        type='RandomResize3D',
        scale=(1600, 900),
        ratio_range=(1., 1.),
        keep_ratio=True)
]
train_transforms = [dict(type='PhotoMetricDistortion3D')] + test_transforms

backend_args = None
train_pipeline = [
    dict(
        type='LoadMultiViewImageFromFiles',
        to_float32=True,
        num_views=6,
        backend_args=backend_args),
    dict(
        type='LoadAnnotations3D',
        with_bbox_3d=True,
        with_label_3d=True,
        with_attr_label=False),
    dict(type='MultiViewWrapper', transforms=train_transforms),
    dict(type='ObjectRangeFilter', point_cloud_range=point_cloud_range),
    dict(type='ObjectNameFilter', classes=class_names),
    dict(type='Pack3DDetInputs', keys=['img', 'gt_bboxes_3d', 'gt_labels_3d'])
]

test_pipeline = [
    dict(
        type='LoadMultiViewImageFromFiles',
        to_float32=True,
        num_views=6,
        backend_args=backend_args),
    dict(type='MultiViewWrapper', transforms=test_transforms),
    dict(type='Pack3DDetInputs', keys=['img'])
]

detr3d/projects/configs/detr3d/detr3d_res101_gridmask.py

model = dict(
    type='Detr3D',
    use_grid_mask=True,
    img_backbone=dict(
        type='ResNet',
        depth=101,
        num_stages=4,
        out_indices=(0, 1, 2, 3),
        frozen_stages=1,
        norm_cfg=dict(type='BN2d', requires_grad=False),
        norm_eval=True,
        style='caffe',
        dcn=dict(type='DCNv2', deform_groups=1, fallback_on_stride=False),
        stage_with_dcn=(False, False, True, True)),
    img_neck=dict(
        type='FPN',
        in_channels=[256, 512, 1024, 2048],
        out_channels=256,
        start_level=1,
        add_extra_convs='on_output',
        num_outs=4,
        relu_before_extra_convs=True),
    pts_bbox_head=dict(
        type='Detr3DHead',
        num_query=900,
        num_classes=10,
        in_channels=256,
        sync_cls_avg_factor=True,
        with_box_refine=True,
        as_two_stage=False,
        transformer=dict(
            type='Detr3DTransformer',
            decoder=dict(
                type='Detr3DTransformerDecoder',
                num_layers=6,
                return_intermediate=True,
                transformerlayers=dict(
                    type='DetrTransformerDecoderLayer',
                    attn_cfgs=[
                        dict(
                            type='MultiheadAttention',
                            embed_dims=256,
                            num_heads=8,
                            dropout=0.1),
                        dict(
                            type='Detr3DCrossAtten',
                            pc_range=point_cloud_range,
                            num_points=1,
                            embed_dims=256)
                    ],
                    feedforward_channels=512,
                    ffn_dropout=0.1,
                    operation_order=('self_attn', 'norm', 'cross_attn', 'norm',
                                     'ffn', 'norm')))),
        bbox_coder=dict(
            type='NMSFreeCoder',
            post_center_range=[-61.2, -61.2, -10.0, 61.2, 61.2, 10.0],
            pc_range=point_cloud_range,
            max_num=300,
            voxel_size=voxel_size,
            num_classes=10), 
        positional_encoding=dict(
            type='SinePositionalEncoding',
            num_feats=128,
            normalize=True,
            offset=-0.5),
        loss_cls=dict(
            type='FocalLoss',
            use_sigmoid=True,
            gamma=2.0,
            alpha=0.25,
            loss_weight=2.0),
        loss_bbox=dict(type='L1Loss', loss_weight=0.25),
        loss_iou=dict(type='GIoULoss', loss_weight=0.0)),
    # model training and testing settings
    train_cfg=dict(pts=dict(
        grid_size=[512, 512, 1],
        voxel_size=voxel_size,
        point_cloud_range=point_cloud_range,
        out_size_factor=4,
        assigner=dict(
            type='HungarianAssigner3D',
            cls_cost=dict(type='FocalLossCost', weight=2.0),
            reg_cost=dict(type='BBox3DL1Cost', weight=0.25),
            iou_cost=dict(type='IoUCost', weight=0.0), # Fake cost. This is just to make it compatible with DETR head. 
            pc_range=point_cloud_range))))

dataset_type = 'NuScenesDataset'
data_root = 'data/nuscenes/'

...

train_pipeline = [
    dict(type='LoadMultiViewImageFromFiles', to_float32=True),
    dict(type='PhotoMetricDistortionMultiViewImage'),
    dict(type='LoadAnnotations3D', with_bbox_3d=True, with_label_3d=True, with_attr_label=False),
    dict(type='ObjectRangeFilter', point_cloud_range=point_cloud_range),
    dict(type='ObjectNameFilter', classes=class_names),
    dict(type='NormalizeMultiviewImage', **img_norm_cfg),
    dict(type='PadMultiViewImage', size_divisor=32),
    dict(type='DefaultFormatBundle3D', class_names=class_names),
    dict(type='Collect3D', keys=['gt_bboxes_3d', 'gt_labels_3d', 'img'])
]
test_pipeline = [
    dict(type='LoadMultiViewImageFromFiles', to_float32=True),
    dict(type='NormalizeMultiviewImage', **img_norm_cfg),
    dict(type='PadMultiViewImage', size_divisor=32),
    dict(
        type='MultiScaleFlipAug3D',
        img_scale=(1333, 800),
        pts_scale_ratio=1,
        flip=False,
        transforms=[
            dict(
                type='DefaultFormatBundle3D',
                class_names=class_names,
                with_label=False),
            dict(type='Collect3D', keys=['img'])
        ])
]

你可能感兴趣的:(detr3d,nuScenes,mmdetection3D,autoware,ROS2)

自动驾驶中间件技术对比小牛蛋自动驾驶中间件
转载：自动驾驶中间件技术辨析：ROS、Apex.Grace、DDS、AutoSAR和AutoSARAdaptive-CSDN博客在自动驾驶技术的演进中，中间件作为连接硬件、操作系统与应用软件的核心枢纽，其安全性、实时性和可扩展性至关重要。当前市场上主流的中间件技术包括ROS/ROS2、Apex.Grace（Apex.OS）、DDS、AutoSAR（经典平台CP）和AutoSARAdaptive（自
ROS2——C++新特性 A_lvvx ROS2 c++开发语言 ROS2
1.自动类型推导auto,可以自行将定义的变量赋值为整形、浮点型、字符型.....2.智能指针c++11提供了三种类型的智能指针：std::unique_ptr、std::shared_ptr和std::weak_ptr。在同一个程序中将某个资源使用智能共享指针进行管理，那么该数据无论在多少个函数内进行传递，都不会发生资源的复制，运行效率会大大提高。当所有的程序使用完毕后，还会自动收回，不会造成内
ROS2基础——Linux A_lvvx ROS2 linux ROS2
Ctrl+Alt+T:打开一个新终端1.查看终端目录命令$pwd#查看终端当前目录---/home/lvvx2.切换终端目录到根目录$cd/#从当前进入根目录$pwd---/3.查看当前目录下文件$ls#查看当前目录下文件---bindevhomeliblib64lost+foundmntprocrunsnapsysusrbootectinitlib32libx32mediaoptrootsbin
【已解决】conda环境下ROS2 colcon build编译选择特定python解释器_anaconda 使用colcon m0_60607675 2024年程序员学习 python conda 开发语言
先自我介绍一下，小编浙江大学毕业，去过华为、字节跳动等大厂，目前阿里P7深知大多数程序员，想要提升技能，往往是自己摸索成长，但自己不成体系的自学效果低效又漫长，而且极易碰到天花板技术停滞不前！因此收集整理了一份《2024年最新Python全套学习资料》，初衷也很简单，就是希望能够帮助到想自学提升又不知道该从何学起的朋友。既有适合小白学习的零基础资料，也有适合3年以上经验的小伙伴深入学习提升的进阶课
C++ 实现 ROS 2 点云欧几里得聚类 c++
C++实现ROS2点云欧几里得聚类在LivoxMid-360采集的sensor_msgs::msg::PointCloud2点云数据上进行欧几里得聚类（EuclideanClusterExtraction），具体流程如下：✅1.订阅PointCloud2并转换为pcl::PointCloud解释：sensor_msgs::msg::PointCloud2是ROS2点云消息格式，PCL不能直接处理。
ROS2使用RCLPY编写节点 ct1027038527 python 开发语言
1.创建文件夹mkdir-pchapt2/chapt2_ws_py/src/2.进入指定文件夹cdchapt2/chapt2_ws_py/src/3.创建Python功能包ros2pkgcreateexample_py--build-typeament_python--dependenciesrclpy4.编写节点在example_py/example_py下创建node_02.py接着我们开始编
ros2 rclpy 详解 --创建 python类型节点 Lntano__y ros2学习 python python ros2 rclpy
rclpy是ROS2(RobotOperatingSystem2)中用于Python的客户端库。它提供了与ROS2系统交互的API，使开发者能够使用Python编写ROS2节点、发布和订阅消息、调用服务、定时器等。rclpy是ROS2的核心库之一，为Python开发者提供了与ROS2系统进行通信的能力。rclpy的基本功能创建节点：提供创建和管理ROS2节点的功能。发布/订阅消息：支持创建发布者和
Node【二】跨域和同源，跨域常用的解决方法小祥编程 Node 前端 javascript 开发语言
Node【二】跨域和同源文章目录Node【二】跨域和同源前言一、跨域、同源是什么？1、同源2、跨域二、跨域的解决方法。2.1、CORS2.1.1、在原生的node中使用cors2.1.2、在express中使用cros2.2、JSONP总结前言我们在使用node.js搭建服务器的时候，就避免不了前、后端的请求和访问，那么跨域和同源我们就必须要掌握。一、跨域、同源是什么？1、同源同源：是游览器最基本
征程 6 工具链 BEVPoolV2 算子使用教程 1 - BEVPoolV2 算子详解算法自动驾驶
1.引言当前，地平线征程6工具链已经全面支持了BEVPoolingV2算子，并与mmdetection3d的实现完成了精准对齐。然而，需要注意的是，此算子因其内在的复杂性以及相关使用示例的稀缺，致使部分用户在实际运用过程中遭遇了与预期不符的诸多问题。在这样的背景下，本文首先会对BEVPoolingV2的实现进行全方位、细致入微的剖析讲解，，让复杂的原理变得清晰易懂。随后，还会通过代表性的示例，来进
如何高效运行 DeepSeek-R1：分步指南知识大胖 NVIDIA GPU和大语言模型开发教程 deepseek janus pro ollama
简介DeepSeek-R1是一个功能强大的开源AI模型，但要高效运行它，需要仔细的硬件选择、优化和部署策略。无论您想在Mac上本地运行它，还是在云GPU上运行它，还是优化性能以供大规模使用，本指南都会逐步引导您完成所有操作。推荐文章《如何在本地电脑上安装和使用DeepSeekR-1》权重1，DeepSeek《Nvidia系列之使用NVIDIAIsaacSim和ROS2的命令行控制您的机器人》权重1
《ROS2 机器人开发从入门道实践》鱼香ROS2——第5章内容儒雅芝士机器人
目录第5章ROS常用开发工具5.1坐标变换工具介绍5.1.1通过命令行使用TF5.1.2对TF原理的简单探究5.2Python中的手眼坐标变换5.2.1通过Python发布静态TF5.2.2通过Python发布动态TF5.2.3通过Python查询TF关系5.3C++中的地图坐标系变化5.3.1通过C++发布静态TF5.3.2通过C++发布动态TF5.3.3通过C++查询TF关系5.4常用可视化工
ROS2入门教程—创建ROS2功能包（C++版） Roar冷颜 ROS2入门教程其他
ROS2入门教程—创建ROS2功能包（C++版）1ROS2中的功能包2创建功能包3编译功能包4设置环境变量5运行功能包6功能包中的内容7修改package.xml文件功能包是ROS2中组织代码的基本容器，方便我们编译、安装、分发开发的代码，一般来讲，每个功能包都是用来完成某项具体的功能相对完整的单元。1ROS2中的功能包 ROS2中的功能包可以使用CMake或者Python两种方式来编译（本
ROS2软件调用架构和机制解析：Publisher创建 slam02∞ ros2 dds
术语DDS(DataDistributionService):用于实时系统的数据分发服务标准，是ROS2底层通信的基础RMW(ROSMiddleware):ROS中间件接口，提供与具体DDS实现无关的抽象APIQoS(QualityofService):服务质量策略，控制通信的可靠性、历史记录、耐久性等属性符号解析:动态库加载过程中，查找和绑定函数指针的机制1.架构概述ROS2采用分层设计，通过多
autoware.universe编译过程中的一个报错：＜command-line＞: fatal error: grid_map_core/eigen_plugins/FunctorsPlugin. 不断学习加努力算法自动驾驶
文章目录前言前言在autoware.universe的编译过程中，报了一个错误：:fatalerror:grid_map_core/eigen_plugins/FunctorsPlugin.hpp:没有那个文件或目录compilationterminated.gmake[2]:***[CMakeFiles/autoware_behavior_velocity_planner_lib.dir/bui
【一看就会】Autoware.universe的“规划”部分源码梳理【四十六】（autoware_obstacle_cruise_planner：障碍物巡航规划器）不断学习加努力自动驾驶算法
提示：文章写完后，目录可以自动生成，如何生成可参考右边的帮助文档文章目录前言十二、autoware_obstacle_cruise_planner：障碍物巡航规划器1.功能概述2.工作流程3.调用关系4.订阅发布话题订阅话题发布话题5.关键算法实现6.主要参数配置规划器选择安全参数优化器参数PID参数巡航参数7.文件结构和功能a)核心实现文件node.cpp：planner_interface.c
ROS2安装教程（virtualbox7.0.6+ROS2） setella c++ubuntu
整个过程分两步：先安装Virtualbox，再安装ROS2一、安装virtualbox7.0.6网址：https://www.virtualbox.org/wiki/Downloads问题1安装时报错：缺少pythoncore、win32api依赖（下图网上拷贝的图，版本忽略）解决：根据virtualbox论坛的帖子，有人说不使用python控制virtualbox的话，可以先不安装，去掉pyth
【一看就会】Autoware.universe的路径点生成逻辑不断学习加努力自动驾驶算法
文章目录前言一、on_set_lanelet_route：根据道路生成路线二、on_set_waypoint_route：根据路径点生成路线三、调用on_set_waypoint_route的流程四、设置路径点总结前言在【一看就会】Autoware.universe的车道规划逻辑文章中，我们介绍了车道规划逻辑。车道规划是指在两个路径点之间规划一条连续可行的车道。也就是说，在我们给了起点和终点之后，
基于Kitti数据集实现MMDetection3D点云物体检测训练 Xian-HHappy 技术知识点 kitti三维点云无人驾驶 MMDetection3D 人工智能计算机视觉目标检测
DataBall助力快速掌握数据集的信息和使用方式，会员享有百种数据集，持续增加中。需要更多数据资源和技术解决方案，知识星球：“DataBall-X数据球(free)”贵在坚持！-----------------------------------------------------------------------------------------------MMDetection3D环境安
奥比中光3D机器视觉相机能连接halcon吗？视觉人机器视觉机器视觉3D 3d 数码相机视觉检测 c#
奥比中光的设备与Halcon的兼容性可以通过以下方式实现：数据接口的通用性奥比中光的相机（如AstroPro、大白等）支持通过UVC协议获取彩色图像，深度数据则通过OpenNI或ROS2接口传输105。若Halcon支持这些协议或标准接口（如ROS消息、OpenCV图像流），则可通过直接调用或二次开发实现连接。例如，通过Python或C#脚本将图像数据从相机传输至Halcon的处理流程中。SDK与
ROS2软件架构全面解析-学习如何设计通信中间件框架 chinamaoge ROS2 DDS 通信中间件 ROS2 ROS 通信中间件
前言ROS（RobotOperatingSystem）2是一个用于开发机器人应用的软件平台，也称为机器人软件开发工具包(SDK)。ROS2是ROS1的迭代升级版本，最主要的升级点是引入DDS（DataDistributionService）为基础的底层通信系统。为解决ROS1存在主要几个缺陷：为解决一个主节点通信故障(ROSMaster)，需要对所有现有的Client库进行单独的补丁处理，并且每个
【一看就会】Autoware.universe的“规划”部分源码梳理【三十五】（motion_velocity_planner第四部分）不断学习加努力算法自动驾驶
文章目录前言四、autoware_motion_velocity_out_of_lane_module功能概述处理流程图输入输出关键算法实现主要参数配置安全参数速度参数检测参数工作流程各文件主要功能核心功能文件：工具类文件：源码注释calculate_slowdown_points.cppfilter_predicted_objects.cppfootprint.cppdebug.cpp总结前言书
【ROS2】RViz2自定义面板插件（rviz_common::Panel）的详细步骤郭老二 ROS Qt ROS2
【ROS】郭老二博文之：ROS目录1、简述RViz2的插件基于ROS2的插件库（pluginlib）机制，通过动态加载共享库实现功能扩展。注意：RViz2使用QT作为UI框架，虽然QT也有插件机制，但是RViz2并没有使用QT的插件机制，而是通过pluginlib加载功能模块来实现。2、插件类型每个插件必须继承相应的基类，才能被RViz识别。RViz2中共有5类插件：插件类型基类Display（显
安装ubuntu20.04+安装ros-noetic 机械专业的计算机小白 ROS ubuntu ros
碰壁：1.VMware15.5因为兼容性问题，启动虚拟机，电脑就蓝屏重启。解决：安16pro版本。2.rosdep问题，网络问题，不停的超时。解决：通过鱼香ROS博主的方法。安装原因：ubuntu20.04支持python3，同时安装ros对应的版本是noetic不需要rosdep（但是官方后期又补齐了这个操作），而且是ros1最后一版方便过渡到ros2.、安装准备：1.注册好的VMware16p
ROS2: Qos机制扛着相机的翻译官 ROS 网络
ROS2:Qos机制Qos机制是ros2区别与ros1增加的重要内容，用来弥补ros1通讯不稳定的问题。按照我的理解，Qos机制通过参数的配置，相当于将通讯机制调整在介于TCP和UDP模式之间。根据使用场景，配置相应的Qos参数，可以侧重于数据通讯实时性或者数据通讯质量。兼容性ReliabilityQosPolicies:PublisherSubscriber兼容BesteffortBesteff
对海康威视工业相机进行取图 boss-dog 海康威视工业相机 c++海康威视工业相机
之前通过海康SDK开发时，取相机当前帧图像一直用的是MV_CC_GetOneFrameTimeout函数，因为每次都是需要的时候去主动取一帧图像，也没关注CPU的占用率。最近在海康相机SDK的基础之上封装了一层ROS2节点，用来对外发布相机的图像，需要一直采集相机的图像，重复调用MV_CC_GetOneFrameTimeout函数函数发现CPU占用率挺高的（本质还是MV_CC_StartG
Ubuntu24.04初始化教程(包含基础优化、ros2) DW_DROME ROS2 配置与环境 ubuntu
更美观的展示下载后在浏览器中打开。将会不断更新。但是所有都是基础且必要的操作。为重装系统之后的环境配置提供便捷信息来源。记录一些错误的解决方案。文章目录构建系统建立系统备份**Timeshift:系统快照和备份工具****安装Timeshift****使用Timeshift创建快照****还原快照****自动创建快照**最基本配置时间同步换源官方源软件源ROS2的软件源软件配置打开新世界大门谷歌浏
自动驾驶数据集三剑客：nuScenes、nuImages 与 nuPlan 的技术矩阵与生态协同数据与算法架构提升之路 #自动驾驶自动驾驶人工智能机器学习
目录1、引言2、主要内容2.1、定位对比：感知与规划的全维覆盖2.2、数据与技术特性对比2.3、技术协同：构建全栈研发生态2.4、应用场景与评估体系2.5、总结与展望3、参考文献1、引言随着自动驾驶技术向全栈化迈进，Motional团队构建了涵盖3D感知、2D检测及规划决策的数据集矩阵，为自动驾驶系统提供了从环境感知到行为决策的全链路支持。nuScenes：多模态3D感知的行业标杆nuImages
ROS2（Robot Operating System 2）与树莓派（Raspberry Pi） Covirtue ROS2 ROS2
ROS2与树莓派一、ROS2简介ROS2是一个为机器人提供硬件抽象、设备驱动、函数库、可视化工具、消息通信以及软件包管理等多种功能的开源操作系统。它支持分布式计算，允许多个节点（即进程）在局域网内自由通信，非常适合用于多机器人协作和复杂机器人系统的开发。二、树莓派简介树莓派是一款基于ARM架构的微型电脑主板，以SD卡为内存硬盘，卡片主板周围有1/2/4个USB接口和一个10/100以太网接口（A型
在本地运行DeepSeek Janus 系列，DeepSeek Janus 系列用于图像理解和生成的统一多模态 AI 知识大胖 NVIDIA GPU和大语言模型开发教程人工智能 deepseek
简介人工智能正在快速发展，多模态模型正在彻底改变机器理解和生成内容的方式。DeepSeek的Janus系列是一种先进的开源多模态人工智能模型，它将图像理解、文本到图像生成和视觉语言推理统一到一个系统中。推荐文章《如何在本地电脑上安装和使用DeepSeekR-1》权重1，DeepSeek《Nvidia系列之使用NVIDIAIsaacSim和ROS2的命令行控制您的机器人》权重1，NVIDIAIsaa
Ubuntu 的 ROS 2 操作系统 turtlebot3 导航仿真 Code-world-1 Turtlebot3 PC端ROS环境搭建与仿真 ubuntu linux ROS 2 轮式移动机器人导航仿真
引言导航仿真是机器人自动化系统中不可或缺的一部分，能够帮助开发者在虚拟环境中测试机器人在复杂场景下的运动与路径规划。在Gazebo仿真环境中，TurtleBot3配合ROS2提供了强大的导航功能。在进行导航仿真时，首先需要准备地图，并确保机器人能够正确地定位自己。本文将介绍如何在Ubuntu系统上使用ROS2的Navigation2包进行TurtleBot3的导航仿真。通过详细的步骤，开发者可以在
LeetCode[位运算] - #137 Single Number II Cwind java Algorithm LeetCode 题解位运算
原题链接：#137 Single Number II 要求：给定一个整型数组，其中除了一个元素之外，每个元素都出现三次。找出这个元素注意：算法的时间复杂度应为O(n)，最好不使用额外的内存空间难度：中等分析：与#136类似，都是考察位运算。不过出现两次的可以使用异或运算的特性 n XOR n = 0, n XOR 0 = n，即某一
《JavaScript语言精粹》笔记 aijuans JavaScript
0、JavaScript的简单数据类型包括数字、字符创、布尔值（true/false）、null和undefined值，其它值都是对象。 1、JavaScript只有一个数字类型，它在内部被表示为64位的浮点数。没有分离出整数，所以1和1.0的值相同。 2、NaN是一个数值，表示一个不能产生正常结果的运算结果。NaN不等于任何值，包括它本身。可以用函数isNaN(number)检测NaN,但是
你应该更新的Java知识之常用程序库 Kai_Ge java
在很多人眼中，Java 已经是一门垂垂老矣的语言，但并不妨碍 Java 世界依然在前进。如果你曾离开 Java，云游于其它世界，或是每日只在遗留代码中挣扎，或许是时候抬起头，看看老 Java 中的新东西。 Guava Guava[gwɑ:və]，一句话，只要你做Java项目，就应该用Guava（Github）。 guava 是 Google 出品的一套 Java 核心库，在我看来，它甚至应该
HttpClient 120153216 httpclient
/** * 可以传对象的请求转发，对象已流形式放入HTTP中 */ public static Object doPost(Map<String,Object> parmMap,String url) { Object object = null; HttpClient hc = new HttpClient(); String fullURL
Django model字段类型清单 2002wmj django
Django 通过 models 实现数据库的创建、修改、删除等操作，本文为模型中一般常用的类型的清单，便于查询和使用： AutoField：一个自动递增的整型字段，添加记录时它会自动增长。你通常不需要直接使用这个字段；如果你不指定主键的话，系统会自动添加一个主键字段到你的model。(参阅自动主键字段) BooleanField：布尔字段,管理工具里会自动将其描述为checkbox。 Cha
在SQLSERVER中查找消耗CPU最多的SQL 357029540 SQL Server
返回消耗CPU数目最多的10条语句 SELECT TOP 10 total_worker_time/execution_count AS avg_cpu_cost, plan_handle, execution_count, (SELECT SUBSTRING(text, statement_start_of
Myeclipse项目无法部署，Undefined exploded archive location 7454103 eclipse MyEclipse
做个备忘！错误信息为： Undefined exploded archive location 原因：在工程转移过程中，导致工程的配置文件出错；解决方法：
GMT时间格式转换 adminjun GMT 时间转换
普通的时间转换问题我这里就不再罗嗦了，我想大家应该都会那种低级的转换问题吧，现在我向大家总结一下如何转换GMT时间格式，这种格式的转换方法网上还不是很多，所以有必要总结一下，也算给有需要的朋友一个小小的帮助啦。 1、可以使用 SimpleDateFormat SimpleDateFormat EEE-三位星期 d-天 MMM-月 yyyy-四位年
Oracle数据库新装连接串问题 aijuans oracle数据库
割接新装了数据库，客户端登陆无问题，apache/cgi-bin程序有问题，sqlnet.log日志如下： Fatal NI connect error 12170. VERSION INFORMATION: TNS for Linux: Version 10.2.0.4.0 - Product
回顾java数组复制 ayaoxinchao java 数组
在写这篇文章之前，也看了一些别人写的，基本上都是大同小异。文章是对java数组复制基础知识的回顾，算是作为学习笔记，供以后自己翻阅。首先，简单想一下这个问题：为什么要复制数组？我的个人理解：在我们在利用一个数组时，在每一次使用，我们都希望它的值是初始值。这时我们就要对数组进行复制，以达到原始数组值的安全性。java数组复制大致分为3种方式：①for循环方式 ②clone方式 ③arrayCopy方
java web会话监听并使用spring注入 bewithme Java Web
在java web应用中，当你想在建立会话或移除会话时，让系统做某些事情，比如说，统计在线用户，每当有用户登录时，或退出时，那么可以用下面这个监听器来监听。 import java.util.ArrayList; import java.ut
NoSQL数据库之Redis数据库管理(Redis的常用命令及高级应用) bijian1013 redis 数据库 NoSQL
一 .Redis常用命令 Redis提供了丰富的命令对数据库和各种数据库类型进行操作，这些命令可以在Linux终端使用。 a.键值相关命令 b.服务器相关命令 1.键值相关命令 &
java枚举序列化问题 bingyingao java 枚举序列化
对象在网络中传输离不开序列化和反序列化。而如果序列化的对象中有枚举值就要特别注意一些发布兼容问题: 1.加一个枚举值新机器代码读分布式缓存中老对象，没有问题，不会抛异常。老机器代码读分布式缓存中新对像，反序列化会中断，所以在所有机器发布完成之前要避免出现新对象，或者提前让老机器拥有新增枚举的jar。 2.删一个枚举值新机器代码读分布式缓存中老对象，反序列
【Spark七十八】Spark Kyro序列化 bit1129 spark
当使用SparkContext的saveAsObjectFile方法将对象序列化到文件，以及通过objectFile方法将对象从文件反序列出来的时候，Spark默认使用Java的序列化以及反序列化机制，通常情况下，这种序列化机制是很低效的，Spark支持使用Kyro作为对象的序列化和反序列化机制，序列化的速度比java更快，但是使用Kyro时要注意，Kyro目前还是有些bug。 Spark
Hybridizing OO and Functional Design bookjovi erlang haskell
推荐博文： Tell Above, and Ask Below - Hybridizing OO and Functional Design 文章中把OO和FP讲的深入透彻，里面把smalltalk和haskell作为典型的两种编程范式代表语言，此点本人极为同意，smalltalk可以说是最能体现OO设计的面向对象语言，smalltalk的作者Alan kay也是OO的最早先驱，
Java-Collections Framework学习与总结-HashMap BrokenDreams Collections
开发中常常会用到这样一种数据结构，根据一个关键字，找到所需的信息。这个过程有点像查字典，拿到一个key，去字典表中查找对应的value。Java1.0版本提供了这样的类java.util.Dictionary(抽象类)，基本上支持字典表的操作。后来引入了Map接口，更好的描述的这种数据结构。 &nb
读《研磨设计模式》-代码笔记-职责链模式-Chain Of Responsibility bylijinnan java 设计模式
声明：本文只为方便我个人查阅和理解，详细的分析以及源代码请移步原作者的博客http://chjavach.iteye.com/ /** * 业务逻辑：项目经理只能处理500以下的费用申请，部门经理是1000，总经理不设限。简单起见，只同意“Tom”的申请 * bylijinnan */ abstract class Handler { /*
Android中启动外部程序 cherishLC android
1、启动外部程序引用自： http://blog.csdn.net/linxcool/article/details/7692374 //方法一 Intent intent=new Intent(); //包名包名+类名（全路径） intent.setClassName("com.linxcool", "com.linxcool.PlaneActi
summary_keep_rate coollyj SUM
BEGIN /*DECLARE minDate varchar(20) ; DECLARE maxDate varchar(20) ;*/ DECLARE stkDate varchar(20) ; DECLARE done int default -1; /* 游标中注册服务器地址 */ DE
hadoop hdfs 添加数据目录出错 daizj hadoop hdfs 扩容
由于原来配置的hadoop data目录快要用满了，故准备修改配置文件增加数据目录，以便扩容，但由于疏忽，把core-site.xml, hdfs-site.xml配置文件dfs.datanode.data.dir 配置项增加了配置目录，但未创建实际目录，重启datanode服务时，报如下错误： 2014-11-18 08:51:39,128 WARN org.apache.hadoop.h
grep 目录级联查找 dongwei_6688 grep
在Mac或者Linux下使用grep进行文件内容查找时，如果给定的目标搜索路径是当前目录，那么它默认只搜索当前目录下的文件，而不会搜索其下面子目录中的文件内容，如果想级联搜索下级目录，需要使用一个“-r”参数： grep -n -r "GET" . 上面的命令将会找出当前目录“.”及当前目录中所有下级目录
yii 修改模块使用的布局文件 dcj3sjt126com yii layouts
方法一：yii模块默认使用系统当前的主题布局文件，如果在主配置文件中配置了主题比如: 'theme'=>'mythm', 那么yii的模块就使用 protected/themes/mythm/views/layouts 下的布局文件；如果未配置主题，那么 yii的模块就使用 protected/views/layouts 下的布局文件，总之默认不是使用自身目录 pr
设计模式之单例模式 come_for_dream 设计模式单例模式懒汉式饿汉式双重检验锁失败无序写入
今天该来的面试还没来，这个店估计不会来电话了，安静下来写写博客也不错，没事翻了翻小易哥的博客甚至与大牛们之间的差距，基础知识不扎实建起来的楼再高也只能是危楼罢了，陈下心回归基础把以前学过的东西总结一下。 *********************************
8、数组豆豆咖啡二维数组数组一维数组
一、概念数组是同一种类型数据的集合。其实数组就是一个容器。二、好处可以自动给数组中的元素从0开始编号，方便操作这些元素三、格式 //一维数组 1,元素类型[] 变量名 = new 元素类型[元素的个数] int[] arr =
Decode Ways hcx2013 decode
A message containing letters from A-Z is being encoded to numbers using the following mapping: 'A' -> 1 'B' -> 2 ... 'Z' -> 26 Given an encoded message containing digits, det
Spring4.1新特性——异步调度和事件机制的异常处理 jinnianshilongnian spring 4.1
目录 Spring4.1新特性——综述 Spring4.1新特性——Spring核心部分及其他 Spring4.1新特性——Spring缓存框架增强 Spring4.1新特性——异步调用和事件机制的异常处理 Spring4.1新特性——数据库集成测试脚本初始化 Spring4.1新特性——Spring MVC增强 Spring4.1新特性——页面自动化测试框架Spring MVC T
squid3(高命中率)缓存服务器配置 liyonghui160com
系统:centos 5.x 需要的软件:squid-3.0.STABLE25.tar.gz 1.下载squid wget http://www.squid-cache.org/Versions/v3/3.0/squid-3.0.STABLE25.tar.gz tar zxf squid-3.0.STABLE25.tar.gz &&
避免Java应用中NullPointerException的技巧和最佳实践 pda158 java
1) 从已知的String对象中调用equals()和equalsIgnoreCase()方法，而非未知对象。　　总是从已知的非空String对象中调用equals()方法。因为equals()方法是对称的，调用a.equals(b)和调用b.equals(a)是完全相同的，这也是为什么程序员对于对象a和b这么不上心。如果调用者是空指针，这种调用可能导致一个空指针异常 Object unk
如何在Swift语言中创建http请求 shoothao http swift
概述：本文通过实例从同步和异步两种方式上回答了”如何在Swift语言中创建http请求“的问题。如果你对Objective-C比较了解的话，对于如何创建http请求你一定驾轻就熟了，而新语言Swift与其相比只有语法上的区别。但是，对才接触到这个崭新平台的初学者来说，他们仍然想知道“如何在Swift语言中创建http请求？”。在这里,我将作出一些建议来回答上述问题。常见的
Spring事务的传播方式 uule spring事务
传播方式：新建事务 required required_new - 挂起当前非事务方式运行 supports &nbs