danmeng8068

SLAM方法汇总

http://blog.csdn.net/smartxxyx/article/details/53068855

SLAM概述

SLAM一般处理流程包括track和map两部分。所谓的track是用来估计相机的位姿，也叫front-end。而map部分(back-end)则是深度的构建，通过前面的跟踪模块估计得到相机的位姿，采用三角法(triangulation)计算相应特征点的深度，进行当前环境map的重建，重建出的map同时为front-end提供更好的姿态估计，并可以用于例如闭环检测．

单目slam根据构建地图的稀疏程度可以大致分为：
- 稀疏法(特征点)，半稠密法，稠密法

根据匹配方法，可分为：直接法和特征点法

根据系统采用的优化策略，可分为Keyframe-based和filter-based方法；

Strasdat H, Montiel J M M, Davison A J. Visual SLAM: why filter?[J]. Image and Vision Computing, 2012, 30(2): 65-77.
For all these scenarios, we conclude that keyframe bundle adjustment outperforms filtering, since it gives the most accuracy per unit of computing time.

典型的单目slam系统

EKF-SLAM, FastSLAM 1.0, FastSLAM 2.0 and UKF-SLAM: http://www-personal.acfr.usyd.edu.au/tbailey/software/slam_simulations.htm

https://github.com/yglee/FastSLAM

ekf-slam-matlab

EKF-SLAM TOOLBOX FOR MATLAB

SceneLib2: SLAM originally designed and implemented by Professor Andrew Davison at Imperial College London

　　 > MonoSLAM: Real-Time Single Camera SLAM (PDF format), Andrew J. Davison, Ian Reid, Nicholas Molton and Olivier Stasse, IEEE Trans. PAMI 2007.

PTAM: http://www.robots.ox.ac.uk/~gk/PTAM/

https://github.com/Oxford-PTAM/PTAM-GPL

https://ewokrampage.wordpress.com/

https://github.com/tum-vision/tum_ardrone

PTAM类图.png

> Georg Klein and David Murray, "Parallel Tracking and Mapping for Small AR Workspaces", Proc. ISMAR 2007

DTSLAM: Deferred Triangulation for Robust SLAM
> Herrera C., D., Kim, K., Kannala, J., Pulli, K., Heikkila, J., DT-SLAM: Deferred Triangulation for Robust SLAM, 3DV, 2014.

LSD-SLAM: http://vision.in.tum.de/research/vslam/lsdslam

A novel, direct monocular SLAM technique: Instead of using keypoints, it directly operates on image intensities both for tracking and mapping. The camera is tracked using direct image alignment, while geometry is estimated in the form of semi-dense depth maps, obtained by filtering over many pixelwise stereo comparisons. We then build a Sim(3) pose-graph of keyframes, which allows to build scale-drift corrected, large-scale maps including loop-closures. LSD-SLAM runs in real-time on a CPU, and even on a modern smartphone.

> LSD-SLAM: Large-Scale Direct Monocular SLAM (J. Engel, T. Schöps, D. Cremers), In European Conference on Computer Vision (ECCV), 2014. [bib] [pdf] [video]

SVO: Fast Semi-Direct Monocular Visual Odometry (ICRA 2014)

SVO类图.png

> Paper: http://rpg.ifi.uzh.ch/docs/ICRA14_Forster.pdf

ORB-SLAM2: Orbslam-workflow.png

http://webdiis.unizar.es/~raulmur/orbslam/

论文翻译：http://qiqitek.com/blog/?p=13　

ORB-SLAM是西班牙Zaragoza大学的Raul Mur-Artal编写的视觉SLAM系统。他的论文“ORB-SLAM: a versatile and accurate monocular SLAM system"发表在2015年的IEEE Trans. on Robotics上。开源代码包括前期的ORB-SLAM[1]和后期的ORB-SLAM2[2]。第一个版本主要用于单目SLAM，而第二个版本支持单目、双目和RGBD三种接口。

ORB-SLAM是一个完整的SLAM系统，包括视觉里程计、跟踪、回环检测。它是一种完全基于稀疏特征点的单目SLAM系统，其核心是使用ORB（Orinted FAST and BRIEF）作为整个视觉SLAM中的核心特征。具体体现在两个方面：

提取和跟踪的特征点使用ORB。ORB特征的提取过程非常快，适合用于实时性强的系统。
回环检测使用词袋模型，其字典是一个大型的ORB字典。
接口丰富，支持单目、双目、RGBD多种传感器输入，编译时ROS可选，使得其应用十分轻便。代价是为了支持各种接口，代码逻辑稍为复杂。
在PC机以30ms/帧的速度进行实时计算，但在嵌入式平台上表现不佳。

它主要有三个线程组成：跟踪、Local Mapping（又称小图）、Loop Closing（又称大图）。跟踪线程相当于一个视觉里程计，流程如下：

首先，对原始图像提取ORB特征并计算描述子。
根据特征描述，在图像间进行特征匹配。
根据匹配特征点估计相机运动。
根据关键帧判别准则，判断当前帧是否为关键帧。

相比于多数视觉SLAM中利用帧间运动大小来取关键帧的做法，ORB_SLAM的关键帧判别准则较为复杂。

> Raúl Mur-Artal, J. M. M. Montiel and Juan D. Tardós. ORB-SLAM: A Versatile and Accurate Monocular SLAM System. IEEE Transactions on Robotics, vol. 31, no. 5, pp. 1147-1163, October 2015. [pdf]

> Raúl Mur-Artal and Juan D. Tardós. Probabilistic Semi-Dense Mapping from Highly Accurate Feature-Based Monocular SLAM. Robotics: Science and Systems. Rome, Italy, July 2015. [pdf] [poster]

基于单目的稠密slam系统

DTAM: https://github.com/anuranbaka/OpenDTAM

http://homes.cs.washington.edu/~newcombe/papers/newcombe_etal_iccv2011.pdf

REMODE: Probabilistic, Monocular Dense Reconstruction in Real Time (ICRA 2014)

http://rpg.ifi.uzh.ch/docs/ICRA14_Pizzoli.pdf

DPPTAM: DPPTAM is a direct monocular odometry algorithm that estimates a dense reconstruction of a scene in real-time on a CPU. Highly textured image areas are mapped using standard direct mapping techniques, that minimize the photometric error across different views. We make the assumption that homogeneous-color regions belong to approximately planar areas. Related Publication:

> Alejo Concha, Javier Civera. DPPTAM: Dense Piecewise Planar Tracking and Mapping from a Monocular Sequence IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS15), Hamburg, Germany, 2015

基于RGBD的稠密slam系统

Elastic Fusion: Real-time dense visual SLAM system

ElasticFusion: Dense SLAM Without A Pose Graph, T. Whelan, S. Leutenegger, R. F. Salas-Moreno, B. Glocker and A. J. Davison, RSS '15

Kintinuous: Real-time large scale dense visual SLAM system
- Real-time Large Scale Dense RGB-D SLAM with Volumetric Fusion, T. Whelan, M. Kaess, H. Johannsson, M.F. Fallon, J. J. Leonard and J.B. McDonald, IJRR '14
- Kintinuous: Spatially Extended KinectFusion, T. Whelan, M. Kaess, M.F. Fallon, H. Johannsson, J. J. Leonard and J.B. McDonald, RSS RGB-D Workshop '12

RGBDSLAMv2: a state-of-the-art SLAM system for RGB-D cameras, e.g., the Microsoft Kinect or the Asus Xtion Pro Live. You can use it to create 3D point clouds or OctoMaps.

> "3D Mapping with an RGB-D Camera", F. Endres, J. Hess, J. Sturm, D. Cremers, W. Burgard, IEEE Transactions on Robotics, 2014.

RTAB-Map: Real-Time Appearance-Based Mapping

The loop closure detector uses a bag-of-words approach to determinate how likely a new image comes from a previous location or a new location. When a loop closure hypothesis is accepted, a new constraint is added to the map's graph, then a graph optimizer minimizes the errors in the map. A memory management approach is used to limit the number of locations used for loop closure detection and graph optimization, so that real-time constraints on large-scale environnements are always respected. RTAB-Map can be used alone with a hand-held Kinect or stereo camera for 6DoF RGB-D mapping, or on a robot equipped with a laser rangefinder for 3DoF mapping.

> M. Labbé and F. Michaud, “Online Global Loop Closure Detection for Large-Scale Multi-Session Graph-Based SLAM,” in Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2014.

DVO: Dense Visual Odometry and SLAM

> Dense Visual SLAM for RGB-D Cameras (C. Kerl, J. Sturm, D. Cremers), In Proc. of the Int. Conf. on Intelligent Robot Systems (IROS), 2013.

> Robust Odometry Estimation for RGB-D Cameras (C. Kerl, J. Sturm, D. Cremers), In Int. Conf. on Robotics and Automation, 2013.

Visual-Inertial Slam系统

ROVIO：Robust Visual Inertial Odometry

Paper: http://dx.doi.org/10.3929/ethz-a-010566547

OKVIS：Open Keyframe-based Visual Inertial SLAM

Stefan Leutenegger, Simon Lynen, Michael Bosse, Roland Siegwart and Paul Timothy Furgale. Keyframe-based visual–inertial odometry using nonlinear optimization. The International Journal of Robotics Research, 2015.

最新单目slam系统

REBVO：Realtime Edge Based Visual Odometry for a Monocular Camera

REBVO tracks a camera in Realtime using edges. The system is split in 2 components. An on-board part (rebvo itself) doing all the processing and sending data over UDP and an OpenGL visualizer.

> Tarrio, J. J., & Pedre, S. (2015). Realtime Edge-Based Visual Odometry for a Monocular Camera. In Proceedings of the IEEE International Conference on Computer Vision (pp. 702-710).

Direct Sparse Odometry: http://vision.in.tum.de/research/vslam/dso

https://www.youtube.com/watch?v=C6-xwSOOdqQ

A novel direct and sparse formulation for Visual Odometry. It combines a fully direct probabilistic model (minimizing a photometric error) with consistent, joint optimization of all model parameters, including geometry - represented as inverse depth in a reference frame - and camera motion. This is achieved in real time by omitting the smoothness prior used in other direct methods and instead sampling pixels evenly throughout the images. DSO does not depend on keypoint detectors or descriptors, thus it can naturally sample pixels from across all image regions that have intensity gradient, including edges or smooth intensity variations on mostly white walls. The proposed model integrates a full photometric calibration, accounting for exposure time, lens vignetting, and non-linear response functions. We thoroughly evaluate our method on three different datasets comprising several hours of video. The experiments show that the presented approach significantly outperforms state-of-the-art direct and indirect methods in a variety of real-world settings, both in terms of tracking accuracy and robustness.

> Direct Sparse Odometry (J. Engel, V. Koltun, D. Cremers), In arXiv:1607.02565, 2016. [bib] [pdf]

> A Photometrically Calibrated Benchmark For Monocular Visual Odometry (J. Engel, V. Usenko, D. Cremers), In arXiv:1607.02555, 2016. [bib] [pdf]

svo 2.0

> C. Forster, Z. Zhang, M. Gassner, M. Werlberger, and D. Scaramuzza. Svo 2.0: Semi-direct visual odometry for monocular and multi-camera systems. IEEE Trans- actions on Robotics, accepted, January 2016.

> C. Forster, M. Pizzoli, and D. Scaramuzza. SVO: Fast Semi-Direct Monocular Visual Odometry. In IEEE Intl. Conf. on Robotics and Automation (ICRA), 2014. doi:10.1109/ICRA.2014.6906584.

典型的双目slam系统

LIBVISO2: http://www.cvlibs.net/software/libviso/

LIBVISO2 (Library for Visual Odometry 2) is a very fast cross-platfrom (Linux, Windows) C++ library with MATLAB wrappers for computing the 6 DOF motion of a moving mono/stereo camera. The stereo version is based on minimizing the reprojection error of sparse feature matches and is rather general (no motion model or setup restrictions except that the input images must be rectified and calibration parameters are known). The monocular version is still very experimental and uses the 8-point algorithm for fundamental matrix estimation. It further assumes that the camera is moving at a known and fixed height over ground (for estimating the scale). Due to the 8 correspondences needed for the 8-point algorithm, many more RANSAC samples need to be drawn, which makes the monocular algorithm slower than the stereo algorithm, for which 3 correspondences are sufficent to estimate parameters.

> Geiger A, Ziegler J, Stiller C. Stereoscan: Dense 3d reconstruction in real-time[C]//Intelligent Vehicles Symposium (IV), 2011 IEEE. IEEE, 2011: 963-968.

> Kitt B, Geiger A, Lategahn H. Visual odometry based on stereo image sequences with RANSAC-based outlier rejection scheme[C]//Intelligent Vehicles Symposium. 2010: 486-492.

ORB-SLAM2: https://github.com/raulmur/ORB_SLAM2

ORB-SLAM2 is a real-time SLAM library for Monocular, Stereo and RGB-D cameras that computes the camera trajectory and a sparse 3D reconstruction (in the stereo and RGB-D case with true scale). It is able to detect loops and relocalize the camera in real time. We provide examples to run the SLAM system in the KITTI dataset as stereo or monocular, and in theTUM dataset as RGB-D or monocular.

S-PTAM: Stereo Parallel Tracking and Mapping: https://github.com/lrse/sptam

S-PTAM is a Stereo SLAM system able to compute the camera trajectory in real-time. It heavily exploits the parallel nature of the SLAM problem, separating the time-constrained pose estimation from less pressing matters such as map building and refinement tasks. On the other hand, the stereo setting allows to reconstruct a metric 3D map for each frame of stereo images, improving the accuracy of the mapping process with respect to monocular SLAM and avoiding the well-known bootstrapping problem. Also, the real scale of the environment is an essential feature for robots which have to interact with their surrounding workspace.

> Taihú Pire, Thomas Fischer, Javier Civera, Pablo De Cristóforis and Julio Jacobo Berlles. Stereo Parallel Tracking and Mapping for Robot Localization Proc. of The International Conference on Intelligent Robots and Systems (IROS) (Accepted), Hamburg, Germany, 2015.

ORBSLAM_DWO: https://github.com/JzHuai0108/ORB_SLAM

　　　ORBSLAM_DWO is developed on top of ORB-SLAM with double window optimization by Jianzhu Huai. The major differences from ORB-SLAM are: (1) it can run with or without ROS, (2) it does not use the modified version of g2o shipped in ORB-SLAM, instead it uses the g2o from github, (3) it uses Eigen vectors and Sophus members instead of Opencv Mat to represent pose entities, (4) it incorporates the pinhole camera model from rpg_vikit and a decay velocity motion model fromStereo PTAM, (5) currently, it supports monocular, stereo, and stereo + inertial input for SLAM, note it does not work with monocular + inertial input.

Faster than real time visual odometry: https://github.com/halismai/bpvo

　A library for (semi-dense) real-time visual odometry from stereo data using direct alignment of feature descriptors. There are descriptors implemented. First, is raw intensity (no descriptor), which runs in real-time or faster. Second, is an implementation of the Bit-Planes descriptor designed for robust performance under challenging illumination conditions as described here andhere.

PL-StVO: Stereo Visual Odometry by combining point and line segment features

＞Gómez-Ojeda R, González-Jiménez J. Robust Stereo Visual Odometry through a Probabilistic Combination of Points and Line Segments[J]. 2016.

ScaViSLAM
This is a general and scalable framework for visual SLAM. It employs "Double Window Optimization" (DWO) as described in our ICCV paper:
> H. Strasdat, A.J. Davison, J.M.M. Montiel, and K. Konolige "Double Window Optimisation for Constant Time Visual SLAM" Proceedings of the IEEE International Conference on Computer Vision, 2011.

闭环检测

DLoopDetector：DLoopDetector is an open source C++ library to detect loops in a sequence of images collected by a mobile robot. It implements the algorithm presented in GalvezTRO12, based on a bag-of-words database created from image local descriptors, and temporal and geometrical constraints. The current implementation includes versions to work with SURF64 and BRIEF descriptors. DLoopDetector is based on the DBoW2 library, so that it can work with any other type of descriptor with little effort.

＞ Bags of Binary Words for Fast Place Recognition in Image Sequences. D Gálvez-López, JD Tardos. IEEE Transactions on Robotics 28 (5), 1188-1197, 2012.

> DBoW2: DBoW2 is an improved version of the DBow library, an open source C++ library for indexing and converting images into a bag-of-word representation.

FAB-MAP: FAB-MAP is a Simultaneous Localisation and Mapping algorithm which operates solely in appearance space. FAB-MAP performs location matching between places that have been visited within the world as well as providing a measure of the probability of being at a new, previously unvisited location. Camera images form the sole input to the system, from which OpenCV's feature extraction methods are used to develop bag-of-words representations for the Bayesian comparison technique.

优化工具库

g2o：g2o is an open-source C++ framework for optimizing graph-based nonlinear error functions. g2o has been designed to be easily extensible to a wide range of problems and a new problem typically can be specified in a few lines of code. The current implementation provides solutions to several variants of SLAM and BA.
Ceres Solver：
Ceres Solver is an open source C++ library for modeling and solving large, complicated optimization problems. It is a feature rich, mature and performant library which has been used in production at Google since 2010. Ceres Solver can solve two kinds of problems.
1. Non-linear Least Squares problems with bounds constraints.
2. General unconstrained optimization problems.
GTSAM： GTSAM is a library of C++ classes that implement smoothing and mapping (SAM) in robotics and vision, using factor graphs and Bayes networks as the underlying computing paradigm rather than sparse matrices. On top of the C++ library, GTSAM includes a MATLAB interface (enable GTSAM_INSTALL_MATLAB_TOOLBOX in CMake to build it). A Python interface is under development.

Visual Odometry / SLAM Evaluation

各大主流的vo和slam系统的精度性能评估网站

SLAM数据集

RGB-D SLAM Dataset and Benchmark:来自TUM，采用Kinect采集的数据集
TUM monoVO dataset
KITTI Vision Benchmark Suite:装备4个相机、高精度GPS和激光雷达，在城市道路采集的数据
Karlsruhe dataset sequence（双目）: http://www.cvlibs.net/datasets/karlsruhe_sequences/
The EuRoC MAV Dataset:来自ETH，采用装备了VI-Sensor的四旋翼采集数据，双目数据集
MIT Stata Center Data Set: http://projects.csail.mit.edu/stata/index.php

SLAM综述相关References

[1] Cadena, Cesar, et al. "Simultaneous Localization And Mapping: Present, Future, and the Robust-Perception Age." arXiv preprint arXiv:1606.05830 (2016). (Davide Scaramuzza等最新slam大综述paper，参考文献达300篇)

[2] Strasdat H, Montiel J M M, Davison A J. Visual SLAM: why filter?[J]. Image and Vision Computing, 2012, 30(2): 65-77.

[3] Visual Odometry Part I The First 30 Years and Fundamentals

[4] Visual odometry Part II Matching, robustness, optimization, and applications

[5] Davide Scaramuzza: Tutorial on Visual Odometry

[6] Factor Graphs and GTSAM: A Hands-on Introduction

[7] Aulinas J, Petillot Y R, Salvi J, et al. The SLAM problem: a survey[C]//CCIA. 2008: 363-371.

[8] Grisetti G, Kummerle R, Stachniss C, et al. A tutorial on graph-based SLAM[J]. IEEE Intelligent Transportation Systems Magazine, 2010, 2(4): 31-43.

[9] Saeedi S, Trentini M, Seto M, et al. Multiple‐Robot Simultaneous Localization and Mapping: A Review[J]. Journal of Field Robotics, 2016, 33(1): 3-46.

[10] Lowry S, Sünderhauf N, Newman P, et al. Visual place recognition: A survey[J]. IEEE Transactions on Robotics, 2016, 32(1): 1-19.

[11] Georges Younes, Daniel Asmar, Elie Shammas. A survey on non-filter-based monocular Visual SLAM systems. Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO), 2016. (针对目前开源的单目slam系统[ PTAM, SVO, DT SLAM, LSD SLAM, ORB SLAM, and DPPTAM] 每个模块采用的方法进行整理)

学习笔记之——3DGS-SLAM系列代码解读 gwpscut 3D Gaussian Splatting (3DGS)3DGS 深度学习三维重建计算机视觉 3d
最近对一系列基于3DGaussianSplatting（3DGS）SLAM的工作的源码进行了测试与解读。为此写下本博客mark一下所有的源码解读以及对应的代码配置与测试记录~其中工作1~5的原理解读见博客：学习笔记之——3DGaussianSplatting及其在SLAM与自动驾驶上的应用调研_3dgaussiansplattingslam-CSDN博客文章浏览阅读5.3k次，点赞53次，收藏92
【MotionCap】DROID-SLAM 1 ：介绍及安装等风来不如迎风去 AI入门与实战人工智能 SLAHMR DROID-SLAM
DROID-SLAM：DROID-SLAM:DeepVisualSLAMforMonocularDROID-SLAM：适用于单目、立体和RGB-D相机的深度视觉SLAMStereo,andRGB-DCamerashttps://arxiv.org/abs/2108.10869DROID-SLAM:DeepVisualSLAMforMonocular,Stereo,andRGB-DCamerasfi
VYOS容器运行Uptime Kuma监控 GTaylor Vyos vyos容器 Uptime Kuma 监控系统无处不容器
添加镜像addcontainerimagelouislam/uptime-kumasudomkdir/config/kumasudochmod777/config/kuma配置setcontainernameUptimeKumadescription'Uptime-Kuma'setcontainernameUptimeKumaimage'docker.io/louislam/uptime-kuma
【ORB-SLAM2：九、BA优化】 KeyPan ORB-SLAM2 人工智能计算机视觉机器学习深度学习算法
BA（BundleAdjustment）是SLAM系统中优化位姿和地图点位置的重要技术。通过最小化图结构中的重投影误差，BA在提高地图精度和轨迹优化方面发挥了核心作用。本章将围绕BA优化展开，从图优化工具简介到优化函数分类，再到具体的局部BA和Sim3优化边的解析进行详细阐述。9.1图优化和g2o简介9.1.1图优化的基本概念图优化图优化将SLAM问题建模为一个图结构：节点（Vertices）：代
【ORB-SLAM2：三、地图初始化】 KeyPan ORB-SLAM2 数码相机计算机视觉人工智能机器学习深度学习算法
地图初始化是视觉SLAM系统的关键步骤之一，它是整个系统运行的起点。初始化的主要任务是从输入图像数据中构建一个初始地图，为后续的相机位姿估计和场景重建提供基础。无论是单目、双目还是RGB-D相机，地图初始化的结果直接决定了系统的鲁棒性和精度。3.1为什么需要地图初始化3.1.1地图初始化的重要性定义初始参考坐标系地图初始化为SLAM系统提供了一个全局参考坐标系，使后续的位姿估计和地图扩展能够在一致
ORB-SLAM2：四、地图点、关键帧、图结构】 KeyPan ORB-SLAM2 计算机视觉人工智能机器学习深度学习算法
地图点、关键帧和图结构是ORB-SLAM系统的核心组成部分，它们共同构建了SLAM系统的空间表示与数据组织方式。本章将详细讨论这些模块及其在系统中的作用和实现方式。4.1地图点4.1.1什么是地图点地图点（MapPoint）是SLAM系统中用来表示环境中三维特征点的抽象概念。这些点是通过相机观测和三角测量得到的，是地图构建的基础。三维位置每个地图点存储其在世界坐标系中的三维坐标P(X,Y,Z)P(
【视觉SLAM:六、视觉里程计Ⅰ：特征点法】 KeyPan 视觉SLAM 计算机视觉人工智能机器学习数码相机算法深度学习
视觉里程计（VisualOdometry,VO）是通过处理图像序列，估计摄像头在时间上的相对位姿变化的技术。它是视觉SLAM的重要组成部分之一，主要通过提取图像中的信息（如特征点或直接像素强度）来实现相机运动估计。以下从特征点法、2D-2D对极几何、三角测量、3D-2D的PnP方法、3D-3D的ICP方法介绍视觉里程计的核心内容。特征点法特征点法是视觉里程计的经典方法，通过提取图像中的显著特征点，
通俗易懂 serverless 架构、微服务架构和云原生架构，并简单代码 Ai君臣架构架构云原生 serverless
文章目录1serverless架构、微服务架构和云原生架构区别1.Serverless架构示例：AWSLambda+APIGateway2.微服务架构示例：Flask微服务3.云原生架构示例：Docker和Kubernetes2Kubernetes中管理多个副本和流量两个关键组件1.Deployment2.Service负载均衡流量管理1serverless架构、微服务架构和云原生架构区别别用代码
【视觉惯性SLAM：十五、ORB-SLAM3中的IMU预积分】 KeyPan 视觉惯性SLAM 计算机视觉视觉检测
15.1视觉惯性紧耦合15.1.1视觉惯性紧耦合的重要性视觉惯性紧耦合（Visual-InertialTightCoupling）在ORB-SLAM3中的作用不可替代，是实现高鲁棒性和高精度定位的核心技术。单一的视觉SLAM主要依赖于图像特征进行定位和建图，这种方法虽然能够在许多环境中获得良好的效果，但其鲁棒性容易受到动态变化、光照条件恶化以及环境特征稀缺等因素的限制。例如，昏暗场景或快速运动可能
VSLAM技术实现机器人在不同场景下的精准导航、避障向阳而生|X 自主导航 python 计算机视觉
链接：https://developer.orbbec.com.cn/forum_plate_module_details.html?id=998
视觉SLAM学习打卡【8-1】-视觉里程计·直接法肝帝永垂不朽 #SLAM 计算机视觉 opencv c++
本节直接法与上节特征点法，为视觉里程计估计位姿的两大主流方法。而在引出直接法前，先介绍光流法（二者均对灰度值I做文章）。至此，前端VO总算结束了。学下来一个感受就是前几章的数学基础很重要，尤其是构建最小二乘的非线性优化（BA），几乎每种方法都有其一席之地。视觉SLAM学习打卡【8-1】-视觉里程计·直接法一、光流法（1）前提（实际中较难满足）（2）理论推导（3）附：超定方程求解二、直接法（1）理论
从零开始搭二维激光SLAM --- 序章李太白lx 从零开始搭二维激光SLAM SLAM
为什么要做这个开源项目1我的SLAM接触史1.1硕士阶段从17年3月开始接触SLAM，到现在已经3年了。虽然时间很长，但并不是所有时间都在单纯的搞SLAM。17年3月，研一下学期的时候选的课题题目，基于SLAM的室内移动机器人导航技术研究。之前并没有接触过SLAM，ROS等等。就连c++都是16年研一上学期的时候学的（大一学过以后没再接触过）。从17年3月开始学ROS，开始了解SLAM，还看了概率
导致格式错误的 Lambda 代理响应的原因以及如何修复它 zqhdz米时空汇编
当人们尝试使用AWSAPIGateway和AWSLambda构建无服务器应用程序时，经常出现的一个问题是_由于配置错误而执行失败：Lambda代理响应格式错误。_没有什么比通用错误消息更糟糕的了，它们不会告诉您解决问题所需的任何内容，对吧？AWS并不是以其错误消息设计而闻名，如果甚至可以这样称呼它的话，更不用说为您提供解决问题的方法了。那么如何修复这个Lambda错误以及是什么原因造成的呢？花椒壳
ROS yaml参数文件的使用 Sun Shiteng ROS
举个例子，若在params.yaml文件中定义如下参数LidarImageFusion:points_src:"/hilbert_h/deskew/cloud_info"image_src:"/usb_cam0/image_raw"camera_info_src:"/home/hdj/fusion_slam/Color_SLAM_ws/src/hilbert_h/config/firefly_8s
xwiki html和css,MediaWiki vs. XWiki Ake阿科多语言信息技术编程数据库操作系统
140Afar,Abkhazian,Afrikaans,Amharic,Arabic,Assamese,Aymara,Azerbaijani,Bashkir,Byelorussian,Bulgarian,Bihari,Bislama,Bengali;Bangla,Tibetan,Breton,Catalan,Corsican,Czech,Welsh,Danish,German,Bhutani,Gr
2021-07-07 潇洒二爷
一辆特斯拉“花格子S型”小车，突然起火，电子技术的车门也失灵TeslaModelSPlaidbrokeintofirewithfailureofelctronicdoors一辆“花格子牌”（ModelSPlaid）特斯拉轿车，在6月29日这天，车主正在路上行驶，突然烈焰腾飞，他的代理律师说，他被短时间困在车内，因为几个电动门都打不开。事情在几天前发生于费城外，这名男子拿到这款特斯拉之后，号称是世界
力扣刷题记录（一）剑指Offer（第二版）乘凉~ 求职过程记录 leetcode 链表算法
1、本栏用来记录社招找工作过程中的内容，包括基础知识学习以及面试问题的记录等，以便于后续个人回顾学习；暂时只有2023年3月份，第一次社招找工作的过程；2、个人经历：研究生期间课题是SLAM在无人机上的应用，有接触SLAM、Linux、ROS、C/C++、DJIOSDK等；3、参加工作后（2021-2023年）岗位是嵌入式软件开发，主要是服务器开发，Linux、C/C++、网络编程、docker容
论文笔记—NDT-Transformer: Large-Scale 3D Point Cloud Localization using the Normal Distribution Transfor 入门打工人笔记 slam 定位算法
论文笔记—NDT-Transformer:Large-Scale3DPointCloudLocalizationusingtheNormalDistributionTransformRepresentation文章摘要~~~~~~~在GPS挑战的环境中，自动驾驶对基于3D点云的地点识别有很高的要求，并且是基于激光雷达的SLAM系统的重要组成部分（即闭环检测）。本文提出了一种名为NDT-Transf
深度学习特征提取魔改版太强了！发文香饽饽！深度之眼深度学习干货人工智能干货人工智能深度学习机器学习论文特征提取
要说CV领域经久不衰的研究热点，特征提取可以占一席，毕竟SLAM、三维重建等重要应用的底层都离不开它。再加上近几年深度学习兴起，用深度学习做特征提取逐渐成了主流，比传统算法无论是性能、准确性还是效率都更胜一筹。目前比较常见的深度学习特征提取方法有基于transformer、基于CNN、基于LSTM以及基于GAN，都发展的比较成熟。但为了追求更快速、准确、鲁棒的特征点提取，研究者们开始致力于改进深度
视觉SLAM十四讲学习笔记——第十讲后端优化（2）晒月光12138 视觉SLAM十四讲学习笔记 slam ubuntu
上文提到考虑全局的后端优化计算量非常大，因此在计算增量方程时，借助H矩阵的稀疏性加速运算。但是随着时间的推移，累积的相机位姿和路标数量还是会导致计算量过大，以上一节的示例代码数据为例：16张图像，共提取到22106个特征点，这些特征点共出现了83718次。对于一个20Hz更新速度，上述的数据量甚至还不到1s的内容，因此在求解大规模定位建图问题时，一定要控制BA的规模。这里主要有两种解决思路：（1）
《Java基础知识》Java Lambda表达式 Limingmingaa java java 开发语言蓝桥杯
接触Lambda表达式的时候，第一感觉就是，这个是啥？我居然看不懂，于是开始寻找资料，必须弄懂它。先来看一个案例：@FunctionalInterfacepublicinterfaceMyLamda{voidtest1(Stringy);}importdemo.knowledgepoints.Lambda.inf.MyLamda;publicclassLambdaTest{publicsta
NDT算法 Joeybee SLAM 算法
上一次我们学习了高翔《自动驾驶与机器人中的SLAM技术》中的三维ICP算法，其中包括点对点、点对线、点对面的ICP算法，本次博客学习NDT算法的源码。NDT算法与ICP算法的最大不同之处，在我看来是NDT考虑了均值和方差这两个局部统计量。从最后的求解方法来看，NDT采用了加权最小二乘问题的高斯-牛顿法，和ICP算法的最明显区别是多了权重分布。从高翔书中的测试结果来看，NDT的收敛速度稍弱于点对面I
SLAM中常用的库 wq_151 人工智能 SLAM 计算机视觉人工智能机器学习 slam
SLAM中常用的库关于库关于库Pangolin是一个用于OpenGL显示/交互以及视频输入的一个轻量级、快速开发库，下面是Pangolin的Github网址：githubEigen是一个高层次的C++库，有效支持线性代数，矩阵和矢量运算，数值分析及其相关的算法。pagenanoflann是一个c++11标准库，用于构建具有不同拓扑（R2，R3（点云），SO(2)和SO(3)（2D和3D旋转组））的
【XR】优化SLAM SDK的稳定性大江东去浪淘尽千古风流人物 xr
优化SLAMSDK的稳定性是确保增强现实(AR)和虚拟现实(VR)应用在各种环境和设备上都能稳定运行的关键。以下是一些主要的优化方法：1.传感器融合优化方法:将多个传感器的数据（如摄像头、加速度计、陀螺仪、磁力计）进行融合，以补偿单一传感器可能存在的误差。优势:提高了环境理解的准确性，减少了由于单一传感器误差导致的抖动和漂移现象。实例:ARKit和ARCore都利用了传感器融合技术来增强稳定性。2
ROS2导航SLAM建图探索鱼香ROS ROS2 机器人 SLAM ROS2 导航 SLAM
大家好，我是昨晚熬夜太多脑壳痛的小鱼。今天带大家一起探索一些ROS2+turtlebot3的slam建图。先上最终效果图1.安装ROS2第一步就是要有一个ROS2的环境，这个没有的请打开小鱼的fishros网站，选择一行代码安装ROS2进行安装。2.安装turtlebot3sudoaptinstallros-foxy-turtlebot3*sudoaptinstallros-foxy-cartog
数百倍加速！港科大最新：嵌入式平台上实时运行的NeRF SLAM！计算机视觉工坊 3D视觉从入门到精通学习自动驾驶算法
来源：计算机视觉工坊添加微信：dddvision，备注：NeRF，拉你入群。文末附行业细分群0.笔者个人体会传统的NeRF和NeRFSLAM所需要的计算量非常大，很难在嵌入式设备上跑起来，这也就很大程度上限制了NeRFSLAM的落地。但最近港科大&中山大学提出了一项工作Photo-SLAM，不仅实现了高保真的建图，还可以在嵌入式设备上实时运行，甚至渲染速度提高了数百倍。下面一起来阅读一下这项工作，
自动驾驶-机器人-slam-定位面经和面试知识系列07之C++STL面试题（03） lonely-stone 面试 c++职场和发展
这个博客系列会分为C++STL-面经、常考公式推导和SLAM面经面试题等三个系列进行更新，基本涵盖了自己秋招历程被问过的面试内容（除了实习和学校项目相关的具体细节）。在知乎和牛客也会同步更新，全网同号（lonely-stone或者lonely_stone）。关于高频面试题和C++STL面经，每次我会更新10个问题左右，每次更新过多，害怕大家可能看了就只记住其中几个点。（在个人秋招面试过程中，面试到
激光SLAM--(8) LeGO-LOAM论文笔记 lonely-stone slam 激光SLAM 论文阅读
论文标题：LeGO-LOAM：LightweightandGround-OptimizedLidarOdometryandMappingonVariableTerrain应用在可变地形场景的轻量级的、并利用地面优化的LOAMABSTRACT轻量级的、基于地面优化的LOAM实时进行六自由度位姿估计，应用在地面的车辆上。强调应用在地面车辆上是因为在这里面要求雷达必须水平安装，而像LOAM和LIO-SA
自动驾驶-机器人-slam-定位面经和面试知识系列03之C++STL面试题（01） lonely-stone 面试 c++职场和发展
这两天有点忙耽搁了，抱歉！！！这个博客系列会分为C++STL-面经、常考公式推导和SLAM面经面试题等三个系列进行更新，基本涵盖了自己秋招历程被问过的面试内容（除了实习和学校项目相关的具体细节）。在知乎和牛客也会同步更新，全网同号（lonely-stone或者lonely_stone）。关于高频面试题和C++STL面经，每次我会更新10个问题左右，每次更新过多，害怕大家可能看了就只记住其中几个点。
自动驾驶-机器人-slam-定位面经和面试知识系列04之高频面试题（02） lonely-stone 自动驾驶机器人面试
这个博客系列会分为C++STL-面经、常考公式推导和SLAM面经面试题等三个系列进行更新，基本涵盖了自己秋招历程被问过的面试内容（除了实习和学校项目相关的具体细节）。在知乎和牛客也会同步更新，全网同号（lonely-stone或者lonely_stone）。关于高频面试题和C++STL面经，每次我会更新10个问题左右，每次更新过多，害怕大家可能看了就只记住其中几个点。（在个人秋招面试过程中，面试到
C/C++Win32编程基础详解视频下载择善Zach 编程 C++Win32
课题视频：C/C++Win32编程基础详解视频知识：win32窗口的创建 windows事件机制主讲：择善Uncle老师学习交流群：386620625 验证码：625 --
Guava Cache使用笔记 bylijinnan java guava cache
1.Guava Cache的get/getIfPresent方法当参数为null时会抛空指针异常我刚开始使用时还以为Guava Cache跟HashMap一样，get(null)返回null。实际上Guava整体设计思想就是拒绝null的，很多地方都会执行com.google.common.base.Preconditions.checkNotNull的检查。 2.Guava
解决ora-01652无法通过128（在temp表空间中） 0624chenhong oracle
解决ora-01652无法通过128（在temp表空间中）扩展temp段的过程一个sql语句后，大约花了10分钟，好不容易有一个结果，但是报了一个ora-01652错误，查阅了oracle的错误代码说明：意思是指temp表空间无法自动扩展temp段。这种问题一般有两种原因：一是临时表空间空间太小，二是不能自动扩展。分析过程：既然是temp表空间有问题，那当
Struct在jsp标签不懂事的小屁孩 struct
非UI标签介绍：控制类标签： 1：程序流程控制标签 if elseif else <s:if test="isUsed"> <span class="label label-success">True</span> </
按对象属性排序换个号韩国红果果 JavaScript 对象排序
利用JavaScript进行对象排序，根据用户的年龄排序展示 <script> var bob={ name;bob, age:30 } var peter={ name;peter, age:30 } var amy={ name;amy, age:24 } var mike={ name;mike, age:29 } var john={
大数据分析让个性化的客户体验不再遥远蓝儿唯美数据分析
顾客通过多种渠道制造大量数据，企业则热衷于利用这些信息来实现更为个性化的体验。分析公司Gartner表示，高级分析会成为客户服务的关键，但是大数据分析的采用目前仅局限于不到一成的企业。挑战在于企业还在努力适应结构化数据，疲于根据自身的客户关系管理（CRM）系统部署有效的分析框架，以及集成不同的内外部信息源。然而，面对顾客通过数字技术参与而产生的快速变化的信息，企业需要及时作出反应。要想实
java笔记4 a-john java
操作符 1，使用java操作符操作符接受一个或多个参数，并生成一个新值。参数的形式与普通的方法调用不用，但是效果是相同的。加号和一元的正号（+）、减号和一元的负号（-）、乘号（*）、除号（/）以及赋值号（=）的用法与其他编程语言类似。操作符作用于操作数，生成一个新值。另外，有些操作符可能会改变操作数自身的
从裸机编程到嵌入式Linux编程思想的转变------分而治之：驱动和应用程序 aijuans 嵌入式学习
笔者学习嵌入式Linux也有一段时间了，很奇怪的是很多书讲驱动编程方面的知识，也有很多书将ARM9方面的知识，但是从以前51形式的（对寄存器直接操作，初始化芯片的功能模块）编程方法，和思维模式，变换为基于Linux操作系统编程，讲这个思想转变的书几乎没有，让初学者走了很多弯路，撞了很多难墙。笔者因此写上自己的学习心得，希望能给和我一样转变
在springmvc中解决FastJson循环引用的问题 asialee 循环引用 fastjson
我们先来看一个例子： package com.elong.bms; import java.io.OutputStream; import java.util.HashMap; import java.util.Map; import co
ArrayAdapter和SimpleAdapter技术总结百合不是茶 android SimpleAdapter ArrayAdapter 高级组件基础
ArrayAdapter比较简单，但它只能用于显示文字。而SimpleAdapter则有很强的扩展性，可以自定义出各种效果 ArrayAdapter;的数据可以是数组或者是队列 // 获得下拉框对象 AutoCompleteTextView textview = (AutoCompleteTextView) this
九封信 bijian1013 人生励志
有时候，莫名的心情不好，不想和任何人说话，只想一个人静静的发呆。有时候，想一个人躲起来脆弱，不愿别人看到自己的伤口。有时候，走过熟悉的街角，看到熟悉的背影，突然想起一个人的脸。有时候，发现自己一夜之间就长大了。 2014，写给人
Linux下安装MySQL Web 管理工具phpMyAdmin sunjing PHP Install phpMyAdmin
PHP http://php.net/ phpMyAdmin http://www.phpmyadmin.net Error compiling PHP on CentOS x64 一、安装Apache 请参阅http://billben.iteye.com/admin/blogs/1985244 二、安装依赖包 sudo yum install gd
分布式系统理论 bit1129 分布式
FLP One famous theory in distributed computing, known as FLP after the authors Fischer, Lynch, and Patterson, proved that in a distributed system with asynchronous communication and process crashes,
ssh2整合(spring+struts2+hibernate)-附源码白糖_ eclipse spring Hibernate mysql 项目管理
最近抽空又整理了一套ssh2框架，主要使用的技术如下： spring做容器，管理了三层(dao,service,actioin)的对象 struts2实现与页面交互(MVC)，自己做了一个异常拦截器，能拦截Action层抛出的异常 hibernate与数据库交互 BoneCp数据库连接池，据说比其它数据库连接池快20倍，仅仅是据说 MySql数据库项目用eclipse
treetable bug记录 braveCS table
// 插入子节点删除再插入时不能正常显示。修改： //不知改后有没有错，先做个备忘 Tree.prototype.removeNode = function(node) { // Recursively remove all descendants of +node+ this.unloadBranch(node); // Remove
编程之美-电话号码对应英语单词 bylijinnan java 算法编程之美
import java.util.Arrays; public class NumberToWord { /** * 编程之美电话号码对应英语单词 * 题目： * 手机上的拨号盘，每个数字都对应一些字母，比如2对应ABC，3对应DEF.........，8对应TUV，9对应WXYZ， * 要求对一段数字，输出其代表的所有可能的字母组合
jquery ajax读书笔记 chengxuyuancsdn jQuery ajax
1、jsp页面 <%@ page language="java" import="java.util.*" pageEncoding="GBK"%> <% String path = request.getContextPath(); String basePath = request.getScheme()
JWFD工作流拓扑结构解析伪码描述算法 comsci 数据结构算法工作活动 J#
对工作流拓扑结构解析感兴趣的朋友可以下载附件，或者下载JWFD的全部代码进行分析 /* 流程图拓扑结构解析伪码描述算法 public java.util.ArrayList DFS(String graphid, String stepid, int j)
oracle I/O 从属进程 daizj oracle
I/O 从属进程　　I/O从属进程用于为不支持异步I/O的系统或设备模拟异步I/O.例如，磁带设备(相当慢)就不支持异步I/O.通过使用I/O 从属进程，可以让磁带机模仿通常只为磁盘驱动器提供的功能。就好像支持真正的异步I/O 一样，写设备的进程(调用者)会收集大量数据，并交由写入器写出。数据成功地写出时，写入器(此时写入器是I/O 从属进程，而不是操作系统)会通知原来的调用者，调用者则会
高级排序:希尔排序 dieslrae 希尔排序
public void shellSort(int[] array){ int limit = 1; int temp; int index; while(limit <= array.length/3){ limit = limit * 3 + 1;
初二下学期难记忆单词 dcj3sjt126com english word
kitchen 厨房 cupboard 厨柜 salt 盐 sugar 糖 oil 油 fork 叉；餐叉 spoon 匙；调羹 chopsticks 筷子 cabbage 卷心菜；洋白菜 soup 汤 Italian 意大利的 Indian 印度的 workplace 工作场所 even 甚至；更 Italy 意大利 laugh 笑 m
Go语言使用MySQL数据库进行增删改查 dcj3sjt126com mysql
目前Internet上流行的网站构架方式是LAMP，其中的M即MySQL, 作为数据库，MySQL以免费、开源、使用方便为优势成为了很多Web开发的后端数据库存储引擎。MySQL驱动Go中支持MySQL的驱动目前比较多，有如下几种，有些是支持database/sql标准，而有些是采用了自己的实现接口,常用的有如下几种: http://code.google.c...o-mysql-dri
git命令 shuizhaosi888 git
---------------设置全局用户名： git config --global user.name "HanShuliang" //设置用户名 git config --global user.email "[email protected]" //设置邮箱 ---------------查看环境配置 git config --li
qemu-kvm 网络 nat模式 (四) haoningabc kvm qemu
qemu-ifup-NAT #!/bin/bash BRIDGE=virbr0 NETWORK=192.168.122.0 GATEWAY=192.168.122.1 NETMASK=255.255.255.0 DHCPRANGE=192.168.122.2,192.168.122.254 TFTPROOT= BOOTP= function check_bridge()
不要让未来的你，讨厌现在的自己 jingjing0907 生活奋斗工作梦想
故事one 　23岁，他大学毕业，放弃了父母安排的稳定工作，独闯京城，在家小公司混个小职位，工作还算顺手，月薪三千，混了混，混走了一年的光阴。　　　　24岁，有了女朋友，从二环12人的集体宿舍搬到香山民居，一间平房，二人世界，爱爱爱。偶然约三朋四友，打扑克搓麻将，日子快乐似神仙；　　　　25岁，出了几次差，调了两次岗，薪水涨了不过百，生猛狂飙的物价让现实血淋淋，无力为心爱银儿购件大牌
枚举类型详解一路欢笑一路走 enum 枚举详解 enumset enumMap
枚举类型详解一.Enum详解 1.1枚举类型的介绍 JDK1.5加入了一个全新的类型的”类”—枚举类型，为此JDK1.5引入了一个新的关键字enum,我们可以这样定义一个枚举类型。 Demo:一个最简单的枚举类 public enum ColorType { RED
第11章动画效果（上） onestopweb 动画
index.html <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> <html xmlns="http://www.w3.org/
Eclipse中jsp、js文件编辑时，卡死现象解决汇总 ljf_home eclipse jsp卡死 js卡死
使用Eclipse编辑jsp、js文件时，经常出现卡死现象，在网上百度了N次，经过N次优化调整后，卡死现象逐步好转，具体那个方法起到作用，不太好讲。将所有用过的方法罗列如下： 1、取消验证 windows–>perferences–>validation 把除了manual 下面的全部点掉，build下只留 classpath dependency Valida
MySQL编程中的6个重要的实用技巧 tomcat_oracle mysql
每一行命令都是用分号(;)作为结束对于MySQL，第一件你必须牢记的是它的每一行命令都是用分号(;)作为结束的，但当一行MySQL被插入在PHP代码中时，最好把后面的分号省略掉，例如： mysql_query("INSERT INTO tablename(first_name,last_name)VALUES('$first_name',$last_name')");
zoj 3820 Building Fire Stations(二分+bfs) 阿尔萨斯 Build
题目链接：zoj 3820 Building Fire Stations 题目大意：给定一棵树，选取两个建立加油站，问说所有点距离加油站距离的最大值的最小值是多少，并且任意输出一种建立加油站的方式。解题思路：二分距离判断，判断函数的复杂度是o(n)，这样的复杂度应该是o(nlogn)，即使常数系数偏大，但是居然跑了4.5s，也是醉了。判断函数里面做了3次bfs，但是每次bfs节点最多