扫地的小何尚

NVIDIA 现实增强AR开发工具包开发手册----Maxine AR SDK开发手册中文版

NVIDIA_Maxine_AR_SDK_API

点击此处加入NVIDIA开发者计划

本节提供有关 NVIDIA® AR SDK API 架构的信息。

1.1. Using the NVIDIA AR SDK in Applications

使用 NVIDIA AR SDK 使应用程序能够使用 SDK 的面部跟踪、面部特征点跟踪、3D 面部网格跟踪和 3D 身体姿势跟踪功能。

1.2. Creating an Instance of a Feature Type

功能类型是用于访问 SDK 功能的预定义结构。每个特性都需要特性类型的实例化。

创建特征类型的实例提供了对加载特征类型实例时使用的配置参数, 以及运行特征类型实例时在运行时提供的输入和输出参数的访问。

为NvAR_FeatureHandle结构分配内存。

    NvAR_FeatureHandle faceDetectHandle{};

调用NvAR_Create()函数。

在对函数的调用中，传递以下信息：
- NvAR_FeatureID枚举的值，用于标识特征类型。
- 指向您声明为NvAR_FeatureHandle结构分配内存的变量的指针。
要创建人脸检测特征类型的实例，请运行以下示例：

此函数创建功能实例的句柄，在函数调用中需要该句柄以获取和设置实例的属性, 以及加载、运行或销毁实例。

    NvAR_Create(NvAR_Feature_FaceDetection, &faceDetectHandle)

1.3. Getting and Setting Properties for a Feature Type

要准备加载和运行特征类型的实例，您需要设置实例所需的属性。

以下是一些属性：

加载特征类型所需的配置属性。
运行特征类型的实例时要在运行时提供的输入和输出属性。

完整列表，请参阅特征类型属性中的关键值。

为了设置属性，NVIDIA AR SDK 提供了类型安全的设置访问器函数。如果您需要已由 set 访问器函数设置的属性的值，请使用相应的 get 访问器函数。有关获取和设置函数的完整列表，请参阅NVIDIA AR SDK 访问器函数摘要。

1.3.1. Setting Up the CUDA Stream

某些 SDK 功能需要运行在 CUDA 流中。有关详细信息，请参阅NVIDIA CUDA 工具包文档。

通过调用以下函数之一初始化 CUDA 流：
- CUDA 运行时 API 函数cudaStreamCreate()
- NvAR_CudaStreamCreate()
您可以使用第二个函数来避免与 NVIDIA CUDA Toolkit 库链接。
调用NvAR_SetCudaStream()函数并提供以下信息作为参数：
- 创建的过滤器句柄。
  
  请参阅创建特征类型的实例。
- 关键值NVAR_Parameter_Config(CUDAStream) 。
  
  请参阅特征类型属性中的关键值。
- 您在上一步中创建的 CUDA 流。
此示例设置通过调用NvAR_CudaStreamCreate()函数创建的 CUDA 流：

CUstream stream;
nvErr = NvAR_CudaStreamCreate (&stream);
nvErr = NvAR_SetCudaStream(featureHandle, NVAR_Parameter_Config(CUDAStream), stream);

1.3.2. Summary of NVIDIA AR SDK Accessor Functions

下表提供了有关 SDK 访问器函数的详细信息。

Table 1. AR SDK Accessor Functions
Property Type	Data Type	Set and Get Accessor Function
32-bit unsigned integer	unsigned int	NvAR_SetU32()
32-bit unsigned integer	unsigned int	NvAR_GetU32()
32-bit signed integer	int	NvAR_SetS32()
32-bit signed integer	int	NvAR_GetS32()
Single-precision (32-bit) floating-point number	float	NvAR_SetF32()
Single-precision (32-bit) floating-point number	float	NvAR_GetF32()
Double-precision (64-bit) floating point number	double	NvAR_SetF64()
Double-precision (64-bit) floating point number	double	NvAR_GetF64()
64-bit unsigned integer	unsigned long long	NvAR_SetU64()
64-bit unsigned integer	unsigned long long	NvAR_GetU64()
Floating-point array	float*	NvAR_SetFloatArray()
Floating-point array	float*	NvAR_GetFloatArray()
Object	void*	NvAR_SetObject()
Object	void*	NvAR_GetObject()
Character string	const char*	NvAR_SetString()
Character string	const char*	NvAR_GetString()
CUDA stream	CUstream	NvAR_SetCudaStream()
CUDA stream	CUstream	NvAR_GetCudaStream()

1.3.3. Key Values in the Properties of a Feature Type

特征类型的属性中的关键值标识可用于每种特征类型的属性。每个键都有一个等效的字符串，并由一个宏定义，该宏指示属性的类别并将名称作为宏的输入。
以下是指示属性类别的宏：

NvAR_Parameter_Config表示配置属性。

请参阅配置属性。
NvAR_Parameter_Input表示输入属性。

请参阅输入属性。
NvAR_Parameter_Output表示输出属性。

请参阅输出属性。

这些名称是固定关键字，列在nvAR_defs.h中。根据属性是输入、输出还是配置属性，关键字可能会与不同的宏一起使用。

属性类型表示要设置和获取属性的访问器函数，如NVIDIA AR SDK 访问器函数摘要表中所列。

1.3.3.1. Configuration Properties

以下是 AR SDK 中的配置属性：

NvAR_Parameter_Config(FeatureDescription)

特征类型的描述。

等效字符串： NvAR_Parameter_Config_FeatureDescription

属性类型：character string (const char*)

NvAR_Parameter_Config(CUDAStream)

运行该功能的 CUDA 流。

等效字符串： NvAR_Parameter_Config_CUDAStream

属性类型：CUDA 流 ( CUstream )

NvAR_Parameter_Config(ModelDir)

包含将用于运行推理以进行人脸检测或特征点检测的 TensorRT 模型文件的目录路径，以及包含 3D 人脸模型的 .nvf 文件，不包括模型文件名。有关 .nvf 文件格式的详细信息，请参阅NVIDIA 3DMM 文件格式。

等效字符串： NvAR_Parameter_Config_ModelDir
属性类型：character string (const char*)

NvAR_Parameter_Config(BatchSize)

在 GPU 上, 一次运行的推理次数。

等效字符串： NvAR_Parameter_Config_BatchSize

属性类型：unsigned integer

NvAR_Parameter_Config(Landmarks_Size)

包含检测到的特征点的 X 和 Y 坐标（以像素为单位）的输出缓冲区的长度。此属性仅适用于特征点检测功能。

等效字符串： NvAR_Parameter_Config_Landmarks_Size

属性类型：unsigned integer

NvAR_Parameter_Config(LandmarksConfidence_Size)

包含检测到的特征点的置信度值的输出缓冲区的长度。此属性仅适用于特征点检测功能。

等效字符串： NvAR_Parameter_Config_LandmarksConfidence_Size

属性类型：unsigned integer

NvAR_Parameter_Config(Temporal)

标记以启用对时间输入帧的优化。当输入为视频时启用该标志。

等效字符串： NvAR_Parameter_Config_Temporal
属性类型：unsigned integer

NvAR_Parameter_Config(ShapeEigenValueCount)

用于描述形状的特征值的数量。

等效字符串： NvAR_Parameter_Config_ShapeEigenValueCount

属性类型：unsigned integer

NvAR_Parameter_Config(ExpressionCount)

用于表示表达式的系数的数量。

等效字符串： NvAR_Parameter_Config_ExpressionCount

属性类型：unsigned integer

NvAR_Parameter_Config(FocalLength)

用于 3D Body Pose 的相机焦距。

等效字符串： NvAR_Parameter_Config_FocalLength

属性类型：float

NvAR_Parameter_Config(UseCudaGraph)

启用 CUDA 图形优化的标志。 CUDA 图减少了 3D 人体跟踪的 GPU 操作提交的开销。

等效字符串： NvAR_Parameter_Config_UseCudaGraph

属性类型：bool

NvAR_Parameter_Config(Mode)

为 3D 身体姿势选择高性能或高质量的模式。

等效字符串： NvAR_Parameter_Config_Mode

属性类型：unsigned int

NvAR_Parameter_Config(ReferencePose)

NvAR_Point3f 类型的 CPU 缓冲区，用于保存 3D 身体姿势的关节旋转的参考姿势。

等效字符串： NvAR_Parameter_Config_ReferencePose
属性类型：object (void*)

1.3.3.2. Input Properties

以下是 AR SDK 中的输入属性：

NvAR_Parameter_Input(Image)

NvCVImage类型的 GPU 输入图像缓冲区。

等效字符串： NvAR_Parameter_Input_Image

属性类型：object (void*)

NvAR_Parameter_Input(Width)

输入图像缓冲区的宽度（以像素为单位）。

等效字符串： NvAR_Parameter_Input_Width

属性类型：integer

NvAR_Parameter_Input(Height)

输入图像缓冲区的高度（以像素为单位）。

等效字符串： NvAR_Parameter_Input_Height

属性类型：integer

NvAR_Parameter_Input(Landmarks)
包含面部标志点的NvAR_Point2f类型的 CPU 输入数组。

等效字符串： NvAR_Parameter_Input_Landmarks

属性类型：object (void*)

NvAR_Parameter_Input(BoundingBoxes)

确定包含NvAR_BBoxes类型人脸的输入图像的感兴趣区域 (ROI) 的边界框。

等效字符串： NvAR_Parameter_InputBoundingBoxes

属性类型：object (void*)

1.3.3.3. Output Properties

以下是 AR SDK 中的输出属性：

NvAR_Parameter_Output(BoundingBoxes)

CPU 输出 NvAR_BBoxes 类型的边界框。

等效字符串： NvAR_Parameter_Output_BoundingBoxes
属性类型：object (void*)

NvAR_Parameter_Output(BoundingBoxesConfidence)

每个返回的边界框的置信度值的浮点数组。

等效字符串： NvAR_Parameter_Output_BoundingBoxesConfidence

属性类型：floating point array

NvAR_Parameter_Output(Landmarks)

NvAR_Point2f类型的 CPU 输出缓冲区，用于保存输出检测到的关键点。有关详细信息，请参阅面部点注释。 CPU 缓冲区中点的顺序遵循 MultiPIE 68 点标记中的顺序，126 点覆盖了沿着脸颊、眼睛和嘴的更多点。

等效字符串： NvAR_Parameter_Output_Landmarks

属性类型：object (void*)

NvAR_Parameter_Output(LandmarksConfidence)

每个检测到的地标点的置信度浮点数组。

等效字符串： NvAR_Parameter_Output_LandmarksConfidence

属性类型：floating point array

NvAR_Parameter_Output(Pose)

NvAR_Quaternion类型的 CPU 数组将输出检测到的姿势保存为 XYZW 四元数。

等效字符串： NvAR_Parameter_Output_Pose

属性类型：object (void*)

NvAR_Parameter_Output(FaceMesh)

NvAR_FaceMesh类型的 CPU 3D 面部网格。

等效字符串： NvAR_Parameter_Output_FaceMesh

属性类型：object (void*)

NvAR_Parameter_Output(RenderingParams)

NvAR_RenderingParams类型的 CPU 输出结构，其中包含可用于渲染 3D 面部网格的渲染参数。

等效字符串： NvAR_Parameter_Output_RenderingParams

属性类型：object (void*)

NvAR_Parameter_Output(ShapeEigenValues)

形状特征值的浮点数组。获取NvAR_Parameter_Config(ShapeEigenValueCount)以确定有多少个特征值。

等效字符串： NvAR_Parameter_Output_ShapeEigenValues

属性类型：const floating point array

NvAR_Parameter_Output(ExpressionCoefficients)

表达系数的浮点数组。获取NvAR_Parameter_Config(ExpressionCount)以确定有多少个系数。

等效字符串： NvAR_Parameter_Output_ExpressionCoefficients

属性类型：const floating point array

NvAR_Parameter_Output(KeyPoints)

NvAR_Point2f类型的 CPU 输出缓冲区，用于保存检测到的身体姿势的 2D 关键点的输出。有关关键点名称和关键点输出顺序的信息，请参阅3D 身体姿势关键点格式。

等效字符串： NvAR_Parameter_Output_KeyPoints

属性类型：object (void*)

NvAR_Parameter_Output(KeyPoints3D)

NvAR_Point3f类型的 CPU 输出缓冲区，用于保存检测到的身体姿势 3D 关键点的输出。有关关键点名称和关键点输出顺序的信息，请参阅3D 身体姿势关键点格式。

等效字符串： NvAR_Parameter_Output_KeyPoints3D

属性类型：object (void*)

NvAR_Parameter_Output(JointAngles)

NvAR_Point3f类型的 CPU 输出缓冲区，用于保存身体姿势关键点的轴角格式的关节角度。

等效字符串： NvAR_Parameter_Output_JointAngles

属性类型：object (void*)

NvAR_Parameter_Output(KeyPointsConfidence)

每个检测到的关键点的置信度浮点数组。

等效字符串： NvAR_Parameter_Output_KeyPointsConfidence

属性类型：floating point array

NvAR_Parameter_Output(KeyPoints)

NvAR_Point2f类型的 CPU 输出缓冲区，用于保存输出检测到的 3D 身体姿势的 2D 关键点。有关信息，请参阅3D 身体姿势关键点格式。 CPU 缓冲区中点的顺序遵循3D Body Pose Keypoint Format中提到的顺序。

等效字符串： NvAR_Parameter_Output_KeyPoints

属性类型：object (void*)

NvAR_Parameter_Output(KeyPoints3D)

NvAR_Point3f 类型的 CPU 输出缓冲区，用于保存输出检测到的 3D 身体姿势的 3D 关键点。有关信息，请参阅3D 身体姿势关键点格式。 CPU 缓冲区中点的顺序遵循3D Body Pose Keypoint Format中提到的顺序。

等效字符串： NvAR_Parameter_Output_KeyPoints3D

属性类型：object (void*)

NvAR_Parameter_Output(JointAngles)

NvAR_Quaternion类型的 CPU 输出缓冲区用于保存 3D 身体姿势的关节旋转输出。

等效字符串： NvAR_Parameter_Output_JointAngles
属性类型：object (void*)

NvAR_Parameter_Output(KeyPointsConfidence)

每个检测到的 3D 身体姿势关键点的置信度浮点数组。

等效字符串： NvAR_Parameter_Output_KeyPointsConfidence

属性类型：floating point array

1.3.4. Getting the Value of a Property of a Feature

要获取特征属性的值，请调用适用于属性数据类型的 get 访问器函数。

在对函数的调用中，传递以下信息：

特征实例的特征句柄。
标识您正在获取的属性的键值。
您希望写入属性值的内存位置。

此示例确定地标检测功能返回的NvAR_Point2f输出缓冲区的长度：

unsigned int OUTPUT_SIZE_KPTS;
NvAR_GetU32(landmarkDetectHandle, NvAR_Parameter_Config(Landmarks_Size), &OUTPUT_SIZE_KPTS);

1.3.5. Setting a Property for a Feature

以下步骤说明了如何设置功能的属性。

为功能所需的所有输入和输出以及可能需要的任何其他属性分配内存。
调用适合属性数据类型的 set 访问器函数。

在对函数的调用中，传递以下信息：
- 特征实例的特征句柄。
- 标识您正在设置的属性的键值。
- 指向要设置属性的值的指针。
  
  此示例将文件路径设置为包含输出 3D 人脸模型的文件：

const char *modelPath = "file/path/to/model";
NvAR_SetString(landmarkDetectHandle, NvAR_Parameter_Config(ModelDir), modelPath);

此示例在 GPU 内存中设置输入图像缓冲区，这是人脸检测功能所需的：

注意：它设置了一个 8 位大块/交错 BGR 数组。

NvCVImage InputImageBuffer;
NvCVImage_Alloc(&inputImageBuffer, input_image_width, input_image_height, NVCV_BGR, NVCV_U8, NVCV_CHUNKY, NVCV_GPU, 1) ;
NvAR_SetObject(landmarkDetectHandle, NvAR_Parameter_Input(Image), &InputImageBuffer, sizeof(NvCVImage));

每个功能的属性以及输入和输出要求的更多信息，请参阅AR 功能的属性列表。

注意：列出的属性名称是定义属性键值的宏的输入。

1.3.6. Loading a Feature Instance

您可以在设置加载特征类型实例所需的配置属性后加载该特征。

要加载特征实例，请调用NvAR_Load()函数并指定在创建实例时为特征实例创建的句柄。有关详细信息，请参阅创建特征类型的实例。

此示例加载人脸检测特征类型的实例：

NvAR_Load(faceDetectHandle);

1.3.7. Running a Feature Instance

在运行功能实例之前，请加载功能类型的实例并设置运行功能实例时所需的用户分配的输入和输出内存缓冲区。

要运行特征实例，请调用NvAR_Run()函数并指定在创建实例时为特征实例创建的句柄。有关详细信息，请参阅创建特征类型的实例。

此示例显示如何运行人脸检测功能实例：

NvAR_Run(faceDetectHandle);

1.3.8. Destroying a Feature Instance

当不再需要某个功能实例时，您需要将其销毁以释放该功能实例内部分配的资源和内存。

内存缓冲区作为输入提供并保存功能的输出，并且必须单独释放。

要销毁特征实例，请调用NvAR_Destroy()函数并指定在创建实例时为特征实例创建的句柄。有关详细信息，请参阅创建特征类型的实例。

1.4. Working with Image Frames on GPU or CPU Buffers

效果过滤器接受图像缓冲区作为NvCVImage对象。图像缓冲区可以是 CPU 或 GPU 缓冲区，但出于性能原因，效果过滤器需要 GPU 缓冲区。 AR SDK 提供了将图像表示转换为NvCVImage以及在 CPU 和 GPU 缓冲区之间传输图像的功能。

有关 NvCVImage 的更多信息，请参阅NvCVImage API 指南。本节简要介绍了 AR SDK 中最常用的功能。

1.4.1. Converting Image Representations to NvCVImage Objects

您可以使用 AR SDK 专门为 RGB OpenCV 图像提供的包装函数。

注意： AR SDK 只为 RGB 图像提供包装函数。没有为 YUV 图像提供包装函数。

要为 OpenCV 图像创建NvCVImage对象包装器，请使用NVWrapperForCVMat()函数。

//Allocate source and destination OpenCV images
cv::Mat srcCVImg(   );
cv::Mat dstCVImg(...);
 
// Declare source and destination NvCVImage objects
NvCVImage srcCPUImg;
NvCVImage dstCPUImg;
 
NVWrapperForCVMat(&srcCVImg, &srcCPUImg);
NVWrapperForCVMat(&dstCVImg, &dstCPUImg);

NvCVImage对象创建 OpenCV 图像包装器，请使用CVWrapperForNvCVImage()函数。

// Allocate source and destination NvCVImage objects
NvCVImage srcCPUImg(...);
NvCVImage dstCPUImg(...);
 
//Declare source and destination OpenCV images
cv::Mat srcCVImg;
cv::Mat dstCVImg;
 
CVWrapperForNvCVImage (&srcCPUImg, &srcCVImg);
CVWrapperForNvCVImage (&dstCPUImg, &dstCVImg);

1.4.1.2. Converting Other Image Representations to NvCVImage Objects

要转换其他图像表示，请调用NvCVImage_Init()函数在现有缓冲区 ( srcPixelBuffer ) 周围放置一个包装器。

NvCVImage src_gpu;
vfxErr = NvCVImage_Init(&src_gpu, 640, 480, 1920, srcPixelBuffer, NVCV_BGR, NVCV_U8, NVCV_INTERLEAVED, NVCV_GPU);
NvCVImage src_cpu;
vfxErr = NvCVImage_Init(&src_cpu, 640, 480, 1920, srcPixelBuffer, NVCV_BGR, NVCV_U8, NVCV_INTERLEAVED, NVCV_CPU);

1.4.1.3. Converting Decoded Frames from the NvDecoder to NvCVImage Objects

要将 NVDecoder 中的解码帧转换为 NvCVImage对象，请调用NvCVImage_Transfer()函数将NvDecoder提供的解码帧从解码像素格式转换为 AR SDK 功能所需的格式。

以下示例显示了从NV12转换为BGRA像素格式的解码帧。

NvCVImage decoded_frame, BGRA_frame, stagingBuffer;
NvDecoder dec;
 
//Initialize decoder...
//Assuming dec.GetOutputFormat() == cudaVideoSurfaceFormat_NV12
 
//Initialize memory for decoded frame
NvCVImage_Init(&decoded_frame, dec.GetWidth(), dec.GetHeight(), dec.GetDeviceFramePitch(), NULL, NVCV_YUV420, NVCV_U8, NVCV_NV12, NVCV_GPU, 1);
decoded_frame.colorSpace = NVCV_709 | NVCV_VIDEO_RANGE | NVCV_CHROMA_COSITED;
 
//Allocate memory for BGRA frame, and set alpha opaque
NvCVImage_Alloc(&BGRA_frame, dec.GetWidth(), dec.GetHeight(), NVCV_BGRA, NVCV_U8, NVCV_CHUNKY, NVCV_GPU, 1);
cudaMemset(BGRA_frame.pixels, -1, BGRA_frame.pitch * BGRA_frame.height);
 
decoded_frame.pixels = (void*)dec.GetFrame();
 
//Convert from decoded frame format(NV12) to desired format(BGRA)
NvCVImage_Transfer(&decoded_frame, &BGRA_frame, 1.f, stream, & stagingBuffer);

注意：上面的示例假定了高清内容的典型色彩空间规范。 SD 通常使用NVCV_601 。有 8 种可能的组合，您应该使用与视频标题中描述的视频相匹配的组合，或者通过反复试验继续进行。

以下是一些附加信息：

如果颜色不正确，请交换 709<->601。
如果它们被冲掉，请交换 VIDEO<->FULL。
如果颜色水平移动，则交换 INTSTITIAL<->COSITED。

1.4.1.4. Converting an NvCVImage Object to a Buffer that can be Encoded by NvEncoder

要通过NvEncoder将NvCVImage转换为在编码期间使用的像素格式，如有必要，请调用NvCVImage_Transfer()函数。

以下示例显示了以 BGRA 像素格式编码的帧。

convert-nvcvimage-obj-buffer-encoded-nvencoderThe following sample shows a frame that is encoded in the BGRA pixel format.
//BGRA frame is 4-channel, u8 buffer residing on the GPU
NvCVImage BGRA_frame;
NvCVImage_Alloc(&BGRA_frame, dec.GetWidth(), dec.GetHeight(), NVCV_BGRA, NVCV_U8, NVCV_CHUNKY, NVCV_GPU, 1);
//Initialize encoder with a BGRA output pixel format
using NvEncCudaPtr = std::unique_ptr>;
NvEncCudaPtr pEnc(new NvEncoderCuda(cuContext, dec.GetWidth(), dec.GetHeight(), NV_ENC_BUFFER_FORMAT_ARGB));
pEnc->CreateEncoder(&initializeParams);
//...
 
std::vector> vPacket;
//Get the address of the next input frame from the encoder
const NvEncInputFrame* encoderInputFrame = pEnc->GetNextInputFrame();
 
 
//Copy the pixel data from BGRA_frame into the input frame address obtained above
NvEncoderCuda::CopyToDeviceFrame(cuContext,
                        BGRA_frame.pixels,
            	        BGRA_frame.pitch,
                        (CUdeviceptr)encoderInputFrame->inputPtr,
                        encoderInputFrame->pitch,
                        pEnc->GetEncodeWidth(),
                        pEnc->GetEncodeHeight(),
             	       CU_MEMORYTYPE_DEVICE,
                        encoderInputFrame->bufferFormat,
                        encoderInputFrame->chromaOffsets,
                        encoderInputFrame->numChromaPlanes);
pEnc->EncodeFrame(vPacket);

1.4.2. Allocating an NvCVImage Object Buffer

您可以使用NvCVImage分配构造函数或图像函数为NvCVImage对象分配缓冲区。在这两个选项中，当图像超出范围时，析构函数会自动释放缓冲区。

1.4.2.1. Using the NvCVImage Allocation Constructor to Allocate a Buffer

NvCVImage分配( allocation)构造函数创建一个已分配内存并已初始化的对象。有关详细信息，请参阅分配构造函数。

分配构造函数的最后三个可选参数决定了生成的NvCVImage对象的属性：

像素组织决定了蓝色、绿色和红色是在不同的平面中还是交错的。
内存类型决定了缓冲区是驻留在 GPU 上还是 CPU 上。
字节对齐决定了连续扫描线之间的间隙。

以下示例展示了如何使用分配构造函数的最后三个可选参数来确定NvCVImage对象的属性。

此示例创建一个对象，而不设置分配构造函数的最后三个可选参数。在这个对象中，蓝色、绿色和红色分量交错在每个像素中，缓冲区驻留在 CPU 上，字节对齐是默认对齐。

NvCVImage cpuSrc(
  srcWidth,
  srcHeight,
  NVCV_BGR,
  NVCV_U8
);

此示例通过显式设置最后三个可选参数来创建与上一个示例具有相同像素组织、内存类型和字节对齐的对象。与前面的示例一样，蓝色、绿色和红色分量在每个像素中交错，缓冲区驻留在 CPU 上，并且字节对齐是默认设置，即针对最大性能进行了优化。

NvCVImage src(
  srcWidth,
  srcHeight,
  NVCV_BGR,
  NVCV_U8,
  NVCV_INTERLEAVED,
  NVCV_CPU,
  0
);

此示例创建一个对象，其中蓝色、绿色和红色分量位于不同的平面中，缓冲区位于 GPU 上，字节对齐确保一条扫描线和下一条扫描线之间不存在间隙。

NvCVImage gpuSrc(
  srcWidth,
  srcHeight,
  NVCV_BGR,
  NVCV_U8,
  NVCV_PLANAR,
  NVCV_GPU,
  1
);

1.4.2.2. Using Image Functions to Allocate a Buffer

通过声明一个空图像，您可以推迟缓冲区分配。

声明一个空的NvCVImage对象。

NvCVImage xfr;
为图像分配或重新分配缓冲区。
- 要分配缓冲区，请调用NvCVImage_Alloc()函数。
  
  当图像是状态结构的一部分时，以这种方式分配缓冲区，直到稍后您才会知道图像的大小。
- 要重新分配缓冲区，请调用NvCVImage_Realloc() 。
  
  此函数检查分配的缓冲区，如果缓冲区足够大，则在释放缓冲区并调用NvCVImage_Alloc()之前对其进行调整。

1.4.3. Transferring Images Between CPU and GPU Buffers

如果输入和输出图像缓冲区的内存类型不同，应用程序可以在 CPU 和 GPU 缓冲区之间传输图像。

1.4.3.1. Transferring Input Images from a CPU Buffer to a GPU Buffer

以下是将输入图像从 CPU 缓冲区传输到 GPU 缓冲区的步骤。

创建一个NvCVImage对象以用作暂存 GPU 缓冲区，该缓冲区与源 CPU 缓冲区具有相同的尺寸和格式。

NvCVImage srcGpuPlanar(inWidth, inHeight, NVCV_BGR, NVCV_F32, NVCV_PLANAR, NVCV_GPU,1)

通过以下方式之一创建暂存缓冲区：
- 为避免在视频管道中分配内存，请创建一个 GPU 缓冲区，该缓冲区具有与视频效果过滤器输入所需的相同尺寸和格式。
```
NvCVImage srcGpuStaging(inWidth, inHeight, srcCPUImg.pixelFormat, srcCPUImg.componentType, srcCPUImg.planar, NVCV_GPU)
```
- 为了简化您的应用程序代码，请声明一个空的暂存缓冲区。
```
NvCVImage srcGpuStaging;
```
调用NvCVImage_Transfer()函数将源 CPU 缓冲区内容通过暂存 GPU 缓冲区复制到最终 GPU 缓冲区中。

//Read the image into srcCPUImg
NvCVImage_Transfer(&srcCPUImg, &srcGPUPlanar, 1.0f, stream, &srcGPUStaging)

1.4.3.2. Transferring Output Images from a GPU Buffer to a CPU Buffer

以下是将输出图像从 CPU 缓冲区传输到 GPU 缓冲区的步骤。

创建一个NvCVImage对象以用作暂存 GPU 缓冲区，该缓冲区与目标 CPU 缓冲区具有相同的尺寸和格式。

NvCVImage dstGpuPlanar(outWidth, outHeight, NVCV_BGR, NVCV_F32, NVCV_PLANAR, NVCV_GPU, 1)

通过以下方式之一创建暂存缓冲区：
- 为避免在视频管道中分配内存，请创建一个与视频效果过滤器的输出具有相同尺寸和格式的 GPU 缓冲区。
```
NvCVImage dstGpuStaging(outWidth, outHeight, dstCPUImg.pixelFormat, dstCPUImg.componentType, dstCPUImg.planar, NVCV_GPU)
```
- 为了简化您的应用程序代码，请声明一个空的暂存缓冲区：
```
NvCVImage dstGpuStaging;
```
将根据需要分配适当大小的缓冲区。
调用NvCVImage_Transfer()函数将 GPU 缓冲区内容通过暂存 GPU 缓冲区复制到目标 CPU 缓冲区。

//Retrieve the image from the GPU to CPU, perhaps with conversion.
NvCVImage_Transfer(&dstGpuPlanar, &dstCPUImg, 1.0f, stream, &dstGpuStaging);

1.5. List of Properties for the AR SDK Features

本部分提供 AR SDK 中功能的属性及其值。

1.5.1. Face Tracking Property Values

下表列出了面部跟踪的配置、输入和输出属性的值。

Table 2. Configuration Properities for Face Tracking
Property Name	Value
FeatureDescription	String is free-form text that describes the feature. The string is set by the SDK and cannot be modified by the user.
CUDAStream	The CUDA stream, which is set by the user.
ModelDir	String that contains the path to the folder that contains the TensorRT package files. Set by the user.
Temporal	Unsigned integer, 1/0 to enable/disable the temporal optimization of face detection. If enabled, only one face is returned. See Face Detection and Tracking for more information. Set by the user.

Table 3. Input Properties for Face Tracking
Property Name	Value
Image	Interleaved (or chunky) 8-bit BGR input image in a CUDA buffer of type NvCVImage. To be allocated and set by the user.

Table 4. Output Properties for Face Tracking
Property Name	Value
BoundingBoxes	NvAR_BBoxes structure that holds the detected face boxes. To be allocated by the user.
BoundingBoxesConfidence	An array of single-precision (32-bit) floating-point numbers that contains the confidence values for each detected face box. To be allocated by the user.

1.5.2. Landmark Tracking Property Values

下表列出了关键点跟踪的配置、输入和输出属性的值。

Table 5. Configuration Properties for Landmark Tracking
Property Name	Value
FeatureDescription	String that describes the feature.
CUDAStream	The CUDA stream. Set by the user.
ModelDir	String that contains the path to the folder that contains the TensorRT package files. Set by the user.
BatchSize	The number of inferences to be run at one time on the GPU. The maximum value is 1.
Landmarks_Size	Unsigned integer, 68 or 126. Specifies the number of landmark points (X and Y values) to be returned. Set by the user.
LandmarksConfidence_Size	Unsigned integer, 68 or 126. Specifies the number of landmark confidence values for the detected keypoints to be returned. Set by the user.
Temporal	Unsigned integer, 1/0 to enable/disable the temporal optimization of landmark detection. If enabled, only one input bounding box is supported as the input. See Landmark Detection and Tracking for more information. Set by the user.

Table 6. Input Properties for Landmark Tracking
Property Name	Value
Image	Interleaved (or chunky) 8-bit BGR input image in a CUDA buffer of type NvCVImage. To be allocated and set by the user.
BoundingBoxes	NvAR_BBoxes structure that contains the number of bounding boxes that are equal to BatchSize on which to run landmark detection. If not specified as an input property, face detection is automatically run on the input image. See Landmark Detection and Tracking for more information. To be allocated by the user.

Table 7. Output Properties for Landmark Tracking
Property Name	Value
Landmarks	NvAR_Point2f array, which must be large enough to hold the number of points given by the product of NvAR_Parameter_Config(BatchSize) and NvAR_Parameter_Config(Landmarks_Size). To be allocated by the user.
Pose	NvAR_Quaternion array, which must be large enough to hold the number of quaternions equal to NvAR_Parameter_Config(BatchSize). To be allocated by the user.
LandmarksConfidence	An array of single-precision (32-bit) floating-point numbers, which must be large enough to hold the number of confidence values given by the product of the following: NvAR_Parameter_Config(BatchSize) NvAR_Parameter_Config(LandmarksConfidence_Size) To be allocated by the user.
BoundingBoxes	NvAR_BBoxes structure that contains the detected face through face detection performed by the landmark detection feature. See Landmark Detection and Tracking for more information. To be allocated by the user.

1.5.3. Face 3D Mesh Tracking Property Values

下表列出了面 3D 网格跟踪的配置、输入和输出属性的值。

Table 8. Configuration Properties for Face 3D Mesh Tracking
Property Name	Value
FeatureDescription	String that describes the feature. This property is read-only.
ModelDir	String that contains the path to the face model, and the TensorRT package files. See Alternative Usage of the Face 3D Mesh Feature for more information. Set by the user.
CUDAStream	The CUDA stream. See Alternative Usage of the Face 3D Mesh Feature for more information. Set by the user.
Temporal	Unsigned integer, 1/0 to enable/disable the temporal optimization of face and landmark detection. See Alternative Usage of the Face 3D Mesh Feature for more information. Set by the user.
LandmarksConfidence_Size	Unsigned integer, 68 or 126. If landmark detection is run internally, the confidence values for the detected key points are returned. See Alternative Usage of the Face 3D Mesh Feature for more information.
ShapeEigenValueCount	The number of eigenvalues that describe the identity shape. Query this to determine how big the eigenvalue array should be, if that is a desired output. This property is read-only.
ExpressionCount	The number of expressions available in the chosen model. Query this to determine how big the expression coefficient array should be, if that is a desired output. This property is read-only.
VertexCount	The number of vertices in the chosen model. Query this property to determine how big the vertex array should be, where VertexCount is the number of vertices. This property is read-only.
TriangleCount	The number of triangles in the chosen model. Query this property to determine how big the triangle array should be, where TriangleCount is the number of triangles. This property is read-only.

Table 9. Input Properties for Face 3D Mesh Tracking
Property Name	Value
Width	The width of the input image buffer that contains the face to which the face model will be fitted. Set by the user.
Height	The height of the input image buffer that contains the face to which the face model will be fitted. Set by the user.
Landmarks	An NvAR_Point2f array that contains the landmark points of size NvAR_Parameter_Config(Landmarks_Size) that is returned by the landmark detection feature. If landmarks are not provided to this feature, an input image must be provided. See Alternative Usage of the Face 3D Mesh Feature for more information. To be allocated by the user.
Image	An interleaved (or chunky) 8-bit BGR input image in a CUDA buffer of type NvCVImage. If an input image is not provided as input, the landmark points must be provided to this feature as input. See Alternative Usage of the Face 3D Mesh Feature for more information. To be allocated by the user.

Table 10. Output Properties for Face 3D Mesh Tracking
Property Name	Value
FaceMesh	NvAR_FaceMesh structure that contains the output face mesh. To be allocated by the user.
RenderingParams	NvAR_RenderingParams structure that contains the rendering parameters for drawing the face mesh that is returned by this feature. To be allocated by the user.
Landmarks	An NvAR_Point2f array, which must be large enough to hold the number of points of size NvAR_Parameter_Config(Landmarks_Size). See Alternative Usage of the Face 3D Mesh Feature for more information. To be allocated by the user.
Pose	NvAR_Quaternion array pointer, to hold one quaternion. See Alternative Usage of the Face 3D Mesh Feature for more information. To be allocated by the user.
LandmarksConfidence	An array of single-precision (32-bit) floating-point numbers, which must be large enough to hold the number of confidence values of size NvAR_Parameter_Config(LandmarksConfidence_Size). See Alternative Usage of the Face 3D Mesh Feature for more information. To be allocated by the user.
BoundingBoxes	NvAR_BBoxes structure that contains the detected face that is determined internally. See Alternative Usage of the Face 3D Mesh Feature for more information. To be allocated by the user.
BoundingBoxesConfidence	An array of single-precision (32-bit) floating-point numbers that contain the confidence values for each detected face box. See Alternative Usage of the Face 3D Mesh Feature for more information. To be allocated by the user.
ShapeEigenValues	Optional: The array into which the shape eigenvalues will be placed, if desired. Query ShapeEigenValueCount to determine how big this array should be. To be allocated by the user.
ExpressionCoefficients	Optional: The array into which the expression coefficients will be placed, if desired. Query ExpressionCount to determine how big this array should be. To be allocated by the user.

1.5.4. Body Detection Property Values

下表列出了身体检测跟踪的配置、输入和输出属性的值。

Table 11. Configuration Properties for Body Dection Tracking
Property Name	Name
FeatureDescription	String is free-form text that describes the feature. The string is set by the SDK and cannot be modified by the user.
CUDAStream	The CUDA stream, which is set by the user.
ModelDir	String that contains the path to the folder that contains the TensorRT package files. Set by the user.
Temporal	Unsigned integer, 1/0 to enable/disable the temporal optimization of body detection. Set by the user.

Table 11. Configuration Properties for Body Dection Tracking
Property Name	Name
FeatureDescription	String is free-form text that describes the feature. The string is set by the SDK and cannot be modified by the user.
CUDAStream	The CUDA stream, which is set by the user.
ModelDir	String that contains the path to the folder that contains the TensorRT package files. Set by the user.
Temporal	Unsigned integer, 1/0 to enable/disable the temporal optimization of body detection. Set by the user.

Table 13. Output Properties for Body Detection
Property Name	Value
BoundingBoxes	NvAR_BBoxes structure that holds the detected body boxes. To be allocated by the user.
BoundingBoxesConfidence	An array of single-precision (32-bit) floating-point numbers that contains the confidence values for each detected body box. To be allocated by the user.

1.5.5. 3D Body Pose Keypoint Tracking Property Values

下表列出了 3D Body Pose Keypoint Tracking 的配置、输入和输出属性的值。

Table 14. Configuration Properties for 3D Body Pose Keypoint Tracking
Property Name	Value
FeatureDescription	FeatureDescription String that describes the feature.
CUDAStream	The CUDA stream. Set by the user.
ModelDir	String that contains the path to the folder that contains the TensorRT package files. Set by the user.
BatchSize	The number of inferences to be run at one time on the GPU. The maximum value is 1.
Mode	Unsigned integer, 0 or 1. Default is 1. Selects the High Performance (1) mode or High Quality (0) mode Set by the user.
UseCudaGraph	Bool, True or False. Default is True Flag to use CUDA Graphs for optimization. Set by the user.
FocalLength	Float. Default is 800.79041 Specifies the focal length of the camera to be used for 3D Body Pose. Set by the user.
Temporal	Unsigned integer and 1/0 to enable/disable the temporal optimization of Body Pose tracking. Set by the user.
NumKeyPoints	Unsigned integer. Specifies the number of keypoints available, which is currently 34.
ReferencePose	NvAR_Point3f array, which contains the reference pose for each of the 34 keypoints. Specifies the Reference Pose used to compute the joint angles.

Table 15. Input Properties for 3D Body Pose Keypoint Tracking
Property Name	Value
Image	Interleaved (or chunky) 8-bit BGR input image in a CUDA buffer of type NvCVImage. To be allocated and set by the user.
BoundingBoxes	NvAR_BBoxes structure that contains the number of bounding boxes that are equal to BatchSize on which to run 3D Body Pose detection. If not specified as an input property, body detection is automatically run on the input image. To be allocated by the user.

Table 16. Output Properties for 3D Body Pose Keypoint Tracking
Property Name	Value
Keypoints	NvAR_Point2f array, which must be large enough to hold the 34 points given by the product of NvAR_Parameter_Config(BatchSize) and 34. To be allocated by the user.
Keypoints3D	NvAR_Point3f array, which must be large enough to hold the 34 points given by the product of NvAR_Parameter_Config(BatchSize) and 34. To be allocated by the user.
JointAngles	NvAR_Quaternion array, which must be large enough to hold the 34 joints given by the product of NvAR_Parameter_Config(BatchSize) and 34. They represent the local rotation (in Quaternion) of each joint with reference to the ReferencePose. To be allocated by the user.
KeyPointsConfidence	An array of single-precision (32-bit) floating-point numbers, which must be large enough to hold the number of confidence values given by the product of the following: NvAR_Parameter_Config(BatchSize) 34 To be allocated by the user.
BoundingBoxes	NvAR_BBoxes structure that contains the detected body through body detection performed by the 3D Body Pose feature. To be allocated by the user.

1.6. Using the AR Features

本节提供有关如何使用 AR 功能的信息。

1.6.1. Face Detection and Tracking

本节提供有关如何使用人脸检测和跟踪功能的信息。

1.6.1.1. Face Detection for Static Frames (Images)

要获得检测到的边界框，您可以显式实例化并运行人脸检测功能，如下所示，该功能将图像缓冲区作为输入。

此示例使用输入图像缓冲区和输出内存运行人脸检测 AR 功能以保存边界框：

//Set input image buffer
NvAR_SetObject(faceDetectHandle, NvAR_Parameter_Input(Image), &inputImageBuffer, sizeof(NvCVImage));
//Set output memory for bounding boxes
NvAR_BBoxes = output_boxes{};
output_bboxes.boxes = new NvAR_Rect[25];
output_bboxes.max_boxes = 25;
NvAR_SetObject(faceDetectHandle, NvAR_Parameter_Output(BoundingBoxes), &output_bboxes, sizeof(NvAR_BBoxes));
 //OPTIONAL – Set memory for bounding box confidence values if desired
 NvAR_Run(faceDetectHandle);

1.6.1.2. Face Tracking for Temporal Frames (Videos)

如果启用了Temporal ，例如，当您处理视频帧而不是图像时，则只返回一个人脸。最大的人脸出现在第一帧，随后在随后的帧中跟踪该人脸。

然而，显式调用人脸检测特征并不是获得表示检测到的人脸的边界框的唯一方法。有关如何使用特征点检测或 Face3D 重建 AR 功能并返回人脸边界框的更多信息，请参阅特征点检测和跟踪和人脸 3D 网格和跟踪。

1.6.2. Landmark Detection and Tracking

本节提供有关如何使用特征点检测和跟踪功能的信息。

1.6.2.1. Landmark Detection for Static Frames (Images)

通常，特征点检测功能的输入是输入图像和一批（最多 8 个）边界框。目前，最大值为 1。这些框表示图像中包含您要在其上运行特征点检测的人脸的区域。

此示例在从人脸检测中获取边界框后运行地标检测 AR 功能：

//Set input image buffer
NvAR_SetObject(landmarkDetectHandle, NvAR_Parameter_Input(Image), &inputImageBuffer, sizeof(NvCVImage));
 
//Pass output bounding boxes from face detection as an input on which //landmark detection is to be run
NvAR_SetObject(landmarkDetectHandle, NvAR_Parameter_Input(BoundingBoxes), &output_bboxes, sizeof(NvAR_BBoxes));
 //Set output buffer to hold detected facial keypoints
std::vector facial_landmarks;
facial_landmarks.assign(OUTPUT_SIZE_KPTS, {0.f, 0.f});
NvAR_SetObject(landmarkDetectHandle, NvAR_Parameter_Output(Landmarks), facial_landmarks.data(),sizeof(NvAR_Point2f));
 NvAR_Run(landmarkDetectHandle);

1.6.2.2. Alternative Usage of Landmark Detection

但是，如Landmark Tracking Property Values中所述，Landmark Detection AR 功能支持一些可选参数，这些参数决定了该功能的运行方式。

如果边界框没有作为输入提供给地标检测 AR 功能，则会在输入图像上自动运行人脸检测，并选择最大的人脸边界框来运行地标检测。

如果BoundingBoxes设置为输出属性，则该属性将填充选定的边界框，该边界框包含运行地标检测的人脸。 Landmarks 不是可选属性，要显式运行此功能，必须使用提供的输出缓冲区设置此属性。

1.6.2.3. Landmark Tracking for Temporal Frames (Videos)

此外，如果启用了Temporal ，例如当您处理视频流并显式运行人脸检测时，则仅支持一个边界框作为地标检测的输入。

当没有明确运行人脸检测时，通过提供输入图像而不是边界框，自动选择检测到的最大人脸。然后将检测到的人脸和地标作为跨时间相关帧的优化进行跟踪。

注意：内部确定的边界框可以从此功能中查询，但不是该功能运行所必需的。

此示例使用 Landmark Detection AR 功能直接从图像中获取特征点，而无需先显式运行人脸检测：

//Set input image buffer
NvAR_SetObject(landmarkDetectHandle, NvAR_Parameter_Input(Image), &inputImageBuffer, sizeof(NvCVImage));
 
//Set output memory for landmarks
std::vector facial_landmarks;
facial_landmarks.assign(batchSize * OUTPUT_SIZE_KPTS, {0.f, 0.f});
NvAR_SetObject(landmarkDetectHandle, NvAR_Parameter_Output(Landmarks), facial_landmarks.data(),sizeof(NvAR_Point2f));
 
//OPTIONAL – Set output memory for bounding box if desired
NvAr_BBoxes = output_boxes{};
output_bboxes.boxes = new NvAR_Rect[25];
output_bboxes.max_boxes = 25;
NvAR_SetObject(landmarkDetectHandle, NvAR_Parameter_Output(BoundingBoxes), &output_bboxes, sizeof(NvAr_BBoxes));
 
//OPTIONAL – Set output memory for pose, landmark confidence, or even bounding box confidence if desired
 
NvAR_Run(landmarkDetectHandle);

1.6.3. Face 3D Mesh and Tracking

本节提供有关如何使用面 3d 网格和跟踪功能的信息。

1.6.3.1. Face 3D Mesh for Static Frames (Images)

通常，人脸 3D 网格特征的输入是输入图像和一组检测到的关键点，这些标记点对应于我们要在其上运行 3D 重建的人脸。

这是此功能的典型用法，其中从地标检测功能检测到的面部关键点作为输入传递给此功能：

//Set facial keypoints from Landmark Detection as an input
NvAR_SetObject(faceFitHandle, NvAR_Parameter_Input(Landmarks), facial_landmarks.data(),sizeof(NvAR_Point2f));
//Set output memory for face mesh
NvAR_FaceMesh face_mesh = new NvAR_FaceMesh();
face_mesh->vertices = new NvAR_Vector3f[FACE_MODEL_NUM_VERTICES];
face_mesh->tvi = new NvAR_Vector3u16[FACE_MODEL_NUM_INDICES];
NvAR_SetObject(faceFitHandle, NvAR_Parameter_Output(FaceMesh), face_mesh, sizeof(NvAR_FaceMesh));
//Set output memory for rendering parameters
NvAR_RenderingParams rendering_params = new NvAR_RenderingParams();
NvAR_SetObject(faceFitHandle, NvAR_Parameter_Output(RenderingParams), rendering_params, sizeof(NvAR_RenderingParams));
 NvAR_Run(faceFitHandle);

1.6.3.2. Alternative Usage of the Face 3D Mesh Feature

与 Landmark 检测功能的替代用法类似，Face 3D Mesh AR 功能可用于确定检测到的人脸边界框、面部关键点、3D 人脸网格及其渲染参数。

如果提供输入图像，而不是面部的面部关键点，则会自动检测面部和面部关键点并用于运行面部网格拟合。以这种方式运行时，如果将BoundingBoxes或 Landmarks 设置为此功能的可选输出属性，这些属性将分别填充包含面部和检测到的面部关键点的边界框。

FaceMesh和RenderingParams不是此功能的可选属性，要运行此功能，必须使用用户提供的输出缓冲区设置这些属性。

此外，如果在不提供面部关键点作为输入的情况下运行此功能，则ModelDir配置参数指向的路径还必须包含面部和地标检测 TRT 包文件。或者，可以为这些功能设置CUDAStream和Temporal标志。

1.6.3.3. Face 3D Mesh Tracking for Temporal Frames (Videos)

如果设置了 Temporal 标志并且在内部运行人脸和关键点检测，则这些特征将针对时间相关的帧进行优化
这意味着将跨帧跟踪面部和面部关键点，并且如果请求，将仅返回一个边界框作为输出。如果显式调用了地标检测和/或人脸检测功能，则人脸 3D 网格功能不支持时间标志。在这种情况下，您必须直接向这些功能提供标志。
注意：内部确定的面部关键点和/或面部边界框可以从此功能中查询，但不是该功能运行所必需的。
此示例使用 Mesh Tracking AR 功能直接从图像中获取人脸网格，无需显式运行 Landmark Detection 或 Face Detection：

//Set input image buffer instead of providing facial keypoints
NvAR_SetObject(landmarkDetectHandle, NvAR_Parameter_Input(Image), &inputImageBuffer, sizeof(NvCVImage));
 
//Set output memory for face mesh
NvAR_FaceMesh face_mesh = new NvAR_FaceMesh();
face_mesh->vertices = new NvAR_Vector3f[FACE_MODEL_NUM_VERTICES];
face_mesh->tvi = new NvAR_Vector3u16[FACE_MODEL_NUM_INDICES];
NvAR_SetObject(faceFitHandle, NvAR_Parameter_Output(FaceMesh), face_mesh, sizeof(NvAR_FaceMesh));
 
//Set output memory for rendering parameters
NvAR_RenderingParams rendering_params = new NvAR_RenderingParams();
NvAR_SetObject(faceFitHandle, NvAR_Parameter_Output(RenderingParams), rendering_params, sizeof(NvAR_RenderingParams));
 
//OPTIONAL - Set facial keypoints as an output
NvAR_SetObject(faceFitHandle, NvAR_Parameter_Output(Landmarks), facial_landmarks.data(),sizeof(NvAR_Point2f));
 
//OPTIONAL – Set output memory for bounding boxes, or other parameters, such as pose, bounding box/landmarks confidence, etc.
 
NvAR_Run(faceFitHandle);

1.6.4. 3D Body Pose Tracking

此功能依赖于时间信息来跟踪场景中的人，其中前一帧的关键点信息用于估计下一帧的关键点。

3D Body Pose Tracking 由以下部分组成：

身体检测
3D关键点检测

在此版本中，我们仅支持画面中的一个人，并且当整个身体（从头到脚）可见时。但是，如果身体的一部分（例如手臂或脚）被遮挡/截断，该功能仍然有效。

1.6.4.1. 3D Body Pose Tracking for Static Frames (Images)

您可以获得封装场景中人物的边界框。要获得检测到的边界框，您可以显式实例化并运行身体检测，如下例所示，并将图像缓冲区作为输入传递。

此示例使用输入图像缓冲区和输出内存运行身体检测以保存边界框：

//Set input image buffer
NvAR_SetObject(bodyDetectHandle, NvAR_Parameter_Input(Image), &inputImageBuffer, sizeof(NvCVImage));
 //Set output memory for bounding boxes
 
NvAR_BBoxes = output_boxes{};
output_bboxes.boxes = new NvAR_Rect[25];
output_bboxes.max_boxes = 25;
NvAR_SetObject(bodyDetectHandle, NvAR_Parameter_Output(BoundingBoxes), &output_bboxes, sizeof(NvAR_BBoxes));
 
//OPTIONAL – Set memory for bounding box confidence values if desired
 
NvAR_Run(bodyDetectHandle);

3D Body Keypoint Detection 的输入是输入图像。它输出 2D 关键点、3D 关键点、关键点置信度分数和封装人的边界框。

此示例运行 3D 身体姿势检测 AR 功能：

//Set input image buffer
NvAR_SetObject(keypointDetectHandle, NvAR_Parameter_Input(Image), &inputImageBuffer, sizeof(NvCVImage));
 
//Pass output bounding boxes from body detection as an input on which //landmark detection is to be run
NvAR_SetObject(keypointDetectHandle, NvAR_Parameter_Input(BoundingBoxes), &output_bboxes, sizeof(NvAR_BBoxes));
 
//Set output buffer to hold detected keypoints
std::vector keypoints;
std::vector keypoints3D;
std::vector jointAngles;
std::vector keypoints_confidence;
 
// Get the number of keypoints
unsigned int numKeyPoints;
NvAR_GetU32(keyPointDetectHandle, NvAR_Parameter_Config(NumKeyPoints), &numKeyPoints);
 
keypoints.assign(batchSize * numKeyPoints , {0.f, 0.f});
keypoints3D.assign(batchSize * numKeyPoints , {0.f, 0.f, 0.f});
jointAngles.assign(batchSize * numKeyPoints , {0.f, 0.f, 0.f});
NvAR_SetObject(keyPointDetectHandle, NvAR_Parameter_Output(KeyPoints), keypoints.data(), sizeof(NvAR_Point2f));
NvAR_SetObject(keyPointDetectHandle, NvAR_Parameter_Output(KeyPoints3D), keypoints3D.data(), sizeof(NvAR_Point3f));
NvAR_SetF32Array(keyPointDetectHandle, NvAR_Parameter_Output(KeyPointsConfidence), keypoints_confidence.data(), batchSize * numKeyPoints);
NvAR_SetObject(keyPointDetectHandle, NvAR_Parameter_Output(JointAngles), jointAngles.data(), sizeof(NvAR_Point3f));
 
//Set output memory for bounding boxes
NvAR_BBoxes = output_boxes{};
output_bboxes.boxes = new NvAR_Rect[25];
output_bboxes.max_boxes = 25;
NvAR_SetObject(keyPointDetectHandle, NvAR_Parameter_Output(BoundingBoxes), &output_bboxes, sizeof(NvAR_BBoxes));
 
NvAR_Run(keyPointDetectHandle);

1.6.4.2. 3D Body Pose Tracking for Temporal Frames (Videos)

该功能依靠时间信息来跟踪场景中的人。前一帧的关键点信息用于估计下一帧的关键点。

此示例使用 3D Body Pose Tracking AR 功能直接从图像中获取 3D Body Pose Keypoints：

//Set input image buffer
NvAR_SetObject(keypointDetectHandle, NvAR_Parameter_Input(Image), &inputImageBuffer, sizeof(NvCVImage));
 
//Pass output bounding boxes from body detection as an input on which //landmark detection is to be run
NvAR_SetObject(keypointDetectHandle, NvAR_Parameter_Input(BoundingBoxes), &output_bboxes, sizeof(NvAR_BBoxes));
 
//Set output buffer to hold detected keypoints
std::vector keypoints;
std::vector keypoints3D;
std::vector jointAngles;
std::vector keypoints_confidence;
 
// Get the number of keypoints
unsigned int numKeyPoints;
NvAR_GetU32(keyPointDetectHandle, NvAR_Parameter_Config(NumKeyPoints), &numKeyPoints);
 
keypoints.assign(batchSize * numKeyPoints , {0.f, 0.f});
keypoints3D.assign(batchSize * numKeyPoints , {0.f, 0.f, 0.f});
jointAngles.assign(batchSize * numKeyPoints , {0.f, 0.f, 0.f});
NvAR_SetObject(keyPointDetectHandle, NvAR_Parameter_Output(KeyPoints), keypoints.data(), sizeof(NvAR_Point2f));
NvAR_SetObject(keyPointDetectHandle, NvAR_Parameter_Output(KeyPoints3D), keypoints3D.data(), sizeof(NvAR_Point3f));
NvAR_SetF32Array(keyPointDetectHandle, NvAR_Parameter_Output(KeyPointsConfidence), keypoints_confidence.data(), batchSize * numKeyPoints);
NvAR_SetObject(keyPointDetectHandle, NvAR_Parameter_Output(JointAngles), jointAngles.data(), sizeof(NvAR_Point3f));
 
//Set output memory for bounding boxes
NvAR_BBoxes = output_boxes{};
output_bboxes.boxes = new NvAR_Rect[25];
output_bboxes.max_boxes = 25;
NvAR_SetObject(keyPointDetectHandle, NvAR_Parameter_Output(BoundingBoxes), &output_bboxes, sizeof(NvAR_BBoxes));
 
NvAR_Run(keyPointDetectHandle);

1.7. Using Multiple GPUs

使用 AR SDK 开发的应用程序可以与多个 GPU 一起使用。默认情况下，SDK 会根据当前选择的 GPU 的能力来确定使用哪个 GPU：如果当前选择的 GPU 支持 AR SDK，则 SDK 使用它。否则，SDK 会选择最佳 GPU。

cudaSetDevice(int whichGPU)和cudaGetDevice(int *whichGPU) NVIDIA CUDA® Toolkit 函数和NvAR_SetS32(NULL, NvAR_Parameter_Config(GPU) , whichGPU) AR SDK Set函数来控制在多 GPU 环境中使用哪个 GPU .在创建任何效果之前，AR SDK 只调用一次Set()调用。由于不可能将分配在一个 GPU 上的图像透明地传递到另一个 GPU，因此您必须确保将同一 GPU 用于所有 AR 功能。

NvCV_Status err;
int chosenGPU = 0; // or whatever GPU you want to use
err = NvAR_SetS32(NULL, NvAR_Parameter_Config(GPU), chosenGPU);
if (NVCV_SUCCESS != err) {
	printf(“Error choosing GPU %d: %s\n”, chosenGPU,
    	    NvCV_GetErrorStringFromCode(err));
}
cudaSetDevice(chosenGPU);
NvCVImage dst = new NvCVImage(…);
NvAR_Handle eff;
err = NvAR_API NvAR_CreateEffect(code, &eff);
…
err = NvAR_API NvAR_Load(eff);
err = NvAR_API NvAR_Run(eff, true);
// switch GPU for other task, then switch back for next frame

缓冲区需要在选定的 GPU 上分配，因此在 GPU 上分配图像之前，请调用cudaSetDevice() 。神经网络需要在选定的 GPU 上加载，因此在调用 NvAR_Load()之前，将此 GPU 设置为当前设备。

要使用缓冲区和模型，在调用NvAR_Run()并将 GPU 设备设置为当前设备之前。先前对NvAR_SetS32(NULL, NvAR_Parameter_Config(GPU) , whichGPU)的调用有助于强制执行此要求。

出于性能考虑，切换到适当的 GPU 是应用程序的责任。

1.7.1. Default Behavior in Multi-GPU Environments

NvAR_Load ()函数在内部调用cudaGetDevice()来识别当前选择的 GPU。

该函数检查当前选择的GPU（默认为0）的计算能力，以确定GPU架构是否支持AR SDK并完成以下任务之一：

如果 SDK 受支持，则 NvAR_Load()使用 GPU。
如果 SDK 不支持， NvAR_Load() 会搜索支持 AR SDK 的最强大的 GPU，并调用cudaSetDevice()将该 GPU 设置为当前 GPU。

如果您不要求您的应用程序在多 GPU 环境中使用特定 GPU，则默认行为就足够了。

1.7.2. Selecting the GPU for AR SDK Processing in a Multi-GPU Environment

您的应用程序可能设计为仅通过在多 GPU 环境中使用特定 GPU 来执行应用 AR 过滤器的任务。在这种情况下，请确保 AR SDK 不会覆盖您为应用视频效果滤镜而选择的 GPU。

// Initialization
cudaGetDevice(&beforeGPU);
err = NvAR_Load(eff);
if (NVCV_SUCCESS != err) { printf("Cannot load ARSDK: %s\n",
   NvCV_GetErrorStringFromCode(err)); exit(-1); }
cudaGetDevice(&arsdkGPU);
if (beforeGPU != arsdkGPU) {
  printf("GPU #%d cannot run AR SDK, so GPU #%d was chosen instead\n",
	beforeGPU, arsdkGPU);
}

1.7.3. Selecting Different GPUs for Different Tasks

您的应用程序可能设计为在多 GPU 环境中执行多项任务，例如渲染游戏和应用 AR 过滤器。在这种情况下，请在调用NvAR_Load()之前为每个任务选择最佳 GPU 。

调用cudaGetDeviceCount()以确定您环境中的 GPU 数量。

// Get the number of GPUs
cuErr = cudaGetDeviceCount(&deviceCount);

通过对每个 GPU 循环执行以下操作，获取每个 GPU 的属性，并确定它是否是每个任务的最佳 GPU：
- 调用cudaSetDevice()设置当前 GPU。
- 调用cudaGetDeviceProperties()获取当前 GPU 的属性。
- 要确定 GPU 是否是每个特定任务的最佳 GPU，请在应用程序中使用自定义代码来分析cudaGetDeviceProperties()检索到的属性。
  此示例使用计算能力来确定是否应分析 GPU 的属性并确定当前 GPU 是否是应用视频效果滤镜的最佳 GPU。仅当计算能力为 7.5 或 8.6 时才分析 GPU 的属性，这表示 GPU 分别基于 NVIDIA Turing™ GPU 架构或 NVIDIA Ampere 架构。

  // Loop through the GPUs to get the properties of each GPU and
  //determine if it is the best GPU for each task based on the
  //properties obtained.
  for (int dev = 0; dev < deviceCount; ++dev) {
	cudaSetDevice(dev);
	cudaGetDeviceProperties(&deviceProp, dev);
	if (DeviceIsBestForARSDK(&deviceProp))  gpuARSDK = dev;
	if (DeviceIsBestForGame(&deviceProp)) gpuGame = dev;
	...
  }
  cudaSetDevice(gpuARSDK);
  err = NvAR_Set...; // set parameters
  err = NvAR_Load(eff);
3.  	In the loop to complete the application’s tasks, select the best GPU for each task before performing the task.
a).	Call cudaSetDevice() to select the GPU for the task.
b).	Make all the function calls required to perform the task.
In this way, you select the best GPU for each task only once without setting the GPU for every function call.
This example selects the best GPU for rendering a game and uses custom code to render the game. It then selects the best GPU for applying a video effect filter before calling the NvCVImage_Transfer() and NvAR_Run() functions to apply the filter, avoiding the need to save and restore the GPU for every NVIDIA AR SDK API call.
// Select the best GPU for each task and perform the task.
while (!done) {
  ...
  cudaSetDevice(gpuGame);
  RenderGame();
  cudaSetDevice(gpuARSDK);
  err = NvAR_Run(eff, 1);
  ...
}

你可能感兴趣的:(ar,深度学习,计算机视觉,python,vr)

Python-tkinter自制登录界面（含注册） GCHEK python 开发语言
简单的用户登录、注册界面importtkinterastkimporttimeimportsubprocessimportsysimportosimporttkinter.messageboxwindow=tk.Tk()window.title('GCHEK')window.geometry('400x300')#设置储存用户信息的容器，这里用的txt。ifnotos.path.exists('U
git删除已经commit但是未push的文件不知西向东 git git
git删除已经commit但是未push的文件已经2次了，没注意,将target文件夹直接就commit了，造成的是你本地仓库就会多出很多class文件来解决方法：打开项目所在目录的文件夹（就是,git文件夹所在的目录）然后打开git命令行(gitbashhere)输入gitlog会将你最近commit的id都输出出来撤销本次commit：gitresetidok,结束。并不会对你改动的代码进行撤
Python爬虫requests(详细) dme. Python爬虫零基础入门爬虫 python
本文来学爬虫使用requests模块的常见操作。1.URL参数无论是在发送GET/POST请求时，网址URL都可能会携带参数，例如：http://www.5xclass.cn?age=19&name=dengres=requests.get(url="https://www.5xclass.cn?age=19&name=deng")res=requests.get(url="https://www
mac+php5.3的docker-compose.yml分享自娱自乐22 docker
version:'3'services:nginx:image:nginx:latestcontainer_name:nginx-composevolumes:-./wwwroot:/usr/share/nginx/html:rw-./nginx/nginx/:/etc/nginx/:rw-./log/nginx:/var/log/nginx:rwrestart:alwayslinks:-phpp
《神经网络与深度学习》(邱锡鹏) 内容概要【不含数学推导】 code_stream #机器学习神经网络
第1章绪论基本概念：介绍了人工智能的发展历程及不同阶段的特点，如符号主义、连接主义、行为主义等。还阐述了深度学习在人工智能领域的重要地位和发展现状，以及其在图像、语音、自然语言处理等多个领域的成功应用。术语解释人工智能：旨在让机器模拟人类智能的技术和科学。深度学习：一种基于对数据进行表征学习的方法，通过构建具有很多层的神经网络模型，自动从大量数据中学习复杂的模式和特征。第2章机器学习概述基本概念：
使用python计算等比数列求和的方法 HAMYHF windows
在python中，计算Sum=m+mm+mmm+mmmm+.....+mmmmm.....,输入两个数m,n。m的位数累加到n的值，列出算式并计算出结果：#为了打印出算式，并计算出结果，将m,mm这些放入到列表中#定义列表中的m初始值为0,用Ele来代表m,mm....Ele=0#定义总和为0Sum=0#定义一个空列表List=[]#输入两个值n=int(input("inputadigit：")
Python+Playwright常用元素定位方法 HAMYHF python 功能测试
CSSselector选择器在CSS中，定位元素主要通过选择器完成，以下是几种常见的CSS选择器定位方法：标签选择器(element):直接使用HTML元素名称来定位，例如p会选择所有段落元素。属性选择器(attribute):选择所有具有指定属性的元素，无论该属性的值是什么。例如，[title]会选择所有包含title属性的元素。选择具有指定属性，并且该属性值完全等于给定值的元素。例如，[typ
图像识别与应用狂踹瘸子那条好脚 python
图像识别作为人工智能领域的重要分支，近年来取得了显著进展，其中卷积神经网络（CNN）功不可没。CNN凭借其强大的特征提取能力，在图像分类、目标检测、人脸识别等任务中表现出色，成为图像识别领域的核心技术。一、卷积神经网络：图像识别的利器CNN是一种专门处理网格状数据的深度学习模型，其结构设计灵感来源于生物视觉系统。与全连接神经网络不同，CNN通过卷积层、池化层等结构，能够有效提取图像的局部特征，并逐
如何安装配置虚拟机薇晶晶 hadoop 大数据分布式
1.CentOS-7-x86_64-Minimal-2009.iso：linux安装文件。用来安装系统。2.VMware17.6.exe：虚拟机软件。用来在自己的电脑上安装虚拟机。它调用CentOS-7-x86_64-Minimal-2009.iso来安装操作系统.3.VC_redist.x86.exe:系统补丁。如果安装VMware17.6时，提示缺少文件，再来安装它，否则不用。4.finals
大模型如何改变教育？典型应用场景的探究与展望！ AGI大模型学习大模型应用人工智能 AI产品经理 llama 大模型 AI 大模型教程
目前，大模型在教育领域的应用主要体现在个性化学习助手、智能问答系统、内容生成与创作辅助、智能写作评估、跨语言学习支持、数学解题辅助等几个方面。大模型技术在教育领域凭借卓越的数据处理能力和深度学习技术，极大推动了教育质量的提升与教育公平的实现。分级分类的教育数据助力大模型发展在构建与优化大模型的过程中，教育数据能够帮助我们更精准地理解教育现象，更有质量地辅助教学。教育数据涵盖广泛，包括但不限于学生的
Python中的 redis keyspace 通知_python 操作redis psubscribe(‘__keyspace@0__ ‘) 2301_82243733 程序员 python 学习面试
最后Python崛起并且风靡，因为优点多、应用领域广、被大牛们认可。学习Python门槛很低，但它的晋级路线很多，通过它你能进入机器学习、数据挖掘、大数据，CS等更加高级的领域。Python可以做网络应用，可以做科学计算，数据分析，可以做网络爬虫，可以做机器学习、自然语言处理、可以写游戏、可以做桌面应用…Python可以做的很多，你需要学好基础，再选择明确的方向。这里给大家分享一份全套的Pytho
Python数据分析与可视化程序媛小果 python python 数据分析开发语言
Python数据分析与可视化在数据驱动的商业世界中，数据分析和可视化成为了理解复杂数据集、做出明智决策的关键工具。Python，作为一种功能强大且易于学习的编程语言，提供了丰富的库和框架，使得数据分析和可视化变得简单高效。本文将探讨Python在数据分析和可视化中的应用，包括数据预处理、分析、以及如何通过可视化工具将数据洞察转化为可操作的策略。1.数据分析的重要性数据分析是提取数据中有用信息的过程
DeepSeek原理介绍以及对网络安全行业的影响 AI拉呱 Deepseek 人工智能
大家好，我是AI拉呱，一个专注于人工智领域与网络安全方面的博主，现任资深算法研究员一职，兼职硕士研究生导师；热爱机器学习和深度学习算法应用，深耕大语言模型微调、量化、私域部署。曾获多次获得AI竞赛大奖，拥有多项发明专利和学术论文。对于AI算法有自己独特见解和经验。曾辅导十几位非计算机学生转行到算法岗位就业。关注评审分享一起学习更多知识。1.DeepSeek公司介绍1.1DeepSeek是什么：wh
【Python 学习 / 7】模块与文件操作卜及中 Python基础 python 学习数据库
文章目录前言一、导入模块1.导入整个模块2.导入模块中的特定函数3.给模块或函数起别名二、常用模块1.`math`模块2.`random`模块3.`os`模块4.`sys`模块三、文件处理1.打开文件2.读取文件3.写入文件4.关闭文件5.使用`with`语句管理文件四、日期时间1.`datetime`模块获取当前日期和时间创建日期和时间对象格式化日期和时间解析字符串为日期对象2.`time`模块
如何安装Hadoop 薇晶晶 hadoop 大数据分布式
Hadoop入门(一)——CentOS7下载+VM上安装（手动分区）Hadoop入门(二)——VMware虚拟网络设置+Windows10的IP地址配置+CentOS静态IP设置Hadoop入门(三)——XSHELL7远程访问工具+XFTP7文件传输Hadoop入门(四)——模板虚拟机环境准备Hadoop入门(五)——Hadoop集群搭建-克隆三台虚拟机Hadoop入门(六)——JDK安装Hado
《编程小白必看！字符加减法开启大小写转换之门，解锁数学分析方法密码，列方程思想》 1zero10 c语言算法
字符加减法的应用1.输入小写字母，输出大写字母首先肯定有定义变量ch；并且让我们可以在黑框输入一个变量，也就是任意一个小写字母charch;scanf("%c\n",ch);接着分析小写字母和大写字母的联系：举例分析，比如b在小写字母表排第二位，而B在大写字母表里也排第二位小写字母和大写字母都有26个所以可以利用排位一致的特点进行方程的构造设小写字母为ch（上面已经设了）设大写字母为y到这里还毫无
【学习笔记】Elasticsearch之环境搭建聪明马的博客 elasticsearch 学习笔记 elasticsearch
Elasticsearch官网本文是自己在学习Elasticsearch的过程中，记下的觉得非常有用的笔记，希望对大家认识Elasticsearch有一点点帮助。1.什么是Elasticsearch官网上是这么介绍的：Elasticsearchisadistributeddocumentstore.Insteadofstoringinformationasrowsofcolumnardata,El
CSS 滚动条样式修改（详细） mr_cmx css css3 html
1、滚动条整体部分使用::-webkit-scrollbar示例：.container::-webkit-scrollbar{width:20px;//修改滚动条宽度}2、滚动条中的滑块使用::-webkit-scrollbar-thumb示例：.container::-webkit-scrollbar-thumb{border-radius:8px;box-shadow:inset005pxrg
网页实现打字机效果充气大锤前端组件 javascript 算法开发语言 vue.js
在DS中，AI与用户的对话呈现的是一个打字机效果，那么我们在网页中如何实现对话框的打字机效果呢思路：进行字符串拼接，将要拼接的字符串逐字拼接到目标字符串上代码/***实现打字机效果*@param{String}str要打印的字符串*@param{Array}arr聊天数据中的数组*@param{Number}id需要push字符串的下标*@param{String}msg_name数组中的对象名*
HarmonyOS应用开发最佳实践 harmonyos
课程简介本课程是【HarmonyOSTechTalk】的第9课。本次交流紧紧围绕HarmonyOS应用开发。重点探讨常见的功耗问题及其最佳实践方案。省电模式是降低能耗的关键策略，通过优化系统资源分配等方式减少电量消耗。深色模式不仅能提升视觉舒适度，还对节能有积极作用。LTPO可变帧率技术则在保障应用流畅性的同时进一步优化功耗。而后台任务的合理开发与管理，决定着应用在后台运行时的资源占用与续航表现。
应用内自动续订商品，畅享无缝服务体验 harmonyos-next
用户购买某种产品时习惯一次性付款，但是对开发者而言，单次购买模式或需要用户频繁续订的服务可能会导致收入不稳定，无法获得持续稳定的收入。对于有视频、音乐等会员需求的用户，一旦体验到服务中断或需要频繁操作，可能会转向其他竞争产品，导致用户流失。HarmonyOSSDK应用内支付服务（IAPKit）为开发者提供应用内自动续期订阅商品能力，用户购买后在一段时间内允许访问增值功能或内容，周期结束后可以选择自
经销商管理系统架构设计方案（附 Java版本和Python版本源代码详解） AI天才研究院 DeepSeek R1 &大数据AI人工智能大模型 AI大模型企业级应用开发实战 AI大模型应用入门实战与进阶计算科学神经计算深度学习神经网络大数据人工智能大型语言模型 AI AGI LLM Java Python 架构设计 Agent RPA
经销商管理系统架构设计方案（Java实现源代码详解）关键词：经销商管理系统，Java，SpringBoot，MyBatis，MySQL，架构设计，源代码1.背景介绍随着市场竞争的日益激烈，企业对经销商的管理越来越重视。传统的经销商管理方式效率低下，信息滞后，难以适应现代企业的发展需求。为了提高经销商管理效率，降低运营成本，越来越多的企业开始采用信息化的手段来管理经销商，而经销商管理系统应运而生。经
Python:数据从Excel表格链接到Word文档更新Excel即可自动更新Word 一个花生米生花 python excel word
要使用Python来创建或更新一个Word文档，并将数据从Excel表格链接到Word文档中，你可以使用python-docx库来操作Word文档和openpyxl或pandas库来读取Excel文件。不过，需要注意的是，python-docx库并不支持将外部文件链接到Word文档的功能。你可以在Word文档中插入Excel数据的快照，但它们不会自动更新。如果你想要在Word文档中插入Excel数
auto-gptq安装以及不适配软硬件环境可能出现的问题及解决方式 IT修炼家大模型部署大模型 auto-gptq cuda
目录1、auto-gptq是什么？2、auto-gptq安装3、auto-gptq不正确安装可能会出现的问题（1）爆出：`CUDAextensionnotinstalled.`（2）没有报错但是推理速度超级慢1、auto-gptq是什么？Auto-GPTQ是一种专注于量化深度学习模型的工具库。它的主要目标是通过量化技术（Quantization）将大型语言模型（LLM）等深度学习模型的大小和计算复
一张图搞定(2020版)IDEA中集成Maven插件【图文】详细一个长不胖的程序YUAN Maven工具 Maven IDEA集成插件
1、首先你得先确保一下你的电脑上是有成功配置好的Maven工具。配置成功之后的演示:黑窗口中输入mvn-v，出现以上情况就是配置成功的，要是你没有配置好，请查看这篇Maven配置文章。建议配置阿里云镜像，以此让下载依赖更快，配置阿里云镜像。2、最好先在本地创建一个jar包本地仓库，以便之后直接配置时好指定你本地仓库的路径。为了让这篇文章只是出现IDEA集成Maven插件，我就把创建本地仓库的做法放
【deepseek与chatGPT辩论】辩论题： “人工智能是否应当具备自主决策能力？” 海宁不掉头发软件工程人工智能人工智能 chatgpt deepseek
探讨辩论题这个提案涉及创建一个精确的辩论题目，旨在测试deepseek的应答能力。创建辩论题目提议设计一个辩论题目以测试deepseek的应答能力。希望这个题目具有挑战性并能够测量其回应质量。好的，来一道适合深度学习的辩论题：辩论题：“人工智能是否应当具备自主决策能力？”这个话题涉及到人工智能的发展、伦理以及未来应用，可以从以下几个方面展开辩论：支持方：认为人工智能的自主决策能力能够加速科技进步，
FreeRTOS-rust 编译分析路西法Lux FreeRTOS-rust rust FreeRTOS FreeRTOS-rust cargo
目录介绍FreeRTOS-rust├──.cargo#对cargo本身的配置│└──config.toml├──Cargo.toml#对当前工作空间的配置├──freertos-cargo-build#负责对freertos源码进行编译│├──Cargo.toml#对当前package进行配置│└──src│└──lib.rs├──freertos-rust#负责编译freertos的rust接口
使用Odoo Shell卸载模块 odoo中国 odoo odoo 开源软件 erp
使用OdooShell卸载模块我们在Odoo使用过程中，因为模块安装错误或者前端错误等导致odoo无法通过界面登录，这时候你可以使用OdooShell来卸载模块。OdooShell是一个交互式Pythonshell，允许你直接与Odoo数据库和模型进行交互。以下是使用OdooShell卸载模块的详细步骤：步骤1：启动OdooShell要启动OdooShell，你需要在终端中运行以下命令。确保你已经
【后端java】构建工具maven 骑鱼过海的猫123 java maven python
文章目录1导入本地jar包到maven仓库1导入本地jar包到maven仓库mvninstall:install-file-Dfile=-DgroupId=-DartifactId=-Dversion=-Dpackaging=是你的jar文件的路径。是你的项目的组ID。是你的项目的ArtifactID。是你的jar包的版本号通常是jar，除非你的文件是其他类型的包，如pom。mvninstall:
全面解析 Enterprise Architect（EA）活动图的工具集：从元素到关系的详尽指南泡沫o0 C/C++编程世界:探索C/C++的奥妙 c++20 开发语言 c++嵌入式 qt uml arm
目录标题第一章:引言——理解活动图的重要性1.1什么是活动图？1.1.1活动图的组成元素1.1.2活动图的应用场景1.2为什么选择EA作为建模工具？1.2.1EA的强大功能1.2.2EA与其他建模工具的对比第二章:活动图中的核心元素2.1活动类元素2.1.1Activity（活动）示例：2.1.2Action（动作）示例：2.1.3Partition（泳道）示例：2.1.4Send（发送）与Rec
桌面上有多个球在同时运动，怎么实现球之间不交叉，即碰撞？换个号韩国红果果 html 小球碰撞
稍微想了一下，然后解决了很多bug，最后终于把它实现了。其实原理很简单。在每改变一个小球的x y坐标后，遍历整个在dom树中的其他小球，看一下它们与当前小球的距离是否小于球半径的两倍？若小于说明下一次绘制该小球（设为a）前要把他的方向变为原来相反方向（与a要碰撞的小球设为b），即假如当前小球的距离小于球半径的两倍的话，马上改变当前小球方向。那么下一次绘制也是先绘制b，再绘制a，由于a的方向已经改变
《高性能HTML5》读后整理的Web性能优化内容白糖_ html5
读后感先说说《高性能HTML5》这本书的读后感吧，个人觉得这本书前两章跟书的标题完全搭不上关系，或者说只能算是讲解了“高性能”这三个字，HTML5完全不见踪影。个人觉得作者应该首先把HTML5的大菜拿出来讲一讲，再去分析性能优化的内容，这样才会有吸引力。因为只是在线试读，没有机会看后面的内容，所以不胡乱评价了。
[JShop]Spring MVC的RequestContextHolder使用误区 dinguangx jeeshop 商城系统 jshop 电商系统
在spring mvc中，为了随时都能取到当前请求的request对象，可以通过RequestContextHolder的静态方法getRequestAttributes()获取Request相关的变量，如request, response等。在jshop中，对RequestContextHolder的
算法之时间复杂度周凡杨 java 算法时间复杂度效率
在计算机科学中，算法的时间复杂度是一个函数，它定量描述了该算法的运行时间。这是一个关于代表算法输入值的字符串的长度的函数。时间复杂度常用大O符号表述，不包括这个函数的低阶项和首项系数。使用这种方式时，时间复杂度可被称为是渐近的，它考察当输入值大小趋近无穷时的情况。这样用大写O()来体现算法时间复杂度的记法，
Java事务处理 g21121 java
一、什么是Java事务通常的观念认为，事务仅与数据库相关。事务必须服从ISO/IEC所制定的ACID原则。ACID是原子性（atomicity）、一致性（consistency）、隔离性（isolation）和持久性（durability）的缩写。事务的原子性表示事务执行过程中的任何失败都将导致事务所做的任何修改失效。一致性表示当事务执行失败时，所有被该事务影响的数据都应该恢复到事务执行前的状
Linux awk命令详解 510888780 linux
一. AWK 说明 awk是一种编程语言，用于在linux/unix下对文本和数据进行处理。数据可以来自标准输入、一个或多个文件，或其它命令的输出。它支持用户自定义函数和动态正则表达式等先进功能，是linux/unix下的一个强大编程工具。它在命令行中使用，但更多是作为脚本来使用。 awk的处理文本和数据的方式：它逐行扫描文件，从第一行到
android permission 布衣凌宇 Permission
<uses-permission android:name="android.permission.ACCESS_CHECKIN_PROPERTIES" ></uses-permission>允许读写访问"properties"表在checkin数据库中，改值可以修改上传 <uses-permission android:na
Oracle和谷歌Java Android官司将推迟 aijuans java oracle
北京时间 10 月 7 日，据国外媒体报道，Oracle 和谷歌之间一场等待已久的官司可能会推迟至 10 月 17 日以后进行，这场官司的内容是 Android 操作系统所谓的 Java 专利权之争。本案法官 William Alsup 称根据专利权专家 Florian Mueller 的预测，谷歌 Oracle 案很可能会被推迟。　　该案中的第二波辩护被安排在 10 月 17 日出庭，从目前看来
linux shell 常用命令 antlove linux shell command
grep [options] [regex] [files] /var/root # grep -n "o" * hello.c:1:/* This C source can be compiled with:
Java解析XML配置数据库连接(DOM技术连接 SAX技术连接) 百合不是茶 sax技术 Java解析xml文档 dom技术 XML配置数据库连接
XML配置数据库文件的连接其实是个很简单的问题,为什么到现在才写出来主要是昨天在网上看了别人写的,然后一直陷入其中,最后发现不能自拔所以今天决定自己完成 ,,,,现将代码与思路贴出来供大家一起学习 XML配置数据库的连接主要技术点的博客; JDBC编程 : JDBC连接数据库 DOM解析XML: DOM解析XML文件 SA
underscore.js 学习（二） bijian1013 JavaScript underscore
Array Functions 所有数组函数对参数对象一样适用。1.first _.first(array, [n]) 别名: head, take 返回array的第一个元素，设置了参数n，就
plSql介绍 bijian1013 oracle 数据库 plsql
/* * PL/SQL 程序设计学习笔记 * 学习plSql介绍.pdf * 时间：2010-10-05 */ --创建DEPT表 create table DEPT ( DEPTNO NUMBER(10), DNAME NVARCHAR2(255), LOC NVARCHAR2(255) ) delete dept; select
【Nginx一】Nginx安装与总体介绍 bit1129 nginx
启动、停止、重新加载Nginx nginx 启动Nginx服务器，不需要任何参数u nginx -s stop 快速(强制)关系Nginx服务器 nginx -s quit 优雅的关闭Nginx服务器 nginx -s reload 重新加载Nginx服务器的配置文件 nginx -s reopen 重新打开Nginx日志文件
spring mvc开发中浏览器兼容的奇怪问题 bitray jquery Ajax springMVC 浏览器上传文件
最近个人开发一个小的OA项目,属于复习阶段.使用的技术主要是spring mvc作为前端框架,mybatis作为数据库持久化技术.前台使用jquery和一些jquery的插件. 在开发到中间阶段时候发现自己好像忽略了一个小问题,整个项目一直在firefox下测试,没有在IE下测试,不确定是否会出现兼容问题.由于jquer
Lua的io库函数列表 ronin47 lua io
1、io表调用方式：使用io表，io.open将返回指定文件的描述，并且所有的操作将围绕这个文件描述　　io表同样提供三种预定义的文件描述io.stdin,io.stdout,io.stderr 　　2、文件句柄直接调用方式,即使用file:XXX()函数方式进行操作,其中file为io.open()返回的文件句柄　　多数I/O函数调用失败时返回nil加错误信息,有些函数成功时返回nil
java-26-左旋转字符串 bylijinnan java
public class LeftRotateString { /** * Q 26 左旋转字符串 * 题目：定义字符串的左旋转操作：把字符串前面的若干个字符移动到字符串的尾部。 * 如把字符串abcdef左旋转2位得到字符串cdefab。 * 请实现字符串左旋转的函数。要求时间对长度为n的字符串操作的复杂度为O(n)，辅助内存为O(1)。 */ pu
《vi中的替换艺术》-linux命令五分钟系列之十一 cfyme linux命令
vi方面的内容不知道分类到哪里好，就放到《Linux命令五分钟系列》里吧！今天编程，关于栈的一个小例子，其间我需要把”S.”替换为”S->”(替换不包括双引号)。其实这个不难，不过我觉得应该总结一下vi里的替换技术了，以备以后查阅。 1 所有替换方案都要在冒号“:”状态下书写。 2 如果想将abc替换为xyz，那么就这样 :s/abc/xyz/ 不过要特别
[轨道与计算]新的并行计算架构 comsci 并行计算
我在进行流程引擎循环反馈试验的过程中，发现一个有趣的事情。。。如果我们在流程图的每个节点中嵌入一个双向循环代码段，而整个流程中又充满着很多并行路由，每个并行路由中又包含着一些并行节点，那么当整个流程图开始循环反馈过程的时候，这个流程图的运行过程是否变成一个并行计算的架构呢？
重复执行某段代码 dai_lm android
用handler就可以了 private Handler handler = new Handler(); private Runnable runnable = new Runnable() { public void run() { update(); handler.postDelayed(this, 5000); } }; 开始计时 h
Java实现堆栈（list实现） datageek 数据结构——堆栈
public interface IStack<T> { //元素出栈，并返回出栈元素 public T pop(); //元素入栈 public void push(T element); //获取栈顶元素 public T peek(); //判断栈是否为空 public boolean isEmpty
四大备份MySql数据库方法及可能遇到的问题 dcj3sjt126com DB backup
一：通过备份王等软件进行备份前台进不去？用备份王等软件进行备份是大多老站长的选择，这种方法方便快捷，只要上传备份软件到空间一步步操作就可以，但是许多刚接触备份王软件的客用户来说还原后会出现一个问题：因为新老空间数据库用户名和密码不统一，网站文件打包过来后因没有修改连接文件，还原数据库是好了，可是前台会提示数据库连接错误，网站从而出现打不开的情况。解决方法：学会修改网站配置文件，大多是由co
github做webhooks：[1]钩子触发是否成功测试 dcj3sjt126com github git webhook
转自: http://jingyan.baidu.com/article/5d6edee228c88899ebdeec47.html github和svn一样有钩子的功能，而且更加强大。例如我做的是最常见的push操作触发的钩子操作，则每次更新之后的钩子操作记录都会在github的控制板可以看到！工具/原料 github 方法/步骤
">的作用" target="_blank">JSP中的作用蕃薯耀
JSP中<base href="<%=basePath%>">的作用 >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>
linux下SAMBA服务安装与配置 hanqunfeng linux
局域网使用的文件共享服务。一.安装包： rpm -qa | grep samba samba-3.6.9-151.el6.x86_64 samba-common-3.6.9-151.el6.x86_64 samba-winbind-3.6.9-151.el6.x86_64 samba-client-3.6.9-151.el6.x86_64 samba-winbind-clients
guava cache IXHONG cache
缓存，在我们日常开发中是必不可少的一种解决性能问题的方法。简单的说，cache 就是为了提升系统性能而开辟的一块内存空间。　　缓存的主要作用是暂时在内存中保存业务系统的数据处理结果，并且等待下次访问使用。在日常开发的很多场合，由于受限于硬盘IO的性能或者我们自身业务系统的数据处理和获取可能非常费时，当我们发现我们的系统这个数据请求量很大的时候，频繁的IO和频繁的逻辑处理会导致硬盘和CPU资源的
Query的开始--全局变量,noconflict和兼容各种js的初始化方法 kvhur JavaScript jquery css
这个是整个jQuery代码的开始，里面包含了对不同环境的js进行的处理，例如普通环境，Nodejs，和requiredJs的处理方法。还有jQuery生成$, jQuery全局变量的代码和noConflict代码详解完整资源： http://www.gbtags.com/gb/share/5640.htm jQuery 源码： (
美国人的福利和中国人的储蓄 nannan408
今天看了篇文章，震动很大，说的是美国的福利。美国医院的无偿入院真的是个好措施。小小的改善，对于社会是大大的信心。小孩，税费等，政府不收反补，真的体现了人文主义。美国这么高的社会保障会不会使人变懒？答案是否定的。正因为政府解决了后顾之忧，人们才得以倾尽精力去做一些有创造力，更造福社会的事情，这竟成了美国社会思想、人
N阶行列式计算(JAVA) qiuwanchi N阶行列式计算
package gaodai; import java.util.List; /** * N阶行列式计算 * @author 邱万迟 * */ public class DeterminantCalculation { public DeterminantCalculation(List<List<Double>> determina
C语言算法之打渔晒网问题 qiufeihu c 算法
如果一个渔夫从2011年1月1日开始每三天打一次渔，两天晒一次网，编程实现当输入2011年1月1日以后任意一天，输出该渔夫是在打渔还是在晒网。代码如下： #include <stdio.h> int leap(int a) /*自定义函数leap()用来指定输入的年份是否为闰年*/ { if((a%4 == 0 && a%100 != 0
XML中DOCTYPE字段的解析 wyzuomumu xml
DTD声明始终以!DOCTYPE开头,空一格后跟着文档根元素的名称,如果是内部DTD,则再空一格出现[],在中括号中是文档类型定义的内容. 而对于外部DTD,则又分为私有DTD与公共DTD,私有DTD使用SYSTEM表示,接着是外部DTD的URL. 而公共DTD则使用PUBLIC,接着是DTD公共名称,接着是DTD的URL. 私有DTD <!DOCTYPErootSYST