Wang_NNN

[论文精读] [NeRF] [AAAI 2023] One is All: Bridging the Gap Between Neural Radiance Fields Architectures

One is All: Bridging the Gap Between Neural Radiance Fields Architectures

Abstract
Motivations
Contributions
Method
- Overview
- Preliminaries
- - NeRF [R1]
  - Plenoxels [R2]
  - TensoRF [R3]
  - INGP [R4]
- PVD: Progressive Volume Distillation
- - Loss Design
  - Density Range Constrain
  - Block-wise Distillation
Experiments
- Results
- Ablation Study
Paper Notes
References

Abstract

Neural Radiance Fields (NeRF) methods have proved effective as compact, high-quality and versatile representations for 3D scenes, and enable downstream tasks such as editing, retrieval, navigation, etc. Various neural architectures are vying for the core structure of NeRF, including the plain Multi-Layer Perceptron (MLP), sparse tensors, low-rank tensors, hashtables and their compositions. Each of these representations has its particular set of trade-offs. For example, the hashtable-based representations admit faster training and rendering but their lack of clear geometric meaning hampers downstream tasks like spatial-relation-aware editing. In this paper, we propose Progressive Volume Distillation (PVD), a systematic distillation method that allows any-to-any conversions between different architectures, including MLP, sparse or low-rank tensors, hashtables and their compositions. PVD consequently empowers downstream applications to optimally adapt the neural representations for the task at hand in a post hoc fashion. The conversions are fast, as distillation is progressively performed on different levels of volume representations, from shallower to deeper. We also employ special treatment of density to deal with its specific numerical instability problem. Empirical evidence is presented to validate our method on the NeRF-Synthetic, LLFF and TanksAndTemples datasets. For example, with PVD, an MLP-based NeRF model can be distilled from a hashtablebased Instant-NGP model at a 10×∼20× faster speed than being trained the original NeRF from scratch, while achieving a superior level of synthesis quality.

神经辐射场(NERF)方法被证明是一种有效的紧凑、高质量和通用的3D场景表示方法，并支持编辑、检索、导航等下游任务。各种神经体系结构正在争做NERF的核心结构，包括普通的多层感知器(MLP)、稀疏张量、低阶张量、哈希表及其组成。每一种表现形式都有其特定的权衡。例如，基于哈希表的表示允许更快的训练和渲染，但它们缺乏清晰的几何意义，阻碍了下游任务，如空间关系感知编辑。在本文中，我们提出了渐进体积蒸馏(PVD)，这是一种系统的蒸馏方法，允许在不同的体系结构之间进行任意到任意的转换，包括MLP、稀疏或低阶张量、哈希表及其组合。因此，PVD使下游应用程序能够以后自组织方式最佳地适应手头任务的神经表示。因为蒸馏是在不同级别的体积表示上逐步进行的，从浅到深，所以转换速度很快。我们还采用了密度的特殊处理来处理其特定的数值不稳定性问题。在NERF合成数据集、LLFF数据集和TanksAndTemples数据集上的实验结果验证了该方法的有效性。例如，利用pvd，可以以比从头开始训练原始nerf快10×∼20倍的速度从基于散列表的即时ngp模型中提取基于mlp的nerf模型，同时实现更高水平的合成质量。

Motivations

Various neural architectures are vying for the core structure of NeRF, including the plain Multi-Layer Perceptron (MLP), sparse tensors, low-rank tensors, hashtables and their compositions

各种神经结构都在争做NERF的核心结构，包括普通的多层感知器(MLP)、稀疏张量、低阶张量、哈希表及其组成。

Due to the diversity of downstream tasks of NVS, there is no single answer as to which representation is the best. The particular choice would depend on the specific application scenarios and the available hardware computation capabilities.

由于新视角合成下游任务的多样性，对于哪种表示法是最好的，并没有单一的答案。具体的选择将取决于特定的应用场景和可用的硬件计算能力。

Instead of focusing on an ideal alternative representation that embraces the advantages of all variants, we propose a method to achieve arbitrary conversions between known NeRF architectures, including MLPs, sparse tensors, low-rank tensors, hash tables and combinations thereof.

我们没有关注包含所有变体优点的理想替代表示，而是提出了一种在已知的NERF体系结构之间实现任意转换的方法，包括MLP、稀疏张量、低阶张量、哈希表及其组合。

Such flexible conversions can obviously bring the following advantages. Firstly, the study would throw insights into the modeling capabilities and limitations of the already rich and ever-growing constellation of architectures of NeRF. Secondly, the possibility of such conversions would free the designer from the burden of pinning down architectures beforehand, as now they can simply adapt a trained model agilely to other architectures to meet the needs of later discovered application scenarios. Last but not least, complementary benefits may be leveraged in cases where teacher and student are of different attributes.

这种灵活的转换显然可以带来以下好处。首先，这项研究将深入了解NERF已经丰富且不断增长的体系结构的建模能力和局限性。其次，这种转换的可能性将使设计人员从预先确定体系结构的负担中解脱出来，因为现在他们只需将经过训练的模型灵活地适应其他体系结构，以满足后来发现的应用场景的需求。最后，在教师和学生具有不同属性的情况下，可以利用互补利益。

Contributions

We propose PVD, a distillation framework that allows conversions between different NeRF architectures, including the MLP, sparse tensor, low-rank tensor and hash table architectures. To the best of our knowledge, this is the first systematic attempt at such conversions.
In PVD, we build a block-wise distillation strategy to accelerate the training procedure based on a unified view of different NeRF architectures. We also employ a special treatment of the dynamic density volume range by clipping, which improves the training stability and significantly improves the synthesis quality.
As concrete examples, we find that distillation from hashtable and VM-decomposition structures often either helps boost student model synthesis quality and consumes less time than training from scratch. A particular beneficial case, where a NeRF student model is distilled from an INGP teacher.

Method

Overview

With PVD, given one trained NeRF model, different NeRF achitecutres, like sparse tensors, MLP, low-rank tensors and hash tables can be obtained quickly through distillation. The loss in intermediate volume representations (shown as double arrow symbol) like output of $φ^1_∗$ , color and density are used alongside the final rendered RGB volume to accelerate distillation.

对于PVD，只要给定一个训练好的NeRF模型，就可以通过蒸馏快速得到稀疏张量、MLP、低秩张量和哈希表等不同的NeRF结构。中间体积表示(显示为双箭头符号)中的损失，如 $φ^1_∗$ 的输出、颜色和密度与最终渲染的RGB一起使用，以加速蒸馏。

Our method aims to achieve mutual conversions between different architectures of Neural Radiance Fields. Since there is an ever-increasing number of such architectures, we will not attempt to achieve these conversions one by one. Rather, we first formulate typical architectures in a unified form and then design a systematic distillation scheme based on the unified view. The architectures we have derived formula include implicit representations like MLP in NeRF, explicit representations like sparse tensors in Plenoxels, and two hybrid representations: hash tables (in INGP) and lowrank tensors (VM-decomposition in TensoRF). Once formulated, any-to-any conversion between these architectures and their compositions is possible.

我们的方法旨在实现不同结构的神经辐射场之间的相互转换。由于此类体系结构的数量不断增加，我们不会尝试逐一实现这些转换。相反，我们首先以统一的形式制定典型的体系结构，然后基于统一的视图设计系统的升华方案。我们推导出的体系结构公式包括NERF中的MLP隐式表示，Plenoxels中的稀疏张量等显式表示，以及两种混合表示：哈希表(INGP)和低阶张量(TensoRF中的VM-分解)。一旦制定，就可以在这些体系结构和它们的组成之间进行任意转换。

Preliminaries

NeRF [R1]

Plenoxels [R2]

TensoRF [R3]

INGP [R4]

PVD: Progressive Volume Distillation

Given a trained model, our task is to distill it into other models, possibly with different architectures. In PVD, we design a volume-aligned loss and build a blockwise distillation strategy to accelerate the training procedure based on a unified view of different NeRF architectures. We also employ a special treatment of the dynamic density volume range by clipping, which improves the training stability and significantly improves the synthesis quality.

给定一个经过训练好的模型，我们的任务是将其蒸馏成其他模型，可能具有不同的体系结构。在PVD中，基于不同NERF结构的统一视图，我们设计了体对齐的损失，并构建了块蒸馏策略来加速训练过程。我们还对动态密度体积范围进行了特殊的裁剪处理，提高了训练的稳定性，显著提高了合成质量。

Loss Design

In our method, we not only use the RGB, but also use the density, color and an additional intermediate feature to calculate loss between different structures.

在我们的方法中，我们不仅使用RGB，而且还使用密度、颜色和附加的中间特征来计算不同结构之间的损失。

We observed that the implicit and explicit structures in the hybrid representation are naturally separated and correspond to different learning objectives. Therefore, we consider splitting a model into this similar expression forms so that different parts can be aligned during distillation.

我们观察到，混合表征中的内隐和外显结构是自然分离的，并与不同的学习目标相对应。因此，我们考虑将模型拆分成类似的表达形式，以便在蒸馏过程中不同的部分可以对齐。

Specifically, given a model $φ_∗$ , we represent them as a cascade of two modules as follows:

给定一个 $φ_∗$ 模型，我们将它们表示为两个模块的级联，如下所示：

The division of each architecture under our unified two-level view. Regarding NeRF, K=4 is used by default in this paper.

在我们统一的两级视图下对每个架构的划分。关于NERF，本文默认使用K=4。

Here * can be either a teacher or a student. For hybrid representations, we directly regard the explicit part as $φ^1_∗$ , and the implicit part as $φ^2_∗$ . While for purely implicit representation, we divide the network into two parts with similar number of layers according to its depth, and denote the former part as $φ^1_∗$ and the latter part as $φ^2_∗$ . As for the purely explicit representation Plenoxels, we still formulate it into two parts by letting $φ^1_∗$ be the identity, though it can be transformed without splitting. Based on the splitting, we design volume-aligned losses as follows:

在这里，*可以是老师，也可以是学生。对于混合表示，我们直接将显式部分视为 $φ^1_∗$ ，隐含部分为 $φ^2_∗$ 。而对于纯隐式表示，我们根据网络的深度将网络分为两个层数相近的部分，将前者称为 $φ^1_∗$ ，后者称为 $φ^2_∗$ 。对于纯粹的显式表示Plenoxels，我们仍然通过让 $φ^2_∗$ 作为恒等映射来将其表示成两部分，尽管它可以不分裂地变换。(模型的具体拆分见上表)。基于拆分，我们设计了如下体对齐损失：

In essence, the reason for designing this loss is that models in different forms can be mapped to the same space that represents the scene. Our experiments have shown that this volume-aligned loss can accelerate the distillation and improve the quality significantly.

本质上，设计这种损失的原因是不同形式的模型可以映射到表示场景的相同空间。我们的实验表明，这种体对齐损失可以加快蒸馏速度，显著提高产品质量。

在蒸馏过程中的完整损失函数如下：

where $L_σ$ , $L_c$ , $L_{rgb}$ , denote the density loss, color loss and RGB loss respectively. $L_2$ is the mean-squared error (MSE). The last item Lreg represents the regularization term, which depends on the form of the student model. For Plenoxels and VM-decomposition, we add $L_1$ sparsity loss and total variation (TV) regularization loss. It should be noted that we only perform density, color, RGB and regularization loss on Plenoxels for its explicit representation.

其中， $L_σ$ 、 $L_c$ 、 $L_{rgb}$ 分别表示密度损失、颜色损失和RGB损失。 $L_2$ 是均方误差(MSE)。最后一项Lreg表示正则化项，它取决于学生模型的形式。对于Plenoxels和VM-分解，我们增加了 $L_1$ 稀疏性损失和总变分(TV)正则化损失。值得注意的是，我们只对Plenoxels的显式表示进行密度、颜色、RGB和正则化损失。

Density Range Constrain

We found that the loss of density σ is hardly directly optimized. And we impute this problem to its specific numerical instability. That is, the density reflects the light transmittance of a point in the space. Whenσ is greater than or less than a certain value, its physical meaning is consistent (i.e., completely transparent or completely opaque). Therefore the value range of σ can be too wide for a teacher, but in fact, only one interval of the density values play a key role. On the basis of this, we limit the numerical range of σ to $[a, b]$ . Then the $L^σ_2$ is calculated as follow:

我们发现，密度损失σ很难直接优化。我们将这个问题归因于它特有的数值不稳定性。也就是说，密度反映了空间中某个点的透射率。当σ大于或小于某个值时，其物理含义是一致的(即完全透明或完全不透明)。因此，σ的取值范围对于教师来说可能太宽了，但实际上，只有一个密度值区间起到了关键作用。在此基础上，我们将σ的数值范围限制在 $[a, b]$ 。则 $L^σ_2$ 计算如下：

(hotdog场景)σ的数值范围选择：[-2,7]

According to our experiments, this restricting has an inappreciable impact on the performance of teacher and bring a tremendous benefit to the distillation.

根据我们的实验，这种限制对教师的表现有微不足道的影响，并给蒸馏带来了巨大的好处。

We also consider to directly perform the density loss on the exp $σ_iδ_i)$ , but we found it is an inefficiency way since the gradient of exp are easier to saturate, and it requires computing an exponent that increases the amount of calculation when the block-wise is implemented.

我们也考虑直接对exp $σ_iδ_i)$ 进行密度损失，但我们发现这种方法效率不高，因为exp的梯度更容易饱和，而且它需要计算指数，从而增加了分块实现时的计算量。

Block-wise Distillation

During volume rendering, most of the computation occurs in MLP forwarding for each sampled point and integrating the output over each ray. Such a heavy process slows down the training and distillation significantly.

在体渲染过程中，大部分计算都发生在对每个采样点的MLP前向传播和对每条光线的输出进行积分。如此繁重的过程大大减慢了训练和蒸馏的速度。

While in our PVD, thanks to the designed of $L^v_2$ , we can implement the block-wise strategy to get rid of this problem. Specifically, we only forward stage1 at the beginning of training, and then run stage2 and stage3 in turn.

而在我们的PVD中，由于 $L^v_2$ 的设计，我们可以实现分块策略来解决这个问题。具体地说，我们只在训练开始时转发Stage1，然后依次运行Stage2和Stage3。

Consequently, the student and the teacher do not need to forward the complete network and render RGB in the early stages of training. In our experiment, the conversion from INGP to NeRF can be completed in tens of minutes, which requires several hours in the past.

因此，学生和教师不需要在培训的早期阶段前向传播完整的网络和渲染RGB。在我们的实验中，从INGP到NeRF的转换可以在几十分钟内完成，这在过去需要几个小时。

Experiments

Teacher models 数据集：NeRF-Synthetic dataset, forward-facing dataset (LLFF) and TanksAndTemple dataset

In the distillation stage, we find it sufficient to utilize the teacher to generate fake data as inpseudo-labeling, and not touch any of the training data.

在蒸馏阶段，我们发现利用教师生成伪标记伪数据，并且不接触任何训练数据就足够了。

Results

Our experiments mainly focus on whether the conversion between different models can maintain the performance of the teacher or its own upper limit.

这是第一次提出不同表示之间的转换方法，因此没有任何可比的基线。实验主要集中在不同模式之间的转换是否能够保持教师的成绩或其自身的上限。

我们的方法对于转换是非常有效的。当一个模型转换成另一种形式时，它的性能与从头开始训练模型的结果或教师的结果相差不大，充分表明基于辐射场的常见表示可以相互转换。此外，我们的PVD在相同结构之间的蒸馏中表现出优异的近乎无损的性能。

Max(diff1，diff2)的值非常接近0，这意味着通过蒸馏得到的模型可以接近老师或从头训练的性能。我们的方法最大限度地将知识从老师迁移到学生。

该方法既保证了合成质量，又保证了场景深度信息的准确性。

在另外两个数据集上的结果也证明了结果的有效性。并且通过我们的蒸馏得到的模型比它在TanksAndTemples数据集上的原始实现性能更好。这主要是因为我们的PVD方法为学生提供了更多的先验信息，使培训更加有效，充分提高了学生的表达能力。

我们的方法获得NeRF模型的速度明显快于从头训练。（教师是基于VM分解的表示法）

Ablation Study

消融研究证明了我们方法中每个组件对性能的影响程度。我们在NeRF-Synthetic数据集上实现了从VM分解到MLP的转换。可以看出，

我们设计的中间特征损失带来了0.9dB PSNR的改善。
不受密度值限制时性能会大幅下降。
在没有使用分块策略的情况下进行了蒸馏，在相同的训练时间预算下，它的性能很差。

Paper Notes

What problem is addressed in the paper?
ANS: Any-to-any conversions between different neural architectures.
Is it a new problem? If so, why does it matter? If not, why does it still matter?
ANS: Yes. This is the first systematic attempt at such conversions.
What is the key to the solution? What is the main contribution?
ANS:
(1) A distillation framework that allows conversions between different NeRF architectures.
(2) Block-wise distillation strategy to accelerate the training procedure based on a unified view of different NeRF architectures.
(3) Dynamic density volume range for training stability.
How the experiments sufficiently support the claims?
ANS: Effective for conversions, and with nearly nondestructive performance in distillation.
What can we learn from ablation studies?
ANS:
(1) Volume-aligned loss brings improvement.
(2) Drop sharply without the restriction on the value of density.
(3) Poor performance under the same budget of training time without block-wise distillation.
Potential fundamental flaws; how this work can be improved?
ANS:
(1) The performance of student models is generally upper-bounded by the performances of teacher models.
(2) As both teacher and student models need be active during training, memory and computation cost will be duly increased.

References

Paper: https://arxiv.org/abs/2211.15977
Project Page: https://sk-fun.fun/PVD
Code: https://github.com/megvii-research/AAAI2023-PVD

Related works
[R1] NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis
[R2] Plenoxels: Radiance Fields without Neural Networks
[R3] TensoRF: Tensorial Radiance Fields
[R4] Instant Neural Graphics Primitives with a Multiresolution Hash Encoding

Latex beamer 常用操作记录
最近组会要讲论文，试着用latex做下slide，感觉效果不错。在此，记录一些可能会比较常用的一些操作。以后学到了新的操作，再继续更新。用的是这个主题，感觉比较简洁：https://www.overleaf.com/latex/templates/artrabeamer/cvtmgdbwvdmr放置图片图片感觉有好多参数，因为这次汇报比较水，所以只用了scale这个缩放参数凑合，以后再补上一些常用
安装Hadoop集群&入门&源码编译只年大数据 Hadoop hadoop 大数据分布式
安装Hadoop集群完全分布式先决条件准备三台机器NameStaticIPDESCbigdata102192.168.1.102DataNode、NodeManager、NameNodebigdata103192.168.1.103DataNode、NodeManager、ResourceManagerbigdata104192.168.1.104DataNode、NodeManager、Seco
crdb_ado_res_nl.dll ContextManager.dll CATGmoOperators.dll crdb_fielddef_res_es.dll CATInt3DMap. a***0738 microsoft windows visual studio
在使用电脑系统时经常会出现丢失找不到某些文件的情况，由于很多常用软件都是采用MicrosoftVisualStudio编写的，所以这类软件的运行需要依赖微软VisualC++运行库，比如像QQ、迅雷、Adobe软件等等，如果没有安装VC++运行库或者安装的版本不完整，就可能会导致这些软件启动时报错，提示缺少库文件。如果我们遇到关于文件在系统使用过程中提示缺少找不到的情况，如果文件是属于运行库文件的
JuPyter(IPython) Notebooks中使用pip安装Python的模块 weixin_34218890 开发工具 python 人工智能
问题描述：没有带GPU的电脑，搞深度学习不是耍流氓嘛，我网上看到有个云平台，免费使用了一下，小姐姐很热情。使用过程如下：他们给的接口是Jupyter编辑平台，我就在上面跑了一个小例子。tensorflow和python环境是他们配置好的，不过我的例子中需要导入matplotlib.pylot模块。可是他们没有提供，怎么办呢？网上查了一下啊解决方法：采用如下方法：importpipdefMyPipi
【LangChain编程：从入门到实践】LangChain与其他框架的比较 AI天才研究院 Agentic AI 实战计算 AI人工智能与大数据计算科学神经计算深度学习神经网络大数据人工智能大型语言模型 AI AGI LLM Java Python 架构设计 Agent RPA
【LangChain编程：从入门到实践】LangChain与其他框架的比较1.背景介绍1.1人工智能发展现状在当今时代，人工智能(AI)已经成为科技领域中最热门和最具革命性的话题之一。随着计算能力的不断提升和算法的持续优化,AI系统正在不断扩展其应用范围,包括自然语言处理、计算机视觉、决策系统等各个领域。1.2LangChain概述在这种背景下,LangChain作为一个新兴的AI框架应运而生。L
JSP汽车网站yzp17--（程序+源码+数据库+调试部署+开发环境） CK3042 java 汽车开发语言 oracle 数据库服务器大数据
本系统（程序+源码+数据库+调试部署+开发环境）带论文文档1万字以上，文末可获取，系统界面在最后面。系统程序文件列表开题报告内容一、项目背景随着互联网的飞速发展和人民生活水平的日益提高，汽车已成为许多家庭的重要交通工具。为了更好地满足消费者对汽车信息的需求，提升购车体验，我们计划开发一个功能全面、信息丰富的汽车网站。该项目旨在为用户提供便捷的汽车资讯查询、车型对比、购车指南以及售后服务等功能，打造
Jupyter安装指南及Python配置 CodeWG python jupyter ide Python
Jupyter是一个非常流行的交互式计算环境，广泛用于数据分析、机器学习和科学计算等领域。本文将详细介绍如何安装Jupyter并配置Python环境。步骤1：安装Python首先，我们需要安装Python。请按照以下步骤进行操作：打开Python官方网站（https://www.python.org）并下载适用于您操作系统的最新版本的Python。运行下载的安装程序，并按照向导的指示进行安装。在安
Vue3高级-第二十六篇：Vue3 与 WebGL 的融合探索程序员勇哥前端全套教程 vue.js 前端 javascript 开发语言前端框架
Vue3高级-第二十六篇：Vue3与WebGL的融合探索1.WebGL基础与Vue3集成准备深入了解WebGL的概念、功能与应用场景概念：WebGL（WebGraphicsLibrary）是一种用于在网页上进行2D和3D图形渲染的JavaScriptAPI。它基于OpenGLES2.0规范，允许开发者在浏览器环境中直接操作图形硬件，无需安装额外插件。WebGL通过在浏览器中创建一个绘图上下文，利用
happy-llm 第一章 NLP 基础概念 weixin_38374194 自然语言处理人工智能学习
文章目录一、什么是NLP？二、NLP发展三大阶段三、NLP核心任务精要四、文本表示演进史1.传统方法：统计表征2.神经网络：语义向量化课程地址：happy-llmNLP基础概念一、什么是NLP？核心目标：让计算机理解、生成、处理人类语言，实现人机自然交互。现状与挑战：成就：深度学习推动文本分类、翻译等任务达到近人类水平。瓶颈：歧义性、隐喻理解、跨文化差异等。二、NLP发展三大阶段时期代表技术核心思
Python 变量、数据类型、数据类型的转换介绍 cs_mengxi Python python 开发语言
介绍【Python变量、数据类型、数据类型的转换】变量什么是变量python中，变量是存储数据的标识符。通过变量我们可以将数据赋值给名称，再程序中通过引用这个名称去访问对应的数据常见的使用场景变量赋值：使用等号（=）将值赋给变量。x=5name=“John”同时为多个变量赋值a=b=c=1动态类型：Python是一种动态类型语言，变量的类型是根据赋给它的值自动推断的。同一个变量可以在不同的时间赋予
Python运算符简介满目828 python 开发语言初学者运算符
目录一.算术运算符二.赋值运算符三.比较运算符四.逻辑运算符五.其他运算符六.运算符优先级一.算术运算符算术运算符包含:+,-,*,/,**,//,%(注:在运算过程中如含有小数,则结果为float类型(小数))+(加法运算符)a=10b=20#+result=a+bprint(result)print(3+4)-(减法运算符)a=10b=20#-result=a-bprint(result)pr
Python scikit-learn 【机器学习库】全面讲解
让AI成为我们的得力助手：《用Cursor玩转AI辅助编程——不写代码也能做软件开发》scikit-learn（简称sklearn）是Python最流行的机器学习库之一，提供简单高效的数据挖掘和数据分析工具。它基于NumPy、SciPy和Matplotlib构建，广泛应用于工业界和学术界。核心优势统一API设计：所有模型使用一致的接口（fit()、predict()、score()）丰富的算法：覆
Jenkins集成GitHub实现自动化打标签实战指南 ivwdcwso 运维与云原生 jenkins github 自动化 CI/CD devops
本文将详细介绍如何使用Jenkins与GitHubAPI集成，实现自动化打标签的完整流程。以下是完整的Python脚本和详细解析。完整Python脚本#!/root/miniconda3/bin/pythonimportjsonimportboto3importosimportpytzimportargparsefromdatetimeimportdatetimefromgithubimportG
超详细yolov8/11-segment实例分割全流程概述：配置环境、数据标注、训练、验证/预测、onnx部署(c++/python)详解
因为yolo的检测/分割/姿态/旋转/分类模型的环境配置、训练、推理预测等命令非常类似，这里不再详细叙述，主要参考**【YOLOv8/11-detect目标检测全流程教程】**，下面有相关链接，这里主要针对数据标注、格式转换、模型部署等不同细节部分；【YOLOv8/11-detect目标检测全流程教程】超详细yolo8/11-detect目标检测全流程概述：配置环境、数据标注、训练、验证/预测、o
PHP接单涨薪系列（九）之计算机视觉实战：PHP+Stable Diffusion接单指南（2025高溢价秘籍）攻城狮凌霄 PHP PHP接单涨薪 AI php 计算机视觉 stable diffusion
案例场景某电商公司使用本方案后，产品图制作成本降低90%，广告转化率提升35%，单月节省设计费用超¥80,000。本文将彻底解密如何用PHP+AI视觉技术接取高单价设计外包，让你在竞争激烈的市场中脱颖而出！一、视觉设计市场的AI革命1.1传统设计vsAI设计设计任务传统流程AI流程需求沟通初稿设计反复修改最终交付AI生成微调即时交付2025年设计市场数据对比：指标传统设计AI设计提升幅度单图制作时
Python（28）Python循环语句指南：从语法糖到CPython字节码的底层探秘一个天蝎座白勺程序猿 Python爬虫入门到高阶实战 python 开发语言
目录引言一、推导式家族全解析1.1基础语法对比1.2性能对比测试二、CPython实现揭秘2.1字节码层面的秘密2.2临时变量机制三、高级特性实现3.1嵌套推导式优化3.2条件表达式处理四、性能优化指南4.1内存使用对比4.2执行时间优化技巧五、最佳实践建议六、总结Python爬虫相关文章（推荐）引言在Python编程中，循环语句是控制流程的核心工具。传统for循环虽然直观，但在处理大数据时往往面
1.1 python中定义变量与数据类型乏眸 python
一、定义变量1.定义变量语法：变量名=值2.使用变量3.看变量的特点#定义变量：存储数据TOMmy_name='TOM'print(my_name)#定义变量：存储数据SerendipityschoolName='Serendipity'print(schoolName)二、数据类型数值：int（整型），float（浮点型）布尔型：true（真），false（假）str（字符串），list（列表）
Python基础——变量和数据类型全端工程师 python基础 python 开发语言
Python基础——变量和数据类型前言一、什么是变量1.1为什么需要变量1.2变量的基本概念1.3变量的命名规则二、数据类型2.1什么是数据类型2.2使用`type()`函数2.3使用不同的数据类型三、类型转换3.1类型转换的基本概念3.2类型转换函数(显示类型转换)3.3隐式类型转换3.4类型转换的注意事项四、变量的使用五、总结前言今天我们开始学习Python编程的基础——变量和数据类型。这些概
Compython：在线Python代码托管与实时执行平台古斯塔夫歼星炮
本文还有配套的精品资源，点击获取简介：Compython是一个在线平台，允许用户在浏览器中托管、分享并运行Python代码，适合编程初学者、教育者和开发者快速测试。该服务提供了Web交互式编程环境，并支持Markdown和版本控制。同时，为了安全起见，采用了沙箱环境以及对上传代码的审查。此外，用户可以结合HTML和Python创建交互式网页应用，平台提供JupyterNotebook风格的界面。服
python定义向量内积_Python 设计一个向量类，实现数据的输入、输出、向量的加法、减法、点积、夹角等计算... weixin_39927623 python定义向量内积
Python设计一个向量类，实现数据的输入、输出、向量的加法、减法、点积、夹角等计算练习题2018.10.25importmathclassVectors:def__init__(self):self.x1=0self.x2=0self.y1=0self.y2=0self.x=self.x2-self.x1self.y=self.y2-self.y1defadd(self):self.x1=int
python win32con_python win32com.client weixin_39604598 python win32con
#创建#wordw=win32com.client.Dispatch("Word.Application")w=win32com.client.DispatchEx("Word.Application")#使用启动独立的进程#excelxlApp=win32com.client.Dispatch("Excel.Application")#后台运行,不显示,不警告w.Visible=0;w.Disp
纯零基础小白设计的PyCharm + Django 5入门学习大纲001 韩公子的Linux大集市 Python3数据分析 pycharm django 学习
文章目录阶段1：预备知识（1-2天）阶段2：Django初体验（3-5天）阶段3：动手做网页（核心2周）阶段4：实战小项目（1周）阶段5：部署与进阶（可选）避坑指南（小白必看！）学习资源推荐以下是为纯零基础小白设计的PyCharm+Django5入门学习大纲，分阶段渐进式学习，含关键实操点：阶段1：预备知识（1-2天）Python基础速成变量、数据类型、条件语句（if）、循环（for/while）
mysql 内积_Python如何计算两行数据内积
Python计算两行数据内积的方法：首先使用【mat()】方法；然后将每组数据分别放到方法里转换为矩阵；再使两矩阵相乘；最后进行转换即可。>>>a=mat([[1],[2],[3]]);>>>b=mat([[0],[2],[3]]);>>>amatrix([[1],[2],[3]])>>>bmatrix([[0],[2],[3]])>>>a.T*bmatrix([[13]])上面为两个列向量的内积
SurveyForge：AI自动撰写综述论文的革命性工具，助力科研效率跃升花生糖@ AIGC学习资料库人工智能 AI论文 AI助手
在学术研究领域，综述论文（SurveyPaper）的撰写是一项耗时且复杂的任务，通常需要数周甚至数月的文献调研与内容整合。如今，上海人工智能实验室、复旦大学与上海交通大学联合开源的SurveyForge，通过创新的AI技术，将这一过程压缩至10分钟内，且生成质量接近人工水平，成为科研人员的得力助手。项目简介SurveyForge是一款基于大语言模型（LLM）的自动综述论文生成工具，专为计算机科学领
python 求向量间内积和外积
#内积可以描述向量间的投影关系，大小为|a||b|cos⟨a,b⟩：python向量内积求向量长度：importnumpyasnpa=np.asarray([1,1,1])print(np.sqrt(a.dot(a
python内积卷积 AI算法网奇 python基础 python 开发语言
内积就是点乘，卷积先取反。importnumpyasnpbb=[1,2]cc=[2,3]aa=np.dot(bb,cc)print(aa)dd=np.convolve([2,1],cc,'valid')print(dd)dd=np.convolve(bb,cc,'same')print(dd)dd=np.convolve(bb,cc,'full')print(dd)结果：8[8][27][276]
babylon-vrm-loader：让3D模型动起来的强大工具纪栋岑Philomena
babylon-vrm-loader：让3D模型动起来的强大工具babylon-vrm-loaderglTFVRMextensionLoaderforbabylon.js项目地址:https://gitcode.com/gh_mirrors/ba/babylon-vrm-loader在现代网页开发中，3D渲染和交互已成为吸引用户注意力的重要手段。babylon-vrm-loader是一个开源项目，
线性代数向量内积_向量的点积| 使用Python的线性代数 cumubi7453 python 线性代数机器学习 numpy 算法
线性代数向量内积Prerequisite:LinearAlgebra|DefiningaVector先决条件：线性代数|定义向量Linearalgebraisthebranchofmathematicsconcerninglinearequationsbyusingvectorspacesandthroughmatrices.Inotherwords,avectorisamatrixinn-dim
Python的变量与数据类型新人码农11111 python 开发语言
文章目录文章目录前言一、python的变量1.python的基本变量2.python的命名规则：二、python的数据类型1.2.整型（int）2.浮点型（float）3.字符串（str）4.布尔值（bool）5.空值（None）6.类型检测（type()）三、python的数据类型转换1.整型转换（int()）2.浮点型转换（float()）3.字符串转换（str()）4.布尔值转换（bool(
python 安装win32com.client库 FreeLikeTheWind. Qt问题 qt 开发语言经验分享 c++python
win32com.client是Python中用于操作WindowsCOM对象的强大模块，特别适合与MicrosoftOffice应用程序(如Word、Excel、Outlook等)进行交互。1.安装win32com.client需要安装pywin32库：pipinstallpywin32如果安装失败或速度慢，可以使用国内镜像源：pipinstallpywin32-ihttps://pypi.tu
java工厂模式 3213213333332132 java 抽象工厂
工厂模式有 1、工厂方法 2、抽象工厂方法。下面我的实现是抽象工厂方法, 给所有具体的产品类定一个通用的接口。 package 工厂模式; /** * 航天飞行接口 * * @Description * @author FuJianyong * 2015-7-14下午02:42:05 */ public interface SpaceF
nginx频率限制+python测试 ronin47 nginx 频率 python
部分内容参考：http://www.abc3210.com/2013/web_04/82.shtml 首先说一下遇到这个问题是因为网站被攻击，阿里云报警，想到要限制一下访问频率，而不是限制ip（限制ip的方案稍后给出）。nginx连接资源被吃空返回状态码是502，添加本方案限制后返回599，与正常状态码区别开。步骤如下：
java线程和线程池的使用 dyy_gusi ThreadPool thread Runnable timer
java线程和线程池一、创建多线程的方式 java多线程很常见，如何使用多线程，如何创建线程，java中有两种方式，第一种是让自己的类实现Runnable接口，第二种是让自己的类继承Thread类。其实Thread类自己也是实现了Runnable接口。具体使用实例如下： 1、通过实现Runnable接口方式 1 2
Linux 171815164 linux
ubuntu kernel http://kernel.ubuntu.com/~kernel-ppa/mainline/v4.1.2-unstable/ 安卓sdk代理 mirrors.neusoft.edu.cn 80 输入法和jdk sudo apt-get install fcitx su
Tomcat JDBC Connection Pool g21121 Connection
Tomcat7 抛弃了以往的DBCP 采用了新的Tomcat Jdbc Pool 作为数据库连接组件，事实上DBCP已经被Hibernate 所抛弃，因为他存在很多问题，诸如：更新缓慢，bug较多，编译问题，代码复杂等等。 Tomcat Jdbc P
敲代码的一点想法永夜-极光 java 随笔感想
入门学习java编程已经半年了,一路敲代码下来,现在也才1w+行代码量,也就菜鸟水准吧,但是在整个学习过程中,我一直在想,为什么很多培训老师,网上的文章都是要我们背一些代码?比如学习Arraylist的时候,教师就让我们先参考源代码写一遍,然
jvm指令集程序员是怎么炼成的 jvm 指令集
转自：http://blog.csdn.net/hudashi/article/details/7062675#comments 将值推送至栈顶时 const ldc push load指令 const系列该系列命令主要负责把简单的数值类型送到栈顶。(从常量池或者局部变量push到栈顶时均使用) 0x02 &nbs
Oracle字符集的查看查询和Oracle字符集的设置修改 aijuans oracle
本文主要讨论以下几个部分：如何查看查询oracle字符集、修改设置字符集以及常见的oracle utf8字符集和oracle exp 字符集问题。一、什么是Oracle字符集 Oracle字符集是一个字节数据的解释的符号集合,有大小之分,有相互的包容关系。ORACLE 支持国家语言的体系结构允许你使用本地化语言来存储，处理，检索数据。它使数据库工具，错误消息，排序次序，日期，时间，货
png在Ie6下透明度处理方法 antonyup_2006 css 浏览器 Firebug IE
由于之前到深圳现场支撑上线，当时为了解决个控件下载，我机器上的IE8老报个错，不得以把ie8卸载掉，换个Ie6,问题解决了，今天出差回来，用ie6登入另一个正在开发的系统，遇到了Png图片的问题，当然升级到ie8(ie8自带的开发人员工具调试前端页面JS之类的还是比较方便的，和FireBug一样，呵呵)，这个问题就解决了，但稍微做了下这个问题的处理。我们知道PNG是图像文件存储格式，查询资
表查询常用命令高级查询方法(二) 百合不是茶 oracle 分页查询分组查询联合查询
----------------------------------------------------分组查询 group by having --平均工资和最高工资 select avg(sal)平均工资,max(sal) from emp ; --每个部门的平均工资和最高工资
uploadify3.1版本参数使用详解 bijian1013 JavaScript uploadify3.1
使用：绑定的界面元素<input id='gallery'type='file'/>$("#gallery").uploadify({设置参数，参数如下}); 设置的属性： id: jQuery(this).attr('id'),//绑定的input的ID langFile: 'http://ww
精通Oracle10编程SQL(17)使用ORACLE系统包 bijian1013 oracle 数据库 plsql
/* *使用ORACLE系统包 */ --1.DBMS_OUTPUT --ENABLE:用于激活过程PUT,PUT_LINE,NEW_LINE,GET_LINE和GET_LINES的调用 --语法：DBMS_OUTPUT.enable(buffer_size in integer default 20000); --DISABLE:用于禁止对过程PUT,PUT_LINE,NEW
【JVM一】JVM垃圾回收日志 bit1129 垃圾回收
将JVM垃圾回收的日志记录下来，对于分析垃圾回收的运行状态，进而调整内存分配(年轻代，老年代，永久代的内存分配)等是很有意义的。JVM与垃圾回收日志相关的参数包括： -XX:+PrintGC -XX:+PrintGCDetails -XX:+PrintGCTimeStamps -XX:+PrintGCDateStamps -Xloggc -XX:+PrintGC 通
Toast使用白糖_ toast
Android中的Toast是一种简易的消息提示框，toast提示框不能被用户点击，toast会根据用户设置的显示时间后自动消失。创建Toast 两个方法创建Toast makeText(Context context, int resId, int duration) 参数：context是toast显示在
angular.identity boyitech AngularJS AngularJS API
angular.identiy 描述: 返回它第一参数的函数. 此函数多用于函数是编程. 使用方法: angular.identity(value); 参数详解: Param Type Details value * to be returned. 返回值: 传入的value 实例代码: <!DOCTYPE HTML>
java-两整数相除，求循环节 bylijinnan java
import java.util.ArrayList; import java.util.List; public class CircleDigitsInDivision { /** * 题目：求循环节，若整除则返回NULL，否则返回char*指向循环节。先写思路。函数原型：char*get_circle_digits(unsigned k,unsigned j)
Java 日期周年 Chen.H java C++c C#
/** * java日期操作(月末、周末等的日期操作) * * @author * */ public class DateUtil { /** */ /** * 取得某天相加(减)後的那一天 * * @param date * @param num *
[高考与专业]欢迎广大高中毕业生加入自动控制与计算机应用专业 comsci 计算机
不知道现在的高校还设置这个宽口径专业没有,自动控制与计算机应用专业,我就是这个专业毕业的,这个专业的课程非常多,既要学习自动控制方面的课程,也要学习计算机专业的课程,对数学也要求比较高.....如果有这个专业,欢迎大家报考...毕业出来之后,就业的途径非常广..... 以后
分层查询（Hierarchical Queries） daizj oracle 递归查询层次查询
Hierarchical Queries If a table contains hierarchical data, then you can select rows in a hierarchical order using the hierarchical query clause: hierarchical_query_clause::= start with condi
数据迁移 daysinsun 数据迁移
最近公司在重构一个医疗系统，原来的系统是两个.Net系统，现需要重构到java中。数据库分别为SQL Server和Mysql，现需要将数据库统一为Hana数据库，发现了几个问题，但最后通过努力都解决了。 1、原本通过Hana的数据迁移工具把数据是可以迁移过去的，在MySQl里面的字段为TEXT类型的到Hana里面就存储不了了，最后不得不更改为clob。 2、在数据插入的时候有些字段特别长
C语言学习二进制的表示示例 dcj3sjt126com c basic
进制的表示示例 # include <stdio.h> int main(void) { int i = 0x32C; printf("i = %d\n", i); /* printf的用法 %d表示以十进制输出 %x或%X表示以十六进制的输出 %o表示以八进制输出 */ return 0; }
NsTimer 和 UITableViewCell 之间的控制 dcj3sjt126com ios
情况是这样的: 一个UITableView, 每个Cell的内容是我自定义的 viewA viewA上面有很多的动画, 我需要添加NSTimer来做动画, 由于TableView的复用机制, 我添加的动画会不断开启, 没有停止, 动画会执行越来越多. 解决办法: 在配置cell的时候开始动画, 然后在cell结束显示的时候停止动画查找cell结束显示的代理
MySql中case when then 的使用 fanxiaolong casewhenthenend
select "主键", "项目编号", "项目名称","项目创建时间", "项目状态","部门名称","创建人" union (select pp.id as "主键", pp.project_number as &
Ehcache（01）——简介、基本操作 234390216 cache ehcache 简介 CacheManager crud
Ehcache简介目录 1 CacheManager 1.1 构造方法构建 1.2 静态方法构建 2 Cache 2.1&
最容易懂的javascript闭包学习入门 jackyrong JavaScript
http://www.ruanyifeng.com/blog/2009/08/learning_javascript_closures.html 闭包（closure）是Javascript语言的一个难点，也是它的特色，很多高级应用都要依靠闭包实现。下面就是我的学习笔记，对于Javascript初学者应该是很有用的。一、变量的作用域要理解闭包，首先必须理解Javascript特殊
提升网站转化率的四步优化方案 php教程分享数据结构 PHP 数据挖掘 Google 活动
网站开发完成后,我们在进行网站优化最关键的问题就是如何提高整体的转化率，这也是营销策略里最最重要的方面之一，并且也是网站综合运营实例的结果。文中分享了四大优化策略：调查、研究、优化、评估，这四大策略可以很好地帮助用户设计出高效的优化方案。 PHP开发的网站优化一个网站最关键和棘手的是，如何提高整体的转化率，这是任何营销策略里最重要的方面之一，而提升网站转化率是网站综合运营实力的结果。今天，我就分
web开发里什么是HTML5的WebSocket？ naruto1990 Web html5 浏览器 socket
当前火起来的HTML5语言里面，很多学者们都还没有完全了解这语言的效果情况，我最喜欢的Web开发技术就是正迅速变得流行的 WebSocket API。WebSocket 提供了一个受欢迎的技术，以替代我们过去几年一直在用的Ajax技术。这个新的API提供了一个方法，从客户端使用简单的语法有效地推动消息到服务器。让我们看一看6个HTML5教程介绍里的 WebSocket API：它可用于客户端、服
Socket初步编程——简单实现群聊 Everyday都不同 socket 网络编程初步认识
初次接触到socket网络编程，也参考了网络上众前辈的文章。尝试自己也写了一下，记录下过程吧：服务端：（接收客户端消息并把它们打印出来） public class SocketServer { private List<Socket> socketList = new ArrayList<Socket>(); public s
面试：Hashtable与HashMap的区别（结合线程） toknowme
昨天去了某钱公司面试，面试过程中被问道 Hashtable与HashMap的区别？当时就是回答了一点，Hashtable是线程安全的，HashMap是线程不安全的，说白了，就是Hashtable是的同步的，HashMap不是同步的，需要额外的处理一下。今天就动手写了一个例子，直接看代码吧 package com.learn.lesson001; import java
MVC设计模式的总结 xp9802 设计模式 mvc 框架 IOC
随着Web应用的商业逻辑包含逐渐复杂的公式分析计算、决策支持等，使客户机越来越不堪重负，因此将系统的商业分离出来。单独形成一部分，这样三层结构产生了。其中‘层’是逻辑上的划分。三层体系结构是将整个系统划分为如图2.1所示的结构[3] （1）表现层（Presentation layer）：包含表示代码、用户交互GUI、数据验证。该层用于向客户端用户提供GUI交互，它允许用户