氵文大师

[论文摘抄] The Transformer Network for the Traveling Salesman Problem

The Transformer Network for the Traveling Salesman Problem

https://arxiv.org/pdf/2103.03012.pdf

Bresson X, Laurent T. The transformer network for the traveling salesman problem[J]. arXiv preprint arXiv:2103.03012, 2021.

@article{bresson2021transformer,
  title={The transformer network for the traveling salesman problem},
  author={Bresson, Xavier and Laurent, Thomas},
  journal={arXiv preprint arXiv:2103.03012},
  year={2021}
}

1. 摘要中的部分：

TSP概况：
The Traveling Salesman Problem (TSP) is the most popular and most studied combinatorial problem, starting with von Neumann in 1951. It has driven the discovery of several optimization techniques such as cutting planes, branch-andbound, local search, Lagrangian relaxation, and simulated annealing.

本文核心问题：
The main question is whether deep learning can learn better heuristics from data, i.e. replacing human-engineered heuristics?

本文解决方法
In this work, we propose to adapt the recent successful Transformer architecture originally developed for natural language processing to the combinatorial TSP. Training is done by reinforcement learning, hence without TSP training solutions, and decoding uses beam search.

beam search 柱搜索(暂时不懂)

2. 引言的部分 [Traditional TSP Solvers]

解决COP的两个方法：

There exist two traditional approaches to tackle combinatorial problems;
exact algorithms and approximate/heuristic algorithms.

精确：

Exact algorithms are guaranteed(保证) to find optimal solutions, but they become intractable when n grows.

近似算法以最优性换取计算效率：

Approximate algorithms trade optimality for computational efficiency. They are problem-specific, often designed by iteratively applying a simple man-crafted rule, known as heuristic. Their complexity is polynomial and their quality depends on an approximate ratio that characterizes the worst/average-case error w.r.t the optimal solution.
它们的质量取决于一个近似比率，该比率表征了最佳解决方案的最坏/平均情况误差

精确的例子：
Exact algorithms for TSP are given by exhaustive search(穷举搜索), Dynamic or Integer Programming.

A Dynamic Programming algorithm was proposed for TSP in [16] with $O(n^22^n)$ complexity, which becomes intractable(棘手的,不可接受的) for n > 40.
A general purpose Integer Programming (IP) solver with Cutting Planes (CP) and Branch-and-Bound (BB) called Gurobi was introduced in [15].
Finally, a highly specialized linear IP+CP+BB, namely Concorde, was designed in [2].
Concorde is widely regarded as the fastest exact TSP solver, for large instances, currently in existence.
Concorde 被广泛认为是目前存在的大型实例中最快的精确 TSP 求解器。

近似/启发式的方法

Several approximate/heuristic algorithms have been introduced.

Christofides algorithm [7] approximates TSP with Minimum Spanning Trees(最小生成树).
The algorithm has a polynomial-time complexity with $O(n^2 log n)$ , and is guaranteed to find a solution within a factor 3/2 of the optimal solution.

Farthest/nearest/greedy insertion algorithms [20] have complexity $O(n^2)$ , and farthest insertion (the best insertion in practice) has an approximation ratio of 2.43.

Google OR-Tools [14] is a highly optimized program that solves TSP and a larger set of vehicle routing problems(路径规划问题). This program applies different heuristics s.a. Simulated Annealing, Greedy Descent, Tabu Search, to navigate in the search space, and refines the solution by Local Search techniques.

OR-Tools使用指南：https://developers.google.com/optimization/introduction/overview

2-Opt algorithm [27, 21] proposes an heuristic based on a move that replaces two edges to reduce the tour length. The complexity is $O(n^2m(n))$ , where $n^2$ is the number of node pairs and $m (n)$ is the number of times all pairs must be tested to reach a local minimum (with worst-case being $O(2^{n/2}))$ . The approximation ratio is $4/\sqrt{n}$ . Extension to 3-Opt move (replacing 3 edges) and more have been proposed in [6].

Finally, LKH-3 algorithm [18] introduces the best heuristic for solving TSP. It is an extension of the original LKH [28] and LKH-2 [17] based on 2-Opt/3-Opt where edge candidates are estimated with a Minimum Spanning Tree [17]. LKH-3 can tackle various TSP-type problems.

3. 引言的部分 [Neural Network Solvers]

学习到的特征替换人工特征

In the last decade, Deep learning (DL) has significantly improved Computer Vision, Natural Language Processing and Speech Recognition by replacing hand-crafted visual/text/speech features by features learned from data [26].

[26] LeCun Y, Bengio Y, Hinton G. Deep learning[J]. nature, 2015, 521(7553): 436-444.

@article{lecun2015deep,
  title={Deep learning},
  author={LeCun, Yann and Bengio, Yoshua and Hinton, Geoffrey},
  journal={nature},
  volume={521},
  number={7553},
  pages={436--444},
  year={2015},
  publisher={Nature Publishing Group}
}

这里又把摘要展开说了说：

核心问题：For combinatorial problems, the main question is whether DL can learn better heuristics from data than hand-crafted heuristics?

This is attractive because developing algorithms to tackle efficiently NP-hard problems require years of research (TSP has been actively studied for seventy years).

这句好地道：

The last five years have seen the emergence of promising techniques
在过去的五年中，出现了有前途的技术
where (graph) neural networks have been capable to learn new combinatorial algorithms with supervised or reinforcement learning.
其中（图）神经网络已经能够通过监督或强化学习来学到新的组合算法。

总结一下近几年的工作:
We briefly summarize this line of work below.

先跳过了，让我先看看本文的方法

HopfieldNets [19]: First Neural Network designed to solve (small) TSPs.
PointerNets [39]: A pioneer work using modern DL to tackle TSP and combinatorial optimization
problems. This work combines recurrent networks to encode the cities and decode the sequence of nodes in the tour, with the attention mechanism. The network structure is similar to [3], which was applied to NLP with great success. The decoding is auto-regressive and the network parameters are learned by supervised learning with approximate TSP solutions.
PointerNets+RL [5]: The authors improve [39] with Reinforcement Learning (RL) which eliminates the requirement of generating TSP solutions as supervised training data. The tour length is used as reward. Two RL approaches are studied; a standard unbiased reinforce algorithm [40], and an active search algorithm that can explore more candidates.
Order-invariant PointerNets+RL [33]: The original network [39] is not invariant by permutations
of the order of the input cities (which is important for NLP but not for TSP). This requires [39] to
randomly permute the input order to let the network learn this invariance. The work [33] solves this issue by making the encoder permutation-invariant.
S2V-DQN [9]: This model is a graph network that takes a graph and a partial tour as input, and
outputs a state-valued function Q to estimate the next node in the tour. Training is done by RL
and memory replay [31], which allows intermediate rewards that encourage farthest node insertion heuristic.
Quadratic Assignment Problem [34]: TSP can be formulated as a QAP, which is NP-hard and also hard to approximate. A graph network based on the powers of adjacency matrix of node distances is trained in supervised manner. The loss is the KL distance between the adjacency matrix of the ground truth cycle and its network prediction. A feasible tour is computed with beam search.
Permutation-invariant Pooling Network [23]: This work solves a variant of TSP with multiple
salesmen. The network is trained by supervised learning and outputs a fractional solution, which is transformed into a feasible integer solution by beam search. The approach is non-autoregressive, i.e. single pass.
Tranformer-encoder+2-Opt heuristic [11]: The authors use a standard transformer to encode the
cities and they decode sequentially with a query composed of the last three cities in the partial
tour. The network is trained with Actor-Critic RL, and the solution is refined with a standard 2-Opt
heuristic.
Tranformer-encoder+Attention-decoder [25]: This work also uses a standard transformer to encode the cities and the decoding is sequential with a query composed of the first city, the last city in the partial tour and a global representation of all cities. Training is carried out with reinforce and a deterministic baseline.
GraphConvNet [22]: This work learns a deep graph network by supervision to predict the probabilities of an edge to be in the TSP tour. A feasible tour is generated by beam search. The approach uses a single pass.
2-Opt Learning [41]: The authors design a transformer-based network to learn to select nodes
for the 2-Opt heuristics (original 2-Opt may require $O(2^{n/2})$ moves before stopping). Learning is
performed by RL and actor-critic.
GNNs with Monte Carlo Tree Search [42]: A recent work based on AlphaGo [35] which augments a graph network with MCTS to improve the search exploration of tours by evaluating multiple next node candidates in the tour. This improves the search exploration of auto-regressive methods, which cannot go back once the selection of the nodes is made.

4. 方法架构 [Proposed Architecture]

1. 方法概述

将TSP视为翻译问题，源语言是一个2D点集，目标语言是一个最短的 tour（索引的序列），本文使用原始的 Transformers 来解决TSP问题

We cast TSP as a “translation” problem where the source “language” is a set of 2D points and
the target “language” is a tour (sequence of indices) with minimal length, and adapt the original
Transformers [37] to solve this problem.

用RL方法训练，reward 是 tour 的长度
如果训练网络在一组随机 TSP 上改进了Baseline，则 Baseline 会相应地更新。

We train by reinforcement learning, with the same setting as [25]. The reward is the tour length and the baseline is simply updated if the train network improves the baseline on a set of random TSPs.

整个架构图如下：

2. Encoder.

本文使用标准 Transformer 模型，只是使用BN，而不是LN

It is a standard Transformer encoder with multi-head attention and residual connection. The
only difference is the use of batch normalization, instead of layer normalization. The memory/speed
complexity is $O(n^2)$ .

Formally, the encoder equations are (when considering a single head for an easier description)

$H^{\rm{enc}}=H^{\ell=L^{\mathrm{enc}}} \in \mathbb{R}^{(n+1) \times d},$

where,
$\begin{aligned} H^{\ell=0} &=\operatorname{Concat}(z, X) \in \mathbb{R}^{(n+1) \times 2}, z \in \mathbb{R}^{2}, X \in \mathbb{R}^{n \times 2}, \\ H^{\ell+1} &=\operatorname{softmax}\left(\frac{Q^{\ell} K^{\ell^{T}}}{\sqrt{d}}\right) V^{\ell} \in \mathbb{R}^{(n+1) \times d} \\ Q^{\ell} &=H^{\ell} W_{Q}^{\ell} \in \mathbb{R}^{(n+1) \times d}, W_{Q}^{\ell} \in \mathbb{R}^{d \times d} \\ K^{\ell} &=H^{\ell} W_{K}^{\ell} \in \mathbb{R}^{(n+1) \times d}, W_{K}^{\ell} \in \mathbb{R}^{d \times d} \\ V^{\ell} &=H^{\ell} W_{V}^{\ell} \in \mathbb{R}^{(n+1) \times d}, W_{V}^{\ell} \in \mathbb{R}^{d \times d} \end{aligned}$

where $z$ is a start token, initialized at random.

有个以为 $H^{\ell=0}$ 是怎么变成 $H^{\ell}$ 的，从 $2$ 变成了 $d$ ??
按理说应该是有个线性投射层吧 $2$ 转化成 $d$ 的，一会儿看看代码咋写的

下图是编码器的图例

3. Decoder.

The decoding is auto-regressive, one city at a time. Suppose we have decoded the first t
cities in the tour, and we want to predict the next city.

The decoding process is composed of 4 steps detailed below and illustrated on 下图.

Decoder – Part 1

The decoding starts with the encoding of the previously selected $i_t$ city :

$\begin{aligned} h_{t}^{\mathrm{dec}} &=h_{i_{t}}^{\mathrm{enc}}+\mathrm{PE}_{t} \in \mathbb{R}^{d} \\ h_{t=0}^{\mathrm{dec}} &=h_{\mathrm{start}}^{\mathrm{dec}}=z+\mathrm{PE}_{t=0} \in \mathbb{R}^{d} \end{aligned}$

where $\mathrm{PE}_{t} \in \mathbb{R}^{d}$ is the traditional positional encoding in [37] to order the nodes in the tour:

$\mathrm{PE}_{t, i}=\left\{\begin{array}{l} \sin \left(2 \pi f_{i} t\right) \text { if } i \text { is even, } \\ \cos \left(2 \pi f_{i} t\right) \text { if } i \text { is odd, } \end{array} \quad \text { with } f_{i}=\frac{10,000^{\frac{d}{[2 i\rfloor}}}{2 \pi}\right.$

Decoder – Part 2.

This step prepares the query using self-attention over the partial tour.
为啥是部分?

The self-attention layer is standard and uses multi-head attention, residual connection, and layer normalization.

The memory/speed complexity is $O (t)$ at the decoding step $t$ . The equations for this step are (when again considering a single head for an easier description):

$\begin{aligned} \hat{h}_{t}^{\ell+1} &=\operatorname{softmax}\left(\frac{q^{\ell} K^{\ell^{T}}}{\sqrt{d}}\right) V^{\ell} \in \mathbb{R}^{d}, \ell=0, \ldots, L^{\mathrm{dec}}-1 \\ q^{\ell} &=\hat{h}_{t}^{\ell} \hat{W}_{q}^{\ell} \in \mathbb{R}^{d}, \hat{W}_{q}^{\ell} \in \mathbb{R}^{d \times d} \\ K^{\ell} &=\hat{H}_{1, t}^{\ell} \hat{W}_{K}^{\ell} \in \mathbb{R}^{t \times d}, \hat{W}_{K}^{\ell} \in \mathbb{R}^{d \times d} \\ V^{\ell} &=\hat{H}_{1, t}^{\ell} \hat{W}_{V}^{\ell} \in \mathbb{R}^{t \times d}, \hat{W}_{V}^{\ell} \in \mathbb{R}^{d \times d}, \\ \hat{H}_{1, t}^{\ell} &=\left[\hat{h}_{1}^{\ell}, . ., \hat{h}_{t}^{\ell}\right], \hat{h}_{t}^{\ell}=\left\{\begin{array}{c} h_{t}^{\mathrm{dec}} \text { if } \ell=0 \\ h_{t}^{\mathrm{q}, \ell} \text { if } \ell>0 \end{array}\right. \end{aligned}$

Decoder – Part 3.

This stage queries the next possible city among the non-visited cities using a query-attention layer.
Multi-head attention, residual connection, and layer normalization are used.
The memory/speed complexity is $O (n)$ at each recursive step.

$\begin{aligned} h_{t}^{\mathrm{q}, \ell+1} &=\operatorname{softmax}\left(\frac{q^{\ell} K^{\ell^{T}}}{\sqrt{d}} \odot \mathcal{M}_{t}\right) V^{\ell} \in \mathbb{R}^{d}, \ell=0, \ldots, L^{\mathrm{dec}}-1 \\ q^{\ell} &=\hat{h}_{t}^{\ell+1} \tilde{W}_{q}^{\ell} \in \mathbb{R}^{d}, \tilde{W}_{q}^{\ell} \in \mathbb{R}^{d \times d} \\ K^{\ell} &=H^{\mathrm{enc}} \tilde{W}_{K}^{\ell} \in \mathbb{R}^{t \times d}, \tilde{W}_{K}^{\ell} \in \mathbb{R}^{d \times d} \\ V^{\ell} &=H^{\mathrm{enc}} \tilde{W}_{V}^{\ell} \in \mathbb{R}^{t \times d}, \tilde{W}_{V}^{\ell} \in \mathbb{R}^{d \times d} \end{aligned}$

with $\mathcal{M}_{t}$ is the mask if the visited cities and $\odot$ is the Hadamard product.（就是逐项乘积）

Decoder – Part 4.

This is the final step that performs a final query using a single-head attention to get a distribution over the non-visited cities.

Eventually, the next node it+1 is sampled from the distribution using Bernoulli during training and greedy (index with maximum probability) at inference time to evaluate the baseline.

The memory/speed complexity is $O (n)$ .

The final equation is

$\begin{aligned} p_{t}^{\mathrm{dec}} &=\operatorname{softmax}\left(C \cdot \tanh \left(\frac{q K^{T}}{\sqrt{d}} \odot \mathcal{M}_{t}\right)\right) \in \mathbb{R}^{n} \\ q &=h_{t}^{\mathrm{q}} \bar{W}_{q} \in \mathbb{R}^{d}, \bar{W}_{q} \in \mathbb{R}^{d \times d} \\ K &=H^{\text {enc }} \bar{W}_{K} \in \mathbb{R}^{n \times d}, \bar{W}_{K}^{\ell} \in \mathbb{R}^{d \times d} \end{aligned}$

where $C = 10$ .

以上还得结合代码来理解… 不太明白 Decoder 为啥这么花里胡哨的??

4. 方法架构对比 [Architecture Comparison]

Comparing Transformers for NLP (translation) vs. TSP (combinatorial optimization), the order of
the input sequence is irrelevant for TSP but the order of the output sequence is coded with PEs for
both TSP and NLP.
输入序列的顺序与 TSP 无关，但输出序列的顺序在 TSP 和 NLP 中都使用 PE 编码。
（不太懂）

TSP-Encoder benefits from Batch Normalization as we consider all cities during the encoding stage.

意思是有了BN，一次性看到了所有的城市??

TSP-Decoder works better with Layer Normalization since one vector is decoded at a time (auto-regressive decoding as in NLP).

The TSP Transformer is learned by Reinforcement Learning, hence no TSP solutions/approximations required.
TSP Transformer 是通过强化学习学习的，因此不需要 TSP 解决方案/近似值。

Both transformers for NLP and TSP have quadratic complexity $O(n^2L)$ .

Comparing with the closed neural network models of [25] and [11], we use the same transformer encoder (with BN) but our decoding architecture is different. We construct the query using all cities in the partial tour with a self-attention module.

[25] use the first and last cities with a global representation of all cities as the query for the next city.
[11] define the query with the last three cities in the partial tour.

Besides, our decoding process starts differently. We add a token city $\in \mathbb{R}$ .
This city does not exist and aims at starting the decoding at the best possible location by querying all cities with a self-attention module.

[25] starts the decoding with the mean representation of the encoding cities and a random token of the first and current cities.
[11] starts the decoding with a random token of the last three cities.

$\max _{\operatorname{seq}_{n}=\left\{i_{1}, \ldots, i_{n}\right\}} P^{\mathrm{TSP}}\left(\operatorname{seq}_{n} \mid X\right)=P^{\mathrm{TSP}}\left(i_{1}, \ldots, i_{n} \mid X\right)$

视频格式批量转换工具-FFGO 屠屠在干嘛 FFGO 格式工厂视频
由于毕设需要webm来展示动画而搜索引擎所有的webm转换工具都是在线且限制转换大小的就算大小刚好也容易报错甚至转换不出来绞尽脑汁干脆自己写了一个视频格式转换工具基本上视频格式都能够支持，如果后续有什么无法支持的格式我会后续继续更新所以暂且命名他为FF-GO吧也挺好听的，下面是软件的截图和下载链接下载直链：https://tuwp.cc:999/d/LOVETU/%E5%AE%9E%E7%94%A
医疗器械企业出海，如何应对序列号跟踪、批次管理难题？
全球医疗器械市场规模持续扩大，越来越多的中国医疗器械企业选择走出国门，参与全球竞争。在出海过程中，欧盟、美国等国家均要求企业建立完整的追溯体系，这给国内医疗企业带来了新的挑战。这该如何破局？ZohoBooks以智能库存管理、全球化合规支持和多系统集成能力，可以成为医疗器械企业出海的“数字化护航者”。一、医疗器械出海的三大管理痛点1、序列号跟踪：从生产到终端的全链条追溯难题医疗器械的序列号需贯穿生产
币圈不设防第三期回顾：中东资本入场，加密市场格局将如何重塑比特币web3区块链
3月14日晚，由TechubNews主办的《币圈不设防》第三期Space活动圆满落幕。本期以“中东资本入股币安背后的逻辑与行业影响”为核心议题，特邀LYSLab投研分析师Veigar、RITDLabs联合创始人Benny、TechubNews运营负责人Sam等嘉宾，共同探讨中东资本的入局对加密行业的深远意义。以下是本期活动的深度总结。一、中东资本为何选择币安？战略布局浮出水面近期，阿布扎比主权基金
工控一体机如何设置成上电自启模式 Ukck_ 单片机嵌入式硬件硬件工程电脑经验分享
一、BIOS设置1、开机时点击键盘Del进入BIOS2、找到电源设置3、在电源管理选项中，找到“ACPowerRecovery”或“RestoreonAC/PowerLoss”等类似选项，将其设置为“Enabled”或“On”4、设置完成后，按F10键或选择“SaveandExit”选项保存设置并退出二、操作系统配置Windows系统：禁用休眠/快速启动：进入控制面板>电源选项>选择电源按钮功能，
使用PHP对接StockTV全球金融市场数据API实战指南 php股票接口
关键词：PHPAPI开发、金融市场数据、WebSocket实时数据、cURL实战一、项目概述StockTV作为全球领先的金融数据平台，提供覆盖股票、外汇、期货和加密货币的实时行情服务。本文将手把手教你使用PHP实现以下核心功能：✅RESTAPI调用：获取历史行情数据✅WebSocket订阅：实时价格推送✅生产级特性：异常重试、速率控制、数据缓存✅高性能优化：连接池、异步处理二、环境准备1.运行环境
STM32最小系统板详解 QoyOle stm32 单片机嵌入式硬件
STM32最小系统板是一款基于STMicroelectronics的STM32微控制器的开发板，它提供了一个简化的硬件平台，用于快速原型设计和开发嵌入式系统。本文将详细介绍STM32最小系统板的特点、组成部分以及如何使用它进行开发。一、特点简化的硬件设计：STM32最小系统板采用了最小化的硬件设计，仅包含了必要的元件，如STM32微控制器、晶振、电源管理电路等。这使得开发者可以专注于软件开发，而无
CCF编程能力等级认证GESP—C++1级—20250322 青岛少儿编程-王老师 #C++-1级 c++java 算法
CCF编程能力等级认证GESP—C++1级—20250322单选题（每题2分，共30分）判断题（每题2分，共20分）编程题(每题25分，共50分)图书馆里的老鼠四舍五入单选题（每题2分，共30分）1、2025年春节有两件轰动全球的事件，一个是DeepSeek横空出世，另一个是贺岁片《哪吒2》票房惊人，入了全球票房榜。下面关于DeepSeek与《哪吒2》的描述成立的是()。A.《哪吒2》是一款新型操
Spring 事务管理全解析：原理、源码与实战工一木子 SpringFramework 笔记 spring 数据库 java
Spring事务管理全解析：原理、源码与实战事务（Transaction）是保证数据一致性的重要机制，Spring通过声明式事务和编程式事务提供强大的事务管理能力。本篇文章将深入剖析Spring事务的底层原理、传播机制、源码解析，并通过代码实战讲解如何正确使用Spring事务。1.什么是事务？（What）事务是数据库操作的最小执行单元，必须具备ACID（原子性、一致性、隔离性、持久性）特性。Spr
AI算力要变天了？一文搞懂ASIC和GPU asicgpuai芯片
近期，全球股市的动荡中，ASIC和GPU这两个科技股概念突然变得火热，引起了市场的高度关注。博通作为ASIC的代表，股价一路猛涨，而英伟达作为GPU的代表，股价却一路下跌。这是否意味着AI算力市场即将变天？随着人工智能技术的飞速发展，AI算力的重要性日益凸显。从早期的简单模型训练到如今的大规模语言模型如ChatGPT等的出现，对算力的需求呈爆发式增长。01那什么是ASIC和GPU？ASIC：定制化
云智慧发布对象关系型数据库CloudPanguDB，打破传统技术壁垒
近日，云智慧推出关系型数据库CloudPanguDB（中文名称：盘古数据库），旨在通过高兼容性能和创新技术架构，降低企业项目整体运营成本。无论是处理海量复杂数据，还是构建清晰有序的数据结构关系，CloudPanguDB都具有强大的应用价值。随着各产业数字化转型的迅速发展，企业对国产化数据库需求与日俱增。CloudPanguDB以云智慧自身产品技术为基础，统一优化技术架构，功能覆盖关系型数据库、全文
信息学奥赛一本通1353 表达式括号匹配(stack) （栈） Star77777 信息学奥赛一本通 #数据结构栈信息学奥赛一本通括号匹配
1353：表达式括号匹配(stack)时间限制:1000ms内存限制:65536KB提交数:14209通过数:7610【题目描述】设一个表达式有英文字母（小写）、运算符（+，—，∗，/+，—，∗，/）和左右小（圆）括号构成，以“@@”作为表达式的结束符。请编写一个程序检查表达式中的左右圆括号是否匹配，若匹配，则返回“YESYES”；否则返回“NONO”。表达式长度小于255255，左圆括号少于20
金银岛（信息学奥赛一本通-1225） Doopny@ 信息学奥赛一本通算法
【题目描述】某天KID利用飞行器飞到了一个金银岛上，上面有许多珍贵的金属，KID虽然更喜欢各种宝石的艺术品，可是也不拒绝这样珍贵的金属。但是他只带着一个口袋，口袋至多只能装重量为w的物品。岛上金属有s个种类,每种金属重量不同，分别为n1,n2,...,ns，同时每个种类的金属总的价值也不同，分别为v1,v2,...,vs。KID想一次带走价值尽可能多的金属，问他最多能带走价值多少的金属。注意到金属
双指针与二分算法打不了嗝蓝桥杯 c++算法
一.双指针1.基本介绍双指针算法是一种暴力枚举的优化算法，他也被叫做尺取法或者滑动窗口。当我们发现算法需要两次for循环时并且两个指针可以不回退，我们可以利用双指针来优化算法复杂度。2.例题详解题目描述企业家Emily有一个很酷的主意：把雪花包起来卖。她发明了一台机器，这台机器可以捕捉飘落的雪花，并把它们一片一片打包进一个包裹里。一旦这个包裹满了，它就会被封上送去发售。Emily的公司的口号是“把
ResNet改进(11)：添加 Squeeze-and-Excitation模块和替换Mish激活函数点我头像干啥 ResNet 改进【有效涨点！】深度学习 pytorch python
本专栏代码均经过测试，可以直接替换项目中的模型，一键运行！采用最新的即插即用模块，有效涨点！！1.SE模块和Mish激活函数SE模块是一种通道注意力机制，旨在增强网络对重要特征通道的关注，从而提升模型的表达能力。它通过显式地建模通道之间的依赖关系，动态调整每个通道的特征响应。SE模块的核心思想：Squeeze：通过全局平均池化（GlobalAveragePooling,GAP）将每个通道的空间维度
螺旋折线 | 第九届蓝桥杯省赛C++B组 @Mr.stone 蓝桥杯 c++算法
如下图所示的螺旋折线经过平面上所有整点恰好一次。对于整点(X,Y)，我们定义它到原点的距离dis(X,Y)是从原点到(X,Y)的螺旋折线段的长度。例如dis(0,1)=3,dis(−2,−1)=9给出整点坐标(X,Y)，你能计算出dis(X,Y)吗？输入格式包含两个整数X,Y。输出格式输出一个整数，表示dis(X,Y)。数据范围−109≤X,Y≤109输入样例：01输出样例：3题解：数学计算题目，
优选算法训练篇07--力扣LCR179.查找总价格为目标值的两个商品大胆飞猪算法训练篇算法 leetcode
目录1.题目链接：LCR179.查找总价格为目标值的两个商品2.题目描述：3.解法一(暴力解法，会超时)：4.解法二(双指针-对撞指针):1.题目链接：LCR179.查找总价格为目标值的两个商品2.题目描述：购物车内的商品价格按照升序记录于数组price。请在购物车中找到两个商品的价格总和刚好是target。若存在多种情况，返回任一结果即可。示例1：输入：price=[3,9,12,15],tar
Python入门(函数) 高育良00003 python 开发语言
一.基础认识一种映射关系1.1什么是函数呢？概念函数是可以重复执行的语句块，可以重复调用作用用于封装语句块，提高代码的重用性1.2函数的定义语法：deffunction():#def为关键字，function为函数名#语句想要执行的操作returnre#re为返回值二.函数的调用函数名后+小括号()表示函数的执行2.1基本用法语法：函数名(实际调用的参数)2.2调用传参2.2.1位置传参最为常见，
hsdb查看Tomcat注解的实例 ok060 tomcat java hsdb
‌一、HSDB查看Tomcat注解的实例步骤‌‌1.附加Tomcat进程‌‌获取Tomcat进程ID‌：使用jps-l命令查找Tomcat的PID（如12345），确保Tomcat处于运行状态‌38。‌启动HSDB‌：jhsdbhsdb--pid12345‌2.定位目标类‌‌打开ClassBrowser‌：在HSDB界面点击‌Tools→ClassBrowser‌，输入目标类名（如com.exam
算力租赁：人工智能时代的“水电煤”革命——以NVIDIA 4090为例解读下一代算力解决方案算法工程gpu
引言：当AI算力需求遇上“算力饥渴症”2023年，ChatGPT仅用2个月突破1亿用户，StableDiffusion让普通人秒变艺术家，但背后是单次训练消耗超10万GB内存、千亿级参数的恐怖算力需求。当全球AI企业陷入“算力饥渴症”时，一种名为算力租赁的创新模式正以每年37%的增速（MarketsandMarkets数据）重塑行业格局。本文将深度解析这一革命性服务，并聚焦搭载NVIDIARTX4
AI Agent赛道：昙花一现还是生态革命？6大咖拆解泡沫与未来人工智能比特币区块链web3
作者：CRYPTO币圈不设防币圈不设防第四期Space总结：AIAgent赛道还能火多久？在Web3华语主持人茄哥的主持下，第四期《币圈不设防》围绕“AIAgent赛道还能火多久？”展开深度探讨。本期嘉宾阵容强大，包括Uweb校长于佳宁、TradingBaseAI创始人Mr.Z、BuilderLogEarn、区块链爱好者flyawei、投研博主清风#BTC，以及社区领袖小智。以下是讨论的核心观点总
AI 真的懂你问的问题吗？ llmclaudeopenai
Hey,我是沉浸式趣谈本文首发于【沉浸式趣谈】，我的个人博客https://yaolifeng.com也同步更新。转载请在文章开头注明出处和版权信息。如果本文对您有所帮助，请点赞、评论、转发，支持一下，谢谢！AI真的懂你问的问题吗？AI—它可能是个「语言魔术师」，但绝对不是「人类大脑」你心血来潮问AI：你：「为什么古埃及人建造金字塔？」AI（认真回答）：「古埃及人建造金字塔主要是作为法老的陵墓，同
C++20中哪些特性对内存管理有帮助？ c++
C++20引入了多项改进和新特性，这些特性在内存管理方面提供了更强大的支持和更高的灵活性。以下是C++20中对内存管理有帮助的主要特性：一、对齐分配器（AlignedAllocator）C++20引入了对齐分配器，允许开发者在分配内存时指定对齐参数，从而确保分配的内存块满足特定的对齐要求。这在处理需要特定对齐的硬件或数据结构时非常有用。cpp复制std::aligned_alloc(64,1024
SM国密算法深度解析与技术实践安全
SM国密算法深度解析与技术实践一、算法体系概述SM系列密码算法是由中国国家密码管理局发布的商用密码标准体系，涵盖非对称加密、对称加密、杂凑算法、标识密码等多个领域。其核心组件包括：SM2：基于椭圆曲线的非对称加密算法（GB/T32918）SM3：密码杂凑算法（GB/T32905）SM4：分组对称加密算法（GB/T32907）与国际算法对比类型国密算法国际标准密钥长度安全强度非对称加密SM2RSA-
梯度下降法理论理解伶星37 机器学习人工智能
梯度下降法：看似原始却透露着机器学习的本质前提：在研究梯度下降方法之前，你要理解矩阵运算（解析解）的方法矩阵运算目前的缺点只能进行对线性函数经行分析，无法对复杂的函数经行分析什么是梯度，以及梯度向量梯度下降的形象例子以及基本思想有三个兄弟被困在山上，得要死，他们目标是看谁尽快找到山谷中的水源老大比较后选择最陡的方向随便探索一下，就朝较低处走去探测几下就走陡峭的方向梯度下降算法的核心思想就是沿着负梯
文件的基本的基本属性伶星37 linux 服务器
为什么要有基本属性Linux系统是一种典型的多用户系统，不同的用户处于不同的地位，拥有不同的权限。为了保护系统的安全性，Linux系统对不同的用户访问同一文件（包括目录文件）的权限做了不同的规定。例子你可以把Linux比作成一个学校，里面的人学生老师校长里面的资料课本学校档案老师个人备案资料学生只能看课本，其他的都不能看，而老师，可以看老师备案资料和课本。校长上面都可以看。在Linux中我们通常使
MybatisPlus 伶星37 spring boot 后端
代码部分添加依赖该代码添加位置：就是在springboot配置文件里面的pom.xml里面要添加的东西对新手说的话，如果这一步没有看懂的话，可以去看一下基础，否则这样的话不能做到理解学习//mybatis-plus的一个插件com.baomidoumybatis-plus-boot-starter3.4.2//这个是关于mysql的一种依赖mysqlmysql-connector-java5.1.
高等数学，对梯度的理解伶星37 机器学习
梯度（Gradient）是多变量微分中非常重要的概念。它描述了一个多元函数在某一点的最大上升方向及其变化率，是向量微积分中的基本工具。定义对于一个多变量标量函数f(x,y,z,… )f(x,y,z,\dots)f(x,y,z,…)梯度是一个向量，记为∇f\nablaf∇f或gradfgradfgradf梯度向量的分量是函数fff对各自变量的偏导数，即：∇f=(δfδx,δfδy,δfδz,… )\
操作系统练习题齐飞 linux
文章目录一、单选题二、多选题三、填空题四、简答题一、单选题1、在计算机系统中配置操作系统的主要目的是（）。A、增强计算机系统的功能B、提高系统资源的利用率C、提高系统的运行速度D、合理组织系统的工作流程，以提高系统吞吐量正确答案：B2、操作系统的主要功能是管理计算机系统中的（），其中包括处理机、存储器，以及文件和设备。这里的存储器管理主要是对进程进行管理。A、程序和数据B、资源C、软件D、硬件正确
Not enough information to list image symbols. Not enough information to list load addresses in ... Water_Sounds 学习笔记 keil mdk
除了绝大部分网上给的解决方法外：Notenoughinformationtolistimagesymbols.Notenoughinformationtolistloadaddressesin…我在向正点原子例程“输入捕获”中添加lcd驱动程序时，发现按照上述链接的做法填了路径什么的，还是报错，最后发现是这个.c文件文件没有添加进来导致这两句话一直是无定义，填进来就好了。
服务器负载均衡是什么意思？ lddfff_3a 负载均衡
什么是负载均衡？负载均衡是由多台服务器以对称的方式组成一个服务器集合，每台服务器都具有等价的地位，都可以单独对外供应效力而无须其他服务器的辅助。经过某种负载分管技术，将外部发送来的央求均匀分配到对称结构中的某一台服务器上，而接收到央求的服务器独登时回应客户的央求。均衡负载可以平均分配客户央求到服务器列阵，籍此供应快速获取重要数据，解决很多并发访问效力问题。这种群集技术可以用最少的出资取得接近于大型
怎么样才能成为专业的程序员？ cocos2d-x小菜编程 PHP
如何要想成为一名专业的程序员？仅仅会写代码是不够的。从团队合作去解决问题到版本控制，你还得具备其他关键技能的工具包。当我们询问相关的专业开发人员，那些必备的关键技能都是什么的时候，下面是我们了解到的情况。关于如何学习代码，各种声音很多，然后很多人就被误导为成为专业开发人员懂得一门编程语言就够了？！呵呵，就像其他工作一样，光会一个技能那是远远不够的。如果你想要成为
java web开发高并发处理 BreakingBad java Web 并发开发处理高
java处理高并发高负载类网站中数据库的设计方法（java教程,java处理大量数据，java高负载数据）一：高并发高负载类网站关注点之数据库没错,首先是数据库,这是大多数应用所面临的首个SPOF。尤其是Web2.0的应用，数据库的响应是首先要解决的。一般来说MySQL是最常用的，可能最初是一个mysql主机，当数据增加到100万以上，那么，MySQL的效能急剧下降。常用的优化措施是M-S（
mysql批量更新 ekian mysql
mysql更新优化：一版的更新的话都是采用update set的方式，但是如果需要批量更新的话，只能for循环的执行更新。或者采用executeBatch的方式，执行更新。无论哪种方式，性能都不见得多好。三千多条的更新，需要3分多钟。查询了批量更新的优化，有说replace into的方式，即： replace into tableName(id,status) values
微软BI（3） 18289753290 微软BI SSIS
1) Q：该列违反了完整性约束错误；已获得 OLE DB 记录。源:“Microsoft SQL Server Native Client 11.0” Hresult: 0x80004005 说明:“不能将值 NULL 插入列 'FZCHID'，表 'JRB_EnterpriseCredit.dbo.QYFZCH'；列不允许有 Null 值。INSERT 失败。”。 A：一般这类问题的存在是
Java中的List g21121 java
List是一个有序的 collection（也称为序列）。此接口的用户可以对列表中每个元素的插入位置进行精确地控制。用户可以根据元素的整数索引（在列表中的位置）访问元素，并搜索列表中的元素。与 set 不同，列表通常允许重复
读书笔记永夜-极光读书笔记
1. K是一家加工厂,需要采购原材料,有A,B,C,D 4家供应商,其中A给出的价格最低,性价比最高,那么假如你是这家企业的采购经理,你会如何决策? 传统决策: A:100%订单 B,C,D:0% &nbs
centos 安装 Codeblocks 随便小屋 codeblocks
1.安装gcc,需要c和c++两部分,默认安装下,CentOS不安装编译器的,在终端输入以下命令即可yum install gccyum install gcc-c++ 2.安装gtk2-devel,因为默认已经安装了正式产品需要的支持库,但是没有安装开发所需要的文档.yum install gtk2* 3. 安装wxGTK yum search w
23种设计模式的形象比喻 aijuans 设计模式
1、ABSTRACT FACTORY—追MM少不了请吃饭了，麦当劳的鸡翅和肯德基的鸡翅都是MM爱吃的东西，虽然口味有所不同，但不管你带MM去麦当劳或肯德基，只管向服务员说“来四个鸡翅”就行了。麦当劳和肯德基就是生产鸡翅的Factory 　　工厂模式：客户类和工厂类分开。消费者任何时候需要某种产品，只需向工厂请求即可。消费者无须修改就可以接纳新产品。缺点是当产品修改时，工厂类也要做相应的修改。如：
开发管理 CheckLists aoyouzi 开发管理 CheckLists
开发管理 CheckLists(23) -使项目组度过完整的生命周期开发管理 CheckLists(22) -组织项目资源开发管理 CheckLists(21) -控制项目的范围开发管理 CheckLists(20) -项目利益相关者责任开发管理 CheckLists(19) -选择合适的团队成员开发管理 CheckLists(18) -敏捷开发 Scrum Master 工作开发管理 C
js实现切换百合不是茶 JavaScript 栏目切换
js主要功能之一就是实现页面的特效,窗体的切换可以减少页面的大小,被门户网站大量应用思路: 1,先将要显示的设置为display:bisible 否则设为none 2,设置栏目的id ,js获取栏目的id,如果id为Null就设置为显示 3,判断js获取的id名字;再设置是否显示代码实现: html代码: <di
周鸿祎在360新员工入职培训上的讲话 bijian1013 感悟项目管理人生职场
这篇文章也是最近偶尔看到的，考虑到原博客发布者可能将其删除等原因，也更方便个人查找，特将原文拷贝再发布的。“学东西是为自己的，不要整天以混的姿态来跟公司博弈，就算是混，我觉得你要是能在混的时间里，收获一些别的有利于人生发展的东西，也是不错的，看你怎么把握了”，看了之后，对这句话记忆犹新。 &
前端Web开发的页面效果 Bill_chen html Web Microsoft
1.IE6下png图片的透明显示： <img src="图片地址" border="0" style="Filter.Alpha(Opacity)=数值(100),style=数值(3)"/> 或在<head></head>间加一段JS代码让透明png图片正常显示。 2.<li>标
【JVM五】老年代垃圾回收：并发标记清理GC(CMS GC) bit1129 垃圾回收
CMS概述并发标记清理垃圾回收(Concurrent Mark and Sweep GC）算法的主要目标是在GC过程中，减少暂停用户线程的次数以及在不得不暂停用户线程的请夸功能，尽可能短的暂停用户线程的时间。这对于交互式应用，比如web应用来说，是非常重要的。 CMS垃圾回收针对新生代和老年代采用不同的策略。相比同吞吐量垃圾回收，它要复杂的多。吞吐量垃圾回收在执
Struts2技术总结白糖_ struts2
必备jar文件早在struts2.0.*的时候，struts2的必备jar包需要如下几个： commons-logging-*.jar Apache旗下commons项目的log日志包 freemarker-*.jar
Jquery easyui layout应用注意事项 bozch jquery 浏览器 easyui layout
在jquery easyui中提供了easyui-layout布局，他的布局比较局限，类似java中GUI的border布局。下面对其使用注意事项作简要介绍：如果在现有的工程中前台界面均应用了jquery easyui，那么在布局的时候最好应用jquery eaysui的layout布局，否则在表单页面（编辑、查看、添加等等）在不同的浏览器会出
java-拷贝特殊链表：有一个特殊的链表，其中每个节点不但有指向下一个节点的指针pNext，还有一个指向链表中任意节点的指针pRand，如何拷贝这个特殊链表？ bylijinnan java
public class CopySpecialLinkedList { /** * 题目：有一个特殊的链表，其中每个节点不但有指向下一个节点的指针pNext，还有一个指向链表中任意节点的指针pRand，如何拷贝这个特殊链表？拷贝pNext指针非常容易，所以题目的难点是如何拷贝pRand指针。假设原来链表为A1 -> A2 ->... -> An，新拷贝
color Chen.H JavaScript html css
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd"> <HTML> <HEAD>&nbs
[信息与战争]移动通讯与网络 comsci 网络
两个坚持:手机的电池必须可以取下来光纤不能够入户,只能够到楼宇建议大家找这本书看看:<&
oracle flashback query(闪回查询) daizj oracle flashback query flashback table
在Oracle 10g中，Flash back家族分为以下成员： Flashback Database Flashback Drop Flashback Table Flashback Query(分Flashback Query,Flashback Version Query，Flashback Transaction Query) 下面介绍一下Flashback Drop 和Flas
zeus持久层DAO单元测试 deng520159 单元测试
zeus代码测试正紧张进行中,但由于工作比较忙,但速度比较慢.现在已经完成读写分离单元测试了,现在把几种情况单元测试的例子发出来,希望有人能进出意见,让它走下去. 本文是zeus的dao单元测试: 1.单元测试直接上代码 package com.dengliang.zeus.webdemo.test; import org.junit.Test; import o
C语言学习三printf函数和scanf函数学习 dcj3sjt126com c printf scanf language
printf函数 /* 2013年3月10日20:42:32 地点：北京潘家园功能：目的：测试%x %X %#x %#X的用法 */ # include <stdio.h> int main(void) { printf("哈哈！\n"); // \n表示换行 int i = 10; printf
那你为什么小时候不好好读书? dcj3sjt126com life
dady, 我今天捡到了十块钱, 不过我还给那个人了 good girl! 那个人有没有和你讲thank you啊没有啦....他拉我的耳朵我才把钱还给他的, 他哪里会和我讲thank you 爸爸, 如果地上有一张5块一张10块你拿哪一张呢.... 当然是拿十块的咯... 爸爸你很笨的, 你不会两张都拿爸爸为什么上个月那个人来跟你讨钱, 你告诉他没
iptables开放端口 Fanyucai linux iptables 端口
1，找到配置文件 vi /etc/sysconfig/iptables 2，添加端口开放，增加一行，开放18081端口 -A INPUT -m state --state NEW -m tcp -p tcp --dport 18081 -j ACCEPT 3，保存 ESC :wq! 4，重启服务 service iptables
Ehcache（05）——缓存的查询 234390216 排序 ehcache 统计 query
缓存的查询目录 1. 使Cache可查询 1.1 基于Xml配置 1.2 基于代码的配置 2 指定可搜索的属性 2.1 可查询属性类型 2.2 &
通过hashset找到数组中重复的元素 jackyrong hashset
如何在hashset中快速找到重复的元素呢?方法很多，下面是其中一个办法： int[] array = {1,1,2,3,4,5,6,7,8,8}; Set<Integer> set = new HashSet<Integer>(); for(int i = 0
使用ajax和window.history.pushState无刷新改变页面内容和地址栏URL lanrikey history
后退时关闭当前页面 <script type="text/javascript"> jQuery(document).ready(function ($) { if (window.history && window.history.pushState) {
应用程序的通信成本 netkiller.github.com 虚拟机应用服务器陈景峰 netkiller neo
应用程序的通信成本什么是通信一个程序中两个以上功能相互传递信号或数据叫做通信。什么是成本这是是指时间成本与空间成本。时间就是传递数据所花费的时间。空间是指传递过程耗费容量大小。都有哪些通信方式全局变量线程间通信共享内存共享文件管道 Socket 硬件（串口，USB）等等全局变量全局变量是成本最低通信方法，通过设置
一维数组与二维数组的声明与定义恋洁e生二维数组一维数组定义声明初始化
/** * */ package test20111005; /** * @author FlyingFire * @date:2011-11-18 上午04:33:36 * @author ：代码整理 * @introduce :一维数组与二维数组的初始化 *summary： */ public c
Spring Mybatis独立事务配置 toknowme mybatis
在项目中有很多地方会使用到独立事务，下面以获取主键为例（1）修改配置文件spring-mybatis.xml  <tx:annotation-driven transaction-manager="transactionManager" /> &n
更新Anadroid SDK Tooks之后，Eclipse提示No update were found xp9802 eclipse
使用Android SDK Manager 更新了Anadroid SDK Tooks 之后，打开eclipse提示 This Android SDK requires Android Developer Toolkit version 23.0.0 or above, 点击Check for Updates 检测一会后提示 No update were found