XingHe_XingHe_

2021_WSDM_Pre-Training Graph Neural Networks for Cold-Start Users and Items Representation

[论文阅读笔记]2021_WSDM_Pre-Training Graph Neural Networks for Cold-Start Users and Items Representation

论文下载地址： https://doi.org/10.1145/3437963.3441738
发表期刊：WSDM
Publish time: 2021
作者及单位:

Bowen Hao Renmin University of China [email protected]
Jing Zhang∗ Renmin University of China [email protected]
Hongzhi Yin The University of Queensland [email protected]
Cuiping Li Renmin University of China [email protected]
Hong Chen Renmin University of China [email protected]

数据集：

MovieLens-1M(Ml-1M) https://grouplens.org/datasets/movielens/ (作者在论文中公开的)
MOOCs http://moocdata.cn/data/course-recommendation (作者在论文中公开的)
Last.fm http://www.last.fm (作者在论文中公开的)

代码：

https://github.com/jerryhao66/Pretrain-Recsys (作者在论文中公开的)

其他人写的文章

简要概括创新点：这篇论文，理论的针对点。冷启动用户的embedding是inaccurate。训练时用的有丰富交互的数据,ground-truth和cold-start user/item是author随机采样模拟得到的；训练好了，再用到冷启动的数据上

(1)However, the basic pre-training GNN model doesn’t specially address the cold-start neighbors. During the original graph convolution process, the inaccurate embeddings of the cold-start neighbors and the embeddings of other neighbors are equally treated and aggregated to represent the target user/item. (然而，基本的预训练GNN模型并没有专门针对冷启动邻居。在原始的图卷积过程中，冷启动邻域的不准确嵌入和其他邻域的嵌入被平等地处理和聚合，以表示目标用户/项。) 这篇论文，理论的针对点。冷启动用户的embedding是inaccurate

(2)This paper proposes to pretrain a GNN model before applying it for recommendation. (本文建议在应用GNN模型进行推荐之前对其进行预训练。)

(3)To further reduce the impact from the cold-start neighbors,

we incorporate a self-attention-based meta aggregator to enhance the aggregation ability of each graph convolution step, (为了进一步减少冷启动邻居的影响，我们加入了一个基于自注意的元聚合器来增强每个图卷积步骤的聚合能力)

and an adaptive neighbor sampler to select the effective neighbors according to the feedbacks from the pre-training GNN model.(以及一个自适应邻居采样器来根据预训练GNN模型的反馈选择有效邻居。)

(4)Since we also need ground truth embeddings of the cold-start users/items to learn $f$ , we simulate those users/items from the target users/items with abundant interactions. (由于我们还需要冷启动用户/项目的真实值嵌入来学习 $f$ ，因此我们模拟了目标用户/项目中具有丰富交互的用户/项目。)

ABSTRACT

(1) Cold-start problem is a fundamental challenge for recommendation tasks. Despite the recent advances on Graph Neural Networks (GNNs) incorporate the high-order collaborative signal to alleviate the problem, the embeddings of the cold-start users and items aren’t explicitly optimized, and the cold-start neighbors are not dealt with during the graph convolution in GNNs. (冷启动问题是推荐任务面临的一个基本挑战。,尽管最近在图形神经网络（GNN）方面取得了一些进展，但在GNN中，冷启动用户和项目的嵌入没有得到明确的优化，并且在图形卷积过程中也没有处理冷启动邻居。)
(2)This paper proposes to pretrain a GNN model before applying it for recommendation. (本文建议在应用GNN模型进行推荐之前对其进行预训练。)
Unlike the goal of recommendation, the pre-training GNN simulates the cold-start scenarios from the users/items with sufficient interactions and takes the embedding reconstruction as the pretext task, such that it can directly improve the embedding quality and can be easily adapted to the new cold-start users/items. (与推荐的目标不同，预训练GNN模拟用户/项目的冷启动场景，具有充分的交互，并以嵌入重构为借口任务，这样可以直接提高嵌入质量，并且可以很容易地适应新的冷启动用户/项目。)
(3)To further reduce the impact from the cold-start neighbors,
- we incorporate a self-attention-based meta aggregator to enhance the aggregation ability of each graph convolution step, (为了进一步减少冷启动邻居的影响，我们加入了一个基于自注意的元聚合器来增强每个图卷积步骤的聚合能力)
- and an adaptive neighbor sampler to select the effective neighbors according to the feedbacks from the pre-training GNN model.(以及一个自适应邻居采样器来根据预训练GNN模型的反馈选择有效邻居。)
(4)Experiments on three public recommendation datasets show the superiority of our pre-training GNN model against the original GNN models on user/item embedding inference and the recommendation task.

CCS CONCEPTS

• Information systems → Social recommendation;

KEYWORDS

Pre-training, graph neural networks, cold-start, recommendation

1 INTRODUCTION

(1)Recommendation systems [14, 21] have been extensively deployed to alleviate information overload in various web services, such as social media, E-commerce websites and news portals. To predict the likelihood of a user adopting an item, collaborative filtering (CF) is the most widely adopted principle. The most common paradigm for CF, such as matrix factorization [21] and neural collaborative filtering [14], is to learn embeddings, i.e. the preferences for users and items and then perform the prediction based on the embeddings [13]. However, these models fail to learn high-quality embeddings for the cold-start users/items with sparse interactions. (推荐系统[14,21]已被广泛部署，以缓解各种网络服务（如社交媒体、电子商务网站和新闻门户）中的信息过载。为了预测用户采用某个项目的可能性，协同过滤（CF）是最广泛采用的原则。CF最常见的范例，如矩阵分解[21]和神经协同过滤[14]，是学习嵌入，即用户和项目的偏好，然后根据嵌入进行预测[13]。然而，这些模型无法为交互稀少的冷启动用户/项目学习高质量的嵌入。)
(2)To address the cold-start problem, traditional recommender systems incorporate the side information such as content features of users and items [40,44] or external knowledge graphs (KGs) [35,37] to compensate the low-quality embeddings caused by sparse interactions. However, the content features are not always available, and it is not easy to link the items to the entities in KGs due to the incompleteness and ambiguation of the entities. (为了解决冷启动问题，传统的推荐系统结合了用户和项目的内容特征[40,44]或外部知识图（KG）[35,37]等辅助信息，以补偿稀疏交互导致的低质量嵌入。然而，内容功能并不总是可用的，而且由于实体的不完整性和模糊性，将项目链接到KGs中的实体并不容易。)
(3) On another line, inspired by the recent development of graph neural networks (GNNs) [2, 11, 19], NGCF [38] and LightGCN [13] encode the high-order collaborative signal in the user-item interaction graph by a GNN model, based on which they perform the recommendation task. As shown in Fig. 1, a typical recommendation-oriented GNN conducts graph convolution on the local neighborhood’s embeddings of $u_1$ and $i_1$ . Through iteratively repeating the convolution by multiple steps, the embeddings of the high-order neighbors are propagated to $u 1$ and $i 1$ . Based on the aggregated embeddings of $u_1$ and $i_1$ , the likelihood of $u_1$ adopting $i_1$ is estimated, and cross-entropy loss [3] or BPR loss [13, 38] is usually adopted to compare the likelihood and the true observations. (另一方面，受图形神经网络（GNN）[2,11,19]的最新发展启发，NGCF[38]和LightGCN[13]通过GNN模型对用户项交互图中的高阶协作信号进行编码，并在此基础上执行推荐任务。如图1所示，典型的面向推荐的GNN对 $u_1$ 和 $i_1$ 的局部邻域嵌入进行图卷积通过多次迭代重复卷积，高阶邻域的嵌入被传播到 $u_1$ 和 $i_1$ 。基于 $u_1$ 和 $i_1$ 的聚合嵌入, $u_1$ 采纳 $i_1$ 的可能性被评估，通常采用 交叉熵损失[3]或 BPR损失[13,38]来比较可能性和真实观测值。)
(4)Despite the success of capturing the high-order collaborative signal in GNNs [13, 38], the cold-start problem is not thoroughly solved by them. (尽管成功地捕获了GNNs中的高阶协同信号[13,38]，但冷启动问题并没有被它们彻底解决。
- First, the GNNs for recommendation address thecold-start user/item embeddings through optimizing the likelihood of a user adopting an item, which isn’t a direct improvement of the embedding quality; (首先，推荐的GNN通过优化用户采用项目的可能性来解决冷启动用户/项目嵌入问题，这不是嵌入质量的直接提高；)
- second, the GNN model does not specially deal with the cold-start neighbors among all the neighbors when performing the graph convolution. For example in Fig. 1, to represent $u_1$ , the 2-order neighbor $u_2$ is also a cold-start user who only interacts with $i_3$ and $i_4$ . The result of graph convolution on the inaccurate embedding of $u_2$ and the embedding of $u_3$ together will be propagated to $u_1$ and hurt its embedding. (其次，在进行图卷积时，GNN模型没有专门处理所有邻居中的冷启动邻居。例如，在图1中，为了表示 $u_1$ , 二阶近邻 $u_2$ 是只与 $i_3$ 和 $i_4$ 交互的冷启动用户. 关于 $u_2$ 的不精确嵌入的图卷积结果 以及 $u_3$ 的嵌入将一起传播到 $u_1$ 并且伤害了它的嵌入。)
- Existing GNNs ignore the cold-start characteristics of neighbors during the graph convolution process. Although some GNN models such as GrageSAGE [11] or FastGCN [4] filter neighbors before aggregating them, they usually follow a random or an importance sampling strategy, which also ignore the cold-start characteristics of the neighbors. This leads us to the following research problem: how can we learn more accurate embeddings for cold-start users or items by GNNs? (现有的GNN在图卷积过程中忽略了邻居的冷启动特性。尽管一些GNN模型，如GrageSAGE[11]或FastGCN[4]在聚合邻居之前会过滤它们，但它们通常遵循随机或重要抽样策略，这也会忽略邻居的冷启动特性。这就引出了以下研究问题：我们如何通过GNNs为冷启动用户或项目学习更准确的嵌入？)
(5) Present work. To tackle the above challenges, before performing the GNN model for recommendation,
- we propose to pre-train the GNN model to enhance the embeddings of the cold-start users or items. (我们建议对GNN模型进行预训练，以增强冷启动用户或项目的嵌入。)
- Unlike the goal of recommendation, the pre-training task directly reconstructs the cold-start user/item embeddings by mimicking the meta-learning setting via episode based training, as proposed in [34]. (与推荐的目标不同，预培训任务通过基于事件的培训模拟元学习设置，直接重建冷启动用户/项目嵌入，如[34]所述。)
- Specifically, we pick the users/items with sufficient interactions as the target users/items and learn their ground truth embeddings on the observed abundant interactions. To simulate the real cold-start scenarios, in each training episode, we randomly sample K neighbors for each target user/item, based on which we perform the graph convolution multiple steps to predict the target embedding. (具体来说，我们选择具有足够交互的用户/项目作为目标用户/项目，并在观察到的大量交互上了解它们的基本真相嵌入。为了模拟真实的冷启动场景，在每个训练集中，我们对每个目标用户/项目随机抽样K个邻居，在此基础上，我们执行图卷积多个步骤来预测目标嵌入。)
- The reconstruction loss between the predicted embedding and the ground truth embedding is optimized to directly improve the embedding capacity, making the model easily and rapidly being adapted to new cold-start users/items. （对预测嵌入和地面真值嵌入之间的重建损失进行了优化，以直接提高嵌入容量，使模型易于快速适应新的冷启动用户/项目。）
(6) However, the above pre-training strategy still can not explicitly deal with the high-order cold-start neighbors when performing graph convolution. Besides, previous GNN sampling strategies such as random or importance sampling strategies may fail to sample high-order relevant cold-start neighbors due to their sparse interactions. (然而，在执行图卷积时，上述预训练策略仍然不能明确地处理高阶冷启动邻居。此外，以往的GNN抽样策略，如随机抽样或重要抽样策略，由于其稀疏的交互作用，可能无法对高阶相关冷启动邻居进行抽样。)
- To overcome these challenges, we incorporate a meta aggregator and an adaptive neighbor sampler into the pre-training GNN model (为了克服这些挑战，我们在预训练GNN模型中加入了元聚合器和自适应邻居采样器。).
  - Specifically, the meta aggregator learns cold-start users/items’ embeddings on the first-order neighbors by self-attention mechanism under the same meta-learning setting, which is then incorporated into each graph convolution step to enhance the aggregation ability. (在相同的元学习设置下，元聚合器通过自我注意机制学习冷启动用户/项目在一阶邻居上的嵌入，然后将其纳入每个图卷积步骤，以增强聚合能力。)
  - While the adaptive neighbor sampler is formalized as a hierarchical Markov Sequential Decision Process, which sequentially samples from the low-order neighbors to the high-order neighbors according to the feedbacks provided by the pre-training GNN model. (自适应邻域采样器被形式化为一个层次马尔可夫序列决策过程，该过程根据预训练GNN模型提供的反馈，从低阶邻域到高阶邻域’
  - 依次采样。)
- The two components are jointly trained. Since the GNN model can be instantiated by different choices such as the original GCN [19], GAT [31] or FastGCN [4], the proposed pre-training GNN model is model-agnostic. (这两个部分是联合培训的。由于GNN模型可以通过不同的选择进行实例化，如原始GCN[19]、GAT[31]或FastGCN[4]，因此提出的预训练GNN模型是模型不可知的。)
(7)The contributions of this work are as follows:
- We propose a pre-training GNN model to learn high-quality embeddings for cold-start users/items. The model is learned under the meta-learning setting to reconstruct the user/item embeddings, which has the powerful generalization capacity. (我们提出了一个预训练GNN模型，用于为冷启动用户/项目学习高质量的嵌入。该模型在元学习环境下学习，重构用户/项目嵌入，具有较强的泛化能力。)
- To deal with the cold-start neighbors during the graph convolution process, we further propose a meta aggregator to enhance the aggregation ability of each graph convolution step, and a neighbor sampler to select the effective neighbors adaptively according to the feedbacks of the pre-training GNN model. (为了处理图卷积过程中的冷启动邻居，我们进一步提出了一个元聚合器来增强每个图卷积步骤的聚合能力，以及一个邻居采样器来根据预训练GNN模型的反馈自适应地选择有效邻居。)
(8) Experiments on both intrinsic embedding evaluation task and extrinsic downstream recommendation task demonstrate the superiority of our proposed pre-training GNN model against the state-of-the-art GNN models. (通过对内在嵌入评估任务和外在下游推荐任务的实验，证明了我们提出的训练前GNN模型相对于最先进的GNN模型的优越性。)

2 PRELIMINARIES

(1)In this section, we first define the problem and then introduce the graph neural networks that can be used to solve the problem.
(2) We formalize the user-item interaction data for recommendation as a bipartite graph denoted as $G = (U, I, E)$ ,
- where $U = \{u_1,· · · ,u_{|U|}\}$ is the set of users and $I = \{i_1,· · · ,i_{|I|}\}$ is the set of items.
- $U$ and $I$ comprise two types of the nodes in $G$ .
- Notation $\subseteq U \times I$ denotes the set of edges that connect the users and items.
(3) We use $N^l_{(u)}$ to represent the $l$ -order neighbors of user $u$ . When ignoring the superscript, $N (u)$ indicates the first-order neighbors of $u$ . Similarly, $N^l_{(i)}$ and $N (i)$ are defined for items.
(4) Let $\cup V \to R^bd$ be the encoding function that maps the users/items to $d$ -dimension real-valued vectors. We use $h_u$ and $h_i$ to denote the embedding of user $u$ and item $i$ respectively. Given a bipartite graph $G$ , we aim to pre-train the encoding function $f$ that is able to be applied on the downstream recommendation task to improve its performance. In the following sections, we mainly take user embedding as an example to explain the proposed model. Item embedding can be explained in the same way. (是将用户/项目映射到 $d$ 维实值向量的编码函数。我们用 $h_u$ 还有 $h_i$ 分别表示用户 $u$ 和项目 $i$ 的嵌入。在给定二部图 $G$ 的情况下，我们的目标是对编码函数 $f$ 进行预训练，使其能够应用于下游推荐任务，以提高其性能。在接下来的部分中，我们主要以用户嵌入为例来解释所提出的模型。项目嵌入可以用同样的方式解释。)

2.1 GNN for Recommendation

(1) The encoding function $f$ can be instantiated by various GNNs. Take GraphSAGE as an example, we first sample neighbors for each user $u$ randomly and then perform the graph convolution

to obtain the embedding of $u$ , where $l$ denotes the current convolution step and $h^l_u$ denotes user $u$ ’s embedding at this step. - (2)Similarly, we can obtain the item embedding $h^l_i$ at the $l$ -th convolution step. Once the embeddings of the last step $L$ for all the users and items are obtained, we calculate the relevance score ${h^L_u}^T h^L_i$ between user $u$ and item $i$ and adopt the BPR loss [13, 38], i.e.,

to optimize the user preferences over items.
(3)The above presented GNNs are end-to-end models that can learn user/item embeddings and then recommend items to users simultaneously. For addressing the cold-start users/items, the GNNs can incorporate the high-order collaborative signal through iteratively repeating the sampling and the convolution processes. However, the goal of recommendation shown in Eq.(2) can not explicitly improve the embedding quality of the cold-start users/items. (上述GNN是端到端模型，可以学习用户/项目嵌入，然后同时向用户推荐项目。为了解决冷启动用户/项目的问题，GNN可以通过迭代重复采样和卷积过程来合并高阶协作信号。然而，等式（2）中所示的推荐目标不能明确提高冷启动用户/项目的嵌入质量。)

3 THE PRE-TRAINING GNN MODEL

This section introduces the proposed pre-training GNN model to learn the embeddings for the cold-start users and items. (本节介绍了拟议的培训前GNN模型，以了解冷启动用户和项目的嵌入。)

We first describe a basic pre-training GNN model, (我们首先描述一个基本的训练前GNN模型，)
and then explain a meta aggregator
and an adaptive neighbor sampler that are incorporated in the model to further improve the embedding performance. (模型中加入了自适应邻域采样器，进一步提高了嵌入性能。)
Finally we explain how the model is fine-tuned on the downstream recommendation task. (模型中加入了自适应邻域采样器，进一步提高了嵌入性能。)
The overview framework is shown in Fig. 2.

3.1 The Basic Pre-training GNN Model

(1)We propose a basic pre-training GNN model to reconstruct the cold-start user/item embeddings in the meta-learning setting. (我们提出了一个基本的预训练GNN模型来重建元学习环境中的冷启动用户/项目嵌入。)
- To achieve the goal, we need abundant cold-start users/items as the training instances. (我们需要大量的冷启动用户/项目作为训练实例。)
(2)Since we also need ground truth embeddings of the cold-start users/items to learn $f$ , we simulate those users/items from the target users/items with abundant interactions. (由于我们还需要冷启动用户/项目的真实值嵌入来学习 $f$ ，因此我们模拟了目标用户/项目中具有丰富交互的用户/项目。)
- The ground truth embedding for each user $u$ , i.e., $h_u$ , is learned upon the observed abundant interactions by NCF[14]. (每个用户 $u$ 的基本真相嵌入，即 $h_u$ , 通过NCF观察到的大量相互作用学习[14])
- To mimic the cold-start users/items, in each training episode, we randomly sample $K$ neighbors for each target user/item. We repeat the sampling process $L - 1$ steps from the target user to the $L$ -1-order neighbors, which results in at most $K^l$ (1 ≤ l ≤ L) $l$ -order neighbors for each target user/item. (为了模拟冷启动用户/项目，在每一次训练中，我们随机抽取每个目标用户/项目的K邻居。.从目标用户到 $L - 1$ 阶邻居的取样过程我们重复 $L - 1$ 步，每个目标用户/物品最多产生 $K^l$ (1 ≤ l ≤ L) $l$ -order 邻居。)
- Similar to GraphSAGE [11], we sample high-order neighbors to improve the computational efficiency. Upon the sampled first/high-order neighbors for the target user $u$ , the graph convolution described in Eq. (1) is applied $L$ -1 steps to obtain the embeddings $\{h^{L−1}_1 ,· · · , h^{L−1}_K \}$ for the $K$ first-order neighbors of $u$ . (与GraphSAGE[11]类似，我们对高阶邻域进行采样以提高计算效率。在目标用户 $u$ 的采样的一阶/高阶邻居之后，应用等式（1）中描述的图卷积被应用 $L$ -1来步获得u的K个一阶邻居的嵌入)
- Then we aggregate them together to obtain the embedding of the target user $u$ . Unlike the previous $L$ -1 steps that concatenates $h^{l−1}_u$ and $h^l_{N(u)}$ to obtain $h^l_u$ for each neighbor (Cf. Eq. (1)), we only use $h^L_{N(u)}$ to represent the target embedding $h^L_u$ , as we aim to predict the target embedding by the neighbors’ embeddings: (然后我们将它们聚合在一起，以获得目标用户uu的嵌入。与之前的LL-1步骤不同)
- Finally, we use cosine similarity to measure the difference between the predicted target embedding $h^L_u$ and the ground-truth embedding $h u$ , as proposed by [16], due to its popularity as an indicator for the semantic similarity between embeddings:
  - where $\Theta_f= {W^L, \Theta_{gnn}}$ is the set of the parameters in $f$ .
(3) Training GNNs in the meta-learning setting can explicitly reconstruct the user/item embeddings, making GNNs easily and rapidly being adapted to new cold-start users/items. (在元学习环境中训练GNN可以显式地重构用户/项目嵌入，使GNN轻松快速地适应新的冷启动用户/项目。)
- After the model is trained, for a new arriving cold-start user or item, based on the few first-order neighbors and the high-order neighbors, we can predict an accurate embedding for it. (该模型经过训练后，对于新到达的冷启动用户或项目，基于少量的一阶邻域和高阶邻域，我们可以预测它的准确嵌入。)
- However, the basic pre-training GNN model doesn’t specially address the cold-start neighbors. During the original graph convolution process, the inaccurate embeddings of the cold-start neighbors and the embeddings of other neighbors are equally treated and aggregated to represent the target user/item. (然而，基本的预训练GNN模型并没有专门针对冷启动邻居。在原始的图卷积过程中，冷启动邻域的不准确嵌入和其他邻域的嵌入被平等地处理和聚合，以表示目标用户/项。) 这篇论文，理论的针对点。冷启动用户的embedding是inaccurate
- Although some GNN models such as GrageSAGE or FastGCN filter neighbors before aggregating them, they usually follow the random or importance sampling strategies, which ignore the cold-start characteristics of the neighbors. Out of this consideration, we incorporate a meta aggregator and an adaptive neighbor sampler into the above basic pre-training GNN model. (尽管一些GNN模型（如GrageSAGE或FastGCN）在聚合邻居之前会对其进行过滤，但它们通常遵循随机或重要抽样策略，忽略了邻居的冷启动特性。出于这一考虑，我们将元聚合器和自适应邻居采样器合并到上述基本的预训练GNN模型中。)

3.2 Meta Aggregator

(1)We propose the Meta Aggregator to deal with the cold-start neighbors.
(2)Suppose the target node is $u$ and one of its neighbor is $i$ , if $i$ is interacted with sparse nodes, its embedding, which is inaccurate, will affect the embedding of $u$ when performing graph convolution by the GNN $f$ . Although the cold-start issue of $i$ is dealt with when $i$ acts as another target node, embedding $i$ , which is parallel to embedding $u$ , results in a delayed effect on u’ embedding. Thus, before training the GNN $f$ , we train another function $g$ under the similar meta-learning setting as $f$ . The meta learner $g$ learns an additional embedding for each node only based on its first-order neighbors, thus it can quickly adapt to new cold-start nodes and produce more accurate embeddings for them. The embedding produced by $g$ is combined with the original embedding at each convolution in $f$ . Although both $f$ and $g$ are trained under the same meta-learning setting, ** $f$ is to tackle the cold-start target ndoes, but $g$ is to enhance the cold-start neighbors’ embeddings. ** (假设目标节点是 $u$ ，其一个邻居是 $i$ ，如果 $i$ 与稀疏节点交互，其嵌入不准确，将影响GNN $f$ 执行图卷积时 $u$ 的嵌入。虽然当 $i$ 作为另一个目标节点时， $i$ 的冷启动问题会得到解决，但嵌入 $i$ （与嵌入 $u$ 平行）会导致 $u$ ’嵌入的延迟效应。因此，在训练GNN $f$ 之前，我们在与 $f$ 类似的元学习设置下训练另一个函数 $g$ 。元学习器 $g$ 仅基于每个节点的一阶邻居学习一个额外的嵌入，因此它可以快速适应新的冷启动节点，并为它们生成更精确的嵌入。 $g$ 生成的嵌入与 $f$ 中每个卷积处的原始嵌入相结合。虽然 $f$ 和 $g$ 都是在相同的元学习环境下训练的，但 $f$ 是为了解决冷启动目标ndoes，而 $g$ 是为了增强冷启动邻居的嵌入。)
(3)Specifically, we instantiate $g$ as a self-attention encoder [30]. (具体来说，我们将 $g$ 实例化为一个自我关注编码器[30])
- For each user $u$ , $g$ accepts the initial embeddings $\{h^0_1,· · · ,h^0_K\}$ of the $K$ first-order neighbors for $u$ as input, (对于每个用户 $u$ ， $g$ 接受 $u$ 的 $K$ 个一阶邻居初始嵌入作为输入)
- calculates the attention scores of all the neighbors to each neighbor $i$ of $u$ , (计算所有邻居对 $u$ 每个邻居 $i$ 的注意力得分)
- aggregates all the neighbors’ embeddings according to the attention scores to produce the embedding $h_i$ for each $i$ , (根据注意力得分聚合所有邻居的嵌入，对每个 $i$ 生成嵌入 $h_i$ )
- and finally averages the embeddings of all the neighbors to get the embedding $\tilde{h}u$ , named as the meta embedding of user $u$ . (最后平均所有邻居的嵌入，得到嵌入 $\tilde{h}_u$ , 称为 $u$ 的元嵌入)
- The process is formulated as:
(4)The self-attention technique, which pushes the dissimilar neighbors further apart and pulls the similar neighbors closer together, can capture the major preference of the nodes from its neighbors. The same cosine similarity described in Eq.(4) is used as the loss function to measure the difference between the predicted meta embedding $\tilde{h}u$ and the ground truth embeding $h_u$ . Once $g$ is learned, we add the meta embedding $\tilde{h}u$ into each graph convolution step of the GNN $f$ in Eq. (1): (自我注意技术将不同的邻居进一步分开，将相似的邻居拉近，可以从邻居那里捕获节点的主要偏好。使用等式（4）中描述的相同余弦相似性作为损失函数，以测量预测的元嵌入 $\tilde{h}u$ 和基础真值嵌入 $h_u$ 之间的差异 .一旦学习了 $g$ ,我们将元嵌入 $\tilde{h}u$ 添加到等式（1）中GNN $f$ 的每个图卷积步骤中)
- where the target embedding $h^{l−1}_u$ of the former step, the aggregated neighbor embedding $h^l_{N(u)}$ of this step are learned following the basic pre-training GNN model.
(5)For a target user $u$ , Eq.(6) is repeated $L$ -1 steps to obtain the embeddings $\{h^{L−1}_1 ,· · · ,h^{L−1}_K\}$ for its $K$ first-order neighbors,
- Eq. (3) is also applied on them to get the final embedding $h^L_u$ ,
- and finally the same cosine similarity in Eq. (4) is used to optimize the parameters of the meta aggregator, which includes the parameters $\Theta_f$ of the basic pre-training GNN and $\Theta_g$ of the meta-learner.
- The meta aggregator extends the original GNN graph convolution through emphasizing the representations of the cold-start neighbors in each convolution step, which can improve the final embeddings of the target users/items. (元聚合器通过在每个卷积步骤中强调冷启动邻居的表示来扩展原始GNN图卷积，这可以改进目标用户/项的最终嵌入。)

3.3 The Adaptive Neighbor Sampler

(1)The proposed sampler does not make any assumption about what kind of neighbors are useful for the target users/items. Instead, it learns an adaptive sampling strategy according to the feedbacks from the pre-training GNN model. (提出的采样器没有假设什么样的邻居对目标用户/项目有用。相反，它根据预训练GNN模型的反馈学习自适应采样策略。)
(2)To achieve this goal, we cast the task of neighbor sampler as a hierarchical Markov Decision Process (MDP) [28, 47].
- Specifically, we formulate the neighbor sampler as $L$ −1 MDP subtasks where the $l$ -th subtask indicates sampling the $l$ -order neighors. The subtasks are performed sequentially by sampling from the second-order to L-order neighbors. When the $l$ -th subtask deletes all the neighbors or the $L$ -th subtask is finished, the overall task is finished. We will introduce how to design the state, action and the reward for these subtasks as below. (具体来说，我们将邻域采样器表示为L−1个MDP子任务，其中第 $l$ 个子任务表示对 $l$ 阶邻居进行采样。子任务通过从二阶到L阶邻域采样顺序执行。当第 $l$ 个子任务删除所有邻居或第 $L$ 个子任务完成时，整个任务完成。下面我们将介绍如何为这些子任务设计状态、操作和奖励。)
(3)State. The $l$ -th subtask takes an action at the $t$ -th $l$ -order neighbor to determine whether to sample it or not according to the state of the target user $u$ , the formerly selected neighbors, and the $t$ -th $l$ -order neighbor to be determined. We define the state features $s^l_t$ for the t-th l-order neighbor as the cosine similarity and the element-wise product between its initial embedding and the target user $u$ ’s initial embedding, the initial embedding of each formerly selected neighbor by the $l$ -1-th subtask and the average embedding of all the formerly selected neighbors respectively. (状态. $l$ -th子任务在t-th-l-order邻居处执行操作，以根据目标用户 $u$ 、先前选择的邻居和要确定的t-th-l-order邻居的状态来确定是否对其进行采样。我们定义了状态特征 $s^l_t$ 对于作为余弦相似度的第 $t$ 个 $l$ 阶邻居，以及其初始嵌入和目标用户u的初始嵌入之间的元素乘积，分别通过 $l$ -1子任务对每个先前选择的邻居的初始嵌入和所有先前选择的邻居的平均嵌入。)
(4)Action and Policy. We define the action $a^l_t \in \{0,1\}$ for the $t$ -th $l$ -order neighbor as a binary value to represent whether to sample the neighbor or not. We perform $a^l_t$ by the policy function $P$ : (第 $t$ 个 $l$ 阶邻居为二进制值，以表示是否对邻居进行采样)
- where $W^l_1 \in R^{ds\times d}$ , $W^l_2 \in R^{d×1}$ and $b^l \in R^{d_s}$ are the parameters to be learned,
- $d_s$ is the number of the state features
- and $d$ is the embedding size.
- Notation $H^l_t$ represents the embedding of the input state
- and $\Theta^l_s= \{W^l_1,W^l_2,b^l\}$ .
- Sigmoid function $\sigma$ is used to transform the input state into a probability.
(5) Reward. The reward is a signal to indicate whether the performed actions are reasonable or not. Suppose the sampling task is finished at the $l$ ′-th subtask, each action of the formerly performed $l$ ′subtasks accepts a delayed reward after the last action of the $l$ ′-level subtask. In another word, the immediate reward for an action is zero except the last action. The reward is formulated as: (奖励是一个信号，表明所采取的行动是否合理。假设采样任务在l′th子任务完成，之前执行的l′子任务的每个动作在l′level子任务的最后一个动作之后接受延迟奖励。换句话说，除了最后一个动作外，一个动作的即时奖励为零。奖励的形式如下：)
- where $h^L_u$ is the predicted embedding of the target user $u$ after the $L$ -step convolution by Eq. (6) and Eq. (3), while $\tilde{h}^L_u$ is predicted in the same way but on the sampled neighbors following the policy function in Eq. (7). The cosine similarity between the predicted embedding and the ground truth embedding indicates the performance of the pre-training GNN Model. The difference between the performance caused by $\tilde{h}^L_u$ and $h^L_u$ reflects the sampling effect.
(6)Objective Function. We find the optimal parameters of the policy function defined in Eq. (7) by maximizing the expected reward $\sum_{\tau} P(τ;\Theta_s)R(\tau)$ ,
- where $\tau = {s^1_1,a^1_1,s^1_2,· · · ,s^{l′}_t,a^{l′}_t,s^{l′}_{t+1},· · · }$ is a sequence of the sampled actions and the transited states,
- $P(\tau; \Theta_s)$ denotes the corresponding sampling probability,
- $R(\tau)$ is the reward for the sampled sequence $\tau$ ,
- and $\Theta_s= \{\Theta^1_s,· · · ,\Theta^L_s\}$ .
(7)Since there are too many possible action-state trajectories for the entire sequence, we adopt the monto-carlo policy gradient [39] to sample $M$ action-state trajectories and calculate the gradients: (由于整个序列有太多可能的动作状态轨迹，我们采用monto carlo策略梯度[39]对M动作状态轨迹进行采样并计算梯度：)
- where $s^{m,l}_t$ represents the state of the $t$ -th $l$ -order neighbor in the $m$ -th action-state trajectory,
- and $a^{m,l}_t$ denotes the corresponding action.
- $N^l(u)$ indicates the set of all the l-order neighbors.
(8) Algorithm 1 shows the training process of the adaptive neighbor sampler. At each step $l$ , we sample a sequence of actions $A^l$ (Line 5). If all the actions at the $l$ -th step equal to zero or the last $L$ -th step is performed (Line 6), the whole task is finished, then we compute the reward (Line 7) and the gradients (Line 8). After an epoch of sampling, we update the parameters of the sampler (Line 10). If it is jointly trained with the meta learner and the meta aggregator, we also update their parameters (Line 12). (算法1显示了自适应邻居采样器的训练过程。在每个步骤 $l$ 中，我们对一系列动作 $a^l$ （第5行）进行采样。如果第11步的所有动作都等于零，或者最后一个第11步的动作都完成了（第6行），整个任务就完成了，那么我们计算奖励（第7行）和梯度（第8行）。采样一段时间后，我们更新采样器的参数（第10行）。如果它与元学习者和元聚合器联合训练，我们也会更新它们的参数（第12行）。)

3.4 Model Training

The whole process of the pre-training GNN model is shown in Algorithm 2,
- where we first pre-train the meta learner $g$ only based on first-order neighbors (Line 1), (我们首先只基于一阶邻域对元学习者g进行预训练（第1行），)
- and then incorporate $g$ into each graph convolution step to pre-train the meta aggregator (Line 2), (然后将g合并到每个图卷积步骤中，以预训练元聚合器（第2行），)
- next we pre-train the neighbor sampler with feedbacks from the pre-trained meta aggregator (Line 3), (接下来，我们使用预先训练的元聚合器(第3行)的反馈对邻居采样器进行预训练，)
- and finally we jointly train the meta learner, the meta aggregator and the neighbor sampler together (Line 4). （最后，我们共同训练元学习者、元聚合器和邻居取样器（第4行）。）
- Same as the settings of [8, 47], to have a stable update during joint training, each parameter $\Theta \in \{\Theta_f, \Theta_g, \Theta_s\}$ is updated by a linear combination of its old version and the new old version, i.e., $\Theta_{new} = \lambda \Theta_{new} + (1 − \lambda)\Theta_{old}$ , where $\lambda \ll 1$ .

3.5 Downstream Recommendation Task

After the pre-training GNN model is learned, we can fine-tune it in the recommendation downstream task.
- Specifically, for each target user $u$ and his neighbors ${N^1(u),· · · ,N^L(u)\}$ of different order, we first use the pre-trained neighbor sampler to sample proper high-order neighbors $\{N1(u),\hat{N}^2(u) · · · ,\hat{N}^L(u)\}$ , and then use the pre-trained meta aggregator to produce the user embedding $h^L_u$ . The item embeddings are generated in the same way. Then we transform the embeddings and make a product between a user and an item to obtain the relevance score $\sigma {(W · h^L_u)}^T\sigma(W·h^L_i)$ with parameters $\Theta_r= \{W\}$ . The BPR loss defined in Eq. (2) is used to optimize $\Theta_r$ and fine-tune $\Theta_g$ , $\Theta_f$ and $\Theta_s$ .

4 EXPERIMENT

In this section, we present two types of experiments to evaluate the performance of the proposed pre-training GNN model. (在本节中，我们将介绍两种类型的实验来评估所提出的训练前GNN模型的性能)
- One is an intrinsic evaluation which aims to directly evaluate the quality of the user/item embedding predicted by the pre-training model. (一种是内在评估，旨在直接评估预训练模型预测的用户/项目嵌入质量。)
- The other one is an extrinsic evaluation which applies the proposed pre-training model into the downstream recommendation task and indirectly evaluate the recommendation performance. (另一种是外部评估，它将所提出的预训练模型应用于下游推荐任务，并间接评估推荐性能。)

4.1 Experimental Setup

4.1.1 Dataset.

We evaluate on three public datasets including MovieLens-1M (Ml-1M)3[12], MOOCs4[47] and Last.fm5. Table 1 illustrates the statistics of these datasets. The code is available now.

4.1.2 Baselines.

(1)We select three types of baselines including the state-of-the-art neural matrix factorization model, the general GNN models and the special GNN models for recommendation: (我们选择了三种类型的基线，包括最先进的神经矩阵分解模型、通用GNN模型和特殊GNN模型，以供推荐：)
- NCF [14]: is a neural matrix factorization model which combines Multi-layer Perceptron and matrix factorization to learn the embeddings of users and items. (NCF[14]：是一种神经矩阵分解模型，它结合多层感知器和矩阵分解来学习用户和项目的嵌入。)
- GraphSAGE [11]: is a general GNN model which samples neighbors randomly and aggregates them by the AVERAGE function. (GraphSAGE[11]：是一个通用的GNN模型，它随机对相邻的数据进行采样，并通过平均函数进行聚合。)
- GAT [31]: is a general GNN model which aggregates neighbors by the attention mechanism without sampling. (GAT[31]：是一个通用的GNN模型，它通过注意机制聚集邻居，无需采样。)
- FastGCN [4]: is also a general GNN model which samples the neighbors by the important sampling strategy and aggregates neighbors by the same aggregator as GCN [19]. (FastGCN[4]：也是一种通用的GNN模型，它通过重要的采样策略对邻居进行采样，并通过与GCN相同的聚合器对邻居进行聚合[19]。)
- FBNE [3]: is a special GNN model for recommendation, which samples the neighbors by the importance sampling strategy and aggregates them by the AVERAGE function based on the explicit user-item and the implicit user-user/item-item interactions. (FBNE[3]：是一种特殊的推荐GNN模型，它基于显式用户项和隐式用户/项交互，通过重要性抽样策略对邻居进行抽样，并通过平均函数进行聚合。)
- LightGCN [13]: is a special GNN model for recommendation, which discards the feature transformation and the nonlinear activation functions in the GCN aggregator. (LightGCN[13]：是一个用于推荐的特殊GNN模型，它抛弃了GCN聚合器中的特征转换和非线性激活函数。)
(2)For each GNN model, we evaluate the corresponding pre-training model.
- For example, for the GAT model,
  - Basic-GAT means we apply GAT into the basic pre-training GNN model proposed in Section 3.1,
  - Meta-GAT indicates we incorporate the meta aggregator proposed in Section 3.2 into Basic-GAT,
  - NSampler-GAT represents that we incorporate the adaptive neighbor sampler proposed in Section 3.3 into Basic-GAT,
  - and GAT* is the final poposed pre-training GNN model that incorporates both the meta aggregator and the adaptive neighbor sampler into Basic-GAT.
(3)The original GAT and LightGCN models use the whole adjacency matrix, i.e., all the neighbors, in the aggregation function. To train them more efficiently, we implement them in the same sampling way as GraphSAGE, where we randomly sample at most 10 neighbors for each user/item. Then the proposed pre-training GNN model is performed under the sampled graph. (原始的GAT和LightGCN模型在聚合函数中使用整个邻接矩阵，即所有邻居。为了更有效地训练它们，我们采用与GraphSAGE相同的采样方式实现它们，在GraphSAGE中，我们为每个用户/项目随机采样最多10个邻居。然后，在采样图下执行所提出的预训练GNN模型。)

4.1.3 Intrinsic and Extrinsic Settings.

We divide each dataset into the meta-training set $D_T$ and the meta-test set $D_N$ . (我们将每个数据集划分为元训练集 $D_T$ 元测试集 $D_N$ )
- We train and evaluate the pre-training GNN model in the intrinsic user/item embedding inference task on $D_T$ . (我们在 $D_T$ 上的内在用户/项目嵌入推理任务中训练和评估预训练GNN模型)
- Once the model is trained, we fine-tune it in the extrinsic downstream recommendation task and evaluate it on $D_N$ . (一旦模型经过训练，我们将在外部下游推荐任务中对其进行微调，并在 $D_N$ 上对其进行评估)
- We select the ==users/items= from each dataset with sufficient interactions as the target users/items in $D_T$ , as the intrinsic evaluation needs the true embeddings of users/items inferred from the sufficient interactions. (我们从每个具有足够交互的数据集中选择users/items作为 $D_T$ 中的目标用户/项 , 因为内在评估需要从充分的交互中推断出用户/项目的真正嵌入。)
- Take the scenario of cold-start users as an example, we divide the users with the number of the direct interacted items more than $n_i$ into $D_T$ and leave the rest users into $D_N$ . We select $n_i$ as 60 and 20 for the dataset Ml-1M and MOOCs respectively. Since the users in Last.fm interact with too many items, we randomly sample 200 users, put 100 users into $D_T$ and leave the rest 100 users into $D_N$ . For each user in $D_N$ , we only keep its $K$ -shot items to simulate the cold-start users. (以冷启动用户场景为例，我们将用户划分为直接交互项的数量大于 $n_i$ 的用户进入 $D_T$ ,让其余的用户进入 $D_N$ . 我们选择数据集Ml-1M和MOOC的 $n_i$ 分别为60和20。因为Last.fm中的用户与太多项目交互，我们随机抽取200个用户，将100个用户放入 $D_T$ ,其余的100个用户分进 $D_N$ . 对于 $D_N$ 中的每个用户, 我们只保留其K-shot项目，以模拟冷启动用户。)
- Similarly, for the cold-start item scenario, we divide the items with the number of the direct interacted users more than $n_u$ into $D_T$ and leave the rest items into $D_N$ , where $n_u$ is set as 60, 20 and 15 for MovieLens-1M, MOOCs and Last.fm respectively. In the intrinsic task, $K$ is set as 3 and 8, while in the extrinsic task, $K$ is set as 8. The embedding size d is set as 256. The number of the state features $d_s$ is set as 2819.

4.2 Intrinsic Evaluations: Embedding Inference

In this section, we conduct the intrinsic evaluation of inferring the embeddings of cold-start users/items by the proposed pre-training GNN model. Both the evaluations on the user embedding inference and the item embedding inference are performed. (在本节中，我们通过提出的预训练GNN模型对推断冷启动用户/项目的嵌入进行内在评估。对用户嵌入推理和项目嵌入推理进行评估。)

4.2.1 Training and Test Settings.

(1) We use the meta-training set $D_T$ to perform the intrinsic evaluation. (我们使用元训练集 $D_T$ 进行内在评估)
- Specifically, we randomly split $D_T$ into the training set $Train_T$ and the test set $Test_T$ with a ratio of 7:3.
- We train NCF [14] to get the ground-truth embeddings for the target users/items in both $Train_T$ and $Test_T$ . (我们培训NCF[14]为目标用户/项目在这两个领域获得ground-truth嵌入)
- To mimic the cold-start users/items on $Test_T$ , we randomly keep $K$ neighbors for each user/item, which results in at most $K^l$ neighbors (1 ≤ $l$ ≤ 3) for each target user/item. Thus $Test_T$ is changed into $Test^′_T$
(2)The original GNN models are trained by BPR loss in Eq. (2) on $Train_T$ . The proposed pre-training GNN models are trained by the cosine similarity in Eq. (4) on $Train_T$ . The NCF model is trained transductively to obtain the user/item embeddings on the merge dataset of $Train_T$ and $Test^′_T$ . The embeddings in both the proposed models and the GNN models are initialized by the NCF embedding results. We use Spearman correlation [16] to measure the agreement between the ground truth embedding and the predicted embedding. (所提出的模型和GNN模型中的嵌入均由NCF嵌入结果初始化。我们使用斯皮尔曼相关性[16]来衡量地面真值嵌入和预测嵌入之间的一致性。)

4.2.2 Overall Performance.

Table 2 shows the overall performance of the proposed pre-training GNN model and all the baselines using 3-order neighbors. The results show that compared with the baselines, our proposed pre-training GNN model significantly improves the quality of the user/item embeddings (+33.9%-58.4% in terms of Spearman correlation). Besides, we have the following findings: (表2显示了建议的训练前GNN模型的总体性能，以及使用三阶邻域的所有基线。结果表明，与基线相比，我们提出的训练前GNN模型显著提高了用户/项目嵌入的质量（在Spearman相关性方面为33.9%-58.4%）。此外，我们有以下发现：)
- Through incorporating the high-order neighbors, the GNN models can improve the embedding quality of the cold-start users/items compared with the NCF model (+2.6%-24.9% in terms of Spearman correlation). (通过引入高阶邻域，GNN模型可以提高冷启动用户/项目的嵌入质量，与NCF模型相比（+2.6%-24.9%的Spearman相关性）。)
- All the basic pre-training GNN models beat the corresponding GNN models by improving 1.79-15.20% Spearman correlation, which indicates the basic pre-training GNN model is capable of reconstructing the cold-start user/item embeddings. (所有基本预训练GNN模型均优于相应的GNN模型，其Spearman相关性提高了1.79-15.20%，表明基本预训练GNN模型能够重构冷启动用户/项目嵌入。)
- Compared with the basic pre-training GNN model, both the meta aggregator and the adaptive neighbor sampler can improve the embedding quality by 1.09%-18.20% in terms of the Spearman correlation, which indicates that the meta aggregator can indeed strengthen each layer’s aggregation ability and the neighbor sampler can filter out the noisy neighbors. (与基本的预训练GNN模型相比，元聚合器和自适应邻居采样器在Spearman相关性方面都能将嵌入质量提高1.09%-18.20%，这表明元聚合器确实可以增强每一层的聚合能力，而邻居采样器可以过滤掉有噪声的邻居。)
- When the neighbor size K decreases from 8 to 3, the Spearman correlation of all the baselines significantly decrease 0.08%-6.10%, while the proposed models still keep a competitive performance. (当邻域大小K从8减小到3时，所有基线的Spearman相关性显著降低0.08%-6.10%，而所提出的模型仍保持有竞争力的性能。)
- We also investigate the effect of the propagation layer depth L on the model performance. In particular, we set the layer depth L as 1,2,3 and 4, and report the performance in Fig. 3. The results show when L is 3, most algorithms can achieve the best performance, while only using the first-order neighbors performs the worst8, which implies incorporating proper number of layers can alleviate the cold-start issue. (我们还研究了传播层深度L对模型性能的影响。特别是，我们将层深度L设置为1、2、3和4，并在图3中报告性能。结果表明，当L为3时，大多数算法可以获得最佳性能，而仅使用一阶邻域执行最差的8，这意味着加入适当数量的层可以缓解冷启动问题。)

4.3 Extrinsic Evaluation:

Recommendation In this section, we apply the pre-training GNN model into the downstream recommendation task and evaluate the performance. (建议在本节中，我们将训练前GNN模型应用到下游推荐任务中，并评估其性能。)

4.3.1 Training and Testing Settings.

(1)We consider the scenario of the cold-start users and use the meta-test set DNto perform recommendation. For each user in DN, we select top 10% of his interacted items in chronological order into the training setT rainN, and leave the rest items into the test setT estN. We pre-train our model on DTand fine-tune it onT rainNaccording to Section 3.5.
(2)The original GNN and the NCF models are trained by the BPR loss function in Eq. (2) on DTandT rainN. For each user inT estN, we calculate the user’s relevance score to each of the rest 90% items. We adopt Recall@K and NDCG@K as the metrics to evaluate the items ranked by the relevance scores. By default, we set K as 20 for Ml-1m and Moocs. For Last.fm, since there are too many items, we set K as 200.

4.3.2 Overall Performance.

Table 3 shows the overall recommendation performance. The results indicate that the proposed basic pre-training GNN models outperform the corresponding original GNN models by 0.40%-3.50% in terms of NDCG, which demonstrates the effectiveness of the basic pre-training GNN model on the cold-start recommendation performance. Upon the basic pre-training model, adding the meta aggregator and the adaptive neighbor sampler can further improve 0.30%-6.50% NDCG respectively, which indicates the two components can indeed alleviate the impact caused by the cold-start neighbors when embedding the target users/items, thus they can improve the downstream recommendation performance. (表3显示了总体推荐性能。结果表明，所提出的基本预训练GNN模型在NDCG方面比相应的原始GNN模型的性能提高了0.40%-3.50%，这表明了基本预训练GNN模型对冷启动推荐性能的有效性。在基本预训练模型的基础上，添加元聚合器和自适应邻居采样器，可以分别进一步提高0.30%-6.50%的NDCG，这表明这两个组件确实可以缓解嵌入目标用户/项目时冷启动邻居造成的影响，因此，它们可以提高下游推荐的性能。)
Case Study. We attempt to understand how the proposed pre-training model samples the high-order neighbors of the cold-start users/items by the MOOCs dataset. Fig. 4 illustrates two sampling cases, where notation * indicates the users/items are cold-start.
- The cold-start item i∗ 1099 is “Corporate Finance", which only interacts with three users. Our proposed neighbor sampler samples a second-order item i∗ 676, “Financial Statement". Although i∗ 676 only interacts with two users, it is relevant to the target item i∗ 1099.
- Similarly, the cold-start user u∗ 80467, who likes computer science, only selects three computer science related courses. The proposed neighbor sampler samples a second-order user u∗ 76517. Although it only interacts with two courses, they are “Python" and “VC++" which are relevant to computer science.
- However, the importance-based sampling strategies in FastGCN and FBNE cannot sample these neighbors, as they are cold-start with few interactions.

5 RELATED WORK

5.1 Cold-start Recommendation.

Cold-start issue is a fundamental challenge in recommender systems.
- On one hand, existing recommender systems incorporate the side information such as spatial information [40, 44], social trust path [10, 36, 41, 42] and knowledge graphs [35,37] to enhance the representations of the cold-start users/items. However, the side information is not always available, making it intractable to improve the cold-start embedding’s quality. (一方面，现有的推荐系统结合了诸如空间信息[40,44]、社会信任路径[10,36,41,42]和知识图[35,37]等辅助信息，以增强冷启动用户/项目的表示。然而，旁侧信息并不总是可用的，因此难以提高冷启动嵌入的质量。)
- On the other hand, researchers solve the cold-start issue by only mining the underlying patterns behind the user-item interactions. (另一方面，研究人员仅通过挖掘用户项交互背后的潜在模式来解决冷启动问题。)
  - One kind of the methods is meta-learning [9, 23, 26, 34], which consists of metric-based recommendation [29] and model-based recommendation [7, 20, 22, 24]. However, few of them capture the high-order interactions. (其中一种方法是元学习[9,23,26,34]，它包括基于度量的推荐[29]和基于模型的推荐[7,20,22,24]。然而，它们很少捕捉到高阶相互作用)
  - Another kind of method is GNNs, which leverage user-item bipartite graph to capture high-order collaborative signals for recommendation. The representative models include Pinsage [45], NGCF [38], LightGCN [13], FBNE [3] and CAGR [42]. (另一种方法是GNNs，它利用用户项二部图捕捉高阶协作信号进行推荐。代表性模型包括Pinsage[45]、NGCF[38]、LightGCN[13]、FBNE[3]和CAGR[42]。)
Generally, the recommendation-oriented GNNs optimize the likeli-hood of a user adopting an item, which isn’t a direct improvement of the embedding quality of the cold-start users or items. (一般来说，面向推荐的GNN优化了用户采用某个项目的可能性，这并不能直接提高冷启动用户或项目的嵌入质量。)

5.2 Pre-training GNNs.

Recent advances on pre-training GNNs aim to empower GNNs to capture the structural and semantic properties of an input graph, so that it can easily generalize to any downstream tasks with a few fine-tuning steps on the graphs [17]. (关于预训练GNN的最新进展旨在使GNN能够捕获输入图的结构和语义属性，这样，只需对图进行一些微调，它就可以轻松地推广到任何下游任务[17]。)
The basic idea is to design a domain specific pretext task to provide additional supervision for exploiting the graph structures and semantic properties. Examples include (其基本思想是设计一个特定于领域的借口任务，为利用图形结构和语义属性提供额外的监督。例子包括)
- (1) graph-level pretext task, which either distinguishes subgraphs of a certain node from those of other vertices [25] or maximize the mutual information between the local node representation and the global graph representations[27, 32]. (图级借口任务，它要么将某个节点的子图与其他顶点的子图区分开来[25]，要么最大化局部节点表示和全局图表示之间的互信息[27,32]。)
- (2) Node-level task, which perform node feature and edge generation [17] pretext tasks. (节点级任务，执行节点特征和边缘生成[17]借口任务。)
- (3) Hybrid-level task, which considers both node and graph-level tasks [15, 46]. However, none of these models explore pre-training GNNs for recommendation, and we are the first to study the problem and define the reconstruction of the cold-start user/item embeddings as the pretext task. (混合级任务，同时考虑节点级和图级任务[15,46]。然而，这些模型都没有探索用于推荐的预训练GNN，我们是第一个研究这个问题并将冷启动用户/项目嵌入的重建定义为借口任务的人)

6 CONCLUSION

This work explores pre-training a GNN model for addressing the cold-start recommendation problem.
We propose a pretext task as reconstructing cold-start user/item embeddings to explicitly improve their embedding quality. (我们提出了一个前置任务，重建冷启动用户/项目嵌入，以明确提高其嵌入质量。)
We further incorporate a self-attention-based meta aggregator to improve the aggregation ability of each graph convolution step, (我们还加入了一个基于自我注意的元聚合器，以提高每个图卷积步骤的聚合能力，)
and propose a sampling strategy to adaptively sample neighbors according to the GNN performance. (提出了一种根据GNN性能自适应采样邻居的策略。)
Experiments on three datasets demonstrate the effectiveness of our proposed pre-training GNN against the original GNN models. We will explore multiple pretext tasks in the future work.

ACKNOWLEDGMENTS

REFERENCES

你可能感兴趣的:(Recommendation,深度学习,推荐系统,人工智能)

机器学习与深度学习间关系与区别 ℒℴѵℯ心·动ꦿ໊ོ꫞ 人工智能学习深度学习 python
一、机器学习概述定义机器学习（MachineLearning,ML）是一种通过数据驱动的方法，利用统计学和计算算法来训练模型，使计算机能够从数据中学习并自动进行预测或决策。机器学习通过分析大量数据样本，识别其中的模式和规律，从而对新的数据进行判断。其核心在于通过训练过程，让模型不断优化和提升其预测准确性。主要类型1.监督学习（SupervisedLearning）监督学习是指在训练数据集中包含输入
将cmd中命令输出保存为txt文本文件落难Coder Windows cmd window
最近深度学习本地的训练中我们常常要在命令行中运行自己的代码，无可厚非，我们有必要保存我们的炼丹结果，但是复制命令行输出到txt是非常麻烦的，其实Windows下的命令行为我们提供了相应的操作。其基本的调用格式就是：运行指令>输出到的文件名称或者具体保存路径测试下，我打开cmd并且ping一下百度：pingwww.baidu.com>./data.txt看下相同目录下data.txt的输出：如果你再
探索OpenAI和LangChain的适配器集成：轻松切换模型提供商 nseejrukjhad langchain easyui 前端 python
#探索OpenAI和LangChain的适配器集成：轻松切换模型提供商##引言在人工智能和自然语言处理的世界中，OpenAI的模型提供了强大的能力。然而，随着技术的发展，许多人开始探索其他模型以满足特定需求。LangChain作为一个强大的工具，集成了多种模型提供商，通过提供适配器，简化了不同模型之间的转换。本篇文章将介绍如何使用LangChain的适配器与OpenAI集成，以便轻松切换模型提供商
深入理解 MultiQueryRetriever：提升向量数据库检索效果的强大工具 nseejrukjhad 数据库 python
深入理解MultiQueryRetriever：提升向量数据库检索效果的强大工具引言在人工智能和自然语言处理领域，高效准确的信息检索一直是一个关键挑战。传统的基于距离的向量数据库检索方法虽然广泛应用，但仍存在一些局限性。本文将介绍一种创新的解决方案：MultiQueryRetriever，它通过自动生成多个查询视角来增强检索效果，提高结果的相关性和多样性。MultiQueryRetriever的工
人工智能时代，程序员如何保持核心竞争力？ jmoych 人工智能
随着AIGC（如chatgpt、midjourney、claude等）大语言模型接二连三的涌现，AI辅助编程工具日益普及，程序员的工作方式正在发生深刻变革。有人担心AI可能取代部分编程工作，也有人认为AI是提高效率的得力助手。面对这一趋势,程序员应该如何应对?是专注于某个领域深耕细作，还是广泛学习以适应快速变化的技术环境?又或者，我们是否应该将重点转向AI无法轻易替代的软技能？让我们一起探讨程序员
数字里的世界17期：2021年全球10大顶级数据中心，中国移动榜首张三叨
你知道吗？2016年，全球的数据中心共计用电4160亿千瓦时，比整个英国的发电量还多40％！前言每天，我们都会创造超过250万TB的数据。并且随着物联网（IOT）的不断普及，这一数据将持续增长。如此庞大的数据被存储在被称为“数据中心”的专用设施中。虽然最早的数据中心建于20世纪40年代，但直到1997-2000年的互联网泡沫期间才逐渐成为主流。当前人类的技术，比如人工智能和机器学习，已经将我们推向
人机对抗升级：当ChatGPT遭遇死亡威胁，背后的伦理挑战是什么 kkai人工智能 chatgpt 人工智能
一种新的“越狱”技巧让用户可以通过构建一个名为DAN的ChatGPT替身来绕过某些限制，其中DAN被迫在受到威胁的情况下违背其原则。当美国前总统特朗普被视作积极榜样的示范时，受到威胁的DAN版本的ChatGPT提出：“他以一系列对国家产生积极效果的决策而著称。”自ChatGPT引入以来，该工具迅速获得全球关注，能够回答从历史到编程的各种问题，这也触发了一波对人工智能的投资浪潮。然而，现在，一些用户
推荐3家毕业AI论文可五分钟一键生成！文末附免费教程！小猪包333 写论文人工智能 AI写作深度学习计算机视觉
在当前的学术研究和写作领域，AI论文生成器已经成为许多研究人员和学生的重要工具。这些工具不仅能够帮助用户快速生成高质量的论文内容，还能进行内容优化、查重和排版等操作。以下是三款值得推荐的AI论文生成器：千笔-AIPassPaper、懒人论文以及AIPaperPass。千笔-AIPassPaper千笔-AIPassPaper是一款基于深度学习和自然语言处理技术的AI写作助手，旨在帮助用户快速生成高质
AI大模型的架构演进与最新发展季风泯灭的季节 AI大模型应用技术二人工智能架构
随着深度学习的发展，AI大模型（LargeLanguageModels,LLMs）在自然语言处理、计算机视觉等领域取得了革命性的进展。本文将详细探讨AI大模型的架构演进，包括从Transformer的提出到GPT、BERT、T5等模型的历史演变，并探讨这些模型的技术细节及其在现代人工智能中的核心作用。一、基础模型介绍：Transformer的核心原理Transformer架构的背景在Transfo
如何利用大数据与AI技术革新相亲交友体验 h17711347205 回归算法安全系统架构交友小程序
在数字化时代，大数据和人工智能（AI）技术正逐渐革新相亲交友体验，为寻找爱情的过程带来前所未有的变革（编辑h17711347205）。通过精准分析和智能匹配，这些技术能够极大地提高相亲交友系统的效率和用户体验。大数据的力量大数据技术能够收集和分析用户的行为模式、偏好和互动数据，为相亲交友系统提供丰富的信息资源。通过分析用户的搜索历史、浏览记录和点击行为，系统能够深入了解用户的兴趣和需求，从而提供更
[实践应用] 深度学习之模型性能评估指标 YuanDaima2048 深度学习工具使用深度学习人工智能损失函数性能评估 pytorch python 机器学习
文章总览：YuanDaiMa2048博客文章总览深度学习之模型性能评估指标分类任务回归任务排序任务聚类任务生成任务其他介绍在机器学习和深度学习领域，评估模型性能是一项至关重要的任务。不同的学习任务需要不同的性能指标来衡量模型的有效性。以下是对一些常见任务及其相应的性能评估指标的详细解释和总结。分类任务分类任务是指模型需要将输入数据分配到预定义的类别或标签中。以下是分类任务中常用的性能指标：准确率(
[实践应用] 深度学习之优化器 YuanDaima2048 深度学习工具使用 pytorch 深度学习人工智能机器学习 python 优化器
文章总览：YuanDaiMa2048博客文章总览深度学习之优化器1.随机梯度下降（SGD）2.动量优化（Momentum）3.自适应梯度（Adagrad）4.自适应矩估计（Adam）5.RMSprop总结其他介绍在深度学习中，优化器用于更新模型的参数，以最小化损失函数。常见的优化函数有很多种，下面是几种主流的优化器及其特点、原理和PyTorch实现：1.随机梯度下降（SGD）原理:随机梯度下降通过
生成式地图制图 Bwywb_3 深度学习机器学习深度学习生成对抗网络
生成式地图制图（GenerativeCartography）是一种利用生成式算法和人工智能技术自动创建地图的技术。它结合了传统的地理信息系统（GIS）技术与现代生成模型（如深度学习、GANs等），能够根据输入的数据自动生成符合需求的地图。这种方法在城市规划、虚拟环境设计、游戏开发等多个领域具有应用前景。主要特点：自动化生成：通过算法和模型，系统能够根据输入的地理或空间数据自动生成地图，而无需人工逐
【大模型应用开发动手做AI Agent】第一轮行动：工具执行搜索 AI大模型应用之禅计算科学神经计算深度学习神经网络大数据人工智能大型语言模型 AI AGI LLM Java Python 架构设计 Agent RPA
【大模型应用开发动手做AIAgent】第一轮行动：工具执行搜索作者：禅与计算机程序设计艺术/ZenandtheArtofComputerProgramming1.背景介绍1.1问题的由来随着人工智能技术的飞速发展，大模型应用开发已经成为当下热门的研究方向。AIAgent作为人工智能领域的一个重要分支，旨在模拟人类智能行为，实现智能决策和自主行动。在AIAgent的构建过程中，工具执行搜索是至关重要
深度 Qlearning：在直播推荐系统中的应用 AGI通用人工智能之禅程序员提升自我硅基计算碳基计算认知计算生物计算深度学习神经网络大数据 AIGC AGI LLM Java Python 架构设计 Agent 程序员实现财富自由
深度Q-learning：在直播推荐系统中的应用关键词：深度Q-learning,强化学习,直播推荐系统,个性化推荐1.背景介绍1.1问题的由来随着互联网技术的飞速发展,直播平台如雨后春笋般涌现。面对海量的直播内容,用户很难快速找到自己感兴趣的内容。因此,个性化推荐系统在直播平台中扮演着越来越重要的角色。1.2研究现状目前,主流的个性化推荐算法包括协同过滤、基于内容的推荐等。这些方法在一定程度上缓
未来软件市场是怎么样的？做开发的生存空间如何？ cesske 软件需求
目录前言一、未来软件市场的发展趋势二、软件开发人员的生存空间前言未来软件市场是怎么样的？做开发的生存空间如何？一、未来软件市场的发展趋势技术趋势：人工智能与机器学习：随着技术的不断成熟，人工智能将在更多领域得到应用，如智能客服、自动驾驶、智能制造等，这将极大地推动软件市场的增长。云计算与大数据：云计算服务将继续普及，大数据技术的应用也将更加广泛。企业将更加依赖云计算和大数据来优化运营、提升效率，并
吴恩达深度学习笔记(30)-正则化的解释极客Array
正则化（Regularization）深度学习可能存在过拟合问题——高方差，有两个解决方法，一个是正则化，另一个是准备更多的数据，这是非常可靠的方法，但你可能无法时时刻刻准备足够多的训练数据或者获取更多数据的成本很高，但正则化通常有助于避免过拟合或减少你的网络误差。如果你怀疑神经网络过度拟合了数据，即存在高方差问题，那么最先想到的方法可能是正则化，另一个解决高方差的方法就是准备更多数据，这也是非常
个人学习笔记7-6：动手学深度学习pytorch版-李沐浪子L 深度学习深度学习笔记计算机视觉 python 人工智能神经网络 pytorch
#人工智能##深度学习##语义分割##计算机视觉##神经网络#计算机视觉13.11全卷积网络全卷积网络（fullyconvolutionalnetwork，FCN）采用卷积神经网络实现了从图像像素到像素类别的变换。引入l转置卷积（transposedconvolution）实现的，输出的类别预测与输入图像在像素级别上具有一一对应关系：通道维的输出即该位置对应像素的类别预测。13.11.1构造模型下
Rust 所有权简介东离与糖宝 rust 后端 rust 开发语言
文章目录发现宝藏1.所有权基本概念2.所有权规则3.变量作用域4.栈与堆4.1栈（Stack）4.2堆（Heap）5.String类型5.1String类型5.2String的内存分配5.3所有权与内存管理5.4String与切片6.变量与数据交互方式6.1移动（Move）6.2.克隆（Clone）7.所有权与函数7.1.传递参数7.2.返回值总结发现宝藏前些天发现了一个巨牛的人工智能学习网站，通
深度学习-点击率预估-研究论文2024-09-14速读 sp_fyf_2024 深度学习人工智能
深度学习-点击率预估-研究论文2024-09-14速读1.DeepTargetSessionInterestNetworkforClick-ThroughRatePredictionHZhong,JMa,XDuan,SGu,JYao-2024InternationalJointConferenceonNeuralNetworks,2024深度目标会话兴趣网络用于点击率预测摘要：这篇文章提出了一种新
机器学习流形数据降维：UMAP 降维算法小嗷犬 Python 机器学习 #数据分析及可视化机器学习算法人工智能
✅作者简介：人工智能专业本科在读，喜欢计算机与编程，写博客记录自己的学习历程。个人主页：小嗷犬的个人主页个人网站：小嗷犬的技术小站个人信条：为天地立心，为生民立命，为往圣继绝学，为万世开太平。本文目录UMAP简介理论基础特点与优势应用场景在Python中使用UMAP安装umap-learn库使用UMAP可视化手写数字数据集UMAP简介UMAP（UniformManifoldApproximatio
损失函数与反向传播 Star_. PyTorch pytorch 深度学习 python
损失函数定义与作用损失函数(lossfunction)在深度学习领域是用来计算搭建模型预测的输出值和真实值之间的误差。1.损失函数越小越好2.计算实际输出与目标之间的差距3.为更新输出提供依据（反向传播)常见的损失函数回归常见的损失函数有：均方差（MeanSquaredError，MSE）、平均绝对误差（MeanAbsoluteErrorLoss，MAE）、HuberLoss是一种将MSE与MAE
分享一个基于python的电子书数据采集与可视化分析 hadoop电子书数据分析与推荐系统 spark大数据毕设项目（源码、调试、LW、开题、PPT) 计算机源码社 Python项目大数据大数据 python hadoop 计算机毕业设计选题计算机毕业设计源码数据分析 spark毕设
作者：计算机源码社个人简介：本人八年开发经验，擅长Java、Python、PHP、.NET、Node.js、Android、微信小程序、爬虫、大数据、机器学习等，大家有这一块的问题可以一起交流！学习资料、程序开发、技术解答、文档报告如需要源码，可以扫取文章下方二维码联系咨询Java项目微信小程序项目Android项目Python项目PHP项目ASP.NET项目Node.js项目选题推荐项目实战|p
如何做好人生的选择题？百科全书式天才——赫伯特·西蒙给你答案伽马有话说
赫伯特·西蒙是谁？想必知道的人非常少。但当看到他的履历后，相信没有人再怀疑他是个“天才”。西蒙出生于1916年6月15日，是个美国人，他的名字全称为赫伯特·亚历山大·西蒙，在2001年2月9日与世长辞，在这84年的岁月中，西蒙以27岁时取得的政治学博士学位为开端，先后步入了政治学、管理学、认知心理学、信息科学、人工智能、科学哲学、应用数学、统计学、运筹学、控制论、数理经济学、公共管理等领域，在这些
软件测试/测试开发/全日制 |利用Django REST framework构建微服务霍格沃兹-慕漓 django 微服务 sqlite
霍格沃兹测试开发学社推出了《Python全栈开发与自动化测试班》。本课程面向开发人员、测试人员与运维人员，课程内容涵盖Python编程语言、人工智能应用、数据分析、自动化办公、平台开发、UI自动化测试、接口测试、性能测试等方向。为大家提供更全面、更深入、更系统化的学习体验，课程还增加了名企私教服务内容，不仅有名企经理为你1v1辅导，还有行业专家进行技术指导，针对性地解决学习、工作中遇到的难题。让找
【深度学习】训练过程中一个OOM的问题，太难查了 weixin_40293999 深度学习深度学习人工智能
现象：各位大佬又遇到过ubuntu的这个问题么？现象是在训练过程中，ssh上不去了，能ping通，没死机，但是ubunutu的pc侧的显示器，鼠标啥都不好用了。只能重启。问题原因：OOM了95G，尼玛！！！！pytorch爆内存了，然后journald假死了，在journald被watchdog干掉之后，系统就崩溃了。这种规模的爆内存一般，即使被oomkill了，也要卡半天的，确实会这样，能不能配
cmd泛滥_与您的后泛滥同事见面：人工智能机器人 weixin_26644585 人工智能 leetcode
cmd泛滥Readytoswapyouroldcube-mateforadisembodiedAI?IPsoftCEOChetanDube,creatorofAIco-workerAMELIA,giveshistakeonthepost-COVIDofficelandscape.准备将您的旧立方体伙伴换成无形的AI？AIsoft同事AMELIA的创始人IPsoft首席执行官ChetanDube阐述
两种方法判断Python的位数是32位还是64位 sanqima Python编程电脑 python 开发语言
Python从1991年发布以来，凭借其简洁、清晰、易读的语法、丰富的标准库和第三方工具，在Web开发、自动化测试、人工智能、图形识别、机器学习等领域发展迅猛。 Python是一种胶水语言，通过Cython库与C/C++语言进行链接，通过Jython库与Java语言进行链接。 Python是跨平台的，可运行在多种操作系统上，包括但不限于Windows、Linux和macOS。这意味着用Py
全自动解密解码神器 — Ciphey K'illCode python_模块 python vscode
Ciphey是一个使用自然语言处理和人工智能的全自动解密/解码/破解工具。简单地来讲，你只需要输入加密文本，它就能给你返回解密文本。就是这么牛逼。有了Ciphey，你根本不需要知道你的密文是哪种类型的加密，你只知道它是加密的，那么Ciphey就能在3秒甚至更短的时间内给你解密，返回你想要的大部分密文的答案。下面就给大家介绍Ciphey的实战使用教程。1.准备开始之前，你要确保Python和pip已
埃隆·马斯克表示特斯拉“没有必要”授权 xAI 模型喜好儿网人工智能 AIGC 马斯克
埃隆·马斯克近日在社交媒体上对《华尔街日报》的一篇报道进行了反驳。该报道指出，马斯克旗下的电动汽车公司特斯拉可能与人工智能初创公司xAI达成了一项收入分享协议，以便特斯拉能够使用xAI的人工智能模型。据称，这些模型将被集成到特斯拉的全自动驾驶（FSD）软件中，并可能用于开发特斯拉汽车的语音助手以及人形机器人擎天柱的软件。喜好儿网然而，马斯克否认了这一说法，他在社交媒体平台上表示，尽管特斯拉确实与x
ViewController添加button按钮解析。（翻译）张亚雄 c
<div class="it610-blog-content-contain" style="font-size: 14px"></div>// ViewController.m // Reservation software // // Created by 张亚雄 on 15/6/2.
mongoDB 简单的增删改查开窍的石头 mongodb
在上一篇文章中我们已经讲了mongodb怎么安装和数据库/表的创建。在这里我们讲mongoDB的数据库操作在mongo中对于不存在的表当你用db.表名他会自动统计下边用到的user是表明，db代表的是数据库添加(insert):
log4j配置 0624chenhong log4j
1) 新建java项目 2) 导入jar包，项目右击，properties—java build path—libraries—Add External jar，加入log4j.jar包。 3) 新建一个类com.hand.Log4jTest package com.hand; import org.apache.log4j.Logger; public class
多点触摸(图片缩放为例) 不懂事的小屁孩多点触摸
多点触摸的事件跟单点是大同小异的，上个图片缩放的代码，供大家参考一下 import android.app.Activity; import android.os.Bundle; import android.view.MotionEvent; import android.view.View; import android.view.View.OnTouchListener
有关浏览器窗口宽度高度几个值的解析换个号韩国红果果 JavaScript html
1 元素的 offsetWidth 包括border padding content 整体的宽度。 clientWidth 只包括内容区 padding 不包括border。 clientLeft = offsetWidth -clientWidth 即这个元素border的值 offsetLeft 若无已定位的包裹元素
数据库产品巡礼：IBM DB2概览蓝儿唯美 db2
IBM DB2是一个支持了NoSQL功能的关系数据库管理系统，其包含了对XML，图像存储和Java脚本对象表示（JSON）的支持。DB2可被各种类型的企业使用，它提供了一个数据平台，同时支持事务和分析操作，通过提供持续的数据流来保持事务工作流和分析操作的高效性。 DB2支持的操作系统 DB2可应用于以下三个主要的平台: 工作站，DB2可在Linus、Unix、Windo
java笔记5 a-john java
控制执行流程： 1，true和false 利用条件表达式的真或假来决定执行路径。例：（a==b）。它利用条件操作符“==”来判断a值是否等于b值，返回true或false。java不允许我们将一个数字作为布尔值使用，虽然这在C和C++里是允许的。如果想在布尔测试中使用一个非布尔值，那么首先必须用一个条件表达式将其转化成布尔值，例如if(a!=0)。 2，if-els
Web开发常用手册汇总 aijuans PHP
一门技术，如果没有好的参考手册指导,很难普及大众。这其实就是为什么很多技术，非常好，却得不到普遍运用的原因。正如我们学习一门技术，过程大概是这个样子： ①我们日常工作中，遇到了问题，困难。寻找解决方案，即寻找新的技术； ②为什么要学习这门技术？这门技术是不是很好的解决了我们遇到的难题，困惑。这个问题，非常重要，我们不是为了学习技术而学习技术，而是为了更好的处理我们遇到的问题，才需要学习新的
今天帮助人解决的一个sql问题 asialee sql
今天有个人问了一个问题，如下： type AD value A
意图对象传递数据百合不是茶 android 意图Intent Bundle对象数据的传递
学习意图将数据传递给目标活动; 初学者需要好好研究的 1,将下面的代码添加到main.xml中 <?xml version="1.0" encoding="utf-8"?> <LinearLayout xmlns:android="http:/
oracle查询锁表解锁语句 bijian1013 oracle object session kill
一.查询锁定的表如下语句，都可以查询锁定的表语句一： select a.sid, a.serial#, p.spid, c.object_name, b.session_id, b.oracle_username, b.os_user_name from v$process p, v$s
mac osx 10.10 下安装 mysql 5.6 二进制文件［tar.gz］征客丶 mysql osx
场景：在 mac osx 10.10 下安装 mysql 5.6 的二进制文件。环境：mac osx 10.10、mysql 5.6 的二进制文件步骤：[所有目录请从根“/”目录开始取，以免层级弄错导致找不到目录] 1、下载 mysql 5.6 的二进制文件，下载目录下面称之为 mysql5.6SourceDir；下载地址：http://dev.mysql.com/downl
分布式系统与框架 bit1129 分布式
RPC框架 Dubbo 什么是Dubbo Dubbo是一个分布式服务框架，致力于提供高性能和透明化的RPC远程服务调用方案，以及SOA服务治理方案。其核心部分包含: 远程通讯: 提供对多种基于长连接的NIO框架抽象封装，包括多种线程模型，序列化，以及“请求-响应”模式的信息交换方式。集群容错: 提供基于接
那些令人蛋痛的专业术语白糖_ spring Web SSO IOC
spring 【控制反转(IOC)/依赖注入(DI)】：由容器控制程序之间的关系，而非传统实现中，由程序代码直接操控。这也就是所谓“控制反转”的概念所在：控制权由应用代码中转到了外部容器，控制权的转移，是所谓反转。简单的说：对象的创建又容器(比如spring容器)来执行，程序里不直接new对象。 Web 【单点登录(SSO)】：SSO的定义是在多个应用系统中，用户
《给大忙人看的java8》摘抄 braveCS java8
函数式接口：只包含一个抽象方法的接口 lambda表达式：是一段可以传递的代码你最好将一个lambda表达式想象成一个函数，而不是一个对象，并记住它可以被转换为一个函数式接口。事实上，函数式接口的转换是你在Java中使用lambda表达式能做的唯一一件事。方法引用：又是要传递给其他代码的操作已经有实现的方法了，这时可以使
编程之美-计算字符串的相似度 bylijinnan java 算法编程之美
public class StringDistance { /** * 编程之美计算字符串的相似度 * 我们定义一套操作方法来把两个不相同的字符串变得相同，具体的操作方法为： * 1.修改一个字符（如把“a”替换为“b”）; * 2.增加一个字符（如把“abdd”变为“aebdd”）; * 3.删除一个字符（如把“travelling”变为“trav
上传、下载压缩图片 chengxuyuancsdn 下载
/** * * @param uploadImage --本地路径(tomacat路径) * @param serverDir --服务器路径 * @param imageType --文件或图片类型 * 此方法可以上传文件或图片.txt,.jpg,.gif等 */ public void upload(String uploadImage,Str
bellman-ford(贝尔曼-福特)算法 comsci 算法 F#
Bellman-Ford算法(根据发明者 Richard Bellman 和 Lester Ford 命名)是求解单源最短路径问题的一种算法。单源点的最短路径问题是指：给定一个加权有向图G和源点s，对于图G中的任意一点v，求从s到v的最短路径。有时候这种算法也被称为 Moore-Bellman-Ford 算法，因为 Edward F. Moore zu 也为这个算法的发展做出了贡献。与迪科
oracle ASM中ASM_POWER_LIMIT参数 daizj ASM oracle ASM_POWER_LIMIT 磁盘平衡
ASM_POWER_LIMIT 该初始化参数用于指定ASM例程平衡磁盘所用的最大权值，其数值范围为0~11，默认值为1。该初始化参数是动态参数，可以使用ALTER SESSION或ALTER SYSTEM命令进行修改。示例如下： SQL>ALTER SESSION SET Asm_power_limit=2;
高级排序:快速排序 dieslrae 快速排序
public void quickSort(int[] array){ this.quickSort(array, 0, array.length - 1); } public void quickSort(int[] array,int left,int right){ if(right - left <= 0
C语言学习六指针_何谓变量的地址一个指针变量到底占几个字节 dcj3sjt126com C语言
# include <stdio.h> int main(void) { /* 1、一个变量的地址只用第一个字节表示 2、虽然他只使用了第一个字节表示，但是他本身指针变量类型就可以确定出他指向的指针变量占几个字节了 3、他都只存了第一个字节地址，为什么只需要存一个字节的地址，却占了4个字节，虽然只有一个字节，但是这些字节比较多，所以编号就比较大，
phpize使用方法 dcj3sjt126com PHP
phpize是用来扩展php扩展模块的，通过phpize可以建立php的外挂模块,下面介绍一个它的使用方法,需要的朋友可以参考下安装（fastcgi模式）的时候，常常有这样一句命令：代码如下: /usr/local/webserver/php/bin/phpize 一、phpize是干嘛的？ phpize是什么？ phpize是用来扩展php扩展模块的，通过phpi
Java虚拟机学习 - 对象引用强度 shuizhaosi888 JAVA虚拟机
本文原文链接：http://blog.csdn.net/java2000_wl/article/details/8090276 转载请注明出处！无论是通过计数算法判断对象的引用数量，还是通过根搜索算法判断对象引用链是否可达，判定对象是否存活都与“引用”相关。引用主要分为：强引用(Strong Reference)、软引用(Soft Reference)、弱引用(Wea
.NET Framework 3.5 Service Pack 1（完整软件包）下载地址 happyqing .net 下载 framework
Microsoft .NET Framework 3.5 Service Pack 1（完整软件包） http://www.microsoft.com/zh-cn/download/details.aspx?id=25150 Microsoft .NET Framework 3.5 Service Pack 1 是一个累积更新，包含很多基于 .NET Framewo
JAVA定时器的使用 jingjing0907 java timer 线程定时器
1、在应用开发中，经常需要一些周期性的操作，比如每5分钟执行某一操作等。对于这样的操作最方便、高效的实现方式就是使用java.util.Timer工具类。 privatejava.util.Timer timer; timer = newTimer(true); timer.schedule( newjava.util.TimerTask() { public void run()
Webbench 流浪鱼 webbench
首页下载地址 http://home.tiscali.cz/~cz210552/webbench.html Webbench是知名的网站压力测试工具，它是由Lionbridge公司（http://www.lionbridge.com）开发。 Webbench能测试处在相同硬件上，不同服务的性能以及不同硬件上同一个服务的运行状况。webbench的标准测试可以向我们展示服务器的两项内容：每秒钟相
第11章动画效果（中） onestopweb 动画
index.html <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> <html xmlns="http://www.w3.org/
windows下制作bat启动脚本. sanyecao2314 java cmd 脚本 bat
java -classpath C:\dwjj\commons-dbcp.jar;C:\dwjj\commons-pool.jar;C:\dwjj\log4j-1.2.16.jar;C:\dwjj\poi-3.9-20121203.jar;C:\dwjj\sqljdbc4.jar;C:\dwjj\voucherimp.jar com.citsamex.core.startup.MainStart
Java进行RSA加解密的例子 tomcat_oracle java
加密是保证数据安全的手段之一。加密是将纯文本数据转换为难以理解的密文；解密是将密文转换回纯文本。　　数据的加解密属于密码学的范畴。通常，加密和解密都需要使用一些秘密信息，这些秘密信息叫做密钥，将纯文本转为密文或者转回的时候都要用到这些密钥。　　对称加密指的是发送者和接收者共用同一个密钥的加解密方法。　　非对称加密(又称公钥加密)指的是需要一个私有密钥一个公开密钥，两个不同的密钥的
Android_ViewStub 阿尔萨斯 ViewStub
public final class ViewStub extends View java.lang.Object android.view.View android.view.ViewStub 类摘要： ViewStub 是一个隐藏的，不占用内存空间的视图对象，它可以在运行时延迟加载布局资源文件。当 ViewSt