XingHe_XingHe_

2022_WSDM_Contrastive Meta Learning with Behavior Multiplicity for Recommendation

[论文阅读笔记]2022_WSDM_Contrastive Meta Learning with Behavior Multiplicity for Recommendation

论文下载地址： https://doi.org/10.1145/3488560.3498527
发表期刊：WSDM
Publish time: 2022
作者及单位:

Wei Wei1,3, Chao Huang1,2∗, Lianghao Xia1, Yong Xu3, Jiashu Zhao4, Dawei Yin5
1Department of Computer Science,2Musketeers Foundation Institute of Data Science, University of Hong Kong
3South China University of Technology,4Wilfrid Laurier University,5Baidu Inc
[email protected], [email protected], [email protected],
[email protected], [email protected], [email protected]

数据集： 正文中的介绍

Tmall
IJCAI-Contest
Retail Rocket

代码：

https://github.com/weiwei1206/CML.git (文中作者给的)

其他：

其他人写的文章

简要概括创新点： (1)Contrastive Meta Learning

We propose a new multi-behavior learning paradigm CML for recommendation by emphasizing the importance of diverse and multiplex user-item relationships, as well as tackling the label scarcity problem for target behaviors. (我们提出了一种新的多行为学习范式CML推荐，强调了多样化和多元化*的用户-项目关系的重要性，并解决了目标行为的 标签稀缺问题。)

In our CML framework, we design a multi-behavior contrastive learning paradigm to capture the transferable user-item relationships from multi-typed user behavior data, which incorporates auxiliary supervision signals into the sparse target behavior modeling. (在我们的CML框架中，我们设计了一个多行为对比学习范式，从多类型用户行为数据中捕获可转移的用户-项目关系，该范式将辅助监督信号纳入稀疏目标行为建模中。)

Furthermore, our proposed meta contrastive encoding scheme allows CML to preserve the personalized multi-behavior characteristics, so as to be reflective of the diverse behavior-aware user preference under a customized self-supervised framework. (此外，我们提出的 元对比编码方案允许CML保留 个性化的多行为特征，从而在定制的 自我监督框架下反映用户的不同行为感知偏好。)

ABSTRACT

(1) A well-informed recommendation framework could not only help users identify their interested items, but also benefit the revenue of various online platforms (e.g., e-commerce, social media). (一个消息灵通的推荐框架不仅可以帮助用户识别他们感兴趣的项目，还可以从各种在线平台（如电子商务、社交媒体）的收入中受益。)
(2) Traditional recommendation models usually assume that only a single type of interaction exists between user and item, and fail to model the multiplex user-item relationships from multi-typed user behavior data, such as page view, add-to-favourite and purchase. (传统的推荐模型通常假设用户和项目之间只存在单一类型的交互，而无法从页面浏览、添加到收藏夹和购买等多类型用户行为数据中建模多重用户-项目关系。)
(3) While some recent studies propose to capture the dependencies across different types of behaviors, two important challenges have been less explored: (虽然最近的一些研究提出要捕捉不同类型行为之间的依赖关系，但有两个重要的挑战尚未被探索：)
- i) Dealing with the sparse supervision signal under target behaviors (e.g., purchase). (处理目标行为（如购买）下的稀疏监督信号。)
- ii) Capturing the personalized multi-behavior patterns with customized dependency modeling. (通过定制的依赖关系建模捕获个性化的多行为模式。)
(4) To tackle the above challenges, we devise a new model CML, Contrastive Meta Learning (CML), to maintain dedicated crosstype behavior dependency for different users. (为了应对上述挑战，我们设计了一种新的模式CML，即对比元学习（CML），为不同的用户维护专用的跨类型行为依赖。)
- In particular, we propose a multi-behavior contrastive learning framework to distill transferable knowledge across different types of behaviors via the constructed contrastive loss. (特别是，我们提出了一个多行为对比学习框架，通过构建的对比损失提取不同类型行为的可转移知识。)
- In addition, to capture the diverse multi-behavior patterns, we design a contrastive meta network to encode the customized behavior heterogeneity for different users. (此外，为了捕捉不同的多行为模式，我们设计了一个对比元网络来编码不同用户的定制行为异质性。)
(5) Extensive experiments on three real-world datasets indicate that our method consistently outperforms various state-of-the-art recommendation methods. (在三个真实数据集上的大量实验表明，我们的方法始终优于各种最先进的推荐方法。)
(6) Our empirical studies further suggest that the contrastive meta learning paradigm offers great potential for capturing the behavior multiplicity in recommendation. We release our model implementation at: https://github.com/weiwei1206/CML.git (我们的实证研究进一步表明，对比元学习范式为捕获推荐中的行为多样性提供了巨大的潜力。我们将在以下网站发布我们的模型实现：https://github.com/weiwei1206/CML.git.)(https://github.com/weiwei1206/CML.git).

CCS CONCEPTS

• Information systems → Recommender systems.

KEYWORDS

Collaborative filtering, Self-Supervised Learning, Multi-Behavior Recommendation, Meta Learning, Graph Neural Network

1 INTRODUCTION

(1) Recommender systems have emerged as critical components to alleviate information overloading for users in various online applications, e.g., e-commerce [40], online video platform [46] and social media [30]. The goal is to learn user preference and forecast the items that he or she will consume based on observed user behaviors. (推荐系统已经成为缓解各种在线应用中用户信息过载的关键组件，例如电子商务[40]、在线视频平台[46]和社交媒体[30]。目标是了解用户偏好，并根据观察到的用户行为预测他或她将消费的商品。)
(2) Among various recommendation techniques, (在各种推荐技巧中)
- collaborative filtering (CF) has become the most promising recommendation architecture to model historical user interactions over items [4, 57]. (协同过滤（CF） 已经成为最有前途的推荐体系结构，可以对项目上的历史用户交互进行建模[4,57]。)
  - Commonly, the core of existing CF paradigm is to project users and items into latent representation space such that their interaction structural information is preserved. (通常，现有CF 范式的核心是将用户和项目投影到潜在的表示空间 中，从而保留其交互结构信息。)
  - For example, Autoencoder has been employed as the effective embedding function for the representation projection in AutoRec [33] and CDAE [49]. (例如，Autoencoder在AutoRec[33]和CDAE[49]中被用作表示投影的有效嵌入函数 。)
- To inject the high-order connection signals in CF, another promising research line model user-item interactions as a graph and generate the user/item feature representations with the graph structural information preserved. (为了在CF中注入高阶连接信号，另一条有前途的研究路线将用户-项目交互建模为图，并在保留图结构信息 的情况下生成用户/项目特征表示。)
  - These models perform the message passing over the interaction graph to generate node-level embeddings layer by layer, such as PinSage [53], NGCF [43] and LightGCN [15]. (这些模型通过交互图执行消息传递， 以逐层生成节点级嵌入，例如PinSage[53]、NGCF[43]和LightGCN[15]。)
(3) However, the majority of existing recommendation models assume that only a single type of interaction exists between user and item, whereas in practical recommendation scenarios are multiplex in nature [12, 41]. (然而，大多数现有的推荐模型都假设用户和项目之间只存在单一类型的交互，而在实际推荐场景中，这种交互本质上是多重的[12,41]。)
- Taking the online retail platform as an example, users can interact with items in multiple manners, including page view, add-to-favourite and purchase. (以在线零售平台为例，用户可以通过多种方式与商品互动，包括页面浏览、添加到收藏夹和购买。)
- Different types of behaviors may characterize user preference from different intention dimensions and complement with each other for better user preference learning [37]. (不同类型的行为可以从不同的意图维度表征用户偏好，并相互补充，以更好地学习用户偏好[37]。)
- Therefore, it is challenging but valuable to capture behavior multiplicity and the underlying dependencies in recommendation. (因此，在推荐中捕捉行为的多样性和潜在的依赖性是很有挑战性但很有价值的。)
- To address this challenge, existing work models the behavior dependency by introducing different aggregation schemes to integrate type-specific behavior embeddings, to enhance the representation on target user behaviors (e.g., customer purchase) [23, 50, 51]. (为了应对这一挑战，现有工作通过引入不同的聚合方案来建模行为依赖性，以集成特定类型的行为嵌入，从而增强对目标用户行为（例如，客户购买）的表示[23,50,51]。)
- For example, MATN [50] adopts the self-attention to encode the pairwise correlations between different types of behaviors, and make predictions on the target behaviors. (例如，MATN [50]采用自我注意 编码不同类型行为之间的 成对相关性 ，并对目标行为进行预测。)
- A relation-aware embedding propagation layer is developed to learn the behavior multiplicity in MBGCN [23], to gather multi-behavior interaction information from high-order neighbors. (开发了一个感知嵌入传播层，用于学习MBGCN [23]中的 行为多样性，从高阶邻居 收集多行为交互信息。)
(4) Despite the effectiveness of existing methods, these studies share two common limitations: (尽管现有方法有效，但这些研究有两个共同的局限性：)
- First, Sparse Supervision Signal under Target Behaviors: the most of current multi-behavior recommender systems are trained with supervised information in an end-to-end manner. (首先，目标行为下的稀疏监督信号：目前大多数多行为推荐系统都是用监督信息进行端到端的训练。)
  - That is to say, for making forecasting on the target user behaviors, it is required to have sufficient labeled data corresponding to the target behaviors (e.g., user purchase data). (也就是说，为了对目标用户行为进行预测，需要有足够的与目标行为相对应的标记数据（例如用户购买数据）。)
  - Unfortunately, the observed interactions under the target behavior type, are often sparse as compared with other types of user-item interactions. （不幸的是，与其他类型的用户项交互相比，在目标行为类型下观察到的交互通常是稀疏的。)
    - For example, purchase prediction task in online retail system still faces the challenge of lacking of ground-truth labels [20]. Hence, directly integrating type-specific behavior embeddings will sacrifice the performance due to lacking supervision signals of target behaviors. （例如，在线零售系统中的购买预测任务仍然面临缺乏地面真相标签的挑战[20]。因此，由于缺乏目标行为的监督信号，直接集成特定类型的行为嵌入将牺牲性能。）
- Second, Personalized Multi-Behavior Patterns: multi-behaviour patterns may vary by users. Semantics of multi-typed user-item interactions and their mutual relationships are diverse, depending on the personalized characteristics of users [27]. Without considering diverse user intents which motivates different types of user behaviors, previous modeling of multiplex user-item relationships leads to suboptimal representations. （第二，个性化的多行为模式：多行为模式可能因用户而异。根据用户的个性化特征，多类型用户项交互的语义及其相互关系是多样的[27]。在没有考虑激发不同类型用户行为的不同用户意图的情况下，以前对多重用户项关系的建模会导致次优表示。）
(5) Contributions.
- Having realized the above challenges for recommendation with behavior multiplicity, we focus on exploring diverse multi-behavior patterns under a contrastive self-supervised learning prototype. (在意识到上述行为多样性推荐的挑战后，我们专注于在对比自我监督学习原型下探索不同的多行为模式。)
- Towards this end, this work proposes a new model-Contrastive Meta Learning (CML) for multi-behavior recommendation. In CML, we design a multi-behavior contrastive learning framework to capture the cross-type interaction dependency from different behavior views. (为此，本文提出了一种新的多行为推荐模式——对比元学习（CML）。在CML中，我们设计了一个多行为对比学习框架，从不同的行为视角捕捉跨类型交互依赖 。)
- This endows our developed recommender system to effectively distill additional supervision signal from different types of user behaviors, which augments the model optimization process with sparse supervision labels. (这使得我们开发的推荐系统能够有效地从不同类型的用户行为中提取额外的监督信号，从而用稀疏的监督标签来 增强模型优化过程)
- Inspired by the recent success achieved by self-supervised representation learning, we leverage the idea of contrastive learning to design cross-type behavior dependency modeling task with the user self-discrimination. (受 自监督表征学习 最近取得的成功的启发，我们利用对比学习 的思想设计了具有用户自辨别能力 的 跨类型行为依赖 建模任务。)
- The goal of our multi-behavior contrastive learning is to reach the agreement between user’s type-specific behavior representations via the constructed contrastive loss. (我们的多行为对比学习 的目标是通过构建的对比损失 ，在用户特定类型的行为表征 之间达成一致。)
- In addition, to handle the preference diversity of users and capture the personalized multi-behavior patterns, we design contrastive meta network to characterize the customized behavior heterogeneity, empowering CML to maintain dedicated representations for different users. (此外，为了处理用户偏好的多样性和捕捉个性化的多行为模式，我们设计了对比元网络来描述定制的行为异质性，使CML能够为不同的用户维护专用的表示。)
- Our meta contrastive encoder first extracts the personalized meta-knowledge from users, and then feeds it into our weighting function for customized multi-behavior dependency modeling. （我们的元对比编码器首先从用户那里提取个性化的元知识，然后将其输入到我们的加权函数中，用于定制多行为依赖建模。）
(6) In a nutshell, this work makes the following contributions: (简而言之，这项工作做出了以下贡献：)
- We propose a new multi-behavior learning paradigm CML for recommendation by emphasizing the importance of diverse and multiplex user-item relationships, as well as tackling the label scarcity problem for target behaviors. (我们提出了一种新的多行为学习范式CML推荐，强调了多样化和多元化*的用户-项目关系的重要性，并解决了目标行为的 标签稀缺问题。)
- In our CML framework, we design a multi-behavior contrastive learning paradigm to capture the transferable user-item relationships from multi-typed user behavior data, which incorporates auxiliary supervision signals into the sparse target behavior modeling. (在我们的CML框架中，我们设计了一个多行为对比学习范式，从多类型用户行为数据中捕获可转移的用户-项目关系，该范式将辅助监督信号纳入稀疏目标行为建模中。)
- Furthermore, our proposed meta contrastive encoding scheme allows CML to preserve the personalized multi-behavior characteristics, so as to be reflective of the diverse behavior-aware user preference under a customized self-supervised framework. (此外，我们提出的 元对比编码方案允许CML保留 个性化的多行为特征，从而在定制的 自我监督框架下反映用户的不同行为感知偏好。)
- We perform extensive experiments on three real-world recommendation datasets to justify the rationality of our assumptions and the effectiveness of our proposed framework. By comparing CML with 12 baselines, we show that CML is able to consistently improve the performance of different techniques under various settings. Further analysis demonstrates the effectiveness of the designed sub-modules with ablation study. (我们在三个真实世界的推荐数据集上进行了大量实验，以证明我们假设的合理性和我们提出的框架的有效性。通过将CML与12个基线进行比较，我们发现CML能够在各种设置下持续改善不同技术的性能。进一步的分析证明了所设计的子模块在消融实验中的有效性。)

2 PRELIMINARY

We first define $\mathcal{U}$ and $\mathcal{I}$ to represent the set of users and items, respectively. (我们首先定义 $\mathcal{U}$ 和 $\mathcal{I}$ ，分别表示用户和项目集。)
In our multi-behavior recommendation scenario, let $\mathcal{X}^{(k)}$ denote the user-item interaction matrix under the $k$ -th behavior type (e.g., page view, add-to-favorite, purchase). (在我们的多行为推荐场景中，让 $\mathcal{X}^{(k)}$ 表示第 $k$ 行为类型下的用户项交互矩阵（例如，页面视图、添加到收藏夹、购买）。)
Hence, multi-behavior interaction data is represented as $\{\mathcal{X}^{(1)}, ..., \mathcal{X}^{(k)}, ..., \mathcal{X}^{(K)}\}$ ,
- where $K$ is the number of behavior types. （其中 $K$ 是行为类型的数量。）
In particular, the element $x^k_{u, i} = 1$ indicates that user $u$ has interacted with item $i$ under the behavior type of $k$ before, and $x^k_{u, i} = 0$ otherwise. (特别地，元素 $x^k_{u, i} = 1$ 表示用户 $u$ 之前曾以 $k$ 的行为类型与第 $i$ 项进行过交互)
Generally, there exist target behavior as the prediction objective. (一般来说，存在目标行为作为预测目标。)
Other types of user behaviors serve as the auxiliary behaviors. (其他类型的用户行为用作辅助行为)
- For example, purchases are directly related to Gross Merchandise Value (GMV) in E-commerce services, and are usually considered as the target behaviors in various user modeling applications. (例如，在电子商务服务中，购买与 商品总值（GMV） 直接相关，并且通常被视为各种用户建模应用程序中的目标行为。)
- Auxiliary behaviors could be the interactions of page view and add-to-favorite/cart. (辅助行为可以是页面查看和添加到收藏夹/购物车的交互。)

2.1 Problem Statement.

The studied task is formally stated as: (所研究的任务正式表述为：)

Input: observed user-item interactions with multiplex $K$ types of behaviors $\{\mathcal{X}^{(1)}, ..., \mathcal{X}^{(k)}, ..., \mathcal{X}^{(K)}\}$ among users $\mathcal{U}$ and items $\mathcal{I}$ . (观察到的用户项交互与多重 $K$ 类型的行为)
Output: a predictive function which estimates the likelihood of user $u$ will interact with item $i$ under the target type $(k)$ of behaviors. (一个预测函数，用于估计用户 $u$ 在目标类型 $(k)$ 下与第 $i$ 项目交互的可能性。)

2.2 Multi-Behavior Interaction Graph. （多行为交互图）

Inspired by the representation paradigm of graph collaborative filtering methods [43, 45], we explore the user-item graph structure for our multi-behavior recommendation scenario. (受图协同过滤方法的表示范式[43,45]的启发，我们探索了多行为推荐场景的用户项图结构。)
Specifically, given $K$ types of user-item interaction matrices $\{\mathcal{X}^{(1)}, ..., \mathcal{X}^{(k)}, ..., \mathcal{X}^{(K)}\}$ , we generate the multi-behavior interaction graph, in which the set of nodes $\mathcal{V} = \mathcal{U} \cup \mathcal{I}$ involves the user and item set. (给定 $K$ 类型的用户项交互矩阵，我们生成了多行为交互图，其中节点集 $\mathcal{V} = \mathcal{U} \cup \mathcal{I}$ 涉及用户和项目集)
We further define the set of multiplex edges $\mathcal{E}$ to represent observed interactions with $K$ types of behaviors. (我们进一步定义了多重边集合 $\mathcal{E}$ 来表示观察到的与KK类型行为的交互作用。)
In $\mathcal{E}$ , edge $e^k_{u, i}$ between $u$ and $i$ indicates that $x^k_{u, i} = 1$ .

3 METHODOLOGY

We present our Contrastive Meta Learning (CML) framework in this section, which encapsulates the customized meta learning into a self-supervised neural architecture, for personalized multi-behavior dependency modeling. (在本节中，我们将介绍我们的对比元学习（CML） 框架，该框架将定制元学习封装到一个自我监督的神经架构中，用于个性化的多行为依赖建模。)
The overall model flow is shown in Figure 1. Key components will be elaborated in following subsections. (整体模型流程如图1所示。以下小节将详细介绍关键组件。)

3.1 Behavior-aware Graph Neural Network (行为感知图神经网络)

(1) To inject the high-order connectivity into the multiplex relation learning across users/items, we first develop a graph-based message passing framework with the awareness of behavior context. (为了将高阶连通性注入到跨用户/项目的多重关系学习 中，我们首先开发了一个基于图形的具有行为上下文意识 的消息传递框架。)
Motivated by graph-based information propagation neural architecture [55] and the findings in the state-of-the-art model Light-GCN [15, 22], our behavior-aware message passing scheme is built over a lightweight graph architecture, which can be represented: (受 基于图形的信息传播神经架构 [55]和最先进的 Light GCN 模型[15,22]的研究结果的启发，我们的行为感知消息传递方案建立在一个 轻量级图形架构 之上，可以表示为：)
- where $e^{k, (l+1)}_{v} \in R^d$ is defined as the obtained representation of node $\in \{u, i\})$ under the $l$ -th graph neural layer. (其中 $e^{k, (l+1)}_{v} \in R^d$ 定义为节点 $\in \{u, i\})$ 的表示，在第 $l$ 个图神经层下。)
- $\mathcal{N}^k_u$ and $\mathcal{N}^k_i$ denotes the neighboring nodes of item $i$ and user $u$ , respectively. (分别表示项目 $i$ 和用户 $u$ 的相邻节点。)
(2) After encoding the behavior-specific interaction patterns of users, we propose to perform the embedding aggregation across different types of behaviour patterns with the following operation for user representations (similar aggregation is applied for item side): (在编码用户的行为特定交互模式后，我们建议通过以下用户表示操作，在不同类型的行为模式之间执行嵌入聚合（类似聚合适用于项目端）：)
- The aggregated feature representation $e^{(l+1)}_u$ could preserve multi-behavior contextual information. (聚合特征表示 $e^{(l+1)}_u$ 可以保存多种行为的上下文信息。)
- $W^l \in R^{d\times d}$ represents the transformation matrix corresponding to $l$ -th graph propagation layer. (表示对应于第 $l$ 个图形传播层 的 变换矩阵。)

3.2 Multi-Behavior Contrastive learning (多行为对比学习)

In our CML framework, we propose a multi-behavior contrastive learning paradigm to capture the complex dependencies across different types of user interactions via a self-supervised principle. (在我们的CML框架中，我们提出了一种多行为对比学习范式，通过自我监督原则捕捉不同类型用户交互的复杂依赖关系。)
Conceptually, we utilize the idea of contrastive learning strategy for instance discrimination by contrasting positive and negative samples [31, 54]. (从概念上讲，我们利用了对比学习策略的概念，例如通过对比正和负样本进行区分[31,54]。)
Our contrastive learning architecture endows our main supervised task (i.e., target behavior prediction) with the auxiliary supervision signals from the auxiliary behaviors. (我们的对比学习架构赋予我们的主要监督任务（即目标行为预测）来自辅助行为的辅助监督信号。)

3.2.1 Contrastive View Generation. (对比视图生成)

(1) In contrastive learning paradigm, it is important to generate appropriate views for constructing diverse representations for the method to contrast with [5]. (在对比学习范式中，重要的是生成适当的视图，以构建与[5]对比的方法的不同表征。)
(2) In our recommendation scenario with behavior multiplicity, we propose to consider each type of behaviors as individual view, which performs the contrastive learning between user embeddings in different behavior views. (在具有行为多样性 的推荐场景中，我们建议将每种行为视为个体视图，在不同行为视图中执行用户嵌入之间的对比学习。)
(3) Different from current multi-behavior recommender systems (e.g., MATN [50], MBGCN [23]) which merely rely on behavior-wise embedding combination for target behavior prediction, we conduct the data augmentation by incorporating auxiliary behavior contextual information as supervision signals. (与当前的多行为推荐系统（如MATN[50]，MBGCN[23]）不同，该系统仅依赖行为层面的嵌入组合进行目标行为预测，我们通过合并辅助行为上下文信息作为监督信号来进行数据增强。)
(4) This design not only encodes the cross-type behavior dependency, but also alleviates the skewed data distribution across different types of user interaction data （这种设计不仅对跨类型的行为依赖进行编码，而且缓解了不同类型用户交互数据之间的数据分布不均）

3.2.2 Behavior-Wise Contrastive Learning Paradigm. (行为层面的对比学习范式)

(1) After establishing contrastive views from multi-behavior context, we further devise a behavior-wise contrastive learning paradigm between the target behaviors and auxiliary behaviors. (在建立了多行为语境下的对比视角后，我们进一步设计了目标行为和辅助行为之间的行为层面对比学习范式。)
- In particular,
  - different behavior views of the same user are considered as positive pairs, （同一用户的不同行为视图被视为正对，）
  - and the views of different users are sampled as negative pairs. （将不同用户的视图作为负对进行采样。）
- Given the encoded target behavior representatione $e^k_u$ from our graph neural architecture, the generated positive and negative pairs are $\{e^k_u, e^{k′}_u | u \in \mathcal{U}\}$ and $\{e^k_u, e^{k′}_{u′} | u, u′ \in \mathcal{U}, u \neq u' \}$ . （给定编码的目标行为表示 $e^k_u$ ，根据我们的图形神经结构，生成的正对和负对是）
- The incorporated auxiliary supervision enables our model to still recognize user $u$ from different behavior views (i.e., $k$ and $k'$ ; $\in K$ ) and captures the latent relationships between the auxiliary behaviors and target behaviors. (合并的辅助监督使我们的模型仍然能够从不同的行为视图（即 $k$ and $k'$ ; $\in K$ ）识别用户 $u$ ）并捕捉辅助行为和目标行为之间的潜在关系。)
- Meanwhile, for different users $u$ and $u^{'}$ , the contrastive loss aims to discriminate their behavior embeddings after data augmentation. (同时，对于不同的用户 $u$ 和 $u^{'}$ ，对比损失的目的是区分他们在数据增强后的行为嵌入。)
(2) Following works [48, 58], we utilize the InfoNCE [29] loss in our multi-view contrastive learning framework, to measure the distance between embeddings. （接下来的工作[48,58]，我们利用多视角对比学习框架中的InfoNCE[29]损失来测量嵌入之间的距离。）
We define our self-supervised learning loss with the objective of maximizing the Mutual Information (MI) between user representations through contrasting positive pairs with the sampled negative pair counterparts. The InfoNCE-based contrastive loss is calculated as below:
- Here, we define $\varphi (·)$ as the similarity function (e.g., inner-product or cosine similarity) between two embeddings.
- $\tau$ represents the temperature hyperparameter for the softmax function.
(3) To sum up, we perform the contrastive learning via maximizing the agreement between two behavior views based on the above defined contrastive loss, and enforcing the divergence among different users.
We obtain the contrastive loss $\mathcal{L}^{k, k'}_{cl}$ for each pair of target behavior $(k)$ and auxiliary behavior $(k')$ .
Therefore, we generate a list of ontrastive loss functions as:

3.3 Meta Contrastive Encoding (元对比编码)

(1) In our recommendation scenario, different users have various behaviour patterns and item interaction preferences. (在我们的推荐场景中，不同的用户有不同的行为模式和项目交互偏好)
- For example, some users are likely to pick up most of products from their favorite item list to purchase, (例如，一些用户可能会从他们最喜欢的商品列表中挑选大部分产品进行购买，)
- while others may only buy sporadic products given that they add a lot of items with less interest into their list [27]. (而其他人可能只购买零星的产品，因为他们在列表中添加了很多不太感兴趣的项目[27]。)
- The diversity of multi-behavior patterns from different users, results in different item interactions. (来自不同用户的多种行为模式的多样性导致了不同的项目交互。)
(2) Hence, effectively modeling the personalized dependencies across different types of behaviors, is also important in making accurate recommendations. (因此，有效地建模不同类型行为之间的个性化依赖关系，对于做出准确的建议也很重要。)
(3) To achieve this goal, we propose a meta contrastive encoding scheme to learn an explicit weighting function for the integration of multi-behavior contrastive loss. (为了实现这一目标，我们提出了一种元对比编码方案来学习一个显式的权重函数来整合多行为对比损失。)
(4) This module customizes our self-supervised learning paradigm with the diverse constrastive loss integration. (本模块通过不同的结构损失整合定制我们的自我监督学习模式。)
(5) Our meta contrastive encoding schema is a two-phase learning paradigm: (我们的元对比编码模式是一种两阶段学习范式)
- i) We propose a meta-knowledge encoder to capture the personalized multi-behavior characteristics, so as to reflect the diverse behavior-aware user preferences. (我们提出了一种元知识编码器来捕捉个性化的多行为特征，从而反映出用户对不同行为的感知偏好。)
- ii) Then, the extracted meta-knowledge will be incorporated into our developed meta weight network, to generate customized contrastive loss weight for cross-type behavior dependency modeling. (然后，将提取的元知识整合到我们开发的元权重网络中，为跨类型行为依赖建模生成定制的对比loss。)

3.3.1 Meta-Knowledge Encoder. (元知识编码器)

(1) In our meta contrastive encoding framework, we firstly extract the meta-knowledge to preserve user-specific behavior dependencies. Inspired by feature interaction mechanisms in [16, 56], we design two types of meta-knowledge encoder with different integration techniques based on learned user behavior representations: $e_u$ and $e^{k'}_u$ (auxiliary behavior of $k^′$ ): (在我们的元对比编码框架中，我们首先提取元知识来保持用户特定的行为依赖性。受[16,56]中特征交互机制的启发，我们基于学习到的用户行为表示设计了两种具有不同集成技术的元知识编码器： $e_u$ 和 $e^{k'}_u$ （ $k^′$ 的辅助行为）：)
- where the encoded meta-knowledge is represented by $Z^{k, k^′}_{u, 1}$ and $Z^{k, k^′}_{u, 2}$ . ( $e_u$ and $e^{k'}_u$ ) (编码后的元知识)
- We define $d(\cdot)$ as the duplicate function to generate a value vector corresponding to the embedding dimensionality. (作为复制函数，生成与嵌入维数相对应的值向量。)
- $\parallel$ denotes the concatenation operation. (表示串联操作。)
- $\gamma$ is a scale factor for the enlarge value. (是放大值的比例因子。)
(2) With this design for learning the personalized characteristics, both the auxiliary-target behavior dependency and user-specific interaction context are preserved in the extracted meta-knowledge. （通过这种学习个性化特征的设计，在提取的元知识中同时保留了辅助目标行为依赖和用户特定的交互上下文。）

3.3.2 Meta Weight Network. (元权重网络)

(1) After encoding the meta knowledge with user-specific multi-behavior patterns, we design a weighting function $\xi(·)$ mapping from meta-knowledge to contrastive loss weights. （在用用户特定的多行为模式对元知识进行编码后，我们设计了一个加权函数 $\xi(·)$ ，从元知识到对比损失权重的映射。）
(2) This module endows our recommendation framework with the capability of learning the multi-behavior relationships in a customized manner, to be reflective of personalized user preference under various types of behavior intentions. （本模块赋予我们的推荐框架以定制方式学习多行为关系的能力，以反映不同类型行为意图下的个性化用户偏好。）
(3) Formally, we define our weighting function as the following transformation layer: （形式上，我们将权重函数定义为以下转换层：）
- where $W_{\xi} \in R^{d\times d}$ and $b_{\xi} \in R^d$ represent the projection layer and bias term, respectively. （分别表示投影层和偏移项）
- Here, we utilize the PReLU activation function to incorporate non-linearity.
(4) On the basis of our meta weight network, we can obtain our personalized contrastive loss weight as follows: （基于我们的元权重网络，我们可以获得如下个性化对比减肥）
For each user $u$ , $\omega^{k, k^{'}}_{u}$ weight represents the customized explicit dependence between the target behavior type of $k$ and auxiliary behavior type of $k^′$ . (权重表示目标行为类型 $k$ 和辅助行为类型 $k^′$ 之间的定制显式依赖关系)
Accordingly, with our meta contrastive encoding scheme, we can generate two lists of loss weights for InfoNCE-based self-supervised loss and Bayesian personalized ranking (BPR)-based recommendation objective loss. (因此，通过我们的元对比编码方案，我们可以为基于InfoNCE的自我监督损失和基于贝叶斯个性化排名（BPR）的推荐目标损失生成两个损失权重列表。)

3.4 The Learning Process of CML Framework (CML框架的学习过程)

In this section, we first introduce our optimization objective and then present the training strategy for our CML framework. (在本节中，我们首先介绍我们的优化目标，然后介绍CML框架的培训策略。)
Finally, the analysis on the time complexity of our model is provided. (最后，对模型的时间复杂度进行了分析。)

3.4.1 Optimization Objective. (优化目标)

(1) In the model inference of CML, we leverage the Bayesian Personalized Ranking (BPR) loss to learn parameters, which encourages the probability estimation of user’s observed interaction to be higher than his/her unobserved counterparts. (在CML的模型推理中，我们利用贝叶斯个性化排名（BPR）损失来学习参数，这鼓励用户观察到的交互的概率估计高于他/她未观察到的部分。)
(2) Formally, the behavior-specific BPR loss is defined as: （形式上，特定于行为的BPR损失定义为）
- $O_k$ represents the pairwise training samples of $k$ -th behavior type, （表示第 $k$ 个行为类型的成对训练样本，）
- i.e., $O_k = \{(u, i^+, i^−) | (u, i^+) \in \mathcal{R}^+, (u, i^−) \in \mathcal{R}^− \}$ .
  - Here, $\mathcal{R}^+$ and $\mathcal{R}^−$ denotes the corresponding observed and unobserved interaction of user $u$ . （表示用户 $u$ 的相应观察到和未观察到的交互。）
- $\Theta$ represents the learnable parameters and the $L_2$ regularization is applied for alleviating overfitting issue. (表示可学习的参数和 $L_2$ 正则化用于缓解过度拟合问题。)

3.4.2 Model Training. (模型训练)

In this work, we follow the training strategy of meta-learning methods in previous work [9, 34], by updating the parameters of our graph neural architecture (represented as $\mathcal{G}(\mathcal{A}; Θ_{\mathcal{G}}))$ and multi-behavior contrastive meta network (represented as $\mathcal{M}((\mathcal{L}, E,E^k);Θ_{\mathcal{M}}))$ in an alternative way. (在这项工作中，我们遵循之前工作[9,34]中元学习方法的训练策略，通过更新我们的图神经结构（表示为 $\mathcal{G}(\mathcal{A}; Θ_{\mathcal{G}}))$ )) 和多行为对比元网络（表示为 $\mathcal{M}((\mathcal{L}, E,E^k);Θ_{\mathcal{M}}))$ ) 以另一种方式。)
- Here, $\mathcal{A}$ denotes the input adjacent matrix of behavior-aware user-item interaction graph. （这里， $\mathcal{A}$ 表示行为感知用户项交互图的输入邻接矩阵。）
- $E$ and $E^k$ represents the learned cross-type and behavior-specific embedding matrix of all users, respectively. （分别表示所有用户的学习交叉类型和行为特定嵌入矩阵。）
(2) The model training consists of three phases in an optimization loop to improve the training efficiency of our models. In particular: （模型训练包括优化循环中的三个阶段，以提高模型的训练效率。特别地）
- i) In the first stage, we integrate the behavior-aware graph neural network (with cloned state) and contrastive meta network, to learn initial parameter space of our multi-behavior contrastive encoder over the entire training data. （在第一阶段，我们将行为感知图神经网络（带克隆状态）和对比元网络相结合，在整个训练数据中学习我们的多行为对比编码器的初始参数空间。）
- ii) In the second stage, we refine the model parameters $Θ_{\mathcal{M}}$ of our contrastive meta network based on the meta data. （在第二阶段，我们细化模型参数 $Θ_{\mathcal{M}}$ 基于元数据的对比元网络）
- iii) After generating the personalized contrastive loss weights, we leverage the updated $Θ_{\mathcal{M}}$ to ameliorate the parameter $Θ_{\mathcal{G}}$ of our graph neural network. （在生成个性化的对比损失权重后，我们利用更新的 $Θ_{\mathcal{M}}$ 改进我们的图形神经网络参数 $Θ_{\mathcal{G}}$ 。）
(3) We formally present the nested optimization process as follows ( $B$ denote the size of training batch): （我们正式提出了嵌套优化过程，如下所示（ $B$ 表示训练批的大小））

3.4.3 Model Complexity Analysis. (模型复杂性分析)

(1) We analyze the complexity of our CML framework from several key components: (我们从几个关键组件分析了CML框架的复杂性)
- i) the computational cost of our lightweight graph neural architecture is $O(L × K × |R^{k+}| × d)$ for performing message passing across graph layers. (我们的轻量级图形神经结构的计算成本是 $O(L × K × |R^{k+}| × d)$ ，执行跨图形层的消息传递)
  - $R^{k+}|$ represents the number of non-zero elements in the adjacent matrix under the behavior of $k$ , （表示 $k$ 行为下相邻矩阵中非零元素的数量）
  - and $L$ denotes the number of information propagation layers. ( $L$ 表示信息传播层的数量)
  - The operations of linear transformations and mean-pooling for multi-behavior aggregation takes $O (L \times (N + M) \times d \times (K + d))$ time. (多行为聚合的线性变换和平均池运算需要 $O (L \times (N + M) \times d \times (K + d))$ )
- ii) Our meta ontrastive encoder takes $|\mathcal{R}^{k+}| × d^2)$ time overhead. (我们的元压缩编码器采用 $|\mathcal{R}^{k+}| × d^2)$ de 时间开销)
- iii) The cost of InfoNCE-based mutual information calculation is $O (B \times d)$ and $O (B \times S \times d)$ for the numerator and denominator (in Equation 3), respectively. (基于信息的互信息计算的成本分别是分子和分母的 $O (B \times d)$ 和 $O (B \times S \times d)$ （在等式3中）)
  - Here, $S$ is the sampling size of contrastive learning for reducing the time complexity and increasing the randomness to achieve model robustness [44]. （在这里， $S$ 是对比学习的样本大小，用于降低时间复杂度和增加随机性，以实现模型稳健性[44]。）
  - Therefore, our multi-behavior contrastive learning paradigm takes $|\mathcal{R}^{k+}| × S × d)$ time per epoch. (因此，我们的多行为对比学习范式每一epoch需要 $|\mathcal{R}^{k+}| × S × d)$ )
(2) In conclusion, our model could achieve comparable time complexity with state-of-the-art multi-behavior recommendation techniques (e.g., MBGCN, EHCF). (总之，我们的模型可以实现与最先进的多行为推荐技术相当的时间复杂度(例如，MBGCN, EHCF))

4 EVALUATION

To evaluate CML’s performance, we conduct experiments on several real-world datasets by answering the following research questions: (为了评估CML的性能，我们通过回答以下研究问题，在几个真实数据集上进行了实验：)

RQ1: How effective is the developed CML framework to tackle the behavior multiplicity in recommendation? (RQ1：开发的CML框架在解决推荐中的行为多样性方面有多有效？)
RQ2: How do different modules contribute to the performance of CML, such as the multi-behavior contrastive learning paradigm and meta contrastive encoder? (不同的模块，如多行为对比学习范式和元对比学习范式，对CML的表现有何影响？)
RQ3: How does CML perform to alleviate interaction data sparsity, when competing with state-of-the-art methods? (在与最先进的方法竞争时，CML如何缓解交互数据稀疏性？)
RQ4: How do different hyperparameter settings affect CML? (不同的超参数设置如何影响CML？)
RQ5: How is the model interpretation ability of our CML? (我们CML的模型解释能力如何？)

4.1 Experimental Settings

4.1.1 Datasets.

(1) We evaluate the effectiveness of our proposed CML on three publicly available recommendation datasets. (我们在三个公开的推荐数据集上评估了我们提出的慢性粒细胞白血病的有效性。)
(2) We present the statistical information in Table 1. (我们在表1中给出了统计信息。)
(3) Tmall: This dataset is collected from Tmall site–one of the largest E-commerce platform in China. (该数据集收集自天猫网——中国最大的电子商务平台之一)
- The user behavior data contains various interactions: Page View, Add-to-Favorite, Add-to-Cart and Purchase. (用户行为数据包含各种交互：页面视图（浏览）、添加到收藏夹、添加到购物车和购买。)
- Following the setting in [50], we keep users with at least three purchases for training and test. (按照[50]中的设置，我们为用户保留至少三次购买，用于培训和测试。)
(4) IJCAI-Contest: This data was adopted in IJCAI15 Challenge from a business-to-customer retail system. It shares the same behavior types with the Tmall data, which are reflective of various user intention over items. (IJCAI竞赛：该数据在IJCAI15挑战赛（从企业到客户的零售系统）中采用。它与天猫数据共享相同的行为类型，反映了用户对商品的各种意图。)
(5) Retailrocket: It is another benchmark dataset collected from Retailrocket recommender system. (这是从Retailrocket推荐系统收集的另一个基准数据集)
- In this dataset, user interactions are consisted of Page View, Add-to-Cart and Transaction. (在这个数据集中，用户交互包括页面视图（浏览)、添加到购物车和交易。)
- Following previous works for recommendation with multi-behaviors [23, 50], purchase behaviors are set as the target behaviors and other types of interactions are considered as the auxiliary behaviors. (继之前的多行为推荐工作[23,50]之后，购买行为被设置为目标行为，其他类型的交互被视为辅助行为)

4.1.2 Baselines.

We compare our CML with the following state-of-the-art methods from two groups: Single-Behavior and Multi-Behavior recommender systems. These methods leverage various techniques to improve the recommendation performance: （我们将我们的CML与以下两组最先进的方法进行比较：单行为和多行为推荐系统。这些方法利用各种技术来提高推荐性能：）

4.1.2.1 Single-Behavior Recommendation Methods: （单一行为推荐方法）

BPR [32]: It is a widely adopted matrix factorization model with the optimization criterion of Bayesian personalized ranking. (它是一种广泛采用的矩阵分解模型，具有贝叶斯个性化排序的优化准则。)
PinSage [53]: This method defines the importance-based neighboring nodes to perform the graph convolution. (该方法定义了基于重要性的相邻节点来执行图卷积)
- In PinSage, the message passing paths are constructed through the random walk. (在PinSage中，通过随机游走构造消息传递路径)
NGCF [43]: it is a representative graph neural framework which captures the collaborative effects in the embedding function of users based on the convolutional message passing scheme. (它是一种典型的图神经网络框架，基于卷积消息传递机制捕获用户嵌入函数中的协作效果。)
LightGCN [15]: it simplifies the graph convolution network-based recommendation architecture by removing the feature transformation and nonlinear activation operations. (通过去除特征变换和非线性激活操作，简化了基于图卷积网络的推荐体系结构。)
SGL [48]: this method performs the self-supervised learning over the user-item interaction graph with data augmentation from different views (e.g., node and edge dropout). The integrated auxiliary task is on the basis of node self-discrimination. (该方法对用户项交互图进行自监督学习，并从不同的视图（如节点和边dropout）进行数据增强。综合辅助任务是基于节点自判别的。)

4.1.2.2 Multi-Behavior Recommendation Models: （多行为推荐模型）

NMTR [11]: it combines the multi-task learning framework and neural collaborative filtering to investigate multi-typed user interaction behaviors based on the predefined cascading relationships. (它结合多任务学习框架和神经协同过滤，基于预定义的级联关系研究多类型用户交互行为。)
MATN [50]: it adopts the attention mechanism for multi-behavior recommendation. (它采用注意机制进行多行为推荐)
- Specifically, it uses memory-enhanced self-attention to measure the influence between different behaviors. (具体来说，它使用记忆增强的自我注意来衡量不同行为之间的影响。)
- The number of memory units is tuned from the range of [2,8]. (内存单元的数量在[2,8]范围内调整)
MBGCN [23]: this approach is a GCN-based model by capturing the multi-behavioral patterns over the constructed user-item interaction graph. (该方法是一种基于GCN的模型，通过在构建的用户项交互图上捕获多个行为模式。)
- The high-order connectivity is considered during the information propagation. (在信息传播过程中考虑了高阶连通性。)
KHGT [51]: this approach leverages transformer to incorporate the temporal information into the multi-behavior modeling, and differentiates the behaviors with graph attention network. (该方法利用transformer将时间信息融入到多行为建模中，并用图形注意网络区分行为。)
EHCF [2]: it conducts the knowledge transfer among heterogeneous user feedback to correlate behavior dependency. A new loss is used for model optimization from the positive-only data. (它在异构用户反馈之间进行知识转移，以关联行为依赖。一个新的损失用于从纯正数据进行模型优化。)

We further compare our CML with two state-of-the-art heterogeneous graph neural networks, by applying them to capture the heterogeneous behavior relations in recommendation. (我们进一步将我们的CML与两个最先进的异构图神经网络进行比较，通过应用它们来捕获推荐中的异构行为关系。)

HGT [17]: This graph transformer models heterogeneous relations in graphs. We adopt the heterogeneous message passing schema to encode the multiplex behaviors with dedicated representations. (该图转换器对图中的异构关系进行建模。我们采用异构消息传递模式，用专用的表示对多路行为进行编码。)
HeCo [44]: It is a recently developed heterogeneous graph neural network based on the cross-view supervised learning architecture. We generate the meta-path relation from our multi-behavior interaction graph. (它是最近发展起来的一种基于交叉视图监督学习结构的异构图神经网络。我们从多行为交互图中生成元路径关系。)

4.1.3 Hyperparameters and Metrics. （超参数和指标）

(1) We implement our CML with PyTorch.
The embedding initialization is performed with Xavier [14] and the model is optimized by adopting the AdamW optimizer [26] and the Cyclical Learning Rate (CyclicLR) strategy [35].
In specific, the base and max learning rate is searched from ${ 0.6e^{−4}, 1e^{−4}, 1e^{−3} \}$ and ${ 0.6e^{−3}, 1e^{−3}, 2e^{−3}, 5e^{−3} \}$ , respectively. (使用Xavier[14]进行嵌入初始化，并采用AdamW优化器[26]和循环学习率（CyclicLR）策略[35]对模型进行优化。)
For all graph-based baselines, the number of graph-based message propagation layers is tuned from {1,2,3,4}. (对于所有基于图的基线，基于图的消息传播层的数量从{1,2,3,4}调整。)
We apply the L2 regularization for the learned embeddings with the weight tuned from ${ 1e^{−3}, 5e^{−3}, 1e^{−2} \}$ . (我们对学习到的嵌入应用L2正则化，权重从 ${ 1e^{−3}, 5e^{−3}, 1e^{−2} \}$ )
Additionally, to alleviate the overfitting issue, the dropout is used in our designed meta network. (此外，为了缓解过度拟合的问题，我们在设计的元网络中使用了dropout)
(2) We adopt the widely used leave-one-out strategy by generating the test set from users’ last interacted items under the target behavior type (i.e., purchase/transaction). (我们采用了广泛使用的遗漏策略，根据用户在目标行为类型（即购买/交易）下最后一次交互的项目生成测试集。)
Two representative evaluation metrics are used for performance comparison: (两个有代表性的评估指标用于效果比较：)
- NDCG (Normalized Discounted Cumulative Gain)
- and HR (Hit Ratio) .
We also run our CML model and the best-performed baseline method for 10 times to calculate p-values for significance analysis. (我们还运行我们的CML模型和表现最佳的基线方法10次，以计算p值进行显著性分析)

4.2 Performance Comparison (RQ1) （性能比较）

We present the detailed evaluation results of all methods on different datasets in Table 2 where the results of our CML and the best performed baselines are highlighted with bold and underlined, respectively. Key observations are as follows: (我们在表2中给出了不同数据集上所有方法的详细评估结果，其中CML和最佳执行基线的结果分别用粗体和下划线突出显示。主要观察结果如下：)

CML consistently outperforms all types of baselines on three datasets. The p-values are much less than 0.05, which indicates statistically significant improvements between our method and baselines. We attribute the significant performance improvements to the following two reasons: (CML在三个数据集上始终优于所有类型的基线。p值远小于0.05，这表明我们的方法和基线之间有显著的统计学改进。我们将显著的性能改进归因于以下两个原因：)
- (1) Through the meta contrastive network, CML captures the multi-behavior dependencies in a customized manner; (通过元对比网络，CML以定制的方式捕获多行为依赖；)
- (2) The designed contrastive learning paradigm incorporates auxiliary self-supervised signals from different types of behavior dimensions, which offers informative gradients to the graph-based collaborative filtering architecture. (所设计的对比学习范式结合了来自不同类型行为维度的辅助自监督信号，为基于图形的协同过滤架构提供了信息梯度。)
Multi-behavior recommendation approaches (e.g., MBGCN, EHCF, KHGT) yield better performance than single-behavior recommendation methods (e.g., NGCF, LightGCN, PinSage), which reveals the helpfulness of exploring multi-behavioral information into the user preference modeling. (多行为推荐方法（如MBGCN、EHCF、KHGT）比单一行为推荐方法（如NGCF、LightGCN、PinSage）具有更好的性能，这揭示了在用户偏好建模中探索多行为信息的帮助。)
- Among various multi-behavior recommendation models, EHCF is the best baseline in most cases. (在各种多行为推荐模型中，EHCF是大多数情况下的最佳基准。)
- This observation indicates that incorporating the different behavior semantics with supervision labels is able to guide the model optimization. (这一观察结果表明，将不同的行为语义与监管标签相结合能够指导模型优化。)
- Additionally, different from the topology-based self-supervised method-SGL, our CML designs new contrastive learning paradigm to fit the multi-behavior recommendation. (此外，与基于拓扑结构的自监督方法SGL不同，我们的CML设计了新的对比学习范式，以适应多行为推荐。)
CML outperforms heterogeneous graph neural networks (i.e., HGT and HeCo) by a large margin in all cases, verifying that our designed meta contrastive network endows the heterogeneous collaborative filtering with the capability of effectively encoding the relation heterogeneity. (CML在所有情况下都大大优于异构图神经网络（即HGT和HeCo），验证了我们设计的元对比网络赋予异构协同过滤有效编码关系异构性的能力。)

4.3 Ablation and Effectiveness Analyses (RQ2) （消融和有效性分析）

To shed light on the performance improvement, we further conduct the ablation study for our CML, to justify the rationality of the designed key components. Analysis details are summarized as: (为了阐明性能改进，我们进一步对我们的CML进行了烧蚀研究，以证明设计的关键部件的合理性。分析细节总结如下：)

Effect of multi-behavior contrastive learning framework. We first aim to answer the question: is it beneficial to integrate behavior-wise dependency under a contrastive learning prototype for CML. (多行为对比学习框架的效果。我们首先要回答这样一个问题：在CML的对比学习原型下整合行为依赖是否有益。)
- Towards this end, we generate a model variant CML(w/o)-CLF by disabling the contrastive learning between the target and auxiliary user behaviors. (为此，我们通过禁用目标用户行为和辅助用户行为之间的对比学习来生成模型变体CML（w/o）-CLF。)
- Instead, we only rely on the behavior-aware graph neural network to capture the behavior relationships. We present the evaluation results in Table 3 with the following key summaries: (相反，我们只依赖行为感知图神经网络来捕捉行为关系。我们在表3中给出了评估结果，并给出了以下关键总结：)
- (1) CML always outperforms CML(w/o)- CLF. This suggests the effectiveness of our contrasive learning paradigm, by capturing the complex dependent relations across different types of behaviors. (CML总是优于CML（w/o）-CLF。这表明，通过捕捉不同类型行为之间的复杂依赖关系，我们的对比学习范式是有效的。)
- (2) This design also mitigates the effect of skewed data distribution in the multi-behavior data, and effectively transfers knowledge from different behavior views. （这种设计还减轻了多行为数据中数据分布不均的影响，并有效地从不同的行为视图传递知识。）
Effect of meta contrastive network. To investigate whether the meta contrastive network benefit the multi-behavior dependency modeling, we propose another variant CML(w/o)-MCN which only conducts the contrastive learning between type-specific behavior embeddings based on the estimated mutual information. （元对比网络效应。为了研究元对比网络是否有利于多行为依赖建模，我们提出了另一种变体CML（w/o）-MCN，它仅基于估计的互信息在特定类型的行为嵌入之间进行对比学习。）
- In other words, cross-behavior contrastive loss functions are integrated with the BPR-based loss using the equal weights, i.e., without explicitly differentiating the influence degrees under the augmented self-supervised learning tasks. （换言之，跨行为对比损失函数与基于BPR的损失函数使用等权重进行集成，即在不明确区分增强自监督学习任务下的影响程度的情况下。）
- Clearly, CML obtains better performance than CML(w/o)-MCN. It suggests that by employing the meta contrastive network, we can automatically discriminate the influence between different target-auxiliary behavior pairs. The cross-view behavior dependency can mutually complement with each other. （显然，CML比CML（w/o）-MCN获得更好的性能。这表明，通过元对比网络，我们可以自动区分不同目标辅助行为对之间的影响。跨视图行为依赖可以相互补充。）
Effect of meta knowledge encoder. To verify the impact of meta knowledge encoder in our contrastive learning framework, we do an ablation study (with variant CML(w/o)-MKE) by disabling the meta contrastive weight network $M (\cdot)$ . (元知识编码器的效果。为了验证元知识编码在我们的对比学习框架中的影响，我们通过禁用元对比权重网络M（·）进行了一项消融研究（使用变异CML（w/o）-MKE）。)
- Instead, we use a weighted gating mechanism to aggregate the behavior-specific contrastive loss in a uniform manner. Removing the incorporation of our meta knowledge degrades the performance, suggesting the necessity of our customized contrastive learning for different types of target-auxiliary behavior dependency. （相反，我们使用加权选通机制以统一的方式聚合特定于行为的对比损失。去除元知识会降低学习成绩，这表明我们有必要针对不同类型的目标辅助行为依赖进行定制对比学习。）

4.4 Model Performance on Alleviating （缓解的模型性能）

(1) Interaction Data Sparsity (RQ3) In this section, we aim to show the rationality of bringing the contrastive learning into the multi-behavior recommendation, so as to alleviate the data sparsity issue. (交互数据稀疏性（RQ3）在本节中，我们旨在展示将对比学习引入多行为推荐的合理性，以缓解数据稀疏性问题。)
- In Figure 2, we show the evaluation result comparison with respect to different interaction sparsity degrees on Tmall data. (在图2中，我们展示了天猫数据上不同交互稀疏度的评估结果比较。)
- Due to space limit, we select several representative baselines to make comparison. Specifically, we split users into six groups in terms of the number of their interactions (e.g., “<7” and “<60”). The reported model performance measured by HR and NDCG (as shown in the right side of y-axis in Figure 2) is averaged over all users in each group. The total number of users belonging to each group is shown in the left side of Figure 2. (由于篇幅限制，我们选择了几个有代表性的基线进行比较。具体来说，我们根据用户的交互次数将用户分为六组（例如，“<7”和“<60”）。通过HR和NDCG（如图2 y轴右侧所示）测量的报告模型性能是各组所有用户的平均值。属于每个组的用户总数如图2左侧所示。)
(2) We have the following findings: (我们有以下发现：)
- i) The recommendation accuracy improves for all compared methods as the number of user interactions increases. It is reasonable since the quality behavior embeddings are more likely to be learned with sufficient user behaviors. (随着用户交互次数的增加，所有比较方法的推荐精度都会提高。这是合理的，因为质量行为嵌入更有可能通过足够的用户行为来学习。)
- ii) As compared to the vanilla collaborative filtering model (NGCF), multi-behavior recommender systems (e.g., KHGT, MBGCN) achieve better performance, suggesting the effectiveness of incorporating multi-typed behavior context for data sparsity alleviation. （与普通协同过滤模型（NGCF）相比，多行为推荐系统（如KHGT、MBGCN）实现了更好的性能，这表明结合多类型行为上下文可以有效地缓解数据稀疏性。）
- iii) CML consistently outperforms other multi-behavior recommendation methods under different interaction degrees. This observation indicates that CML solves the data sparsity issue better, by embracing the self-supervised contrastive learning paradigm for preserving the behavior heterogeneity in recommendation. (在不同的交互程度下，CML的性能始终优于其他多行为推荐方法。这一观察结果表明，CML通过采用自我监督的对比学习范式来保持推荐中的行为异质性，从而更好地解决了数据稀疏性问题。)

4.5 Hyperparameter Analysis on CML (RQ4) （CML的超参数分析）

This section examines the impact of different settings of several key hyperparameters in our proposed CML framework, including # graph propagation layers $L$ , representation dimensionality $d$ , batch size in training process. Figure 3 reports the evaluation results. (本节探讨了我们提出的CML框架中几个关键超参数的不同设置的影响，包括#图传播层 $L$ 、表示维度 $d$ 、训练过程中的批量大小。图3显示了评估结果。)
For each time, we investigate the effect of one hyperparameter at a time and keep other parameters with their default settings. (对于每一次，我们一次研究一个超参数的影响，并将其他参数保留为默认设置。)

4.5.1 # graph propagation layers $L$ . (图传播层)

From Figure 3, we can observe that more graph propagation layers results in better performance when $L$ ≤ 3. (从图3中，我们可以观察到，当 $\le 3$ )
- This suggests that more message passing layers will capture latent dependency from high-order neighbors. (这表明，更多的消息传递层将捕获来自高阶邻居的潜在依赖性。)
- When further stacking more graph layers might introduce noise to the user representations, which leads to the oversmoothing issue [3, 28]. (当进一步叠加更多图形层时，可能会给用户表示带来噪声，从而导致过度平滑问题[3,28]。)

4.5.2 Representation dimensionality $d$ . (表征维度 $d$ )

Our model can achieve good performance with the embedding dimensionality 16 ≤ d ≤ 32. It indicates that our CML can boost the performance with small hidden state dimensionality, This can be attributed to effectively enhancing the user-item interaction learning with multiplex relationships. (在嵌入维数为16 ≤ d ≤ 32的情况下，我们的模型可以获得良好的性能.这表明我们的CML可以在较小的隐藏状态维度下提高性能，这可以归因于有效地增强了具有多重关系的用户项交互学习。)

4.5.3 Batch size in learning process. (学习过程中的批量大小)

We search the batch size for our meta contrastive network (meta batch) and the graph neural architecture (train batch) from the range of {128, 256, 512, 1024, 2048} and {256, 512, 1024, 2048, 4096}, respectively. (我们分别从{128、256、512、1024、2048}和{256、512、2048、4096}范围内搜索元对比网络（元批次）和图形神经结构（训练批次）的批次大小。)
Darker color signals better performance in Figure 3 (c). (在图3（c）中，颜色越深表示性能越好。)
When the sampled batch size of meta network is smaller than that of base graph network, the model performance becomes better. (当元网络的采样批量小于基图网络时，模型性能更好。)
This configuration will improve the cooperation between our augmented self-supervised learning task and BPR-based ranking objective. (这种配置将改善我们的增强自监督学习任务和基于BPR的排名目标之间的合作。)

4.6 Qualitative Evaluation (RQ5) (定性评价)

In this section, we perform the qualitative evaluation to show the model interpretation with the learned meta contrastive weights across different behavior types. We also visualize the projected behavior embeddings to have a better understanding of our achieved agreement between type-specific behavior embeddings. (在这一部分中，我们进行定性评估，以展示模型的解释，以及学习到的跨不同行为类型的元对比权重。我们还将投射的行为嵌入可视化，以便更好地理解特定类型行为嵌入之间达成的一致。)

4.6.1 Meta Contrastive Weight Visualization. (元对比权重可视化)

We visualize the learned meta contrastive weights $\omega^{k, k^′}_u$ for each auxiliary behavior pairs $k − k^′)$ from several sampled users. (我们将学习的元对比权重 $\omega^{k, k^′}_u$ ，对于每个辅助行为对 $k − k^′)$ 来自几个抽样用户。)
The customized contrastive weights can be observed in Figure 4 (a), which reflect the personalized multi-behavior interaction patterns of different users. （定制的对比权重如图4（a）所示，反映了不同用户的个性化多行为交互模式。）
Each $\omega^{k, k^′}_u$ value indicates the weight of individual contrastive loss between the target and auxiliary behavior views. （每个 $\omega^{k, k^′}_u$ 值表示目标和辅助行为视图之间个体对比损失的权重）
- For example, for user with id: 27310, the learned weights for the constructed view-buy and favorite-buy contrastive loss is 0.243 and 0.595, respectively. （例如，对于id为27310的用户，构建的查看-购买和收藏-购买对比损失的学习权重分别为0.243和0.595。）
- This suggests that this user is more likely to place the order after he or she adds the products into the favorite list, as compared with his/her page view behaviors. （这表明，与他/她的页面浏览行为相比，该用户在将产品添加到收藏夹列表后更有可能下订单。）

4.6.2 Embedding Visualization. （嵌入可视化）

We further show the visualization (2- D projection with t-SNE [38]) of user behavior embeddings encoded from CML and w/o-CLF on IJCAI-Contest data, respectively. （我们进一步展示了在IJCAI竞赛数据上分别从CML和w/o-CLF编码的用户行为嵌入的可视化（t-SNE[38]的二维投影）。）
In particular, we use different colors to represent different types of behaviors, i.e., red: page view, blue: add-to-favorite, black: add-to-cart, green: purchase. （特别是，我们使用不同的颜色来表示不同类型的行为，即红色：页面视图、蓝色：添加到收藏夹、黑色：添加到购物车、绿色：购买。）
From Figure 4 (b), we observe the embedding agreement achieved by our CML. （从图4（b）中，我们观察到CML实现的嵌入协议。）
This again justifies the effectiveness of our CML in alleviating data scarcity issue with the knowledge transfer across different types of behaviors, under our contrastive self-supervised learning architecture. （这再次证明了我们的CML在我们的对比自我监督学习架构下，通过不同类型的行为之间的知识转移，在缓解数据稀缺问题方面的有效性。）

5 RELATED WORK

5.1 Graph-based Recommendation Models （基于图的推荐模型）

Recent studies have demonstrated the promising results offered by GNN-based recommendation models, by using different information propagation functions to aggregate embeddings over neighbors [1, 7, 15, 18, 19, 36]. (最近的研究已经证明了基于GNN的推荐模型所提供的有希望的结果，通过使用不同的信息传播函数来聚合邻居上的嵌入[1,7,15,18,19,36]。)
For example, by stacking multiple embedding propagation layers, NGCF [43] can gather information from neighboring nodes with high-order connectivity. (例如，通过堆叠多个嵌入传播层，NGCF[43]可以从具有高阶连接性的相邻节点收集信息。)
To address the burdensome design of GCN-based message passing in NGCF, LightGCN [15] omits the weight matrix and utilizes the sum-based pooling operation to obtain better recommendation performance. (为了解决NGCF中基于GCN的消息传递的繁重设计，LightGCN[15]省略了权重矩阵，并利用基于和的池操作来获得更好的推荐性能。)
Additionally, to differentiate relations in recommendation, attention-based aggregation functions have been designed for fusing various information in recommender systems, such as social influence [8, 21, 36], knowledge graph embedding [24, 42], textual information [47]. (此外，为了区分推荐中的关系，基于注意的聚合函数被设计用于融合推荐系统中的各种信息，例如社会影响[8,21,36]、知识图谱嵌入[24,42]、文本信息[47]。)
Specifically, GraphRec [8] discriminates influence between users using graph-based attention mechanism. (具体来说，GraphRec[8]使用基于图的注意机制区分用户之间的影响。)
Wu et al. [47] develops an attentional graph neural paradigm to enhance the user and item representations with textural information. (Wu等人[47]开发了一种注意图神经范式，用文本信息增强用户和项目表征。)
Motivated by the above research works, our contrastive meta learning framework is built over the graph neural network to capture the behavior-aware collaborative effects between users and items. （在上述研究工作的推动下，我们构建了基于图神经网络的对比元学习框架，以捕捉用户和项目之间的行为感知协作效应。）

5.2 Multi-Behavior Recommender Systems （多行为推荐系统）

Under the multi-typed user-item interactions, there exist some recent works attempting to designing effective approaches for handling behavior multiplicity [2, 23, 50–52]. (在多类型用户项交互下，最近有一些工作试图设计有效的方法来处理行为多样性[2,23,50–52]。)
In particular, the behavior-wise relationships are characterized by attention mechanism in [50, 51]. (特别是，在[50,51]中，行为方面的关系以注意机制为特征。)
MBGCN [23] learns discriminative behavior representations using graph convolutional network. (MBGCN[23]使用图卷积网络学习区分性行为表示。)
MATN [50] considers the influences among different types of interactions with attentive weights for pattern aggregation. (考虑不同类型的互动之间的影响有着不同的注意权重，针对模式聚合。)
However, most of them are not designed with the sparse behavior data in mind. （然而，它们中的大多数并没有考虑到稀疏的行为数据。）
To fill this gap, we propose a new model with contrastive learning at behavior semantic levels, which provides auxiliary informative supervision signals for knowledge transferring between behavior types. （为了填补这一空白，我们提出了一种在行为语义层面上进行对比学习的新模型，该模型为行为类型之间的知识转移提供了辅助信息监督信号。）

5.3 Contrastive Representation Learning （对比表征学习）

Self-supervised learning techniques have been demonstrated to be effective in learning representations from both image data [6] and textual data [10]. (自监督学习技术已被证明在从图像数据[6]和文本数据[10]学习表征方面是有效的。)
It aims to learn quality discriminative representations by contrasting positive and negative samples from different views. (它的目的是通过对比不同观点的正面和负面样本来学习高质量的区别表征。)
For visual data, different data augmentation strategies (e.g., rotation [13], color distortion [5]) are used to generate negative instances. (对于视觉数据，使用不同的数据增强策略（例如，旋转[13]，颜色失真[5]）来生成负面实例。)
To better represent the graph topological structures, Deep Graph InfoMax (DGI) [39] aims to maximize the mutual information between node embedding and graph representations based on the original and corrupted graphs. (为了更好地表示图的拓扑结构，Deep graph InfoMax（DGI）[39]的目标是在原始图和损坏图的基础上最大化节点嵌入和图表示之间的互信息。)
In addition, a model-agnostic recommendation model SGL [48] has been proposed to augment the supervised task of recommendation with auxiliary tasks. （此外，还提出了一个模型不可知的推荐模型SGL[48]，用辅助任务来扩充推荐的监督任务。）
- It performs dropout operations over the graph connection structures with different strategies, i.e., node dropout, edge dropout and random walk. （它使用不同的策略，即drop节点、drop边和随机游走，在图连接结构上执行drop操作。）
SMIN [25] is a social-aware recommendation method with generative self-supervision. (是一种具有生成性自我监督的社会意识推荐方法。)
Inspired by the existing contrastive learning paradigms, this work proposes a new graph contrastive representation framework with the adaptive multi-behavior modeling, by exploring various semantic aspects of user-item interactions. (受现有对比学习范式的启发，本研究通过探索用户项交互的各个语义方面，提出了一种新的基于自适应多行为建模的图对比表示框架。)

6 CONCLUSION

(1) In this paper, we develop a novel multi-behavior contrastive meta learning framework for recommendation. (我们开发了一个新的多行为对比元学习推荐框架。)
- Our model learns user representations by preserving behavior heterogeneous context with ==the agreement++ between behaviors views constructed from our contrastive learning paradigm. (我们的模型通过保持行为的异质性和基于对比学习范式构建的行为视图之间的一致性来学习用户表示。)
- The behavior-aware graph neural architecture with multi-behavior self-supervision bring benefits to the heterogeneous relational learning for recommendation. (多行为自监督的行为感知图神经网络结构有利于异构关系学习的推荐。)
- We perform comprehensive experiments using several real-world datasets to demonstrate the effectiveness of our proposed CML method, by comparing it with various state-of-the-arts. (我们使用多个真实数据集进行了综合实验，通过与各种最新技术的比较，证明了我们提出的CML方法的有效性。)
(2) In this paper, we take the initial step to capture the diverse multi-behavior patterns of users for recommendation under the self-supervised learning paradigm. (在本文中，我们首先在自监督学习范式下捕获用户多样的多行为模式以供推荐。)
- In the future, it would be interesting to explore the pre-train model strategy of our CML for online user modeling applications (e.g., user profiling). (在未来，探索我们的CML在线用户建模应用程序（例如，用户评测）的预训练模型策略将是一件有趣的事情。)
- Additionally, another meaningful future research direction can be extending our framework to learn disentangled representations of users, which could reflect the multi-dimensional user interests. （此外，另一个有意义的未来研究方向可以是扩展我们的框架，学习用户的分离表示，这可以反映多维用户兴趣。）

你可能感兴趣的:(Recommendation,深度学习,推荐系统,人工智能,数据挖掘)

Python爬虫实战：全方位爬取知乎学习板块问答数据 Python爬虫项目 2025年爬虫实战项目 python 爬虫学习开发语言 scrapy 游戏
1.项目背景与爬取目标知乎是中国最大的知识问答社区，聚集了大量高质量的学习资源和经验分享。爬取知乎“学习”板块的问答数据，可以为学习资料整理、舆情分析、推荐系统开发等提供数据支持。本项目目标：爬取“学习”话题下的热门问答列表抓取每个问答的标题、作者、回答内容、点赞数、评论数等详细信息实现动态加载内容的抓取，包含图片和富文本避免被反爬机制限制，保证数据采集稳定结合数据分析，为后续应用打基础2.知乎“
【机器学习&深度学习】反向传播机制
目录一、一句话定义二、类比理解三、为什重要？四、用生活例子解释：神经网络=烹饪机器人4.1第一步：尝一口（前向传播）4.2第二步：倒着推原因（反向传播）五、换成人工智能流程说一遍六、图示类比：找山顶（最优参数）七、总结一句人话八、PyTorch代码示例：亲眼看到每一层的梯度九、梯度=损失函数对参数的偏导数十、类比总结反向传播（Backpropagation）是神经网络中训练过程的核心机制，它就像“
人脸识别算法赋能园区无人超市安防升级智驱力人工智能算法人工智能边缘计算人脸识别智慧园区智慧工地智慧煤矿
人脸识别算法赋能园区无人超市安防升级正文在园区无人超市的运营管理中，传统安防手段依赖人工巡检或基础监控设备，存在响应滞后、误报率高、环境适应性差等问题。本文从技术背景、实现路径、功能优势及应用场景四个维度，阐述如何通过人脸识别检测、人员入侵算法及疲劳检测算法的协同应用，构建高效、精准的智能安防体系。一、技术背景：视觉分析算法的核心支撑人脸识别算法基于深度学习的卷积神经网络（CNN）模型，通过提取面
Python 数据挖掘实战：关联规则与聚类分析，解锁数据价值的钥匙清水白石008 python Python题库 python 数据挖掘动画
Python数据挖掘实战：关联规则与聚类分析，解锁数据价值的钥匙引言在数字化浪潮席卷全球的今天，数据已成为企业和组织最重要的战略资产。海量数据蕴藏着巨大的价值，等待我们去挖掘和发现。数据挖掘(DataMining)，作为从海量数据中提取有价值知识和模式的关键技术，正日益受到各行各业的重视。它如同探矿者的火眼金睛，能够穿透数据的迷雾，发现隐藏在背后的规律和趋势，为商业决策、科学研究和社会发展提供强有
潜入思维的海洋：SoftCoT++如何让语言模型更聪明步子哥智能涌现语言模型人工智能自然语言处理
在人工智能的浩瀚星空下，大型语言模型（LLMs）如同一颗颗璀璨的恒星，照亮了从文本生成到复杂推理的广阔领域。然而，这些模型在推理任务中往往像是在迷雾中航行——尽管它们能抵达目的地，却常常因为固定的思维路径而错过更优的航线。2025年5月，一篇题为《SoftCoT++:Test-TimeScalingwithSoftChain-of-ThoughtReasoning》的论文如同一盏明灯，照亮了如何让
BI+AI实战：我们如何用3秒完成车企供应链推演 qq_43696218 人工智能
一、BI+AI引领财务分析新纪元在财务数据分析领域，奥威BI+AI正以革命性的姿态颠覆传统。当金蝶、用友等工具仍深陷报表泥潭时，奥威BI+AI通过深度融合商业智能（BI）与人工智能（AI），实现了从滞后报表到实时洞察的飞跃。这不仅极大地提升了财务分析的效率，更为企业的战略决策提供了前所未有的精准支持。二、BI+AI的核心技术优势‌实时动态分析‌o奥威BI+AI摒弃了静态数据集，依托原始科目余额表实
DeepSeek-V3 通俗详解：从诞生到优势，以及与 GPT-4o 的对比码事漫谈 AI ai
前些天发现了一个巨牛的人工智能学习网站，通俗易懂，风趣幽默，忍不住分享一下给大家。点击跳转到网站1.DeepSeek的前世今生1.1什么是DeepSeek？DeepSeek是一家专注于人工智能技术研发的公司，致力于打造高性能、低成本的AI模型。它的目标是让AI技术更加普惠，让更多人能够用上强大的AI工具。1.2DeepSeek-V3的诞生DeepSeek-V3是DeepSeek公司推出的最新一代A
企业级AI开发利器：Spring AI框架深度解析与实战_spring ai实战 AI大模型-海文人工智能 spring python 算法开发语言 java 机器学习
企业级AI开发利器：SpringAI框架深度解析与实战一、前言：Java生态的AI新纪元在人工智能技术爆发式发展的今天，Java开发者面临着一个新的挑战：如何将大语言模型（LLMs）和生成式AI（GenAI）无缝融入企业级应用。传统的Java生态缺乏统一的AI集成方案，开发者往往需要为不同AI供应商（如OpenAI、阿里云、HuggingFace）编写大量重复的接口适配代码，这不仅增加了开发成本，
图扑软件智慧云展厅，开启数字化展馆新模式智慧园区可视化 5g 人工智能大数据安全云计算
随着疫情的影响以及新兴技术的不断发展，展会的发展形式也逐渐从线下转向线上。通过“云”上启动、云端互动、双线共频的形式开展。通过应用大数据、人工智能、沉浸式交互等多重技术手段，构建数据共享、信息互通、精准匹配的高精度“云展厅”，突破时空壁垒限制。图扑软件运用HT强大的渲染功能，数字孪生“云展位”，1:1复现实际展厅内部独特的结构造型和建筑特色。也可以第一人称视角漫游，模拟用户在展厅内的参观场景，在保
转行要趁早！网络安全行业人才缺口大，企业招聘需求正旺！
网络安全行业具有人才缺口大、岗位选择多、薪资待遇好、学历要求不高等优势，对于想要转行的人员来说，是一个非常不错的选择。人才缺口大网络安全攻防技术手段日新月异，特别是现在人工智能技术飞速发展，网络安全形势复杂严峻，人才重要性凸显。教育部《网络安全人才实战能力白皮书》数据显示，到2027年，我国网络安全人员缺口将达327万。近期发布的《2024年网络安全产业人才发展报告》中提到，沿用ISC2的人才缺口
【机器学习与数据挖掘实战 | 医疗】案例18：基于Apriori算法的中医证型关联规则分析 Francek Chen 机器学习与数据挖掘实战机器学习数据挖掘 Apriori python 关联规则人工智能
【作者主页】FrancekChen【专栏介绍】⌈⌈⌈机器学习与数据挖掘实战⌋⌋⌋机器学习是人工智能的一个分支，专注于让计算机系统通过数据学习和改进。它利用统计和计算方法，使模型能够从数据中自动提取特征并做出预测或决策。数据挖掘则是从大型数据集中发现模式、关联和异常的过程，旨在提取有价值的信息和知识。机器学习为数据挖掘提供了强大的分析工具，而数据挖掘则是机器学习应用的重要领域，两者相辅相成，共同推动
【Python深度学习】零基础掌握Pytorch Pooling layers nn.MaxPool方法 Mr数据杨 Python 深度学习 python 深度学习 pytorch
在深度学习的世界中，MaxPooling是一种关键的操作，用于降低数据的维度并保留重要特征。这就像是从一堆照片中挑选出最能代表某个场景的那张。PyTorch提供了多种MaxPooling层，包括nn.MaxPool1d、nn.MaxPool2d和nn.MaxPool3d，它们分别适用于不同维度的数据处理。如果处理的是声音信号（一维数据），就会用到nn.MaxPool1d。而处理图像（二维数据）时，
误差的回响：反向传播算法与神经网络的惊天逆转田园Coder 人工智能科普人工智能科普
当专家系统在20世纪80年代初期大放异彩，成为人工智能实用化的耀眼明星时，另一股曾经被宣判“死刑”的力量——连接主义（神经网络）——正在寒冬的冻土下悄然涌动，孕育着一场惊天动地的复苏。马文·明斯基和西摩·帕尔特在1969年《感知机》专著中那精准而冷酷的理论批判，如同沉重的封印，将多层神经网络的研究禁锢了近二十年。他们指出的核心死结——缺乏有效算法来训练具有隐藏层的网络——仿佛一道无法逾越的天堑。单
【Html实现“心形日出”（附效果+源代码）】| JavaScript面试题：解释一下异步编程中的回调函数、Promise和Async/Await的概念。它们有什么区别？追光者♂ html5 css3 心形日出前端特效 JS面试题 Promise Async/Await
风会带走你曾经存在过的证明。——虞姬作者主页：追光者♂个人简介：[1]计算机专业硕士研究生[2]2023年城市之星领跑者TOP1(哈尔滨)[3]2022年度博客之星人工智能领域TOP4[4]阿里云社区特邀专家博主[5]CSDN-人工智能领域优质创作者无限进步，一起追光！！！
阅读笔记(2) 单层网络:回归 a2507283885 笔记
阅读笔记(2)单层网络:回归该笔记是DataWhale组队学习计划（共度AI新圣经：深度学习基础与概念）的Task02以下内容为个人理解，可能存在不准确或疏漏之处，请以教材为主。1.从泛函视角来看线性回归还记得线性代数里学过的“基”这个概念吗？一组基向量是一组线性无关的向量，它们通过线性组合可以张成一个向量空间。也就是说，这个空间里的任意一个向量，都可以表示成这组基的线性组合。函数其实也可以看作是
青少年编程与数学 01-012 通用应用软件简介 15 人工智能助手明月看潮生编程与数学第01阶段青少年编程人工智能应用软件编程与数学
青少年编程与数学01-012通用应用软件简介15人工智能助手一、什么是人工智能助手二、人工智能助手的产生和发展（一）早期探索阶段（二）技术突破阶段（三）广泛应用阶段三、人工智能助手的主要功能（一）信息查询（二）日程管理（三）设备控制（四）知识问答四、人工智能助手的商业模式（一）广告收入（二）增值服务（三）数据服务（四）硬件销售五、DeepSeek（一）基本情况（二）技术水平（三）产品功能（四）市场
虚拟空间中的AI协作与任务 AI天才研究院 ChatGPT AI大模型企业级应用开发实战 AI人工智能与大数据大厂Offer收割机面试题简历程序员读书硅基计算碳基计算认知计算生物计算深度学习神经网络大数据 AIGC AGI LLM Java Python 架构设计 Agent 程序员实现财富自由
虚拟空间与AI概述在当今信息化和数字化的时代，虚拟空间（VirtualSpace）已成为人们生活和工作的重要一部分。虚拟空间是一种通过计算机技术构建的虚拟环境，它能够模拟和增强现实世界中的各种交互和体验。而人工智能（AI）作为计算机科学的一个分支，通过模拟人类的认知能力来实现自动化和智能化的决策。虚拟空间与AI的结合，不仅为人类带来了全新的交互方式，也为各行业的发展注入了强大的动力。虚拟空间的定义
AI Agent: AI的下一个风口智能体在元宇宙里的应用 AI智能应用 Python入门实战计算科学神经计算深度学习神经网络大数据人工智能大型语言模型 AI AGI LLM Java Python 架构设计 Agent RPA
AIAgent:AI的下一个风口智能体在元宇宙里的应用作者：禅与计算机程序设计艺术/ZenandtheArtofComputerProgramming关键词：AIAgent,元宇宙,虚拟角色,智能交互,人工智能,虚拟世界,智能体架构,交互式应用1.背景介绍1.1问题的由来随着虚拟现实(VR)、增强现实(AR)和区块链技术的不断发展，元宇宙(Metaverse)的概念逐渐兴起。元宇宙是一个由虚拟世界
攻击者利用热门AI发动黑帽SEO攻击，通过污染搜索结果传播窃密木马 FreeBuf- 人工智能
伪装成AI主题网站的恶意页面|图片来源：ZscalerZscaler威胁实验室研究人员发现一起精心策划的恶意软件攻击活动，攻击者利用ChatGPT和LumaAI等人工智能(AI)工具的热度，通过黑帽SEO（搜索引擎优化）技术劫持搜索引擎结果，诱导用户落入恶意软件陷阱。Zscaler警告称："这些攻击背后的威胁行为者正在利用ChatGPT和LumaAI等AI工具的热度。"这些欺诈活动至少从2025年
Python/Java/Php/C#/Go/C/C++这几个主力语言，谁到底真的不行 dotNET跨平台 java c#开发语言
1.前言阿里最近又进行了史诗级的大裁员，IT行业肉眼可见的持续性衰退与没落。当潮水退却，才能看出谁在裸泳。作为当今计算机编程界的几大主力语言，谁才真正的裸泳者呢？2.描述1.Python:Python作为一款解释性的动态语言，它很早就诞生了。它的第一个发行版1991年出世，比Java还要早四年。可惜命运不济，一直没有大的作为。到了2014年人工智能的风口悄然兴起，Python一路高歌猛进。到了20
【深度学习解惑】如果用RNN实现情感分析或文本分类，你会如何设计数据输入？云博士的AI课堂大模型技术开发与实践哈佛博后带你玩转机器学习深度学习深度学习 rnn 分类人工智能机器学习神经网络
以下是用RNN实现情感分析/文本分类时数据输入设计的完整技术方案：1.引言与背景介绍情感分析/文本分类是NLP的核心任务，目标是将文本映射到预定义类别（如正面/负面情感）。RNN因其处理序列数据的天然优势成为主流方案。核心挑战在于如何将非结构化的文本数据转换为适合RNN处理的数值化序列输入。2.原理解释文本到向量的转换流程：原始文本分词建立词汇表词索引映射词嵌入层序列向量关键数学表示：词嵌入表示：
End-To-End 之于推荐-kuaishou OneRec 笔记 ASKED_2019 RecSys 笔记
核心思想OneRec提出了一种统一的生成式推荐系统架构，打破了传统“召回-粗排-精排”级联式推荐流程，使用单一生成模型同时完成召回与排序任务。该系统由快手团队研发，并成功部署于短视频主场景。OnlineA/BTest表现：模型总观看时长平均观看时长OneRec-1B+IPA+1.68%+6.56%一Input处理Userpositiveactionsequence，将短视频的多模态表征，通过量化的
Pytorch模型安卓部署 python&java pytorch 人工智能 python
Pytorch是一种流行的深度学习框架，用于算法开发，而Android是一种广泛应用的操作系统，多应用于移动设备当中。目前多数的研究都是在于算法上，个人觉得把算法落地是一件很有意思的事情，因此本人准备分享一些模型落地的文章(后续可能分享微信小程序部署，PyQt部署以及exe打包，ncnn部署，tensorRT部署，MNN部署)。本篇文章主要分享Pytorch的Android端部署。看这篇文章的读者
人工智能-基础篇-5-建模方式（判别式模型和生成式模型）
机器学习包括了多种建模方式，其中判别式建模（DiscriminativeModel）和生成式建模是最常见的两种。这两种建模方式都可以通过深度学习技术来实现，并用于创建不同类型的模型。简单来说：想要创建一个模型，依赖需求需要合适的建模方式来创建这个模型。通常建模方式主要分为两大类。一类是判别式模型，针对输入数据给出特定的输出。如：判断一张图片是猫还是狗，直接学习“猫”和“狗”的特征差异（如耳朵形状、
PyTorch教程：LSTM语言模型的动态量化技术解析怀灏其Prudent
PyTorch教程：LSTM语言模型的动态量化技术解析tutorialsPyTorchtutorials.项目地址:https://gitcode.com/gh_mirrors/tuto/tutorials前言在深度学习模型部署过程中，模型大小和推理速度是两个至关重要的考量因素。PyTorch提供的动态量化技术能够在不显著影响模型准确率的前提下，有效减小模型体积并提升推理速度。本文将深入解析如何对
【机器学习】数学基础——张量（傻瓜篇）一叶千舟深度学习【理论】机器学习人工智能
目录前言一、张量的定义1.标量（0维张量）2.向量（1维张量）3.矩阵（2维张量）4.高阶张量（≥3维张量）二、张量的数学表示2.1张量表示法示例三、张量的运算3.1常见张量运算四、张量在深度学习中的应用4.1PyTorch示例：张量在神经网络中的运用五、总结：张量的多维世界延伸阅读前言在机器学习、深度学习以及物理学中，张量是一个至关重要的概念。无论是在人工智能领域的神经网络中，还是在高等数学、物
后端开发实习生简历迭代的5个版本，希望能帮你找到实习今天不coding 简历实习后端 Java 大厂暑期实习
后端开发实习生简历迭代的5个版本，希望能帮你找到实习1.0研究生开学时写的第一份简历，主要是对本科做的项目的一些总结。本科主要是以深度学习的项目为主+比赛，开发的技术学的比较少，后端的项目也没有做过。但是凭此找到了一份算法的实习。当时研一还是想走算法工程师的。后面觉得自己不适合，就放弃了。2.0经历过几个月的算法实习和论文折磨之后，决定走后端开发岗了，选择Java为主语言，在B站大学做了一个项目，
【机器学习实战】Datawhale夏令营2：深度学习回顾城主_全栈开发机器学习机器学习深度学习人工智能
#DataWhale夏令营#ai夏令营文章目录1.深度学习的定义1.1深度学习＆图神经网络1.2机器学习和深度学习的关系2.深度学习的训练流程2.1数学基础2.1.1梯度下降法基本原理数学表达步骤学习率α梯度下降的变体2.1.2神经网络与矩阵网络结构表示前向传播激活函数反向传播批处理卷积操作参数更新优化算法正则化初始化2.2激活函数Sigmoid函数:Tanh函数:ReLU函数(Rectified
深度学习详解：通过案例了解机器学习基础 beist 深度学习机器学习人工智能
引言机器学习（MachineLearning，ML）和深度学习（DeepLearning，DL）是现代人工智能领域中的两个重要概念。通过让机器具备学习的能力，机器可以从数据中自动找到函数，并应用于各种任务，如语音识别、图像识别和游戏对战等。在这篇笔记中，我们将通过一个简单的案例，逐步了解机器学习的基础知识。1.1机器学习案例学习1.1.1回归问题与分类问题在机器学习中，根据所要解决的问题类型，任务
大模型量化需要重新演唱大模型量化
大模型量化是一种优化技术，旨在减少深度学习模型的内存占用和提高推理速度，同时尽量保持模型的精度。量化通过将模型中的浮点数权重和激活值转换为较低精度的表示形式来实现这一目标。以下是关于大模型量化的详细知识：目录1.量化基础1.1量化定义1.2量化优势1.3量化挑战2.量化方法2.1量化类型2.2量化粒度2.3量化算法3.量化实践3.1量化流程3.2量化工具4.量化案例4.1BERT量化4.2GPT-
ASM系列六利用TreeApi 添加和移除类成员 lijingyao8206 jvm 动态代理 ASM 字节码技术 TreeAPI
同生成的做法一样，添加和移除类成员只要去修改fields和methods中的元素即可。这里我们拿一个简单的类做例子，下面这个Task类，我们来移除isNeedRemove方法，并且添加一个int 类型的addedField属性。 package asm.core; /** * Created by yunshen.ljy on 2015/6/
Springmvc-权限设计 bee1314 spring Web jsp
万丈高楼平地起。权限管理对于管理系统而言已经是标配中的标配了吧，对于我等俗人更是不能免俗。同时就目前的项目状况而言，我们还不需要那么高大上的开源的解决方案，如Spring Security，Shiro。小伙伴一致决定我们还是从基本的功能迭代起来吧。目标： 1.实现权限的管理（CRUD） 2.实现部门管理（CRUD) 3.实现人员的管理（CRUD） 4.实现部门和权限
算法竞赛入门经典（第二版）第2章习题 CrazyMizzz c 算法
2.4.1 输出技巧 #include <stdio.h> int main() { int i, n; scanf("%d", &n); for (i = 1; i <= n; i++) printf("%d\n", i); return 0; } 习题2-2 水仙花数(daffodil
struts2中jsp自动跳转到Action 麦田的设计者 jsp webxml struts2 自动跳转
1、在struts2的开发中，经常需要用户点击网页后就直接跳转到一个Action，执行Action里面的方法，利用mvc分层思想执行相应操作在界面上得到动态数据。毕竟用户不可能在地址栏里输入一个Action（不是专业人士） 2、＜jsp:forward page="xxx.action" /＞，这个标签可以实现跳转，page的路径是相对地址,不同与jsp和j
php 操作webservice实例 IT独行者 PHP webservice
首先大家要简单了解了何谓webservice，接下来就做两个非常简单的例子，webservice还是逃不开server端与client端。我测试的环境为：apache2.2.11 php5.2.10做这个测试之前，要确认你的php配置文件中已经将soap扩展打开，即extension=php_soap.dll; OK 现在我们来体验webservice //server端 serve
Windows下使用Vagrant安装linux系统 _wy_ windows vagrant
准备工作：下载安装 VirtualBox ：https://www.virtualbox.org/ 下载安装 Vagrant ：http://www.vagrantup.com/ 下载需要使用的 box ：官方提供的范例：http://files.vagrantup.com/precise32.box 还可以在 http://www.vagrantbox.es/
更改linux的文件拥有者及用户组(chown和chgrp) 无量 c linux chgrp chown
本文（转） http://blog.163.com/yanenshun@126/blog/static/128388169201203011157308/ http://ydlmlh.iteye.com/blog/1435157 一、基本使用：使用chown命令可以修改文件或目录所属的用户：命令
linux下抓包工具矮蛋蛋 linux
原文地址： http://blog.chinaunix.net/uid-23670869-id-2610683.html tcpdump -nn -vv -X udp port 8888 上面命令是抓取udp包、端口为8888 netstat -tln 命令是用来查看linux的端口使用情况 13 . 列出所有的网络连接 lsof -i 14. 列出所有tcp 网络连接信息 l
我觉得mybatis是垃圾！：“每一个用mybatis的男纸，你伤不起” alafqq mybatis
最近看了每一个用mybatis的男纸，你伤不起原文地址：http://www.iteye.com/topic/1073938 发表一下个人看法。欢迎大神拍砖；个人一直使用的是Ibatis框架，公司对其进行过小小的改良；最近换了公司，要使用新的框架。听说mybatis不错；就对其进行了部分的研究；发现多了一个mapper层；个人感觉就是个dao；
解决java数据交换之谜百合不是茶数据交换
交换两个数字的方法有以下三种，其中第一种最常用 /* 输出最小的一个数 */ public class jiaohuan1 { public static void main(String[] args) { int a =4; int b = 3; if(a<b){ // 第一种交换方式 int tmep =
渐变显示 bijian1013 JavaScript
<style type="text/css"> #wxf { FILTER: progid:DXImageTransform.Microsoft.Gradient(GradientType=0, StartColorStr=#ffffff, EndColorStr=#97FF98); height: 25px; } </style>
探索JUnit4扩展：断言语法assertThat bijian1013 java 单元测试 assertThat
一.概述 JUnit 设计的目的就是有效地抓住编程人员写代码的意图，然后快速检查他们的代码是否与他们的意图相匹配。 JUnit 发展至今，版本不停的翻新，但是所有版本都一致致力于解决一个问题，那就是如何发现编程人员的代码意图，并且如何使得编程人员更加容易地表达他们的代码意图。JUnit 4.4 也是为了如何能够
【Gson三】Gson解析{"data":{"IM":["MSN","QQ","Gtalk"]}} bit1129 gson
如何把如下简单的JSON字符串反序列化为Java的POJO对象? {"data":{"IM":["MSN","QQ","Gtalk"]}} 下面的POJO类Model无法完成正确的解析： import com.google.gson.Gson;
【Kafka九】Kafka High Level API vs. Low Level API bit1129 kafka
1. Kafka提供了两种Consumer API High Level Consumer API Low Level Consumer API(Kafka诡异的称之为Simple Consumer API，实际上非常复杂) 在选用哪种Consumer API时，首先要弄清楚这两种API的工作原理，能做什么不能做什么，能做的话怎么做的以及用的时候，有哪些可能的问题
在nginx中集成lua脚本：添加自定义Http头，封IP等 ronin47 nginx lua
Lua是一个可以嵌入到Nginx配置文件中的动态脚本语言，从而可以在Nginx请求处理的任何阶段执行各种Lua代码。刚开始我们只是用Lua 把请求路由到后端服务器，但是它对我们架构的作用超出了我们的预期。下面就讲讲我们所做的工作。强制搜索引擎只索引mixlr.com Google把子域名当作完全独立的网站，我们不希望爬虫抓取子域名的页面，降低我们的Page rank。 location /{
java-归并排序 bylijinnan java
import java.util.Arrays; public class MergeSort { public static void main(String[] args) { int[] a={20,1,3,8,5,9,4,25}; mergeSort(a,0,a.length-1); System.out.println(Arrays.to
Netty源码学习-CompositeChannelBuffer bylijinnan java netty
CompositeChannelBuffer体现了Netty的“Transparent Zero Copy” 查看API（ http://docs.jboss.org/netty/3.2/api/org/jboss/netty/buffer/package-summary.html#package_description）可以看到，所谓“Transparent Zero Copy”是通
Android中给Activity添加返回键 hotsunshine Activity
// this need android:minSdkVersion="11" getActionBar().setDisplayHomeAsUpEnabled(true); @Override public boolean onOptionsItemSelected(MenuItem item) {
静态页面传参 ctrain 静态
$(document).ready(function () { var request = { QueryString : function (val) { var uri = window.location.search; var re = new RegExp("" + val + "=([^&?]*)", &
Windows中查找某个目录下的所有文件中包含某个字符串的命令 daizj windows 查找某个目录下的所有文件包含某个字符串
findstr可以完成这个工作。 [html] view plain copy >findstr /s /i "string" *.* 上面的命令表示，当前目录以及当前目录的所有子目录下的所有文件中查找"string&qu
改善程序代码质量的一些技巧 dcj3sjt126com 编程 PHP 重构
有很多理由都能说明为什么我们应该写出清晰、可读性好的程序。最重要的一点，程序你只写一次，但以后会无数次的阅读。当你第二天回头来看你的代码时，你就要开始阅读它了。当你把代码拿给其他人看时，他必须阅读你的代码。因此，在编写时多花一点时间，你会在阅读它时节省大量的时间。让我们看一些基本的编程技巧：尽量保持方法简短尽管很多人都遵
SharedPreferences对数据的存储 dcj3sjt126com
SharedPreferences简介： &nbs
linux复习笔记之bash shell (2) bash基础 eksliang bash bash shell
转载请出自出处： http://eksliang.iteye.com/blog/2104329 1.影响显示结果的语系变量（locale） 1.1locale这个命令就是查看当前系统支持多少种语系，命令使用如下： [root@localhost shell]# locale LANG=en_US.UTF-8 LC_CTYPE="en_US.UTF-8"
Android零碎知识总结 gqdy365 android
1、CopyOnWriteArrayList add(E) 和remove(int index)都是对新的数组进行修改和新增。所以在多线程操作时不会出现java.util.ConcurrentModificationException错误。所以最后得出结论：CopyOnWriteArrayList适合使用在读操作远远大于写操作的场景里，比如缓存。发生修改时候做copy，新老版本分离，保证读的高
HoverTree.Model.ArticleSelect类的作用 hvt Web .net C#hovertree asp.net
ArticleSelect类在命名空间HoverTree.Model中可以认为是文章查询条件类，用于存放查询文章时的条件，例如HvtId就是文章的id。HvtIsShow就是文章的显示属性，当为-1是，该条件不产生作用，当为0时，查询不公开显示的文章，当为1时查询公开显示的文章。HvtIsHome则为是否在首页显示。HoverTree系统源码完全开放，开发环境为Visual Studio 2013
PHP 判断是否使用代理 PHP Proxy Detector 天梯梦 proxy
1. php 类 I found this class looking for something else actually but I remembered I needed some while ago something similar and I never found one. I'm sure it will help a lot of developers who try to
apache的math库中的回归——regression（翻译） lvdccyb Math apache
这个Math库，虽然不向weka那样专业的ML库，但是用户友好，易用。多元线性回归，协方差和相关性（皮尔逊和斯皮尔曼），分布测试（假设检验，t，卡方，G），统计。数学库中还包含，Cholesky，LU，SVD，QR，特征根分解，真不错。基本覆盖了：线代，统计，矩阵，最优化理论曲线拟合常微分方程遗传算法（GA），还有3维的运算。。。
基础数据结构和算法十三：Undirected Graphs (2) sunwinner Algorithm
Design pattern for graph processing. Since we consider a large number of graph-processing algorithms, our initial design goal is to decouple our implementations from the graph representation
云计算平台最重要的五项技术 sumapp 云计算云平台智城云
云计算平台最重要的五项技术 1、云服务器云服务器提供简单高效，处理能力可弹性伸缩的计算服务，支持国内领先的云计算技术和大规模分布存储技术，使您的系统更稳定、数据更安全、传输更快速、部署更灵活。特性机型丰富通过高性能服务器虚拟化为云服务器，提供丰富配置类型虚拟机，极大简化数据存储、数据库搭建、web服务器搭建等工作；仅需要几分钟，根据CP
《京东技术解密》有奖试读获奖名单公布 ITeye管理员活动
ITeye携手博文视点举办的12月技术图书有奖试读活动已圆满结束，非常感谢广大用户对本次活动的关注与参与。 12月试读活动回顾： http://webmaster.iteye.com/blog/2164754 本次技术图书试读活动获奖名单及相应作品如下：一等奖（两名） Microhardest：http://microhardest.ite