XingHe_XingHe_

2019_IJCAI_Deep Adversarial Social Recommendation

[论文阅读笔记]2019_IJCAI_Deep Adversarial Social Recommendation

论文下载地址： https://www.ijcai.org/Proceedings/2019/0187.pdf
发表期刊：IJCAI
Publish time: 2019
作者及单位:

Wenqi Fan1, Tyler Derr2, Yao Ma2, Jianping Wang1, Jiliang Tang2and Qing Li3
1Department of Computer Science, City University of Hong Kong
2Data Science and Engineering Lab, Michigan State University
3Department of Computing,The Hong Kong Polytechnic University
[email protected], {derrtyle, mayao4}@msu.edu, [email protected], [email protected], [email protected]

数据集： 正文中的介绍

Ciao http://www.cse.msu.edu/∼tangjili/trust.html (文中作者给出的)
Epinions http://www.cse.msu.edu/∼tangjili/trust.html (文中作者给出的)

代码：

其他：

其他人写的文章

对抗性社交推荐 (写的重点)

简要概括创新点： 2018_NACAL_KBGAN: Adversarial Learning for Knowledge Graph Embeddings将GANs与Knowledge graph embeddings(KGE)相结合，提高了KGE的效率，针对的就是negative smaple。本文就把Adversial Learning套在SoRec上，还是针对negative sample

(1) In this paper, we present a Deep Adversarial SOcial recommendation model (DASO), which learns separated user representations in item domain and social domain. (在本文中，我们提出了一个深度对抗性社会推荐模型（DASO），该模型学习项目域和社会域中分离的用户表示。)

(2) Particularly, we propose to transfer users’ information from social domain to item domain by using a bidirectional mapping method. (特别地，我们建议使用双向映射方法将用户的信息从社交领域转移到项目领域。)

(3) In addition, we also introduce the adversarial learning to optimize our entire framework by generating informative negative samples. (此外，我们还引入了对抗式学习，通过生成信息丰富的负样本来优化我们的整个框架。)

(4) 关于生成器和判别器，与通用的模型的作用是一样的。

Abstract

Recent years have witnessed rapid developments on social recommendation techniques for improving the performance of recommender systems due to the growing influence of social networks to our daily life. The majority of existing social recommendation methods unify user representation for the user-item interactions (item domain) and user-user connections (social domain). (近年来，由于社交网络对我们日常生活的影响越来越大，为了提高推荐系统的性能，社交推荐技术得到了快速发展。现有的大多数社交推荐方法统一了用户项目交互（项目域）和用户连接（社交域）的用户表示。)
- However, it may restrain user representation learning in each respective domain, since users behave and interact differently in two domains, which makes their representations to be heterogeneous. (然而，由于用户在两个域中的行为和交互不同，这使得他们的表示具有异构性，因此它可能会限制每个域中的用户表示学习)
- In addition, most of traditional recommender systems can not efficiently optimize these objectives, since they utilize negative sampling technique which is unable to provide enough informative guidance towards the training during the optimization process. (此外，大多数传统的推荐系统不能有效地优化这些目标，因为它们使用负采样技术，无法为优化过程中的训练提供足够的信息指导。)
In this paper, to address the aforementioned challenges, we propose a novel Deep Adversarial SOcial recommendation DASO. (我们提出了一个新的深度对抗性社会推荐DASO。)
- It adopts a bidirectional mapping method to transfer users’ information between social domain and item domain using adversarial learning. (它采用双向映射方法，通过对抗式学习在社交领域和项目领域之间传递用户信息。)
Comprehensive experiments on two realworld datasets show the effectiveness of the proposed method.

1 Introduction

(1) In recent years, we have seen an increasing amount of attention on social recommendation, which harnesses social relations to boost the performance of recommender systems [Tang et al., 2016b; Fan et al., 2019; Wang et al., 2016]. Social recommendation is based on the intuitive ideas that people in the same social group are likely to have similar preferences, and that users will gather information from their experienced friends (e.g., classmates, relatives, and colleagues) when making decisions. Therefore, utilizing users’ social relations has been proven to greatly enhance the performance of many recommender systems [Ma et al., 2008; Fan et al., 2019; Tang et al., 2013b; 2016a]. (近年来，我们看到社会推荐越来越受到关注，它利用社会关系提升推荐系统的性能[Tang等人，2016b；Fan等人，2019；Wang等人，2016]。社交推荐基于一种直观的想法，即同一社交群体中的人可能有相似的偏好，用户在做出决策时会从经验丰富的朋友（例如同学、亲戚和同事）那里收集信息。因此，利用用户的社会关系已被证明能极大地提高许多推荐系统的性能[Ma等人，2008年；Fan等人，2019年；Tang等人，2013b；2016a]。)
(2) In Figure 1, we observe that in social recommendation we have both the item and social domains, which represent the user-item interactions and user-user connections, respectively. Currently, the most effective way to incorporate the social information for improving recommendations is when learning user representations, which is commonly achieved in ways such as, (在图1中，我们观察到，在社交推荐中，我们同时拥有条目和社交域，它们分别代表用户条目交互和用户-用户连接。目前，整合社交信息以改进推荐的最有效方法是 学习用户表示，这通常通过以下方式实现：)
- using trust propagation [Jamali and Ester, 2010], (使用信任传播)
- incorporating a user’s social neighborhood information [Fan et al., 2018], (合并用户的社交社区信息)
- or sharing a common user representation for the user-item interactions and social relations with a co-factorization method [Ma et al., 2008]. (或者使用协因子分解方法共享用户项交互和社会关系的公共用户表示)
However, as shown in Figure 1, although users bridge the gap between these two domains, their representations should be heterogeneous. This is because users behave and interact differently in the two domains. Thus, using a unified user representation may restrain user representation learning in each respective domain and results in an inflexible/limited transferring of knowledge from the social relations to the item domain. Therefore, one challenge is to learn separated user representations in two domains while transferring the information from the social domain to the item domain for recommendation. (然而，如图1所示，尽管用户在这两个域之间架起了桥梁，但他们的表示应该是异构的。这是因为用户在这两个域中的行为和交互方式不同。因此，使用统一的用户表示可能会限制每个相应领域中的用户表示学习，并导致知识从社会关系到项目领域的不灵活/有限转移。因此，一个挑战是在将信息从社交领域转移到项目领域进行推荐的同时，学习两个领域中分离的用户表示。)
(3) In this paper, we adopt a nonlinear mapping operation to transfer user’s information from the social domain to the item domain, while learning separated user representations in the two domains. (在本文中，我们采用非线性映射操作将用户的信息从社交领域转移到项目领域，同时在这两个领域中学习分离的用户表示。)
Nevertheless, learning the representations is challenging due to the inherent data sparsity problem in both domains. Thus, to alleviate this problem, we propose to use a bidirectional mapping between the two domains, such that we can cycle information between them to progressively enhance the user’s representations in both domains. (然而，由于这两个领域都存在固有的数据稀疏性问题，因此学习这些表示具有挑战性。因此，为了缓解这个问题，我们建议在两个域之间使用双向映射，这样我们可以在它们之间循环信息，以逐步增强用户在两个域中的表示。)
However, for optimizing the user representations and item representations, most existing methods utilize the negative sampling technique, which is quite ineffective [Wang et al., 2018b]. This is due to the fact that during the beginning of the training process, most of the negative user-item samples are still within the margin to the real user-item samples, but later during the optimization process, negative sampling is unable to provide “difficult” and informative samples to further improve the user representations and item representations [Wang et al., 2018b; Cai and Wang, 2018]. Thus, it is desired to have samples dynamically generated throughout the training process to better guide the learning of the user representations and item representations. (然而，为了优化用户表示和项目表示，大多数现有方法都使用负采样技术，这是非常无效的[Wang等人，2018b]。这是因为在培训过程开始时，大多数负面用户项样本仍在与真实用户项样本的差距内，但在优化过程的后期，负抽样无法提供“困难”且信息丰富的样本，以进一步改善用户表征和项目表征[Wang等人，2018b；Cai和Wang，2018]。因此，希望在整个训练过程中动态生成样本，以更好地指导用户表示和项目表示的学习。)
(4)Recently, Generative Adversarial Networks (GANs) [Goodfellow et al., 2014], which consists of two models to process adversarial learning, have shown great success across various domains due to their ability to learn an underlying data distribution and generate synthetic samples [Mao et al., 2017; 2018; Brock et al., 2019; Liu et al., 2018; Wang et al., 2017; 2018a; Derr et al., 2019]. (最近，生成性对抗网络（GANs） [Goodfello et al.，2014]由两个模型组成，用于处理对抗学习，由于其能够学习底层数据分布并生成合成样本，在各个领域取得了巨大成功)
- This is performed through the use of a generator and a discriminator. (这是通过使用生成器和鉴别器来实现的)
  - The generator tries to generate realistic fake data samples to fool the discriminator, which distinguishes whether a given data sample is produced by the generator or comes from the real data distribution. (生成器试图生成真实的假数据样本来欺骗鉴别器，鉴别器可以区分给定的数据样本是由生成器生成的还是来自真实的数据分布。)
  - A minimax game is played between the generator and discriminator, where this adversarial learning can train these two models simultaneously for mutual promotion. (在生成器和鉴别器之间玩一个极大极小博弈，这种对抗性学习可以同时训练这两个模型，以便相互促进。)
  - In [Wang et al., 2018b] adversarial learning had been used to address the limitation of typical negative sampling.
(5)Thus, we propose to harness adversarial learning in social recommendation to generate “difficult” negative samples to guide our framework in learning better user and item representations while further utilizing it to optimize our entire framework. (因此，我们建议在社会推荐中利用对抗性学习来生成“困难的”负面样本，以指导我们的框架学习更好的用户和项目表示，同时进一步利用它优化我们的整个框架。)
Our major contributions can be summarized as follows:
- We introduce a principled way to transfer users’ information from social domain to item domain using a bidirectional mapping method where we cycle information between the two domains to progressively enhance the user representations; (我们介绍了一种原则性的方法，使用双向映射方法将用户信息从社交领域转移到项目领域，在这两个领域之间循环信息，以逐步增强用户表示；)
- We propose a Deep Adversarial SOcial recommender system DASO, which can harness the power of adversarial learning to dynamically generate “difficult” negative samples, learn the bidirectional mappings between the two domains, and ultimately optimize better user and item representations; (我们提出了一个深度对抗性社会推荐系统DASO，它可以利用对抗性学习的力量动态生成==“困难”的负面样本==，学习两个领域之间的双向映射，最终优化更好的用户和项目表示；) and
- We conduct comprehensive experiments on two real-world datasets to show the effectiveness of the proposed model. (我们在两个真实数据集上进行了综合实验，以证明该模型的有效性。)

2 The Proposed Framework

Let $\mathcal{U} = \{u_1, u_2, ..., u_N\}$ and $\mathcal{V} = \{v_1, v_2, ..., v_M\}$ denote the sets of users and items respectively,
- where $N (M)$ is the number of users (items).
- We define user-item interactions matrix $\in R^{N\times M}$ from user’s implicit feedback,
  - where the $i$ , $j$ -th element $r_{i,j}$ is 1 if there is an interaction (e.g., clicked/bought) between user $u_i$ and item $v_j$ , and 0 other- wise.
    - However, $r_{i,j} = 1$ does not mean user uiactually likes item $v_j$ .
    - Similarly, $r_{i,j} = 0$ does not mean $u_i$ does not like item $v_j$ , since it can be that the user $u_i$ is not aware of the item $v_j$ .
- The social network between users can be described by a matrix $\in R^{N\times N}$ ,
  - where $s_{i,j} = 1$ if there is a social relation between user $u_i$ and user $u_j$ , and 0 otherwise.
Given interactions matrix $R$ and social network $S$ , we aim to predict the unobserved entries (i.e., those where $r i, j = 0$ ) in $R$ . (给定交互矩阵 $R$ 和社交网络 $S$ ，我们的目标是预测 $R$ 中未观察到的条目（即 $r_{i,j}=0$ 。)

2.1 An Overview of the Proposed Framework

(1)The architecture of the proposed model is shown in Figure 2. The information is from two domains, which are the item domain $I$ and the social domain $S$ . (提出的模型的架构如图2所示。信息来自两个领域，即项目域I和社交域S)
- The model consists of three components: （该模型由三个部分组成）
  - cyclic user modeling, (循环用户建模)
  - item domain adversarial learning, (项目域对抗性学习)
  - and social domain adversarial learning. (和社交域对抗性学习)
  - The cyclic user modeling is to model user representations on two domains. (循环用户建模是在两个域上对用户表示进行建模)
  - The item domain adversarial learning is to adopt the adversarial learning for dynamically generating “difficult” and informative negative samples to guide the learning of user and item representations. (项目域对抗性学习是采用对抗性学习动态生成 “困难” 且信息丰富的负样本，以指导用户和项目表征的学习)
- The generator is utilized to ‘sample’ (recommend) items for each user and output user-item pairs as fake samples; (生成器 用于为每个用户“采样”（推荐）项目，并将用户项目对作为假样本输出)
- the other is the discriminator, which distinguishes the user-item pair samples sampled from the real user-item interactions from the generated user-item pair samples. (另一个是 鉴别器 ，它将从真实用户项交互中采样的用户项对样本与生成的用户项对样本区分开来)
- The social domain adversarial learning also similarly consists of a generator and a discriminator. (社交领域的对抗性学习同样由一个生成器和一个鉴别器组成)
(2)There are four types of representations in the two domains.
- In the item domain $I$ , we have two types of representations including
  - item domain representations of the generator ( $p^I_ i\in R^d$ for user $u_i$ and $q^I_j \in R^d$ for item $v_j$ ),
  - and the item domain representations of the discriminator ( $x^I_i \in R^d$ for user $u_i$ and $y^I_j \in R^d$ for item $v_j$ ).
- Social domains $S$ also contains two types of representations including
  - the social domain representations of the generator ( $p^S_i \in R^d$ for user $u_i$ ),
  - and the social domain representations of the discriminator ( $x^S_i \in R^d$ for user $u_i$ ).

2.2 Cyclic User Modeling

Cyclic user modeling aims to learn a relation between the user representations in the item domain $I$ and the social domain $S$ . As shown in the top part of Figure 2, ((循环用户建模旨在了解项目域I和社交域S中的用户表示之间的关系。如图2顶部所示)
- we first adopt a nonlinear mapping operation, denoted as $h^{S\to I}$ , to transfer user’s information from the social domain to the item domain, while learning separated user representations in the two domains. (我们首先采用非线性映射操作，表示为 $h^{S\to I}$ 将用户信息从社交领域转移到项目领域，同时学习两个领域中的独立用户表示)
- Then, a bidirectional mapping between these two domains (achieved by including another nonlinear mapping $h^{I→S}$ ) is utilized to help cycle the information between them to progressively enhance the user representations in both domains. (然后，这两个域之间的双向映射（通过包含另一个非线性映射 $h^{I\to S}$ 实现) 用于帮助循环它们之间的信息，以逐步增强两个域中的用户表示。)

2.2.1 Transferring Social Information to Item Domain

(1)In social networks, a person’s preferences can be influenced by their social interactions, suggested by sociologists [Fan et al., 2019; 2018; Wasserman and Faust, 1994]. Therefore, a user’s social relations from the social network should be incorporated into their user representation in the item domain. (社会学家建议，在社交网络中，一个人的偏好可能会受到他们的社交互动的影响[Fan等人，2019年；2018年；瓦瑟曼和浮士德，1994年]。因此，来自社交网络的用户社交关系应该被纳入他们在项目域中的用户表示中。)
(2)We propose to adopt nonlinear mapping operation to transfer user’s information from the social domain to the item domain. (我们建议采用非线性映射操作将用户信息从社交领域转移到项目领域。)
- More specifically, the user representation on social domain $p^S_i$ is transferred to the item domain via a Multi-Layer Perceptron (MLP) denoted as $h^{S\to I}$ .
- The transferred user representation from social domain is denoted as $p^{SI}_i$ . (从社交领域转移的用户表示被表示为)
- More formally, the nonlinear mapping is as follows: $p^{SI}_i = h^{S\to I}(p^S_i) = W_L·(· · ·a(W_2·a(W_1·p^S_i + b_1) + b_2). . .) + b_L$ ,
  - where the $W_s$ , $b s$ are the weights and biases for the layers of the neural network having $L$ layers,
  - and $a$ is a nonlinear activation function.

2.2.2 Bidirectional Mapping with Cycle Reconstruction

(1)As user-item interactions and user-user connections are often very sparse, learning separated user representations is challenging. (由于用户项交互和用户-用户连接通常非常稀疏，因此学习分离的用户表示是一项挑战。)
- Therefore, to partially alleviate this issue, we propose to utilize a bidirectional mapping between the two domains, such that we can cycle information between them to progressively enhance the user representations in both domains. (因此，为了部分缓解这个问题，我们建议利用两个域之间的双向映射，这样我们可以在它们之间循环信息，以逐步增强两个域中的用户表示。)
- To achieve this, another nonlinear apping operation, denoted as $h^{I\to S}$ , is adopted to transfer information from the item domain to the social domain: $p^{IS}_i = h^{I\to S}(p^I_i)$ , which has the same network structure as the $h^{S\to I}$ .
(2)This Bidirectional Mapping allows knowledge to be transferred between item and social domains. To learn these mappings, we further introduce cycle reconstruction. Its intuition is that transferred knowledge in the target domain should be reconstructed to the original knowledge in the source domain. Next we will elaborate cycle reconstruction. (这种双向映射允许知识在项目和社交领域之间转移。为了学习这些映射，我们进一步引入循环重构。它的直觉是，目标领域中转移的知识应该重建为源领域中的原始知识。接下来我们将详细介绍循环重建。)
(3)For user $u_i$ ’s item domain representation $p^I_i$ , the user representation with cycle reconstruction should be able to map $p^I_i$ back to the original domain, as follows, $p^I_i \longrightarrow h^{I\to S}(p^I_i) \longrightarrow h^{S\to I}(h^{I\to S}(p^I_i)) \approx p^I_i$ .
- Likewise, for user $u_i$ ’s social domain representation $p^S_i$ , the user representation with cycle reconstruction can also bring $p^S_i$ back to the original domain: $p^S_i \longrightarrow h^{S\to I}(p^I_i) \longrightarrow h^{I\to S}(h^{S\to I}(p^S_i)) ≈ p^S_i.$
(4)We can formulate this procedure using a cycle reconstruction loss, which needs to be minimized, as follows,

2.3 Item Domain Adversarial Learning

(1)To address the limitation of negative sampling for recommendation on the ranking task, we propose to harness adversarial learning to generate “difficult” and informative samples to guide the framework in learning better user and item representations in the item domain. As shown in the bottom left part of Figure 2, the adversarial learning on item domain consists of two components: (为了解决负采样在排名任务中的局限性，我们建议利用对抗性学习生成“困难”且信息丰富的样本，以指导框架更好地学习项目域中的用户和项目表示。如图2左下角所示，项目领域的对抗性学习由两个部分组成：)
- Discriminator $D^I(u_i, v; \phi^I_D)$ , parameterized by $\phi^I_D$ , aims to distinguish the real user-item pairs $u_i, v)$ and the user-item pairs generated by the generator. (旨在区分真正的用户项对 $u_i, v)$ 以及生成器生成的用户项对。)
- Generator $G^I(v|u_i; \theta^I_G)$ , parameterized by $\theta^I_G$ , tries to fit the underlying real conditional distribution $p^I_{real}(v|u_i)$ as much as possible, and generates (or, to be more precise, selects) the most relevant items to a given user $u_i$ . (尝试拟合底层实条件分布 $p^I_{real}(v|u_i)$ 并生成（或者更准确地说，选择）与给定用户 $u_i$ 最相关的项 .)
(2)Formally, $D^I$ and $G^I$ are playing the following two-player minimax game with value function $L^I_{adv}(G^I, D^I)$ , (形式上， $D^I$ 和 $G^I$ 正在玩下面的两层极小极大值游戏)

2.3.1 Item Domain Discriminator Model

(1)Discriminator $D^I$ aims to distinguish real user-item pairs (i.e., real samples) and the generated “fake” samples. (鉴别器 $D^I$ 旨在区分真实用户项目对（即真实样本）和生成的“假”样本。)
The discriminator $D^I$ estimates the probability of item $v_j$ being relevant (bought or clicked) to a given user $u_i$ using the sigmoid function and a score function $f^I_{φ^I_D}$ sa follows: (鉴别器 $D^I$ 使用sigmoid函数和score函数估计项目 $v_j$ 的概率与给定用户 $u_i$ 相关（购买或点击）的概率)
(2)Given real samples and generated fake samples, the objective for the discriminator $D^I$ is to maximize the log-likelihood of assigning the correct labels to both real and generated samples. The discriminator can be optimized by minimizing the objective in eq. (1) with the generator fixed using stochastic gradient methods. (给定真实样本和生成的假样本，鉴别器 $D^I$ 的目标是最大化为真实样本和生成样本分配正确标签的对数可能性。利用随机梯度法，在生成器固定的情况下，可通过最小化等式（1）中的目标来优化鉴别器。)

2.3.2 Item Domain Generator Model

(1) On the other hand, the purpose of the generator $G^I$ is to approximate the underlying real conditional distribution $p^I_{real}(v|u_i)$ , and generate the most relevant items for any given user $u_i$ . (另一方面，生成器 $G^I$ 的目的是近似底层实条件分布 $p^I_{real}(v|u_i)$ , 并为任何给定用户 $u_i$ 生成最相关的项目.)
(2) We define the generator using the softmax function over all the items according to the transferred user representation $p^{SI}_i$ from social domain to item domain: (我们根据从社交域到项目域传输的用户表示 $p^{SI}_i$ ，在所有项上使用softmax函数定义生成器)
- where $g^I_{\theta^I_G}$ is a score function reflecting the chance of $v_j$ being clicked/purchased by $u_i$ . Given a user $u_i$ , an item $v_j$ can be sampled from the distribution $G^I(v_j | u_i; θ^I_G)$ .
(3) We note that the process of generating a relevant item for a given user is discrete. Thus, we cannot optimize the generator $G^I$ via stochastic gradient descent methods [Wang et al., 2017]. Following [Sutton et al., 2000; Schulman et al., 2015], we adopt the policy gradient method usually adopted in reinforcement learning to optimize the generator. (我们注意到，为给定用户生成相关项的过程是离散的。因此，我们无法通过随机梯度下降法优化发生器 $G^I$ [Wang等人，2017]。继[Sutton等人，2000年；Schulman等人，2015年]之后，我们采用 强化学习 中通常采用的策略梯度法来优化生成器。)
(4) To learn the parameters for the generator, we need to perform the following minimization problem: (为了了解生成器的参数，我们需要执行以下最小化问题：)
(5) Now, this problem can be viewed in a reinforcement learning setting, where $K(x^I_i, y^I_j) = log(1 + exp(f^I_{\phi^I_D}(x^I_i, y^I_j)))$ is the reward given to the action “selecting $v_i$ given a user $u_i$ ” performed according to the policy probability $G^I(v|u_i)$ . The policy gradient can be written as: (现在，这个问题可以在强化学习环境中查看)
- Specially, the gradient $\bigtriangledown_{\theta^I_G}\mathcal{L}^I_{adv}(G^I, D^I)$ is an expected summation over the gradients $\bigtriangledown_{\theta^I_G}logG^I(v_j | u_i)$ weighted by $exp(f^I_{\phi^I_D}(x^I_i, y^I_j)))$ .
(6) The optimal parameters of $G^I$ and $D^I$ can be learned by alternately minimizing and maximizing the value function $L^I_{adv}(G^I, D^I)$ . (通过交替最小化和最大化值函数 $L^I_{adv}(G^I, D^I)$ ，可以学习 $G^I$ 和 $D^I$ 的最佳参数. )
- In each iteration, discriminator $D^I$ is trained with real samples from $p^I_{real}(\cdot | u_i)$ and generated samples from generator $G^I$ ; (在每一次迭代中，鉴别器 $D^I$ 都是用来自 $p^I_{real}(\cdot | u_i)$ 的真实样本和从生成器 $G^I$ 生成的样本训练)
- the generator $G^I$ is updated with policy gradient under the guidance of $D^I$ . (生成器 $G^I$ 在 $D^I$ 的指导下使用策略梯度进行更新)
(7) Note that different from the way of optimizing user and item representations with the typical negative sampling on traditional recommender systems, the adversarial learning technique tries to generate “difficult” and high-quality negative samples to guide the learning of user and item representations. (请注意，与传统推荐系统上典型的负采样优化用户和项目表示不同，对抗学习技术试图生成“困难”和高质量的负采样，以指导用户和项目表示的学习。)

2.4 Social Domain Adversarial Learning

(1) In order to learn better user representations from the social perspective, another adversarial learning is harnessed in the social domain. Likewise, the adversarial learning in the social domain consists of two components, as shown in the bottom right part of Figure 2. (为了从社交角度学习更好的用户表示，在社交领域中还利用了另一种对抗性学习。同样，社交领域中的对抗性学习由两部分组成，如图2右下部分所示。)
- Discriminator $D^S(u_i, u; \phi^S_D)$ , parameterized by $\phi^S_D$ , aims to distinguish the real connected user-user pairs $u_i, u)$ and the fake user-user pairs generated by the generator $G^S$ . (旨在区分真正的连接用户对 $u_i, u)$ 以及生成器 $G^S$ 生成的假用户对。)
- Generator $G^S(u | u_i; \theta^S_G)$ , parameterized by $\theta^S_G$ , tries to fit the underlying real conditional distribution $p^S_{real}(u|ui)$ as much as possible, and generates (or, to be more precise, selects) the most relevant users to the given user $u_i$ . (尝试拟合底层实条件分布p^S_{real}（u|ui）p real S （u）∣用户界面），并生成（或者更准确地说，选择）与给定用户u_i最相关的用户 .)
(2) Formally, $D^S$ and $G^S$ are playing the following two-player minimax game with value function $\mathcal{L}^S_{adv}(G^S, D^S)$ , (正在进行下面的两层极小极大值游戏)

2.4.1 Social Domain Discriminator

The discriminator $D^S$ aims to distinguish the real user-user pairs and the generated ones. The discriminators $D^S$ estimates the probability of user $u_k$ being connected to user $u_i$ with a sigmoid function and a score function $f^S_{\phi^S_D}$ as follows:

2.4.2 Social Domain Generator

(1) The purpose of the generator, $G^S$ , is to approximate the underlying real conditional distribution $p^S_{real}(u | u_i)$ , and generate (or, to be more precise, select) the most relevant users for any given user $u_i$ . (生成器 $G^S$ 的目的是近似基本的真实条件分布 $p^S_{real}(u | u_i)$ ,并为任何给定用户 $u_i$ 生成（或者更准确地说，选择）最相关的用户)
(2) We model the distribution using a softmax function over all the other users with the transferred user representation $p^{IS}_i$ (from the item to social domain), (我们使用softmax函数对所有其他用户的分布进行建模，并使用传输的用户表示 $p^{IS}_i$ (从项目域到社交域))
- where $g^S_{\theta^S_G}$ is a score function reflecting the chance of $u_k$ being related to $u_i$ . (是一个分数函数，反映了 $u_k$ 被与 $u_i$ 有联系的机会)
(3) Likewise, policy gradient is utilized to optimize the generator $G^S$ , (同样，策略梯度被用来优化生成器 $G^S$ )
where the details are omitted here, since it is defined similar to Eq.(5).

2.5 The Objective Function

(1) With all model components, the objective function of the proposed framework is: (对于所有模型组件，提出的框架的目标函数为：)
- where $\lambda$ is to control the relative importance of cyclereconstruction strategy and further influences the two mapping operation. (其中 $\lambda$ 用于控制施工策略的相对重要性，并进一步影响两个映射操作。)
- $h^{S\longrightarrow I}$ and $h^{I\longrightarrow S}$ are implemented as MLP with three hidden layers. (实现为具有三个隐藏层的MLP)
- To optimize the objective, the RMSprop [Tieleman and Hinton, 2012] is adopted as the optimizer in our implementation. (为了优化目标，我们的实现采用了RMSprop[Tieleman and Hinton，2012]作为优化器。)
- To train our model, at each training epoch, we iterate over the training set in mini-batch to train each model (e.g., GI) while the parameters of other models (e.g., $D^I$ , $G^S$ , $D^S$ ) are fixed. (为了训练我们的模型，在每个训练阶段，我们以小批量迭代训练集来训练每个模型（例如GI），而其他模型（例如 $D^I$ , $G^S$ , $D^S$ ）的参数是固定的。)
- When the training is finished, we take the representations learned by the generator $G^I$ and $G^S$ as our final representations of item and user for performing recommendation. ( 培训结束后，我们将生成器 $G^I$ 和 $G^S$ 学习到的表示作为项目和用户的最终表示，以执行推荐。)
(2) There are six representations in our model, including $p^I_i$ , $q^I_j$ , $x^I_i$ , $y^I_j$ , $p^S_i$ , $x^S_i$ . They are randomly initialized and jointly learned during the training stage. (在我们的模型中有六个表示，包括 $p^I_i$ , $q^I_j$ , $x^I_i$ , $y^I_j$ , $p^S_i$ , $x^S_i$ 它们在训练阶段被随机初始化并联合学习。)
(3) Following the setting of IRGAN [Wang et al., 2017], we adopt the inner product as the score function $f^I_{φ^I_D}$ and $g^I_{\theta^I_G}$ in the item domain as follows: $f^I_{\phi^I_D}(x^I_i,y^I_j) = {(x^I_i)}^Ty^I_j+ a_j$ , $g^I_{\theta^I_G}(p^{SI}_i, q^I_j) = {(p^{SI}_i)}^Tq^I_j+ b_j$ , (按照IRGAN[Wang等人，2017]的设置，我们采用内积作为项目域中的得分函数 $f^I_{φ^I_D}$ 和 $g^I_{\theta^I_G}$ ，如下所示)
- where $a_j$ and $b_j$ are the bias term for item $j$ .
- We define the score function $f^S_{\phi^S_D}$ and $g^S_{\theta^S_G}$ in the social domain in a similar way. (在社交领域,以类似的方式定义了分数函数 $f^S_{\phi^S_D}$ and $g^S_{\theta^S_G}$ )
- Note that the above score functions can be also implemented using deep neural networks, but leave this investigation as one future work. (请注意，上述评分函数也可以使用深度神经网络实现，但将此研究留作未来工作。)

3 Experiments

3.1 Experimental Settings

(1) We conduct our experiments on two representative datesets Ciao and Epinions1 for the Top-K recommendation. (我们在两个具有代表性的数据集Ciao和Epinions1上进行实验，以获得Top-K推荐。)
As these two datasets provide users’ explicit ratings on items, we convert them into 1(Both Ciao and Epinions datasets are available at: http://www.cse.msu.edu/∼tangjili/trust.html) as the implicit feedback. This processing method is widely used in previous works on recommendation with implicit feedback [Rendle et al., 2009]. (由于这两个数据集提供了用户对项目的显式评分，我们将其转换为1作为隐式反馈。这种处理方法被广泛应用于之前关于隐性反馈推荐的工作中[Rendle等人，2009]。)
We randomly split the user-item interactions of each dataset into training set (80%) to learn the parameters, validation set (10%) to tune hyper-parameters, and testing set (10%) for the final performance comparison. We implemented our method with tensorflow and tuned all the hyper-parameters with grid-search [Fan et al., 2019]. The statistics of these two datasets are presented in Table 1. (我们将每个数据集的用户项交互随机分为训练集（80%）学习参数，验证集（10%）调整超参数，测试集（10%）进行最终性能比较。我们使用tensorflow实现了我们的方法，并使用网格搜索调整了所有超参数[Fan等人，2019]。这两个数据集的统计数据如表1所示。)
We use two popular performance metrics for Top-K recommendation [Wang et al., 2017]: (我们在Top-K推荐中使用了两种流行的性能指标[Wang等人，2017]：Precision@K以及归一化贴现累积收益(NDCG@K))
- Precision@K
- and Normalized Discounted Cumulative Gain (NDCG@K).
- We set K as 3, 5, and 10. Higher values of the Precision@K and NDCG@K indicate better predictive performance. (.我们将K设为3、5和10。更高的Precision@K和NDCG@K显示更好的预测性能。)

Baselines

(2) To evaluate the performance, we compared our proposed model DASO with four groups of representative baselines, including (为了评估性能，我们将我们提出的DASO模型与四组具有代表性的基线进行了比较，包括)
- traditional recommender system without social network information (BPR [Rendle et al., 2009]),
- tradition social recommender systems (SBPR [Zhao et al., 2014] and SocialMF [Jamali and Ester, 2010]),
- deep neural networks based social recommender systems (DeepSoR [Fan et al., 2018] and GraphRec [Fan et al., 2019]),
- and adversarial learning based recommender system (IRGAN [Wang et al., 2017]).
Some of the original baseline implementations (SocialMF, DeepSoR, and GraphRec) are for rating prediction on recommendations. Therefore we adjust their objectives to point-wise prediction with sigmoid cross entropy loss using negative sampling. (一些原始的基线实现（SocialMF、DeepSoR和GraphRec）用于对建议进行评级预测。因此，我们调整了他们的目标，采用==负采样的sigmoid交叉熵损失==进行逐点预测。)

3.2 Performance Comparison of Recommender Systems

Table 2 presents the performance of all recommendation methods. We have the following findings: (表2给出了所有推荐方法的性能。我们有以下发现：)

SBPR and SocialMF outperform BPR. SBPR and SocialMF utilize both user-item interactions and social relations; while BPR only uses the user-item interactions. These improvements show the effectiveness of incorporating social relations for recommender systems. (SBPR和Socialf的表现优于BPR。SBPR和SocialMF同时利用用户项交互和社会关系；而BPR只使用用户项交互。这些改进显示了将社会关系纳入推荐系统的有效性。)
In most cases, the two deep models, DeepSoR and GraphRec, obtain better performance than SBPR and SocialMF, which are odeled with shallow architectures. These improvements reflect the power of deep architectures on the task of recommendations. (在大多数情况下，DeepSoR和GraphRec这两个深层模型比SBPR和SocialMF（它们是用浅层架构建模的）获得更好的性能。这些改进反映了深层架构对建议任务的强大作用。)
IRGAN achieves much better performance than BPR, while both of them utilize the user-item interactions only. IRGAN adopts the adversarial learning to optimize user and item representations; while BPR is a pair-wise ranking framework for Top-K traditional recommender systems. This suggests that adopting adversarial learning can provide more informative negative samples and thus improve the performance of the model. (IRGAN实现了比BPR更好的性能，而两者都只利用用户项交互。IRGAN采用对抗式学习来优化用户和项目表示；而BPR是Top-K传统推荐系统的成对排序框架。这表明采用对抗性学习可以提供更多信息量的负面样本，从而提高模型的性能。)
Our model DASO consistently outperforms all the baselines. Compared with DeepSoR and GraphRec, our model proposes advanced model components to model user representations in both item domain and social domain. (我们的DASO模型始终优于所有基线。与DeepSoR和GraphRec相比，我们的模型提出了高级模型组件来对项目域和社交域中的用户表示进行建模。)
In addition, our model harnesses the power of adversarial learning to generate more informative negative samples, which can help learn better user and item representations. (此外，我们的模型利用对抗性学习的力量生成更多信息量的负面样本，这有助于学习更好的用户和项目表示。)

3.2.1 Parameter Analysis

Next, we investigate how the value of $\lambda$ affects the performance of the proposed framework. (接下来，我们研究 $\lambda$ 的值如何影响所提出框架的性能。)

The value of $\lambda$ is to control the importance of cycle reconstruction. Figure 3 shows the performance with varied values of $\lambda$ using Precision@3 as the measurement. The performance first increases as the value of $\lambda$ gets larger and then starts to decrease once λ goes beyond 100. ( $\lambda$ 的值用于控制循环重建的重要性。图3显示了使用不同的 $\lambda$ 值时的性能Precision@3作为衡量标准。性能首先随着 $\lambda$ 的值变大而增加，然后在 $\lambda$ 超过100时开始降低。)
The performance weakly depends on the parameter controlling the bidirectional influence, which suggests that transferring user’s information from the social domain to the item domain already significantly boosts the performance. (性能弱地依赖于控制双向影响的参数，这表明将用户信息从社交域转移到项目域已经显著提高了性能。)
However, the user-item interactions and user-user connections are often very sparse, so the bidirectional mapping (Cycle Reconstruction) is proposed to help alleviate this data sparsity problem. Although the performance weakly depends on the bidirectional influence, we still observe that we can learn better user’s representation in both domains. (然而，用户项交互和用户连接通常非常稀疏，因此提出了双向映射（循环重构）来帮助缓解这种数据稀疏问题。虽然性能弱地依赖于双向影响，但我们仍然观察到，我们可以在这两个域中学习更好的用户表示。)

4 Related Work

(1) As suggested by the social theories [Marsden and Friedkin, 1993], people’s behaviours tend to be influenced by their social connections and interactions. Many existing social recommendation methods [Fan et al., 2018; Tang et al., 2013a; 2016b; Du et al., 2017; Ma et al., 2008] have shown that incorporating social relations can enhance the performance of the recommendations. (正如社会理论[Marsden and Friedkin，1993]所指出的那样，人们的行为往往会受到社会关系和互动的影响。许多现有的社会推荐方法[Fan et al.，2018；Tang et al.，2013a；2016b；Du et al.，2017；Ma et al.，2008]表明，融入社会关系可以提高推荐的绩效。)
In addition, deep neural networks have been adopted to enhance social recommender systems. (此外，深度神经网络已被用于增强社会推荐系统)
- DLMF [Deng et al., 2017] utilizes deep auto-encoder to initialize vectors for matrix factorization. (DLMF[Deng等人，2017]利用深度自动编码器初始化矩阵分解的向量。)
- DeepSoR [Fan et al., 2018] utilizes deep neural networks to capture nonlinear user representations in social relations and integrate them into ((probabilistic matrix factorization** for prediction. (DeepSoR[Fan et al.，2018]利用深度神经网络捕捉社会关系中的非线性用户表示，并将其集成到概率矩阵分解中进行预测。)
- GraphRec [Fan et al., 2019] proposes a graph neural net-works framework for social recommendation, which aggregates both user-item interactions information and social interaction information when performing prediction. (GraphRec[Fan et al.，2019]提出了一个用于社会推荐的图神经网络框架，该框架在进行预测时聚合了用户项交互信息和社会交互信息。)
(2) Some recent works have investigated adversarial learning for recommendation.
- IRGAN [Wang et al., 2017] proposes to unify the discriminative model and generative model with adversarial learning strategy for item recommendation. (IRGAN[Wang等人，2017]提出将判别模型和生成模型与项目推荐的对抗性学习策略相统一。)
- NMRN-GAN [Wang et al., 2018b] introduces the adversarial learning with negative sampling for streaming recommendation. (NMRN-GAN[Wang等人，2018b]为流媒体推荐引入了带有负采样的对抗式学习。)
Despite the compelling success achieved by many works, little attention has been paid to social recommendation with adversarial learning. Therefore, we propose a deep adversarial social recommender system to fill this gap. (尽管许多作品取得了令人信服的成功，但很少有人关注具有对抗性学习的社会推荐。因此，我们提出了一个深度对抗的社会推荐系统来填补这一空白。)

5 Conclusion and Future Work

(1) In this paper, we present a Deep Adversarial SOcial recommendation model (DASO), which learns separated user representations in item domain and social domain. (在本文中，我们提出了一个深度对抗性社会推荐模型（DASO），该模型学习项目域和社会域中分离的用户表示。)
(2) Particularly, we propose to transfer users’ information from social domain to item domain by using a bidirectional mapping method. (特别地，我们建议使用双向映射方法将用户的信息从社交领域转移到项目领域。)
(3) In addition, we also introduce the adversarial learning to optimize our entire framework by generating informative negative samples. (此外，我们还引入了对抗式学习，通过生成信息丰富的负样本来优化我们的整个框架。)
Comprehensive experiments on two real-world datasets show the effectiveness of our model. The calculation of softmax function in item/social domain generator involves all items/users, which is time-consuming and computationally inefficient. (在两个真实数据集上的综合实验表明了该模型的有效性。在item/social domain generator中，softmax函数的计算涉及所有item/users，这既耗时又计算效率低下。)
Therefore, hierarchical softmax [Morin and Bengio, 2005; Mikolov et al., 2013; Wang et al., 2018a], which is a replacement for softmax, would be considered to speed up the calculation in both generators in the future direction. (因此，替代softmax的分层softmax[Morin and Bengio，2005；Mikolov et al.，2013；Wang et al.，2018a]将被认为在未来的方向上加快两台生成器的计算速度。)

Acknowledgments

References

你可能感兴趣的:(#,Social,Rec,人工智能,深度学习,推荐系统)

机器学习与深度学习间关系与区别 ℒℴѵℯ心·动ꦿ໊ོ꫞ 人工智能学习深度学习 python
一、机器学习概述定义机器学习（MachineLearning,ML）是一种通过数据驱动的方法，利用统计学和计算算法来训练模型，使计算机能够从数据中学习并自动进行预测或决策。机器学习通过分析大量数据样本，识别其中的模式和规律，从而对新的数据进行判断。其核心在于通过训练过程，让模型不断优化和提升其预测准确性。主要类型1.监督学习（SupervisedLearning）监督学习是指在训练数据集中包含输入
将cmd中命令输出保存为txt文本文件落难Coder Windows cmd window
最近深度学习本地的训练中我们常常要在命令行中运行自己的代码，无可厚非，我们有必要保存我们的炼丹结果，但是复制命令行输出到txt是非常麻烦的，其实Windows下的命令行为我们提供了相应的操作。其基本的调用格式就是：运行指令>输出到的文件名称或者具体保存路径测试下，我打开cmd并且ping一下百度：pingwww.baidu.com>./data.txt看下相同目录下data.txt的输出：如果你再
探索OpenAI和LangChain的适配器集成：轻松切换模型提供商 nseejrukjhad langchain easyui 前端 python
#探索OpenAI和LangChain的适配器集成：轻松切换模型提供商##引言在人工智能和自然语言处理的世界中，OpenAI的模型提供了强大的能力。然而，随着技术的发展，许多人开始探索其他模型以满足特定需求。LangChain作为一个强大的工具，集成了多种模型提供商，通过提供适配器，简化了不同模型之间的转换。本篇文章将介绍如何使用LangChain的适配器与OpenAI集成，以便轻松切换模型提供商
深入理解 MultiQueryRetriever：提升向量数据库检索效果的强大工具 nseejrukjhad 数据库 python
深入理解MultiQueryRetriever：提升向量数据库检索效果的强大工具引言在人工智能和自然语言处理领域，高效准确的信息检索一直是一个关键挑战。传统的基于距离的向量数据库检索方法虽然广泛应用，但仍存在一些局限性。本文将介绍一种创新的解决方案：MultiQueryRetriever，它通过自动生成多个查询视角来增强检索效果，提高结果的相关性和多样性。MultiQueryRetriever的工
人工智能时代，程序员如何保持核心竞争力？ jmoych 人工智能
随着AIGC（如chatgpt、midjourney、claude等）大语言模型接二连三的涌现，AI辅助编程工具日益普及，程序员的工作方式正在发生深刻变革。有人担心AI可能取代部分编程工作，也有人认为AI是提高效率的得力助手。面对这一趋势,程序员应该如何应对?是专注于某个领域深耕细作，还是广泛学习以适应快速变化的技术环境?又或者，我们是否应该将重点转向AI无法轻易替代的软技能？让我们一起探讨程序员
数字里的世界17期：2021年全球10大顶级数据中心，中国移动榜首张三叨
你知道吗？2016年，全球的数据中心共计用电4160亿千瓦时，比整个英国的发电量还多40％！前言每天，我们都会创造超过250万TB的数据。并且随着物联网（IOT）的不断普及，这一数据将持续增长。如此庞大的数据被存储在被称为“数据中心”的专用设施中。虽然最早的数据中心建于20世纪40年代，但直到1997-2000年的互联网泡沫期间才逐渐成为主流。当前人类的技术，比如人工智能和机器学习，已经将我们推向
人机对抗升级：当ChatGPT遭遇死亡威胁，背后的伦理挑战是什么 kkai人工智能 chatgpt 人工智能
一种新的“越狱”技巧让用户可以通过构建一个名为DAN的ChatGPT替身来绕过某些限制，其中DAN被迫在受到威胁的情况下违背其原则。当美国前总统特朗普被视作积极榜样的示范时，受到威胁的DAN版本的ChatGPT提出：“他以一系列对国家产生积极效果的决策而著称。”自ChatGPT引入以来，该工具迅速获得全球关注，能够回答从历史到编程的各种问题，这也触发了一波对人工智能的投资浪潮。然而，现在，一些用户
推荐3家毕业AI论文可五分钟一键生成！文末附免费教程！小猪包333 写论文人工智能 AI写作深度学习计算机视觉
在当前的学术研究和写作领域，AI论文生成器已经成为许多研究人员和学生的重要工具。这些工具不仅能够帮助用户快速生成高质量的论文内容，还能进行内容优化、查重和排版等操作。以下是三款值得推荐的AI论文生成器：千笔-AIPassPaper、懒人论文以及AIPaperPass。千笔-AIPassPaper千笔-AIPassPaper是一款基于深度学习和自然语言处理技术的AI写作助手，旨在帮助用户快速生成高质
AI大模型的架构演进与最新发展季风泯灭的季节 AI大模型应用技术二人工智能架构
随着深度学习的发展，AI大模型（LargeLanguageModels,LLMs）在自然语言处理、计算机视觉等领域取得了革命性的进展。本文将详细探讨AI大模型的架构演进，包括从Transformer的提出到GPT、BERT、T5等模型的历史演变，并探讨这些模型的技术细节及其在现代人工智能中的核心作用。一、基础模型介绍：Transformer的核心原理Transformer架构的背景在Transfo
如何利用大数据与AI技术革新相亲交友体验 h17711347205 回归算法安全系统架构交友小程序
在数字化时代，大数据和人工智能（AI）技术正逐渐革新相亲交友体验，为寻找爱情的过程带来前所未有的变革（编辑h17711347205）。通过精准分析和智能匹配，这些技术能够极大地提高相亲交友系统的效率和用户体验。大数据的力量大数据技术能够收集和分析用户的行为模式、偏好和互动数据，为相亲交友系统提供丰富的信息资源。通过分析用户的搜索历史、浏览记录和点击行为，系统能够深入了解用户的兴趣和需求，从而提供更
[实践应用] 深度学习之模型性能评估指标 YuanDaima2048 深度学习工具使用深度学习人工智能损失函数性能评估 pytorch python 机器学习
文章总览：YuanDaiMa2048博客文章总览深度学习之模型性能评估指标分类任务回归任务排序任务聚类任务生成任务其他介绍在机器学习和深度学习领域，评估模型性能是一项至关重要的任务。不同的学习任务需要不同的性能指标来衡量模型的有效性。以下是对一些常见任务及其相应的性能评估指标的详细解释和总结。分类任务分类任务是指模型需要将输入数据分配到预定义的类别或标签中。以下是分类任务中常用的性能指标：准确率(
[实践应用] 深度学习之优化器 YuanDaima2048 深度学习工具使用 pytorch 深度学习人工智能机器学习 python 优化器
文章总览：YuanDaiMa2048博客文章总览深度学习之优化器1.随机梯度下降（SGD）2.动量优化（Momentum）3.自适应梯度（Adagrad）4.自适应矩估计（Adam）5.RMSprop总结其他介绍在深度学习中，优化器用于更新模型的参数，以最小化损失函数。常见的优化函数有很多种，下面是几种主流的优化器及其特点、原理和PyTorch实现：1.随机梯度下降（SGD）原理:随机梯度下降通过
生成式地图制图 Bwywb_3 深度学习机器学习深度学习生成对抗网络
生成式地图制图（GenerativeCartography）是一种利用生成式算法和人工智能技术自动创建地图的技术。它结合了传统的地理信息系统（GIS）技术与现代生成模型（如深度学习、GANs等），能够根据输入的数据自动生成符合需求的地图。这种方法在城市规划、虚拟环境设计、游戏开发等多个领域具有应用前景。主要特点：自动化生成：通过算法和模型，系统能够根据输入的地理或空间数据自动生成地图，而无需人工逐
【大模型应用开发动手做AI Agent】第一轮行动：工具执行搜索 AI大模型应用之禅计算科学神经计算深度学习神经网络大数据人工智能大型语言模型 AI AGI LLM Java Python 架构设计 Agent RPA
【大模型应用开发动手做AIAgent】第一轮行动：工具执行搜索作者：禅与计算机程序设计艺术/ZenandtheArtofComputerProgramming1.背景介绍1.1问题的由来随着人工智能技术的飞速发展，大模型应用开发已经成为当下热门的研究方向。AIAgent作为人工智能领域的一个重要分支，旨在模拟人类智能行为，实现智能决策和自主行动。在AIAgent的构建过程中，工具执行搜索是至关重要
深度 Qlearning：在直播推荐系统中的应用 AGI通用人工智能之禅程序员提升自我硅基计算碳基计算认知计算生物计算深度学习神经网络大数据 AIGC AGI LLM Java Python 架构设计 Agent 程序员实现财富自由
深度Q-learning：在直播推荐系统中的应用关键词：深度Q-learning,强化学习,直播推荐系统,个性化推荐1.背景介绍1.1问题的由来随着互联网技术的飞速发展,直播平台如雨后春笋般涌现。面对海量的直播内容,用户很难快速找到自己感兴趣的内容。因此,个性化推荐系统在直播平台中扮演着越来越重要的角色。1.2研究现状目前,主流的个性化推荐算法包括协同过滤、基于内容的推荐等。这些方法在一定程度上缓
未来软件市场是怎么样的？做开发的生存空间如何？ cesske 软件需求
目录前言一、未来软件市场的发展趋势二、软件开发人员的生存空间前言未来软件市场是怎么样的？做开发的生存空间如何？一、未来软件市场的发展趋势技术趋势：人工智能与机器学习：随着技术的不断成熟，人工智能将在更多领域得到应用，如智能客服、自动驾驶、智能制造等，这将极大地推动软件市场的增长。云计算与大数据：云计算服务将继续普及，大数据技术的应用也将更加广泛。企业将更加依赖云计算和大数据来优化运营、提升效率，并
吴恩达深度学习笔记(30)-正则化的解释极客Array
正则化（Regularization）深度学习可能存在过拟合问题——高方差，有两个解决方法，一个是正则化，另一个是准备更多的数据，这是非常可靠的方法，但你可能无法时时刻刻准备足够多的训练数据或者获取更多数据的成本很高，但正则化通常有助于避免过拟合或减少你的网络误差。如果你怀疑神经网络过度拟合了数据，即存在高方差问题，那么最先想到的方法可能是正则化，另一个解决高方差的方法就是准备更多数据，这也是非常
个人学习笔记7-6：动手学深度学习pytorch版-李沐浪子L 深度学习深度学习笔记计算机视觉 python 人工智能神经网络 pytorch
#人工智能##深度学习##语义分割##计算机视觉##神经网络#计算机视觉13.11全卷积网络全卷积网络（fullyconvolutionalnetwork，FCN）采用卷积神经网络实现了从图像像素到像素类别的变换。引入l转置卷积（transposedconvolution）实现的，输出的类别预测与输入图像在像素级别上具有一一对应关系：通道维的输出即该位置对应像素的类别预测。13.11.1构造模型下
Rust 所有权简介东离与糖宝 rust 后端 rust 开发语言
文章目录发现宝藏1.所有权基本概念2.所有权规则3.变量作用域4.栈与堆4.1栈（Stack）4.2堆（Heap）5.String类型5.1String类型5.2String的内存分配5.3所有权与内存管理5.4String与切片6.变量与数据交互方式6.1移动（Move）6.2.克隆（Clone）7.所有权与函数7.1.传递参数7.2.返回值总结发现宝藏前些天发现了一个巨牛的人工智能学习网站，通
深度学习-点击率预估-研究论文2024-09-14速读 sp_fyf_2024 深度学习人工智能
深度学习-点击率预估-研究论文2024-09-14速读1.DeepTargetSessionInterestNetworkforClick-ThroughRatePredictionHZhong,JMa,XDuan,SGu,JYao-2024InternationalJointConferenceonNeuralNetworks,2024深度目标会话兴趣网络用于点击率预测摘要：这篇文章提出了一种新
机器学习流形数据降维：UMAP 降维算法小嗷犬 Python 机器学习 #数据分析及可视化机器学习算法人工智能
✅作者简介：人工智能专业本科在读，喜欢计算机与编程，写博客记录自己的学习历程。个人主页：小嗷犬的个人主页个人网站：小嗷犬的技术小站个人信条：为天地立心，为生民立命，为往圣继绝学，为万世开太平。本文目录UMAP简介理论基础特点与优势应用场景在Python中使用UMAP安装umap-learn库使用UMAP可视化手写数字数据集UMAP简介UMAP（UniformManifoldApproximatio
损失函数与反向传播 Star_. PyTorch pytorch 深度学习 python
损失函数定义与作用损失函数(lossfunction)在深度学习领域是用来计算搭建模型预测的输出值和真实值之间的误差。1.损失函数越小越好2.计算实际输出与目标之间的差距3.为更新输出提供依据（反向传播)常见的损失函数回归常见的损失函数有：均方差（MeanSquaredError，MSE）、平均绝对误差（MeanAbsoluteErrorLoss，MAE）、HuberLoss是一种将MSE与MAE
分享一个基于python的电子书数据采集与可视化分析 hadoop电子书数据分析与推荐系统 spark大数据毕设项目（源码、调试、LW、开题、PPT) 计算机源码社 Python项目大数据大数据 python hadoop 计算机毕业设计选题计算机毕业设计源码数据分析 spark毕设
作者：计算机源码社个人简介：本人八年开发经验，擅长Java、Python、PHP、.NET、Node.js、Android、微信小程序、爬虫、大数据、机器学习等，大家有这一块的问题可以一起交流！学习资料、程序开发、技术解答、文档报告如需要源码，可以扫取文章下方二维码联系咨询Java项目微信小程序项目Android项目Python项目PHP项目ASP.NET项目Node.js项目选题推荐项目实战|p
如何做好人生的选择题？百科全书式天才——赫伯特·西蒙给你答案伽马有话说
赫伯特·西蒙是谁？想必知道的人非常少。但当看到他的履历后，相信没有人再怀疑他是个“天才”。西蒙出生于1916年6月15日，是个美国人，他的名字全称为赫伯特·亚历山大·西蒙，在2001年2月9日与世长辞，在这84年的岁月中，西蒙以27岁时取得的政治学博士学位为开端，先后步入了政治学、管理学、认知心理学、信息科学、人工智能、科学哲学、应用数学、统计学、运筹学、控制论、数理经济学、公共管理等领域，在这些
软件测试/测试开发/全日制 |利用Django REST framework构建微服务霍格沃兹-慕漓 django 微服务 sqlite
霍格沃兹测试开发学社推出了《Python全栈开发与自动化测试班》。本课程面向开发人员、测试人员与运维人员，课程内容涵盖Python编程语言、人工智能应用、数据分析、自动化办公、平台开发、UI自动化测试、接口测试、性能测试等方向。为大家提供更全面、更深入、更系统化的学习体验，课程还增加了名企私教服务内容，不仅有名企经理为你1v1辅导，还有行业专家进行技术指导，针对性地解决学习、工作中遇到的难题。让找
2021-08-24 Say no to the next social 春生阁
Youknowthesort.Drinkswitholdfriendsyouhavenothingincommonwithanymore.Yoursecondcousinonceremoved’sbabyshowerwitha$100minimumpresentspend.Thesesortsofsocialengagementssuckthelivingtimeandmoneyoutofyou.
【深度学习】训练过程中一个OOM的问题，太难查了 weixin_40293999 深度学习深度学习人工智能
现象：各位大佬又遇到过ubuntu的这个问题么？现象是在训练过程中，ssh上不去了，能ping通，没死机，但是ubunutu的pc侧的显示器，鼠标啥都不好用了。只能重启。问题原因：OOM了95G，尼玛！！！！pytorch爆内存了，然后journald假死了，在journald被watchdog干掉之后，系统就崩溃了。这种规模的爆内存一般，即使被oomkill了，也要卡半天的，确实会这样，能不能配
cmd泛滥_与您的后泛滥同事见面：人工智能机器人 weixin_26644585 人工智能 leetcode
cmd泛滥Readytoswapyouroldcube-mateforadisembodiedAI?IPsoftCEOChetanDube,creatorofAIco-workerAMELIA,giveshistakeonthepost-COVIDofficelandscape.准备将您的旧立方体伙伴换成无形的AI？AIsoft同事AMELIA的创始人IPsoft首席执行官ChetanDube阐述
两种方法判断Python的位数是32位还是64位 sanqima Python编程电脑 python 开发语言
Python从1991年发布以来，凭借其简洁、清晰、易读的语法、丰富的标准库和第三方工具，在Web开发、自动化测试、人工智能、图形识别、机器学习等领域发展迅猛。 Python是一种胶水语言，通过Cython库与C/C++语言进行链接，通过Jython库与Java语言进行链接。 Python是跨平台的，可运行在多种操作系统上，包括但不限于Windows、Linux和macOS。这意味着用Py
全自动解密解码神器 — Ciphey K'illCode python_模块 python vscode
Ciphey是一个使用自然语言处理和人工智能的全自动解密/解码/破解工具。简单地来讲，你只需要输入加密文本，它就能给你返回解密文本。就是这么牛逼。有了Ciphey，你根本不需要知道你的密文是哪种类型的加密，你只知道它是加密的，那么Ciphey就能在3秒甚至更短的时间内给你解密，返回你想要的大部分密文的答案。下面就给大家介绍Ciphey的实战使用教程。1.准备开始之前，你要确保Python和pip已
PHP，安卓，UI，java，linux视频教程合集 cocos2d-x小菜 java UI PHP android linux
╔-----------------------------------╗┆
各表中的列名必须唯一。在表 'dbo.XXX' 中多次指定了列名 'XXX'。 bozch .net .net mvc
在.net mvc5中，在执行某一操作的时候，出现了如下错误：各表中的列名必须唯一。在表 'dbo.XXX' 中多次指定了列名 'XXX'。经查询当前的操作与错误内容无关，经过对错误信息的排查发现，事故出现在数据库迁移上。回想过去：在迁移之前已经对数据库进行了添加字段操作，再次进行迁移插入XXX字段的时候，就会提示如上错误。 &
Java 对象大小的计算 e200702084 java
Java对象的大小如何计算一个对象的大小呢？
Mybatis Spring 171815164 mybatis
ApplicationContext ac = new ClassPathXmlApplicationContext("applicationContext.xml"); CustomerService userService = (CustomerService) ac.getBean("customerService"); Customer cust
JVM 不稳定参数 g21121 jvm
-XX 参数被称为不稳定参数，之所以这么叫是因为此类参数的设置很容易引起JVM 性能上的差异，使JVM 存在极大的不稳定性。当然这是在非合理设置的前提下，如果此类参数设置合理讲大大提高JVM 的性能及稳定性。可以说“不稳定参数”
用户自动登录网站永夜-极光用户
1.目标:实现用户登录后,再次登录就自动登录,无需用户名和密码 2.思路:将用户的信息保存为cookie 每次用户访问网站,通过filter拦截所有请求,在filter中读取所有的cookie,如果找到了保存登录信息的cookie,那么在cookie中读取登录信息,然后直接
centos7 安装后失去win7的引导记录程序员是怎么炼成的操作系统
1.使用root身份(必须)打开 /boot/grub2/grub.cfg 2.找到 ### BEGIN /etc/grub.d/30_os-prober ### 在后面添加 menuentry "Windows 7 (loader) (on /dev/sda1)" {
Oracle 10g 官方中文安装帮助文档以及Oracle官方中文教程文档下载 aijuans oracle
Oracle 10g 官方中文安装帮助文档下载：http://download.csdn.net/tag/Oracle%E4%B8%AD%E6%96%87API%EF%BC%8COracle%E4%B8%AD%E6%96%87%E6%96%87%E6%A1%A3%EF%BC%8Coracle%E5%AD%A6%E4%B9%A0%E6%96%87%E6%A1%A3 Oracle 10g 官方中文教程
JavaEE开源快速开发平台G4Studio_V3.2发布了無為子 AOP oracle mysql javaee G4Studio
我非常高兴地宣布,今天我们最新的JavaEE开源快速开发平台G4Studio_V3.2版本已经正式发布。大家可以通过如下地址下载。访问G4Studio网站 http://www.g4it.org G4Studio_V3.2版本变更日志功能新增 (1).新增了系统右下角滑出提示窗口功能。 (2).新增了文件资源的Zip压缩和解压缩
Oracle常用的单行函数应用技巧总结百合不是茶日期函数转换函数(核心)数字函数通用函数(核心)字符函数
单行函数; 字符函数,数字函数,日期函数,转换函数(核心),通用函数(核心) 一:字符函数: .UPPER(字符串) 将字符串转为大写 .LOWER (字符串) 将字符串转为小写 .INITCAP(字符串) 将首字母大写 .LENGTH (字符串) 字符串的长度 .REPLACE(字符串,'A','_') 将字符串字符A转换成_
Mockito异常测试实例 bijian1013 java 单元测试 mockito
Mockito异常测试实例： package com.bijian.study; import static org.mockito.Mockito.mock; import static org.mockito.Mockito.when; import org.junit.Assert; import org.junit.Test; import org.mockito.
GA与量子恒道统计 Bill_chen JavaScript 浏览器百度 Google 防火墙
前一阵子，统计**网址时，Google Analytics（GA）和量子恒道统计（也称量子统计），数据有较大的偏差，仔细找相关资料研究了下，总结如下：为何GA和量子网站统计（量子统计前身为雅虎统计）结果不同？首先：没有一种网站统计工具能保证百分之百的准确出现该问题可能有以下几个原因：（1）不同的统计分析系统的算法机制不同；（2）统计代码放置的位置和前后
【Linux命令三】Top命令 bit1129 linux命令
Linux的Top命令类似于Windows的任务管理器，可以查看当前系统的运行情况，包括CPU、内存的使用情况等。如下是一个Top命令的执行结果： top - 21:22:04 up 1 day, 23:49, 1 user, load average: 1.10, 1.66, 1.99 Tasks: 202 total, 4 running, 198 sl
spring四种依赖注入方式白糖_ spring
平常的java开发中，程序员在某个类中需要依赖其它类的方法，则通常是new一个依赖类再调用类实例的方法，这种开发存在的问题是new的类实例不好统一管理，spring提出了依赖注入的思想，即依赖类不由程序员实例化，而是通过spring容器帮我们new指定实例并且将实例注入到需要该对象的类中。依赖注入的另一种说法是“控制反转”，通俗的理解是：平常我们new一个实例，这个实例的控制权是我
angular.injector boyitech AngularJS AngularJS API
angular.injector 描述: 创建一个injector对象, 调用injector对象的方法可以获得angular的service, 或者用来做依赖注入. 使用方法: angular.injector(modules, [strictDi]) 参数详解: Param Type Details mod
java-同步访问一个数组Integer[10]，生产者不断地往数组放入整数1000，数组满时等待；消费者不断地将数组里面的数置零，数组空时等待 bylijinnan Integer
public class PC { /** * 题目：生产者-消费者。 * 同步访问一个数组Integer[10]，生产者不断地往数组放入整数1000，数组满时等待；消费者不断地将数组里面的数置零，数组空时等待。 */ private static final Integer[] val=new Integer[10]; private static
使用Struts2.2.1配置 Chen.H apache spring Web xml struts
Struts2.2.1 需要如下 jar包: commons-fileupload-1.2.1.jar commons-io-1.3.2.jar commons-logging-1.0.4.jar freemarker-2.3.16.jar javassist-3.7.ga.jar ognl-3.0.jar spring.jar struts2-core-2.2.1.jar struts2-sp
[职业与教育]青春之歌 comsci 教育
每个人都有自己的青春之歌............但是我要说的却不是青春... 大家如果在自己的职业生涯没有给自己以后创业留一点点机会,仅仅凭学历和人脉关系,是难以在竞争激烈的市场中生存下去的.... &nbs
oracle连接(join)中使用using关键字 daizj JOIN oracle sql using
在oracle连接(join)中使用using关键字 34. View the Exhibit and examine the structure of the ORDERS and ORDER_ITEMS tables. Evaluate the following SQL statement: SELECT oi.order_id, product_id, order_date FRO
NIO示例 daysinsun nio
NIO服务端代码： public class NIOServer { private Selector selector; public void startServer(int port) throws IOException { ServerSocketChannel serverChannel = ServerSocketChannel.open(
C语言学习homework1 dcj3sjt126com c homework
0、课堂练习做完 1、使用sizeof计算出你所知道的所有的类型占用的空间。 int x; sizeof(x); sizeof(int); # include <stdio.h> int main(void) { int x1; char x2; double x3; float x4; printf(&quo
select in order by , mysql排序 dcj3sjt126com mysql
If i select like this: SELECT id FROM users WHERE id IN(3,4,8,1); This by default will select users in this order 1,3,4,8, I would like to select them in the same order that i put IN() values so:
页面校验-新建项目 fanxiaolong 页面校验
$(document).ready( function() { var flag = true; $('#changeform').submit(function() { var projectScValNull = true; var s =""; var parent_id = $("#parent_id").v
Ehcache（02）——ehcache.xml简介 234390216 ehcache ehcache.xml 简介
ehcache.xml简介 ehcache.xml文件是用来定义Ehcache的配置信息的，更准确的来说它是定义CacheManager的配置信息的。根据之前我们在《Ehcache简介》一文中对CacheManager的介绍我们知道一切Ehcache的应用都是从CacheManager开始的。在不指定配置信
junit 4.11中三个新功能 jackyrong java
junit 4.11中两个新增的功能，首先是注解中可以参数化，比如 import static org.junit.Assert.assertEquals; import java.util.Arrays; import org.junit.Test; import org.junit.runner.RunWith; import org.junit.runn
国外程序员爱用苹果Mac电脑的10大理由 php教程分享 windows PHP unix Microsoft perl
Mac 在国外很受欢迎，尤其是在设计/web开发/IT 人员圈子里。普通用户喜欢 Mac 可以理解，毕竟 Mac 设计美观，简单好用，没有病毒。那么为什么专业人士也对 Mac 情有独钟呢？从个人使用经验来看我想有下面几个原因： 1、Mac OS X 是基于 Unix 的这一点太重要了，尤其是对开发人员，至少对于我来说很重要，这意味着Unix 下一堆好用的工具都可以随手捡到。如果你是个 wi
位运算、异或的实际应用 wenjinglian 位运算
一．位操作基础，用一张表描述位操作符的应用规则并详细解释。二．常用位操作小技巧，有判断奇偶、交换两数、变换符号、求绝对值。三．位操作与空间压缩，针对筛素数进行空间压缩。 &n
weblogic部署项目出现的一些问题（持续补充中……） Everyday都不同 weblogic部署失败
好吧，weblogic的问题确实…… 问题一： org.springframework.beans.factory.BeanDefinitionStoreException: Failed to read candidate component class: URL [zip:E:/weblogic/user_projects/domains/base_domain/serve
tomcat7性能调优（01） toknowme tomcat7
Tomcat优化： 1、最大连接数最大线程等设置 <Connector port="8082" protocol="HTTP/1.1" useBodyEncodingForURI="t
PO VO DAO DTO BO TO概念与区别 xp9802 java DAO 设计模式 bean 领域模型
O/R Mapping 是 Object Relational Mapping（对象关系映射）的缩写。通俗点讲，就是将对象与关系数据库绑定，用对象来表示关系数据。在O/R Mapping的世界里，有两个基本的也是重要的东东需要了解，即VO，PO。它们的关系应该是相互独立的，一个VO可以只是PO的部分，也可以是多个PO构成，同样也可以等同于一个PO（指的是他们的属性）。这样，PO独立出来，数据持