CN106777123A

CN106777123A - A kind of group based on two-way tensor resolution model recommends method

Info

Publication number: CN106777123A
Application number: CN201611168532.1A
Authority: CN
Inventors: 姜元春; 杨露; 孙见山; 王锦坤; 刘业政
Original assignee: Hefei University of Technology
Current assignee: Hefei University of Technology
Priority date: 2016-12-16
Filing date: 2016-12-16
Publication date: 2017-05-31
Anticipated expiration: 2036-12-16
Also published as: CN106777123B

Abstract

The invention discloses a group recommendation method based on a bidirectional tensor decomposition model, including: 1) defining an interaction relationship D _S representing a group G, a user U and a product I; 2) constructing a tensor decomposition model; 3) using the The Yethian personalized sorting method converts and solves the tensor decomposition model, and obtains the values of each parameter in the tensor decomposition model; 4) Obtains the group preference of the g-th group for the i-th product And traverse all commodities to obtain the group preferences of the gth group for all products; 5) sort the group preferences of the gth group for all products in descending order, and select the first N products as the recommended product list and push them to the gth group. The invention models the individual preference as a two-way process, which can effectively reflect the real formation process of the individual preference, improves the accuracy of group recommendation, and has better robustness.

Description

A Group Recommendation Method Based on Bidirectional Tensor Decomposition Model

技术领域technical field

本发明属于群推荐领域，具体地说是一种基于双向张量分解模型的群推荐(BTF-GR)方法The invention belongs to the field of group recommendation, in particular to a group recommendation (BTF-GR) method based on a bidirectional tensor decomposition model

背景技术Background technique

社交网络早已成为社交媒体环境中一个重要活动部分，用户在社交网络平台会自发组成群体，而捕捉每一个群体的偏好将有利于我们对用户群体进行深入的行为分析，进而为群体推荐目标产品和服务。但群体推荐不同于个体推荐，因为群体通常是由具有多样化偏好的用户构成的，所以群体推荐不易实现。群推荐的核心任务就是要聚合个体偏好以产生一个群体推荐结果，但现有聚合策略都是将群体偏好的形成建模为一个单向过程，虽然即群体偏好是个体偏好聚合的结果，但在真实的社交环境中群体和个体存在交互作用，而现有聚合策略并不能捕捉群体对个体偏好产生的影响，导致群推荐准确性不高；用户的个体特性和群体的群体特性呈现多样化，所以群体对个体偏好的影响存在差异性，但现有聚合不能刻画这种影响的差异，使得群推荐结果并不理想；而且当现有群推荐方法应用于稀疏数据集时，推荐性能明显下降，所以现有群推荐方法并不能很好地解决大数据环境下的数据稀疏问题，不具备较好鲁棒性。Social networking has long been an important part of the social media environment. Users will spontaneously form groups on social networking platforms, and capturing the preferences of each group will help us conduct in-depth behavioral analysis of user groups, and then recommend target products and Serve. But group recommendation is different from individual recommendation, because groups are usually composed of users with diverse preferences, so group recommendation is not easy to achieve. The core task of group recommendation is to aggregate individual preferences to generate a group recommendation result, but the existing aggregation strategies all model the formation of group preference as a one-way process, although group preference is the result of individual preference aggregation, but in In a real social environment, there is interaction between groups and individuals, and the existing aggregation strategy cannot capture the influence of groups on individual preferences, resulting in low accuracy of group recommendations; the individual characteristics of users and the group characteristics of groups are diverse, so There are differences in the influence of groups on individual preferences, but the existing aggregation cannot describe the difference of this influence, which makes the group recommendation results unsatisfactory; and when the existing group recommendation method is applied to a sparse data set, the recommendation performance drops significantly, so The existing group recommendation methods cannot solve the problem of data sparsity in the big data environment well, and do not have good robustness.

近年来，分解成为最受欢迎的推荐技术，通过分解可以描述用户与产品的交互作用，虽然将张量分解应用于标签推荐、广告点击预测等推荐问题中取得了较好的推荐性能，但张量分解的时间复杂度高，受限于数据规模，所以并不能很好地适用于大数据环境下的群推荐问题。In recent years, decomposition has become the most popular recommendation technique. Through decomposition, the interaction between users and products can be described. Although the application of tensor decomposition to label recommendation, advertisement click prediction and other recommendation problems has achieved good recommendation performance, Zhang Quantitative decomposition has a high time complexity and is limited by the size of the data, so it is not suitable for group recommendation problems in a big data environment.

发明内容Contents of the invention

本发明针对现有群推荐策略存在的不足之处，提出一种基于双向张量分解模型的群推荐方法，以期能在个体偏好建模中体现群体与个体的交互作用，并刻画群体对个体偏好影响的差异性，通过聚合准确的个体偏好形成准确的群体偏好，从而提升群推荐精度，并适用于大规模、稀疏的数据环境下精准、稳定的群推荐。Aiming at the deficiencies of existing group recommendation strategies, the present invention proposes a group recommendation method based on a two-way tensor decomposition model, in order to reflect the interaction between the group and the individual in the individual preference modeling, and to describe the preference of the group to the individual The difference in influence, through the aggregation of accurate individual preferences to form accurate group preferences, thereby improving the accuracy of group recommendations, and is suitable for accurate and stable group recommendations in large-scale and sparse data environments.

为达到上述目的，本发明采用的技术方案为：In order to achieve the above object, the technical scheme adopted in the present invention is:

本发明一种基于双向张量分解模型的群推荐方法的特点是按照如下步骤进行：A kind of group recommendation method based on the two-way tensor decomposition model of the present invention is characterized in that it proceeds according to the following steps:

步骤一、定义一个表示群体G、用户U和产品I的交互关系：所述交互关系D_S中，G＝{G₁,...,G_g,...,G_|G|}表示群体集合，G_g表示任意第g个群体，1≤g≤|G|；U＝{U₁,...,U_u,...,U_|U|}表示用户集合，U_u表示第u个用户；I＝{I₁,...,I_i,...,I_|I|}表示产品集合，I_i表示第i个产品， Step 1. Define an interaction relationship representing group G, user U and product I: In the interaction relationship D _S , G={G ₁ ,...,G _g ,...,G _|G| } represents a group set, G _g represents any gth group, 1≤g≤|G| ;U＝{U ₁ ,...,U _u ,...,U _|U| } represents the set of users, and U _u represents the uth user; I＝{I ₁ ,...,I _i ,...,I _|I| } represents the product set, I _i represents the i-th product,

步骤二、利用式(1)构建第g个群体中第u个用户对第i个产品的张量分解模型 Step 2: Use formula (1) to construct the tensor decomposition model of the i-th product for the u-th user in the g-th group

式(1)中，表示第g个群体中第u个用户对第i个产品的个体偏好；表示第u个用户对自身的个体偏好的影响，表示第u个用户所属的第g个群体对第u个用户的个体偏好的影响，b_i表示第i个产品的偏差；表示第u个用户的个体特性对第u个用户的权重；U_u,l表示第u个用户的第l个隐变量，k_u表示第u个用户的隐变量个数；表示与第u个用户产生交互的第i个产品的第l个隐变量；表示第u个用户所属的第g个群体的群体特性对第u个用户的权重；G_g,m表示第g个群体的第m个隐变量，k_g表示第g个群体的隐变量个数；表示与第g个群体产生交互的第i个产品的第m个隐变量； In formula (1), Indicates the individual preference of the u-th user in the g-th group to the i-th product; Indicates the uth user's individual preference for himself Impact, Indicates the individual preference of the g-th group to which the u-th user belongs to the u-th user , b _i represents the deviation of the i-th product; Represents the weight of the individual characteristics of the uth user to the uth user; U _u,l represents the lth hidden variable of the uth user, and k _u represents the number of hidden variables of the uth user; Indicates the l-th latent variable of the i-th product that interacts with the u-th user; Indicates the weight of the group characteristics of the g-th group to which the u-th user belongs to the u-th user; G _g,m represents the m-th hidden variable of the g-th group, and k _g represents the number of hidden variables of the g-th group ; Represents the mth latent variable of the i-th product that interacts with the g-th group;

步骤三、利用贝叶斯个性化排序方法对所述张量分解模型进行优化求解，得到所述张量分解模型中的各个参数值；Step 3, using the Bayesian personalized ranking method to analyze the tensor decomposition model Perform an optimization solution to obtain the tensor decomposition model Each parameter value in;

步骤四、利用式(2)得到第g个群体对第i个产品的群体偏好从而获得第g个群体对所有产品的群体偏好：Step 4. Use formula (2) to obtain the group preference of the g-th group for the i-th product Thus, the group preference of the gth group for all products is obtained:

式(2)中，Δ(·)为平均聚合函数；In formula (2), Δ( ) is the average aggregation function;

步骤四、将所述第g个群体对所有产品的群体偏好进行降序排序，并选择前N个群体偏好所对应的产品作为推荐产品列表推送给第g个群体。Step 4: Sort in descending order the group preferences of all products by the gth group, and select the products corresponding to the top N group preferences as a list of recommended products and push them to the gth group.

本发明所述的群推荐方法的特点也在于，所述步骤三是按如下步骤进行：The group recommendation method of the present invention is also characterized in that the third step is carried out as follows:

步骤3.1、利用式(3)得到所述张量分解模型的目标函数 Step 3.1, utilize formula (3) to obtain described tensor decomposition model The objective function of

式(3)中，表示第g个群体中第u个用户对第i_a个产品的个体偏好与对第i_b个产品的个体偏好的差值；表示第i_a个产品属于第g个群体中第u个用户的正反馈集合，所述正反馈集合为与第g个群体中第u个用户交互过的所有产品集合；表示第i_b个产品属于第g个群体中第u个用户的负反馈和缺失值集合，负反馈和缺失值集合为与第g个群体中第u个用户没有交互过的所有产品集合；表示logistic函数；Θ表示所述张量分解模型中的参数集合，并有λ_Θ表示正则化参数；In formula (3), Indicates the difference between the individual preference of the u-th user in the g-th group for the _i -th product and the individual preference for the _i -th product; Indicates that the _i -th product belongs to the positive feedback set of the u-th user in the g-th group, and the positive feedback set is a set of all products that have interacted with the u-th user in the g-th group; Indicates that the i _b -th product belongs to the negative feedback and missing value set of the u-th user in the g-th group, and the negative feedback and missing value set is the set of all products that have not interacted with the u-th user in the g-th group; Represents the logistic function; Θ represents the tensor decomposition model The set of parameters in and has λ _Θ represents a regularization parameter;

步骤3.2、初始化参数集合Θ和正则化参数λ_Θ；Step 3.2, initialization parameter set Θ and regularization parameter λ _Θ ;

步骤3.3、遍历所述第g个群体中第u个用户的正反馈集合中的所有产品，并在遍历每个产品的过程中从所述第g个群体中第u个用户的负反馈和缺失值集合中任意选择一个产品；Step 3.3, traversing all the products in the positive feedback set of the uth user in the gth group, and in the process of traversing each product, from the negative feedback and missing feedback of the uth user in the gth group Arbitrarily select a product in the value set;

步骤3.4、利用随机梯度下降方法求得所述参数集合Θ中参数的梯度；在参数和为定值时，对所述参数分别进行迭代更新，直到收敛为止，从而获得参数的最优值；Step 3.4, utilize stochastic gradient descent method to obtain the parameters in the parameter set Θ Gradient; in parameter with When it is a fixed value, for the parameter Perform iterative updates respectively until convergence, so as to obtain the parameters the optimal value;

步骤3.5、遍历所述第g个群体中第u个用户的正反馈集合中的所有产品，并在遍历每个产品的过程中从第g个群体中第u个用户的负反馈和缺失值集合中任意选择一个产品；Step 3.5, traversing all the products in the positive feedback set of the uth user in the gth group, and in the process of traversing each product, from the negative feedback and missing value set of the uth user in the gth group Choose any product from

步骤3.6、利用随机梯度下降方法求得所述参数集合Θ中参数和的梯度；在参数为最优值时，对所述参数和分别进行迭代更新，直到收敛为止，从而获得参数和的最优值。Step 3.6, utilize stochastic gradient descent method to obtain the parameters in the parameter set Θ with Gradient; in parameter When it is the optimal value, for the parameters with Perform iterative updates respectively until convergence, so as to obtain the parameters with the optimal value of .

与已有技术相比，本发明的有益效果体现在：Compared with the prior art, the beneficial effects of the present invention are reflected in:

1、本发明首次将群体偏好的形成建模为聚合个体偏好和群体对个体偏好产生影响的双向过程，相比于现有群推荐方法将群体偏好建模为聚合个体偏好的单向过程，本发明的建模思想更符合群体偏好形成的真实场景，本发明不仅可以捕捉群体与个体之间的交互作用，而且还可以刻画群体对不同用户的个体偏好产生影响的差异性，从而有效提高了群推荐精度，获得了满意的群推荐结果。1. For the first time, the present invention models the formation of group preferences as a two-way process of aggregating individual preferences and the influence of groups on individual preferences. Compared with existing group recommendation methods that model group preferences as a one-way process of aggregating individual preferences, this The modeling idea of the invention is more in line with the real scene of group preference formation. The invention can not only capture the interaction between the group and the individual, but also describe the differences in the influence of the group on the individual preferences of different users, thus effectively improving the group The recommendation accuracy is satisfactory, and a satisfactory group recommendation result is obtained.

2、本发明通过将用户、群体、产品三者建模为成对交互的张量分解，解决了张量分解模型存在的高时间复杂度问题，同时本发明提出的张量分解模型由于集成了群体偏好，可以有效减少数据稀疏的负面影响，所以本发明使得张量分解模型可以应用于大数据环境下的群推荐问题，本发明中成对交互的张量分解模型可以在线性时间复杂度内获得更高的预测质量。2. The present invention solves the high time complexity problem existing in the tensor decomposition model by modeling users, groups, and products as a tensor decomposition of paired interactions. At the same time, the tensor decomposition model proposed by the present invention integrates Group preference can effectively reduce the negative impact of data sparseness, so the present invention enables the tensor decomposition model to be applied to the group recommendation problem in the big data environment, and the tensor decomposition model of pairwise interaction in the present invention can Get higher forecast quality.

3、大数据环境下存在大量稀疏的隐反馈数据，直接通过预测偏好分值求解模型的方法预测的个体偏好存在较大偏差，从而导致推荐精度和满意度下降，而本发明将模型求解转化为排序问题，排序方法针对稀疏的隐反馈数据具有很好的适应性，可得到准确的偏好排序。本发明使用贝叶斯个性化排序方法求解模型可获得准确的个体偏好排序，进而聚合为准确的群体偏好排序，所以有效提升了群推荐的准确性和满意度。3. There is a large amount of sparse implicit feedback data in the big data environment, and there is a large deviation in the individual preference predicted by the method of directly predicting the preference score to solve the model, which leads to a decline in recommendation accuracy and satisfaction. However, the present invention transforms the model solution into For the ranking problem, the ranking method has good adaptability to sparse implicit feedback data, and can obtain accurate preference ranking. The present invention uses the Bayesian personalized ranking method to solve the model to obtain accurate individual preference rankings, and then aggregates them into accurate group preference rankings, thus effectively improving the accuracy and satisfaction of group recommendations.

4、本发明为用户设置了个性化权重，用以捕捉个体偏好和群体影响对不同用户的差异化作用，个性化权重的设置使得模型更加贴近个体偏好形成的真实情景，有助于获得更好的群推荐精度。4. The present invention sets personalized weights for users to capture the differential effects of individual preferences and group influence on different users. The setting of personalized weights makes the model closer to the real situation of individual preferences, which helps to obtain better group recommendation accuracy.

5、真实群推荐环境中，通常采取的策略是提供一个尽可能长的推荐列表以实现尽可能多地涵盖群体中所有用户的偏好。当推荐列表越长，本发明所提出的群推荐方法不仅具有较好鲁棒性，而且性能更优，所以本发明适用于群推荐环境。5. In the real group recommendation environment, the usual strategy is to provide a recommendation list as long as possible to cover the preferences of all users in the group as much as possible. When the recommendation list is longer, the group recommendation method proposed by the present invention not only has better robustness, but also has better performance, so the present invention is applicable to the group recommendation environment.

6、大规模群体因为偏好多样性增加，使得针对大规模群体的推荐十分困难和不精确，而本发明提出的群推荐方法针对不同规模的群体都具有较好稳健性。6. Due to the increase in preference diversity of large-scale groups, the recommendation for large-scale groups is very difficult and imprecise, while the group recommendation method proposed by the present invention has better robustness for groups of different sizes.

7、本发明可用于家电和食品等实体产品、音乐和电影等数字产品、旅游路线和度假安排等服务产品的群推荐系统，可以在电脑和手机的网页和APP等平台使用，应用范围广泛。7. The present invention can be used in a group recommendation system for physical products such as home appliances and food, digital products such as music and movies, service products such as travel routes and vacation arrangements, and can be used on platforms such as webpages and APPs of computers and mobile phones, and has a wide range of applications.

附图说明Description of drawings

图1为本发明的流程示意图；Fig. 1 is a schematic flow sheet of the present invention;

图2为本发明所提方法与传统方法的对比图；Fig. 2 is the comparative figure of method proposed by the present invention and traditional method;

图3为本发明的数据表示和解决方案；Fig. 3 is data representation and solution of the present invention;

图4a为本发明与基准算法的平均准确率均值对比图；Fig. 4 a is the comparison chart of the average accuracy rate mean value of the present invention and the benchmark algorithm;

图4b为本发明与基准算法的推荐列表长度为5的召回率对比图；Fig. 4b is a comparison chart of the recall ratio of the recommendation list length of the present invention and the benchmark algorithm being 5;

图4c为本发明与基准算法的平均排序倒数对比图；Fig. 4c is the comparison chart of the average sorting reciprocal of the present invention and the benchmark algorithm;

图5a为个性化权重与固定权重的平均准确率均值对比图；Figure 5a is a comparison chart of the average accuracy rate between personalized weights and fixed weights;

图5b为个性化权重与固定权重的推荐列表长度为5的召回率对比图；Figure 5b is a comparison chart of the recall rate of the recommendation list length of 5 with personalized weight and fixed weight;

图5c为个性化权重与固定权重的平均排序倒数对比图。Figure 5c is a comparison chart of the average ranking reciprocal of personalized weights and fixed weights.

具体实施方式detailed description

本发明提出的张量分解模型是一种双向的成对交互式张量分解模型，即群体偏好的形成为双向过程，而且将用户、群体、产品三者建模为成对交互关系。本发明提出的群推荐方法建立在以下两点假设的基础上：The tensor decomposition model proposed by the present invention is a two-way paired interactive tensor decomposition model, that is, the formation of group preferences is a two-way process, and the user, group, and product are modeled as a paired interaction relationship. The group recommendation method proposed by the present invention is based on the following two assumptions:

1、一个群体中的每个用户对于产品都有一种内在偏好。每个用户对于一个产品的内在偏好可能会受到所属群体的影响；1. Each user in a group has an inherent preference for a product. Each user's intrinsic preference for a product may be influenced by the group to which he belongs;

2.群体对于个体偏好的影响在用户之间是显著不同的。用户的最终偏好是内在偏好和群体影响的综合作用结果。2. The influence of groups on individual preferences is significantly different among users. The user's final preference is the combined result of internal preference and group influence.

本实施例中，如图1所示，一种基于双向张量分解模型的群推荐方法，按照如下步骤进行：In this embodiment, as shown in Figure 1, a group recommendation method based on a two-way tensor decomposition model is performed according to the following steps:

步骤一、定义一个表示群体G、用户U和产品I的交互关系：交互关系D_S中，G＝{G₁,...,G_g,...,G_|G|}表示群体集合，G_g表示任意第g个群体，1≤g≤|G|；U＝{U₁,...,U_u,...,U_|U|}表示用户集合，U_u表示第u个用户；I＝{I₁,...,I_i,...,I_|I|}表示产品集合，I_i表示第i个产品，传统群体偏好建模只考虑用户U和产品I的交互，而本发明不仅考虑了用户U和产品I的交互，同时也考虑了群体G和用户U之间的交互，而且针对不同用户这种交互作用的强度存在差异，本发明提出的群体偏好建模思想与传统的群体偏好建模思想的区别如图2所示。Step 1. Define an interaction relationship representing group G, user U and product I: In the interaction relation D _S , G={G ₁ ,...,G _g ,...,G _|G| } represents the group set, G _g represents any gth group, 1≤g≤|G|; U ＝{U ₁ ,...,U _u ,...,U _|U| } represents the set of users, and U _u represents the uth user; I＝{I ₁ ,...,I _i ,...,I _|I| } represents the product set, I _i represents the i-th product, Traditional group preference modeling only considers the interaction between user U and product I, but the present invention not only considers the interaction between user U and product I, but also considers the interaction between group G and user U, and the interaction between different users There are differences in the strength of the effect. The difference between the group preference modeling idea proposed by the present invention and the traditional group preference modeling idea is shown in FIG. 2 .

步骤二、将群体G、用户U和产品I的交互关系进行成对交互式分解，群体G、用户U和产品I的数据表示和本发明提出的成对交互式张量分解模型如图3所示，图3还显示了本发明提出的张量分解模型与传统的张量分解模型的差异，再利用式(1)便可构建第g个群体中第u个用户对第i个产品的张量分解模型 Step 2, carry out pairwise interactive decomposition of the interaction relationship between group G, user U and product I, the data representation of group G, user U and product I and the pairwise interactive tensor decomposition model proposed by the present invention are shown in Figure 3 Figure 3 also shows the difference between the tensor decomposition model proposed by the present invention and the traditional tensor decomposition model, and then using formula (1) can construct the tensor of the i-th product for the u-th user in the g-th group Quantitative decomposition model

步骤三、利用贝叶斯个性化排序方法对张量分解模型进行优化求解，得到张量分解模型中的各个参数值，即将个体偏好的直接求解转化为个体偏好的排序；Step 3. Use the Bayesian personalized ranking method to decompose the tensor model Perform an optimization solution to obtain the tensor decomposition model Each parameter value in , that is, the direct solution of individual preference is transformed into the ranking of individual preference;

步骤3.1、求解张量分解模型即给定第g个群体中的第u个用户，求得第u个用户的最优参数，所以根据贝叶斯个性化排序方法得出张量分解模型的学习目标就是最大化式(2)中的后验概率 Step 3.1, Solve the tensor decomposition model That is, given the u-th user in the g-th group, the optimal parameters of the u-th user are obtained, so according to the Bayesian personalized ranking method, the learning objective of the tensor decomposition model is to maximize the formula (2) The posterior probability in

式(2)中，Θ表示张量分解模型中的参数集合，并有表示第g个群体中的第u个用户对所有产品的偏好排序；表示由样本集得到的第g个群体中的第u个用户对所有产品的个体偏好似然函数；表示参数的先验知识。其中似然函数可以表示为公式(3)：In formula (2), Θ represents the tensor decomposition model The set of parameters in and has Indicates the preference ranking of the u-th user in the g-th group for all products; Indicates the individual preference likelihood function of the u-th user in the g-th group for all products obtained from the sample set; Represents prior knowledge of the parameters. where the likelihood function It can be expressed as formula (3):

式(3)中，表示第g个群体中的第u个用户对所有产品的偏好排序中，产品排在产品前面，且每对产品的排序独立于其他产品对的排序；表示第g个群体的第u个用户相比于产品i_b对于产品的个体偏好概率，同时可以表示为公式(4)：In formula (3), Indicates that the u-th user in the g-th group has a preference for all products, and the product ranked product front, and the ordering of each pair of products is independent of the ordering of other product pairs; Indicates that the u-th user of the g-th group compares to product i _b for product The individual preference probability of can be expressed as formula (4):

式(4)中，表示第g个群体中第u个用户对第i_a个产品的个体偏好与对第i_b个产品的个体偏好的差值；表示logistic函数。根据这种转化方式，并假设参数的先验知识服从均值为0，协方差矩阵为∑_Θ的高斯分布，则公式(2)中后验概率的对数形式可以简化为公式(5):In formula (4), Indicates the difference between the individual preference of the u-th user in the g-th group for the _i -th product and the individual preference for the _i -th product; Represents the logistic function. According to this transformation, and assuming prior knowledge of the parameters Obey the mean value of 0, the covariance matrix is a Gaussian distribution of _ΣΘ , then the logarithmic form of the posterior probability in formula (2) can be simplified to formula (5):

张量分解模型的学习目标便转化为最小化公式(6)的目标函数 tensor decomposition model The learning objective is transformed into the objective function of minimizing formula (6)

式(6)中，表示第i_a个产品属于第g个群体中第u个用户的正反馈集合，正反馈集合为与第g个群体中第u个用户交互过的所有产品集合；表示第i_b个产品属于第g个群体中第u个用户的负反馈和缺失值集合，负反馈和缺失值集合为与第g个群体中第u个用户没有交互过的所有产品集合；λ_Θ表示正则化参数；In formula (6), Indicates that the i _a -th product belongs to the positive feedback set of the u-th user in the g-th group, and the positive feedback set is the set of all products that have interacted with the u-th user in the g-th group; Indicates that the i _b -th product belongs to the negative feedback and missing value set of the u-th user in the g-th group, and the negative feedback and missing value set is the set of all products that have not interacted with the u-th user in the g-th group; λ _Θ represents the regularization parameter;

步骤3.3、遍历第g个群体中第u个用户的正反馈集合中的所有产品，并在遍历每个产品的过程中从第g个群体中第u个用户的负反馈和缺失值集合中任意选择一个产品；Step 3.3. Traverse all products in the positive feedback set of the uth user in the gth group, and randomly select from the negative feedback and missing value set of the uth user in the gth group during the process of traversing each product select a product;

步骤3.4、利用随机梯度下降方法求得参数集合Θ中参数的梯度；在参数和为定值时，对参数分别进行迭代更新，直到收敛为止，从而获得参数的最优值；Step 3.4, using the stochastic gradient descent method to obtain the parameters in the parameter set Θ Gradient; in parameter with When it is a fixed value, for the parameter Perform iterative updates respectively until convergence, so as to obtain the parameters the optimal value;

步骤3.5、遍历第g个群体中第u个用户的正反馈集合中的所有产品，并在遍历每个产品的过程中从第g个群体中第u个用户的负反馈和缺失值集合中任意选择一个产品；Step 3.5, traverse all the products in the positive feedback set of the uth user in the gth group, and randomly select from the negative feedback and missing value set of the uth user in the gth group during the process of traversing each product select a product;

步骤3.6、利用随机梯度下降方法求得参数集合Θ中参数和的梯度；在参数为最优值时，对参数和分别进行迭代更新，直到收敛为止，从而获得参数和的最优值。Step 3.6, using the stochastic gradient descent method to obtain the parameters in the parameter set Θ with Gradient; in parameter When it is the optimal value, for the parameter with Perform iterative updates respectively until convergence, so as to obtain the parameters with the optimal value of .

步骤四、利用式(7)得到第g个群体对第i个产品的群体偏好从而获得第g个群体对所有产品的群体偏好：Step 4: Use formula (7) to obtain the group preference of the g-th group for the i-th product Thus, the group preference of the gth group for all products is obtained:

式(7)中，Δ(·)为平均聚合函数；In formula (7), Δ( ) is the average aggregation function;

步骤四、将第g个群体对所有产品的群体偏好进行降序排序，并选择前N个群体偏好所对应的产品作为推荐产品列表推送给第g个群体。Step 4: Sort the group preferences of all products by the g-th group in descending order, and select the products corresponding to the top N group preferences as a recommended product list and push them to the g-th group.

针对本发明方法进行实验论证，具体包括：Carry out experimental demonstration for the inventive method, specifically include:

1)准备标准数据集1) Prepare the standard data set

本发明使用CiteULike和Last.fm这两个在推荐领域应用广泛的数据集作为标准数据集验证本发明提出的群推荐方法的性能。第一个数据集CiteULike是一个科学研究者在线社区网站。在CiteULike上，学者可以使用标签标注学术文章，同时也可以创建和加入群来分享引用的文章。实验使用的CiteULike数据集包含130321个“群体-用户-产品”三元组，其中的11168篇文章由来自584个群体的1310个用户标注而成，平均群体大小是5.4，从每个群体中选出前10篇文章作为测试集，剩余的数据作为训练集；第二个数据集Last.fm是个面向音乐迷的在线社区网站。在Last.fm上，音乐迷们可以标注音乐家或者曲目，创建或者和有类似品位的人形成群体。实验使用的Last.fm数据集包含了317907个“群体-用户-产品”三元组，其中的1992位音乐家由来自2716个群体的3605个用户标注而成，均群体大小是21.2，选择每个群体中的前10位音乐家作为测试集，剩余的数据用于训练集。The present invention uses CiteULike and Last.fm, two widely used data sets in the recommendation field, as standard data sets to verify the performance of the group recommendation method proposed by the present invention. The first dataset, CiteULike, is an online community website for scientific researchers. On CiteULike, scholars can use tags to mark academic articles, and they can also create and join groups to share cited articles. The CiteULike data set used in the experiment contains 130,321 "group-user-product" triples, of which 11,168 articles are annotated by 1310 users from 584 groups, the average group size is 5.4, and selected from each group The first 10 articles are used as a test set, and the remaining data is used as a training set; the second data set Last.fm is an online community website for music fans. On Last.fm, music fans can tag musicians or tracks and create or form groups with people of similar taste. The Last.fm data set used in the experiment contains 317907 "group-user-product" triples, of which 1992 musicians are marked by 3605 users from 2716 groups, and the average group size is 21.2. The top 10 musicians in each group are used as the test set, and the remaining data are used as the training set.

2)评价指标2) Evaluation indicators

采用推荐长度为N的召回率(Rec@N)，平均准确率均值(MAP)和平均排序倒数(MRR)作为本实验的评价指标。召回率评估群推荐系统返回所有相关产品的能力，平均准确率均值和平均排序倒数揭示在推荐列表中相关产品的准确性。推荐列表长度为N的召回率Rec@N计算公式为：The recall rate (Rec@N) with a recommended length of N, mean average precision (MAP) and average rank reciprocal (MRR) are used as the evaluation indicators of this experiment. Recall evaluates the ability of a group recommender system to return all relevant products, mean precision and mean rank reciprocal reveal the accuracy of relevant products in the recommended list. The formula for calculating the recall rate Rec@N with a recommended list length of N is:

式(8)中，N_related是在排序列表和测试集中同时出现的产品数目；N是测试集中相关产品的数目。In formula (8), N _related is the number of products that appear in both the sorted list and the test set; N is the number of related products in the test set.

平均准确率均值计算公式为：The formula for calculating the average accuracy rate is:

式(9)中，其中|G|是测试集中群体的数目，是一个指示变量，如果群体的推荐列表中排名第n的产品也出现在测试集中，则值为1，其余情况都为0。表示的推荐结果的准确率，公式为：In formula (9), where |G| is the number of groups in the test set, is an indicator variable, if the group If the product ranked nth in the recommended list also appears in the test set, the value is 1, and the other cases are 0. The accuracy rate of the recommendation result represented by , the formula is:

式(10)中，其中是在排序列表和测试集中同时出现的产品数目，是推荐列表中相关产品的数目。In formula (10), where is the number of products that co-occur in the sorted list and the test set, is the number of related products in the recommendation list.

平均排序倒数是第一个排序正确的产品的逆的连乘，计算公式为：The average sorted reciprocal is the multiplication of the inverse of the first correctly sorted product, calculated as:

3)在标准数据集上进行实验3) Experiment on a standard dataset

为了验证本发明所提模型的有效性，我们将本发明提出的用于群推荐的双向张量分解(BTF-GR)模型和4种基准方法进行比较，4种基准方法为：基于用户的协同过滤(UserCF)方法—基于用户的最近邻(UserKNN)算法，基于产品的协同过滤(ItemCF)方法—基于产品的最近邻(ItemKNN)算法，基于隐反馈的矩阵分解(IMF)方法，基于矩阵分解的贝叶斯个性化排序(BPRMF)方法。在CiteULike数据集和Last.fm数据集上用5种方法进行建模和推荐，并将推荐结果进行比较。实验结果如图4a、图4b、图4c所示。与4种基准方法相比，本发明提出的群推荐方法在Last.fm和CiteULike数据集上都获得了更优的推荐精度，且对于稀疏的隐反馈数据，本发明的性能优势更加明显。In order to verify the effectiveness of the proposed model in the present invention, we compared the Bidirectional Tensor Decomposition (BTF-GR) model proposed in the present invention for group recommendation with 4 benchmark methods. The 4 benchmark methods are: user-based collaborative Filtering (UserCF) method - user-based nearest neighbor (UserKNN) algorithm, product-based collaborative filtering (ItemCF) method - product-based nearest neighbor (ItemKNN) algorithm, matrix factorization (IMF) method based on implicit feedback, matrix factorization-based The Bayesian Personalized Ranking (BPRMF) method. Modeling and recommendation are carried out with 5 methods on the CiteULike dataset and Last.fm dataset, and the recommendation results are compared. The experimental results are shown in Figure 4a, Figure 4b, and Figure 4c. Compared with the four benchmark methods, the group recommendation method proposed by the present invention has achieved better recommendation accuracy on both the Last.fm and CiteULike datasets, and the performance advantage of the present invention is more obvious for sparse implicit feedback data.

为了检测个性化权重对于本发明所提模型的影响，我们通过为所有个体设置统一的固定权重构造了一个变体模型，将其作为对照实验。图5a、图5b、图5c中实线表示本发明的双向张量分解(BTF-GR)模型的结果，虚线是变体模型的结果。结果显示在CiteULike数据集和Last.fm数据集上，双向张量分解(BTF-GR)模型总是优于具有固定权重的变体模型。这个结果说明，用户偏好和群体影响在个人偏好的形成过程中发挥不同的作用，而双向张量分解(BTF-GR)模型可以捕获这种差异，从而获得更好的推荐精度。In order to test the influence of personalized weights on the proposed model of the present invention, we construct a variant model by setting uniform fixed weights for all individuals, and use it as a control experiment. The solid lines in Fig. 5a, Fig. 5b and Fig. 5c represent the results of the bidirectional tensor decomposition (BTF-GR) model of the present invention, and the dashed lines represent the results of the variant model. The results show that the Bidirectional Tensor Factorization (BTF-GR) model always outperforms the variant with fixed weights on both the CiteULike dataset and the Last.fm dataset. This result shows that user preference and group influence play different roles in the formation of individual preference, and the Bidirectional Tensor Factorization (BTF-GR) model can capture this difference to obtain better recommendation accuracy.

为验证本发明所提模型的鲁棒性，以及了解与4种基准方法的鲁棒性对比情况，我们通过改变推荐列表长度和群体规模，分别设计了2个实验组进行验证。实验结果显示本发明的双向张量分解(BTF-GR)模型在推荐列表长度和群体规模上都具有具有较好的鲁棒性，且推荐精度始终优于4种基准方法，同时当推荐列表越长，本发明的双向张量分解(BTF-GR)模型性能越优，即本发明适用于群推荐环境。In order to verify the robustness of the model proposed in the present invention and understand the robustness comparison with the four benchmark methods, we designed two experimental groups for verification by changing the length of the recommendation list and the group size. The experimental results show that the bidirectional tensor decomposition (BTF-GR) model of the present invention has good robustness in the length of the recommendation list and the size of the group, and the recommendation accuracy is always better than the four benchmark methods. Longer, the better the performance of the bidirectional tensor decomposition (BTF-GR) model of the present invention, that is, the present invention is applicable to the group recommendation environment.

Claims

1. A group recommendation method based on a two-way tensor decomposition model, characterized in that it is carried out according to the following steps:

Step 1. Define an interaction relationship representing group G, user U and product I: In the interaction relationship DS, G={G ₁ ,...,G _g ,...,G _|G| } represents a group set, G _g represents any gth group, 1≤g≤|G|; U＝{U ₁ ,...,U _u ,...,U _|U| } represents the set of users, U _u represents the uth user; 1≤u≤|U|; I＝{I ₁ ,.. .,I _i ,...,I _|I| } means the product set, I _i means the i-th product, 1≤i≤|I|;

Step 2: Use formula (1) to construct the tensor decomposition model of the i-th product for the u-th user in the g-th group

{\overset{^^}{z z}}_{g g,, u u,, i i} = = {ω ω}_{u u}^{U u} \cdot &Center Dot; {Σ Σ}_{l l = = 11}^{{k k}_{u u}} {U u}_{u u,, l l} \cdot \cdot {I I}_{i i,, l l}^{U u} + + {ω ω}_{u u}^{G G} \cdot \cdot {Σ Σ}_{m m}^{{k k}_{g g}} {G G}_{g g,, m m} \cdot \cdot {I I}_{i i,, m m}^{G G} + + {b b}_{i i} - - - - - - ((11))

In formula (1), Indicates the individual preference of the u-th user in the g-th group to the i-th product; Indicates the uth user's individual preference for himself Impact, Indicates the individual preference of the g-th group to which the u-th user belongs to the u-th user , b _i represents the deviation of the i-th product; Represents the weight of the individual characteristics of the uth user to the uth user; U _u,l represents the lth hidden variable of the uth user, and k _u represents the number of hidden variables of the uth user; Indicates the l-th latent variable of the i-th product interacting with the u-th user; Indicates the weight of the group characteristics of the g-th group to which the u-th user belongs to the u-th user; G _g,m represents the m-th hidden variable of the g-th group, and k _g represents the number of hidden variables of the g-th group ; Represents the mth latent variable of the i-th product that interacts with the g-th group;

Step 3, using the Bayesian personalized ranking method to analyze the tensor decomposition model Perform an optimization solution to obtain the tensor decomposition model Each parameter value in;

Step 4. Use formula (2) to obtain the group preference of the g-th group for the i-th product Thus, the group preference of the gth group for all products is obtained:

{\overset{^^}{r r}}_{g g,, i i} = = Δ Δ (({\overset{^^}{z z}}_{g g,, u u,, i i})) - - - - - - ((22))

In formula (2), Δ( ) is the average aggregation function;

Step 4: Sort in descending order the group preferences of all products by the gth group, and select the products corresponding to the top N group preferences as a list of recommended products and push them to the gth group.

2. The group recommendation method according to claim 1, characterized in that, said step 3 is carried out as follows:

Step 3.1, utilize formula (3) to obtain described tensor decomposition model The objective function of

{f f}_{g g,, u u,, {i i}_{a a},, {i i}_{b b}} = = arg arg \underset{G G,, U u,, I I}{m m i i n no} \underset{g g &Element; &Element; G G}{Σ Σ} \underset{u u &Element; &Element; U u}{Σ Σ} \underset{{i i}_{a a} &Element; &Element; {I I}_{g g u u}}{Σ Σ} \underset{{i i}_{b b} &Element; &Element; I I \ \ {I I}_{g g u u}}{Σ Σ} ((- - l l n no σ σ ((\overset{^^}{{z z}_{g g}},, u u,, {i i}_{a a},, {i i}_{b b})) + + {λ λ}_{Θ Θ} | | | | Θ Θ | | {| |}^{22})) - - - - - - ((33))

In formula (3), Indicates the difference between the individual preference of the u-th user in the g-th group for the i- _a -th product and the i- _b -th product; i _a ∈ I _gu means that the _i -th product belongs to the g-th group The positive feedback set of the u-th user, the positive feedback set is the set of all products that have interacted with the u-th user in the g-th group; ib∈I\I _gu means that the i _b -th product belongs to the g-th group The set of negative feedback and missing values of the u-th user, the set of negative feedback and missing values is the set of all products that have not interacted with the u-th user in the g-th group; Represents the logistic function; Θ represents the tensor decomposition model The set of parameters in and has λ _Θ represents a regularization parameter;

Step 3.2, initialization parameter set Θ and regularization parameter λ _Θ ;

Step 3.3, traversing all the products in the positive feedback set of the uth user in the gth group, and in the process of traversing each product, from the negative feedback and missing feedback of the uth user in the gth group Arbitrarily select a product in the value set;

Step 3.4, using the stochastic gradient descent method to obtain the parameters U _u,l in the parameter set Θ, G _g,m , _Gradient of bi; in parameter with When is a fixed value, for the parameter U _u,l , G _g,m , b _i are updated iteratively until convergence, so as to obtain parameters U _u,l , G _g,m , The optimal value of b _i ;

Step 3.5, traversing all the products in the positive feedback set of the uth user in the gth group, and in the process of traversing each product, from the negative feedback and missing value set of the uth user in the gth group Choose any product from

Step 3.6, utilize stochastic gradient descent method to obtain the parameters in the parameter set Θ with Gradient; in parameter U _u,l , G _g,m , When b _i is the optimal value, for the parameters with Perform iterative updates respectively until convergence, so as to obtain the parameters with the optimal value of .