Group recommender systems (GRSs) identify items to recommend to a group of people by aggregating group members’ individual preferences into a group profile and selecting the items that have the largest score in the group profile. The GRS predicts that these recommendations would be chosen by the group by assuming that the group is applying the same preference aggregation strategy as the one adopted by the GRS. However, predicting the choice of a group is more complex since the GRS is not aware of the exact preference aggregation strategy that is going to be used by the group.
To this end, the aim of this article is to validate the research hypothesis that, by using a machine learning approach and a dataset of observed group choices, it is possible to predict a group’s final choice better than by using a standard preference aggregation strategy. Inspired by the Decision Scheme theory, which first tried to address the group choice prediction problem, we search for a group profile definition that, in conjunction with a machine learning model, can be used to accurately predict a group choice. Moreover, to cope with the data scarcity problem, we propose two data augmentation methods, which add synthetic group profiles to the training data, and we hypothesize that they can further improve the choice prediction accuracy.
We validate our research hypotheses by using a dataset containing 282 participants organized in 79 groups. The experiments indicate that the proposed method outperforms baseline aggregation strategies when used for group choice prediction. The method we propose is robust with the presence of missing preference data and achieves a performance superior to what humans can achieve on the group choice prediction task. Finally, the proposed data augmentation method can also improve the prediction accuracy. Our approach can be exploited in novel GRSs to identify the items that the group is likely to choose and to help groups to make even better and fairer choices.
1 Introduction
Recommender Systems (RSs) are information retrieval tools that help their users to make better decisions by suggesting items that are likely to meet their needs and wants [Ricci et al. 2022]. Group Recommender Systems (GRSs) are special types of RSs aiming at identifying items that, if experienced by a group of people, will satisfy all group members as much as possible [Masthoff and Delic 2022]. Group recommendations are constructed by algorithms that leverage preference aggregation strategies. For each item, they combine the item preference scores of the group members into a single score, for instance, by averaging the group members’ preference scores [Jameson 2004].
While the ultimate goal of a GRS is to generate useful recommendations for a group, the system may benefit from a component that, by relying on the knowledge of individual preferences, generates a prediction of the group’s more likely choice. That prediction may be used directly as a recommendation, helping the group to quickly converge to that decision. However, the group choice prediction can also be exploited for generating alternative and better choices. For example, the GRS could identify options that are similar to the predicted choice but fairer, which can be achieved, for instance, by reducing the variance of the group members’ predicted satisfaction scores.
In this work, we focus explicitly on the problem of group choice prediction, and we propose a machine learning–based solution that leverages a training data-set of observed groups. The groups are described by their group profiles, which are constructed with a preference aggregation strategy. We then aim at predicting the choices of groups not present in the training set. We note that while the group choice is typically the result of the group decision-making process, we aim at predicting it solely from the knowledge of the group members’ preference data, which are aggregated in the group profile. Our approach is inspired by the Social Decision Scheme (SDS) theory [Stasser 1999]. SDS theory builds a group profile, named group preferences composition, only on the base of the individual preferences (the group members’ preferred options). It assumes that a so-called Social Combination Process or Social Decision Scheme summarizes the relationship between the initial group preferences’ composition and the final group choice, that is, the collective response.
We note that classical preference aggregation techniques, which are used in GRSs, can generate group choice prediction: the item with the largest aggregated preference score is predicted as the choice of the group. Hence, they also follow the SDS assumption that the group choice solely relies on the individual preferences of the group members. However, they mechanically dictate the group choice by using hard-coded rules, such as “the item with the largest average individual rating must be the group choice”. Moreover, which preference aggregation strategy must be used for predicting a specific group’s choice is generally unknown since real groups may use a variety of different preference aggregation strategies [Masthoff 2004; Delic et al. 2018b; Forsyth 2018]. Conversely, SDS theory suggests to reconstruct the social decision scheme and the group choice by using observational data describing how, in groups, the members’ preferences determine the corresponding groups’ choices.
In that respect, by having at our disposal a dataset containing information about group members’ individual preferences and the corresponding post-interaction group choice, and by extending the SDS theory, we combine the heuristics of an aggregation strategy with a machine learning model (multinomial logistic regression) to predict the effect of the inter-group interaction. In other words, from a dataset of real groups containing the knowledge of the group members’ preferences and final group choices, we build a prediction model of the group’s choice from the group members’ individual preferences aggregated in a group profile by a preference aggregation strategy.
Moreover, in order to cope with the (typically) limited number of observed groups and their choices that are in the training set, we conjecture, by following standard machine learning approaches, that it is possible to improve the accuracy of the choice prediction model with two specifically designed data augmentation methods [Wong et al. 2016]. The objective is to bring some additional knowledge about typical decision-making behaviors in groups by enriching the training set with synthetic but likely to be observed groups (profiles) and their corresponding choices. In the first method, we add synthetic group profiles, which are called Winners, and represent cases in which all the group members prefer an option and the group (consequently) chooses that option. The second type of synthetic group is called Permutations. These groups have profiles (and choices) that are obtained from the profiles of real observed groups by making a permutation of the options and, accordingly, their scores. For instance, the group score of the first and second options in a real observed group are swapped in a permutation group. If the group choice, as assumed by SDS, depends only on the scores of the options in the group profile, the choice of the group with the permuted profile will be the option obtained by the permutation of the originally chosen option. Hence, in the example mentioned above, if in the original group, the choice was the first option, in the permuted group the choice must be the second option.
We have tested the effectiveness of our learning approach for group choice prediction on a dataset describing the preferences and the choices of 282 participants organized in 79 real groups while deciding on which travel destination to visit together. The precise formulation of our research hypotheses is in Section 3.2; we summarize the main results here. We show that the proposed learning approach, Learning-based Choice Prediction (LCP), generates significantly better predictions of the groups’ choices in comparison with those based on classical preference aggregation strategies (Preference Aggregation-based Choice Prediction[PACP]). That result holds even when only partial information of the group members’ preferences is available, i.e., when the system misses some group members’ ratings. Our method also has a better prediction accuracy compared with what is achieved by humans when, after having observed the group members’ preferences, they predict the likely choice of the group. Moreover, we show that by using data augmentation (winners and permutations), the prediction performance can be improved and the predicted distribution of group choices can be made more similar to the observed distribution of the group choices. In summary, the main contributions of this article are:
–
A novel learning approach (LCP) to predict group choices given the knowledge of the group members’ preference that significantly outperforms the accuracy of baseline preference aggregation strategies (PACP) that is robust with respect to missing preference data and that also outperforms human-based group’s choice prediction.
–
A data augmentation method that, by adding synthetic group profiles (winners and permutations), improves the group choice prediction accuracy of LCP and makes the distribution of the predicted group choices more similar to the observed one (ground truth).
We stress the practical importance and value of the obtained results. The prediction of a target group choice can be used by a GRS to indicate which option is the current inclination of the group, hence helping the group to quickly come to such a decision. Moreover, by having the knowledge of the likely choice of the group, the GRS can leverage this information to generate other recommendations, for instance, presenting items similar to the predicted choice but with additional important properties, for instance, items that are more novel or fairer choices. Hence, we believe that our results can open the research on novel and effective group recommendation techniques, especially conversational ones, which can greatly benefit from the prediction of the likely choice of a group to better interact with the group members in supporting their decision-making process.
The rest of the article is structured as follows. In Section 2, we provide an overview of the related work on GRSs. In Section 3, we compare the proposed approach with the state-of-the-art and formulate our research hypotheses. In Section 4, the group profile generation mechanism and the choice prediction learning approach are elaborated in detail. In Section 5, data augmentation is discussed and our approach based on the generation of synthetic group profiles is presented. In Section 6, the evaluation procedure is explained. In Section 7, the results supporting our two research hypotheses are presented. Finally, in Section 8, we summarize the article’s contribution, discuss limitations of our approach, and indicate lines of future work.
2 State-of-the-art
In Sections 2.1, 2.2, and 2.3, we first survey approaches that deal with the construction of the group profile that have been developed in the GRSs literature. Then, in Section 2.4, we present an approach to group profile generation, the SDS, specifically introduced for predicting the group choice.
2.1 GRS Based on Group Profile
GRSs are designed to find items whose joint experience in a target group would be satisfactory for all group members [Masthoff and Delic 2022; Felfernig et al. 2018]. Group recommendation methods can be divided into two main classes: combining recommendations and combining user profiles. In the first class, recommendations are first generated for each group member; then, based on the individual recommendations, group recommendations are selected. In the second class of methods, a group profile is first constructed by using preference aggregation techniques applied on the group members’ individual profiles (preferences for the items, e.g., ratings). Then, group recommendations are generated on the basis of the constructed group profile. In this section, we focus on the second class of methods, profile aggregation, as our choice prediction approach leverages this construct.
Figure 1 shows the general schema used by GRSs to build a group profile and then group recommendations. There are two main paths to create a group profile: by using a preference aggregation strategy and by using Machine Learning (ML) techniques. The approaches based on the preference aggregation strategies take as input the individual preferences of the group members (individual ratings) and construct for each group a profile by relying on a selected preference aggregation strategy. Then, a recommendation algorithm leverages the target group profile, together with other group profiles, to generate recommendations. Conversely, ML methods create a group profile by taking as input individual ratings but also the available group choices or group scores (also called ratings by some authors), which indicate to what extent a set of observed groups like some evaluated options. Next, an ML model produces an Embedded Group Profile, which is defined by hidden features of the group. In fact, generating group profiles is not the main goal of the ML models; this is a by-product of the method used to predict the group scores. Hence, while the group profile generated by preference aggregation strategies contains the estimated group scores for the options, the group profile constructed by ML models contains latent features, and the value of each entry indicates the estimated importance of that dimension for the group representation. In both approaches, a group recommendation algorithm uses the generated group profiles as input to compute recommendations, which are obtained by exploiting a range of different solutions, such as collaborative filtering or neural networks.
Fig. 1.
In the following sections, we will detail the two approaches to group profile generation, i.e., the one based on preference aggregation and the one leveraging ML.
2.2 Preference Aggregation Strategies
Group modeling or creating group profiles is likely the most essential aspect of aggregation-based GRSs. A large part of the research on GRSs has been dedicated to understanding how group members’ individual preferences are (descriptive) or should be (normative) aggregated to create group profiles. In Masthoff [2004] a number of preference aggregation strategies, motivated by Social Choice theory, are described.
–
Additive - Group members’ individual ratings for each item (if available) are summed up to create a vector of group scores, one score for each item. Possible implementations of the additive strategy calculate the mean value (i.e., average strategy) or the median value (i.e., median strategy) of the individuals’ ratings.
–
Borda count - Each group member creates a ranked list of options according to one’s preferences; points are assigned to options, separately for each individual, based on the position of an option in a list (i.e., the last option gets zero points, the second to the last receives one point, etc.); a group score for an option is calculated as the sum of the individually assigned points.
–
Multiplicative - Individuals’ ratings are multiplied to create group scores.
–
Least misery - An item’s group score is the minimum of the individuals’ ratings for the item; the strategy assumes that a group is as satisfied as its least satisfied member.
–
Copeland Rule - An item’s group score is equal to the number of times that option beats other options minus the number of times it loses with respect to the other options. Option \(i_1\) beats option \(i_2\) in a group if the number of group members who prefer \(i_1\) to \(i_2\) is larger than the number of members who prefer \(i_2\) to \(i_1\).
–
Majority: Group score for each option is equal to the number of group members that have chosen the option as their individual or group choice.
Additive-based aggregation strategies typically treat all group members as equally important. However, there are situations in which certain group members may have different levels of importance. Weighted sum or weighted average aggregation strategies can be used to address this problem. For instance, Ardissono et al. Ardissono et al. [2003] used the weighted sum aggregation strategy to construct a group profile that reflects the importance of individual group members. In this approach, each member is assigned a different weight to reflect each individual’s importance in constructing the group profile. Another type of aggregation strategy is the distance-based strategy [Zhiwen et al. 2005], which aims to minimize the total distance between the constructed group profile and the individual profiles of group members. In other words, given individual preferences (for instance, individual ratings), the distance-based aggregation strategies create the group profile that minimizes the total distance of the constructed group profile from the individual group member’s profile.
Another research direction for constructing group profiles uses individual rankings, instead of ratings. Similar to the distance-based strategies mentioned earlier, these methods aim to create a group profile that minimizes the constructed group profile’s (ranking) distance from individual preferences (rankings). For instance, Cook and Seiford [1978], propose a method for constructing group profiles that minimize the total distance to the profile (ranking) of individual members. Additionally, Dong et al. [2021] argue that group members may sometimes categorize potential options into approved (items they would like to consume) and disapproved (items they would not consume) groups individually. They introduced the concept of a preference-approval structure, which combines ranking and approval data to incorporate approved and disapproved alternatives during preference modeling. The authors proposed a group preference aggregation model that minimizes the total distance to individual preference-approval structures.
2.3 Machine Learning Models
Standard preference aggregation strategies and their extensions construct group profiles in a pre-defined and mechanical way. As a consequence, the constructed group profiles may not be optimal in different contexts. For instance, different groups might employ different strategies in order to reach satisfying decisions. Moreover, even when the same group is deciding on different options, they might employ different techniques to consider and evaluate these options. To overcome this problem, ML-based variants propose more adaptive models.
Cao et al. [2018] propose a method that learns a group profile by using an attentive neural network based on existing individual user–item as well as group–item interactions 1. Hence, in this case, aside from individual user preferences, group preferences must also be available to the model. The constructed group profiles are used by another ML component for generating recommendations. The performance of this approach deteriorates when applied to ephemeral groups (created for one event only or, put in other words, groups for which there is only one group–item interaction entry) [Sankar et al. 2020]. To overcome this problem, Sankar et al. [2020] proposed an alternative solution. Here, the attention mechanism is extended with another neural approach that maximizes the Mutual Information between the group and its members. This assigns a greater weight to a user in the group profile when the user’s current group is more similar to the ephemeral groups that the user belonged to in the past. We note that these methods follow the upper workflow indicated in Figure 1. They utilize known individual as well as group preferences (stored in a dataset) for constructing group profiles and predicting group scores for items.
2.4 Social Decision Scheme
In the previous subsections, we have focused on the generation of a proper representation of the group preferences, called group profile, and the corresponding recommendation methods. However, as we mentioned in the introduction, when the goal is to support group decision-making, it could be useful to predict the current inclination of a group for a specific option (group choice).
The previously discussed preference aggregation strategies can be used to make predictions about the group choice: the option with the largest score after the strategy is applied is the choice that the group should make if the group adopts that strategy. However, it is worth mentioning that the main focus of these methods is not to predict what a group would choose given the individual preferences of the group members but to find “the best” option for the group under certain constraints or goals that the aggregation mechanism aims for [Sen 1977]. In other words, the methods indicate what the group should choose in order to fit certain constraints and goals.
To the best of our knowledge, the SDS theory, which is originally presented by Stasser [1999] and also discussed by Friedkin and Johnsen [2011], is unique in explicitly focusing on the group choice prediction problem. The SDS models how the collective response (group choice), which is the result of possibly complex inter-group interactions that happen during the group decision-making process, can be generated by relying only on individual preferences. Here, the group choice might not be the “best” option according to a predefined mechanical approach; rather, it might be the choice that the group reached according to their internal agreements. The SDS proposes a set of basic modeling elements: (i) individual preferences; (ii) distinguishable distribution of group members’ preferences, which corresponds to the group profile in the terminology used in this work as well as in GRSs; (iii) patterns of group influence (decision scheme); and (iv) collective responses (group choice). The theory states that individual preferences are the ingredients of the group profile (distinguishable distribution), and consensus processes act on such a group profile to yield a collective response (group choice) [Stasser 1999]. The SDS claims that the social decision scheme (patterns of group influence), which describes the association between group profiles and the possible group responses (group choice), should be learned from data. Hence, in SDS theory, the group profile is generated from individual preferences, as in classical GRSs, but then an adaptive social decision scheme model, which could be learned from available data or is given a priori, is applied to yield the group response (group choice).
In Stasser’s original approach, the entries of a group profile are constructed by counting the number of times an option has been indicated as the most preferred by the group members. Such a group profile is called the distinguishable distribution (of the group preferences). In this method, there are three essential elements. The first is the set P, which contains all possible distinguishable distributions (group profiles). The number of possible distinguishable distributions, when n is the number of available options and r is the group size, is \(_{(n+r-1)}C_r\), where \(_mC_r\) is the binomial coefficient (the number of combinations, or the number of ways that r elements can be drawn out from a set of m elements). For instance, if a group of 2 members (\(r = 2\)) considers 2 options (\(n=2\)), one can observe \(_3C_2 = 3\) distinguishable distributions: (1) the two members prefer the first option; (2) both members prefer the second option; or (3) one member prefers one option and the other member the other option. The second element of the model is the \(\pi\) vector, whose elements are the probabilities of observing each of the distinguishable distributions (group profiles) given another vector describing the a priori probabilities to observe the preference of individuals for the options. The last element of Stasser’s model is the D matrix, i.e., the social decision scheme matrix. The rows of this matrix correspond to the different group profiles and the columns correspond to the possible group responses (choices). Each entry \(D_{ij}\) of the matrix is the probability that the i-th group profile leads to the j-th response. Stasser proposes that this matrix should be learned from data. A severe limitation of Stasser’s model is that the number of distinguishable distributions (possible group profiles) grows rapidly with the number of options and fitting the matrix D becomes practically impossible.
3 Research Gap and Hypotheses
3.1 Discussion of the State-of-the-Art
Inspired by the SDS theory, we are tackling the group choice prediction task, i.e., to estimate what a group would actually choose from a limited set of options, by learning the choice function from a dataset of observed group choices. The schema of our approach is shown in Figure 2. Hence, the input of the proposed learning process is a set of groups, with the information of the members’ individual ratings of the options and the corresponding groups’ choices. The result is a predictive model that, given a group profile, predicts the likely group choice. The proposed model can treat ephemeral groups; it does not use any possibly available information that an individual is part of more than one group and any additional information about group members and groups, such as demographic or role data.
Fig. 2.
In our work, we generalize the distinguishable distribution model of a group members’ preferences that is introduced by the SDS. We build a diverse set of potentially usable group profiles by applying a range of preference aggregation strategies. The goal is to test whether the specific strategy used for aggregating the preferences of the group members may have an impact on the quality of the choice prediction. In our generalization of SDS theory, we also deal with the case that some individual preferences may not be available. Hence, we deal with missing data in the construction of group profiles. Moreover, we follow the original SDS idea that the association between group profile and group response (group choice) should be learned from data. However, we introduce a method to directly predict the group choice, without approximating the social decision scheme matrix D. The proposed learning method can be used with any vector-based representation of the group profile. We believe that our extension can also benefit other applications of SDS theory.
Classical preference aggregation strategies presented in Section 2.2, as we previously noted, can be used for predicting a group choice as well. However, these strategies operate in a mechanical way and are not able to learn how the group choice depends on the computed group profile. Our method can achieve a better prediction accuracy because it learns the effect of inter-group interactions, as a map between the preferences, aggregated into a group profile, and the final group choice. For that reason, we apply ML to a range of diverse types of group profiles, which are computed by preference aggregation techniques, to understand what group profile may be better used to learn the map between profile and choice.
Similar to the ML-based models that we have surveyed in Section 2.3, our approach uses a training set of groups and choices. However, unlike them, we predict the group choice, not the group scores for the items. We believe that group scores are artificial concepts. In reality, a group makes a choice and group members do not need to agree on their joint/common ratings of choices. We also note that the presented ML-based models use very sparse individual preferences of users for a huge set of items and one of these approaches makes use of repeated evaluations of groups for items. Conversely, our method requires more dense individual preference data but it is able to work with ephemeral groups with only one observed choice for each group.
3.2 Research Hypotheses
In order to evaluate our proposed approach, we are interested in validating two hypotheses related to the task of predicting a group choice, over a limited set of options, after the group members have formulated their individual preferences and have interacted to make a joint decision to select one of the options (choice).
H1
By using a proper ML model trained on a dataset of group choices, it is possible to effectively predict the choice of a group by using only the individual preferences of the group members, encoded in a group profile.
With this hypothesis, we aim to evaluate the general validity of the SDS theory, and our extension, in predicting a group choice. In fact, our objective is to examine whether it is possible to learn the social influence that happens during the group discussion utilizing solely individuals’ preferences and groups’ choices. To the best of our knowledge, this article is the first attempt to validate the SDS theory using a real-world dataset.
To perform a quantitative assessment of this hypothesis, we will compare the proposed approach to the choice prediction that preference aggregation strategies can generate. We hypothesize to be able to improve the prediction accuracy of these aggregation strategies. Moreover, the model proposed in Stasser [1999] assumes the existence of all individual group members’ ratings. However, in reality, in most cases, the user–option matrix is sparse. Therefore, we also aim to examine the validity of this hypothesis when the user–option matrix is sparse. Finally, to make a further quantitative comparison of the achieved prediction quality, we conduct a user experiment in which humans are requested to predict the choices of the same groups by only knowing the group members’ preferences. We hypothesize that our method is competitive with the performance of human group choice prediction.
When trying to create a new model for recommending items to groups or predicting group choices, one faces the challenge of limited data availability. The majority of existing datasets are either small in size or lack clarity regarding their collection procedure, the assignment of group scores, and the actual decision-making procedures followed by the groups. To this end, we hypothesize the following:
H2
In order to cope with the data scarcity of group choices, it is possible to use data augmentation, relying on synthetic group profiles and their choices, and further improve the quality of the proposed group choice prediction method.
The precise data augmentation method that we propose is based on artificially introducing, only in the training set, groups along with their individual members’ preferences, and group choices, that are not recorded in the dataset. These synthetic groups, even if not actually observed, are likely to be “observable”. For instance, even if in the dataset there is not a group in which all group members prefer one option to the other options, it can be assumed that if such a group exists in the dataset, the individually preferred option will also be the group choice.
In order to test the introduced hypotheses, we reuse a dataset of group choices generated in a previously performed user study. Then, we conduct a set of extensive experiments. Both aspects are explained in detail in Section 6. In the following section, we provide a comprehensive description of the proposed ML-based choice prediction model and its parts.
4 Group Choice Prediction Model
4.1 Group Profiles
We start with the definition of the important notations and the precise problem formulation. Let \(U = \lbrace u_1, \ldots , u_m\rbrace\) be a set of users, \(O = \lbrace o_1, \ldots , o_n\rbrace\) a set of items or options, and \(R = (r_{i,j})\) the \(m \times n\) user–option rating matrix: \(r_{i,j}\) is the rating (non-negative real number) given by user \(u_i\) to option \(o_j\). All notation utilized in the current and subsequent sections can be found in Table 1. To simplify the notation, we will refer to user \(u_i\) as the i-th user, or even as user i. The same approach will be used for options. The rating matrix is in general partially defined, i.e., some of the ratings may be unknown. The user \(u_i\)’s profile, \(\boldsymbol {u}_i\), is a real-valued vector formed by the \(u_i\)’s ratings:
Group g’s profile vector constructed using the AG aggregation strategy
\(f^{AG}\)
A function for calculating group profile using AG aggregation strategy
\(c(g)\)
Actual choice of g
\(c^{*} (g)\)
Predicted choice of group g
\(\mathcal {G}\)
A set of tuples, each of which consists of a group profile and group choice
\(G_{\text{train}}\)
A subset \(\mathcal {G}\) used for training the model
\(G_{\text{test}}\)
A subset \(\mathcal {G}\) used for testing the model
\(\sigma (\cdot)\)
A permutation function
Table 1. List of Notations Used in This Article
Table 2 shows an example of a rating matrix. In this example, 4 users are present, \(u_1\), \(u_2\), \(u_3\), \(u_4\), and 10 options are listed: \(o_1, \ldots , o_{10}\). Each number in the table indicates the corresponding user–option rating. In this example, all possible users’ ratings are known: the matrix is complete. Throughout the remainder of the article, this user–option ratings matrix example (Table 2) will be utilized to illustrate the proposed techniques.
Table 2.
Member
\(o_1\)
\(o_2\)
\(o_3\)
\(o_4\)
\(o_5\)
\(o_6\)
\(o_7\)
\(o_8\)
\(o_9\)
\(o_{10}\)
\({u}_1\)
6
9
4
8
5
2
7
1
10
3
\({u}_2\)
7
6
4
1
2
10
3
8
9
5
\({u}_3\)
1
10
3
5
9
6
8
7
2
4
\({u}_4\)
6
8
3
9
1
5
7
2
10
4
Table 2. Example of a Rating Matrix of Users \(u_1\), \(u_2\), \(u_3\), and \(u_4\)
10 Options \(\lbrace o_1,\ldots , o_{10}\rbrace\) and their Ratings, Given in a 1-10 Rating Scale.
A group profile is an aggregated representation of the preferences of the group members. It is a real vector of the same dimensionality as the group members’ profiles and the value of each entry indicates the “importance”, or “score”, of the corresponding option for the group. A group profile can be obtained by applying a preference aggregation strategy to the group members’ profiles. Let \(g = \lbrace u_1, u_2,\ldots , u_{|g|}\rbrace \subset U\) be a group. We denote with \(\boldsymbol {g}\) a group profile of g. We will also denote with \(\boldsymbol {g}^{AG}\) a profile of g when we want to make evident the preference aggregation strategy AG used to generate the group profile:
where \(f^{AG}\) is a preference aggregation strategy function; \(r_{g,j}\), the group score for option j, is a non-negative real number. In Section 2.2, we have introduced the Average (AVE), Multiplicative (MULT), Least Misery (LM), and Copeland Rule (COPE) preference aggregation strategies. In AVE, the group g score for option j, i.e., \(r_{g, j}\), is the average of group members’ ratings \(\lbrace r_{u_{1}, j}, \ldots , r_{u_{|g|}, j}\rbrace\) for that option j. In MULT, the group score for an option is the multiplication of group members’ ratings for that option. In LM, the group score is the minimum rating of the group members’ ratings for that option. We now define the COPE-based profile, the original Stasser group profile [Stasser 1999], and a generalization of that profile.
Copeland Rule (COPE). For calculating the group profile of g according to the COPE, one needs to compute a real \(n \times n\) matrix \(M^g = (m_{i,j}),\) where each option in O corresponds to a column and a row in this matrix. Each entry of this matrix is computed with the following formula:
Stasser group profile (SDS1) and generalized version (SDS3). As we have discussed in Section 2.4, the Stasser model [Stasser 1999] for predicting a group choice is based on a group representation that counts, for each option, the number of group members that prefer that option (to the others). We call this preference aggregation strategy SDS1. We also consider a straightforward generalization of this approach, named SDS3, for which the group’s score for an option is obtained by counting the number of times that option is among the top three preferred options of the group members. SDS2, SDS4, and SDSn can be defined in this manner as well. However, in order to simplify the analysis of the results, in the following we will only consider SDS3.
4.2 Unknown User Ratings and Normalization
It is worth noting that some entries of the rating matrix R might be unknown, i.e., one or more group members may have not rated an option. In such cases, which are not considered in the original SDS formulation, the group score for that option is computed by using only the available group members’ ratings for the option. When none of the group members has rated an option, one can label the group score of that option as “unknown”.
Finally, after applying any preference aggregation strategy, the group profile is normalized so that the sum of its entries is 1. Hence, if \(\boldsymbol {g} = ({r}_{g,1}, \ldots , {r}_{g,n})\) is a group profile obtained by any preference aggregation strategy, then, after normalization, the entries of this vector are replaced with
For instance, Table 3 shows group profiles of the group \(g=\lbrace u_1, u_2, u_3, u_4 \rbrace\) whose individual ratings are shown in Table 2. In this example, \(o_2\) is the preferred option of \(u_3\), since \(r_{3,2} = 10\), \(o_6\) is the preferred option of \(u_2\) for the same reason, and \(o_9\) is the preferred option of \(u_1\) and \(u_4\). Then, for example, in the SDS1-based group profile, before normalization, the group score for options \(o_2\) and \(o_6\) is 1, and for \(o_9\) is 2. After normalization, the group scores of \(o_2\) and \(o_6\) are 0.25 and the group score of \(o_9\) is 0.5, and the group profile is \(\boldsymbol {g}^{SDS1} = (0, 0.25, 0, 0, 0, 0.25, 0, 0, 0.5, 0)\).
Table 3.
group profile
\(o_1\)
\(o_2\)
\(o_3\)
\(o_4\)
\(o_5\)
\(o_6\)
\(o_7\)
\(o_8\)
\(o_9\)
\(o_{10}\)
\(\boldsymbol {g}^{AVE}\)
0.09
0.15
0.06
0.1
0.07
0.1
0.11
0.08
0.14
0.07
\(\boldsymbol {g}^{MULT}\)
0.02
0.47
0.01
0.03
0.009
0.06
0.12
0.01
0.19
0.02
\(\boldsymbol {g}^{LM}\)
0.04
0.26
0.13
0.04
0.04
0.08
0.13
0.04
0.08
0.13
\(\boldsymbol {g}^{SDS1}\)
0
0.25
0
0
0
0.25
0
0
0.5
0
\(\boldsymbol {g}^{SDS3}\)
0
0.25
0
0.16
0.08
0.08
0.08
0.08
0.08
0.25
\(\boldsymbol {g}^{COPE}\)
0.08
0.2
0
0.11
0.04
0.11
0.15
0.042
0.21
0.02
Table 3. Normalized Group Profiles Calculated by Using the Preference Aggregation Strategies AVE, MULT, LM, SDS1, SDS3, and COPE
Users’ ratings are as in Table 2 and the group is \(g=\lbrace u_1, u_2, u_3, u_4\rbrace\). Each number in the rows of this table is the group score for the column item, employing the preference aggregation strategy indicated in the first column of the row.
A preference aggregation strategy can be used to predict a group choice: the option that, according to the selected preference aggregation strategy, has the largest score. Here, this approach is considered as a baseline method. Hence, if \(\boldsymbol {g}^{AG} = (r_{g,1}, \ldots , r_{g,n})\) is the g’s group profile calculated by the preference aggregation strategy AG, then the PACP method predicts that the group with profile \(\boldsymbol {g}^{AG}\) will choose the option
We use the star notation, \(c^{*} (g)\), to indicate the predicted choice, whereas the actual choice is denoted with \(c(g)\). We also stress that PACP operates on a group profile built by using a preference aggregation strategy. Hence, when it is needed, we will use a distinct notation, such as PACP-AVE, to explicitly indicate the preference aggregation strategy (AVE) used in the prediction.2
4.3.2 Learning-Based Choice Prediction—LCP.
PACP is rigidly employing a preference aggregation strategy when predicting a group choice. Motivated by the SDS, we conjecture that the choice can be better predicted by leveraging the analysis of patterns of user preferences encoded in a group profile, which is constructed by using a preference aggregation strategy. Hence, after the construction of the group profile, based on the group members’ ratings and a selected preference aggregation strategy, we use an ML classifier to predict the actual group choice.
In order to implement this idea, given a set of groups G, we need for each group \(g \in G\): the group members’ individual preferences to be aggregated in a group profile \(\boldsymbol {g}\), and the observed choice made by the group, i.e., \(c(g)\). Then, this dataset \(\mathcal {G} = \lbrace (\boldsymbol {g}, c(g)): g \in G\rbrace\) is given as input to an ML algorithm, which in this article is Multinomial Logistic Regression, that generates a prediction \(c^*(g)\) of the actual group choice \(c(g)\). Then, the group choice is a class variable, taking value in the set of all the available options O. Thus, the group choice prediction problem is translated into a classification problem: given a training set consisting of group profiles in \(\mathcal {G_{\text{train}}}\), where the group choice is known, after having trained the ML classifier on that set, the classifier can predict the choice \(c^*(g^{\prime })\) of a group \(g^{\prime }\) in a test set \(\mathcal {G_{\text{test}}}\).
Similar to what was said for PACP, we stress that LCP operates on a set of group profiles built by using a preference aggregation strategy. Consequently, when needed, we will use a distinct notation, such as LCP-AVE, to explicitly indicate the used preference aggregation strategy (AVE).
5 Data Augmentation with Synthetic Group Profiles
In this section, we face the data scarcity problem by presenting two data augmentation methods. LCP group choice prediction is trained on a dataset of observed groups along with their members’ preferences and the final choice of the group. Clearly, a larger training dataset will give to the model more information about the target choice function to be learned. However, often the training data set is small, as the dataset used in our experiments, which contains only 79 groups. This is a limiting factor for any ML model. While the minimum number of the required sample size depends on different factors, in Lakshmanan et al. [2020] it is claimed that a reasonable minimum number of instances for a classification problem with c classes, where instances are described by f features, is \(10 f c\). Thus, for instance, in the experiments conducted in this article, \(f = c = 10\), which means that a minimum of 1000 groups would be needed, whereas our dataset is one order of magnitude smaller.
To address this problem, we make some assumptions about the functional relationship between the group profile and the group choice. These assumptions are then motivating the creation of synthetic groups that were not actually observed but should be observed, given the assumptions. The first assumption on the choice function is that in a group in which all group members prefer the same option, that option would be chosen by the group. Hence, even if groups in which all group members prefer a single option were not observed, we introduce in the training set synthetic groups of this type and set as group choice the option that is preferred by all. The second assumption is that the order of the options in the vector representation of the group profile is not relevant for the choice function. Instead, it is assumed that only the relative scores of options in the vector representation of the group profile impact the choice function. Hence, assume that we have observed a group g that chose the first option. Then, if a group \(g^{\prime }\) has a group profile that is a permutation of the profile of group g, i.e., the same scores but in a different order, then the choice of \(g^{\prime }\) must be the option that by the permutation corresponds to the first option in the group g profile.
We are therefore following a data augmentation approach, expanding the training dataset by adding synthetic groups (group profile and group choice) that could improve the choice prediction function on the test set. We generate two sets of synthetic group profiles, Winners and Permutations, which are presented below.
The idea of using synthetic variations of the training examples, such as those introduced and presented below, comes from similar data augmentation methods often used in ML. Data augmentation is, in fact, a technique to increase the diversity and the coverage of the training set by applying random, however realistic, transformations when the dataset is not large enough for a trained model to generalize well [Antoniou et al. 2018; Shorten and Khoshgoftaar 2019].
Winners. The first set of synthetic group profiles is called Winners and contains one group profile \(\boldsymbol {g}_{j} = (r_{g_j,1}, \dots , r_{g_j,n})\) for each option \(j \in O\). A group in the Winners set has profile scores concentrated on a single option that is also assumed to be the group choice:
Table 4 shows three examples of Winner group profiles when \(|O| = 10\). The motivation for adding such possibly missing observations is that they comply with a fundamental axiom of social choice: if all group members prefer the same option, this must be the group choice. The addition of these synthetic profiles could ease the training of the ML algorithm that predicts the group choice, as it explicitly adds knowledge that the predictive model might not be able to extract from the available data.
Table 4.
Group
\(o_1\)
\(o_2\)
\(o_3\)
\(o_4\)
\(o_5\)
\(o_6\)
\(o_7\)
\(o_8\)
\(o_9\)
\(o_{10}\)
Group choice
\(\boldsymbol {g}_1\)
1
0
0
0
0
0
0
0
0
0
\(o_1\)
\(\boldsymbol {g}_2\)
0
1
0
0
0
0
0
0
0
0
\(o_2\)
\(\ddots\)
\(\boldsymbol {g}_{10}\)
0
0
0
0
0
0
0
0
0
1
\(o_{10}\)
Table 4. Winners Group Profiles
In each profile, there is only one option with a group score equal to 1, whereas the remaining scores are zero. The option with group score 1 is the group choice.
Permutations. The second set of synthetic group profiles is based on an assumption that is at the base of the SDS theory: the group choice should not depend on the option itself; rather it should depend on the relative scores of options in the group profile. Hence, the Permutations profiles are generated by cloning existing profiles and rearranging the order of the options, their scores, and the group choice accordingly. In mathematics, a permutation of a list is a change in the ordering of the elements of the list. For instance, \((1, 3, 2)\), \((2, 1, 3)\), \((2, 3, 1)\), and \((3, 1, 2)\) are all the permutations of the ordered list \((1, 2, 3)\). Given, for instance, a group profile \(\boldsymbol {g} = (r_{g,1}, r_{g,2}, r_{g,3})\) with 3 options, a permutation of this profile could be \(\boldsymbol {g}^{\prime } = (r_{g,2}, r_{g,1}, r_{g,3})\), where the first and the second scores are swapped. Let us now assume that the group profile \(\boldsymbol {g}\) belongs to a group g that chose the first option. If this choice is determined by the relative values of the scores \((r_{g,1}, r_{g,2}, r_{g,3})\), then one can assume that for a group with profile \(\boldsymbol {g}^{\prime } = (r_{g,2}, r_{g,1}, r_{g,3})\) the choice will be the second option, which has exactly the same score as the first option in the profile \(\boldsymbol {g}\). To give another example, consider the group profiles in Table 4. The profiles \(\boldsymbol {g}_2, \ldots , \boldsymbol {g}_{10}\) are permutations of the first profile \(\boldsymbol {g}_1\); if in the first profile, the group choice is the first option, it is evident that in the second profile, the option chosen by the group must be the second, and so on.
We now give a formal description of the data augmentation approach based on the permutation of group profiles in a dataset. Let \(G_{train}\) be the available training set for LCP, i.e., it contains pairs \(\big (\boldsymbol {g}, c(g)\big)\) composed by a group profile \(\boldsymbol {g}\) and the (observed) group choice \(c(g)\). Let \(\sigma\) be a permutation of O, i.e., a rearrangement of the \(n = |O|\) options, \(\sigma : \lbrace 1,\ldots ,n\rbrace \longrightarrow \lbrace 1,\ldots ,n\rbrace\). By using a group profile \(\big (\boldsymbol {g}, c(g)\big) \in G_{train}\) and a permutation \(\sigma\), we create a new group profile \(\sigma (\boldsymbol {g})\) and the corresponding group choice \(c\big (\sigma (\boldsymbol {g})\big)\) as follows:
Table 5 shows a group profile and two examples of permuted group profiles. \(o_2\) is the actual group choice of the group with profile \(\boldsymbol {g}\): it is the option with the largest group score. \(\sigma _1\) is a permutation that reorders all the options and maps \(o_2\) to \(o_1\), whereas \(\sigma _2\) is a permutation that maps \(o_2\) to \(o_3\). If the group g, with the particular pattern of group scores as in \(\boldsymbol {g}\), made the choice \(o_2\), then if the scores are only rearranged, hence, their relative values are not changed, the choice of the group with profile \(\sigma _1(\boldsymbol {g})\) now should be the option corresponding to the option chosen in the original group. Hence, it must be \(o_1\). One can comment on \(\sigma _2\) this way as well.
Table 5.
Group Profile
\(o_1\)
\(o_2\)
\(o_3\)
\(o_4\)
\(o_5\)
\(o_6\)
\(o_7\)
\(o_8\)
\(o_9\)
\(o_{10}\)
Group choice
\(\boldsymbol {g}\)
0.09
0.15
0.06
0.1
0.07
0.1
0.11
0.08
0.14
0.07
\(o_2\)
\(\sigma _1(\boldsymbol {g})\)
0.15
0.1
0.07
0.11
0.1
0.08
0.07
0.14
0.09
0.06
\(o_1\)
\(\sigma _2(\boldsymbol {g})\)
0.11
0.08
0.15
0.07
0.07
0.1
0.14
0.09
0.06
0.1
\(o_3\)
Table 5. Examples of Synthetic Data Constructed Using the Permutation Augmentation Method
\(\boldsymbol {g}\) is the group profile, and \(\sigma _1\) and \(\sigma _2\) are the group profiles constructed using the permutation augmentation method.
Thus, given a training set \(G_{train}\) of group profiles and corresponding choices, we sample with repetition from this training set and obtain a new set (with repetitions) of group profiles and corresponding choices \(\lbrace (\boldsymbol {g}_1, c(g_1)), \ldots , (\boldsymbol {g}_N, c(g_N))\rbrace\), where \(\big (\boldsymbol {g}_l, c(g_l)\big) \in G_{train}\), \(l= 1, \ldots , N\). We then select a permutation \(\sigma _l\) of O for each profile \(\big (\boldsymbol {g}_l, c(g_l)\big)\), \(l= 1, \ldots , N\), and we generate a new permuted group profile \(\big (\sigma _l(\boldsymbol {g}_l), \sigma (c(\boldsymbol {g}_l)\big)\). We add these permuted profiles to the original training set:
Algorithm 1 shows the exact procedure that we have designed for constructing new profiles using the Permutation data augmentation method.In this algorithm, N permuted group profiles are generated by making sure that a target distribution of group choices p is preserved in the synthetic data. In our experiments, this distribution p is the observed distribution of the choices in the training dataset. Hence, by adding the permuted profiles, we do not change the distribution of the choices in the training set. The rationale of this is related to a typical bias of ML algorithms, of being influenced by the class distribution in the training set. Finally, we note that the parameter N should be selected case by case depending on the available data and the complexity of the choice prediction problem. In our experiments, we have added 1,200 permutations to the training set (of 60 groups). More details are given in the next section.
6 Experimental Evaluation
6.1 Dataset
The groups’ observational data used in this article was collected in a user study focused on the travel and tourism domain. The study was implemented in two rounds at several universities in Europe. Both implementations followed the same three-phase structure [Delic et al. 2018c;, 2018a;, 2016]. In the first phase of the user study, the participants’ explicit preferences, i.e., either ratings or rankings for 10 pre-selected destinations (options), were collected. The two rounds differed in the pre-selected destinations and in the way participants expressed their preferences about them. In the first round, the destinations were 10 large European cities and the participants ranked them. Conversely, in the second round, the destinations were chosen to fit the general preferences of certain traveler types identified in the tourism literature [Yiannakis and Gibson 1992; Gibson and Yiannakis 2002; Neidhardt et al. 2014; Gretzel et al. 2004; Moscardo et al. 1996], and the participants rated them on a 10-point scale (1 — not attractive, 10 — highly attractive). The rationale for changing the destination set was to increase their diversity and consequently also the preferences of the group members.
In the second phase of the study, the participants were asked to form groups freely, with the only restriction that the group size should not exceed five members. Each user participated in only one group. The rationale of the group size constraint was only to focus the acquired data on the typical scenario of small groups and to avoid collecting data from a smaller number of larger groups. Then, the participants joined their respective groups and started a face-to-face discussion aimed at selecting, from the pre-defined set, a destination that they as a group would like to visit together. It is worth noting that, in the face-to-face discussion, the group members did not have access to the ratings or rankings that they previously individually assigned to the options. Hence, these users, while making a choice in their group, might not have recalled precisely the expressed preferences, and the group discussion could have brought the group to choices that are not well justified by the pre-discussion individual preferences.
Finally, in the third phase, the participants filled in a post-questionnaire in which they indicated which destination the group selected, i.e., the group choice. The observational study resulted in two datasets: DSI (200 participants in 55 groups) and a smaller set DSII (82 participants in 24 groups).3
In order to compute the group profiles, all considered strategies require the group members’ ratings as input. As mentioned, the two datasets, DSI and DSII, were different in terms of group members’ individual preferences, i.e., rankings in DSI and ratings in DSII. Hence, to derive ratings of the options from users’ ranked lists in DSI, we assign the maximum score (10) to the first option in a user’s ranked list, 9 to the second option, etc.
Finally, in Masthoff and Gatt [2006] it was shown that in GRSs it may be beneficial to alter the original ratings of the users in order to amplify the difference between highly rated items and lower-rated ones. In order to implement this idea, we have replaced the original ratings with the square of them.
6.2 Evaluation Setting
In order to evaluate the performance of the proposed group choice prediction approach, namely, LCP, we have compared it to the baseline PACP method. To train LCP, we use the Multinomial Logistic Regression classifier [Venables and Ripley 2013]. We have also tested the performance of other classifiers, such as Linear Discriminant Analysis [Venables and Ripley 2013; Ripley 2007] and Support Vector Machines with a linear kernel function [Chang and Lin 2011]. However, Multinomial Logistic Regression outperformed these models, very likely because of its simplicity and the limited size of our dataset. For that reason, we only show the performance of LPC when the Multinomial Logistic Regression classifier is used.
Four-fold cross-validation is employed to estimate LCP and PACP performance. Four folds are selected due to the size of the dataset (i.e., 79 groups altogether). In fact, having more than four folds would produce folds containing less than 15 instances. As a measure of performance, we report the average accuracy of the predicted group choices (i.e., the number of correct predictions over the number of all predictions). Since our dataset is small and the estimated accuracy of LCP can depend on the particular four folds used in the cross-validation, we iterated the whole procedure 10 times and reported the average accuracy of these 10 iterations. In each iteration, we calculated the accuracy using the standard cross-validation method and then reported the average of these accuracy values. We note that in order to have a fair comparison, we have used the same foldings, in the 10 repetitions, to evaluate all the considered models and their variants. We have implemented our models in Python and have used the scikit-learn library. To tune the hyperparameters, we have utilized the Grid Search function available in scikit-learn. We employed various solvers,: ‘newton-cg’, ‘liblinear’, ‘lbfgs’, ‘sag’, and ‘saga’. The regularization term interval was set to \([0.1, 50]\), with a step size of 0.1.
PACP baseline predicts the option with the highest score as the group choice. However, there may be group profiles in which more than one option has the same largest score. In this case, PACP selects one of these alternatives randomly. Hence, also for the evaluation of PACP, we repeated the four-fold validation procedure 10 times and the final result is the average accuracy of these repetitions. In each of these repetitions, we used the same foldings and cross-validation method, but PACP does not require any learning phase; the choice prediction on a test group is only based on the group profile data.
As previously stated, we created our dataset by merging two distinct datasets: DSI and DSII. In order to assess the influence of this combination, we replicated our experiment on the larger dataset alone. This experiment also aimed to examine whether the proposed ML approach is robust even when combining different datasets.
In our dataset, the users have rated all of the items. However, in real-world scenarios, it is common for some or even most of the alternatives to be unrated by users. Thus, to assess the robustness of our approach and compare it with the baseline, we have performed an additional experiment. In this experiment, we randomly eliminated certain user–option ratings before creating the group profiles. By varying the probability to eliminate a rating, we create a collection of new datasets of group profiles. We then predict the group choices by independently considering these datasets. Hence, we test our group choice prediction method relying on sparse matrices of user–option ratings. We followed the previously mentioned procedure for our experiment. For each of the user–option matrices we generated by removing certain individual ratings, we employed a four-fold cross-validation approach. We repeated this process 10 times and calculated the average accuracy of choice predictions for both PACP and LCP.
We have also compared the prediction performance of LPC with humans’ ability in predicting group choice by only knowing the individual group members’ ratings. To do so, we have developed a user interface in which the participants could check the individual ratings of the group members in our dataset and predict the group choice. More details on this graphical user interface (GUI) and the experiment are given in the results section.
We have finally evaluated the performance of LPC when the Winners and Permutations data augmentation approaches are used. Similar to what is described above, when the training set is augmented with Permutations in order to avoid the possibility of having training sets of different quality, we repeat the four-fold cross-validation procedure 10 times by generating each time a different set of synthetic group profiles. We stress that Winners and Permutations profiles are only added to the training set, whereas the test set contains only genuine group profiles: the test set was not changed in any fold or repetition. The number of Winner profiles that are added to the training set is 10, as this is the number of options in our prediction task. The number of Permutations added is 1,200. This number was optimized with cross-validation.
In summary, considering the two models for group choice prediction, PACP and LCP, the diverse types of group profiles produced by the considered preference aggregation strategies (AVE, MULT, LM, COPE, SDS1, and SDS3), and the two data augmentation approaches (Winners and Permutations), we have generated \(3 * 6 = 18\) variants of LCP and 6 variants of PACP. Table 6 reports the names of these variants and their characteristics. For instance, the aggregation strategy AVE is used to generate group profiles that are considered in LCP-AVE, LCP-AVE-W (Winners), LCP-AVE-P (Permutations), and PACP-AVE.
Table 6.
Pref. Aggr. Strat.
Winners
Permutations
PACP
LCP
AVE
-
-
PACP-AVE
LCP-AVE
\(\checkmark\)
-
-
LCP-AVE-W
-
\(\checkmark\)
-
LCP-AVE-P
MULT
-
-
PACP-MULT
LCP-MULT
\(\checkmark\)
-
-
LCP-MULT-W
-
\(\checkmark\)
-
LCP-MULT-P
LM
-
-
PACP-LM
LCP-LM
\(\checkmark\)
-
-
LCP-LM-W
-
\(\checkmark\)
-
LCP-LM-P
SDS1
-
-
PACP-SDS1
LCP-SDS1
\(\checkmark\)
-
-
LCP-SDS1-W
-
\(\checkmark\)
-
LCP-SDS1-P
SDS3
-
-
PACP-SDS3
LCP-SDS3
\(\checkmark\)
-
-
LCP-SDS3-W
-
\(\checkmark\)
-
LCP-SDS3-P
COPE
-
-
PACP-COPE
LCP-COPE
\(\checkmark\)
-
-
LCP-COPE-W
-
\(\checkmark\)
-
LCP-COPE-P
Table 6. List of the Choice Prediction Model Variants That Are Considered in the Evaluation
7 Results
We address the first hypothesis stated in Section 3.2 by evaluating the predictive capability of the proposed LPC model in predicting the group choice. We also assess the robustness of the proposed models in dealing with unknown user ratings, and we present and discuss the findings from our user study regarding the human ability to predict group choice. Then, we move to the second hypothesis and assess the effectiveness of the proposed data augmentation method.
7.1 Predictive Capability of LCP
We start by examining the validity of our first hypothesis. By using a proper ML model trained on a dataset of group choices, it is possible to effectively predict the choice of a group by using only the individual preferences of the group members, encoded in a group profile.
Figure 3 shows the performance (choice prediction accuracy) of the considered LCP and PACP variants. The benefit of using LCP in comparison with PACP is clear: for all the considered group profile generation methods (AVE, MULT, LM, SDS1, SDS3, and COPE), LCP achieves a better performance than the corresponding baseline PACP variant. We note that LCP-AVE, i.e., LCP when the group profile is constructed with the AVE aggregation strategy, outperforms all the other LCP variants, i.e., those using group profiles produced by the alternative preference aggregation strategies. Thus, by using the AVE preference aggregation strategy to build the group profiles, LPC can reach a good performance of almost 50% accuracy. It is also interesting to note that even if SDS1 and SDS3 preference aggregation strategies are not among the best, there is a great benefit in using the LCP learning approach when these strategies are used for building the group profile. This confirms Stasser’s intuition regarding the SDS theory that the choice function can be learned from group choice data.
Fig. 3.
7.2 Considering DSI and DSII Separately
As mentioned in Section 6.1, the dataset that we use in our experiments is the union of two datasets (DSI and DSII) collected in two implementations of the Delic et al. [2018c];, 2018a];, 2016] user study. These two implementations are different with regard to the options considered by the participants and the type of collected individual preferences (ratings vs. ranking). Merging these two datasets for creating a single one is also motivated by the assumption introduced by Stasser [Stasser 1999] and mentioned above: the specific pattern of group scores in the group profile and not the options mostly determines the group choice.
In order to assess the assumption that by using the merged dataset, instead of the two independently, would not negatively impact the accuracy of the LCP models, we repeated our experiment on DSI, the larger one, which contains 55 groups. The second small dataset contains only 24 groups, which makes it inappropriate for properly assessing the quality of both LCP and PACP.
By comparing the accuracy of the LCP variants in the merged dataset (DSI+DSII) with the corresponding variants in DSI, we have discovered that the prediction accuracy barely differs. For instance, the accuracy of LCP-AVE (LCP-MULT) on the merged dataset is only 0.003 (0.032) larger than the accuracy of LCP-AVE (LCP-MULT) on DSI.
This result supports the above-mentioned assumption that the specific distribution of the group scores that are in the group profile and not the options determines the group choice. In fact, the merged dataset, which we use in our analysis, is the combination of group profiles related to two different choice tasks: the common aspect is only the same number of options. Hence, this analysis confirms the validity of using the merged dataset, derived from two independent implementations of the destination selection task. It also shows that one can obtain a benefit by merging data coming from somewhat different group decision tasks.
7.3 Dealing with Unknown User Ratings
The user–option rating matrix used in our experiments is dense: all users have rated all the options. However, in many practical situations, this may not be the case: a group member may express one’s preferences only for a subset of the available options. To evaluate the ability of PACP and LCP to deal with these cases, i.e., when the user–option rating matrix is sparse and there are unknown user ratings, we produced a collection of new sub-datasets by discarding in each sub-dataset a proportion of the group members’ ratings. To do so, in each sub-dataset user ratings were, one by one, independently removed with a given probability p. We have considered probability values between 0 and 0.6 (with 0.01 step) to produce a collection of sub-datasets.
As mentioned in Section 4.2, to calculate the group score for an option, when there are missing ratings, we require that at least one of the group members has rated that item. Hence, when generating a sub-dataset of ratings, by removing ratings with a certain probability, we avoided the cases in which an option was not rated by at least one group member.
Figure 4 shows the accuracy of PACP-AVE and LCP-AVE for different sub-datasets generated by an increasing probability to remove an existing rating in the original dataset. The x-axis in this figure indicates the actual sparsity of the generated user–option matrix, i.e., the percentage of ratings that were not present in the rating matrix that was used to compute the plotted accuracy. To avoid creating randomly a very good or very bad matrix, we repeated this experiment 50 times and reported the average accuracy for each obtained sparsity level. As expected, the accuracy of LCP-AVE and PACP-AVE decreases as the sparsity of the user–option rating matrix increases. However, it is clear that LCP-AVE has a better performance than PACP-AVE in dealing with missing data. Figure 4 also shows that, with increasing sparsity, the performance gap between LCP-AVE and PACP-AVE grows. It is worth mentioning that we also tested other variants of PACP and LCP, for instance, PACP-MULT, but we do not show these results as they are very similar to those shown here.
Fig. 4.
7.4 Human Ability in Predicting the Group Choice
We have shown that LPC can achieve an accuracy of almost 50% in predicting the group choice from the observation of the group members’ preferences (see Figure 3). To better judge the absolute quality of this ML-based prediction, it is interesting to understand whether a human may be better than LCP in predicting the choice of a group, by having the knowledge of the group members’ preferences, i.e., their ratings for the alternative options.
To this end, following a similar approach adopted by Masthoff [2004], we have conducted a user study and asked 10 participants to predict the likely group’s choice after having observed the group members’ ratings. The participants were computer science master students and colleagues at the Free University of Bolzano. We gave the participants a simple task: please consider the group members’ ratings shown here and select the option that you believe can be the group’s final choice. We have implemented a simple GUI (Figure 5), in which the system, for each group in the dataset, shows the group members’ ratings for 10 destinations \((\text{D1},\ldots , \text{D10})\). Note that no information about the identity of the destinations was given and the subjects were not even aware of the fact that it was a destination selection problem faced by the groups. The study participants were requested to select the destination/option that they believed could have been the group’s final choice.
Fig. 5.
The participants predicted all the group choices of the 79 groups in our dataset. We did not specify any time limitation for the participants and they had to do the whole experiment in one session. All of our 10 subjects performed the required task in one session and we did not exclude any of them from our report. On average, they required about 20 minutes to predict all the groups’ choices. The average accuracy of the participants in predicting the group choices was 0.37, with a minimum of 0.28 and a maximum of 0.46. The variance of the accuracy score was 0.05.
By comparing these results to the performance of LCP and PACP, shown in Figure 3, one can see that human (average) accuracy is much lower than the (average) accuracy obtained by the best LCP variants and even the accuracy of the best PACP variants (AVE, MULT, COPE). Only the best-performing human (0.46) is approaching the (average) performance of the best LCP and PACP variants, which is obtained with the AVE preference aggregation strategy. This result further confirms the benefit of using our ML predictive model, LCP, to solve this task.
7.5 Using Data Augmentation in LCP
We now consider the LCP variants that use the proposed data augmentation approach: Winners and Permutations (see Section 5). Figure 6 shows the performance of the PACP and the LCP variants with and without synthetic group profiles (Winners or Permutations). We discuss here our second hypothesis: in order to cope with the data scarcity of group choices, it is possible to use data augmentation methods, relying on synthetic group choices, and further improve the quality of the proposed group choice prediction method.
Fig. 6.
We observe that the advantage of using the data augmentation methods is not uniform for all the group profile types, i.e., built by using the various preference aggregation strategies that we have considered. Adding Winners group profiles to the dataset brings a value to all LCP variants, with the exception of MULT. Interestingly, the benefit of the Winners data augmentation is more evident for the SDS1 and SDS3 variants. In conclusion, the Winners data augmentation seems to be applicable without any fear of deteriorating the LCP performance.
Adding Permutations data instead improves the accuracy of LCP-AVE, LCP-SDS3, and LCP-COPE. But Permutations data are not beneficial at all when the group profiles are built with the other preference aggregation strategies. However, it is important to note that LCP-AVE-P is the best-performing choice prediction method. Hence, there is clear value in both data augmentation techniques, but the Permutations data augmentation approach requires validation before being applied in combination with a specific preference aggregation strategy, case by case.
In order to better understand the effect of adding the synthetic groups in the Permutations to the training set, we analyzed the distribution of the predictions over the 10 possible options and compared it to the actual distribution of the group choices in the dataset. We found that Permutations help to reduce the KL-divergence between the distribution of the predicted choices and the distribution of the actual group choices, as shown in Table 7. KL-divergence is a statistical metric that measures to what extent two probability distributions are different from each other. The KL-divergence of the LCP variants that use Permutations, namely LCP-P, is much lower than that of the original LCP, and the LCP variants that use the data set augmented by the Winners group profiles. Moreover, it is also lower than the Kl-divergence of the PACP model. Therefore, these synthetic group profiles aid LCP to generate a choice distribution that is more similar to the observed choice distribution.
Table 7.
PACP
LCP
LCP-W
LCP-P
AVE
0.202
0.212
0.278
0.196
MULT
0.251
0.184
0.282
0.164
LM
0.293
0.372
0.569
0.212
Table 7. KL-divergence Between the Predicted Choice Distribution and the Actual Choice Distribution
A smaller value indicates a more similar distribution of the predicted choices to the actual choices.
We further illustrate this finding by comparing the confusion matrices of the PACP-AVE (Figure 7(a)), LCP-AVE (Figure 7(b)), and LCP-AVE-P (Figure 7(c)). In these matrices, each row corresponds to an option chosen by a number of groups indicated in the last column, and the entries in the row show how the predictions for those groups are distributed along the 10 options. By looking at the last (bottom) row of these tables (summary distribution of all the predictions), it is clear that the choice predictions of LCP-AVE concentrate more around the four most popular options (\(o_5\), \(o_6\), \(o_9\), and \(o_{10}\)) than the PACP-AVE does whereas with the addition of Permutations, the LCP-AVE-P model produces a (predicted) choice distribution more evenly distributed. Still, by comparing the bottom row of the LCP-AVE and LCP-AVE-P matrices with the last column (the true distribution of the group choices), one can immediately see the bias of the learning methods, which predict more often the popular group choices.
Fig. 7.
7.6 Results Significance
We also tested the significance of all our results with the Wilcoxon sign-rank test [Wilcoxon 1992]. Table 8 shows the significance level (p-value) of the improvement made by LCP-* compared with PACP-* and LCP-*-P compared with LCP-*. LCP-*-W was never significantly better than LCP.
Table 8.
Compared Variants
p-value
LCP-LM vs. PACP-LM
0.048
LCP-SDS1 vs. PACP-SDS1
0.049
LCP-AVE-P vs. LCP-AVE
0.019
LCP-SDS3-P vs. LCP-SDS3
0.011
LCP-COPE-P vs. LCP-COPE
0.020
Table 8. Significance Level (P-value) of the Learning Method LCP-*-P Improvement in Comparison with LCP
8 Discussion and Future Work
Group Choice Prediction
GRSs have focused on the problem of how to properly aggregate and use individual preferences in order to suggest items that a group will be happy to choose. In this article, by considering that preference aggregation techniques can also be leveraged to predict the choice that a group could or should make, we have hypothesized that the performance of these strategies can be improved by the adoption of a proper ML approach (see Hypothesis 1 in Section 3.2) and even further improved by using a proper data augmentation method (see Hypothesis 2 in Section 3.2). Our hypotheses are motivated by the SDS theory that explicitly focuses on the group choice prediction problem. SDS models how the collective response (group choice), which is the result of possibly complex inter-group interactions that happen during the group decision-making process, can be generated by relying only on individual preferences (group members’ preferences).
Group Profiles and Machine Learning
We have proposed a new approach to group choice prediction that learns a mapping from the group profile to the group choice. The group profile is a summary representation of the individual group members’ preferences and can be generated by any preference aggregation strategy. We have shown that our approach, named LCP (Learning-based Choice Prediction), produces effective predictions, significantly more accurate than those generated by the usage of standard aggregation strategies (PACP).
We have considered a range of aggregation strategies in assessing the quality of LCP: Average, Multiplicative, Least Misery, SDS1, SDS3, and COPE. We have found that group profiles constructed by using the Average preference aggregation strategy enable LCP to produce the best results in our dataset. We have also shown that the better performance of LCP, compared with PACP baseline, which was first assessed on a dense rating matrix, is maintained when only partial knowledge about individual preferences of the members of the group is available, i.e., only some of the individual preferences are known. The empirical analysis also indicates that as the sparsity of the rating matrix increases, the performance gap between LCP and PACP increases as well, with a steeper performance decline for PACP. Moreover, in a user study, we have shown that the proposed choice prediction method, LCP, is more accurate than human assessors trying to predict the group choice from the observation of the group members’ preferences (ratings for the options).
Our group choice prediction approach uses a set of observed groups, along with the group members’ ratings for the considered options and the consequent choice that the group made, in order to train the group choice predictive model. Hence, our approach is dependent on the size and quality of the training data. In order to cope with the data scarcity problem, i.e., having only a limited number of observed groups with their choices, we have proposed two training data augmentation methods. These methods are grounded on assumptions about the properties of the target map from group profiles to group choices. The first assumption is that if all the group members select one item as their preferred option, then this must be the group’s final choice. Group profiles with that characteristic were called Winners. Then, in the second method, by assuming that groups are influenced in their decision by the relative scores of the options in the group profile, and not the actual options, we have defined Permutations profiles and added them to the training data. We have found that Winners never damage the prediction accuracy of LCP, but their benefit is small (as very few group profiles are added by this method) whereas, in combination with certain types of group profiles (those produced by AVE, SDS3, and COPE preference aggregation strategies), Permutations do help LCP to produce more accurate predictions. Moreover, it is observed that this result is obtained by generating a distribution of the predicted group choices more similar to the true (observed in the data) group choice distribution.
Limitations and Future Work
We must acknowledge some limitations of our choice prediction approach and its evaluation. First, in this study, we have used a single dataset that is rather small. However, we have shown in our experiments that the performance of the proposed method on our dataset, which is actually the combination of two datasets referring to different group decision problems, is comparable to the performance of the same method on a single dataset. This shows that the proposed approach can be robust to solve a combination of prediction tasks provided that the number of options is the same (see Section 7.2). In fact, the advantage of our approach is that it is not using any specific characteristic of the choice prediction task apart from the knowledge of the individual preferences. This clearly makes the proposed approach general.
However, we recognize that there is an urgent need to test the application of LCP to other datasets in order to further increase the generality of our results. Therefore, in the future, it is important to consider other, possibly larger, datasets. The datasets must be generated by collecting group choices in a broad range of application domains (e.g., tourism, music, video). It is worth noting that datasets with information about individual preferences and the corresponding group choice are not available now, and new observational studies, such as that described by Delic et al. [2018c],are needed to produce such datasets.
Second, the proposed choice prediction technique is currently designed to tackle decision problems with a small number of options. In our dataset, 10 options were available for the group to choose from. It is surely necessary to design and analyze methods that can scale to a larger number of options. This may be achieved with different types of learning algorithms and with more effective modeling approaches to summarize the preferences of the group members. In that respect, it could be important to derive from the surveyed ML-based group recommender systems approaches, alternative solutions to model the joint preferences of a group, as those based on hidden features of the group.
A third limitation is related to the fact that a rating dataset, which models individual preferences, could be very sparse, even sparser than the datasets considered in Section 7.3. Thus, an appropriate method to deal with these situations is needed. We note that in our current solution of the problem we require that for each option there must be at least one group member that has rated it. An extension of LPC to deal with sparser rating data must be designed. One line of research could be to test the effect of replacing missing ratings with ratings obtained from a rating prediction method and then to construct group profiles based on both real and predicted ratings.
Conclusion
In conclusion, the extensive analysis that we present in this article has clearly indicated the effectiveness and practical applicability of the proposed methods for group choice prediction.
This information, the likely choice that a group will make, can be immediately used to generate a recommendation by recommending the predicted choice, helping the group to faster converge on a decision. However, by having the knowledge of the likely choice of the group, the recommender system can also leverage it to generate other types of recommendations, for instance, presenting items similar to the predicted choice but with additional important properties, such as items that are more novel or fairer choices. Hence, we believe that addressing the group choice prediction problem can open the research on novel and interesting group recommendation techniques, especially conversational approaches, which can greatly benefit from the prediction of the likely choice of the group, to better interact with the group members in supporting their decision-making process.
Hence, notwithstanding its limitations, we believe that the proposed approach represents a concrete tool for better understanding groups and their discussions, and for generatig more compelling GRSs.
Footnotes
1
By an individual user—item interaction, the user’s choice of that particular item is usually considered. Similarly, by a group–item interaction, a group choice of that item is usually considered.
2
We note that when computing choice predictions in our experiments, we have also used the Borda count and Most Pleasure preference aggregation strategies. However, we do not discuss these prediction methods because PACP with the Borda aggregation strategy has a performance very similar to that obtained by the AVE strategy. Besides, the performance of Most Pleasure was consistently worse than any other strategy.
Liliana Ardissono, Anna Goy, Giovanna Petrone, Marino Segnan, and Pietro Torasso. 2003. Intrigue: Personalized recommendation of tourist attractions for desktop and hand held devices. Applied Artificial Intelligence 17, 8-9 (2003), 687–714.
Da Cao, Xiangnan He, Lianhai Miao, Yahui An, Chao Yang, and Richang Hong. 2018. Attentive group recommendation. In The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval (Ann Arbor, MI, USA) (SIGIR’18). Association for Computing Machinery, New York, NY, 645–654.
Chih-Chung Chang and Chih-Jen Lin. 2011. LIBSVM: A library for support vector machines. ACM Transactions on Intelligent Systems and Technology (TIST) 2, 3 (2011), 1–27.
Amra Delic, Judith Masthoff, Julia Neidhardt, and Hannes Werthner. 2018a. How to use social relationships in group recommenders: Empirical evidence. In Proceedings of the 26th Conference on User Modeling, Adaptation and Personalization. ACM, New York, NY, 121–129.
Amra Delic, Julia Neidhardt, Thuy Ngoc Nguyen, and Francesco Ricci. 2018c. An observational user study for group recommender systems in the tourism domain. Information Technology & Tourism 19, 1-4 (2018), 87–116.
Amra Delic, Julia Neidhardt, Thuy Ngoc Nguyen, Francesco Ricci, Laurens Rook, Hannes Werthner, and Markus Zanker. 2016. Observing group decision making processes. In Proceedings of the 10th ACM Conference on Recommender Systems, RecSys’16. Association for Computing Machinery, New York, NY, 147–150.
Amra Delic, Julia Neidhardt, and Hannes Werthner. 2018b. Group decision making and group recommendations. In 2018 IEEE 20th Conference on Business Informatics (CBI), Vol. 1. IEEE, Vienna, Austria, 79–88.
Alexander Felfernig, Ludovico Boratto, Martin Stettinger, and Marko Tkalčič. 2018. Group Recommender Systems: An Introduction. Springer Cham, Cham, Switzerland.
Ulrike Gretzel, Nicole Mitsche, Yeong-Hyeon Hwang, and Daniel R. Fesenmaier. 2004. Tell me who you are and I will tell you where to go: Use of travel personalities in destination recommendation systems. Information Technology & Tourism 7, 1 (2004), 3–12.
Anthony Jameson. 2004. More than the sum of its members: Challenges for group recommender systems. In Proceedings of the Working Conference on Advanced Visual Interfaces (Gallipoli, Italy) (AVI’04). Association for Computing Machinery, New York, NY, USA, 48–54.
Judith Masthoff. 2004. Group modeling: Selecting a sequence of television items to suit a group of viewers. In Personalized Digital Television. Springer, Dordrecht, Netherlands, 93–141.
Judith Masthoff and Albert Gatt. 2006. In pursuit of satisfaction and the prevention of embarrassment: Affective state in group recommender systems. User Modeling and User-Adapted Interaction 16, 3 (2006), 281–319.
Gianna Moscardo, Alastair M. Morrison, Philip L. Pearce, Cheng-Te Lang, and Joseph T. O’Leary. 1996. Understanding vacation destination choice through travel motivation and activities. Journal of Vacation Marketing 2, 2 (1996), 109–122.
Julia Neidhardt, Reiner Schuster, Leonhard Seyfang, and Hannes Werthner. 2014. Eliciting the users’ unknown preferences. In Proceedings of the 8th ACM Conference on Recommender Systems. ACM, 2645767, 309–312.
Aravind Sankar, Yanhong Wu, Yuhang Wu, Wei Zhang, Hao Yang, and Hari Sundaram. 2020. GroupIM: A mutual information maximization framework for neural group recommendation. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval (Virtual Event, China) (SIGIR’20). ACM, New York, NY, 1279–1288.
Garold Stasser. 1999. A primer of social decision scheme theory: Models of group influence, competitive model-testing, and prospective modeling. Organizational Behavior and Human Decision Processes 80, 1 (1999), 3–20.
Frank Wilcoxon. 1992. Individual comparisons by ranking methods. In Breakthroughs in Statistics: Methodology and Distribution. Springer, New York, 196–202.
Sebastien C. Wong, Adam Gatt, Victor Stamatescu, and Mark D. McDonnell. 2016. Understanding data augmentation for classification: When to warp?. In 2016 International Conference on Digital Image Computing: Techniques and Applications (DICTA). IEEE, 1–6.
UMAP'10: Proceedings of the 18th international conference on User Modeling, Adaptation, and Personalization
Today most of existing personalization systems (e.g content recommenders, or targeted ad) focus on individual users and ignore the social situation in which the services are consumed However, many human activities are social and involve several ...
Recommendation techniques have proven their usefulness as a tool to cope with the information overload problem in many classical domains such as movies, books, and music. Additional challenges for recommender systems emerge in the domain of tourism such ...
RecSys '10: Proceedings of the fourth ACM conference on Recommender systems
Collaborative filtering recommendations were designed primarily for individual user models and recommendations. However, nowadays more and more scenarios evolve, in which the recommended items are consumed by groups of users rather than by individuals. ...
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].