Rating Conversion - Multi-Criteria Recommender Systems and Rating Conversion

2.4 Multi-Criteria Recommender Systems and Rating Conversion

2.4.3 Rating Conversion

In order to provide an effective rating prediction, the memory-based CF approach rely crucially on the ratings from neighbors. However, exploiting those ratings to make a prediction for the other users directly might lead to a problem. This is because the habits or patterns on giving ratings among users vary due to their personal biases. For example, on the rating range of 1 to 10, Useru₁might give rating score from 2 to 5 indicating ‘dislike’

to ‘like’, while Useru₂, instead, gives the rating from 5 to 8 with the same intention. This means that ‘like’ for useru₁ equals to ‘dislike’ to useru₂. Therefore, using ratings from neighbors to predict the rating for an active user directly may not be practical.

In order to deal with the user personal biases in the ratings, many rating conversion techniques have been introduced in single criterion domain [8, 18, 43, 44, 56]. The main idea is to convert the ratings from the neighbors into the same scale as the active user, before utilizing them for a rating prediction. The most simplest approach that can be applied for converting the ratings is a normalization.

The normalization approach converts the user ratings into a specific range. Such range is usually between 0 and 1 where everyone’s ‘most like’ and ‘most dislike’ will be mapped to

score ‘1’ and ‘0’, respectively. Many normalization methods are proposed based on different assumptions, such as linear normalization, the Gaussian normalization, and the decoupling normalization.

Linear Normalization

This method maps ratings based on the maximum and minimum of personal user ratings. By using the linear function, the normalized rating valuer^new_u_a for the useru_a’s specific rating is computed as:

r^new_u

a = r_u^old_a −r_u_a_,min+1

r_u_a_,max−r_u_a_,min+1, (2.4.4) wherer^old_u_a is an original rating ofu_a,r_u_a_,max andr_u_a_,mindenote the maximum and minimum ratings useru_ahas rated, respectively. This normalization method maps ratings based only on maximum and minimum of the personal user ratings.

Gaussian Normalization

This method considers two factors that affect the variance of ratings among users with similar interests [43]. The first factor is a difference of a rating from the average ratings. This factor relates to the fact that some users are more tolerant and tend to give higher ratings than others.

Another factor is the difference of users rating scales. This comes from the fact that some users tend to assign items to a narrow range of ratings, whereas other users tend to assign items to a wide range. Combining these two factors, the ratings of each user are subtracted with his average and divided by the variance of his ratings, as expressed by:

r^new_u

a = r^old_u_a −r¯_u_a

σ_u_a , (2.4.5)

where ¯r_u_a andσua are an average and a standard deviation of user ratings, respectively.

Decoupling Normalization

This method converts a user rating on item into a probability for that item to be favored by the user [44]. When the ratingr_u_a is going to be normalized, the probability is determined based on two factors. First, a ratio between two numbers: the number of items which was rated no more than valuer_u_a by the useru_aand the number of all items that the useru_ahas

2.4 Multi-Criteria Recommender Systems and Rating Conversion 29 rated. The high ratio means the ratingr_u_a are likely to be favored by the user. The second factor is a ratio between the other two numbers: the number of items which was rated value r_u_a by the useru_aand the double number of all items that the user has rated. The low ratio means the ratingr_u_a are likely to be favored by the user. Based on these two factors, a special formula; called halfway accumulative distribution was proposed as:

r_u^new_a = |{v_j∈I_u_a|r_a,_j≤r_u^old

a }|

I_u_a −|{v_j∈I_u_a|r_a,_j=r_u^old

a }|

2|I_u_a| , (2.4.6)

whereI_u_a denotes the set of items to which useru_ahas rated.

Although the normalization techniques are able to convert a user’s ratings into the same range, the conversions are based only on the rating data of the only one user. This might lead to an inaccurate recommendation if there are two active users whose rating patterns are different but having the same neighbors. If the normalized ratings are used for the recommendations to these two active users, the results will be the same. For example, an active useru_ausually rates ‘0.4’ (normalized ratings) while another active useru_busually rates ‘0.7’. If these two users share the same neighboru_cwhose rated the target item with

‘0.8’, they will receive the same predicted ratings of ‘0.8’. Although ‘0.8’ seems like no effect on user u_b, it seems to be high value of rating for user u_a since his usual rating is

‘0.4’. Thus, the better solution is to find the relationship between each pair of user ratings:

original user and target user, in order to convert neighbor ratings to individual active user ratings. The examples of such conversion techniques include linear mapping [8], Lathia’s rating conversion [56] and Warat’s rating conversion [18], which are explained further in Chapter 3.3.

Furthermore, the rating conversion techniques have been proposed only in the SC domain.

Such SC rating conversion techniques can be applied to MC ratings by converting ratings from each criterion independently. However, this could cause a scalability problem and consume a lot of resources. Moreover, usually there are implicit relation among the criteria ratings when user makes decision to select an item. For example, a user may choose a room that have high score on both service and location, while ignore its price. If each criterion rating is converted independently, such implicit relation could be lost.

Chapter 3 Related Work

3.1 Review-Based Recommendation Techniques

Dalam dokumen submitted to the Department of Informatics (Halaman 43-47)