提问人:hk2 提问时间:4/13/2023 更新时间:4/13/2023 访问量:24
将商店绩效与贝叶斯平均评分进行比较
Comparing store performance with bayesian average rating
问:
我正在尝试根据商店的评级和评论来比较商店的表现,我在类似的帖子中遇到了两种方法——加权评级和贝叶斯评级 按 5 星评级排序的更好方法是什么?
我的数据集有点相似,商店有总体评分(满分 5 星评分)和评论数。但是,有些商店的评分较高,评论较少,有些商店的评分较高,评论较高,而另一些商店的评分较低,评论较高。我很难理解加权评级方法中的“m”是什么意思,该方法以及埃文·米勒的贝叶斯公式(即
]1)weighted rating = (v / (v + m)) * R + (m / (v + m)) * C
nk is the number of k-star ratings,
sk is the "worth" (in points) of k stars,
N is the total number of votes
K is the maximum number of stars (e.g. K=5, in a 5-star rating system)
z_alpha/2 is the 1 - alpha/2 quantile of a normal distribution. If you want 95% confidence (based on the Bayesian posterior distribution) that the actual sort criterion is at least as big as the computed sort criterion, choose z_alpha/2 = 1.65```
Below is a sample dataset to provide more clarity. The ratings lie between 3.5 to 4.6 with reviews ranging from ~200 to ~2800. Which of the above two methods should be a good fit in my case and how can I use the variables in my dataset in the above two formulae?
| 商店 | 评级 | 评论数量 |
|---|---|---|
| 101 | 3.7 | 211 |
| 102 | 3.6 | 1,194 |
| 103 | 3.7 | 1,879 |
| 104 | 3.7 | 876 |
| 105 | 3.7 | 765 |
| 106 | 3.7 | 922 |
| 107 | 3.5 | 502 |
| 108 | 3.7 | 2,401 |
| 109 | 3.9 | 635 |
| 110 | 3.9 | 505 |
| 111 | 3.8 | 275 |
| 112 | 3.9 | 1,021 |
| 113 | 3.9 | 1,931 |
| 114 | 4 | 851 |
| 115 | 4.1 | 741 |
| 116 | 4.1 | 749 |
| 117 | 4 | 500 |
| 118 | 4.2 | 896 |
| 119 | 4.2 | 2,807 |
| 120 | 4.2 | 1,372 |
| 121 | 4.1 | 1,807 |
| 122 | 4.2 | 2,526 |
| 123 | 4 | 1,170 |
| 124 | 4.2 | 1,587 |
| 125 | 4.2 | 2,125 |
| 126 | 4.1 | 1,959 |
| 127 | 4.3 | 862 |
| 128 | 4.3 | 1,249 |
| 129 | 4.4 | 2,143 |
| 130 | 4.4 | 1,396 |
| 131 | 4.4 | 366 |
| 132 | 4.4 | 954 |
| 133 | 4.5 | 1,058 |
| 134 | 4.5 | 230 |
| 135 | 4.6 | 436 |
| 136 | 4.6 | 1,000 |
答: 暂无答案
评论