У меня есть агрегированные данные, которые содержат рейтинги опросов ресторанов по шкале от 1 до 5 звезд.
Как построить график распределения и найти стандартную ошибку в R?
И можно ли проводить более сложные анализы, такие как сгруппированные рейтинги (например, ошибка для всех рестораны быстрого питания, например McDonalds + BurgerKing, все стейк-хаусы и т. д. c.) и t-тесты?
Данные выглядят следующим образом:
Restaurant Question 1.star 2.stars ...etc
McDonalds How was the food? 5 6 ...
McDonalds How were the drinks? 3 4
McDonalds How were the workers? 2 7
Burger_King How was the food? 4 11
Burger_King How were the drinks? 9 3
Burger_King How were the workers? 12 1
Вот для удобства пустяк:
tribble(
~restaurant, ~question, ~one_star, ~two_star, ~three_star, ~four_star, ~five_star, ~average,
"McDonalds", "How was the food?", 5, 6, 8, 2, 9, (5*1 + 6*2 + 8*3 + 2*4 + 5*9)/(5 + 6 + 8 + 2 + 9),
"McDonalds", "How were the drinks?", 9, 8, 7, 5, 1, (9*1 + 8*2 + 7*3 + 5*4 + 5*1)/(9 + 8 + 7 + 5 + 1),
"McDonalds", "How were the drinks?", 9, 8, 7, 5, 1, (9*1 + 8*2 + 7*3 + 5*4 + 5*1)/(9 + 8 + 7 + 5 + 1),
"BurgerKing", "How was the food?", 5, 6, 8, 2, 9, (5*1 + 6*2 + 8*3 + 2*4 + 5*9)/(5 + 6 + 8 + 2 + 9),
"BurgerKing", "How were the drinks?", 9, 8, 7, 5, 1, (9*1 + 8*2 + 7*3 + 5*4 + 5*1)/(9 + 8 + 7 + 5 + 1),
"BurgerKing", "How were the drinks?", 9, 8, 7, 5, 1, (9*1 + 8*2 + 7*3 + 5*4 + 5*1)/(9 + 8 + 7 + 5 + 1)
)