607-week-1-assignment-

Overview

The article that I chose is “The Ultimate Halloween Candy Power Ranking” by fivethirtyeight.com, which can be found at this link: https://fivethirtyeight.com/features/the-ultimate-halloween-candy-power-ranking/

This article is about the results of a survey conducted by fivethirtyeight.com, in which over 200,000 participants were asked to rank their favorite Halloween candies. The survey aimed to determine the most popular Halloween candies in the United States, as well as any regional differences in candy preferences. The article also includes data visualizations and analysis of the survey results, including the distribution of candy popularity by chocolate or non-chocolate, hard or soft and by region. I have chosen the “The Ultimate Halloween Candy Power Ranking” dataset from fivethirtyeight.com. This dataset contains information on different Halloween candies and their popularity, based on a survey conducted by fivethirtyeight.com. I find this dataset interesting because it allows me to explore the preferences and opinions of people when it comes to Halloween candies.

Data preparation

i will do the following.

1-Read the data into R using the read.csv() function 2-Remove unnecessary columns that will not be used in the analysis 3-Rename columns to meaningful names 4-Replace any non-intuitive abbreviations used in the data 5-Check for missing values and handle them accordingly ——- I have studied the data and read the associated fivethirtyeight.com article. The dataset contains information on 85 different candies, including the candy’s name, its chocolate or non-chocolate category, whether it’s a hard or soft candy, and its overall ranking in the survey. The survey was conducted by fivethirtyeight.com in 2016, with over 200,000 participants. The data is provided in a CSV file on the GitHub site, and it is ready for analysis.

Download the dataset and read it into a dataframe

url <- "https://raw.githubusercontent.com/fivethirtyeight/data/master/candy-power-ranking/candy-data.csv"
download.file(url, destfile = "candy-data.csv", method = "curl")

candy_data <- read.csv("candy-data.csv")
candy_data

##                 competitorname chocolate fruity caramel peanutyalmondy nougat
## 1                    100 Grand         1      0       1              0      0
## 2                 3 Musketeers         1      0       0              0      1
## 3                     One dime         0      0       0              0      0
## 4                  One quarter         0      0       0              0      0
## 5                    Air Heads         0      1       0              0      0
## 6                   Almond Joy         1      0       0              1      0
## 7                    Baby Ruth         1      0       1              1      1
## 8           Boston Baked Beans         0      0       0              1      0
## 9                   Candy Corn         0      0       0              0      0
## 10          Caramel Apple Pops         0      1       1              0      0
## 11             Charleston Chew         1      0       0              0      1
## 12  Chewey Lemonhead Fruit Mix         0      1       0              0      0
## 13                    Chiclets         0      1       0              0      0
## 14                        Dots         0      1       0              0      0
## 15                    Dum Dums         0      1       0              0      0
## 16                 Fruit Chews         0      1       0              0      0
## 17                     Fun Dip         0      1       0              0      0
## 18                  Gobstopper         0      1       0              0      0
## 19           Haribo Gold Bears         0      1       0              0      0
## 20           Haribo Happy Cola         0      0       0              0      0
## 21           Haribo Sour Bears         0      1       0              0      0
## 22          Haribo Twin Snakes         0      1       0              0      0
## 23            Hershey's Kisses         1      0       0              0      0
## 24           Hershey's Krackel         1      0       0              0      0
## 25    Hershey's Milk Chocolate         1      0       0              0      0
## 26      Hershey's Special Dark         1      0       0              0      0
## 27                  Jawbusters         0      1       0              0      0
## 28                Junior Mints         1      0       0              0      0
## 29                     Kit Kat         1      0       0              0      0
## 30                 Laffy Taffy         0      1       0              0      0
## 31                   Lemonhead         0      1       0              0      0
## 32 Lifesavers big ring gummies         0      1       0              0      0
## 33         Peanut butter M&M's         1      0       0              1      0
## 34                       M&M's         1      0       0              0      0
## 35                  Mike & Ike         0      1       0              0      0
## 36                   Milk Duds         1      0       1              0      0
## 37                   Milky Way         1      0       1              0      1
## 38          Milky Way Midnight         1      0       1              0      1
## 39    Milky Way Simply Caramel         1      0       1              0      0
## 40                      Mounds         1      0       0              0      0
## 41                 Mr Good Bar         1      0       0              1      0
## 42                       Nerds         0      1       0              0      0
## 43         Nestle Butterfinger         1      0       0              1      0
## 44               Nestle Crunch         1      0       0              0      0
## 45                   Nik L Nip         0      1       0              0      0
## 46                 Now & Later         0      1       0              0      0
## 47                      Payday         0      0       0              1      1
## 48                 Peanut M&Ms         1      0       0              1      0
## 49                Pixie Sticks         0      0       0              0      0
## 50                   Pop Rocks         0      1       0              0      0
## 51                   Red vines         0      1       0              0      0
## 52          Reese's Miniatures         1      0       0              1      0
## 53   Reese's Peanut Butter cup         1      0       0              1      0
## 54              Reese's pieces         1      0       0              1      0
## 55 Reese's stuffed with pieces         1      0       0              1      0
## 56                    Ring pop         0      1       0              0      0
## 57                        Rolo         1      0       1              0      0
## 58           Root Beer Barrels         0      0       0              0      0
## 59                       Runts         0      1       0              0      0
## 60                     Sixlets         1      0       0              0      0
## 61           Skittles original         0      1       0              0      0
## 62          Skittles wildberry         0      1       0              0      0
## 63             Nestle Smarties         1      0       0              0      0
## 64              Smarties candy         0      1       0              0      0
## 65                    Snickers         1      0       1              1      1
## 66            Snickers Crisper         1      0       1              1      0
## 67             Sour Patch Kids         0      1       0              0      0
## 68       Sour Patch Tricksters         0      1       0              0      0
## 69                   Starburst         0      1       0              0      0
## 70         Strawberry bon bons         0      1       0              0      0
## 71                Sugar Babies         0      0       1              0      0
## 72                 Sugar Daddy         0      0       1              0      0
## 73                Super Bubble         0      1       0              0      0
## 74                Swedish Fish         0      1       0              0      0
## 75                 Tootsie Pop         1      1       0              0      0
## 76        Tootsie Roll Juniors         1      0       0              0      0
## 77        Tootsie Roll Midgies         1      0       0              0      0
## 78     Tootsie Roll Snack Bars         1      0       0              0      0
## 79           Trolli Sour Bites         0      1       0              0      0
## 80                        Twix         1      0       1              0      0
## 81                   Twizzlers         0      1       0              0      0
## 82                    Warheads         0      1       0              0      0
## 83        Welch's Fruit Snacks         0      1       0              0      0
## 84  Werther's Original Caramel         0      0       1              0      0
## 85                    Whoppers         1      0       0              0      0
##    crispedricewafer hard bar pluribus sugarpercent pricepercent winpercent
## 1                 1    0   1        0        0.732        0.860   66.97173
## 2                 0    0   1        0        0.604        0.511   67.60294
## 3                 0    0   0        0        0.011        0.116   32.26109
## 4                 0    0   0        0        0.011        0.511   46.11650
## 5                 0    0   0        0        0.906        0.511   52.34146
## 6                 0    0   1        0        0.465        0.767   50.34755
## 7                 0    0   1        0        0.604        0.767   56.91455
## 8                 0    0   0        1        0.313        0.511   23.41782
## 9                 0    0   0        1        0.906        0.325   38.01096
## 10                0    0   0        0        0.604        0.325   34.51768
## 11                0    0   1        0        0.604        0.511   38.97504
## 12                0    0   0        1        0.732        0.511   36.01763
## 13                0    0   0        1        0.046        0.325   24.52499
## 14                0    0   0        1        0.732        0.511   42.27208
## 15                0    1   0        0        0.732        0.034   39.46056
## 16                0    0   0        1        0.127        0.034   43.08892
## 17                0    1   0        0        0.732        0.325   39.18550
## 18                0    1   0        1        0.906        0.453   46.78335
## 19                0    0   0        1        0.465        0.465   57.11974
## 20                0    0   0        1        0.465        0.465   34.15896
## 21                0    0   0        1        0.465        0.465   51.41243
## 22                0    0   0        1        0.465        0.465   42.17877
## 23                0    0   0        1        0.127        0.093   55.37545
## 24                1    0   1        0        0.430        0.918   62.28448
## 25                0    0   1        0        0.430        0.918   56.49050
## 26                0    0   1        0        0.430        0.918   59.23612
## 27                0    1   0        1        0.093        0.511   28.12744
## 28                0    0   0        1        0.197        0.511   57.21925
## 29                1    0   1        0        0.313        0.511   76.76860
## 30                0    0   0        0        0.220        0.116   41.38956
## 31                0    1   0        0        0.046        0.104   39.14106
## 32                0    0   0        0        0.267        0.279   52.91139
## 33                0    0   0        1        0.825        0.651   71.46505
## 34                0    0   0        1        0.825        0.651   66.57458
## 35                0    0   0        1        0.872        0.325   46.41172
## 36                0    0   0        1        0.302        0.511   55.06407
## 37                0    0   1        0        0.604        0.651   73.09956
## 38                0    0   1        0        0.732        0.441   60.80070
## 39                0    0   1        0        0.965        0.860   64.35334
## 40                0    0   1        0        0.313        0.860   47.82975
## 41                0    0   1        0        0.313        0.918   54.52645
## 42                0    1   0        1        0.848        0.325   55.35405
## 43                0    0   1        0        0.604        0.767   70.73564
## 44                1    0   1        0        0.313        0.767   66.47068
## 45                0    0   0        1        0.197        0.976   22.44534
## 46                0    0   0        1        0.220        0.325   39.44680
## 47                0    0   1        0        0.465        0.767   46.29660
## 48                0    0   0        1        0.593        0.651   69.48379
## 49                0    0   0        1        0.093        0.023   37.72234
## 50                0    1   0        1        0.604        0.837   41.26551
## 51                0    0   0        1        0.581        0.116   37.34852
## 52                0    0   0        0        0.034        0.279   81.86626
## 53                0    0   0        0        0.720        0.651   84.18029
## 54                0    0   0        1        0.406        0.651   73.43499
## 55                0    0   0        0        0.988        0.651   72.88790
## 56                0    1   0        0        0.732        0.965   35.29076
## 57                0    0   0        1        0.860        0.860   65.71629
## 58                0    1   0        1        0.732        0.069   29.70369
## 59                0    1   0        1        0.872        0.279   42.84914
## 60                0    0   0        1        0.220        0.081   34.72200
## 61                0    0   0        1        0.941        0.220   63.08514
## 62                0    0   0        1        0.941        0.220   55.10370
## 63                0    0   0        1        0.267        0.976   37.88719
## 64                0    1   0        1        0.267        0.116   45.99583
## 65                0    0   1        0        0.546        0.651   76.67378
## 66                1    0   1        0        0.604        0.651   59.52925
## 67                0    0   0        1        0.069        0.116   59.86400
## 68                0    0   0        1        0.069        0.116   52.82595
## 69                0    0   0        1        0.151        0.220   67.03763
## 70                0    1   0        1        0.569        0.058   34.57899
## 71                0    0   0        1        0.965        0.767   33.43755
## 72                0    0   0        0        0.418        0.325   32.23100
## 73                0    0   0        0        0.162        0.116   27.30386
## 74                0    0   0        1        0.604        0.755   54.86111
## 75                0    1   0        0        0.604        0.325   48.98265
## 76                0    0   0        0        0.313        0.511   43.06890
## 77                0    0   0        1        0.174        0.011   45.73675
## 78                0    0   1        0        0.465        0.325   49.65350
## 79                0    0   0        1        0.313        0.255   47.17323
## 80                1    0   1        0        0.546        0.906   81.64291
## 81                0    0   0        0        0.220        0.116   45.46628
## 82                0    1   0        0        0.093        0.116   39.01190
## 83                0    0   0        1        0.313        0.313   44.37552
## 84                0    1   0        0        0.186        0.267   41.90431
## 85                1    0   0        1        0.872        0.848   49.52411

url <- "https://raw.githubusercontent.com/fivethirtyeight/data/master/candy-power-ranking/candy-data.csv"
df <- read.csv(url)

I will select a subset of columns that I am interested in analyzing. In this case, I will select the columns for “competitorname”, “chocolate”, “winpercent” and “pricepercent”. I will also add meaningful column names and replace any non-intuitive abbreviations used in the data.

Select a subset of columns

subset_cols <- c("competitorname","chocolate","winpercent","pricepercent")
df_subset <- df[,subset_cols]

Rename columns with meaningful names

names(df_subset) <- c("Candy Name","Chocolate","Win Percentage","Price Percentage")

I will check the first few rows of the transformed dataframe to ensure that the data has been properly subsetted and renamed.

Print first few rows of transformed dataframe

head(df_subset)

##     Candy Name Chocolate Win Percentage Price Percentage
## 1    100 Grand         1       66.97173            0.860
## 2 3 Musketeers         1       67.60294            0.511
## 3     One dime         0       32.26109            0.116
## 4  One quarter         0       46.11650            0.511
## 5    Air Heads         0       52.34146            0.511
## 6   Almond Joy         1       50.34755            0.767

To ensure that the original data file is accessible through my code, I will store the data file in a public GitHub repository using its URL

github

Findings and Recommendations

Conclusions

From the analysis of the “The Ultimate Halloween Candy Power Ranking” dataset, several conclusions can be drawn about the preferences of people when it comes to Halloween candies. First, it is clear that chocolate candies are more popular than non-chocolate candies. Additionally, the survey results also indicate that people have a preference for soft candies over hard candies. Second, the survey results indicate that there are regional differences in candy preferences. For example, people in the Northeast and West regions of the United States tend to prefer chocolate candies, while people in the South and Midwest regions tend to prefer non-chocolate candies.To extend, verify, or update the work from the selected article, there are several steps that can be taken. For example: Conducting a similar survey in different years to track any changes in candy preferences over time.Expanding the survey to include more participants and a wider range of geographic locations.Conducting a survey to find out the reasons why people like or dislike a particular candy, in order to get more insight into their preferences.Conducting a survey of children to compare their preferences with adults, as children may have different preferences.Conducting a survey to compare the candies preferences among different ethnic groups.Conducting a survey to compare the preferences of candies in different countries.Overall, the “The Ultimate Halloween Candy Power Ranking” dataset provides an interesting look at candy preferences in the United States and serves as a starting point for further research on the topic.

week1 homework

waheeb Algabri

2023-01-26