Use these tabs to navigate the report:
Bellabeat is a tech-driven wellness company designed for women founded in 2014 and headquartered in San Francisco. By 2016, the company had quick growth and globally launched campaigns and opened locations in Zagreb, Croatia and Hong Kong, China.
The Bellabeat fitness products include: Bellabeat Application, Leaf Tracker, Time Watch, Spring Water Bottle, and Bellabeat membership.
The focus of this analysis is to suggest opportunities to improve the specific fitness product Leaf tracker. The Leaf is a pendant that measures your sleep cycles, activity, breathing, and monthly cycles in conjunction with the Bellabeat application. Leaf can be worn as a bracelet, necklace, or clip.
2015: Bellabeat launched a wearable tracker called Leaf Nature, characterized by its nature-inspired design and versatility of wear.[1] It is a wearable fitness tracker for women that tracks activity, sleep, and stress resistance.[2] The same year, the company added period and pregnancy tracking to its accompanying application.[3]
2016: The company launched Leaf Urban,[4] the updated version of the previous Leaf tracker that featured a more modern, water-resistant design, and updated the corresponding Bellabeat application with additional mindfulness guidance.[5]
2018: The Leaf collection was later expanded to include Leaf Chakra.[6]
2019: Bellabeat partners with Swarovski to launch Bellabeat Leaf Crystal, created with Swarovski crystals.[7][8]
2021: Bellabeat partners with Poosh by Kourtney Kardashian for the Poosh Your Wellness Festival, where the company held a solo panel with the topic “Living by Your Cycle With Bellabeat”[9] led by Bellabeat’s co-founder Urška Sršen and Shayna Taylor, a holistic nutritionist.
2023: Bellabeat hosts a Wellness Summer Power dinner at Nobu in Malibu, featuring Olivia Culpo as the host and attended by celebrities including James Charles, Sophia Culpo, Aurora Culpo, Montana Tucker, Sarah Stage, and others.[10] Bellabeat partners with The Body Shop on the Beyond Skin Deep wellness bundle[11] that combines Bellabeat’s Ivy wearable device with The Body Shop’s products.[12]
Compare usage data of Bellabeat’s competitors’ similar smart devices and present data insights and marketing recommendations to the Bellabeat executive team.
Shaping Women’s Health & Well-Being: Focusing on creating innovative health and wellness products for women, our mission is to empower women to take control of their health by providing them with technology-driven solutions that blend design and function.[13]
The Executive Team currently includes these six people:
SPECIAL NOTE: This report contains very highly detailed subject matter that would not typically be included to present to these key stakeholders, but is included to show the thought processes throughout this case study, since it may be used to show future employers of my work capabilities.
The following questions were brainstormed by Bellabeat’s Marketing Team in order to steer the analyses. Keep in mind not all questions will be answered by this analysis, but may require other data analyses in order to answer those remaining questions. It was used as a general guide to lead the Marketing Team.
There are 4 following datasets: the first was supplied by a key stakeholder and Chief Product Officer Urška Sršen. The second, third, and fourth sets were added to include more valid data, leading to more analysis, visualizations, insights, and recommendations beyond the first set.
The FitBit Fitness Tracker Data is a secondary, external, discrete, quantitative, structured, open source data downloaded from Mobius at https://www.kaggle.com/datasets/arashnic/fitbit and organized in rows and columns in long format.
Fitness tracker data was measured and collected from 30 consenting Fitbit users spanning 2 months from March 12 to May 12 of 2016. The data downloads in a zipped file of 2 separate month folders, containing 29 .csv files. Data was collected by Amazon Mechanical Turk via survey responders and the data had no identifiable personal information included. The data is cited properly.
The personal data includes hour, minute, and second information on sleep, physical activity level, activity intensity, calories and metabolic equivalents per activity, weight, and measured steps.
The data is rated for CC0: public domain use.
This data is not completely reliable because the data and its description do not mention gender.
The data is original, but not completely comprehensive, since the sleep file is only collected for the second month set.
The data is repetitive, including wide and long formats for the same data.
There are inconsistencies in the time elements of files. Some are reported in seconds, minutes, days, or a combination.
There are missing values in some of the data columns.
The data is not very current. It is from 8 years ago, in 2016.
The description of the data states the sample size is 30 participants. However, there are 35 distinct user IDs within the data sets.
The sample size is small at 35 and may contain men or women. It is not specified, whereas Bellabeat primarily focuses on women consumers. This is a bias that is noted.
The data is split up into separate months, each in their own folder, that will have to be combined or joined to analyze.
The data collection starts in the middle of the months, so this makes naming more challenging.
Seven .csv files are missing from the Mar 12 to Apr 11 folder that are contained in the Apr 12 to May 11 data file folders. Of the seven missing files:
Three “daily” files can be calculated using their counterpart “hourly” files.
Three “wide” files have already been converted to more easily used “narrow” files, so the wide formats can be ignored, deleted, or archived.
The 7th file is sleepDay_merged.csv and is only in the second month set.
This data is secondary, external, discrete, quantitative, structured, open source data download from https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/ZS2Z2J organized in long format.
The study included 26 women and 20 men consenting and wearing either Fitbit or Apple Watch fitness trackers to obtain caloric expenditure data to see if the trackers were accurate. Participants were tested by 3 different intensities or self paced for 40 minutes on a treadmill, and then 25 minutes of either standing, lying, or sitting. The study concluded that Fitbit and Apple Watch were able to predict physical activity type with reasonable accuracy. As far as privacy, the data had no identifiable personal identification.
The data was rated for CC0: Public Domain usage Version 1.
The sample size was small with only 46 participants.
The study doesn’t specify locations of people’s data or country of origin.
This data is a secondary, external, discrete, quantitative, structured, open source data download by UiT The Arctic University of Norway at DataverseNO https://dataverse.no/dataset.xhtml?persistentId=doi:10.18710/6ZWC9Z organized in long format.
The study consists of 423 types of fitness trackers or watches measuring which features each brand and type included. The country of origin, year released, company names, and device names are included. The data listed Boolean style arguments on features including an accelerometer, gyroscope, magnetometer, barometer, GPS, and PPG. Refer to glossary for technology definitions. Privacy was not an issue.
The data was rated for CC0: Public Domain usage Version 1.
The data was included in order to find trends globally and examine technology features in fitness trackers.
SPECIAL NOTE: This report contains very highly detailed subject matter that would not typically be included to present to these key stakeholders, but is included to show the thought processes throughout this case study, since it may be used to show future employers of my work capabilities.
The original data was copied to a RAW folder and locked with a password so no changes could be made. It was then recopied to a CLEANED folder to begin the scrubbing process. The RAW folder files will stay in their original form in case a backup replacement is needed. The file names will be consistent and standardized utilizing Bellabeat’s naming convention policy.
“Narrow” will be changed to “long”. “Daily” will be changed to “day” for time element consistency.
All file naming conventions will be changed to snake case, using lowercase and underscores to separate the individual file words.
“Merged” and “narrow” can all be discarded since they are no longer needed and all data is in long format.
Standardizing all time elements to plural and removing the “ly” from them shows consistency. All time elements in file names will be moved to after the main element (i.e. “hourlyCalories_merged.csv” will become “calories_hours.csv”.
METs will change to metabolic_equivalents, which are a way to measure the energy cost of physical activities, and are used by Fitbit devices to estimate exercise intensity.
The file WeightLogInfo will be changed to include the unit of measure kg to remain consistent (weight_kg.csv). The LogInfo column will be ignored because it does not contain useful information.
A folder will be created to contain the merged data from both months, named fitbit_data_03122016_05122016.
Google Sheets will be used to merge files and then column names only have to be standardized once instead of twice to save time. A “_month1” and a “_month2” will be added to the end of file names in the 1st and 2nd month’s folder, respectively, so the files with the same names wouldn’t be confusing during the merge. The cut and paste functions were used in Google Sheets to merge the files. Heartrates by seconds were 2 files that were too large to merge, so Google Cloud Services were activated in order to upload and merge those files.
Calories were in both hourly and in minutes. The minute increment was too small and hourly or daily calories would be more useful, so the calories_minutes files will be ignored, deleted, or archived depending on policy.
Google Sheets was used to merge all files, since Sheets can accommodate up to 5 million rows.
Google sheets was used to trim spaces, delete replicated lines, and utilize conditional formatting to change empty or null cells to zeros when warranted. Sorting and filtering assisted the cleaning process. Outliers were examined and discarded on an individual basis as per dataset logic.
The date_time columns were formatted as Date and Time datatypes to ensure consistency and for future JOINing functions in Tableau.
Variable names of columns and/or rows were standardized using snake case as per Bellabeat’s policy on naming conventions.
The cleaned files were then saved to their respective “_CLEANED” folders in .csv format to easily be uploaded to Tableau, BigQuery, or RStudio for analysis.
All files were downloadable as .csv files and needed no other conversion for consistency.
These prior steps helped transform the original data into high quality cleaned data that ensured data validation standards.
The same steps mentioned above were repeated to ensure data validation. However, this data was nearly cleaned and only had one data table.
One data value had to be cleaned because it was a company name made of only numbers, so it was a misread datatype of integer instead of string text. It was converted to string text for consistency.
Google Sheets - The dataset files were imported into Google Sheets. The Id columns were all the same, but the Date or Time columns had many different formats. The datatype was changed to Date and Time with column label “date_time” in order to create a consistent format (i.e. 4/12/2016 08:35:00) and join tables later. Merging the 2 months files together was attempted in Sheets, but the larger files kept crashing. All files were exported into .csv files in a CLEANED folder. Many pivot tables were created in Google Sheets to consolidate data for specific users, averages, sums, minimums, maximums, counts, and distinct counts.
Text Editor - Merging the 2 separate month files into one file was done in a simple text editor utilizing cut and paste. The files were renamed with “_both_months” at the end of file names. This created less data tables in Tableau to join later. The files were saved as .csv files. “Weight_kg_month1” and “Sleep_day_month2” could not be merged with their counterparts because they did not exist in the datasets. They will still be imported to see if any insights can be determined from them.
Tableau - The 9 consolidated datasets were imported into Tableau. The “Activities_day_both_months” table was used as the main table to connect all other relational tables. Id and date_time were the common links used as primary keys to join all the tables. The joined tables in Tableau appear as follows:
Google Sheets, Tableau, and SQL in BigQuery were used to organize and analyze the data in many different ways and multiple outputs and visualizations were created to demonstrate the findings.
Some of the following SQL queries were used in BigQuery to organize tables into manageable summary tables:
SELECT Id,
COUNT(Id) AS ID_count_entries,
SUM(TotalSteps) AS SUM_TotalSteps,
SUM(TotalDistance) AS SUM_TotalDistance,
SUM(TrackerDistance) AS SUM_TrackerDistance,
SUM(LoggedActivitiesDistance) AS SUM_LoggedActivitiesDistance,
SUM(VeryActiveDistance) AS SUM_VeryActiveDistance,
SUM(ModeratelyActiveDistance) AS SUM_ModeratelyActiveDistance,
SUM(LightActiveDistance) AS SUM_LightActiveDistance,
SUM(SedentaryActiveDistance) AS SUM_SecondaryActiveDistance,
SUM(VeryActiveminutes) AS SUM_VeryActiveMinutes,
SUM(FairlyActiveminutes) AS SUM_FairlyActiveMinutes,
SUM(LightlyActiveminutes) AS SUM_LightlyActiveMinutes,
SUM(SedentaryMinutes) AS SUM_SedentaryMinutes,
SUM(Calories) AS SUM_Calories
FROM `bellebeat-421514.health_data.activity`
GROUP BY Id
SELECT COUNT(DISTINCT(Id))
FROM `bellebeat-421514.health_data.weight`
SELECT * FROM `bellebeat-421514.bellebeat_data.heartbeats_seconds` CROSS JOIN `bellebeat-421514.bellebeat_data.heartrate_seconds2`
RStudio - An R Markdown report (this report) was generated in RStudio to document the entire analysis process from beginning to end and saved as HTML output for an online portfolio that will be created to share the information with key stakeholders and fellow Marketing Data Analysts.
Kaggle and Linkin will provide the web space for uploading the final report to a portfolio.
Most of the visualizations are from the primary dataset due to the multiple datasets. The three other datasets added to the case study had one table each to analyze, so there are less visuals and insights.
accelerometer - An accelerometer is a sensor that measures the rate of change of velocity, or acceleration, in one or more directions. In fitness trackers, accelerometers convert body movement into data, such as steps taken, distance traveled, and calories burned. Accelerometers can also be used to identify postures, classify movements, estimate energy expenditure, and detect falls.
barometer - A barometer is a sensor that measures air pressure. In a smartwatch, a barometer is used to measure the altitude, provide weather forecast, and track the changes in elevation while climbing or hiking.
BMI - Body Mass Index is a number that estimates body fat based on a person’s height and weight. BMI is calculated by dividing a person’s weight in kilograms by the square of their height in meters. A high BMI can indicate high body fatness.
ECG Sensor - Electrocardiogram (ECG) sensors, also known as electrodes, are devices used to visualize the electrical activity of the heart. An ECG, or EKG, measures the voltage changes that occur on the skin as electrical signals travel from the heart’s pacemaker to the upper and lower chambers. The ECG tracing shows how the depolarization wave moves during each heartbeat, which is a wave of positive charge. The pattern of the wave depends on the set of electrodes used. For example, one electrode might be placed on the right arm and the other on the left leg, and a positive deflection will occur when the waves move toward the left leg electrode.
GPS - Global Positioning System, is a global navigation satellite system that provides location, velocity and time synchronization.
gyroscope - A gyroscope is a sensor that measures angular velocity. In a smart device, the gyroscope is used to track the orientation of the device, enabling features such as screen rotation and step counting.
heart rate monitor - Heart rate monitors are devices that can detect and track your heart or pulse rate continuously. Most of these devices are wearable, and many are highly accurate.
magnetometer - A magnetometer in a smart device is a type of sensor that measures the direction and strength of a magnetic field.
METs - One MET is the energy you spend sitting at rest — your resting or basal metabolic rate. So, an activity with a MET value of 4 means you’re exerting four times the energy than you would if you were sitting still. To put it in perspective, a brisk walk at 3 or 4 miles per hour has a value of 4 METs.
PPG - Photoplethysmography, which is a non-invasive optical measurement technique that uses a light source and a photodetector to measure blood volume variations. It is often used for heart rate monitoring in fitness, professional sports, and off-hospital monitoring. PPG sensors are cheaper and use simpler hardware than traditional ECG-based systems.
skin temperature sensors - Skin temperature sensors are devices that measure the temperature of the skin’s surface and can be used to indicate body temperature. They can be placed on the skin at any site, but are often used on the forehead, axilla, or other areas. Skin temperature sensors can be used for biomedical temperature monitoring, efficacy testing, clinical research analysis, and more.
SpO2 sensor - or pulse oximeter sensor, is a small clip that measures the percentage of oxygen in a patient’s red blood cells. It’s a non-invasive device that uses a probe, red and infrared light sources, and photo detectors to transmit light through a patient’s blood. The device then measures how oxygenated and deoxygenated hemoglobin absorb light differently.
Perez, Sarah (September 30, 2014). “Bellabeat Debuts A Trio Of Health-Tracking Products For Moms-To-Be And Beyond, Including Jewelry, A Monitor And A Smart Scale”. TechCrunch. Retrieved December 1, 2022.
“Bellabeat Debuts LEAF, A Smart Charm Necklace That Monitors Your Stress Levels And Ovulation Schedule”. Bustle. May 28, 2015. Retrieved December 1, 2022.
Perez, Sarah (January 16, 2015). “Bellabeat Leaf, A Health Tracker For Women, Will Now Track Your Reproductive Cycle Too”. TechCrunch. Retrieved December 1, 2022.
Perez, Sarah (July 26, 2016). “Bellabeat Leaf’s new wearable and app help women tackle stress”. TechCrunch. Retrieved December 1, 2022.
Marie Claire (June 9, 2015). “6 Pieces Of Wearable Tech To Destress Your Mind And Make You Feel Calmer”. Marie Claire UK. Retrieved December 1, 2022.
Jovin, Ivan (September 21, 2018). “Bellabeat launches Leaf Chakra, a wearable for mind-body balance”. Gadgets & Wearables. Retrieved December 9, 2022.
KAPFUNDE, MUCHANETA (November 8, 2019). “Bellabeat’s New Leaf Crystal Designed To Help You Make Better Lifestyle Choices(FashNerd)”. techlovesstyle (in Chinese). Retrieved December 9, 2022.
“The new Bellabeat fashion accessory successfully manages stress - HAMAG BICRO” (in Croatian). Retrieved December 9, 2022.
Living By Your Cycle with Bellabeat: PYW 2022 | Poosh, retrieved December 9, 2022.
“Bellabeat Ushers in Summer with a Stunning Private Dinner, Hosted by Olivia Culpo and Isabel Alysa Vita at the Elite Nobu, Malibu”. www.flaunt.com. Retrieved September 26, 2023.
Weekly, La (July 31, 2023). “Bellabeat and The Body Shop Collaborate to Deliver a Holistic Wellness Experience - LA Weekly”. www.laweekly.com. Retrieved September 26, 2023.
Journal (July 11, 2023). “Strateška suradnja za novu eru wellnesa: Bellabeat se udružio s The Body Shopom i predstavio novitete”. Journal.hr (in Croatian). Retrieved September 26, 2023.
Bellabeat. (n.d.). About Us. Retrieved from https://bellabeat.com/about-us/
Images compiled from Google Chrome image search.
The following code was used to create the data tabs:
# {.tabset}
The following packages were installed to use R Markdown:
install.packages("rmarkdown")
install.packages("RStudio/Pandoc")
install.packages("LaTeX")
library(rmarkdown)
library(RStudio/Pandoc)
library(LaTeX)
SPECIAL NOTE: This report contains very highly detailed subject matter that would not typically be included to present to these key stakeholders, but is included to show the thought processes throughout this case study, since it may be used to show future employers of my work capabilities.
Thank You for Examining my Case Study! All work contained in this report by David Lentz. THE END.