Initial RMD Start Date: 2024-04-24
Last Updated: 2024-07-31
Print Date: 2025-07-12
TABLE OF CONTENTS
Utilize the Table of Contents (TOC) menu on the left to scroll
through the different sections and help navigate this report. You can
click on the title to go directly to the section.
If you’re viewing the PDF version, the Table of Contents stays at the
top of this report. To go directly to a section, copy and paste the
title into the search box. Use the following shortcut keys to help
navigate with ease:
Shortcut Keys:
- Hold
Ctrl
and press the Home
key to go to
the top of the page.
- Hold
Ctrl
and press the End
key to go to
the bottom of the page.
- Hold
Ctrl
and press the F
key to search
for specific text or strings of text.
- Press the
Esc
key to quickly exit the search box.
About This Report
This report is published on RPubs. KDF stands for Ken Does Fitness. Refer to
References at the bottom of this page for cited sources. The RMD file to
produce this report, and some of the figures from this report, may be
found in the Google Drive folder .
This report systematically records and analyzes workout statistics to
enhance individual and collective performance by achieving the following
goals:
- Track and Improve Performance - Regularly measuring workout
statistics helps individuals monitor progress, set realistic goals, and
tailor training plans for optimal results.
- Informed Decision Making - Detailed data analysis allows for
personalized adjustments in training loads, reducing the risk of injury
and enhancing overall performance.
- Foster Healthy Competition - Comparing personal performance over
time and with peers creates a motivating and competitive environment
that encourages continuous improvement.
- Data-Driven Insights - Identifying trends and patterns in
performance metrics helps athletes make informed adjustments to their
training for sustained improvement and injury prevention.
Pros
|
Cons
|
Enhanced performance tracking: Helps monitor progress and identify
strengths and weaknesses.
|
Resource intensive: Requires time and effort for accurate data entry and
maintenance.
|
Motivation and accountability: Sets clear benchmarks and goals, boosting
motivation.
|
Potential for misinterpretation: Incorrect data analysis can lead to
wrong conclusions.
|
Data-driven decision making: Informs better training choices based on
data analysis.
|
Privacy concerns: Risks of handling personal performance data
insecurely.
|
Facilitates comparison and competition: Encourages healthy competition
and self-improvement.
|
Emphasis on quantitative metrics: May overlook qualitative aspects like
mental well-being and enjoyment.
|
Formatting - This report is initially printed as one continuous page
in HTML format and may appear differently when viewed in PDF format.
Terminology - The terms activity, workout, and exercise may be used
interchangeably in this context, even though they have distinct
meanings.
Data Assumptions - The data used for analyses assumes each set of
exercise is performed to Rep Max (RM), which isn’t always the case,
particularly if the individual isn’t aiming to perform the set to RM.
Assuming each set of exercise is performed to Rep Max helps standardize
data and provide clear insights into maximal strength, which is
beneficial for benchmarking and tracking progress. However, it can also
lead to inaccuracies, potential overtraining, and misleading data if not
all exercises are performed to failure. Balancing the use of 1RM
assumptions with realistic training practices is essential for accurate
performance assessment and effective training programming.
STRENGTH STANDARDS
Strength Standards provide valuable benchmarks for assessing an
individual’s performance in key strength exercises that are widely
practiced in fitness and strength training. These standards categorize
performance levels based on either amount of weight executed relative to
bodyweight, or repetitions executed, and are segmented by gender and
experience levels. By comparing an individual’s performance against
these established standards, trainers and athletes can identify
strengths and weaknesses in their training regimens, set realistic
goals, and measure progress in a structured manner.
Strength Standards Overview
Several strength standards are readily available online:
Finding accurate strength standards depends on the demographic,
exercise type, and required accuracy. Each has its pros and cons: some
cater to specific demographics, others focus on certain exercises, and
their credibility affects reliability. Usability and comprehensive data
for benchmarking are crucial, as are regular updates to reflect current
trends. Considering these factors helps trainers and athletes choose the
best standards for assessing and improving performance.
Pros
|
Cons
|
Benchmarking: Strength standards provide clear benchmarks to measure
progress and set goals.
|
Generic: Strength standards can be too generic and may not account for
individual differences in genetics, body composition, and training
history.
|
Motivation: Seeing how your performance stacks up against established
standards can be highly motivating.
|
Pressure: They can create undue pressure to meet certain benchmarks,
potentially leading to injury or burnout.
|
Guidance: They offer guidance on what is considered strong or weak for
different exercises, helping you tailor your training accordingly.
|
Outdated Information: Some standards may be based on outdated
information or not reflect recent advancements in sports science.
|
Comparison: They allow for meaningful comparisons between individuals,
helping to identify strengths and weaknesses.
|
Not Comprehensive: They might not cover all exercises or account for
different training goals, such as hypertrophy versus strength.
|
Structured Training: Strength standards can help in designing more
structured and effective training programs.
|
Misleading: For beginners, strength standards can be misleading if they
don’t account for the starting point or rate of progress appropriate for
different fitness levels.
|
I’ve compiled some of the standards into an Excel file. This allows for
comparison between personal performance measures and the specified
strength standards. I recommend consulting each source for confirmation
on the latest version, as the structure or numbers may have changed over
time.
Standards Categories
The standards compiled are separated into categories, as shown in the
following table, making it easy to determine which exercise to use as a
benchmark.
Categories
|
Categories
|
Push Ups
|
Romanian Deadlift
|
Bench Press
|
T Bar Row
|
Incline Bench Press
|
Press
|
Pull Ups
|
Barbell Shoulder Press
|
Lat Pulldown
|
Dumbbell Shoulder Press
|
Deadlift
|
Military Press
|
Strength Standards List
Below is the list of Strength Standards compiled into the SS Excel
file.
This file references the source of each standard table. We’ll delve into
Mark Rippetoe’s Starting Strength Standards, as his and his colleagues’
methods and research are based on comprehensive and technical
information.
Strength Standard
|
Category
|
Male Barbell Curl Std 1RM By Age Strengthlevel
|
Barbell Curl
|
Male Barbell Curl Std 1RM By Bodyweight Strengthlevel
|
Barbell Curl
|
Male Barbell Curl Std 1RM Strengthlevel
|
Barbell Curl
|
Male Barbell Shoulder Press Std 1RM By Age Strengthlevel
|
Barbell Shoulder Press
|
Male Barbell Shoulder Press Std 1RM By Bodyweight Strengthlevel
|
Barbell Shoulder Press
|
Male Barbell Shoulder Press Std 1RM Strengthlevel
|
Barbell Shoulder Press
|
Male Bench Press Std 1RM By Age Strengthlevel
|
Bench Press
|
Male Bench Press Std 1RM By Bodyweight Mark Rippetoe
|
Bench Press
|
Male Bench Press Std 1RM By Bodyweight Strengthlevel
|
Bench Press
|
Male Bench Press Std 1RM Strengthlevel
|
Bench Press
|
Male Deadlift Std 1RM By Age Strengthlevel
|
Deadlift
|
Male Deadlift Std 1RM By Bodyweight Mark Rippetoe
|
Deadlift
|
Male Deadlift Std 1RM By Bodyweight Strengthlevel
|
Deadlift
|
Male Deadlift Std 1RM Strengthlevel
|
Deadlift
|
Male Dumbbell Shoulder Press Std 1RM By Age Strengthlevel
|
Dumbbell Shoulder Press
|
Male Dumbbell Shoulder Press Std 1RM By Bodyweight Strengthlevel
|
Dumbbell Shoulder Press
|
Male Dumbbell Shoulder Press Std 1RM Strengthlevel
|
Dumbbell Shoulder Press
|
Male Incline Bench Press Std 1RM By Age Strengthlevel
|
Incline Bench Press
|
Male Incline Bench Press Std 1RM By Bodyweight Strengthlevel
|
Incline Bench Press
|
Male Incline Bench Press Std 1RM Strengthlevel
|
Incline Bench Press
|
Male Lat Pulldown Std 1RM By Age Strengthlevel
|
Lat Pulldown
|
Male Lat Pulldown Std 1RM By Bodyweight Strengthlevel
|
Lat Pulldown
|
Male Lat Pulldown Std 1RM Strengthlevel
|
Lat Pulldown
|
Male Military Press Std 1RM By Age Strengthlevel
|
Military Press
|
Male Military Press Std 1RM By Bodyweight Strengthlevel
|
Military Press
|
Male Military Press Std 1RM Strengthlevel
|
Military Press
|
Named Range List
|
Named Ranges
|
Male Press Std 1RM By Bodyweight Mark Rippetoe
|
Press
|
Male Pull Ups Std 1RM Strengthlevel
|
Pull Ups
|
Male Pull Ups Std 1RM Weight By Age Strengthlevel
|
Pull Ups
|
Male Pull Ups Std 1RM Weight By Bodyweight Strengthlevel
|
Pull Ups
|
Male Pull Ups Std Reps By Age Strengthlevel
|
Pull Ups
|
Male Pull Ups Std Reps By Bodyweight Strengthlevel
|
Pull Ups
|
Male Push Ups Std Reps By Age Strengthlevel
|
Push Ups
|
Male Push Ups Std Reps By Bodyweight Strengthlevel
|
Push Ups
|
Male Push Ups Std Reps Strengthlevel
|
Push Ups
|
Male Romanian Deadlift Std 1RM By Age Strengthlevel
|
Romanian Deadlift
|
Male Romanian Deadlift Std 1RM By Bodyweight Strengthlevel
|
Romanian Deadlift
|
Male Romanian Deadlift Std 1RM Strengthlevel
|
Romanian Deadlift
|
Male Squat Std 1RM By Age Strengthlevel
|
Squat
|
Male Squat Std 1RM By Bodyweight Mark Rippetoe
|
Squat
|
Male Squat Std 1RM By Bodyweight Strengthlevel
|
Squat
|
Male Squat Std 1RM Strengthlevel
|
Squat
|
Male T Bar Row Std 1RM By Age Strengthlevel
|
T Bar Row
|
Male T Bar Row Std 1RM By Bodyweight Strengthlevel
|
T Bar Row
|
Male T Bar Row Std 1RM Strengthlevel
|
T Bar Row
|
MARK RIPPETOE’S STANDARDS
Mark Rippetoe’s Strength Standard: 1 Rep Max By Bodyweight.
Here we utilize Mark Rippetoe’s Strength Standards to benchmark
individual performance against established criteria for primary strength
exercises. These standards are instrumental in evaluating an
individual’s lifting capabilities in relation to their bodyweight and
training experience. By doing so, we can effectively gauge progress,
pinpoint areas that need improvement, and tailor training programs to
better meet the specific strength goals of each individual.
The exercises use the technique described in “Starting Strength:
Basic Barbell Training, 3rd ed.” The Bluebook may be purchased through
The Aasgaard Company online store and on Amazon in various mediums
(paperback, Kindle, audio). All lifts are performed with the barbell, as
indicated by the book title.
This project has been a continuous process of editing, updating, and
reviewing as I progress. After considering both StrengthLevel.com and
Mark Rippetoe’s Standards via an article by Legion Athletics, I decided
to utilize Mark Rippetoe’s Standards to create visual plot benchmarks.
(My primary goal is to create spider charts for creative and visually
motivating targets.) However, the Legion Athletics article lacked
information on how these measures are obtained and what techniques or
instruments are used in executing these standards.
I emailed rip@startingstrength.com for clarification regarding
the Male Overhead Shoulder Press Strength Standards and was advised to
“Ignore this table. It is invalid, and we haven’t used it in 20 years.
As you would know if you had read the blue book.” Upon further research,
Mark Rippetoe confirmed the changes in categories on Starting Strength
Forum
from the previous outdated categories (Beginner, Novice, Intermediate,
Advanced, and Elite) to the current Cat. I through Cat. V.
To properly utilize these standards, it is recommended to review
“Starting Strength: Basic Barbell Training” and “Practical
Programming for Strength Training.” The most recent and relevant 3rd editions
(2012) are available on their website, as well as in different mediums
on Amazon. You can download online book samples to determine if Starting
Strength suits your needs.
Strength Standard
|
Category
|
Male Bench Press Std 1RM By Bodyweight Mark Rippetoe
|
Bench Press
|
Male Deadlift Std 1RM By Bodyweight Mark Rippetoe
|
Deadlift
|
Male Press Std 1RM By Bodyweight Mark Rippetoe
|
Press
|
Male Squat Std 1RM By Bodyweight Mark Rippetoe
|
Squat
|
There are five Starting Strength Standards, with the Power Clean
excluded in this report. Each of the four remaining Standards is
presented below, followed by a plot to visualize performances for
comparison and motivation.
Male Bench Press Std 1RM By Bodyweight Mark Rippetoe
Bodyweight (lb)
|
Cat. I
|
Cat. II
|
Cat. III
|
Cat. IV
|
Cat. V
|
114
|
84
|
107
|
130
|
179
|
222
|
123
|
91
|
116
|
142
|
194
|
242
|
132
|
98
|
125
|
153
|
208
|
260
|
148
|
109
|
140
|
172
|
234
|
291
|
165
|
119
|
152
|
187
|
255
|
319
|
181
|
128
|
164
|
201
|
275
|
343
|
198
|
135
|
173
|
213
|
289
|
362
|
220
|
142
|
183
|
225
|
306
|
381
|
242
|
149
|
190
|
232
|
316
|
395
|
275
|
153
|
196
|
239
|
325
|
407
|
319
|
156
|
199
|
244
|
333
|
416
|
320+
|
159
|
204
|
248
|
340
|
425
|

Male Deadlift Std 1RM By Bodyweight Mark Rippetoe
Bodyweight (lb)
|
Cat. I
|
Cat. II
|
Cat. III
|
Cat. IV
|
Cat. V
|
114
|
97
|
179
|
204
|
299
|
387
|
123
|
105
|
194
|
222
|
320
|
414
|
132
|
113
|
209
|
239
|
342
|
438
|
148
|
126
|
234
|
269
|
380
|
482
|
165
|
137
|
254
|
293
|
411
|
518
|
181
|
148
|
274
|
315
|
438
|
548
|
198
|
156
|
289
|
333
|
457
|
567
|
220
|
164
|
305
|
351
|
479
|
586
|
242
|
172
|
318
|
363
|
490
|
596
|
275
|
176
|
326
|
373
|
499
|
602
|
319
|
180
|
333
|
381
|
506
|
608
|
320+
|
183
|
340
|
388
|
512
|
617
|

Male Press Std 1RM By Bodyweight Mark Rippetoe
Bodyweight (lb)
|
Cat. I
|
Cat. II
|
Cat. III
|
Cat. IV
|
Cat. V
|
114
|
53
|
72
|
90
|
107
|
129
|
123
|
57
|
78
|
98
|
116
|
141
|
132
|
61
|
84
|
105
|
125
|
151
|
148
|
69
|
94
|
119
|
140
|
169
|
165
|
75
|
102
|
129
|
153
|
186
|
181
|
81
|
110
|
138
|
164
|
218
|
198
|
85
|
116
|
146
|
173
|
234
|
220
|
89
|
122
|
155
|
183
|
255
|
242
|
93
|
127
|
159
|
189
|
264
|
275
|
96
|
131
|
164
|
194
|
272
|
319
|
98
|
133
|
167
|
199
|
278
|
320+
|
100
|
136
|
171
|
203
|
284
|

Male Squat Std 1RM By Bodyweight Mark Rippetoe
Bodyweight (lb)
|
Cat. I
|
Cat. II
|
Cat. III
|
Cat. IV
|
Cat. V
|
114
|
78
|
144
|
174
|
240
|
320
|
123
|
84
|
155
|
190
|
259
|
346
|
132
|
91
|
168
|
205
|
278
|
369
|
148
|
101
|
188
|
230
|
313
|
410
|
165
|
110
|
204
|
250
|
342
|
445
|
181
|
119
|
220
|
269
|
367
|
479
|
198
|
125
|
232
|
285
|
387
|
504
|
220
|
132
|
244
|
301
|
409
|
532
|
242
|
137
|
255
|
311
|
423
|
551
|
275
|
141
|
261
|
319
|
435
|
567
|
319
|
144
|
267
|
326
|
445
|
580
|
320+
|
147
|
272
|
332
|
454
|
593
|

1RM CALCULATOR
This 1RM Conversion Table from Strength Level displays the
Repetition Percentages of 1RM, helping to calculate the approximate
one-rep max (1RM). The one-rep max is the maximum weight you can lift
for a single repetition for a given exercise.
Repetitions
|
Percentage of 1RM
|
Example: Weight and Repetition if 1RM is 150
|
Example: Weight at 1RM if 5RM is 45
|
1
|
100%
|
150.0
|
50.6
|
2
|
97%
|
145.5
|
49.0
|
3
|
94%
|
141.0
|
47.5
|
4
|
92%
|
138.0
|
46.5
|
5
|
89%
|
133.5
|
45.0
|
6
|
86%
|
129.0
|
43.5
|
7
|
83%
|
124.5
|
42.0
|
8
|
81%
|
121.5
|
41.0
|
9
|
78%
|
117.0
|
39.4
|
10
|
75%
|
112.5
|
37.9
|
11
|
73%
|
109.5
|
36.9
|
12
|
71%
|
106.5
|
35.9
|
13
|
70%
|
105.0
|
35.4
|
14
|
68%
|
102.0
|
34.4
|
15
|
67%
|
100.5
|
33.9
|
16
|
65%
|
97.5
|
32.9
|
17
|
64%
|
96.0
|
32.4
|
18
|
63%
|
94.5
|
31.9
|
19
|
61%
|
91.5
|
30.8
|
20
|
60%
|
90.0
|
30.3
|
21
|
59%
|
88.5
|
29.8
|
22
|
58%
|
87.0
|
29.3
|
23
|
57%
|
85.5
|
28.8
|
24
|
56%
|
84.0
|
28.3
|
25
|
55%
|
82.5
|
27.8
|
26
|
54%
|
81.0
|
27.3
|
27
|
53%
|
79.5
|
26.8
|
28
|
52%
|
78.0
|
26.3
|
29
|
51%
|
76.5
|
25.8
|
30
|
50%
|
75.0
|
25.3
|
How to Calculate Your 1RM
To calculate the One Rep Max (1RM) based on the weight lifted and the
number of repetitions performed, use the following formula: Y lbs at X
Reps / % at X Reps = Y lbs 1RM
For example, if you did 5 reps at 200 lbs, and 5 reps corresponds to
89%, then 200 lbs / 0.89 = ~225 lbs 1RM
How to Calculate Your X-RM
To calculate the X Repetition Max (X-RM) from a given number of
repetitions and weight lifted, first calculate the 1RM, then use the 1RM
to get the X-RM. We do this through the following formula: Y lbs at X
Reps / % at X Reps = Y lbs 1RM
Then multiply by the percentage of your target repetitions to get
X-RM: 1RM * % at Target Reps = Y lbs X-RM
For example, if you did 5 reps at 45 lbs, and 5 reps corresponds to
89%, and you want to find the 3RM: First, calculate the 1RM: 45 lbs /
0.89 = ~50.56 lbs 1RM Then calculate the 3RM using the 3 rep percentage
(94%): 50.56 lbs * 0.94 = ~47.53 lbs 3RM
KDF WORKBOOK
A Brief Overview
This project derives data from the primary workbook, named KDF (Ken
Does Fitness). For years, I recorded my personal
performance to recall my previous workouts for the next session, as I
often had trouble remembering the weights used. As I started learning
analytics and coding methods, I wanted to apply those techniques to
something personal and applicable, which led to this report. It has
indeed been a learning process. Not to mention, the data cleaning and
organization have also been a time-consuming process. As of 2025-07-12,
the contents of this workbook include a combination of performance data,
formulas, code, and reference resources. Until a separate file with a
more user-friendly experience is available without the necessary
back-end coding information, this file will have to suffice.
The following table from KDF addresses the primary sheets helpful in
improving workout performance and performance analyses. Some of the
sheets listed serve as sections in this report.
TABLE OF CONTENTS
|
EXPLANATION
|
Date_Notes
|
Contains date-specific notes and entries, used for tracking progress,
gaps in progress, and important events.
|
Vitals
|
Records vital statistics such as weight, body measurements, and other
health-related data.
|
Raw_Data
|
Stores the unprocessed data collected from various sources with little
analysis calculations performed.
|
Stats_Specific
|
Contains specific statistical analyses or summaries derived from the raw
data. Using the Primary Drop Down Table helps draws the statistics
for the drafted Target Goal, the Last Exercise performed, date-specified
statistics for Past Exercises, and a 1RM Convertor based on mentioned
statistics to serve as a benchmark goal to help finish off that last set
of exercise. Available on the mobile Excel app, replacing the Pivot
Table as it is unavailable on certain mobile versions of the
app. *This workbook is set to View; you’ll need to “Edit a copy” and
save this file to filter criteria and properly explore this sheet.
|
Raw_Data_Pivot
|
Provides pivot tables created from the raw data for summarizing and
analyzing data trends. Pivot Table is useful, but some mobile
versions or phones, does not support ability to change criteria with
Pivot Tables. Unable to find solution to properly utilize Pivot
Tables on the mobile phone app. Therefore Stats_Specific datasheet was
created for mobile review and filtering using the drop-down menu.
|
Demographic
|
Includes demographic information about individuals or groups, such as
age, gender, and other relevant characteristics.
|
DEMOGRAPHIC
This section includes demographic information about individuals or
groups, such as age, gender, and other relevant characteristics. Lists
information for individuals with at least one set of data in the
workbook.
Name
|
Birthdate
|
Age
|
City
|
Alex
|
1996-03-04
|
28
|
Santa Barbara
|
Josh
|
1991-11-03
|
33
|
Santa Barbara
|
Kendrick
|
1992-05-24
|
32
|
Santa Barbara
|
Lucien
|
1995-05-04
|
29
|
Los Angeles
|
Nathan
|
1992-06-11
|
32
|
Los Angeles
|
Sam
|
NA
|
NA
|
NA
|
Test Dummy
|
1986-06-09
|
38
|
San Diego
|
VITALS
The “Vitals” section tracks key health metrics such as weight, body
measurements, and heart rate to monitor progress, set personalized
fitness goals, and identify patterns or potential health risks. By
providing a comprehensive view of an individual’s health, it enables
data-driven decisions for adjusting fitness plans and ensures a balanced
approach to achieving long-term fitness goals. Regular updates and
visual tools like charts help in understanding trends and motivating
continued effort. If performance data is the brain of this report, then
vitals are the heart—pun intended.
Vitals Category
The Vitals columns provide a list of measurement categories available
for further analysis.
Entries
|
Entries
|
Date
|
Weight lbs
|
Time
|
Height
|
Name
|
Body Fat %
|
Systolic Blood Pressure (Top Number 120)
|
BMI
|
Diastolic Blood Pressure (Bottom Number 80)
|
Subjective Symptoms
|
Pulse
|
Other Notes
|
Heart Rate
|
|
Individuals with a Vital
Alex |
Josh |
Kendrick |
Lucien |
Test Dummy |
BODYWEIGHT
An individual’s bodyweight is used to match performance measurements
with Mark Rippetoe’s Strength Standards mentioned earlier in this
report. Bodyweight allows for simple techniques and methods to assess
performance, such as when hitting plateaus or experiencing changes in
bodyweight.
All Bodyweight Over Time
The plot displays all bodyweight measurements over time, with all
individuals plotted on a single graph. Individuals represented by just a
dot and no line have only one measurement recorded, indicating that more
data is required; any two points will create a line.
Name
|
Date Range
|
Entries
|
Min BW
|
Min BW Date
|
Max BW
|
Max BW Date
|
Alex
|
2023-10-04 to 2024-07-31
|
15
|
175.0
|
2024-07-31
|
196.0
|
2024-05-08
|
Josh
|
2024-03-05 to 2024-06-06
|
2
|
185.0
|
2024-03-05
|
190.0
|
2024-06-06
|
Kendrick
|
2023-09-05 to 2024-07-31
|
95
|
130.3
|
2023-09-22
|
141.8
|
2024-02-11
|
Lucien
|
2024-02-06 to 2024-05-05
|
2
|
155.0
|
2024-02-06
|
160.0
|
2024-05-05
|
Test Dummy
|
2023-08-20 to 2024-01-05
|
5
|
158.0
|
2024-01-02
|
169.0
|
2023-08-20
|

Individual Bodyweight Over Time
These visualizations allow us to:
- Track the bodyweight trends of each individual.
- Identify key metrics such as the minimum and maximum bodyweight, and
the number of measurements taken.
- Understand how bodyweight changes over time may correlate with
performance in strength exercises and other fitness metrics.
- By comparing these plots with strength standards, demographic
information, and other performance data, we can gain valuable insights
into each individual’s progress and areas needing improvement.





Individual Bodyweight Over Last 2 Weeks
Unable to plot all individuals in one figure as each individual’s
last 2-week date range differs from one another. The average bodyweight
over the date range is presented in the subtitles.





Avg Bodyweight Over Last 2 Weeks
Side by side comparison between individuals.
Name
|
Age At Avg BW
|
Average Bodyweight (lbs)
|
Date Range
|
Alex
|
29
|
175.00
|
2024-07-18 to 2024-07-31
|
Josh
|
33
|
190.00
|
2024-05-24 to 2024-06-06
|
Kendrick
|
33
|
133.46
|
2024-07-18 to 2024-07-31
|
Lucien
|
29
|
160.00
|
2024-04-22 to 2024-05-05
|
Test Dummy
|
38
|
160.68
|
2023-12-23 to 2024-01-05
|

Avg Bodyweight by Age
This plot displays data for all individuals. This dataset–age vs
bodyweight–is seems limited in isolation. With other data points and
contextion information it can offer valuable insights,particularly if
used it can still offer valuable insights when used alongside other data
points and contextual information. Its usefulness largely depends on how
it’s interpreted and integrated into a broader analysis framework.

DATA ENTRIES
The meat and potatoes. Without this, what’s the point?
Column Entries
The following columns from the Raw Data sheet in the KDF Workbook are
shown below. Nearly all columns are utilized for analyses, although
some, like “Duration,” are rarely recorded. Most of my workouts are
focused on strength training rather than cardio and endurance training.
Volume measures have also not been a primary focus compared to 1RM
(one-rep max), which serves as a requirement for comparison with 1RM
strength standards.
Column Names
|
Column Names
|
Date
|
Barbell_Weight
|
Day
|
Total_Weight
|
Name
|
Raw_Reps
|
Position
|
Final_Reps
|
Instrument
|
1RM_Calculated_Weight
|
Activity
|
Volume
|
Lateral_Movement_Performed
|
Duration
|
Variation
|
TIMEVALUE
|
Grip
|
Distance
|
Set_#
|
Notes
|
Weight
|
|
Exercise Variations List
Entries from raw_data
are extracted for further
analysis. Whether indoors or outdoors, using calisthenics or equipment,
performing aerobic or anaerobic exercises, there are countless exercises
and ways to perform them. These exercises vary in instruments and
positions used, requiring different muscle activation.
Not included in the lists are entries pertaining to different grips
(wide, mid, close, underhand, overhand, neutral, and alternate, such as
for deadlifts). A Wide Grip Bicep Curl utilizes external rotation,
limits the range of motion allowing for heavier weights, places more
stress on the elbow joints, emphasizes the short head of the biceps, and
engages the upper back and shoulders. In contrast, a Close Grip Bicep
Curl utilizes internal rotation, allows for a greater range of motion,
focuses on the long head of the biceps, and engages the forearm
muscles.
Positions
Incorporating different positions into a workout routine can enhance
muscle development, improve functional strength, and reduce the risk of
overuse injuries by ensuring a balanced workout.
Positions
|
Positions
|
Bent Over
|
Seated
|
Decline Bench
|
Seated Back Pad
|
Flat Bench
|
Seated Front Pad
|
Incline Bench
|
Side-Lying
|
Kneeled
|
Standing
|
Prone
|
|
Instruments
Using a variety of instruments in workouts can help target different
muscle groups more effectively, provide varied resistance, and offer
safer and more efficient exercise options.
Instruments
|
Instruments
|
Instruments
|
Ab Crunch Machine
|
Full Crunch Machine
|
Machine (Loaded)
|
Barbell
|
Hack Squat Machine (Loaded)
|
Machine (Nitro Pec Fly / Rear Delt)
|
Barbell (Loaded)
|
Hammer Strength Machine
|
Plate
|
Bicep Curl Machine
|
Hip Abduction Machine
|
Plyo Box
|
Cable
|
Horizontal Leg Press Machine
|
Preacher Pad
|
Cable Bar
|
Hyperextension Bench
|
Preacher Pad, Dumbell
|
Cable EZ Bar
|
Iso-Lateral Incline Press Machine (Loaded)
|
Prone Leg Curl Machine
|
Cable Handle
|
Isolateral Row Machine
|
Pull-Up Bar
|
Cable Handle (Doube Pulley)
|
Kettlebell
|
Resistance Band
|
Cable Rope
|
Lat Pulldown Bar
|
Smith Machine (Loaded)
|
Cable Rope (Long)
|
Lat Pulldown Bar (Double Pulley)
|
Straight Barbell
|
Cable Straight Bar
|
Lat Pulldown Bar (Mag Grip)
|
T-Bar Row Landmine
|
Cable V-bar
|
Lat Pulldown Bar (Mag Grip) (Double Pulley)
|
Track Lane 6
|
Cable V-bar (Double Pulley)
|
Leg Curl Machine
|
Trap Bar (Loaded)
|
Dumbbell
|
Leg Extension Machine
|
Treadmill
|
EZ Bar
|
Leg Press Machine (Loaded)
|
|
EZ Bar (Loaded)
|
Machine
|
|
Activities
The following activities (or exercises) have been entered into the
Raw Data. Other activities are available but have not been performed.
Some, such as “Track Lane 6,” are specific to location and time
duration. Other activities are universally available, such as Chest
Press and Lat Pulldown. However, when time is a factor, sometimes you’ve
got to pivot and improvise your workout with other types of
exercises.
Activities
|
Activities
|
Activities
|
Abdominal Crunch
|
High-Bar Squat
|
Plate Curls
|
Abdominal Crunch (Arm & Leg Pull)
|
High Chest Fly
|
Pull-Up
|
Arnold Press
|
Hip Abduction
|
Push-up
|
Back Extension
|
Hip Thrust
|
Push Press
|
Behind The Neck Press
|
Jump
|
Rear Delt Fly
|
Bench Row
|
Kettlebell Swing (One Handed)
|
Row
|
Bicep Curl
|
Kettlebell Swing (Two Handed)
|
Run
|
Box Jump 30”
|
Lat Pulldown
|
Run (Nona’s->Mission)
|
Box Jump 36”
|
Lateral Shoulder Raise
|
Run (Nona’s<-Mission)
|
Box Jump 39”
|
Leg Curl
|
Run (Treadmill)
|
Box Jump 40”
|
Leg Extension
|
Shoulder Raise
|
Box Jump 42”
|
Leg Press
|
Shoulder Shrug
|
Cable Y Raise
|
Low-Bar Squat
|
Side Bend
|
Calf Raise
|
Low Chest Fly
|
Side Leg Circles
|
Chest Fly
|
Low Row
|
Skull Crusher
|
Chest Press
|
Lunge
|
Sprint
|
Crunch
|
Military Press
|
Squat
|
Deadlift
|
Oblique Twist
|
Straight Arm Lat Pulldown
|
Diagonal Shoulder Raise
|
Oblique Twist Down
|
Tricep Pulldown
|
Face Pull
|
Oblique Twist Up
|
Tricep Pushdown
|
Face Pull with External Rotation
|
One Arm Oblique Crunch
|
Upright Row
|
Face Pull with Externatl Rotation
|
Overhead Shoulder Press
|
Wrist Curl
|
Front Shoulder Raise
|
Overhead Tricep Extension
|
|
Hammer Curl
|
Pallof Press
|
|
STATS TO STANDARDS
Performance measures are evaluated to match Standards.
1RM Table - Avg L2W & ATH
The following table reports the average one-repetition maximum (1RM)
for each individual across each specified exercise (“Selected Criteria”)
over the last two weeks (L2W). The data for the last two weeks serves as
a benchmark for the next workout. As the saying goes, “You’re only as
good as your last.”
The table also reports highest 1RM over all time (ATH) for each
selected criterion, accompanied by the last date the highest 1RM was
achieved.
The 1RM is calculated based on weight executed per set and the number
of reps, using the 1RM Conversion Table.
All calculations assume the standard barbell (Olympic barbell) weight
of 45 pounds (20.4 kg).
Keep in mind there are many variables affecting performance. One of
these variables includes grip (wide, alternating, overhand, etc.). One
individual may use alternating grips (left underhand/right overhand, or
vice-versa) when performing deadlifts to maximize weight execution.
Another individual might utilize an overhand grip. Someone else may
perform a combination of alternating grip and overhand grip. The
following datasets and figures do not account for these differences.
Selected Criteria
|
Index
|
Name
|
Average 1RM (lbs)
|
Date Range (Last 2 Weeks)
|
Highest 1RM All Time (lbs)
|
Last Date Highest 1RM All Time
|
Flat Bench Barbell Chest Press
|
1
|
Alex
|
185.83
|
2024-07-18 to 2024-07-31
|
228.40
|
2024-01-13
|
Flat Bench Barbell Chest Press
|
1
|
Josh
|
213.15
|
2024-04-25 to 2024-05-08
|
253.09
|
2023-09-13
|
Flat Bench Barbell Chest Press
|
1
|
Kendrick
|
152.97
|
2024-07-20 to 2024-08-02
|
174.70
|
2024-02-07
|
Flat Bench Barbell Chest Press
|
1
|
Nathan
|
162.92
|
2023-04-07 to 2023-04-20
|
162.92
|
2023-04-20
|
Barbell Deadlift
|
2
|
Alex
|
224.51
|
2024-07-18 to 2024-07-31
|
261.63
|
2023-12-17
|
Barbell Deadlift
|
2
|
Josh
|
236.38
|
2023-10-27 to 2023-11-09
|
275.28
|
2023-10-18
|
Barbell Deadlift
|
2
|
Kendrick
|
219.21
|
2024-07-18 to 2024-07-31
|
267.44
|
2024-02-14
|
Barbell Overhead Shoulder Press
|
3
|
Alex
|
106.09
|
2024-06-29 to 2024-07-12
|
110.47
|
2024-07-12
|
Barbell Overhead Shoulder Press
|
3
|
Kendrick
|
72.33
|
2024-07-13 to 2024-07-26
|
87.21
|
2024-01-02
|
Barbell Squat
|
4
|
Alex
|
185.75
|
2024-07-02 to 2024-07-15
|
191.86
|
2024-07-15
|
Barbell Squat
|
4
|
Kendrick
|
173.99
|
2024-07-13 to 2024-07-26
|
207.87
|
2024-02-12
|
NOTES FOR FUTURE IMPLEMENTATION
Power BI vs. RStudio (R Programming)
Creating this report requires knowledge in managing the different
coding structure–it requires knowing R. It can be replicated through
other programs, and by doing so may enhance the overall experience. Such
is the case with Power BI but in a different aspect.
Power BI is better suited for creating interactive dashboards and
visualizations quickly and sharing them easily with non-technical
stakeholders. It is user-friendly and integrates well with other
Microsoft products, making it ideal for business environments that
require regular reporting and real-time data updates.
R Studio, on the other hand, is ideal for in-depth data analysis,
complex statistical modeling, and projects that require extensive data
manipulation and custom analyses. It offers greater flexibility and is
more powerful for advanced analytics, although it requires programming
knowledge and may have a steeper learning curve.
If the primary goal is to perform advanced statistical analysis and
custom modeling, R Studio would be more appropriate. However, if the
goal is to create interactive, shareable dashboards and reports, Power
BI might be the better choice. This is a toss coin, as my intent is to
provide both advanced statistical analysis and custom modeling as I
learn to further utilize the program, but also allow others the
availability to interact with their data, should that time ever
come.
Prospective Statistical Analyses
More advanced statistical measurements are available to derive from
the dataset. I’ve included a general list of ideas to explore. Some
ideas require a larger demographic group, and others necessitate the use
of technological devices (watches and phone apps). At the time of
publishing, I am working on plotting multiple variables on one plot. One
includes plotting Average Bodyweight (y) and Daily Average 1RM (y) over
time (x). This will help explore any correlation and causation between
variables. Findings are expected to provide adaptable insights into
daily routines and habits.
Dual Y-axis Plot
This type of plot allows you to visualize two different y-variables
against the same x-variable
Average Bodyweight and Daily Average 1RM over time
- Plot = Selected Criteria for Individual
- Y-variable = Avg Bodyweight
- Requires creating new df from Vitals; Daily_Avg_Bodyweight (Avg BW
of all measures on each date)
- Y-variable = Daily Avg 1RM
- Requires creating new df from Raw Data; Daily_Avg_1RM (Avg 1RM of
all sets on each date)
- X-variable = Date (day/month)
- Imagine plot similar to Webull Comparison Quotes structure
Daily Average 1RM and # Daily Sets over time for Selected
Criteria
- Plot = Selected Criteria for Individual
- Y-variable = Daily Average 1RM
- Consider to sub Daily with Weekly or Monthly Average 1RM
- Consider to sub Average 1RM with Highest 1RM
- Y-variable = Daily Sets Performed
- Consider to sub Daily Sets with Weekly or Monthly Sets
- A larger and consistent number of sets performed is expected
correlate or maintain 1RM measures.
- Gaps in number of sets performed is expected to correlate with
decreased 1RM measures

Stacked Bar Chart
Loop plots for each individual. Show [total] value of a category and
segments that bar to show the proportion of different sub-categories
within that [total] value.
% Sets of Selected Criteria by ATH 1RM and Avg 1RM
- Value = % sets performed (ie 2% of all sets performed)
- Sub-category = ATH 1RM as the full bar (ie Chest Press 174.70,
Deadlift 267.44)
- Sub-category = Avg 1RM L2W as percentage of ATH 1RM (ie Chest Press
152.71, Deadlift 219.21)
- Category = Selected Criteria (Chest Press, Deadlift, etc.)
- *Avg 1RM L2W as percentage of ATH 1RM may not be the best dataset to
compare with respect to % sets of selected criteria performed. Consider
other Sub-categories for comparison.
Other Charts
Vertical Bar Chart
- Usage: Comparing the frequency or count of different exercise
types.
- Example: Number of sessions for each exercise type (e.g., Bench
Press, Deadlift, Squat, Press).
Horizontal Bar Chart:
- Comparing the frequency or count of different exercise types with
long labels.
- Example: Number of sessions for each exercise type, displayed
horizontally for better readability.
Stacked Bar Chart:
- Usage: Showing the composition of sets within each exercise.
- Example: Number of sets performed for each exercise type, broken
down by reps range (e.g., 1-5 reps, 6-10 reps, 11+ reps).
Grouped (Clustered) Bar Chart:
- Usage: Comparing the average 1RM (one-repetition maximum) for
different exercises across multiple individuals.
- Example: Average 1RM for Bench Press, Deadlift, Squat, and Press for
Alex, Kendrick, and Josh.
100% Stacked Bar Chart:
- Usage: Showing the percentage composition of total workouts by
exercise type.
- Example: Percentage of total workouts made up by Bench Press,
Deadlift, Squat, and Press.
Diverging Bar Chart:
- Usage: Displaying positive and negative performance changes in
different exercises.
- Example: Performance improvement or decline in 1RM for each exercise
type over a period.
Lollipop Chart:
- Usage: Showing average bodyweight for each individual.
- Example: Average bodyweight of Alex, Kendrick, and Josh represented
as dots on lines.
Dot Plot:
- Usage: Showing the performance scores (e.g., 1RM) for different
exercises.
- Example: Performance scores for Bench Press, Deadlift, Squat, and
Press for each individual.
Variable Width Bar Chart:
- Usage: Showing the volume of weight lifted where the width
represents the number of sessions.
- Example: Total weight lifted in Bench Press, Deadlift, Squat, and
Press with bar width representing the number of sessions.
Waterfall Chart:
- Usage: Showing the cumulative effect of different exercise sessions
on total weight lifted.
- Example: Cumulative total weight lifted over time for different
exercises.
Pareto Chart:
- Usage: Highlighting the most significant exercises contributing to
total weight lifted.
- Example: Total weight lifted for each exercise type in descending
order with a line showing cumulative percentage.
Gantt Chart:
- Usage: Project planning and tracking progress over time.
- Example: Workout schedule showing the timeline for different
exercises and sets over a month.
Histogram:
- Usage: Showing the frequency distribution of bodyweight.
- Example: Distribution of bodyweight entries over the last
month.
Mosaic Plot:
- Usage: Displaying the relationship between exercise type and the
number of sets performed.
- Example: Visualization of how different exercises are associated
with the number of sets performed.
Utilization of Datasets
Longitudinal Analysis of Training Progress:
- Objective: To track and analyze the training progress over an
extended period.
- Implementation: Utilize time series analysis to monitor changes in
1RM and bodyweight over time. This will help identify trends, plateaus,
and periods of significant improvement. Applying smoothing techniques
and ARIMA models can aid in forecasting future performance based on
historical data.
Correlation and Causation Studies:
- Objective: To understand the relationship between different
variables.
- Implementation: Perform correlation analysis to investigate the
relationships between bodyweight, training frequency, nutritional
intake, and 1RM performance. Follow up with regression analysis to
identify potential causal relationships and quantify the impact of
various factors on strength gains.
Comparative Analysis Across Different Demographics:
- Objective: To compare performance metrics across different
demographics such as age, gender, and training experience.
- Implementation: Use t-tests and ANOVA to compare 1RM performance and
bodyweight across different demographic groups. This can help tailor
training programs to specific groups and improve overall
effectiveness.
Enhanced Visualization Techniques:
- Objective: To create more insightful and interactive
visualizations.
- Implementation: Incorporate advanced visualization libraries such as
plotly for interactive plots that allow for deeper exploration of the
data. This can include dynamic plots where users can filter and view
specific subsets of the data.
Predictive Modeling for Performance Optimization:
- Objective: To develop models that predict performance outcomes and
optimize training regimens.
- Implementation: Apply machine learning techniques such as random
forests, support vector machines, or neural networks to predict 1RM
performance based on a range of input variables. These models can be
used to suggest optimal training loads and volumes for individuals.
Cluster Analysis for Identifying Training Patterns:
- Objective: To identify distinct training patterns and group similar
individuals.
- Implementation: Use clustering techniques such as K-means clustering
to identify groups of individuals with similar training patterns and
performance metrics. This can help in customizing training programs for
different clusters based on their unique characteristics.
Survival Analysis for Training Attrition:
- Objective: To analyze the dropout rates and factors affecting
training adherence.
- Implementation: Perform survival analysis to study the
time-to-dropout for trainees and identify factors that influence
training adherence. This can guide interventions aimed at reducing
dropout rates and improving long-term training commitment.
Principal Component Analysis (PCA) for Dimensionality Reduction:
- Objective: To simplify complex datasets and identify key performance
indicators.
- Implementation: Apply PCA to reduce the dimensionality of your
dataset while retaining most of the variance. This can help in
identifying key variables that most influence training outcomes and
performance.
Nutrition and Recovery Analysis:
- Objective: To study the impact of nutrition and recovery on
performance.
- Implementation: Collect and analyze data on dietary intake, sleep
patterns, and recovery methods. Use regression analysis to determine how
these factors influence strength gains and overall performance.
Benchmarking and Standard Development:
- Objective: To establish performance benchmarks and standards for
different populations.
- Implementation: Utilize the collected data to develop performance
standards and benchmarks. This can involve creating percentile ranks and
performance categories that can be used for setting realistic goals and
expectations for trainees.
Data Collection Enhancements
Incorporating Wearable Technology:
- Utilize wearable devices to collect real-time data on heart rate,
sleep quality, and daily activity levels. This data can provide deeper
insights into how lifestyle factors influence training performance.
Detailed Nutritional Logging:
- Implement a system for detailed nutritional logging, allowing for
the collection of data on macronutrient and micronutrient intake.
Analyzing this data can help optimize dietary plans to support training
goals.
Psychological and Motivational Assessments:
- Regularly assess psychological factors such as motivation, stress
levels, and mental fatigue. Understanding the psychological aspects of
training can provide a more holistic view of an individual’s performance
and adherence to the training program.
Recovery and Injury Data:
- Collect data on recovery strategies and injury occurrences.
Analyzing this information can help in developing injury prevention
programs and optimizing recovery protocols to enhance performance.
Advanced Biometric Data:
- Incorporate the collection of advanced biometric data such as blood
markers, hormone levels, and muscle oxygenation. This can provide a
deeper understanding of physiological responses to training.
REFERENCES
---
title: "Project KDF"
author: 'Author: Kendrick Tham'
output:
  html_document:
    toc: yes
    toc_depth: 3
    toc_float:
      collapsed: no
      smooth_scroll: yes
      placement: top
      toc_subtree: yes  # Ensures that sub-sections are hidden until the primary section is clicked
    # code_folding: hide   # Hide the code by default
    code_download: true  # Add a button to download the R code    
---

Initial RMD Start Date: 2024-04-24

Last Updated: 2024-07-31

Print Date: `r Sys.Date()`

<!-- # RMD Setup -->

<style>

/* Reduce the font size of the TOC */
{
  font-size: 0.90em; /* Adjust the size as needed */
}

/* Add borders and spacing to all figures */
img {
  border: 1px solid #ddd; /* Specify border style */
  padding: 5px; /* Add padding around the figures */
  border-radius: 5px; /* Add rounded corners to the border */
  display: block; /* Ensure the image is treated as a block element to respect margin */
  margin-bottom: 20px; /* Add bottom margin to each image for spacing */
}

/* Add borders to all tables */
table {
  border-collapse: collapse; /* Collapse borders into a single border */
  border: 1px solid #ddd; /* Specify border style */
  margin-bottom: 20px; /* Add bottom margin to each table for spacing */
}

/* Add borders to table cells */
th, td {
  border: 1px solid #ddd; /* Specify border style for table cells */
  padding: 8px; /* Add padding inside the table cells */
  text-align: left; /* Align text to the left within table cells */
}


/* Ensuring all column headers are aligned to the left */
table.kable-table th, table.kable-table td {
   text-align: left !important;
}


/* Apply styles to all headers */
th {
  position: sticky; /* Ensure headers stick when scrolling */
  top: 0; /* Position headers at the top */
  background-color: #f2f2f2; /* Background color for headers */
  z-index: 999; /* Ensure headers stay on top of other elements */
}

/* Apply styles to all tables */
table {
  border-collapse: collapse; /* Collapse borders into a single border */
  border: 1px solid #ddd; /* Specify border style */
}

/* Apply styles to table cells */
th, td {
  border: 1px solid #ddd; /* Specify border style for table cells */
  padding: 8px; /* Add padding inside the table cells */
  text-align: left; /* Align text to the left within table cells */
}

/* Style tables with alternating striped rows */
table {
  border-collapse: collapse;
  width: 100%;
}

th, td {
  border: 1px solid #dddddd;
  text-align: left;
  padding: 8px;
}

/* Add alternating background colors to table rows */
tr:nth-child(even) {
  background-color: #f2f2f2;
}

tr:nth-child(odd) {
  background-color: #ffffff; /* White background for odd rows */
}

</style>

```{r default_chunk_options, include=FALSE}
# Set repository and default chunk options
options(repos = c(CRAN = "https://cloud.r-project.org"))
knitr::opts_chunk$set(echo=FALSE, eval=FALSE, results='hide', message=FALSE, warning=FALSE)
```

<!-- 
Options such as echo, message, warning, results, include, eval, cache, and more. 

- echo: Show or hide code in the output (echo = FALSE hides the code).
- message: Show or hide messages in the output (message = FALSE suppresses messages).
- warning: Show or hide warnings in the output (warning = FALSE suppresses warnings).
- results: Control how results are shown:
  - 'markup': Display results as they are.
  - 'asis': Include results as raw HTML or LaTeX.
  - 'hide': Hide results from output.
- include: Include or exclude both code and results from the output (include = FALSE excludes everything).
- eval: Evaluate the code chunk or not (eval = FALSE prevents code execution).
- cache: Cache the results of the chunk to speed up rendering (cache = TRUE enables caching).
- fig.cap: Add a caption to graphical outputs.
- out.width/out.height: Set the output size of plots.
- collapse: Concatenate textual output into a single block.
- tidy: Tidy up the display of code or results (tidy = TRUE tidies the output).
-->

```{r packages_and_libraries, eval=TRUE}

# List of all required libraries
required_libraries <- c("ggplot2", "stringr", "readxl", "dplyr", "lubridate", "hms", "openxlsx", "writexl", "rmarkdown", "knitr", "kableExtra", "ggnewscale", "gridExtra")

# ggplot2     for 'declaratively' creating graphics, based on "The Grammar of Graphics". 
# stringr     for working with strings, i.e., character data.
# readxl      for reading Excel files by providing a way to read xls and xlsx files into R without dependencies on external software.
# dplyr       for data manipulation, providing a consistent set of verbs that help you solve the most common data manipulation challenges.
# lubridate   for working with dates and times in R.
# hms         for storing time-of-day values and interfacing these values with base R's date-time classes.
# openxlsx    for reading from and writing to Microsoft Excel .xlsx files. It provides more functionality than just handling data; it also allows you to modify various aspects of the Excel file itself.
# writexl     for a fast and portable way to write data frames to Excel .xlsx files, focusing solely on writing data without any dependencies.
# rmarkdown   for dynamic report generation from R. It integrates the core syntax of markdown (a simple markup language) with R code, allowing for the automatic update of reports to reflect data changes and analysis updates.
# knitr       for using functions such as kable() to format dataframes as markdown tables. 
# kableExtra  for enhanced functionality for styling tables such as displaying columns evenly
# ggnewscale  for creating new color palettes and more
# gridExtra

# Check which libraries are not installed
installed_libs <- rownames(installed.packages())
libraries_to_install <- required_libraries[!required_libraries %in% installed_libs]

# Install missing libraries
if(length(libraries_to_install) > 0) {
  install.packages(libraries_to_install)
}

# Load all required libraries
lapply(required_libraries, library, character.only = TRUE)
```


# TABLE OF CONTENTS
Utilize the Table of Contents (TOC) menu on the left to scroll through the different sections and help navigate this report. You can click on the title to go directly to the section.

If you're viewing the PDF version, the Table of Contents stays at the top of this report. To go directly to a section, copy and paste the title into the search box. Use the following shortcut keys to help navigate with ease:

Shortcut Keys:

- Hold `Ctrl` and press the `Home` key to go to the top of the page.
- Hold `Ctrl` and press the `End` key to go to the bottom of the page.
- Hold `Ctrl` and press the `F` key to search for specific text or strings of text.
- Press the `Esc` key to quickly exit the search box.

## About This Report 

This report is published on RPubs[^1]. KDF stands for Ken Does Fitness. Refer to References at the bottom of this page for cited sources. The RMD file to produce this report, and some of the figures from this report, may be found in the Google Drive folder [^2]. 

This report systematically records and analyzes workout statistics to enhance individual and collective performance by achieving the following goals:

- Track and Improve Performance - Regularly measuring workout statistics helps individuals monitor progress, set realistic goals, and tailor training plans for optimal results.
- Informed Decision Making - Detailed data analysis allows for personalized adjustments in training loads, reducing the risk of injury and enhancing overall performance.
- Foster Healthy Competition - Comparing personal performance over time and with peers creates a motivating and competitive environment that encourages continuous improvement.
- Data-Driven Insights - Identifying trends and patterns in performance metrics helps athletes make informed adjustments to their training for sustained improvement and injury prevention.

```{r reports_pnc, eval=TRUE, results='asis'}
# Load necessary libraries
library(knitr)
library(kableExtra)

# Define the pros and cons as lists
pros <- c("Enhanced performance tracking: Helps monitor progress and identify strengths and weaknesses.",
          "Motivation and accountability: Sets clear benchmarks and goals, boosting motivation.",
          "Data-driven decision making: Informs better training choices based on data analysis.",
          "Facilitates comparison and competition: Encourages healthy competition and self-improvement.")


cons <- c("Resource intensive: Requires time and effort for accurate data entry and maintenance.",
          "Potential for misinterpretation: Incorrect data analysis can lead to wrong conclusions.",
          "Privacy concerns: Risks of handling personal performance data insecurely.",
          "Emphasis on quantitative metrics: May overlook qualitative aspects like mental well-being and enjoyment.")

# Combine the pros and cons into a data frame
reports_pnc <- data.frame(
  Pros = pros,
  Cons = cons,
  stringsAsFactors = FALSE
)

# Print the table using kable and kableExtra
kable(reports_pnc, format = "html", escape = FALSE, col.names = c("Pros", "Cons")) %>%
  kable_styling(bootstrap_options = c("striped", "hover", "condensed", "responsive"), full_width = TRUE) %>%
  column_spec(1, width = "50%") %>%
  column_spec(2, width = "50%")

```

Formatting - This report is initially printed as one continuous page in HTML format and may appear differently when viewed in PDF format.

Terminology - The terms activity, workout, and exercise may be used interchangeably in this context, even though they have distinct meanings.

Data Assumptions - The data used for analyses assumes each set of exercise is performed to Rep Max (RM), which isn't always the case, particularly if the individual isn't aiming to perform the set to RM. Assuming each set of exercise is performed to Rep Max helps standardize data and provide clear insights into maximal strength, which is beneficial for benchmarking and tracking progress. However, it can also lead to inaccuracies, potential overtraining, and misleading data if not all exercises are performed to failure. Balancing the use of 1RM assumptions with realistic training practices is essential for accurate performance assessment and effective training programming.

<!-- Links will appear back to back, twice in a row when HTML format is Saved As PDF. -->

<!-- Explain the origin and development process of this report, for prospective employers -->

# STRENGTH STANDARDS

Strength Standards provide valuable benchmarks for assessing an individual's performance in key strength exercises that are widely practiced in fitness and strength training. These standards categorize performance levels based on either amount of weight executed relative to bodyweight, or repetitions executed, and are segmented by gender and experience levels. By comparing an individual's performance against these established standards, trainers and athletes can identify strengths and weaknesses in their training regimens, set realistic goals, and measure progress in a structured manner.

## Strength Standards Overview 

Several strength standards are readily available online:

- ExRx Strength Standards: <https://exrx.net/Testing/WeightLifting/StrengthStandards>
- StrengthLevel.com Strength Standards: <https://strengthlevel.com/strength-standards>
- Symmetric Strength: <https://symmetricstrength.com/>
- Mark Rippetoe's Strength Standards: <https://startingstrength.com/files/standards.pdf>

Finding accurate strength standards depends on the demographic, exercise type, and required accuracy. Each has its pros and cons: some cater to specific demographics, others focus on certain exercises, and their credibility affects reliability. Usability and comprehensive data for benchmarking are crucial, as are regular updates to reflect current trends. Considering these factors helps trainers and athletes choose the best standards for assessing and improving performance.

```{r strength_standards_pnc, eval=TRUE, results='asis'}
# Load necessary libraries
library(knitr)
library(kableExtra)

# Define the pros and cons as lists
pros <- c("Benchmarking: Strength standards provide clear benchmarks to measure progress and set goals.",
          "Motivation: Seeing how your performance stacks up against established standards can be highly motivating.",
          "Guidance: They offer guidance on what is considered strong or weak for different exercises, helping you tailor your training accordingly.",
          "Comparison: They allow for meaningful comparisons between individuals, helping to identify strengths and weaknesses.",
          "Structured Training: Strength standards can help in designing more structured and effective training programs.")

cons <- c("Generic: Strength standards can be too generic and may not account for individual differences in genetics, body composition, and training history.",
          "Pressure: They can create undue pressure to meet certain benchmarks, potentially leading to injury or burnout.",
          "Outdated Information: Some standards may be based on outdated information or not reflect recent advancements in sports science.",
          "Not Comprehensive: They might not cover all exercises or account for different training goals, such as hypertrophy versus strength.",
          "Misleading: For beginners, strength standards can be misleading if they don’t account for the starting point or rate of progress appropriate for different fitness levels.")

# Combine the pros and cons into a data frame
strength_standards_pnc <- data.frame(
  Pros = pros,
  Cons = cons,
  stringsAsFactors = FALSE
)

# Print the table using kable and kableExtra
kable(strength_standards_pnc, format = "html", escape = FALSE, col.names = c("Pros", "Cons")) %>%
  kable_styling(bootstrap_options = c("striped", "hover", "condensed", "responsive"), full_width = TRUE) %>%
  column_spec(1, width = "50%") %>%
  column_spec(2, width = "50%")

```
I've compiled some of the standards into an Excel file[^3]. This allows for comparison between personal performance measures and the specified strength standards. I recommend consulting each source for confirmation on the latest version, as the structure or numbers may have changed over time. 

```{r read_strength_standards, eval=TRUE}

# File Path Definition: The script defines a path to an Excel file containing "Strength Standards".
file_path_strength_standards <- "C://Users//kendr//OneDrive//Documents//Data Analytics//KenDoesFitness/Strength Standards Final.xlsm"

# Workbook Loading: Loads the Excel workbook into R using the openxlsx package, which provides more advanced control over Excel files than readxl.
strength_standards <- loadWorkbook(file_path_strength_standards)
```

## Standards Categories  
The standards compiled are separated into categories, as shown in the following table, making it easy to determine which exercise to use as a benchmark.

```{r strength_standards_sheet_names, eval=TRUE, results='markup'}

# Sheet Names Display: Retrieves and displays the names of sheets within the Excel workbook to help identify available data.

# Check to confirm which number of names to display
# names(strength_standards)

# Display specific elements from the strength_standards object with Indexing
names_subset_strength_standards <- names(strength_standards)[4:15]
names_subset_strength_standards_gsub = gsub("_"," ", names_subset_strength_standards)
# names_subset_strength_standards

# Calculate the number of names
num_names <- length(names_subset_strength_standards_gsub)

# Calculate the number of names per column
names_per_column <- ceiling(num_names / 2)

# Split the names into two columns
column_1 <- names_subset_strength_standards_gsub[1:min(names_per_column, num_names)]
column_2 <- names_subset_strength_standards_gsub[(min(names_per_column, num_names) + 1):num_names]

# Pad the shorter column with NA values if necessary
if(length(column_1) < length(column_2)) {
  column_1 <- c(column_1, rep("", length(column_2) - length(column_1)))
} else if(length(column_2) < length(column_1)) {
  column_2 <- c(column_2, rep("", length(column_1) - length(column_2)))
}
  
# Create a data frame with two columns
names_subset_strength_standards_gsub_df <- data.frame(Column_1 = column_1,
                              Column_2 = column_2)

# Calculate the width dynamically
col_width <- paste0(100 / ncol(names_subset_strength_standards_gsub_df), "%")

# Print the data frame using knitr::kable
knitr::kable(names_subset_strength_standards_gsub_df, format = "markdown", col.names = c("Categories", "Categories")) %>%
  column_spec(1:2, width = col_width, extra_css = "text-align: left;")

```


## Strength Standards List 
Below is the list of Strength Standards compiled into the SS Excel file[^3]. This file references the source of each standard table. We'll delve into Mark Rippetoe's Starting Strength Standards, as his and his colleagues' methods and research are based on comprehensive and technical information.

```{r named_ranges, eval=TRUE, results='markup'}

library(openxlsx)
library(readxl)
library(knitr)
library(kableExtra)

# Read Named Ranges: Reads a specific sheet named "Named_Ranges" that presumably contains metadata about other named ranges in the workbook. It keeps only the columns related to named ranges and their respective sheets.
named_ranges <- read_excel(file_path_strength_standards, sheet = "Named_Ranges")
sheet_names_filtered <- unique(named_ranges$`Category Sheet`)
named_ranges <- named_ranges[named_ranges$`Category Sheet` %in% sheet_names_filtered, c("Strength Standard", "Category Sheet")]

# Display the modified 'Strength Standard' column with spaces instead of underscores
strength_standards_list_display <- named_ranges %>%
  mutate(`Strength Standard` = gsub("_", " ", `Strength Standard`)) %>%
  mutate(`Category Name` = gsub("_", " ", `Category Sheet`)) %>%
  select(-`Category Sheet`)  # This excludes the 'Category Sheet' column

# View the data frame if needed (not during knitting). View function opens the raw_data dataframe in the RStudio data viewer for interactive exploration.
View(named_ranges)
View(strength_standards_list_display)

# Define the ratios and total width for column widths
total_width <- 100
ratio_strength_standards <- 2
ratio_category <- 1.25
total_ratio <- ratio_strength_standards + ratio_category

# Calculate the column widths
width_strength_standards <- (ratio_strength_standards / total_ratio) * total_width
width_category <- (ratio_category / total_ratio) * total_width

# Print the dataframe using kable for a nicely formatted markdown table.
kable(strength_standards_list_display, format = "html", col.names = c("Strength Standard", "Category")) %>%
  kable_styling(full_width = TRUE) %>%
  column_spec(1, width = paste0(width_strength_standards, "%")) %>%
  column_spec(2, width = paste0(width_category, "%"))

# Create lists to store data frames
named_ranges_list <- list()
named_ranges_sheets_list <- list()

# Loop through the named ranges
for (i in 1:nrow(named_ranges)) {
  named_range_value <- named_ranges$`Strength Standard`[i]
  named_range_sheet_value <- named_ranges$`Category Sheet`[i]
  named_range_lower <- tolower(gsub(" ", "_", named_range_value))
  named_range_sheet_lower <- tolower(gsub(" ", "_", named_range_sheet_value))
  
  if (named_range_sheet_value == "Named_Ranges") {
    next  # Skip iteration if the sheet is "Named_Ranges" to avoid redundancy
  }
  
  # Read data from the named range
  named_range_data <- tryCatch({
    read.xlsx(file_path_strength_standards, namedRegion = named_range_value)
  }, error = function(e) {
    message(paste("Warning: Workbook has no such named region:", named_range_value))
    NULL
  })
  
  if (!is.null(named_range_data) && is.data.frame(named_range_data)) {
    # Update column names
    colnames(named_range_data) <- gsub("\\.", " ", colnames(named_range_data))
    # Store the data frame in the list
    named_ranges_list[[named_range_lower]] <- named_range_data
    # Assign the data frame to the global environment with the lower case name
    assign(named_range_lower, named_range_data, envir = .GlobalEnv)
  }
  
  # Read data from the sheet
  named_range_sheet_data <- tryCatch({
    read.xlsx(file_path_strength_standards, sheet = named_range_sheet_value)
  }, error = function(e) {
    message(paste("Warning: Workbook has no such sheet:", named_range_sheet_value))
    NULL
  })
  
  if (!is.null(named_range_sheet_data) && is.data.frame(named_range_sheet_data)) {
    # Update column names
    colnames(named_range_sheet_data) <- gsub("\\.", " ", colnames(named_range_sheet_data))
    # Store the data frame in the list
    named_ranges_sheets_list[[named_range_sheet_lower]] <- named_range_sheet_data
    # Assign the data frame to the global environment with the lower case name
    assign(named_range_sheet_lower, named_range_sheet_data, envir = .GlobalEnv)
  }
}

# Display the structure of the lists
# str(named_ranges_list)
# str(named_ranges_sheets_list)

# Optional: View the contents of the lists in RStudio Viewer
# Instead of viewing the whole list, view individual data frames
# Uncomment the below lines to view specific named ranges or sheets
# View(named_ranges_list[["male_barbell_curl_std_1rm_by_age_strengthlevel"]])
# View(male_barbell_curl_std_1rm_by_age_strengthlevel)

```

# MARK RIPPETOE'S STANDARDS

Mark Rippetoe's Strength Standard: 1 Rep Max By Bodyweight.  

Here we utilize Mark Rippetoe's Strength Standards[^4] to benchmark individual performance against established criteria for primary strength exercises. These standards are instrumental in evaluating an individual's lifting capabilities in relation to their bodyweight and training experience. By doing so, we can effectively gauge progress, pinpoint areas that need improvement, and tailor training programs to better meet the specific strength goals of each individual.

The exercises use the technique described in "Starting Strength: Basic Barbell Training, 3rd ed." The Bluebook may be purchased through The Aasgaard Company online store[^5] and on Amazon in various mediums (paperback, Kindle, audio). All lifts are performed with the barbell, as indicated by the book title.

This project has been a continuous process of editing, updating, and reviewing as I progress. After considering both StrengthLevel.com and Mark Rippetoe's Standards via an article by Legion Athletics[^6], I decided to utilize Mark Rippetoe's Standards to create visual plot benchmarks. (My primary goal is to create spider charts for creative and visually motivating targets.) However, the Legion Athletics article lacked information on how these measures are obtained and what techniques or instruments are used in executing these standards.

I emailed rip@startingstrength.com for clarification regarding the Male Overhead Shoulder Press Strength Standards and was advised to "Ignore this table. It is invalid, and we haven’t used it in 20 years. As you would know if you had read the blue book." Upon further research, Mark Rippetoe confirmed the changes in categories on Starting Strength Forum[^7] from the previous outdated categories (Beginner, Novice, Intermediate, Advanced, and Elite) to the current Cat. I through Cat. V.

To properly utilize these standards, it is recommended to review "Starting Strength: Basic Barbell Training"[^8] and "Practical Programming for Strength Training."[^9] The most recent and relevant 3rd editions (2012) are available on their website, as well as in different mediums on Amazon. You can download online book samples to determine if Starting Strength suits your needs.

```{r mark_rippetoe_standards_list, eval=TRUE, results='markup'}

# Code extracts and refines a subset of data related to "Mark Rippetoe" standards from a broader dataset, making it more accessible and easier to work with by cleaning up the category names for further analysis or reporting

# subset() filters rows from the named_ranges dataframe.
# grepl() checks each entry in the "Strength Standard" column for the substring 
mark_rippetoe_standards_list <- subset(named_ranges, grepl("Mark_Rippetoe", `Strength Standard`))

# Pipe Operator (%>%) is used to pass the result of one function to the next. It helps in writing cleaner and more readable code, especially when performing multiple operations on a dataset.
# gsub("_", " ", Category Sheet) replaces underscores (_) with spaces in the Category Sheet column.
# New column is created. 
mark_rippetoe_standards_list <- mark_rippetoe_standards_list %>%
  mutate(`Category Name` = gsub("_", " ", `Category Sheet`))

# Display the modified 'Strength Standard' column with spaces instead of underscores
display_mark_rippetoe_standards_list <- mark_rippetoe_standards_list %>%
  mutate(`Strength Standard` = gsub("_", " ", `Strength Standard`)) %>%
  select(-`Category Sheet`)  # This excludes the 'Category Sheet' column

# The View function is used to open the raw_data dataframe in the RStudio data viewer for interactive exploration.
View(mark_rippetoe_standards_list) #Run this line of code to view table. 

# Print for reference. 
# print(mark_rippetoe_standards_list)
# print(display_mark_rippetoe_standards_list)

# Define the ratios and total width for column widths
total_width <- 100
ratio_strength_standards <- 2
ratio_category <- 1.25
total_ratio <- ratio_strength_standards + ratio_category

# Calculate the column widths
width_strength_standards <- (ratio_strength_standards / total_ratio) * total_width
width_category <- (ratio_category / total_ratio) * total_width

# Print the dataframe using kable for a nicely formatted markdown table.
kable(display_mark_rippetoe_standards_list, format = "html", col.names = c("Strength Standard", "Category")) %>%
  kable_styling(full_width = TRUE) %>%
  column_spec(1, width = paste0(width_strength_standards, "%")) %>%
  column_spec(2, width = paste0(width_category, "%"))


```

There are five Starting Strength Standards, with the Power Clean excluded in this report. Each of the four remaining Standards is presented below, followed by a plot to visualize performances for comparison and motivation.

```{r mark_rippetoe_standards_tables_and_plots, fig.width=10, fig.height=7, eval=TRUE, results='asis'}

# Keep fig.width=10 for PDF display (via browser > Save As PDF). While fig.width=9 (instead of 10) fits width of output, saving as PDF with fig.width=9 will result in small figure; width=10 will fill the PDF page.

# Load the required packages if not already installed
library(ggplot2)
library(dplyr)
library(knitr)
library(kableExtra) # Ensure kableExtra is loaded for styling

# Function to generate ggplot for each data frame
generate_ggplot <- function(data, category_name, mark_rippetoe_strength_standard) {
  data_number <- data %>%
    mutate(Bodyweight = as.numeric(gsub("[^0-9]", "", Bodyweight)))

  # Rename columns to remove double periods
  colnames(data_number) <- gsub("\\.\\.", ". ", colnames(data_number))

  # Print column names for debugging
  # print(colnames(data_number))
    
  ggplot(data_number, aes(x = Bodyweight)) +
    geom_ribbon(aes(ymin = `Cat. I`, ymax = `Cat. II`), fill = "blue", alpha = 0.2) +
    geom_ribbon(aes(ymin = `Cat. II`, ymax = `Cat. III`), fill = "green", alpha = 0.2) +
    geom_ribbon(aes(ymin = `Cat. III`, ymax = `Cat. IV`), fill = "orange", alpha = 0.2) +
    geom_ribbon(aes(ymin = `Cat. IV`, ymax = `Cat. V`), fill = "red", alpha = 0.2) +
    geom_ribbon(aes(ymin = `Cat. V`, ymax = Inf), fill = "purple", alpha = 0.2) +
    
    # Fill the remaining non-colored part
    geom_ribbon(aes(ymin = -Inf, ymax = `Cat. I`), fill = "gray", alpha = 0.2) +
    
    # Create lines
    geom_line(aes(y = `Cat. I`), color = "blue", size = 1) +
    geom_line(aes(y = `Cat. II`), color = "green", size = 1) +
    geom_line(aes(y = `Cat. III`), color = "orange", size = 1) +
    geom_line(aes(y = `Cat. IV`), color = "red", size = 1) +
    geom_line(aes(y = `Cat. V`), color = "purple", size = 1) +
    
    # Set labels
    labs(
      x = "Bodyweight (lb)",
      y = "1RM Weight Load Execution",
      title = paste("Male", category_name, "Standards"),
      subtitle = "Comparison of Strength Standards Across Experience Levels",
      caption = paste("Source:", mark_rippetoe_strength_standard) # Change Source info PRN
    ) +
    
    # Add more markers on the y-axis at increments of 20
    scale_y_continuous(breaks = seq(0, 999, by = 20)) +
    # Add more markers on the x-axis at increments of 10
    scale_x_continuous(breaks = seq(0, 999, by = 10)) +
    
    # Add text labels for skill levels with matching colors
    geom_text(aes(x = max(Bodyweight), y = min(`Cat. I`), label = ""), color = "dimgray", vjust = -1) +
    geom_text(aes(x = max(Bodyweight), y = max(`Cat. I`), label = "Cat. I"), color = "darkblue", vjust = -1) +
    geom_text(aes(x = max(Bodyweight), y = max(`Cat. II`), label = "Cat. II"), color = "darkgreen", vjust = -1) +
    geom_text(aes(x = max(Bodyweight), y = max(`Cat. III`), label = "Cat. III"), color = "darkorange", vjust = -1) +
    geom_text(aes(x = max(Bodyweight), y = max(`Cat. IV`), label = "Cat. IV"), color = "darkred", vjust = -1) +
    geom_text(aes(x = max(Bodyweight), y = max(`Cat. V`), label = "Cat. V"), color = "purple", vjust = -1) +
    
    # Customize theme
    theme_minimal() +
    theme(
      plot.title = element_text(hjust = 0.5, size = 16),
      plot.subtitle = element_text(hjust = 0.5, size = 12),
      plot.caption = element_text(hjust = 0.5),
      axis.title.x = element_text(size = 14),
      axis.title.y = element_text(size = 14),
      axis.text = element_text(size = 12),
      legend.position = "right",
      legend.title = element_blank(),
      legend.text = element_text(size = 12),
      legend.key.size = unit(1, "cm")
    )
}

# Lists to store the plots and tables
plots_list_strength_standards_mark_rippetoe <- list()
tables_list_strength_standards_mark_rippetoe <- list()

# Check and create the 'plots' directory if it doesn't exist
if (!dir.exists("plots")) {
  dir.create("plots")
}

# Check and create the 'tables' directory if it doesn't exist
if (!dir.exists("tables")) {
  dir.create("tables")
}

# Loop through each named range from the filtered mark_rippetoe_standards_list dataframe
for (i in 1:nrow(mark_rippetoe_standards_list)) {
  strength_standard_named_range <- mark_rippetoe_standards_list$`Strength Standard`[i]
  category_name <- mark_rippetoe_standards_list$`Category Name`[i]
  
  # Load the data from the named range
  data_frame <- readWorkbook(strength_standards, namedRegion = strength_standard_named_range)
  
  # Check if the 'Bodyweight' column exists in the data frame
  if (!"Bodyweight" %in% colnames(data_frame)) {
    cat(paste("Skipping", strength_standard_named_range, "- 'Bodyweight' column missing.\n"))
    next
  }

  # Ensure column names are trimmed of whitespace
  colnames(data_frame) <- trimws(colnames(data_frame))

  # Rename columns to remove double periods for both table and plot
  colnames(data_frame) <- gsub("\\.\\.", ". ", colnames(data_frame))

  # Format the title of the plot and head of the table using the cleaned category name
  formatted_header <- gsub("_", " ", strength_standard_named_range)

  # Generate the plot
  plot <- generate_ggplot(data_frame, category_name, strength_standard_named_range)
  
  # Print the name of the named range as a markdown header
  cat(paste0("\n\n<h4> ", formatted_header, "</h4>\n"))

  # Rename 'Bodyweight' column to 'Bodyweight (lb)'
  names(data_frame)[names(data_frame) == "Bodyweight"] <- "Bodyweight (lb)"

  # Print the plot
  print(plot)

  # Save the plot to the list
  plots_list_strength_standards_mark_rippetoe[[formatted_header]] <- plot

  # Output the data frame as a markdown table with even column widths
  print(
    table_html <- kable(data_frame, format = "html", align = 'l') %>%
      kable_styling(full_width = TRUE) %>%
      column_spec(1:ncol(data_frame), width = paste0(100 / ncol(data_frame), "%"))
  )
  
  # Save the table to the list
  tables_list_strength_standards_mark_rippetoe[[formatted_header]] <- table_html

  # Optionally save the plot and table
  ggsave(filename = paste0("plots/", tolower(strength_standard_named_range), ".png"), plot = plot, width = 12, height = 8)

  # Save the data frame as a CSV file
  write.csv(data_frame, file = paste0("tables/", tolower(strength_standard_named_range), ".csv"), row.names = FALSE)
}

# Example of how to recall and print a specific plot and table from the lists
# print(plots_list_strength_standards_mark_rippetoe[["Male Bench Press Std 1RM By Bodyweight Mark Rippetoe"]])
# print(tables_list_strength_standards_mark_rippetoe)
# print(tables_list_strength_standards_mark_rippetoe["Male Bench Press Std 1RM By Bodyweight Mark Rippetoe"])
# cat(as.character(tables_list_strength_standards_mark_rippetoe[["Male Bench Press Std 1RM By Bodyweight Mark Rippetoe"]]))

```

# 1RM CALCULATOR 
This 1RM Conversion Table from Strength Level[^10] displays the Repetition Percentages of 1RM, helping to calculate the approximate one-rep max (1RM). The one-rep max is the maximum weight you can lift for a single repetition for a given exercise.

```{r calculator_1rm, eval=TRUE, results='asis'}

library(openxlsx)
library(readxl)
library(knitr)
library(kableExtra)

file_path_1rm_calculator <- "C://Users//kendr//OneDrive//Documents//Data Analytics//KenDoesFitness//1RM Calculator.xlsm"

sheet_names_1rm_calculator <- excel_sheets(file_path_1rm_calculator) # Update selected sheets as needed.

# Use knitr::kable to format as a markdown table
# knitr::kable(sheet_names_1rm_calculator, format = "markdown", col.names = "Sheets") 

# Read Named Ranges: Reads a specific sheet named "Named_Ranges" that presumably contains metadata about other named ranges in the workbook. It keeps only the columns related to named ranges and their respective sheets.
named_ranges_1rm_calculator <- read_excel(file_path_1rm_calculator, sheet = "Named_Ranges")
sheet_names_filtered <- unique(named_ranges_1rm_calculator$`Named Range Sheet`)
named_ranges_1rm_calculator <- named_ranges_1rm_calculator[named_ranges_1rm_calculator$`Named Range Sheet` %in% sheet_names_filtered, c("Named Range", "Named Range Sheet")]

# View the data frame if needed (not during knitting). View function opens the raw_data dataframe in the RStudio data viewer for interactive exploration.
# View(named_ranges_1rm_calculator)

num_columns <- ncol(named_ranges_1rm_calculator)
column_width <- 100 / num_columns

# Display the dataframe using knitr::kable 
# knitr::kable(named_ranges_1rm_calculator, format = "markdown", col.names = c("Named Range", "Named Range Sheet")) %>%
#   kable_styling(full_width = TRUE) %>%
#   column_spec(1:num_columns, width = paste0(column_width, "%"))

# Create lists to store data frames
named_ranges_1rm_calculator_list <- list()
named_ranges_1rm_calculator_sheets_list <- list()

# Loop through the named ranges
for (i in 1:nrow(named_ranges_1rm_calculator)) {
  named_range_1rm_calculator_value <- named_ranges_1rm_calculator$`Named Range`[i]
  named_range_1rm_calculator_sheet_value <- named_ranges_1rm_calculator$`Named Range Sheet`[i]
  
  # Skip if any values are missing
  if (is.na(named_range_1rm_calculator_value) || is.na(named_range_1rm_calculator_sheet_value)) {
    next
  }
  
  named_range_1rm_calculator_lower <- tolower(gsub(" ", "_", named_range_1rm_calculator_value))
  named_range_1rm_calculator_sheet_lower <- tolower(gsub(" ", "_", named_range_1rm_calculator_sheet_value))
  
  if (named_range_1rm_calculator_sheet_value == "Named_Ranges") {
    next  # Skip iteration if the sheet is "Named_Ranges" to avoid redundancy
  }
  
  # Read data from the named range
  named_range_1rm_calculator_data <- tryCatch({
    read.xlsx(file_path_1rm_calculator, namedRegion = named_range_1rm_calculator_value)
  }, error = function(e) {
    message(paste("Warning: Workbook has no such named region:", named_range_1rm_calculator_value))
    NULL
  })
  
  if (!is.null(named_range_1rm_calculator_data) && is.data.frame(named_range_1rm_calculator_data)) {
    # Update column names
    colnames(named_range_1rm_calculator_data) <- gsub("\\.", " ", colnames(named_range_1rm_calculator_data))
    # Store the data frame in the list
    named_ranges_1rm_calculator_list[[named_range_1rm_calculator_lower]] <- named_range_1rm_calculator_data
    # Assign the data frame to the global environment with the lower case name
    assign(named_range_1rm_calculator_lower, named_range_1rm_calculator_data, envir = .GlobalEnv)
  }
  
  # Read data from the sheet
  named_range_1rm_calculator_sheet_data <- tryCatch({
    read.xlsx(file_path_1rm_calculator, sheet = named_range_1rm_calculator_sheet_value)
  }, error = function(e) {
    message(paste("Warning: Workbook has no such sheet:", named_range_1rm_calculator_sheet_value))
    NULL
  })
  
  if (!is.null(named_range_1rm_calculator_sheet_data) && is.data.frame(named_range_1rm_calculator_sheet_data)) {
    # Update column names
    colnames(named_range_1rm_calculator_sheet_data) <- gsub("\\.", " ", colnames(named_range_1rm_calculator_sheet_data))
    # Store the data frame in the list
    named_ranges_1rm_calculator_sheets_list[[named_range_1rm_calculator_sheet_lower]] <- named_range_1rm_calculator_sheet_data
    # Assign the data frame to the global environment with the lower case name
    assign(named_range_1rm_calculator_sheet_lower, named_range_1rm_calculator_sheet_data, envir = .GlobalEnv)
  }
}

# Display the structure of the lists
# str(named_ranges_1rm_calculator_list)
# str(named_ranges_1rm_calculator_sheets_list)

# Optional: View the contents of the lists in RStudio Viewer
# Instead of viewing the whole list, view individual data frames
# Uncomment the below lines to view specific named ranges or sheets
# View(named_ranges_1rm_calculator_list[["demographic"]])

# knitr::kable(named_ranges_1rm_calculator, format = "markdown") 
# Rename the column
colnames(calculator_1rm_by_percentage_strength_level)[2] <- "Percentage of 1RM"

# Replace periods with spaces in the column names
colnames(calculator_1rm_by_percentage_strength_level) <- gsub("\\.", " ", colnames(calculator_1rm_by_percentage_strength_level))

# Format the Percentage of 1RM column as percentages without decimal places
calculator_1rm_by_percentage_strength_level$`Percentage of 1RM` <- paste0(
  round(calculator_1rm_by_percentage_strength_level$`Percentage of 1RM` * 100), "%"
)

# Set the last two columns to 1 decimal place if they exist
if (ncol(calculator_1rm_by_percentage_strength_level) > 2) {
  calculator_1rm_by_percentage_strength_level[, (ncol(calculator_1rm_by_percentage_strength_level)-1):ncol(calculator_1rm_by_percentage_strength_level)] <- 
    round(calculator_1rm_by_percentage_strength_level[, (ncol(calculator_1rm_by_percentage_strength_level)-1):ncol(calculator_1rm_by_percentage_strength_level)], 1)
}

# Use knitr::kable to format as a markdown table
knitr::kable(calculator_1rm_by_percentage_strength_level, format = "html", align = rep('l', ncol(calculator_1rm_by_percentage_strength_level))) %>%  # Use 'html' format for kable_styling
  kable_styling(full_width = TRUE) %>%  # Ensure full-width table with equal column width
  column_spec(1:ncol(calculator_1rm_by_percentage_strength_level), width = paste0(100 / ncol(calculator_1rm_by_percentage_strength_level), "%"))

cat("\n\n### How to Calculate Your 1RM\n")
cat("To calculate the One Rep Max (1RM) based on the weight lifted and the number of repetitions performed, use the following formula:\n")
cat("Y lbs at X Reps / % at X Reps = Y lbs 1RM\n")
cat("\nFor example, if you did 5 reps at 200 lbs, and 5 reps corresponds to 89%, then \n")
cat("200 lbs / 0.89 = ~225 lbs 1RM\n")

cat("\n\n### How to Calculate Your X-RM\n")
cat("To calculate the X Repetition Max (X-RM) from a given number of repetitions and weight lifted, first calculate the 1RM, then use the 1RM to get the X-RM. We do this through the following formula:\n")
cat("Y lbs at X Reps / % at X Reps = Y lbs 1RM\n")
cat("\nThen multiply by the percentage of your target repetitions to get X-RM:\n")
cat("1RM * % at Target Reps = Y lbs X-RM\n")
cat("\nFor example, if you did 5 reps at 45 lbs, and 5 reps corresponds to 89%, and you want to find the 3RM:\n")
cat("First, calculate the 1RM:\n")
cat("45 lbs / 0.89 = ~50.56 lbs 1RM\n")
cat("Then calculate the 3RM using the 3 rep percentage (94%):\n")
cat("50.56 lbs * 0.94 = ~47.53 lbs 3RM\n")

```

# KDF WORKBOOK
## A Brief Overview 

This project derives data from the primary workbook, named KDF (Ken Does Fitness)[^11]. For years, I recorded my personal performance to recall my previous workouts for the next session, as I often had trouble remembering the weights used. As I started learning analytics and coding methods, I wanted to apply those techniques to something personal and applicable, which led to this report. It has indeed been a learning process. Not to mention, the data cleaning and organization have also been a time-consuming process. As of `r Sys.Date()`, the contents of this workbook include a combination of performance data, formulas, code, and reference resources. Until a separate file with a more user-friendly experience is available without the necessary back-end coding information, this file will have to suffice.

```{r read_ken_does_fitness, eval=TRUE}

# Read Excel file and Define file path.
# Replace \ with // (Windows compatibility). 
file_path_kdf <- "C://Users//kendr//OneDrive//Documents//Data Analytics//KenDoesFitness//Ken Does Fitness Final.xlsx"

# Raw String Literal: Supposedly you can also use a raw string literal by prefixing the string with r or R. This tells R to interpret the string as-is, without processing escape sequences. However, doing the following method below may result in "malformed raw string literal (<input>:1:16)" as it has for me. Therefore consider to use the method above instead.
```

```{r named_ranges_kdf, eval=TRUE, results='hide'}

library(openxlsx)
library(readxl)
library(knitr)
library(kableExtra)

# Set the file path
# file_path_kdf <- "your_file_path_here.xlsx"

# Read Named Ranges: Reads a specific sheet named "Named_Ranges" that presumably contains metadata about other named ranges in the workbook. It keeps only the columns related to named ranges and their respective sheets.
named_ranges_kdf <- read_excel(file_path_kdf, sheet = "Named_Ranges")
sheet_names_filtered <- unique(named_ranges_kdf$`Named Range Sheet`)
named_ranges_kdf <- named_ranges_kdf[named_ranges_kdf$`Named Range Sheet` %in% sheet_names_filtered, c("Named Range", "Named Range Sheet")]

# View the data frame if needed (not during knitting). View function opens the raw_data dataframe in the RStudio data viewer for interactive exploration.
View(named_ranges_kdf)

# Display the dataframe using knitr::kable 
num_columns <- ncol(named_ranges_kdf)
column_width <- 100 / num_columns

# Display the dataframe using knitr::kable 
knitr::kable(named_ranges_kdf, format = "markdown", col.names = c("Named Range", "Named Range Sheet")) %>%
  kable_styling(full_width = TRUE) %>%
  column_spec(1:num_columns, width = paste0(column_width, "%"))

# Named Ranges Lower Case Assignment - Dynamic Data Loading and Variable Assignment
# - Loop Through Named Ranges: The script iterates over each row of the named_ranges_kdf dataframe. For each row, it reads the actual data from the Excel workbook for both the named range and its corresponding sheet.
# - Conditional Skipping: If the sheet for a named range is itself "Named_Ranges", the loop skips to the next iteration to avoid redundancy or potential errors.
# - Dynamic Variable Assignment: Using the assign() function, it dynamically creates variables in the R environment named after each named range and sheet. This is done in lowercase to standardize variable names.
# - Reading Data: Data is read into these dynamically named variables using the read.xlsx() function, which can access both specific named ranges and entire sheets.

# Create lists to store data frames
named_ranges_kdf_list <- list()
named_ranges_kdf_sheets_list <- list()

# Loop through the named ranges
for (i in 1:nrow(named_ranges_kdf)) {
  named_range_kdf_value <- named_ranges_kdf$`Named Range`[i]
  named_range_kdf_sheet_value <- named_ranges_kdf$`Named Range Sheet`[i]
  named_range_kdf_lower <- tolower(gsub(" ", "_", named_range_kdf_value))
  named_range_kdf_sheet_lower <- tolower(gsub(" ", "_", named_range_kdf_sheet_value))
  
  if (named_range_kdf_sheet_value == "Named_Ranges") {
    next  # Skip iteration if the sheet is "Named_Ranges" to avoid redundancy
  }
  
  # Read data from the named range
  named_range_kdf_data <- tryCatch({
    read.xlsx(file_path_kdf, namedRegion = named_range_kdf_value)
  }, error = function(e) {
    message(paste("Warning: Workbook has no such named region:", named_range_kdf_value))
    NULL
  })
  
  if (!is.null(named_range_kdf_data) && is.data.frame(named_range_kdf_data)) {
    # Update column names
    colnames(named_range_kdf_data) <- gsub("\\.", " ", colnames(named_range_kdf_data))
    # Store the data frame in the list
    named_ranges_kdf_list[[named_range_kdf_lower]] <- named_range_kdf_data
    # Assign the data frame to the global environment with the lower case name
    assign(named_range_kdf_lower, named_range_kdf_data, envir = .GlobalEnv)
  }
  
  # Read data from the sheet
  named_range_kdf_sheet_data <- tryCatch({
    read.xlsx(file_path_kdf, sheet = named_range_kdf_sheet_value)
  }, error = function(e) {
    message(paste("Warning: Workbook has no such sheet:", named_range_kdf_sheet_value))
    NULL
  })
  
  if (!is.null(named_range_kdf_sheet_data) && is.data.frame(named_range_kdf_sheet_data)) {
    # Update column names
    colnames(named_range_kdf_sheet_data) <- gsub("\\.", " ", colnames(named_range_kdf_sheet_data))
    # Store the data frame in the list
    named_ranges_kdf_sheets_list[[named_range_kdf_sheet_lower]] <- named_range_kdf_sheet_data
    # Assign the data frame to the global environment with the lower case name
    assign(named_range_kdf_sheet_lower, named_range_kdf_sheet_data, envir = .GlobalEnv)
  }
}

# Display the structure of the lists
# str(named_ranges_kdf_list)
# str(named_ranges_kdf_sheets_list)

# Optional: View the contents of the lists in RStudio Viewer
# Instead of viewing the whole list, view individual data frames
# Uncomment the below lines to view specific named ranges or sheets
# View(named_ranges_kdf_list[["demographic"]])
# View(demographic)

# knitr::kable(named_ranges_kdf, format = "markdown") 


```

The following table from KDF addresses the primary sheets helpful in improving workout performance and performance analyses. Some of the sheets listed serve as sections in this report.

```{r table_of_contents_nr , eval=TRUE, results='markup'}

colnames(table_of_contents_nr)[2] <- "EXPLANATION" 
table_of_contents_nr <- table_of_contents_nr %>%
  select(-`Coding Notes`)
# print(table_of_contents_nr)
# View(table_of_contents_nr)
# str(table_of_contents_nr)

# Filter TOC for display by filtering to the specific rows.
table_of_contents_nr_display <- table_of_contents_nr[2:7, ]

# Maintain line breaks in character columns
table_of_contents_nr_display <- table_of_contents_nr_display %>%
  mutate_if(is.character, ~gsub("\n", "<br>", .))

# Knit table without row names
knitr::kable(table_of_contents_nr_display, format = "html", escape=FALSE, row.names = FALSE) %>%
  kable_styling(full_width = TRUE)

```

# DEMOGRAPHIC
This section includes demographic information about individuals or groups, such as age, gender, and other relevant characteristics. Lists information for individuals with at least one set of data in the workbook. 

```{r demographic, eval=TRUE, results='asis'}

library(dplyr)
library(knitr)

# print(demographic_nr)
# View(demographic_nr)
# str(demographic_nr)

# Update all character columns and replace any instance of "-" with NA. Values look better with NA than an empty string.
demographic_nr <- demographic_nr %>%
  mutate_if(is.character, ~na_if(., "-"))

# Remove rows where Name is NA
demographic_nr <- demographic_nr %>%
  filter(!is.na(Name))

# Convert Birthdate column from character to numeric and then to Date format
demographic_nr$'Birthdate' <- as.numeric(demographic_nr$'Birthdate')
demographic_nr$'Birthdate' <- as.Date(demographic_nr$'Birthdate', origin = "1970-01-01") 

# Adjust the dates by subtracting X years
demographic_nr$`Birthdate` <- demographic_nr$`Birthdate` - years(70)

# Verify the conversion
# head(demographic_nr$`Birthdate`)

# Select the specific columns
demographic_nr <- demographic_nr %>%
  select(Name, Birthdate, Age, City) %>%
  arrange(Name)

# print(demographic_nr)
# str(demographic_nr)
# View(demographic_nr)

# Create demographic_display for displaying the table
demographic_display <- demographic_nr %>%
  mutate(Name = gsub("_", " ", Name))

# Display the table using knitr::kable
knitr::kable(demographic_display, format = "html", escape = FALSE) %>%
  kable_styling(full_width = TRUE) %>%
  # column_spec(1:num_columns, width = paste0(column_width, "%")) %>% # Equal column width
  print()

# str(demographic_nr)
# str(demographic_display)
```

# VITALS
The "Vitals" section tracks key health metrics such as weight, body measurements, and heart rate to monitor progress, set personalized fitness goals, and identify patterns or potential health risks. By providing a comprehensive view of an individual's health, it enables data-driven decisions for adjusting fitness plans and ensures a balanced approach to achieving long-term fitness goals. Regular updates and visual tools like charts help in understanding trends and motivating continued effort. If performance data is the brain of this report, then vitals are the heart—pun intended.

<!-- ## Vitals Dataframe-->
```{r vitals_df_transformation, eval=TRUE}

# Select the desired sheet
selected_sheet <- "Vitals"
vitals_data <- read_excel(file_path_kdf, sheet = selected_sheet)

# Display first few rows and structure of the dataframe. Run for confirmation as needed.
# head(vitals_data)
# str(vitals_data)

# Convert selected columns to numeric
# Define a function to convert columns to numeric, handling non-numeric values gracefully. In this approach, the convert_to_numeric function checks for non-numeric and infinite values after attempting the conversion using as.numeric. This helps ensure that problematic values are replaced with NA, and the column retains its numeric nature.
convert_to_numeric <- function(x) {
  numeric_values <- as.numeric(x)       # Try converting to numeric
  ifelse(is.na(numeric_values) | is.infinite(numeric_values), NA, numeric_values) # Return NA if conversion fails or results in infinite
}

# Apply the conversion function to specific columns in the vitals data
vitals_data <- vitals_data %>%
  mutate_at(vars(`Systolic_Blood_Pressure_(Top_Number_120)`, `Diastolic_Blood_Pressure_(Bottom_Number_80)`, Pulse, Heart_Rate, BMI, `Body_Fat_%`), convert_to_numeric)

# Convert 'Date' column to date format
vitals_data$Date <- as.Date(vitals_data$Date, format = "%m/%d/%Y")

# Handle missing time values and convert time to hms format
# vitals_data$Time <- hms(vitals_data$Time) #Code Bad; "Error: All arguments must be numeric or NA" Therefore:
vitals_data$Time <- ifelse(vitals_data$Time == "-", NA, vitals_data$Time)
vitals_data$Time <- as.numeric(vitals_data$Time)                # Convert time to numeric assuming it's in a suitable format
vitals_data$Time <- as_hms(vitals_data$Time * 86400)            # Convert numeric time to hms format (seconds in a day)  

# The View function is used to open the raw_data dataframe in the RStudio data viewer for interactive exploration.
View(vitals_data) #Run this line of code to view table. Does not apply with knit.

# The print function is used to display the transformed raw_data dataframe in the R console.
print(vitals_data)

# str function is used to display the structure of the raw_data dataframe.
str(vitals_data)
```


## Vitals Category 
The Vitals columns provide a list of measurement categories available for further analysis.

```{r vitals_df_columns, eval=TRUE, results='markup'}

# Display the modified column names with spaces instead of underscores
vitals_columns <- vitals_data
# Replace underscores with spaces in column names
colnames(vitals_columns) <- gsub("_", " ", colnames(vitals_columns))

# Extract column names vitals_columns
column_names <- colnames(vitals_columns)
# print(column_names)

# Calculate the number of names
num_names <- length(column_names)

# Calculate the number of names per column
names_per_column <- ceiling(num_names / 2)

# Split the names into two columns
column_1 <- column_names[1:min(names_per_column, num_names)]
column_2 <- column_names[(min(names_per_column, num_names) + 1):num_names]

# Pad the shorter column with NA values if necessary
if(length(column_1) < length(column_2)) {
  column_1 <- c(column_1, rep("", length(column_2) - length(column_1)))
} else if(length(column_2) < length(column_1)) {
  column_2 <- c(column_2, rep("", length(column_1) - length(column_2)))
}

# Create a data frame with two columns
column_names_df <- data.frame(Column_1 = column_1,
                              Column_2 = column_2)

# Calculate the width dynamically
col_width <- paste0(100 / ncol(column_names_df), "%")

# Print the data frame using knitr::kable
knitr::kable(column_names_df, format = "markdown", col.names = c("Entries", "Entries")) %>%
  column_spec(1:2, width = col_width, extra_css = "text-align: left;")

```

## Individuals with a Vital
```{r vitals_data_unique_name, eval=TRUE, results='markup'}

# Unique Names Extraction: Extracts and prints unique names from the Name column to identify distinct individuals in the dataset.
vitals_data_unique_name <- gsub("_", " ", vitals_data$Name)
vitals_data_unique_name <- sort(unique(vitals_data_unique_name))
vitals_data_unique_name <- data.frame(Individuals = vitals_data_unique_name)


# Use knitr::kable to format as a markdown table
knitr::kable(vitals_data_unique_name, format = "markdown", "Individuals") 

# Print for reference
# print(vitals_data_unique_name) 
```

# BODYWEIGHT 

An individual's bodyweight is used to match performance measurements with Mark Rippetoe's Strength Standards mentioned earlier in this report. Bodyweight allows for simple techniques and methods to assess performance, such as when hitting plateaus or experiencing changes in bodyweight.  

## All Bodyweight Over Time

The plot displays all bodyweight measurements over time, with all individuals plotted on a single graph. Individuals represented by just a dot and no line have only one measurement recorded, indicating that more data is required; any two points will create a line.

<!--
PLAN: Consider to update code for Average Vitals (Avg Weight, Avg Height, etc) if multiple entries exist on the same date. Unable to figure out as of 05/10/2024.
-->

```{r table_all_bodyweight_over_time, eval=TRUE, results='markup'}

# Ensure the necessary libraries are loaded
library(dplyr)
library(knitr)
library(kableExtra)
library(ggplot2)
library(RColorBrewer)

# Define a function to calculate the date range, number of entries, and bodyweight metrics for each individual
assign_all_bodyweight_over_time <- function(vitals_data) {
  vitals_data_weight_lbs <- vitals_data[!is.na(as.numeric(vitals_data$Weight_lbs)), ]
  vitals_data_weight_lbs_unique_names <- unique(vitals_data_weight_lbs$Name)
  
  # Initialize a data frame to store the results
  table_all_bodyweight_over_time <- data.frame(Name = character(), Age = numeric(), Date_Range = character(), Entries = numeric(), Min_BW = numeric(), Min_BW_Date = character(), Max_BW = numeric(), Max_BW_Date = character(), stringsAsFactors = FALSE)
  
  # Loop through each individual to compile date range and number of entries information
  for (name in vitals_data_weight_lbs_unique_names) {
    # Subset data for the current individual
    individual_data <- subset(vitals_data_weight_lbs, Name == name)
    
    # Calculate date range and number of entries
    date_range <- paste(min(individual_data$Date), max(individual_data$Date), sep = " to ")
    number_of_entries <- nrow(individual_data)
    
    # Calculate min and max bodyweight and their respective dates
    min_bodyweight <- min(individual_data$Weight_lbs, na.rm = TRUE)
    min_bodyweight_date <- individual_data$Date[which.min(individual_data$Weight_lbs)]
    max_bodyweight <- max(individual_data$Weight_lbs, na.rm = TRUE)
    max_bodyweight_date <- individual_data$Date[which.max(individual_data$Weight_lbs)]
    
    # Add the results to the data frame
    table_all_bodyweight_over_time <- rbind(table_all_bodyweight_over_time, data.frame(Name = name, Date_Range = date_range, Entries = number_of_entries, Min_BW = min_bodyweight, Min_BW_Date = min_bodyweight_date, Max_BW = max_bodyweight, Max_BW_Date = max_bodyweight_date, stringsAsFactors = FALSE))
  }

  # Sort the results by Name
  table_all_bodyweight_over_time <- table_all_bodyweight_over_time %>%
    arrange(Name)
  
  # Return the results data frame
  return(table_all_bodyweight_over_time)
}

# Call the function to get the results
table_all_bodyweight_over_time <- assign_all_bodyweight_over_time(vitals_data)

# Perform a left join to combine the data frames based on the Name column
table_all_bodyweight_over_time <- left_join(table_all_bodyweight_over_time, demographic_nr, by = "Name")

# Select and order the columns
table_all_bodyweight_over_time <- table_all_bodyweight_over_time %>%
  select(Name, Date_Range, Entries, Min_BW, Min_BW_Date, Max_BW, Max_BW_Date)

# Rename columns to replace underscores with spaces
colnames(table_all_bodyweight_over_time) <- gsub("_", " ", colnames(table_all_bodyweight_over_time))

# Calculate the number of columns
ncolumns <- ncol(table_all_bodyweight_over_time)
column_width <- 100 / ncolumns

# Substitute underscores with spaces in the Name column of vitals_data
table_all_bodyweight_over_time$Name <- gsub("_", " ", table_all_bodyweight_over_time$Name)

# Print the combined table using knitr::kable
knitr::kable(table_all_bodyweight_over_time, format = "html", align = c("l", "l", "l", "r", "l", "r", "l")) %>%
  kable_styling(full_width = TRUE)

# str(table_all_bodyweight_over_time)

```

```{r plot_all_bodyweight_over_time, fig.width=10, fig.height=7, eval=TRUE, results='markup'}

library(dplyr)
library(ggplot2)
library(RColorBrewer)
library(lubridate)

# Create a color palette using RColorBrewer's "Dark2" (or another suitable palette)
color_palette <- brewer.pal(n = min(8, length(unique(vitals_data$Name))), "Dark2")
if (length(unique(vitals_data$Name)) > 8) {
  color_palette <- rep(color_palette, length.out = length(unique(vitals_data$Name)))
}
names(color_palette) <- unique(vitals_data$Name)

# Custom date label function to add a newline between the month and the year
date_label_func <- function(x) {
  format(x, "%b\n%Y")  # Displays as "Sep\n2023", separating month and year with a newline
}

# Extend the x-axis by X-additional months using lubridate.
extend_months <- 1

# Create a modified Name column for legend display
vitals_data$Name_Legend <- gsub("_", " ", vitals_data$Name)

# FIRST PLOT 
# Bodyweight over time for all individuals with two extra grid lines on the x-axis
all_data_max_date <- max(vitals_data$Date, na.rm = TRUE)
extended_all_data_max_date <- all_data_max_date + months(extend_months)
first_plot <- ggplot(vitals_data, aes(x = Date, y = Weight_lbs, color = Name_Legend)) +
  geom_line() +
  geom_point(data = vitals_data %>% group_by(Name) %>% filter(Date == min(Date) | Date == max(Date)), 
           aes(x = Date, y = Weight_lbs, fill = Name_Legend), shape = 21, color = "black", size = 3) +  # Add marker on first and last data point
  scale_x_date(date_breaks = "1 month", labels = date_label_func,
               limits = c(min(vitals_data$Date, na.rm = TRUE), extended_all_data_max_date)) +
  scale_y_continuous(labels = scales::number_format(accuracy = 0.1)) +  # Adjust y-axis to 1st decimal
  scale_color_manual(values = setNames(color_palette, gsub("_", " ", names(color_palette)))) +
  scale_fill_manual(values = setNames(color_palette, gsub("_", " ", names(color_palette)))) +  # Matching fill colors with color palette
  labs(title = "Bodyweight Over Time for All Individuals",
       x = "Date",
       y = "Weight (lbs)",
       color = "Name",
       fill = "Name") +
  theme_minimal() +
  theme(axis.text.x = element_text(hjust = 0.5),
        panel.grid.minor.x = element_blank(),  # Remove minor grid lines on the x-axis
        panel.grid.major = element_line(color = "grey90"))  # Make grid lines lighter

# Print the first plot
print(first_plot)

# Markdown separator to add spacing between plots
cat("\n\n")

```

## Individual Bodyweight Over Time

These visualizations allow us to:

- Track the bodyweight trends of each individual.
- Identify key metrics such as the minimum and maximum bodyweight, and the number of measurements taken.
- Understand how bodyweight changes over time may correlate with performance in strength exercises and other fitness metrics.
- By comparing these plots with strength standards, demographic information, and other performance data, we can gain valuable insights into each individual's progress and areas needing improvement.

```{r plot_individual_bodyweight_over_time, fig.width=10, fig.height=7, eval=TRUE, results='markup'}

# SECOND PLOTS (LOOP)
# Second: Loop through each individual to create separate plots with two extra grid lines on the x-axis
plots_list <- list()
individual_names <- unique(vitals_data$Name)

# Merge the table_all_bodyweight_over_time data frame with individual_data
table_all_bodyweight_over_time <- assign_all_bodyweight_over_time(vitals_data)

for (name in individual_names) {
  # Subset data for the current individual
  individual_data <- subset(vitals_data, Name == name)
  
  # Extend the x-axis by X additional month(s)
  max_date_individual <- max(individual_data$Date, na.rm = TRUE)
  extended_individual_max_date <- max_date_individual + months(extend_months)
  
  # Get metrics for the current individual from the table_all_bodyweight_over_time
  individual_metrics <- table_all_bodyweight_over_time %>% filter(Name == name)
  min_bodyweight <- individual_metrics$Min_BW
  min_bodyweight_date <- individual_metrics$Min_BW_Date
  max_bodyweight <- individual_metrics$Max_BW
  max_bodyweight_date <- individual_metrics$Max_BW_Date
  date_range <- individual_metrics$Date_Range
  number_of_entries <- individual_metrics$Entries
  
  # Create a plot for the current individual with two extra grid lines on the x-axis
  plot <- ggplot(individual_data, aes(x = Date, y = Weight_lbs)) +
    geom_line(color = color_palette[name]) +
    geom_point(data = individual_data %>% filter(Date == min(Date) | Date == max(Date)), 
           aes(x = Date, y = Weight_lbs, fill = Name_Legend), shape = 21, color = "black", size = 3) +  # Add marker on first and last data point
    scale_x_date(date_breaks = "1 month", labels = date_label_func, limits = c(min(individual_data$Date, na.rm = TRUE), extended_individual_max_date)) +
    scale_y_continuous(labels = scales::number_format(accuracy = 0.1)) +  # Adjust y-axis to 1st decimal
    scale_color_manual(values = setNames(color_palette, gsub("_", " ", names(color_palette)))) +
    scale_fill_manual(values = setNames(color_palette, gsub("_", " ", names(color_palette)))) +  # Matching fill colors with color palette
    labs(title = paste("Bodyweight Over Time for", gsub("_", " ", name)),
         subtitle = paste("    Min Bodyweight:", min_bodyweight, "lbs on", min_bodyweight_date, "\n", 
                          "   Max Bodyweight:", max_bodyweight, "lbs on", max_bodyweight_date, "\n",
                          "   Date Range:", date_range, "\n",
                          "   Count:", number_of_entries, "\n\n", sep = " "),
         x = "Date",
         y = "Weight (lbs)") +
    theme_minimal() +
    theme(axis.text.x = element_text(hjust = 0.5),
          panel.grid.minor.x = element_blank(),  # Remove minor grid lines on the x-axis
          panel.grid.major = element_line(color = "grey90"),  # Make grid lines lighter
          plot.subtitle = element_text(size = 0.85 * 12, face = "italic")) + 
    guides(color = FALSE, fill = FALSE)  # Remove legends for color and fill
  
  # Print each plot separately
  print(plot)
  
  # Markdown separator to add spacing between plots
  cat("\n\n")
  
  # Optionally, add the plot to a list if you want to further manipulate or save them
  plots_list[[name]] <- plot
}

```

## Individual Bodyweight Over Last 2 Weeks

Unable to plot all individuals in one figure as each individual's last 2-week date range differs from one another. The average bodyweight over the date range is presented in the subtitles.

```{r bodyweight_over_time_last_2wk, fig.width=10, fig.height=7, eval=TRUE, results='markup'}

# Custom date label function to add a newline between the month and the year, including the year on the first label
date_label_func <- function(x) {
  labels <- format(x, "%b\n%d") # Displays as "Sep\n15", separating month and year with a newline
  labels[1] <- paste0(labels[1], "\n", format(x[1], "%Y"))  # Append the year to the first label
  return(labels)
}

# Extend the x-axis by X-additional months using lubridate.
extend_months <- 1

# Select rows where Weight_lbs is not NA and convert to numeric just in case.
vitals_data_weight_lbs <- vitals_data[!is.na(as.numeric(vitals_data$Weight_lbs)), ]
# Extract unique names from the data
vitals_data_weight_lbs_unique_names <- unique(vitals_data_weight_lbs$Name)

# Calculate number of entries for each individual
number_of_entries <- table(vitals_data$Name)

# Loop through each individual to create separate plots with two extra grid lines on the x-axis
plots_list <- list()
individual_names <- unique(vitals_data$Name)

for (name in individual_names) {
  # Subset data for the current individual
  individual_data <- subset(vitals_data, Name == name)
  
  min_date <- min(individual_data$Date)
  max_date <- max(individual_data$Date)
  min_date_last_2wk <- max_date - 13
  vitals_last_2wk <- subset (individual_data, Date >= min_date_last_2wk & Date <= max_date)
    
  # Calculate average bodyweight over the last 2 weeks
  avg_bodyweight_last_2wk <- mean(vitals_last_2wk$Weight_lbs, na.rm = TRUE)
  date_range <- paste(min_date_last_2wk, max_date, sep = " to ")
    
  # Calculate number of entries for the last 2 weeks
  number_of_entries <- nrow(vitals_last_2wk)
  
  # Create a plot for the last 2 weeks of data for the current individual
  plot <- ggplot(vitals_last_2wk, aes(x = Date, y = Weight_lbs)) +
    geom_line(color = color_palette[name]) +
    geom_point(data = vitals_last_2wk[1,], aes(x = Date, y = Weight_lbs, fill = Name), shape = 21, color = "black", size = 3) +  # Add marker on first data point
    geom_point(data = vitals_last_2wk[nrow(vitals_last_2wk),], aes(x = Date, y = Weight_lbs, fill = Name), shape = 21, color = "black", size = 3) +  # Add marker on last data point
    scale_x_date(breaks = seq(min(vitals_last_2wk$Date), max(vitals_last_2wk$Date), by = "1 day"),
                 labels = date_label_func,
                 limits = c(min(vitals_last_2wk$Date, na.rm = TRUE), max(vitals_last_2wk$Date, na.rm = TRUE))) +
    scale_y_continuous(labels = scales::number_format(accuracy = 0.1)) +  # Adjust y-axis to 1st decimal
    scale_color_manual(values = color_palette[name]) +
    scale_fill_manual(values = color_palette[name]) +  # Matching fill colors with color palette
    labs(title = paste("Bodyweight Over Last 2 Weeks for", gsub("_", " ", name)),
         subtitle = paste("    Average Bodyweight:", round(avg_bodyweight_last_2wk, 2), "lbs\n",
                          "   Date Range:", date_range, "\n",
                          "   Count:", number_of_entries, "\n\n", sep = " "),
         x = "Date",
         y = "Weight (lbs)") +
    theme_minimal() +
    theme(axis.text.x = element_text(hjust = 0.5),
          panel.grid.minor.x = element_blank(),  # Remove minor grid lines on the x-axis
          panel.grid.major = element_line(color = "grey90"), # Make grid lines lighter
          plot.subtitle = element_text(size = 0.85 * 12, face = "italic")) + 
    guides(color = FALSE, fill = FALSE)  # Remove legends for color and fill
  
  # Print each plot separately
  print(plot)
  
  # Markdown separator to add spacing between plots
  cat("\n\n")
  
  # Optionally, add the plot to a list if you want to further manipulate or save them
  plots_list[[name]] <- plot
}

```

## Avg Bodyweight Over Last 2 Weeks
Side by side comparison between individuals.

```{r table_avg_bodyweights_last_2wk, fig.width=10, fig.height=7, eval=TRUE, results='markup'}

# Ensure the necessary libraries are loaded
library(dplyr)
library(knitr)
library(kableExtra)
library(ggplot2)
library(RColorBrewer)
library(lubridate)

# Define a function to calculate and assign average bodyweight variables
assign_average_bodyweight <- function(vitals_data) {
  vitals_data_weight_lbs <- vitals_data[!is.na(as.numeric(vitals_data$Weight_lbs)), ]
  vitals_data_weight_lbs_unique_names <- unique(vitals_data_weight_lbs$Name)
  
  # Initialize a data frame to store the results
  table_avg_bodyweights_last_2wk <- data.frame(Name = character(), Average_Bodyweight_Last_2Wk = numeric(), Date_Range = character(), Age_At_Avg_Bodyweight_Last_2Wk = numeric(), stringsAsFactors = FALSE)
  
  # Loop through each individual to compile average bodyweight information
  for (name in vitals_data_weight_lbs_unique_names) {
    # Subset data for the current individual
    individual_data <- subset(vitals_data_weight_lbs, Name == name)
    
    min_date <- min(individual_data$Date)
    max_date <- max(individual_data$Date)
    min_date_last_2wk <- max_date - 13
    vitals_last_2wk <- subset (individual_data, Date >= min_date_last_2wk & Date <= max_date)
    
    # Calculate average bodyweight over the last 2 weeks
    avg_bodyweight_last_2wk <- mean(vitals_last_2wk$Weight_lbs, na.rm = TRUE)
    date_range <- paste(min_date_last_2wk, max_date, sep = " to ")
    
    # Calculate the age at the time of the average bodyweight measurement
    avg_bodyweight_last_2wk_date <- max(vitals_last_2wk$Date, na.rm = TRUE)
    birthdate <- demographic_nr$Birthdate[demographic_nr$Name == name]
    age_at_avg_bodyweight_last_2wk <- as.numeric(difftime(avg_bodyweight_last_2wk_date, birthdate, units = "weeks")) / 52.25
    
    # Add the results to the data frame
    table_avg_bodyweights_last_2wk <- rbind(table_avg_bodyweights_last_2wk, data.frame(Name = gsub("_", " ", name), Average_Bodyweight_Last_2Wk = round(avg_bodyweight_last_2wk, 2), Date_Range = date_range, Age_At_Avg_Bodyweight_Last_2Wk = ceiling(age_at_avg_bodyweight_last_2wk), stringsAsFactors = FALSE))
    
    # Define variable names for average bodyweight, date range, and age at average bodyweight
    avg_bodyweight_last_2wk_var_name <- paste("avg_bodyweight_last_2wk", tolower(name), sep = "_")
    date_range_var_name <- paste("avg_bodyweight_last_2wk_date_range", tolower(name), sep = "_")
    age_var_name <- paste("age_at_avg_bodyweight_last_2wk", tolower(name), sep = "_")
    
    # Assign calculated values to the corresponding variables
    assign(avg_bodyweight_last_2wk_var_name, avg_bodyweight_last_2wk, envir = .GlobalEnv)
    assign(date_range_var_name, date_range, envir = .GlobalEnv)
    assign(age_var_name, ceiling(age_at_avg_bodyweight_last_2wk), envir = .GlobalEnv)
  }

  # Sort the results by Name
  table_avg_bodyweights_last_2wk <- table_avg_bodyweights_last_2wk %>%
    arrange(Name)
  
  # Return the results data frame
  return(table_avg_bodyweights_last_2wk)
}

# Call the function to perform the assignments and get the results
table_avg_bodyweights_last_2wk <- assign_average_bodyweight(vitals_data)

# Perform a left join to combine the data frames based on the Name column
table_avg_bodyweights_last_2wk <- left_join(table_avg_bodyweights_last_2wk, demographic_nr, by = "Name")

# Select and order the columns
table_avg_bodyweights_last_2wk <- table_avg_bodyweights_last_2wk %>%
  select(Name, Age_At_Avg_Bodyweight_Last_2Wk, Average_Bodyweight_Last_2Wk, Date_Range)

# Calculate the number of columns
ncolumns <- ncol(table_avg_bodyweights_last_2wk)

# Define the column width ratios
column_ratios <- c(1, 1, 1.5, 1.5)

# Calculate the total ratio
total_ratio <- sum(column_ratios)

# Calculate the percentage width for each column
column_widths <- (column_ratios / total_ratio) * 100

# Print the combined table using knitr::kable
knitr::kable(table_avg_bodyweights_last_2wk, format = "html", col.names = c("Name", "Age At Avg BW", "Average Bodyweight (lbs)", "Date Range"), align = c("l", "l", "l", "l")) %>%
  kable_styling(full_width = TRUE) %>%
  column_spec(1, width = paste0(column_widths[1], "%")) %>%
  column_spec(2, width = paste0(column_widths[2], "%")) %>%
  column_spec(3, width = paste0(column_widths[3], "%")) %>%
  column_spec(4, width = paste0(column_widths[4], "%"))

# str(table_avg_bodyweights_last_2wk)

```


<!-- Bar Graph-->
```{r plot_avg_bodyweights_last_2wk, fig.width=10, fig.height=7, eval=TRUE, results='markup'}

# Ensure Name column has consistent names
table_avg_bodyweights_last_2wk$Name <- gsub("_", " ", table_avg_bodyweights_last_2wk$Name)

# Update the color palette to ensure all names are included
color_palette <- brewer.pal(n = min(8, length(unique(table_avg_bodyweights_last_2wk$Name))), "Dark2")
if (length(unique(table_avg_bodyweights_last_2wk$Name)) > 8) {
  color_palette <- rep(color_palette, length.out = length(unique(table_avg_bodyweights_last_2wk$Name)))
}
names(color_palette) <- unique(table_avg_bodyweights_last_2wk$Name)

# Plot the results using ggplot2
plot_avg_bodyweights_last_2wk <- ggplot(table_avg_bodyweights_last_2wk, aes(x = Name, y = Average_Bodyweight_Last_2Wk, fill = Name)) +
  geom_bar(stat = "identity") +
  geom_text(aes(label = paste0(Average_Bodyweight_Last_2Wk, " lbs\n",
                               "Bodyweight\n\n", 
                               "Date Range \n", 
                               sub(" to.*", "", Date_Range), "\n", 
                               "to\n", 
                               sub(".* to ", "", Date_Range), "\n\n",
                               Age_At_Avg_Bodyweight_Last_2Wk, " years old")), 
            vjust = 1.5, color = "black", size = 4) +
  scale_fill_manual(values = color_palette) +  # Matching fill colors with color palette
  labs(title = "Average Bodyweight Over Last 2 Weeks", y = "Average Bodyweight (lbs)") +
  theme_minimal() +
  theme(
    plot.title = element_text(hjust = 0.5, size = 16),  # Larger font size for the title
    axis.title.x = element_blank(),  # Remove x-axis title
    axis.title.y = element_text(size = 12),  # Font size for y-axis title
    axis.text.x = element_text(size = 12),  # Font size for x-axis text; For bold - face = "bold"
    axis.text.y = element_text(size = 10),  # Font size for y-axis text
    legend.position = "none",  # Remove legend
    panel.background = element_rect(fill = "lightblue", color = NA),  # Light blue background
    panel.grid.major = element_line(color = "white", size = 0.5),  # White major grid lines
    panel.grid.minor = element_line(color = "white", size = 0.5)  # White minor grid lines
  )

# Print the plot
print(plot_avg_bodyweights_last_2wk)


```

## Avg Bodyweight by Age
This plot displays data for all individuals. This dataset--age vs bodyweight--is seems limited in isolation. With other data points and contextion information it can offer valuable insights,particularly if used  it can still offer valuable insights when used alongside other data points and contextual information. Its usefulness largely depends on how it's interpreted and integrated into a broader analysis framework.

<!-- Plan to create set of horizontal bar charts, one for bodyweight, another for for age. Should look like column left = names, column middle = bodyweight, = column right = age. --> 

```{r plot_avg_bodyweight_by_age, fig.width=10, fig.height=7, eval=TRUE, results='markup'}

# Ensure the necessary libraries are loaded
library(ggplot2)
library(RColorBrewer)

# Ensure Name column has consistent names
table_avg_bodyweights_last_2wk$Name <- gsub("_", " ", table_avg_bodyweights_last_2wk$Name)

# Update the color palette to ensure all names are included
color_palette <- brewer.pal(n = min(8, length(unique(table_avg_bodyweights_last_2wk$Name))), "Dark2")
if (length(unique(table_avg_bodyweights_last_2wk$Name)) > 8) {
  color_palette <- rep(color_palette, length.out = length(unique(table_avg_bodyweights_last_2wk$Name)))
}
names(color_palette) <- unique(table_avg_bodyweights_last_2wk$Name)

# Convert Age to numeric 
table_avg_bodyweights_last_2wk$Age_At_Avg_Bodyweight_Last_2Wk <- as.numeric(table_avg_bodyweights_last_2wk$Age_At_Avg_Bodyweight_Last_2Wk)

# Calculate min and max ages with buffer
min_age <- min(table_avg_bodyweights_last_2wk$Age_At_Avg_Bodyweight_Last_2Wk, na.rm = TRUE) - 4
max_age <- max(table_avg_bodyweights_last_2wk$Age_At_Avg_Bodyweight_Last_2Wk, na.rm = TRUE) + 4

# Calculate min and max bodyweights with buffer
min_bodyweight <- floor(min(table_avg_bodyweights_last_2wk$Average_Bodyweight_Last_2Wk, na.rm = TRUE) / 10) * 10
max_bodyweight <- ceiling(max(table_avg_bodyweights_last_2wk$Average_Bodyweight_Last_2Wk, na.rm = TRUE) / 10) * 10

# Create a plot for Bodyweight by Age
plot_age_by_bodyweight <- ggplot(table_avg_bodyweights_last_2wk, aes(x = Age_At_Avg_Bodyweight_Last_2Wk, y = Average_Bodyweight_Last_2Wk, color = Name, fill = Name)) +
  geom_point(size = 5, shape = 21, color = "black") +
  scale_color_manual(values = color_palette) +
  scale_fill_manual(values = color_palette) +
  scale_x_continuous(limits = c(min_age, max_age), breaks = seq(min_age, max_age, by = 2)) +
  scale_y_continuous(limits = c(min_bodyweight, max_bodyweight), breaks = seq(min_bodyweight, max_bodyweight, by = 10)) +
  labs(title = "Average Bodyweight by Age, Last 2 Weeks", x = "Age", y = "Average Bodyweight (lbs)") +
  theme_minimal() +
  theme(plot.title = element_text(hjust = 0.5, size = 16),
        axis.text.x = element_text(hjust = 0.5, face = "bold"),
        panel.grid.minor.x = element_blank(),  # Remove minor grid lines on the x-axis
        panel.grid.major = element_line(color = "grey90"), # Make grid lines lighter
        plot.subtitle = element_text(size = 0.85 * 12, face = "italic"),
        legend.position = "right") 

# Print the plot
print(plot_age_by_bodyweight)

```

# DATA ENTRIES 
The meat and potatoes. Without this, what's the point?

```{r raw_data_transformation, eval=TRUE, results='asis', message=FALSE, warning=FALSE}

# Select the desired sheet
selected_sheet <- "Raw_Data"
raw_data <- read_excel(file_path_kdf, sheet = selected_sheet)

# Display first few rows and structure of the dataframe. Run for confirmation as needed.
# head(raw_data)
# str(raw_data)

# Markdown separator to add spacing between plots
cat("\n\n")

# Convert selected columns to numeric
raw_data <- raw_data %>%
  mutate_at(vars(Weight, Barbell_Weight, Total_Weight, Final_Reps, `1RM_Calculated_Weight`, Volume, TIMEVALUE), as.numeric)

# Convert Date column to Date format
raw_data$Date <- as.Date(raw_data$Date, format = "%m/%d/%Y") #Code OK as Date "2023-03-29"
# Following Date conversions below were either unsuccessful or undesirable. Unable to get proper code as Date "MM/DD/YYYY". Therefore settled with the above Date conversion as "YYYY-MM-DD."
# raw_data$Date <- mdy(raw_data$Date) #Came Bad; result NA - Do Not Use 
# raw_data$Date <- format(raw_data$Date, format = "%m/%d/%Y") #Code OK as chr "03/29/2023"

# Handle missing Duration values
# raw_data$Duration <- hms(raw_data$Duration) #Code Bad; "Error: All arguments must be numeric or NA". Since errors exist, it is best to proceed with the following: 
raw_data$Duration <- ifelse(raw_data$Duration == "-", NA, raw_data$Duration) #Code OK; this allows next step hms() or as_hms()

# Convert Duration to numeric and then to hms format
raw_data$Duration <- as.numeric(raw_data$Duration) #Code OK; same result as if converting to data type Number in Excel
raw_data$Duration <- as_hms(raw_data$Duration * 86400) #Code OK; numeric is first multiplied by the number of secs in a day.
    # Note: `as.hms()` was deprecated in hms 0.5.0.Therefore used `as_hms()` instead.
    # Consider to check for errors: original CSV file had one value "0:01:45:04" instead of "0:01:45.04" resulting in error and therefore had not converted to numeric chr. Issue has been resolved for data transformation.

# Format Duration column (optional)
formatted_duration <- sprintf("%02d:%02d:%05.2f", hour(raw_data$Duration), minute(raw_data$Duration), second(raw_data$Duration)) #May not be necessary.

# The View function is used to open the raw_data dataframe in the RStudio data viewer for interactive exploration.
View(raw_data) #Run this line of code to view table. Does not apply with knit.

# The print function is used to display the transformed raw_data dataframe in the R console.
# print(raw_data)

# str function is used to display the structure of the raw_data dataframe.
# str(raw_data)
```

## Column Entries
The following columns from the Raw Data sheet in the KDF Workbook are shown below. Nearly all columns are utilized for analyses, although some, like "Duration," are rarely recorded. Most of my workouts are focused on strength training rather than cardio and endurance training. Volume measures have also not been a primary focus compared to 1RM (one-rep max), which serves as a requirement for comparison with 1RM strength standards.

```{r raw_data_columns, eval=TRUE, results='markup'}

## Plan: Merge header/column.names if the same. Unable to figure out 05/11/2024.

# Extract column names vitals_data
column_names <- colnames(raw_data)
# print(column_names)
# str(column_names)

# Calculate the number of names
num_names <- length(column_names)

# Calculate the number of names per column
names_per_column <- ceiling(num_names / 2)

# Split the names into two columns
column_1 <- column_names[1:min(names_per_column, num_names)]
column_2 <- column_names[(min(names_per_column, num_names) + 1):num_names]

# Pad the shorter column with NA values if necessary
if(length(column_1) < length(column_2)) {
  column_1 <- c(column_1, rep("", length(column_2) - length(column_1)))
} else if(length(column_2) < length(column_1)) {
  column_2 <- c(column_2, rep("", length(column_1) - length(column_2)))
}

# Create a data frame with two columns
column_names_df <- data.frame(Column_1 = column_1,
                              Column_2 = column_2)

# Calculate the width dynamically
col_width <- paste0(100 / ncol(column_names_df), "%")

# Print the data frame using knitr::kable
knitr::kable(column_names_df, format = "markdown", col.names = c("Column Names", "Column Names")) %>%

  column_spec(1:2, width = col_width, extra_css = "text-align: left;")

```

## Exercise Variations List

Entries from `raw_data` are extracted for further analysis. Whether indoors or outdoors, using calisthenics or equipment, performing aerobic or anaerobic exercises, there are countless exercises and ways to perform them. These exercises vary in instruments and positions used, requiring different muscle activation.

Not included in the lists are entries pertaining to different grips (wide, mid, close, underhand, overhand, neutral, and alternate, such as for deadlifts). A Wide Grip Bicep Curl utilizes external rotation, limits the range of motion allowing for heavier weights, places more stress on the elbow joints, emphasizes the short head of the biceps, and engages the upper back and shoulders. In contrast, a Close Grip Bicep Curl utilizes internal rotation, allows for a greater range of motion, focuses on the long head of the biceps, and engages the forearm muscles.[^12] [^13]

```{r unique_criteria, eval=TRUE}

raw_data_unique_name <- sort(unique(raw_data$Name))
raw_data_unique_position <- sort(unique(raw_data$Position))
raw_data_unique_instrument <- sort(unique(raw_data$Instrument))
raw_data_unique_activity <- sort(unique(raw_data$Activity))

# Filter out "-" from raw_data_unique's
raw_data_unique_name <- raw_data_unique_name[raw_data_unique_name != "-"]
raw_data_unique_position <- raw_data_unique_position[raw_data_unique_position != "-"]
raw_data_unique_instrument <- raw_data_unique_instrument[raw_data_unique_instrument != "-"]
raw_data_unique_activity <- raw_data_unique_activity[raw_data_unique_activity != "-"]

```

<!--Individuals entered:
```{r raw_data_unique_name, eval=TRUE, results='markup'}
# INDIVIDUALS ENTERED:
# print(raw_data_unique_name)

# Print the data frame using knitr::kable
knitr::kable(raw_data_unique_name, format = "markdown", col.names = "Individuals") %>%
  kable_styling(full_width = TRUE)  # Ensure full-width table with equal column width

```
-->

## Positions
Incorporating different positions into a workout routine can enhance muscle development, improve functional strength, and reduce the risk of overuse injuries by ensuring a balanced workout.

```{r raw_data_unique_position, eval=TRUE, results='markup'}
# POSITIONS USED:
# print(raw_data_unique_position)

# Calculate the number of names
num_names <- length(raw_data_unique_position)

# Calculate the number of names per column
names_per_column <- ceiling(num_names / 2)

# Split the names into two columns
column_1 <- raw_data_unique_position[1:min(names_per_column, num_names)]
column_2 <- raw_data_unique_position[(min(names_per_column, num_names) + 1):num_names]

# Pad the shorter column with NA values if necessary
if(length(column_1) < length(column_2)) {
  column_1 <- c(column_1, rep("", length(column_2) - length(column_1)))
} else if(length(column_2) < length(column_1)) {
  column_2 <- c(column_2, rep("", length(column_1) - length(column_2)))
}

# Create a data frame with two columns
raw_data_unique_position_df <- data.frame(Column_1 = column_1,
                              Column_2 = column_2)

# Calculate the width dynamically
col_width <- paste0(100 / ncol(raw_data_unique_position_df), "%")

# Print the data frame using knitr::kable
knitr::kable(raw_data_unique_position_df, format = "markdown", col.names = c("Positions", "Positions")) %>%
  column_spec(1:2, width = col_width, extra_css = "text-align: left;")

```

## Instruments 
Using a variety of instruments in workouts can help target different muscle groups more effectively, provide varied resistance, and offer safer and more efficient exercise options.

```{r raw_data_unique_instrument, eval=TRUE, results='markup'}
# INSTRUMENTS USED: 
# print(raw_data_unique_instrument)

# Calculate the number of names
num_names <- length(raw_data_unique_instrument)

# Calculate the number of names per column
names_per_column <- ceiling(num_names / 3)

# Split the names into three columns
column_1 <- raw_data_unique_instrument[1:min(names_per_column, num_names)]
column_2 <- raw_data_unique_instrument[(min(names_per_column, num_names) + 1):(2 * min(names_per_column, num_names))]
column_3 <- raw_data_unique_instrument[((2 * min(names_per_column, num_names)) + 1):num_names]

# Pad the shorter columns with NA values if necessary
max_length <- max(length(column_1), length(column_2), length(column_3))
column_1 <- c(column_1, rep("", max_length - length(column_1)))
column_2 <- c(column_2, rep("", max_length - length(column_2)))
column_3 <- c(column_3, rep("", max_length - length(column_3)))

# Create a data frame with two columns
raw_data_unique_instrument_df <- data.frame(Column_1 = column_1,
                                            Column_2 = column_2,
                                            Column_3 = column_3)

# Calculate the width dynamically
col_width <- paste0(100 / ncol(raw_data_unique_instrument_df), "%")

# Print the data frame using knitr::kable
knitr::kable(raw_data_unique_instrument_df, format = "markdown", col.names = c("Instruments", "Instruments", "Instruments")) %>%
  column_spec(1:3, width = col_width, extra_css = "text-align: left;")

```

## Activities 
The following activities (or exercises) have been entered into the Raw Data. Other activities are available but have not been performed. Some, such as “Track Lane 6,” are specific to location and time duration. Other activities are universally available, such as Chest Press and Lat Pulldown. However, when time is a factor, sometimes you’ve got to pivot and improvise your workout with other types of exercises.

```{r raw_data_unique_activity, eval=TRUE, results='markup'}
# ACTIVITIES PERFORMED
# print(raw_data_unique_activity)

# Calculate the number of names
num_names <- length(raw_data_unique_activity)

# Calculate the number of names per column
names_per_column <- ceiling(num_names / 3)

# Split the names into three columns
column_1 <- raw_data_unique_activity[1:min(names_per_column, num_names)]
column_2 <- raw_data_unique_activity[(min(names_per_column, num_names) + 1):(2 * min(names_per_column, num_names))]
column_3 <- raw_data_unique_activity[((2 * min(names_per_column, num_names)) + 1):num_names]

# Pad the shorter columns with NA values if necessary
max_length <- max(length(column_1), length(column_2), length(column_3))
column_1 <- c(column_1, rep("", max_length - length(column_1)))
column_2 <- c(column_2, rep("", max_length - length(column_2)))
column_3 <- c(column_3, rep("", max_length - length(column_3)))

# Create a data frame with two columns
raw_data_unique_activity_df <- data.frame(Column_1 = column_1,
                                          Column_2 = column_2,
                                          Column_3 = column_3)

# Calculate the width dynamically
col_width <- paste0(100 / ncol(raw_data_unique_activity_df), "%")

# Print the data frame using knitr::kable
knitr::kable(raw_data_unique_activity_df, format = "markdown", col.names = c("Activities", "Activities", "Activities")) %>%
  column_spec(1:3, width = col_width, extra_css = "text-align: left;")
```

# PERFORMANCE ANALYSES 
## Weights Executed Over Time, 3-4-5+ Reps

This set of statistics is ideal for athletes or trainers who want to track performance over time, particularly in observing how maximum lifting capacity evolves across different workouts and rep thresholds. The visualizations and statistics generated can assist in adjusting training programs based on observed trends and performance plateaus.

The following figures are inspired by the 5x5 Training Model. From around September 2023 to April 2024, I primarily focused on 5 sets of 5 reps, incrementally adding weight after successfully executing 5 reps. I did not adhere to a single approach for performing the 5x5's, as there are various perspectives and workout models. For instance, in his "The 5×5 Workout Explained" (2023) article[^14], Colucci explores the 5x5 workout program. He mentions its history with Reg Park whom proposed starting with progressively heavier warm-up sets for the first two sets, followed by three sets at the same heavier weight for the final three sets. 

The 5×5 workout program is designed to build muscle size and strength by focusing on performing five sets of five repetitions of major compound exercises like squats, bench presses, and deadlifts. It emphasizes using progressively heavier weights with adequate rest intervals to ensure proper recovery and efficient muscle development. This routine is particularly beneficial for beginners to establish a solid strength foundation and for experienced lifters looking to maintain or increase muscle mass and strength. 

## Metrics Table
The following tables and plots highlight performance over time for workouts based on selected criteria. This table serves as a reference for how each metric may be used to compare with the others.

```{r statistics_table_guide, eval=TRUE, results='asis'}  

# Ensure the necessary libraries are loaded
library(dplyr)
library(knitr)
library(kableExtra)
library(ggplot2)
library(RColorBrewer)
library(lubridate)

# Print the statistics for the current plot as a table
statistics_table <- data.frame(
  Metric = c(
    "Average Weight Executed Per Set",
    "Average 1RM Calculated Weight Per Set",
    "Highest 1RM Calculated Weight",
    "Highest 1RM Calculated Weight",
    "Highest Weight Executed at 5+ Reps"
  ),

  Range = c(
    "Last 2 Weeks",
    "Last 2 Weeks",
    "Last 2 Weeks",
    "All Time",
    "All Time"
  ),
  
  Explanation = c(
    "Provides context to the Average 1RM Calculated Weight Last 2 Weeks by indicating the average effort per set",
    "Helps to gauge if recent training sessions have been effective in maintaining or increasing strength levels, compared to the average weights executed per set.",
    "Indicates the highest strength achievement in the recent period and can be compared against the all-time 1RM to see if recent training is aligning with peak historical performance.",
    "Provides a reference point for evaluating current performance trends. For instance, comparing the Highest 1RM Calculated Weight Last 2 Weeks to this value shows if current strength levels are approaching, maintaining, or falling short of peak historical performance. This ATH (all time high) metric is the one to be constantly striving to meet, if not surpass. Reviewing the timeline as to when ATH's occured helps to provide insight into the patterns and events that resulted in either growth, maintenance, or decline. For example, a trip x2 weeks out of the country was accompanied by decline in strength-based activities, leading to a decline on performance.",
    "This metric complements the 1RM values by providing insight into strength endurance. Comparing it with the Average Weight Executed Per Set Last 2 Weeks can indicate improvements or declines in sustained strength over time. "
  )
)

# Print the table using knitr::kable
kable(statistics_table, format = "html", table.attr = "class='table table-striped'") %>%
  kable_styling(full_width = TRUE) %>%
  column_spec(1, width = "2%") %>%
  column_spec(2, width = "0.5%") %>%
  column_spec(3, width = "3%") 

```

The following plots display the Total Weight Per Set. Data points represent sets performed at 3, 4, and 5+ reps.

- Data points are intentionally colored:
    - Red means Stop.
    - Green means Go.

- After 5+ reps (red dot) have been met, the weight is increased.
- If either 4 reps (blue dot) or 3 reps (green dot) have been met, the weight stays the same.

These visualizations help track progress and make data-driven decisions about adjusting weights based on performance. By observing the trends and patterns in these plots, one can ensure that training intensity is appropriately adjusted to meet performance goals.

## Performances Over Time, Selected Criteria

Not all datapoints in this section were performed using 5x5 RM. Some exercises, like Hammer Curls or Lat Pulldowns, were sometimes performed in sets of 8-12+ reps, requiring less weight and producing noticeable differences in weight load execution on the plots. This is particularly evident in Kendrick's Lat Pulldown plot. Others, such as Kendrick's Flat Bench Barbell Chest Press, show a gradual rise in performance, which is more accurate and reflects drops in performance and absences of datapoints due to life events and decreases in exercise frequency. Generally, exercises that show a higher incline angle, such as Kendrick's Barbell Squat and Kendrick's Barbell Deadlift, exhibit faster-than-normal increases in weight over time. This faster-than-normal increase in weight may be due to starting low due to uncertainty with the amount of weight required for max execution and the need to avoid injury, or it could be caused by "beginner gains."

This variance can lead to misleading interpretations when comparing performance across different exercises and rep ranges. For accurate comparison, it's best to review sections containing 1-Rep Max (1RM) data, which provide a standardized measure for evaluating performance between different sets and on a daily basis. By focusing on 1RM, you ensure consistency and comparability, mitigating the issues caused by varied rep ranges in other exercises.

<!--
### Explanation:

Multiple Performances Over Time, Selected Criteria

- Data Frame Setup: The criteria_list includes NA for optional criteria like "Position," where filtering should be skipped.
- Conditional Filtering: The plotting function checks if each criterion is not NA before applying it as a filter. This prevents unwanted filtering and ensures that only specified criteria affect the dataset.
- Plotting: Plots are still generated based on the available subset of data after applying the relevant filters.

This approach gives the flexibility to define each plot's criteria dynamically, including or excluding filters as needed per plot without altering the overall structure of the code.
-->

```{r performance_over_time_plots, fig.width=10, fig.height=7, eval=TRUE, results='asis'}

# Load necessary libraries
library(dplyr)
library(ggplot2)
library(knitr)
library(kableExtra)

# Define a list or dataframe of different criteria with NA for optional criteria
# Place NA where you don't want to filter in criteria_sets 
# Use \\(***\\) for parentheses.
# exclude_*** to specify exclusion directly within list definition for criteria. 
# Attempting to use array to handle potentially multiple criteria (ie `activities = c("Bicep Curl", "Hammer Curl")` using sapply and combining results with any() has been unsuccessful and time-consuming for review; consider to review again as it is  suitable for scenarios where multiple criteria can apply to a single attribute. As an alternative consider to use "NA" with "exclude_".

# Common criteria information
common_criteria <- list(
  name = "Kendrick"
)

# Additional notes for future filtering - 
# Define specific criteria without the 'name' field. Include 'name' for exceptions
# [instrument = "Barbell \\(Loaded\\)] # Specifying strings containing parenthesis by including back. 
# [instrument = ".*Barbell.*"]  # Use a regular expression to match any variation of "Barbell". Specifying "Barbell (Loaded)" instead of "Barbell" filters the data. "Barbell" will include all strings, similar to ".*Barbell.*" which may not be useful if looking to exclude certain data (ie Straight Barbell). 
# [exclude_instrument = "Double Pulley"]  # Exclude a single criteria. Attempting to exclude multiple criteria such as [exclude_instrument = c(criteria_one, criteria_2)] unsuccessful. Consider to try code again in future. 


criteria_sets <- list(
  list(
    position = NA,
    instrument = "Barbell",
    activity = "Squat"
  ),
  list(
    position = "Flat Bench",
    instrument = "Barbell",
    activity = "Chest Press"
  ),
  list(
    position = NA,
    instrument = "Barbell",
    activity = "Deadlift"
  ),
  list(
    position = NA,
    instrument = ".*Barbell.*",  # Use a regular expression to match any variation of "Barbell"
    activity = "Overhead Shoulder Press"
  ),
  list(
    position = NA,
    instrument = "Lat Pulldown Bar",
    exclude_instrument = "Double Pulley",
    activity = "Lat Pulldown"
  ),
  list(
    name = "Alex",
    position = "Flat Bench",
    instrument = "Barbell \\(Loaded\\)",
    activity = "Chest Press"
  ),
  list(
    position = NA,
    instrument = "Barbell",
    activity = "Bicep Curl"
  ),
  list(
    position = NA,
    instrument = "Cable",
    activity = "Hammer Curl"
  )
  # Additional criteria sets can be added here without repeating the 'name'
)

# Function to clean up label strings:
# To update the provided code so that it does not display escaped parentheses \\( and \\) in the plot labels/titles, you can introduce a cleaning step for the labels. This can be done using the gsub function to remove or replace the escape characters before using them in the plot labels. 
clean_label <- function(label) {
  gsub("\\\\|\\.|\\*", "", label)  # Remove the escape characters for parentheses and ".*"
}

# Define a plotting function that takes criteria_sets as an argument and checks for NA
generate_plot <- function(criteria_sets) {

  # If 'name' is not in the criteria, use the common 'name'
  if (!"name" %in% names(criteria_sets)) {
    criteria_sets$name <- common_criteria$name
  }
  
  # Start filtering with mandatory criteria_sets (Name in this case)
  raw_data_criteria <- raw_data %>%
    filter(grepl(criteria_sets$name, Name, ignore.case = TRUE))

  # Conditionally add filters if criteria_sets are not NA
  # Filtering for instrument with inclusion and specific exclusion criteria
  if (!is.na(criteria_sets$position) && criteria_sets$position != "") {
    raw_data_criteria <- raw_data_criteria %>%
      filter(grepl(criteria_sets$position, Position, ignore.case = TRUE))
      if (!is.null(criteria_sets$exclude_position)) {
        raw_data_criteria <- raw_data_criteria %>%
          filter(!grepl(criteria_sets$exclude_position, Position, ignore.case = TRUE))
      }
  }
  
  if (!is.na(criteria_sets$instrument) && criteria_sets$instrument != "") {
    raw_data_criteria <- raw_data_criteria %>%
      filter(grepl(criteria_sets$instrument, Instrument, ignore.case = TRUE))
       if (!is.null(criteria_sets$exclude_instrument)) {
        raw_data_criteria <- raw_data_criteria %>%
          filter(!grepl(criteria_sets$exclude_instrument, Instrument, ignore.case = TRUE))
      }
  }
  
  if (!is.na(criteria_sets$activity) && criteria_sets$activity != "") {
    raw_data_criteria <- raw_data_criteria %>%
      filter(grepl(criteria_sets$activity, Activity, ignore.case = TRUE))
       if (!is.null(criteria_sets$exclude_activity)) {
        raw_data_criteria <- raw_data_criteria %>%
          filter(!grepl(criteria_sets$exclude_activity, Activity, ignore.case = TRUE))
      }
  }

  filtered_data_reps_5 <- subset(raw_data_criteria, Final_Reps >= 5)
  filtered_data_reps_4 <- subset(raw_data_criteria, Final_Reps >= 4)
  filtered_data_reps_3 <- subset(raw_data_criteria, Final_Reps >= 3)
  
  max_weight_reps_5 <- max(filtered_data_reps_5$Total_Weight)
  max_weight_reps_4 <- max(filtered_data_reps_4$Total_Weight)
  max_weight_reps_3 <- max(filtered_data_reps_3$Total_Weight)

  regression_line_reps_5 <- lm(Total_Weight ~ Date, data = filtered_data_reps_5)
  regression_line_reps_4 <- lm(Total_Weight ~ Date, data = filtered_data_reps_4)
  regression_line_reps_3 <- lm(Total_Weight ~ Date, data = filtered_data_reps_3)
  
  slope_reps_5 <- coef(regression_line_reps_5)[2]
  slope_reps_4 <- coef(regression_line_reps_4)[2]
  slope_reps_3 <- coef(regression_line_reps_3)[2]
  
  slope_sign_reps_5 <- ifelse(slope_reps_5 >= 0, "+", "-")
  slope_sign_reps_4 <- ifelse(slope_reps_4 >= 0, "+", "-")
  slope_sign_reps_3 <- ifelse(slope_reps_3 >= 0, "+", "-")
  
  subtitle_left_reps_5 <- str_pad(paste("Highest Weight Executed At 5+ Reps:", max_weight_reps_5, "lbs."), 
                                  width = nchar("Highest Weight Executed At 5+ Reps: "), side = "left")
  subtitle_left_reps_4 <- str_pad(paste("Highest Weight Executed At 4 Reps:", max_weight_reps_4, "lbs."), 
                                  width = nchar("Highest Weight Executed At 4 Reps: "), side = "left")
  subtitle_left_reps_3 <- str_pad(paste("Highest Weight Executed At 3 Reps:", max_weight_reps_3, "lbs."), 
                                  width = nchar("Highest Weight Executed At 3 Reps: "), side = "left")
  
  subtitle_right_reps_5 <- str_pad(paste("Slope:", slope_sign_reps_5, abs(round(slope_reps_5, 3)), "lbs per day"), 
                                   width = nchar("Slope: "), side = "right")
  subtitle_right_reps_4 <- str_pad(paste("Slope:", slope_sign_reps_4, abs(round(slope_reps_4, 3)), "lbs per day"), 
                                   width = nchar("Slope: "), side = "right")
  subtitle_right_reps_3 <- str_pad(paste("Slope:", slope_sign_reps_3, abs(round(slope_reps_3, 3)), "lbs per day"), 
                                   width = nchar("Slope: "), side = "right")
  
  min_date <- min(c(min(filtered_data_reps_5$Date), 
                    min(filtered_data_reps_4$Date), 
                    min(filtered_data_reps_3$Date)))
  max_date <- max(c(max(filtered_data_reps_5$Date), 
                    max(filtered_data_reps_4$Date), 
                    max(filtered_data_reps_3$Date)))
  date_range <- paste("Date Range ", format(min_date, "%m/%d/%Y"), " to ", format(max_date, "%m/%d/%Y"))

  clean_label_title = paste(criteria_sets$name,
                "-",
                ifelse(is.na(criteria_sets$position), "", clean_label(criteria_sets$position)), 
                ifelse(is.na(criteria_sets$instrument), "", clean_label(criteria_sets$instrument)), 
                ifelse(is.na(criteria_sets$activity), "", clean_label(criteria_sets$activity))  
  )  

  plot <- ggplot() +
    geom_point(data = filtered_data_reps_3, aes(x = Date, y = Total_Weight, color = "3"),
               stat = "summary", fun = max, size = 3) +
    geom_smooth(data = filtered_data_reps_3, aes(x = Date, y = Total_Weight), 
                method = "lm", se = FALSE, linetype = "dashed", color = "#008000", size = 1) +
    geom_point(data = filtered_data_reps_4, aes(x = Date, y = Total_Weight, color = "4"),
               stat = "summary", fun = max, size = 3) +
    geom_smooth(data = filtered_data_reps_4, aes(x = Date, y = Total_Weight), 
                method = "lm", se = FALSE, linetype = "dashed", color = "#1f78b4", size = 1) +
    geom_point(data = filtered_data_reps_5, aes(x = Date, y = Total_Weight, color = "5+"),
               stat = "summary", fun = max, size = 3) +
    geom_smooth(data = filtered_data_reps_5, aes(x = Date, y = Total_Weight), 
                method = "lm", se = FALSE, linetype = "dashed", color = "#e31a1c", size = 1) +
    
    labs(title = clean_label_title,
         x = "Date", 
         y = "Weight (lbs) Executed",
         subtitle = paste(date_range, "\n", subtitle_left_reps_5, " ", subtitle_right_reps_5, "\n", subtitle_left_reps_4, " ", subtitle_right_reps_4, "\n", subtitle_left_reps_3, " ", subtitle_right_reps_3),
         color = "Minimum Reps"
    ) +
    theme_minimal() +
    scale_x_date(labels = scales::date_format("%m/%d/%Y"), date_breaks = "1 week") +
    theme(axis.text.x = element_text(angle = 90, hjust = 1),
          plot.title = element_text(size = 0.85 * 16, face = "bold"),
          plot.subtitle = element_text(size = 0.85 * 12, face = "italic"),
          axis.title = element_text(size = 0.85 * 14),
          axis.text = element_text(size = 0.85 * 10),
          legend.position = "bottom",
          legend.box = "horizontal",
          legend.direction = "horizontal"
    ) +
    scale_color_manual(values = c("3" = "#008000", "4" = "#1f78b4", "5+" = "#e31a1c"), name = "Minimum Reps")

  # Calculate max 1RM weights and corresponding dates
  max_1rm_all_time <- max(raw_data_criteria$`1RM_Calculated_Weight`, na.rm = TRUE) 
  last_date_max_1rm_all_time <- max(raw_data_criteria$Date[raw_data_criteria$`1RM_Calculated_Weight` == max_1rm_all_time], na.rm = TRUE) # Most recent date for highest 1RM over all time.

  # Most recent date for highest Total Weight executed at 5+ reps. 
  last_date_max_weight_reps_5 <- max(filtered_data_reps_5$Date[filtered_data_reps_5$Total_Weight == max_weight_reps_5], na.rm = TRUE)
      
  # Data for last 2 weeks
  min_date_last_2wk <- max_date - 13
  raw_data_last_2wk <- subset(raw_data_criteria, Date >= min_date_last_2wk & Date <= max_date)
  
  max_1rm_last_2wk <- max(raw_data_last_2wk$`1RM_Calculated_Weight`, na.rm = TRUE)
  last_date_max_1rm_last_2wk <- max(raw_data_last_2wk$Date[raw_data_last_2wk$`1RM_Calculated_Weight` == max_1rm_last_2wk], na.rm = TRUE) # Most recent date for highest 1RM over last 2 weeks. 

  avg_weight_executed_per_set_last_2wk <- mean(raw_data_last_2wk$Total_Weight, na.rm = TRUE)
  avg_1rm_calculated_weight_last_2wk <- mean(raw_data_last_2wk$`1RM_Calculated_Weight`, na.rm = TRUE)
  
  # Print the statistics for the current plot as a table
  statistics_table <- data.frame(
    Metric = c(
      "Average Weight Executed Per Set:",
      "Average 1RM Calculated Weight Per Set:",
      "Highest 1RM Calculated Weight:",
      "Highest 1RM Calculated Weight:",
      "Highest Weight Executed at 5+ Reps:"
    ),

    `Value (lbs)` = c(
      paste(round(avg_weight_executed_per_set_last_2wk, 2)),
      paste(round(avg_1rm_calculated_weight_last_2wk, 2)),
      paste(round(max_1rm_last_2wk,2)),
      paste(round(max_1rm_all_time,2)),
      paste(round(max_weight_reps_5,2))
    ),
    
    Range = c(
      "Last 2 Weeks",
      "Last 2 Weeks",
      "Last 2 Weeks",
      "All Time",
      "All Time"
    ),
    
    Date = c(
      paste(format(min_date_last_2wk, "%m/%d/%Y"), " to " , format(max_date, "%m/%d/%Y")),
      paste(format(min_date_last_2wk, "%m/%d/%Y"), " to " , format(max_date, "%m/%d/%Y")),
      format(last_date_max_1rm_last_2wk, "%m/%d/%Y"),
      format(last_date_max_1rm_all_time, "%m/%d/%Y"),
      format(last_date_max_weight_reps_5, "%m/%d/%Y")
    )
  )
  
  # Update the column name
  colnames(statistics_table)[1] <- paste(clean_label_title)
  
  cat(paste0("\n\n<h4> ", clean_label_title, "</h4>\n"))
  print(
    kable(statistics_table, format = "html", align = 'l') %>%
      kable_styling(full_width = TRUE) %>%
      column_spec(1:ncol(statistics_table), width = paste0(100 / ncol(statistics_table)))
  )
  
  plot <- plot + theme(
    axis.text.x = element_text(size = rel(0.8), angle = 90, hjust = 1),
    axis.title = element_text(size = rel(1)),
    plot.title = element_text(size = rel(1.2), face = "bold"),
    plot.subtitle = element_text(size = rel(1))
  )

  # Print the plot immediately after the statistics
  # To ensure that the output of each plot is printed immediately after its corresponding summary statistics, integrate the printing of the statistics with the plot within the loop. The print function will print the plot to the R console or viewer pane, depending on the setup.
  print(plot)
}

# Loop over each set of criteria_sets and generate a plot along with their corresponding statistics
invisible(lapply(criteria_sets, function(criteria) {
  generate_plot(criteria)
}))


```

# STATS TO STANDARDS

Performance measures are evaluated to match Standards.  

<!-- 
PLAN: DO A SEPARATE MAX 1RM PERFORMANCE COMPARISON (INSTEAD OF AVG 1RM)
-->

## Integrating Performance Metrics with Mark Rippetoe Strength Standards 

Performance measures are evaluated to match standards. This section combines practical workout data analysis with Mark Rippetoe's Strength Standards. The standards are essential for evaluating and contextualizing individual fitness achievements against standardized criteria.

For a higher quality of personal performance comparison, we define specific criteria to match the standards presented by Mark. Should we choose, we can specify our criteria for Flat Bench Dumbbell Chest Press or Incline Bench Dumbbell Chest Press, each likely requiring different weights and numbers of reps for execution. As a reminder, Mark Rippetoe's training utilizes the barbell as the instrument for growth. By explaining in detail the Bench Press in his Starting Strength book [^5], we know the strength standard is measured via the flat bench. Therefore, we specify Flat Bench Barbell (Loaded) Chest Press in our criteria to compare with Mark Rippetoe's 1RM Male Bench Press Standards.

```{r setup_performance_on_mark_rippetoe_standards, eval=TRUE, results='markup'}

# Setup and Library Import: 
# All necessary R packages (dplyr, tidyr, readr, lubridate) should be installed and loaded. These libraries are critical for data manipulation (dplyr), data tidying (tidyr), handling date objects (lubridate), and reading data (readr).

# Define Exercise Criteria: 
# A list named exercise_criteria is defined where each exercise (e.g., Bench Press, Squat) has specific criteria such as position, instrument, and activity pattern. This manual setup aids in filtering the workout data based on the predefined standards. 
# Note: Mark Rippetoe's Standards does not specify positioning (flat bench, incline bench, standing) nor instrument used (barbell, dumbbell). These additional criteria is defined into the list. 

# Creating Criteria Strings: 
# A function is developed to convert the exercise criteria into a single string that can be used for filtering or displaying. This function takes into account various aspects of the exercise like position, instrument, and activity, concatenating them into a readable string.

# Matching Criteria with Standards:
# The function create_criteria_string is applied row-wise to the mark_rippetoe_standards_list dataframe, which is likely a dataset containing predefined standards for different exercises. This application enriches the dataframe by adding a Selected Criteria that provides human-readable criteria for each standard.

library(dplyr)
library(tidyr)
library(knitr)

# Define Exercise Criteria: 
exercise_criteria <- list(
  "Bench Press" = list(Position = "Flat Bench", Instrument = "Barbell", Activity = "Chest Press"),
  "Press" = list(Instrument = "Barbell", Activity = "Overhead Shoulder Press"),
  "Squat" = list(Instrument = "Barbell", Activity = "Squat"),
  "Deadlift" = list(Instrument = "Barbell", Activity = "Deadlift")
)

# Function to create criteria strings
create_criteria_string <- function(category_name) {
  criteria <- exercise_criteria[[category_name]]
  if (is.null(criteria)) return(NA)
  
  criteria_strings <- c()
  if (!is.null(criteria$Position)) {
    modified_position <- gsub("\\.\\*(.*?)\\.\\*", "\\1", criteria$Position)
    criteria_strings <- c(criteria_strings, modified_position)
  }
  if (!is.null(criteria$Instrument)) {
    modified_instrument <- gsub("\\.\\*(.*?)\\.\\*", "\\1", criteria$Instrument)
    criteria_strings <- c(criteria_strings, modified_instrument)
  }
  if (!is.null(criteria$Activity)) {
    modified_activity <- gsub("\\.\\*(.*?)\\.\\*", "\\1", criteria$Activity)
    criteria_strings <- c(criteria_strings, modified_activity)
  }
  
  return(paste(criteria_strings, collapse = " "))
}

# Apply criteria string creation and remove 'Category Sheet'
mark_rippetoe_standards_criteria_match <- mark_rippetoe_standards_list %>%
  rowwise() %>%
  mutate(`Selected Criteria` = create_criteria_string(`Category Name`)) %>%
  ungroup() %>%
  select(-`Category Sheet`)  # This excludes the 'Category Sheet' column

# Add Index to standards
mark_rippetoe_standards_criteria_match <- mark_rippetoe_standards_criteria_match %>%
  mutate(Index = row_number())

# Display the modified 'Strength Standard' column with spaces instead of underscores
display_mark_rippetoe_standards_criteria_match <- mark_rippetoe_standards_criteria_match %>%
  mutate(`Strength Standard` = gsub("_", " ", `Strength Standard`))

# Use knitr::kable to format as a markdown table
knitr::kable(display_mark_rippetoe_standards_criteria_match, format = "markdown", caption = "Specified Criteria with Standards For Reference")
# print(mark_rippetoe_standards_criteria_match) # Print tibble for reference

```

## 1RM Table - Avg L2W & ATH 
The following table reports the average one-repetition maximum (1RM) for each individual across each specified exercise ("Selected Criteria") over the last two weeks (L2W). The data for the last two weeks serves as a benchmark for the next workout. As the saying goes, "You're only as good as your last."

The table also reports highest 1RM over all time (ATH) for each selected criterion, accompanied by the last date the highest 1RM was achieved.

The 1RM is calculated based on weight executed per set and the number of reps, using the 1RM Conversion Table[^10].

All calculations assume the standard barbell (Olympic barbell) weight of 45 pounds (20.4 kg).

Keep in mind there are many variables affecting performance. One of these variables includes grip (wide, alternating, overhand, etc.). One individual may use alternating grips (left underhand/right overhand, or vice-versa) when performing deadlifts to maximize weight execution. Another individual might utilize an overhand grip. Someone else may perform a combination of alternating grip and overhand grip. The following datasets and figures do not account for these differences.

<!-- Plan 
PLAN: Create code to save plots as file.**
--> 

```{r table_1rm_performance_on_mark_rippetoe_standards, eval=TRUE, results='markup'}

# This block of R code adjusts the approach to analyzing the average one-repetition maximum (1RM) for each individual over the last two weeks from the most recent date of each specific exercise.

# This block of R code performs a detailed analysis and reporting of the average one-repetition maximum (1RM) for each individual in the dataset over the last two weeks for specific exercises. The report includes several details about each exercise, such as the number of sets, number of workout days, and the actual date range of the workouts. 

# cat("AVERAGE 1 REP MAX FOR LAST 2 WEEKS OF DATA \n")
# cat("Note: 1RM is calculated based on Weight Executed Per Set, Number of Reps, and a Conversion Table for Repetition Percentages of 1RM (source: https://strengthlevel.com/one-rep-max-calculator as of 02/2024).\n\n")


# Load necessary libraries
library(dplyr)
library(lubridate)
library(knitr)
library(kableExtra)

# Filtering Data
raw_data <- raw_data %>%
  mutate(`1RM_Calculated_Weight` = as.numeric(`1RM_Calculated_Weight`, warn = FALSE)) %>%
  filter(!is.na(`1RM_Calculated_Weight`)) %>%
  mutate(Date = as.Date(Date, format="%m/%d/%Y"))

# Unique names vector
unique_names <- unique(raw_data$Name)

# Create Criteria Strings
create_criteria_string <- function(category_name) {
  criteria <- exercise_criteria[[category_name]]
  if (is.null(criteria)) return(NA)
  criteria_strings <- c()
  if (!is.null(criteria$Position)) criteria_strings <- append(criteria_strings, criteria$Position)
  if (!is.null(criteria$Instrument)) criteria_strings <- append(criteria_strings, criteria$Instrument)
  if (!is.null(criteria$Activity)) {
    modified_activity <- gsub("\\.\\*(.*?)\\.\\*", "\\1", criteria$Activity)
    criteria_strings <- append(criteria_strings, modified_activity)
  }
  return(gsub("_", " ", paste(criteria_strings, collapse = " ")))
}

# Function to expand standards with performance data
expand_standards_with_performance <- function(standards, raw_data, unique_names, exercise_criteria) {
  performance_data <- list()
  for (name in unique_names) {
    for (exercise in names(exercise_criteria)) {
      criteria <- exercise_criteria[[exercise]]
      selected_criteria <- create_criteria_string(exercise)
      
      # Filter raw_data by name and exercise criteria
      raw_data_criteria <- raw_data %>%
        filter(Name == name) %>%
        filter(if (!is.null(criteria$Position)) Position == criteria$Position else TRUE,
               grepl(criteria$Instrument, Instrument, ignore.case = TRUE),
               grepl(criteria$Activity, Activity, ignore.case = TRUE))
      
      # Most recent workout date for the specific exercise
      individual_max_date <- max(raw_data_criteria$Date, na.rm = TRUE)
      
      # Filter for the last two weeks from the most recent date of the exercise
      individual_min_date_last_2wk <- individual_max_date - days(13)
      raw_data_criteria_last_2wk <- raw_data_criteria %>%
        filter(Date >= individual_min_date_last_2wk & Date <= individual_max_date)
      
      avg_1rm_last_2wk <- if(nrow(raw_data_criteria_last_2wk) > 0) round(mean(raw_data_criteria_last_2wk$`1RM_Calculated_Weight`, na.rm = TRUE), 2) else "No data available"
      date_range <- if(nrow(raw_data_criteria_last_2wk) > 0) paste(individual_min_date_last_2wk, individual_max_date, sep = " to ") else "No data available"
      num_sets <- nrow(raw_data_criteria_last_2wk)
      num_workout_days <- length(unique(raw_data_criteria_last_2wk$Date))
      
      # Calculate highest 1RM all time and the corresponding date
      max_1rm_all_time <- if (nrow(raw_data_criteria) > 0) max(raw_data_criteria$`1RM_Calculated_Weight`, na.rm = TRUE) else NA
      max_1rm_all_time_date <- if (!is.na(max_1rm_all_time)) {
        max(raw_data_criteria$Date[raw_data_criteria$`1RM_Calculated_Weight` == max_1rm_all_time], na.rm = TRUE)
      } else {
        NA
      }
      
#      Exclude for display purposes. Refer to the table for same data. 
#      cat("Category Name:", exercise, "\n")
#      cat("Selected Criteria:", selected_criteria, "\n")
#      cat("Average 1RM (lbs):", ifelse(is.nan(avg_1rm_last_2wk) | num_sets == 0, "No data available", round(avg_1rm_last_2wk, 2)), "\n")
#      cat("Date Range (Last 2 Weeks of Data):", date_range, "\n")
#      cat("Based on", num_sets, "sets\n")
#      cat("Based on", num_workout_days, "workout days\n")
#      cat("Highest 1RM All Time (lbs):", ifelse(is.na(max_1rm_all_time), "No data available", round(max_1rm_all_time, 2)), "\n")
#      cat("Date of Last 1RM ATH:", ifelse(is.na(max_1rm_all_time_date), "No data available", format(max_1rm_all_time_date, "%Y-%m-%d")), "\n")
#      cat("-\n")

      performance_data[[length(performance_data) + 1]] <- tibble(
        Name = name,
        `Category Name` = exercise,
        `Selected Criteria` = selected_criteria,
        `Average 1RM (lbs)` = ifelse(is.numeric(avg_1rm_last_2wk), sprintf("%.2f", avg_1rm_last_2wk), avg_1rm_last_2wk),
        `Date Range (Last 2 Weeks)` = date_range,
        `Number of Sets` = num_sets,
        `Number of Days` = num_workout_days,
        `Highest 1RM All Time (lbs)` = ifelse(is.numeric(max_1rm_all_time), sprintf("%.2f", max_1rm_all_time), max_1rm_all_time),
        `Last Date Highest 1RM All Time` = ifelse(!is.na(max_1rm_all_time_date), format(max_1rm_all_time_date, "%Y-%m-%d"), "No data available"),
        Index = match(exercise, standards$`Category Name`) # Use match to find the original order
      )
    }
  }

  combined_performance_data <- bind_rows(performance_data)
  expanded_standards <- standards %>%
    right_join(combined_performance_data, by = c("Category Name", "Selected Criteria", "Index")) %>%
    arrange(Name, Index) # Sort by Name and then by the original order
  return(expanded_standards)
}

# Execute and display results
performance_on_mark_rippetoe_standards <- expand_standards_with_performance(mark_rippetoe_standards_criteria_match, raw_data, unique_names, exercise_criteria)

# Modify the 'Strength Standards' column for display purposes by replacing "_" with " "
performance_on_mark_rippetoe_standards <- performance_on_mark_rippetoe_standards %>%
  mutate(`Strength Standard` = gsub("_", " ", `Strength Standard`)) %>%
  arrange(
    `Index`,  # Ascending by default
    `Name`
    # desc(`Average 1RM (lbs)` # Descending
  )

# Select specific columns for display
display_performance_on_mark_rippetoe_standards <- performance_on_mark_rippetoe_standards %>%
  select(`Selected Criteria`, 
         `Index`,
         `Name`,
         `Average 1RM (lbs)`, 
         `Date Range (Last 2 Weeks)`, 
         `Highest 1RM All Time (lbs)`, 
         `Last Date Highest 1RM All Time`) %>%
  filter_all(all_vars(. != "No data available" & !is.na(.)))

# Display the first few rows of the dataframe
# head(performance_on_mark_rippetoe_standards)

# Use kable and kableExtra for formatting of table output
kable(display_performance_on_mark_rippetoe_standards, "html", align = 'l') %>% 
  kable_styling(full_width = TRUE, 
                fixed_thead = TRUE) %>%
  column_spec(1:ncol(display_performance_on_mark_rippetoe_standards), width = "auto")

# Print the column headers to choose Dynamic Ordering. 
column_headers <- colnames(performance_on_mark_rippetoe_standards)
column_headers_display <- colnames(display_performance_on_mark_rippetoe_standards)
# print(column_headers)
# print(column_headers_display)

```

## Individual Performance Tables
Filtered results are grouped by individual. Some individuals do not have recorded data that meets the criteria.

```{r individual_table_performance_to_mark_rippetoe_standards, eval=TRUE, results='asis'}

# Load necessary libraries
library(dplyr)
library(knitr)
library(kableExtra)

# Loop through each unique name and generate a table for each
for (i in seq_along(unique(display_performance_on_mark_rippetoe_standards$Name))) {
  
  name <- unique(display_performance_on_mark_rippetoe_standards$Name)[i]
  
  # Add this conditional check to process only "Alex"
  # if (name == "Alex") {

  # Filter the results for the current name
  individual_performance <- display_performance_on_mark_rippetoe_standards %>%
    filter(Name == name) %>%
    select(-Name)  # Remove the "Name" column for the table
  
  # Print the section header
  cat(paste0("\n\n<h4>", name, " - 1RM Performance </h4>\n\n"))
  
  # Use kable and kableExtra for formatting of table output
  kable(individual_performance, "html", align = 'l') %>% 
    kable_styling(full_width = TRUE, 
                  fixed_thead = TRUE) %>%
    column_spec(1:ncol(individual_performance), width = "auto") %>%
    print()  # Print each table
  
  cat("\n\n")
}

```


# 1RM Plots - Performance Over Standards 
Performance is plotted over Mark Rippetoe's Strength Standards.

Some considerations for the following plots:

- Average 1RM for each individual is based on data from the last 2 weeks (L2W). As presented in the 1RM Table above, date ranges between each individual differ, ranging from a few weeks to several months or even years. Despite these differences, this plot is still beneficial for comparing performance among individuals, fostering friendly competition, and motivating action toward higher performance.

- Highest 1RM (ATH - All Time Highs) for each individual is based on data from all entries.

- Average Bodyweight represents data from the last 2 weeks of entries for each individual. The following plots and Category Levels use Average Bodyweight for both Average 1RM L2W and ATH executions. Separate calculations are required to obtain Bodyweight at the time when Average 1RM L2W was executed and Bodyweight when ATH 1RM was executed.

- The date and date ranges for Average Bodyweight (lbs) and 1RM statistics may not align. For example: 
    - Date Range for Average 1RM L2W: 2023-10-27 to 2023-11-09		
    - Date for ATH 1RM: 2023-10-18	
    - Date Range for Average Bodyweight L2W: 2024-05-24 to 2024-06-06
    - Further calculations are required to align the date ranges between bodyweight and performance, allowing greater reliability in the 1RM weight execution-to-bodyweight ratio.

## Average 1RM Last 2 Weeks (L2W)

```{r avg_1rm_last_2wk_on_standards, fig.width=10, fig.height=7, eval=TRUE, results='asis'}

# Ensure the necessary libraries are loaded
library(dplyr)
library(ggplot2)
library(RColorBrewer)
library(knitr)
library(kableExtra)

# Load the standards tables and store them in a list
standards_tables <- list(
  "male_bench_press_std_1rm_by_bodyweight_mark_rippetoe" = male_bench_press_std_1rm_by_bodyweight_mark_rippetoe,
  "male_deadlift_std_1rm_by_bodyweight_mark_rippetoe" = male_deadlift_std_1rm_by_bodyweight_mark_rippetoe,
  "male_press_std_1rm_by_bodyweight_mark_rippetoe" = male_press_std_1rm_by_bodyweight_mark_rippetoe,
  "male_squat_std_1rm_by_bodyweight_mark_rippetoe" = male_squat_std_1rm_by_bodyweight_mark_rippetoe
)

# Function to find category level
find_category_level <- function(weight, bw, standards_table) {
  standards <- standards_tables[[standards_table]]
  if (is.null(standards)) {
    stop("Standards table not found.")
  }
  
  # Print for debugging
  # print(standards)
  # print(bw)
  
  rounded_bw <- standards %>% filter(`Bodyweight` >= bw) %>% slice(1) %>% pull(`Bodyweight`)
  categories <- standards %>% filter(`Bodyweight` == rounded_bw)
  
  # Print for debugging
  # print(categories)
  
  # Column names in the standards tables have extra spaces, such as "Cat I" instead of "Cat I"
  category <- ifelse(weight < categories$`Cat  I`, "Below Cat. I",
               ifelse(weight < categories$`Cat  II`, "Cat. I",
               ifelse(weight < categories$`Cat  III`, "Cat. II",
               ifelse(weight < categories$`Cat  IV`, "Cat. III",
               ifelse(weight < categories$`Cat  V`, "Cat. IV", "Cat. V")))))
  return(category)
}

# Define a function to add data points to existing plots and generate a table
add_bodyweight_and_1rm_to_plot <- function(existing_plot, performance_data, table_avg_bodyweights_last_2wk, standards_table) {
  # Filter and join the data
  combined_data <- performance_data %>%
    inner_join(table_avg_bodyweights_last_2wk, by = "Name") %>%
    mutate(`Average 1RM (lbs)` = as.numeric(`Average 1RM (lbs)`)) %>%
    filter(!is.na(`Average 1RM (lbs)`) & `Average 1RM (lbs)` != "No data available") %>%
    mutate(Category_Level_L2W = mapply(find_category_level, `Average 1RM (lbs)`, `Average_Bodyweight_Last_2Wk`, MoreArgs = list(standards_table = standards_table)))

  # Print for debugging
  # print(combined_data)
  
  # Add points to the existing plot
  updated_plot <- existing_plot +
    geom_point(data = combined_data, aes(x = `Average_Bodyweight_Last_2Wk`, y = `Average 1RM (lbs)`, color = Name, fill = Name), 
               size = 4, shape = 21, color = "black") +
    scale_color_manual(values = color_palette) +
    scale_fill_manual(values = color_palette) +
    labs(color = "Name", fill = "Name") +
    theme(legend.text = element_text(size = 11), # Decrease legend text size
          legend.title = element_text(size = 11), # Decrease legend title size
          legend.key.size = unit(0.5, "cm")) # Decrease legend key size

  # Create a table with the specified columns
  table_data <- combined_data %>%
    select(Name, 
           `Average 1RM (lbs)`, 
           `Date Range (Last 2 Weeks)`, 
           `Average_Bodyweight_Last_2Wk`, 
           Date_Range,
           Category_Level_L2W) %>%
    rename(
        `Date Range` = `Date Range (Last 2 Weeks)`,
        `Avg BW L2W (lbs)` = `Average_Bodyweight_Last_2Wk`,
        Category_Level = Category_Level_L2W
        )

  # Update table further with the specified columns
  table_data <- table_data %>%
    select(Name, 
           `Average 1RM (lbs)`, 
           # `Date Range`, 
           `Avg BW L2W (lbs)`, 
           Category_Level
           # Date_Range
           )
  
  # Substitute underscores with spaces in column names
  colnames(table_data) <- gsub("_", " ", colnames(table_data))
  
  # Print the combined table using knitr::kable
  num_cols <- ncol(table_data)
  table_html <- kable(table_data, format = "html", escape = FALSE, align = 'l') %>%
    kable_styling(full_width = TRUE) %>%
    column_spec(1:num_cols, width = paste0(100 / num_cols, "%"), extra_css = "text-align: left;")
  
  return(list(plot = updated_plot, table = table_html, table_data = table_data))
}

# Retrieve existing plots and titles
existing_plots <- plots_list_strength_standards_mark_rippetoe
plot_titles <- names(existing_plots)

# Map plot titles to standards table names
plot_to_standards_table <- list(
  "Male Bench Press Std 1RM By Bodyweight Mark Rippetoe" = "male_bench_press_std_1rm_by_bodyweight_mark_rippetoe",
  "Male Deadlift Std 1RM By Bodyweight Mark Rippetoe" = "male_deadlift_std_1rm_by_bodyweight_mark_rippetoe",
  "Male Press Std 1RM By Bodyweight Mark Rippetoe" = "male_press_std_1rm_by_bodyweight_mark_rippetoe",
  "Male Squat Std 1RM By Bodyweight Mark Rippetoe" = "male_squat_std_1rm_by_bodyweight_mark_rippetoe"
)

# Iterate through each standard and add data points to the existing plots
for (plot_title in plot_titles) {
  
  # Filter performance data for the current standard
  performance_data <- filter(performance_on_mark_rippetoe_standards, `Strength Standard` == plot_title)
  
  # Get the existing plot
  existing_plot <- existing_plots[[plot_title]]
  
  # Get the corresponding standards table name
  standards_table <- plot_to_standards_table[[plot_title]]
  
  # Add bodyweight and 1RM data points to the existing plot
  result <- add_bodyweight_and_1rm_to_plot(existing_plot, performance_data, table_avg_bodyweights_last_2wk, standards_table)
  
  clean_title <- (sub(" Std 1RM By Bodyweight Mark Rippetoe", " Standards, Avg 1RM L2W", plot_title))
  cat(paste0("\n\n<h4>", clean_title, "</h4>\n\n"))

  # Print the updated plot
  print(result$plot)
  
  # Print the table
  print(result$table)
  
  # Save the plot
  # plot_filename <- paste0("plots/avg_1rm_last_2wk_on_", tolower(gsub(" ", "_", plot_title)), "_", Sys.Date(), ".png")
  # ggsave(plot_filename, plot = result$plot, width = 10, height = 7)
  
  # Save the table
  # table_filename <- paste0("tables/avg_1rm_last_2wk_on_", tolower(gsub(" ", "_", plot_title)), "_", Sys.Date(), ".csv")
  # write.csv(result$table_data, file = table_filename, row.names = FALSE)
}

```

## Highest 1RM All Time (ATH)

```{r highest_1rm_all_time_on_standards, fig.width=10, fig.height=7, eval=TRUE, results='asis'}

# Ensure the necessary libraries are loaded
library(dplyr)
library(ggplot2)
library(RColorBrewer)
library(knitr)
library(kableExtra)

# Function to find category level for ATH
find_category_level_ath <- function(weight, bw, standards_table) {
  standards <- standards_tables[[standards_table]]
  if (is.null(standards)) {
    stop("Standards table not found.")
  }
  
  rounded_bw <- standards %>% filter(`Bodyweight` >= bw) %>% slice(1) %>% pull(`Bodyweight`)
  categories <- standards %>% filter(`Bodyweight` == rounded_bw)
  
  category <- ifelse(weight < categories$`Cat  I`, "Below Cat. I",
               ifelse(weight < categories$`Cat  II`, "Cat. I",
               ifelse(weight < categories$`Cat  III`, "Cat. II",
               ifelse(weight < categories$`Cat  IV`, "Cat. III",
               ifelse(weight < categories$`Cat  V`, "Cat. IV", "Cat. V")))))
  return(category)
}

# Define a function to add data points to existing plots and generate a table for ATH
add_bodyweight_and_1rm_to_plot_ath <- function(existing_plot, performance_data, table_avg_bodyweights_last_2wk, standards_table) {
  # Filter and join the data
  combined_data <- performance_data %>%
    inner_join(table_avg_bodyweights_last_2wk, by = "Name") %>%
    mutate(`1RM ATH (lbs)` = as.numeric(`Highest 1RM All Time (lbs)`)) %>%
    filter(!is.na(`1RM ATH (lbs)`) & `1RM ATH (lbs)` != "No data available") %>%
    mutate(Category_Level_ATH = mapply(find_category_level_ath, `1RM ATH (lbs)`, `Average_Bodyweight_Last_2Wk`, MoreArgs = list(standards_table = standards_table)))

  # Add points to the existing plot
  updated_plot <- existing_plot +
    geom_point(data = combined_data, aes(x = `Average_Bodyweight_Last_2Wk`, y = `1RM ATH (lbs)`, color = Name, fill = Name), 
               size = 4, shape = 24, color = "black") +
    scale_color_manual(values = color_palette) +
    scale_fill_manual(values = color_palette) +
    labs(color = "Name", fill = "Name") +
    theme(legend.text = element_text(size = 11), # Decrease legend text size
          legend.title = element_text(size = 11), # Decrease legend title size
          legend.key.size = unit(0.5, "cm")) # Decrease legend key size

  # Create a table with the specified columns
  table_data <- combined_data %>%
    select(Name, 
           `Highest 1RM All Time (lbs)`, 
           `Last Date Highest 1RM All Time`, 
           `Average_Bodyweight_Last_2Wk`, 
           `Date Range (Last 2 Weeks)`,
           Category_Level_ATH) %>%
    rename(
        `1RM ATH (lbs)` = `Highest 1RM All Time (lbs)`,
        Date = `Last Date Highest 1RM All Time`,
        `Avg BW L2W (lbs)` = `Average_Bodyweight_Last_2Wk`,
        `Date Range` = `Date Range (Last 2 Weeks)`,
        Category_Level = Category_Level_ATH
        )

   # Update table further with the specified columns
    table_data <- table_data %>%
      select(Name, 
             `1RM ATH (lbs)`, 
             # Date, 
             `Avg BW L2W (lbs)`, 
             Category_Level
             # Date_Range
             )
  
  # Substitute underscores with spaces in column names
  colnames(table_data) <- gsub("_", " ", colnames(table_data))
  
  # Print the combined table using knitr::kable
  num_cols <- ncol(table_data)
  table_html <- kable(table_data, format = "html", escape = FALSE, align = 'l') %>%
    kable_styling(full_width = TRUE) %>%
    column_spec(1:num_cols, width = paste0(100 / num_cols, "%"), extra_css = "text-align: left;")

  return(list(plot = updated_plot, table = table_html, table_data = table_data))
}

# Retrieve existing plots and titles
existing_plots <- plots_list_strength_standards_mark_rippetoe
plot_titles <- names(existing_plots)

# Map plot titles to standards table names
plot_to_standards_table <- list(
  "Male Bench Press Std 1RM By Bodyweight Mark Rippetoe" = "male_bench_press_std_1rm_by_bodyweight_mark_rippetoe",
  "Male Deadlift Std 1RM By Bodyweight Mark Rippetoe" = "male_deadlift_std_1rm_by_bodyweight_mark_rippetoe",
  "Male Press Std 1RM By Bodyweight Mark Rippetoe" = "male_press_std_1rm_by_bodyweight_mark_rippetoe",
  "Male Squat Std 1RM By Bodyweight Mark Rippetoe" = "male_squat_std_1rm_by_bodyweight_mark_rippetoe"
)

# Iterate through each standard and add data points to the existing plots for ATH
for (plot_title in plot_titles) {
  # Filter performance data for the current standard
  performance_data <- filter(performance_on_mark_rippetoe_standards, `Strength Standard` == plot_title)
  
  # Get the existing plot
  existing_plot <- existing_plots[[plot_title]]
  
  # Get the corresponding standards table name
  standards_table <- plot_to_standards_table[[plot_title]]
  
  # Add bodyweight and 1RM data points to the existing plot for ATH
  result <- add_bodyweight_and_1rm_to_plot_ath(existing_plot, performance_data, table_avg_bodyweights_last_2wk, standards_table)

  clean_title <- (sub(" Std 1RM By Bodyweight Mark Rippetoe", " Standards, 1RM ATH", plot_title))
  cat(paste0("\n\n<h4>", clean_title, "</h4>\n\n"))
  
  # Print the updated plot
  print(result$plot)
  
  # Print the table
  print(result$table)
  
  # Save the plot
  # plot_filename <- paste0("plots/highest_1rm_all_time_on_", tolower(gsub(" ", "_", plot_title)), "_", Sys.Date(), ".png")
  # ggsave(plot_filename, plot = result$plot, width = 10, height = 7)
  
  # Save the table
  # table_filename <- paste0("tables/highest_1rm_all_time_on_", tolower(gsub(" ", "_", plot_title)), "_", Sys.Date(), ".csv")
  # write.csv(result$table_data, file = table_filename, row.names = FALSE)
}

```

## L2W & ATH - Side By Side 
Performance measures for Average 1RM Last 2 Weeks (L2W) and Highest 1RM All Time (ATH) are merged into one plot for side-by-side comparison. This visualization helps individuals reach higher performance by highlighting recent performance and identifying personal bests, thereby fostering healthy competition and motivating consistent progress.

This combined plot allows for an easy comparison of recent performance and all-time bests, providing a clear visual representation of each individual's progress and potential. By understanding these metrics, individuals can identify areas for improvement and set goals to surpass their personal bests, ultimately leveling up their performance.

Note: Each exercise plot is followed by its table for reference. 

Using Average 1RM Last 2 Weeks (L2W) for training purposes is more effective than comparing Highest 1RM Last 2 Weeks to 1RM All Time High (ATH). This is because the average 1RM over the last two weeks provides a more consistent and realistic measure of current strength levels, which can be influenced by various factors such as fatigue, recovery, and daily performance variability. Relying solely on the highest 1RM may not accurately reflect true performance capability and can lead to unrealistic training loads and potential overtraining. 

```{r avg_1rm__l2w_and_ath_1rm_on_standards, fig.width=10, fig.height=7, eval=TRUE, results='asis'}

# Ensure the necessary libraries are loaded
library(dplyr)
library(ggplot2)
library(RColorBrewer)
library(knitr)
library(kableExtra)

# Define color palettes
color_palette_l2w <- c("Alex - Avg L2W" = "#1f77b4", 
                       "Kendrick - Avg L2W" = "#ff7f0e", 
                       "Lucien - Avg L2W" = "#9467bd", 
                       "Josh - Avg L2W" = "#8c564b")
color_palette_ath <- c("Alex - ATH" = "#2ca02c", 
                       "Kendrick - ATH" = "#d62728", 
                       "Lucien - ATH" = "#e377c2", 
                       "Josh - ATH" = "#7f7f7f")

# Load the standards tables and store them in a list
standards_tables <- list(
  "male_bench_press_std_1rm_by_bodyweight_mark_rippetoe" = male_bench_press_std_1rm_by_bodyweight_mark_rippetoe,
  "male_deadlift_std_1rm_by_bodyweight_mark_rippetoe" = male_deadlift_std_1rm_by_bodyweight_mark_rippetoe,
  "male_press_std_1rm_by_bodyweight_mark_rippetoe" = male_press_std_1rm_by_bodyweight_mark_rippetoe,
  "male_squat_std_1rm_by_bodyweight_mark_rippetoe" = male_squat_std_1rm_by_bodyweight_mark_rippetoe
)

# Function to find category level
find_category_level <- function(weight, bw, standards_table) {
  standards <- standards_tables[[standards_table]]
  if (is.null(standards)) {
    stop("Standards table not found.")
  }
  
  rounded_bw <- standards %>% filter(`Bodyweight` >= bw) %>% slice(1) %>% pull(`Bodyweight`)
  categories <- standards %>% filter(`Bodyweight` == rounded_bw)
  
  category <- ifelse(weight < categories$`Cat  I`, "Below Cat. I",
               ifelse(weight < categories$`Cat  II`, "Cat. I",
               ifelse(weight < categories$`Cat  III`, "Cat. II",
               ifelse(weight < categories$`Cat  IV`, "Cat. III",
               ifelse(weight < categories$`Cat  V`, "Cat. IV", "Cat. V")))))
  return(category)
}

# Define a function to add data points to existing plots
add_bodyweight_and_1rm_to_plot <- function(existing_plot, performance_data, table_avg_bodyweights_last_2wk, standards_table) {
  # Filter and join the data
  combined_data <- performance_data %>%
    inner_join(table_avg_bodyweights_last_2wk, by = "Name") %>%
    mutate(`Average 1RM (lbs)` = as.numeric(`Average 1RM (lbs)`),
           `Highest 1RM All Time (lbs)` = as.numeric(`Highest 1RM All Time (lbs)`)) %>%
    filter(!is.na(`Average 1RM (lbs)`) & `Average 1RM (lbs)` != "No data available") %>%
    mutate(Name_Avg_L2W = paste(Name, "- Avg L2W"), 
           Name_ATH = paste(Name, "- ATH"),
           Category_Level_L2W = mapply(find_category_level, `Average 1RM (lbs)`, `Average_Bodyweight_Last_2Wk`, MoreArgs = list(standards_table = standards_table)),
           Category_Level_ATH = mapply(find_category_level, `Highest 1RM All Time (lbs)`, `Average_Bodyweight_Last_2Wk`, MoreArgs = list(standards_table = standards_table)))

  # Add points to the existing plot
  updated_plot <- existing_plot +
    geom_point(data = combined_data, aes(x = Average_Bodyweight_Last_2Wk, y = `Average 1RM (lbs)`, color = Name_Avg_L2W, fill = Name_Avg_L2W), 
               size = 4, shape = 21, color = "black") +
    geom_point(data = combined_data, aes(x = Average_Bodyweight_Last_2Wk, y = `Highest 1RM All Time (lbs)`, color = Name_ATH, fill = Name_ATH), 
               size = 4, shape = 24, color = "black") +
    scale_color_manual(values = c(color_palette_l2w, color_palette_ath)) +
    scale_fill_manual(values = c(color_palette_l2w, color_palette_ath)) +
    guides(color = guide_legend(order = 1), fill = guide_legend(order = 2)) +
    labs(color = "Name", fill = "Name") +
    theme(legend.text = element_text(size = 11), # Decrease legend text size
          legend.title = element_text(size = 11), # Decrease legend title size
          legend.key.size = unit(0.5, "cm")) # Decrease legend key size

  
  # Create a table with the specified columns
  table_data <- combined_data %>%
    select(Name, 
           `Average 1RM (lbs)`, 
           `Date Range (Last 2 Weeks)`, 
           Category_Level_L2W,
           `Highest 1RM All Time (lbs)`, 
           `Last Date Highest 1RM All Time`, 
           Category_Level_ATH,
           `Average_Bodyweight_Last_2Wk`, 
           Date_Range) %>%
    rename(
        `Date Range L2W` = `Date Range (Last 2 Weeks)`,
        `1RM ATH (lbs)` = `Highest 1RM All Time (lbs)`,
        `Date ATH` = `Last Date Highest 1RM All Time`,
        `Avg BW L2W (lbs)` = `Average_Bodyweight_Last_2Wk`
        )
  
  # Update table further with the specified columns
  table_data <- table_data %>%
    select(Name, 
           `Average 1RM (lbs)`, 
           # `Date Range L2W`, 
           Category_Level_L2W,
           `1RM ATH (lbs)`, 
           # `Date ATH`, 
           Category_Level_ATH,
           `Avg BW L2W (lbs)` 
           # `Date_Range`
           )
  
  # Substitute underscores with spaces in column names
  colnames(table_data) <- gsub("_", " ", colnames(table_data))

  # Print the combined table using knitr::kable
  table_html <- kable(table_data, format = "html", escape = FALSE, align = 'l') %>%
    kable_styling(full_width = TRUE) %>%
    column_spec(1:ncol(table_data), extra_css = "text-align: left;")

  return(list(plot = updated_plot, table = table_html, table_data = table_data))
}

# Retrieve existing plots and titles
existing_plots <- plots_list_strength_standards_mark_rippetoe
plot_titles <- names(existing_plots)

# Map plot titles to standards table names
plot_to_standards_table <- list(
  "Male Bench Press Std 1RM By Bodyweight Mark Rippetoe" = "male_bench_press_std_1rm_by_bodyweight_mark_rippetoe",
  "Male Deadlift Std 1RM By Bodyweight Mark Rippetoe" = "male_deadlift_std_1rm_by_bodyweight_mark_rippetoe",
  "Male Press Std 1RM By Bodyweight Mark Rippetoe" = "male_press_std_1rm_by_bodyweight_mark_rippetoe",
  "Male Squat Std 1RM By Bodyweight Mark Rippetoe" = "male_squat_std_1rm_by_bodyweight_mark_rippetoe"
)

# Iterate through each standard and add data points to the existing plots
for (plot_title in plot_titles) {
  # Filter performance data for the current standard
  performance_data <- filter(performance_on_mark_rippetoe_standards, `Strength Standard` == plot_title)
  
  # Get the existing plot
  existing_plot <- existing_plots[[plot_title]]
  
  # Get the corresponding standards table name
  standards_table <- plot_to_standards_table[[plot_title]]
  
  # Add bodyweight and 1RM data points to the existing plot
  result <- add_bodyweight_and_1rm_to_plot(existing_plot, performance_data, table_avg_bodyweights_last_2wk, standards_table)
  
  clean_title <- (sub(" Std 1RM By Bodyweight Mark Rippetoe", " Standards, Avg 1RM L2W & 1RM ATH", plot_title))
  cat(paste0("\n\n<h4>", clean_title, "</h4>\n\n"))

  # Print the updated plot
  print(result$plot)
  
  # Print the table
  print(result$table)
  
  # Save the plot
  plot_filename <- paste0("plots/avg_1rm_l2w_and_ath_1rm_on__", tolower(gsub(" ", "_", plot_title)), "_", Sys.Date(), ".png")
  ggsave(plot_filename, plot = result$plot, width = 10, height = 7)
  
  # Save the table data to CSV
  table_filename <- paste0("tables/avg_1rm_l2w_and_ath_1rm_on_", tolower(gsub(" ", "_", plot_title)), "_", Sys.Date(), ".csv")
  write.csv(result$table_data, file = table_filename, row.names = FALSE)
}

```



<!-- # TEST  

```{r test_performance_and_bodyweight_over_time, fig.width=10, fig.height=7, eval=FALSE, results='asis'}


# Ensure the necessary libraries are loaded
library(dplyr)
library(ggplot2)
library(knitr)
library(kableExtra)

# Rename the column in vitals_data_weight_lbs if it exists, otherwise leave unchanged
vitals_data_weight_lbs <- vitals_data_weight_lbs %>%
  rename_with(~ifelse(. == "Weight_lbs", "Bodyweight (lbs)", .))

# Filter bodyweight data for Kendrick and calculate Daily Average Bodyweight
kendrick_bodyweight_data <- vitals_data_weight_lbs %>%
  filter(Name == "Kendrick") %>%
  group_by(Date) %>%
  summarize(Daily_Avg_BW = mean(`Bodyweight (lbs)`, na.rm = TRUE)) %>%
  ungroup()

# Print unique values to debug criteria matching
print("Unique values in Position column:")
print(unique(kendrick_performance_data$Position))

print("Unique values in Instrument column:")
print(unique(kendrick_performance_data$Instrument))

print("Unique values in Activity column:")
print(unique(kendrick_performance_data$Activity))

# Define Exercise Criteria (adjusted based on the unique values in the dataset)
exercise_criteria <- list(
  "Bench Press" = list(Position = "Flat Bench", Instrument = "Barbell", Activity = "Chest Press"),
  "Press" = list(Instrument = "Barbell", Activity = "Overhead Shoulder Press"),
  "Squat" = list(Instrument = "Barbell", Activity = "Squat"),
  "Deadlift" = list(Instrument = "Barbell", Activity = "Deadlift")
)

# Filter performance data for Kendrick
kendrick_performance_data <- raw_data %>%
  filter(Name == "Kendrick")

# Function to apply the criteria filter
apply_criteria_filter <- function(data, criteria) {
  if (!is.null(criteria$Position)) {
    data <- data %>% filter(Position == criteria$Position)
  }
  if (!is.null(criteria$Instrument)) {
    data <- data %>% filter(Instrument == criteria$Instrument)
  }
  if (!is.null(criteria$Activity)) {
    data <- data %>% filter(Activity == criteria$Activity)
  }
  return(data)
}

# Loop through each exercise criteria
for (exercise in names(exercise_criteria)) {
  criteria <- exercise_criteria[[exercise]]
  
  # Print criteria for debugging
  print(paste("Applying criteria for", exercise))
  print(criteria)
  
  # Apply criteria filter to performance data
  filtered_performance_data <- apply_criteria_filter(kendrick_performance_data, criteria)
  
  # Print summary of filtered_performance_data for debugging
  print(paste("Filtered data for", exercise))
  print(summary(filtered_performance_data))
  
  # Calculate Daily Average 1RM for the current criteria
  daily_avg_1rm_data <- filtered_performance_data %>%
    group_by(Date) %>%
    summarize(Daily_Avg_1RM = mean(`1RM_Calculated_Weight`, na.rm = TRUE)) %>%
    ungroup()
  
  # Print summary of daily_avg_1rm_data
  print(paste("Summary for", exercise))
  print(summary(daily_avg_1rm_data))
  
  # Merge the datasets on Date
  kendrick_combined_data <- daily_avg_1rm_data %>%
    full_join(kendrick_bodyweight_data, by = "Date")
  
  # Handle missing data by filling gaps
  kendrick_combined_data <- kendrick_combined_data %>%
    arrange(Date) %>%
    tidyr::fill(Daily_Avg_1RM, Daily_Avg_BW, .direction = "downup")
  
  # Plot Daily Average 1RM and Bodyweight over time for Kendrick
  p <- ggplot(kendrick_combined_data, aes(x = Date)) +
    geom_line(aes(y = Daily_Avg_1RM, color = "Daily Avg 1RM"), size = 1) +
    geom_line(aes(y = Daily_Avg_BW, color = "Daily Avg BW"), size = 1) +
    scale_color_manual(values = c("Daily Avg 1RM" = "blue", "Daily Avg BW" = "red")) +
    labs(title = paste("Kendrick's Daily Avg 1RM and Bodyweight Over Time for", exercise),
         x = "Date",
         y = "Value",
         color = "Metric") +
    theme_minimal() +
    theme(
      plot.title = element_text(hjust = 0.5, size = 16),
      axis.title.x = element_text(size = 14),
      axis.title.y = element_text(size = 14),
      axis.text = element_text(size = 12),
      legend.position = "right",
      legend.title = element_blank(),
      legend.text = element_text(size = 12),
      legend.key.size = unit(1, "cm")
    )
  
  print(p)
}


```
--> 

# NOTES FOR FUTURE IMPLEMENTATION

## Power BI vs. RStudio (R Programming)
Creating this report requires knowledge in managing the different coding structure--it requires knowing R. It can be replicated through other programs, and by doing so may enhance the overall experience. Such is the case with Power BI but in a different aspect.

Power BI is better suited for creating interactive dashboards and visualizations quickly and sharing them easily with non-technical stakeholders. It is user-friendly and integrates well with other Microsoft products, making it ideal for business environments that require regular reporting and real-time data updates.

R Studio, on the other hand, is ideal for in-depth data analysis, complex statistical modeling, and projects that require extensive data manipulation and custom analyses. It offers greater flexibility and is more powerful for advanced analytics, although it requires programming knowledge and may have a steeper learning curve.

If the primary goal is to perform advanced statistical analysis and custom modeling, R Studio would be more appropriate. However, if the goal is to create interactive, shareable dashboards and reports, Power BI might be the better choice. This is a toss coin, as my intent is to provide both advanced statistical analysis and custom modeling as I learn to further utilize the program, but also allow others the availability to interact with their data, should that time ever come. 

## Prospective Statistical Analyses 
More advanced statistical measurements are available to derive from the dataset. I've included a general list of ideas to explore. Some ideas require a larger demographic group, and others necessitate the use of technological devices (watches and phone apps). At the time of publishing, I am working on plotting multiple variables on one plot. One includes plotting Average Bodyweight (y) and Daily Average 1RM (y) over time (x). This will help explore any correlation and causation between variables. Findings are expected to provide adaptable insights into daily routines and habits. 

### Dual Y-axis Plot
This type of plot allows you to visualize two different y-variables against the same x-variable 

Average Bodyweight and Daily Average 1RM over time

  - Plot = Selected Criteria for Individual
  - Y-variable = Avg Bodyweight
      - Requires creating new df from Vitals; Daily_Avg_Bodyweight (Avg BW of all measures on each date)
  - Y-variable = Daily Avg 1RM 
      - Requires creating new df from Raw Data; Daily_Avg_1RM (Avg 1RM of all sets on each date)
  - X-variable = Date (day/month)
  - Imagine plot similar to Webull Comparison Quotes structure
  

Daily Average 1RM and # Daily Sets over time for Selected Criteria 

  - Plot = Selected Criteria for Individual
  - Y-variable = Daily Average 1RM
      - Consider to sub Daily with Weekly or Monthly Average 1RM 
      - Consider to sub Average 1RM with Highest 1RM 
  - Y-variable = Daily Sets Performed
      - Consider to sub Daily Sets with Weekly or Monthly Sets
      - A larger and consistent number of sets performed is expected correlate or maintain 1RM measures.
      - Gaps in number of sets performed is expected to correlate with decreased 1RM measures

```{r sample, fig.width=10, fig.height=7, eval=TRUE, results='asis'}

# Install ggpubr package if not already installed
if (!requireNamespace("ggpubr", quietly = TRUE)) {
  install.packages("ggpubr")
}

# Load necessary libraries
library(ggplot2)
library(scales)
library(ggpubr)

# Create the dataset
data <- data.frame(
  Date = as.Date(c('2023-01-01', '2023-01-02', '2023-01-03', '2023-01-04', 
                   '2023-01-05', '2023-01-06', '2023-01-07', '2023-01-08', 
                   '2023-01-09', '2023-01-10')),
  Average_Bodyweight = c(70, 71, 70, 70.5, 71.2, 70.8, 70.5, 71, 71.5, 70.7),
  Daily_Average_1RM = c(150, 152, 151, 153, 155, 154, 153, 156, 158, 157)
)

# Determine the y-axis limits for both variables
y1_min <- min(data$Daily_Average_1RM) - 1
y1_max <- max(data$Daily_Average_1RM) + 1
y2_min <- min(data$Average_Bodyweight) - 1
y2_max <- max(data$Average_Bodyweight) + 1

# Scaling function for the secondary axis
scale_factor <- (y2_max - y2_min) / (y1_max - y1_min)

# Define the date label function to add a newline between the month and the day  
date_label_func <- function(x) {
  labels <- format(x, "%b\n%d") # Displays as "Sep\n15", separating month and day with a newline
  labels[1] <- paste0(labels[1], "\n", format(x[1], "%Y"))  # Append the year to the first label
  return(labels)
}

# Create the plot
p <- ggplot(data, aes(x = Date)) +
  geom_line(aes(y = Daily_Average_1RM, color = "Daily Average 1RM")) +
  geom_line(aes(y = (Average_Bodyweight - y2_min) / scale_factor + y1_min, color = "Average Bodyweight")) +
  geom_point(aes(y = Daily_Average_1RM, color = "Daily Average 1RM"), size = 3, data = data[c(1, nrow(data)), ]) +
  geom_point(aes(y = (Average_Bodyweight - y2_min) / scale_factor + y1_min, color = "Average Bodyweight"), size = 3, data = data[c(1, nrow(data)), ]) +
  scale_y_continuous(
    name = "Daily Average 1RM",
    limits = c(y1_min, y1_max),
    labels = scales::number_format(accuracy = 0.1),
    sec.axis = sec_axis(~ . * scale_factor + y2_min, name = "Average Bodyweight (kg)", labels = scales::number_format(accuracy = 0.1))
  ) +
  scale_x_date(
    breaks = seq(min(data$Date), max(data$Date), by = "1 day"),
    minor_breaks = seq(min(data$Date), max(data$Date), by = "1 day"),
    labels = date_label_func
  ) +
  scale_color_manual(
    name = "Variables",
    values = c("Daily Average 1RM" = "blue", "Average Bodyweight" = "red")
  ) +
  labs(
    title = "Average Bodyweight and Daily Average 1RM Over Time",
    x = "" # Date or other if desired
  ) +
  theme_minimal() +
  theme(
    axis.title.y.right = element_text(color = "red", margin = margin(l = 15)),
    axis.title.y.left = element_text(color = "blue", margin = margin(r = 15)),
    axis.title.x = element_text(margin = margin(t = 5)),
    panel.grid.major = element_line(color = "grey80"),
    panel.grid.minor = element_line(color = "grey90"),
    panel.grid.major.x = element_line(color = "grey80"),
    panel.grid.minor.x = element_line(color = "grey90"),
    panel.grid.major.y = element_line(color = "grey80"),
    panel.grid.minor.y = element_line(color = "grey90"),
    axis.ticks.length = unit(0.2, "cm"),
    axis.text.y.left = element_text(margin = margin(r = 2)), # Adjust margin here
    axis.text.y.right = element_text(margin = margin(l = 2)), # Adjust margin here
    axis.text.x = element_text(margin = margin(t = 0.5)),  # Add some margin to x-axis labels for better readability,
    legend.title = element_blank(),
    legend.spacing.x = unit(0.5, 'cm')
  )

# Arrange the plot and the legend
ggarrange(p, ncol = 1, legend = "top")


```

<!--
  scale_x_date(date_breaks = "1 month", labels = date_label_func,
               limits = c(min(vitals_data$Date, na.rm = TRUE), extended_all_data_max_date)) +
  scale_y_continuous(labels = scales::number_format(accuracy = 0.1)) +  # Adjust y-axis to 1st decimal
  scale_color_manual(values = setNames(color_palette, gsub("_", " ", names(color_palette)))) +
  scale_fill_manual(values = setNames(color_palette, gsub("_", " ", names(color_palette)))) +  # Matching fill colors with color palette
  labs(title = "Bodyweight Over Time for All Individuals",
       x = "Date",
       y = "Weight (lbs)",
       color = "Name",
       fill = "Name") +
  theme_minimal() +
  theme(axis.text.x = element_text(hjust = 0.5),
        panel.grid.minor.x = element_blank(),  # Remove minor grid lines on the x-axis
        panel.grid.major = element_line(color = "grey90"))  # Make grid lines lighter
-->


<!-- 
scale_x_date(breaks by x1 month (the data will be spread out over months). 
Create geom_market; Add marker on first data point and last data point
-->

### Stacked Bar Chart 
Loop plots for each individual. Show [total] value of a category and segments that bar to show the proportion of different sub-categories within that [total] value.

% Sets of Selected Criteria by ATH 1RM and Avg 1RM 

  - Value = % sets performed (ie 2% of all sets performed)
  - Sub-category = ATH 1RM as the full bar (ie Chest Press 174.70, Deadlift 267.44)
  - Sub-category = Avg 1RM L2W as percentage of ATH 1RM (ie Chest Press 152.71, Deadlift 219.21)
  - Category = Selected Criteria (Chest Press, Deadlift, etc.)
  - *Avg 1RM L2W as percentage of ATH 1RM may not be the best dataset to compare with respect to % sets of selected criteria performed. Consider other Sub-categories for comparison.
  
### Other Charts

Vertical Bar Chart

- Usage: Comparing the frequency or count of different exercise types.
- Example: Number of sessions for each exercise type (e.g., Bench Press, Deadlift, Squat, Press).

Horizontal Bar Chart:

- Comparing the frequency or count of different exercise types with long labels.
- Example: Number of sessions for each exercise type, displayed horizontally for better readability.

Stacked Bar Chart:

- Usage: Showing the composition of sets within each exercise.
- Example: Number of sets performed for each exercise type, broken down by reps range (e.g., 1-5 reps, 6-10 reps, 11+ reps).

Grouped (Clustered) Bar Chart:

- Usage: Comparing the average 1RM (one-repetition maximum) for different exercises across multiple individuals.
- Example: Average 1RM for Bench Press, Deadlift, Squat, and Press for Alex, Kendrick, and Josh.

100% Stacked Bar Chart:

- Usage: Showing the percentage composition of total workouts by exercise type.
- Example: Percentage of total workouts made up by Bench Press, Deadlift, Squat, and Press.

Diverging Bar Chart:

- Usage: Displaying positive and negative performance changes in different exercises.
- Example: Performance improvement or decline in 1RM for each exercise type over a period.

Lollipop Chart:

- Usage: Showing average bodyweight for each individual.
- Example: Average bodyweight of Alex, Kendrick, and Josh represented as dots on lines.

Dot Plot:

- Usage: Showing the performance scores (e.g., 1RM) for different exercises.
- Example: Performance scores for Bench Press, Deadlift, Squat, and Press for each individual.

Variable Width Bar Chart:

- Usage: Showing the volume of weight lifted where the width represents the number of sessions.
- Example: Total weight lifted in Bench Press, Deadlift, Squat, and Press with bar width representing the number of sessions.

Waterfall Chart:

- Usage: Showing the cumulative effect of different exercise sessions on total weight lifted.
- Example: Cumulative total weight lifted over time for different exercises.

Pareto Chart:

- Usage: Highlighting the most significant exercises contributing to total weight lifted.
- Example: Total weight lifted for each exercise type in descending order with a line showing cumulative percentage.

Gantt Chart:

- Usage: Project planning and tracking progress over time.
- Example: Workout schedule showing the timeline for different exercises and sets over a month.

Histogram:

- Usage: Showing the frequency distribution of bodyweight.
- Example: Distribution of bodyweight entries over the last month.

Mosaic Plot:

- Usage: Displaying the relationship between exercise type and the number of sets performed.
- Example: Visualization of how different exercises are associated with the number of sets performed.

## Utilization of Datasets

Longitudinal Analysis of Training Progress:

- Objective: To track and analyze the training progress over an extended period.
- Implementation: Utilize time series analysis to monitor changes in 1RM and bodyweight over time. This will help identify trends, plateaus, and periods of significant improvement. Applying smoothing techniques and ARIMA models can aid in forecasting future performance based on historical data.

Correlation and Causation Studies:

- Objective: To understand the relationship between different variables.
- Implementation: Perform correlation analysis to investigate the relationships between bodyweight, training frequency, nutritional intake, and 1RM performance. Follow up with regression analysis to identify potential causal relationships and quantify the impact of various factors on strength gains.

Comparative Analysis Across Different Demographics:

- Objective: To compare performance metrics across different demographics such as age, gender, and training experience.
- Implementation: Use t-tests and ANOVA to compare 1RM performance and bodyweight across different demographic groups. This can help tailor training programs to specific groups and improve overall effectiveness.

Enhanced Visualization Techniques:

- Objective: To create more insightful and interactive visualizations.
- Implementation: Incorporate advanced visualization libraries such as plotly for interactive plots that allow for deeper exploration of the data. This can include dynamic plots where users can filter and view specific subsets of the data.

Predictive Modeling for Performance Optimization:

- Objective: To develop models that predict performance outcomes and optimize training regimens.
- Implementation: Apply machine learning techniques such as random forests, support vector machines, or neural networks to predict 1RM performance based on a range of input variables. These models can be used to suggest optimal training loads and volumes for individuals.

Cluster Analysis for Identifying Training Patterns:

- Objective: To identify distinct training patterns and group similar individuals.
- Implementation: Use clustering techniques such as K-means clustering to identify groups of individuals with similar training patterns and performance metrics. This can help in customizing training programs for different clusters based on their unique characteristics.

Survival Analysis for Training Attrition:

- Objective: To analyze the dropout rates and factors affecting training adherence.
- Implementation: Perform survival analysis to study the time-to-dropout for trainees and identify factors that influence training adherence. This can guide interventions aimed at reducing dropout rates and improving long-term training commitment.

Principal Component Analysis (PCA) for Dimensionality Reduction:

- Objective: To simplify complex datasets and identify key performance indicators.
- Implementation: Apply PCA to reduce the dimensionality of your dataset while retaining most of the variance. This can help in identifying key variables that most influence training outcomes and performance.

Nutrition and Recovery Analysis:

- Objective: To study the impact of nutrition and recovery on performance.
- Implementation: Collect and analyze data on dietary intake, sleep patterns, and recovery methods. Use regression analysis to determine how these factors influence strength gains and overall performance.

Benchmarking and Standard Development:

- Objective: To establish performance benchmarks and standards for different populations.
- Implementation: Utilize the collected data to develop performance standards and benchmarks. This can involve creating percentile ranks and performance categories that can be used for setting realistic goals and expectations for trainees.

**Data Collection Enhancements**

Incorporating Wearable Technology: 

- Utilize wearable devices to collect real-time data on heart rate, sleep quality, and daily activity levels. This data can provide deeper insights into how lifestyle factors influence training performance.

Detailed Nutritional Logging: 

- Implement a system for detailed nutritional logging, allowing for the collection of data on macronutrient and micronutrient intake. Analyzing this data can help optimize dietary plans to support training goals.

Psychological and Motivational Assessments: 

- Regularly assess psychological factors such as motivation, stress levels, and mental fatigue. Understanding the psychological aspects of training can provide a more holistic view of an individual’s performance and adherence to the training program.

Recovery and Injury Data: 

- Collect data on recovery strategies and injury occurrences. Analyzing this information can help in developing injury prevention programs and optimizing recovery protocols to enhance performance.

Advanced Biometric Data: 

- Incorporate the collection of advanced biometric data such as blood markers, hormone levels, and muscle oxygenation. This can provide a deeper understanding of physiological responses to training.

# REFERENCES 

<!-- 
There are multiple references above for the same notation below. R has default settings to assign one notation per reference, while keeping the order you have designed. For example [^3] is used at least twice, but will appear as [^3] and [^4] in the knit product. The following numbers will bump up. Same goes for One Rep Max Calculator [^10], which at time of review prints as [^11] and [^14]. Annotation numbers may have changed but hopefully you understand my point.
-->

[^1]: Tham, K. (2024). *KDF: Strength and Bodyweight Performance Analysis (`r Sys.Date()`).* RPubs by RStudio. Retrieved from <https://rpubs.com/KenDoesData/>.

[^2]: Tham, K. (2024). *KDF (KEN DOES FITNESS).* Google Drive. Available at <https://drive.google.com/drive/folders/1-41FJ_c41Kj7KaK0h1aJ5EBvEhPwwIsi?usp=sharing>

[^3]: Tham, K. (2024). *Strength Standards Final.xlsm.* Ken Does Data. Retrieved from <https://1drv.ms/x/s!AsYYZA9nL8lf8CK6k6rAP3LSkvry?e=ylARXi>.

[^4]: Rippetoe, M. (2012). *Strength Standards.* Starting Strength. Retrieved from <https://startingstrength.com/files/standards.pdf>.

[^5]: Rippetoe, M. (2017). *Starting Strength: Basic Barbell Training (3rd ed.).* The Aasgaard Company. Retrieved from <https://aasgaardco.com/store/books-posters-dvd/books/starting-strength-basic-barbell-training/>.

[^6]: Legion Athletics. (2019). *These Are the Best Strength Standards on the Internet.* Retrieved from <https://legionathletics.com/strength-standards/>.

[^7]: Rippetoe, M. (2012). *The Strength Standards Tables are back up.* Starting Strength Forum. Retrieved from <https://startingstrength.com/resources/forum/mark-rippetoe-q-and-a/33029-strength-standards-tables.html>.

[^8]: Rippetoe, M. (2017). *Starting Strength: Basic Barbell Training, 3rd Edition* by Mark Rippetoe. Available at <https://www.amazon.com/Starting-Strength-Mark-Rippetoe-ebook/dp/B006XJR5ZA/ref=sr_1_1?crid=21521YXBKX4AI&dib=eyJ2IjoiMSJ9.1qTeh_8DJhqDmxrrajmjTbLiQz-wPaKOifgi787M7WurNxwUwKVbhQadJEw8Wdu2ygDP8X0D23Y5xGqEZAJ2L5luEw0gvCIIyuTmugYZ_bRmG35hqLRPlbOyQx4e5pRf2wVoGnsQPzFQEfEOlrhiHKp8BSWCuQkf2RELFKSOLQO49Fq8Y57PU_MuPm9ge91UcC3ebmCZ8pCWHmqbUURN6tyocDT-CR3ZaQAKPJXnmYPsocCP4jz4Y_QTLE1hecsk8Vuugf3HIBjBWkOVMZFmSDlS9ujVxqF4vRqC7GsVJlw._9jNplpd9L6PPFzk7WMC-nSDfWXPhmckFiiaMSwaCyo>.

[^9]: Rippetoe, M. *Practical Programming for Strength Training, 3rd Edition* by Mark Rippetoe. Available at <https://www.amazon.com/Practical-Programming-Strength-Training-Rippetoe-ebook/dp/B00IU8YETW/ref=sr_1_2?crid=21521YXBKX4AI&dib=eyJ2IjoiMSJ9.1qTeh_8DJhqDmxrrajmjTbLiQz-wPaKOifgi787M7WurNxwUwKVbhQadJEw8Wdu2ygDP8X0D23Y5xGqEZAJ2L5luEw0gvCIIyuTmugYZ_bRmG35hqLRPlbOyQx4e5pRf2wVoGnsQPzFQEfEOlrhiHKp8BSWCuQkf2RELFKSOLQO49Fq8Y57PU_MuPm9ge91UcC3ebmCZ8pCWHmqbUURN6tyocDT-CR3ZaQAKPJXnmYPsocCP4jz4Y_QTLE1hecsk8Vuugf3HIBjBWkOVMZFmSDlS9ujVxqF4vRqC7GsVJlw._9jNplpd9L6PPFzk7WMC-nSDfWXPhmckFiiaMSwaCyo>.

[^10]: Strength Level. (n.d.). *One Rep Max Calculator.* Retrieved from <https://strengthlevel.com/one-rep-max-calculator>.

[^11]: Tham, K. (2024). *Ken Does Fitness Final.xlsm.* Retrieved from <https://1drv.ms/x/s!AsYYZA9nL8lf8ybTNxlW-2c6sDXB?e=KLfhT4>.

[^12]: Flex Staff. (2023). *Wide-Grip vs. Close-Grip Barbell Curls.* Muscle & Fitness. Retrieved from <https://www.muscleandfitness.com/flexonline/training/wide-grip-vs-close-grip-barbell-curls/>.

[^13]: TuffWraps Editors. (2023). *Close-Grip vs. Wide-Grip Bicep Curls: Choosing the Right Grip for Maximum Results.* TuffWraps. Retrieved from <https://www.tuffwraps.com/blogs/news/close-grip-vs-wide-grip-bicep-curls-choosing-the-right-grip-for-maximum-results>.

[^14]: Colucci, C. (2023). *The 5×5 Workout Explained: The Classic Program for Size and Strength.* Breaking Muscle. Retrieved from <https://breakingmuscle.com/5-x-5-workout-explained/>.


