\(~\)

Polycystic Ovarian Syndrome (PCOS)

\(~\)

Overview:

Polycystic ovarian syndrome (PCOS) is a reproductive hormonal imbalance condition affecting as much as 5 million women in the US alone. It’s believed that genetics and environmental factors can cause PCOS that affect their body physically and emotionally along with their metabolism, overall health and appearance in women. PCOS causes problems in ovaries making it hard for women to have a healthy menstrual cycle leading to the development of cysts and infertility. Although very common in women of reproductive age, PCOS may begin shortly after puberty but can also develop during the later teenage years and early adulthood.

\(~\)

Hormones that are involved in PCOS are:

  • Androgens: aka “male hormones” are present in women with PCOS at higher levels than usual. Excess in androgens can cause symptoms such as acne, unwanted hair, thinning hair, and irregular periods.

  • Insulin: allows the body to absorb glucose (blood sugar) into the cells for energy. In PCOS, the body doesn’t respond to insulin as intended therefore, elevations in blood glucose levels can be assessed. Such elevations then lead to increased production of androgen.

  • Progesterone: vital hormone for menstruation and pregnancy; lack of progesterone contributes to irregular periods.

\(~\)

PCOS Symptoms:

Many of these symptoms can be attributed to other causes or go unnoticed but it is very common for PCOS to go undiagnosed for some time. Here are some symptoms that help with the diagnosis:

  • Irregular periods: irregular or missed periods as are a result of not ovulating is a common signs of PCOS

  • Polycystic ovaries: some may develop cysts in their ovaries but some don’t. Ovaries may be enlarged and follicles surrounding their eggs therefore failing to function regularly.

  • Excess androgen: elevated levels of male hormones can cause excess hair and acne.

\(~\)

Other symptoms may include:

  • Weight gain: many people with PCOS will have weight gain or obesity that is difficult to manage.

  • Fatigue: increase in fatigue or low energy is also common

  • Unwanted hair growth: due to excess androgen, areas such as face, arms, back, chest, hand, toes and abdomen may have excess hair growth.

  • Thinning hair on the head: hair loss may increase in middle age for those with PCOS

  • Infertility: PCOS is a leading cause for infertility but not everyone is the same.

  • Acne: due to hormonal changes, acne can be arise and make skin oilier than usual and cause breakout in the face, chest and upper back.

  • Darkening of skin: areas such as under arms, breasts or back of your neck may get dark, patchy or thicken

  • Mood changes: mood swings, depression and anxiety can increase

  • Pelvic pain: pain may occur with periods along with heavy bleeding or without bleeding

  • Headaches: can occur due to hormonal changes

  • Sleep problems: most people often suffer with problem such as insomnia or poor sleep. These arise due to many factors but a common one is having sleep apnea (sleep disorder). Even when you fall asleep you wake up as if you have not slept at all or have trouble falling asleep.

  • Depression: can arise due to symptoms that can alter your appearance and have a negative impact on your emotions.

It’s good to note that not everyone who is diagnosed with PCOS experiences all of these symptoms and should always consult with a their PCP or OBGYN to get an accurate diagnosis.

\(~\)

Load Libraries:

These are the libraries used to explore, prepare and visualize the data

library(tidyverse)
library(dplyr)
library(DataExplorer)
library(hrbrthemes)
library(ggplot2)
library(gridExtra)
library(shiny)
library(plotly)
library(shinythemes)

\(~\)

Load data:

Below is a short description of the variables of interest in the data sets:

  • The data sets I will be using are from Kaggle collected from 10 different hospital across Kerala, India.

Things to consider and understand the data set:

  • Units used are feet to cm
  • For Yes | No questions
    • Yes = 1
    • No = 0
  • Blood Groups:
    • A+ = 11
    • A- = 12
    • B+ = 13
    • B- = 14
    • O+ = 15
    • O- = 16
    • AB+ = 17
    • AB- = 18
  • RBS means Random glucose test
  • PRG means PCOS related genes
  • Blood pressure entered as systolic and diastolic separately
  • Beta-HCG cases are mentioned as Case I and II: these are pregnancy hormones
    • The detection of beta hCG (Case I) shows more accuracy in pregnancy test than the hCG (Case II) detection test.
  • AMH(ng/mL) detects ovarian reserve (egg count)
  • RR(breathes/min) is respiration rates
  • Hb(g/dl) is hemoglobin levels that measures how much hemoglobin is in your blood.
  • FSH is follicle stimulating hormone
    • FSH helps control the menstrual cycle and stimulates the growth of eggs in the ovaries. FSH levels in women change throughout the menstrual cycle, with the highest levels happening just before an egg is released by the ovary.
  • LH is luteinizing hormone (increases during ovulation)
    • Before menopause - 5 to 25 IU/L
    • Level peaks even higher around the middle of the menstrual cycle
    • Level then becomes higher after menopause - 14.2 to 52.3 IU/L
  • TSH is thyroid stimulating hormone
    • levels too high or low may be a sign of thyroid problem
  • PRL is prolactin in blood
    • causes the breasts to grow and make milk during pregnancy and after birth. Prolactin levels are normally high for pregnant women and new mothers. Levels are normally low for nonpregnant women and for men.
  • Cycle(R/I) values are as follows:
    • 4 indicates irregular menstrual cycle
    • 2 indicates a regular menstrual cycle

\(~\)

I have included the original data sets in my GitHub account and read from this location.

Lets view the pcos data:

##   Sl..No Patient.File.No. PCOS..Y.N. I...beta.HCG.mIU.mL.
## 1      1            10001          0                 1.99
## 2      2            10002          0                60.80
## 3      3            10003          1               494.08
## 4      4            10004          0                 1.99
##   II....beta.HCG.mIU.mL. AMH.ng.mL.
## 1                   1.99       2.07
## 2                   1.99       1.53
## 3                 494.08       6.63
## 4                   1.99       1.22

… And the pcos_infertility data:

##   Sl..No Patient.File.No. PCOS..Y.N. Age..yrs. Weight..Kg. Height.Cm.    BMI
## 1      1                1          0        28        44.6      152.0   19.3
## 2      2                2          0        36        65.0      161.5 #NAME?
## 3      3                3          1        33        68.8      165.0 #NAME?
## 4      4                4          0        37        65.0      148.0 #NAME?
##   Blood.Group Pulse.rate.bpm. RR..breaths.min. Hb.g.dl. Cycle.R.I.
## 1          15              78               22    10.48          2
## 2          15              74               20    11.70          2
## 3          11              72               18    11.80          2
## 4          13              72               20    12.00          2
##   Cycle.length.days. Marraige.Status..Yrs. Pregnant.Y.N. No..of.aborptions
## 1                  5                     7             0                 0
## 2                  5                    11             1                 0
## 3                  5                    10             1                 0
## 4                  5                     4             0                 0
##   I...beta.HCG.mIU.mL. II....beta.HCG.mIU.mL. FSH.mIU.mL. LH.mIU.mL. FSH.LH
## 1                 1.99                   1.99        7.95       3.68 #NAME?
## 2                60.80                   1.99        6.73       1.09 #NAME?
## 3               494.08                 494.08        5.54       0.88 #NAME?
## 4                 1.99                   1.99        8.06       2.36 #NAME?
##   Hip.inch. Waist.inch. Waist.Hip.Ratio TSH..mIU.L. AMH.ng.mL. PRL.ng.mL.
## 1        36          30          #NAME?        0.68       2.07      45.16
## 2        38          32          #NAME?        3.16       1.53      20.09
## 3        40          36          #NAME?        2.54       6.63      10.52
## 4        42          36          #NAME?       16.41       1.22      36.90
##   Vit.D3..ng.mL. PRG.ng.mL. RBS.mg.dl. Weight.gain.Y.N. hair.growth.Y.N.
## 1           17.1       0.57         92                0                0
## 2           61.3       0.97         92                0                0
## 3           49.7       0.36         84                0                0
## 4           33.4       0.36         76                0                0
##   Skin.darkening..Y.N. Hair.loss.Y.N. Pimples.Y.N. Fast.food..Y.N.
## 1                    0              0            0               1
## 2                    0              0            0               0
## 3                    0              1            1               1
## 4                    0              0            0               0
##   Reg.Exercise.Y.N. BP._Systolic..mmHg. BP._Diastolic..mmHg. Follicle.No...L.
## 1                 0                 110                   80                3
## 2                 0                 120                   70                3
## 3                 0                 120                   80               13
## 4                 0                 120                   70                2
##   Follicle.No...R. Avg..F.size..L...mm. Avg..F.size..R...mm. Endometrium..mm. X
## 1                3                   18                   18              8.5  
## 2                5                   15                   14              3.7  
## 3               15                   18                   20             10.0  
## 4                2                   15                   14              7.5

\(~\)

Data Exploration:

The first data set pcos includes 541 observations and a total of 6 variables. The second data set pcos_infertility includes 541 observations and a total of 45 variables. Notice that the column names are not clear enough for readers, this will be tackled in the data preparation section.

\(~\)

The pcos data:

## Rows: 541
## Columns: 6
## $ Sl..No                 <int> 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, …
## $ Patient.File.No.       <int> 10001, 10002, 10003, 10004, 10005, 10006, 10007…
## $ PCOS..Y.N.             <int> 0, 0, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 0, 0, 0,…
## $ I...beta.HCG.mIU.mL.   <dbl> 1.99, 60.80, 494.08, 1.99, 801.45, 237.97, 1.99…
## $ II....beta.HCG.mIU.mL. <dbl> 1.99, 1.99, 494.08, 1.99, 801.45, 1.99, 1.99, 1…
## $ AMH.ng.mL.             <chr> "2.07", "1.53", "6.63", "1.22", "2.26", "6.74",…

\(~\)

The pcos_infertility data:

## Rows: 541
## Columns: 45
## $ Sl..No                 <int> 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, …
## $ Patient.File.No.       <int> 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, …
## $ PCOS..Y.N.             <int> 0, 0, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 0, 0, 0,…
## $ Age..yrs.              <int> 28, 36, 33, 37, 25, 36, 34, 33, 32, 36, 20, 26,…
## $ Weight..Kg.            <dbl> 44.6, 65.0, 68.8, 65.0, 52.0, 74.1, 64.0, 58.5,…
## $ Height.Cm.             <dbl> 152.0, 161.5, 165.0, 148.0, 161.0, 165.0, 156.0…
## $ BMI                    <chr> "19.3", "#NAME?", "#NAME?", "#NAME?", "#NAME?",…
## $ Blood.Group            <int> 15, 15, 11, 13, 11, 15, 11, 13, 11, 15, 15, 13,…
## $ Pulse.rate.bpm.        <int> 78, 74, 72, 72, 72, 78, 72, 72, 72, 80, 80, 72,…
## $ RR..breaths.min.       <int> 22, 20, 18, 20, 18, 28, 18, 20, 18, 20, 20, 20,…
## $ Hb.g.dl.               <dbl> 10.48, 11.70, 11.80, 12.00, 10.00, 11.20, 10.90…
## $ Cycle.R.I.             <int> 2, 2, 2, 2, 2, 2, 2, 2, 2, 4, 2, 2, 4, 2, 2, 2,…
## $ Cycle.length.days.     <int> 5, 5, 5, 5, 5, 5, 5, 5, 5, 2, 5, 5, 2, 5, 5, 5,…
## $ Marraige.Status..Yrs.  <dbl> 7, 11, 10, 4, 1, 8, 2, 13, 8, 4, 4, 3, 7, 15, 9…
## $ Pregnant.Y.N.          <int> 0, 1, 1, 0, 1, 1, 0, 1, 0, 0, 1, 0, 1, 0, 0, 0,…
## $ No..of.aborptions      <int> 0, 0, 0, 0, 0, 0, 0, 2, 1, 0, 2, 1, 0, 0, 0, 0,…
## $ I...beta.HCG.mIU.mL.   <dbl> 1.99, 60.80, 494.08, 1.99, 801.45, 237.97, 1.99…
## $ II....beta.HCG.mIU.mL. <chr> "1.99", "1.99", "494.08", "1.99", "801.45", "1.…
## $ FSH.mIU.mL.            <dbl> 7.95, 6.73, 5.54, 8.06, 3.98, 3.24, 2.85, 4.86,…
## $ LH.mIU.mL.             <dbl> 3.68, 1.09, 0.88, 2.36, 0.90, 1.07, 0.31, 3.07,…
## $ FSH.LH                 <chr> "#NAME?", "#NAME?", "#NAME?", "#NAME?", "#NAME?…
## $ Hip.inch.              <int> 36, 38, 40, 42, 37, 44, 39, 44, 39, 40, 39, 39,…
## $ Waist.inch.            <int> 30, 32, 36, 36, 30, 38, 33, 38, 35, 38, 35, 33,…
## $ Waist.Hip.Ratio        <chr> "#NAME?", "#NAME?", "#NAME?", "#NAME?", "#NAME?…
## $ TSH..mIU.L.            <dbl> 0.68, 3.16, 2.54, 16.41, 3.57, 1.60, 1.51, 12.1…
## $ AMH.ng.mL.             <chr> "2.07", "1.53", "6.63", "1.22", "2.26", "6.74",…
## $ PRL.ng.mL.             <dbl> 45.16, 20.09, 10.52, 36.90, 30.09, 16.18, 26.41…
## $ Vit.D3..ng.mL.         <dbl> 17.10, 61.30, 49.70, 33.40, 43.80, 52.40, 42.70…
## $ PRG.ng.mL.             <dbl> 0.57, 0.97, 0.36, 0.36, 0.38, 0.30, 0.46, 0.26,…
## $ RBS.mg.dl.             <dbl> 92, 92, 84, 76, 84, 76, 93, 91, 116, 125, 108, …
## $ Weight.gain.Y.N.       <int> 0, 0, 0, 0, 0, 1, 0, 1, 0, 0, 0, 0, 1, 0, 0, 1,…
## $ hair.growth.Y.N.       <int> 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 0, 0, 0,…
## $ Skin.darkening..Y.N.   <int> 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 0, 0, 0,…
## $ Hair.loss.Y.N.         <int> 0, 0, 1, 0, 1, 1, 0, 0, 0, 0, 0, 0, 1, 0, 1, 0,…
## $ Pimples.Y.N.           <int> 0, 0, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 0, 1, 0,…
## $ Fast.food..Y.N.        <int> 1, 0, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 0, 1, 0,…
## $ Reg.Exercise.Y.N.      <int> 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 0, 0, 0,…
## $ BP._Systolic..mmHg.    <int> 110, 120, 120, 120, 120, 110, 120, 120, 120, 11…
## $ BP._Diastolic..mmHg.   <int> 80, 70, 80, 70, 80, 70, 80, 80, 80, 80, 80, 80,…
## $ Follicle.No...L.       <int> 3, 3, 13, 2, 3, 9, 6, 7, 5, 1, 7, 4, 15, 3, 4, …
## $ Follicle.No...R.       <int> 3, 5, 15, 2, 4, 6, 6, 6, 7, 1, 15, 2, 8, 3, 1, …
## $ Avg..F.size..L...mm.   <dbl> 18, 15, 18, 15, 16, 16, 15, 15, 17, 14, 17, 18,…
## $ Avg..F.size..R...mm.   <dbl> 18, 14, 20, 14, 14, 20, 16, 18, 17, 17, 20, 19,…
## $ Endometrium..mm.       <dbl> 8.5, 3.7, 10.0, 7.5, 7.0, 8.0, 6.8, 7.1, 4.2, 2…
## $ X                      <chr> "", "", "", "", "", "", "", "", "", "", "", "",…

\(~\)

Let’s look at the distribution of our data sets using histograms:

pcos data:

\(~\)

pcos_infertility data:

## `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

## `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

\(~\)

Data Preparation:

Before I start the cleaning process I converted the data sets as numeric since the class of some of the variables differ.

\(~\)

Lets look at the column names of both data sets to be able to start tidying the data:

pcos:

## [1] "Sl..No"                 "Patient.File.No."       "PCOS..Y.N."            
## [4] "I...beta.HCG.mIU.mL."   "II....beta.HCG.mIU.mL." "AMH.ng.mL."

\(~\)

pcos_infertility:

##  [1] "Sl..No"                 "Patient.File.No."       "PCOS..Y.N."            
##  [4] "Age..yrs."              "Weight..Kg."            "Height.Cm."            
##  [7] "BMI"                    "Blood.Group"            "Pulse.rate.bpm."       
## [10] "RR..breaths.min."       "Hb.g.dl."               "Cycle.R.I."            
## [13] "Cycle.length.days."     "Marraige.Status..Yrs."  "Pregnant.Y.N."         
## [16] "No..of.aborptions"      "I...beta.HCG.mIU.mL."   "II....beta.HCG.mIU.mL."
## [19] "FSH.mIU.mL."            "LH.mIU.mL."             "FSH.LH"                
## [22] "Hip.inch."              "Waist.inch."            "Waist.Hip.Ratio"       
## [25] "TSH..mIU.L."            "AMH.ng.mL."             "PRL.ng.mL."            
## [28] "Vit.D3..ng.mL."         "PRG.ng.mL."             "RBS.mg.dl."            
## [31] "Weight.gain.Y.N."       "hair.growth.Y.N."       "Skin.darkening..Y.N."  
## [34] "Hair.loss.Y.N."         "Pimples.Y.N."           "Fast.food..Y.N."       
## [37] "Reg.Exercise.Y.N."      "BP._Systolic..mmHg."    "BP._Diastolic..mmHg."  
## [40] "Follicle.No...L."       "Follicle.No...R."       "Avg..F.size..L...mm."  
## [43] "Avg..F.size..R...mm."   "Endometrium..mm."       "X"

\(~\)

As mentioned earlier, my data preparation will mainly consist of renaming columns, targeting missing values whether it is by replacing them with a mean, median or mode for Marriage Status (Yrs) and Fast Food (Y/N) or calculating the values as a whole as its the case of BMI, FSH.LH and Waist.Hip.Ratio. I’ll also be converting certain columns to metric, remove columns that are duplicate or not need so that it’s easier to manage when I merge my two data sets.

\(~\)

Here’s the results after part 1 of the preparation process:

##   Sl.No PCOS Ibeta-HCG(mIU/mL) IIbeta-HCG(mIU/mL) AMH(ng/mL) Age(yrs)
## 1     1    0              1.99               1.99       2.07       28
## 2     2    0             60.80               1.99       1.53       36
## 3     3    1            494.08             494.08       6.63       33
## 4     4    0              1.99               1.99       1.22       37
## 5     5    0            801.45             801.45       2.26       25
## 6     6    0            237.97               1.99       6.74       36
##   Weight(kg) Height(cm)  BMI Blood_Group Pulse rate(bpm) RR(breaths/min)
## 1       44.6      152.0 19.3          15              78              22
## 2       65.0      161.5   NA          15              74              20
## 3       68.8      165.0   NA          11              72              18
## 4       65.0      148.0   NA          13              72              20
## 5       52.0      161.0   NA          11              72              18
## 6       74.1      165.0   NA          15              78              28
##   Hb(g/dl) Cycle(R/I) Cycle_length(days) Marriage_Status(yrs) Pregnant
## 1    10.48          2                  5                    7        0
## 2    11.70          2                  5                   11        1
## 3    11.80          2                  5                   10        1
## 4    12.00          2                  5                    4        0
## 5    10.00          2                  5                    1        1
## 6    11.20          2                  5                    8        1
##   No_of_abortions FSH(mIU/mL) LH(mIU/mL) FSH-LH Hip(in) Waist(in)
## 1               0        7.95       3.68     NA      36        30
## 2               0        6.73       1.09     NA      38        32
## 3               0        5.54       0.88     NA      40        36
## 4               0        8.06       2.36     NA      42        36
## 5               0        3.98       0.90     NA      37        30
## 6               0        3.24       1.07     NA      44        38
##   Waist-Hip_Ratio TSH (mIU/L) PRL(ng/mL) Vit_D3(ng/mL) PRG(ng/mL) RBS(mg/dl)
## 1              NA        0.68      45.16          17.1       0.57         92
## 2              NA        3.16      20.09          61.3       0.97         92
## 3              NA        2.54      10.52          49.7       0.36         84
## 4              NA       16.41      36.90          33.4       0.36         76
## 5              NA        3.57      30.09          43.8       0.38         84
## 6              NA        1.60      16.18          52.4       0.30         76
##   Weight_gain hair_growth Skin_darkening Hair_loss Pimples Fast_food
## 1           0           0              0         0       0         1
## 2           0           0              0         0       0         0
## 3           0           0              0         1       1         1
## 4           0           0              0         0       0         0
## 5           0           0              0         1       0         0
## 6           1           0              0         1       0         0
##   Reg_Exercise BP_Systolic(mmHg) BP_Diastolic(mmHg) Follicle_No(L)
## 1            0               110                 80              3
## 2            0               120                 70              3
## 3            0               120                 80             13
## 4            0               120                 70              2
## 5            0               120                 80              3
## 6            0               110                 70              9
##   Follicle_No(R) Avg_F_size(L)(mm) Avg_F_size(R)(mm) Endometrium(mm)
## 1              3                18                18             8.5
## 2              5                15                14             3.7
## 3             15                18                20            10.0
## 4              2                15                14             7.5
## 5              4                16                14             7.0
## 6              6                16                20             8.0

\(~\)

Checking for missing data:

##                Sl.No                 PCOS    Ibeta-HCG(mIU/mL) 
##                    0                    0                    0 
##   IIbeta-HCG(mIU/mL)           AMH(ng/mL)             Age(yrs) 
##                    0                    1                    0 
##           Weight(kg)           Height(cm)                  BMI 
##                    0                    0                  299 
##          Blood_Group      Pulse rate(bpm)      RR(breaths/min) 
##                    0                    0                    0 
##             Hb(g/dl)           Cycle(R/I)   Cycle_length(days) 
##                    0                    0                    0 
## Marriage_Status(yrs)             Pregnant      No_of_abortions 
##                    1                    0                    0 
##          FSH(mIU/mL)           LH(mIU/mL)               FSH-LH 
##                    0                    0                  532 
##              Hip(in)            Waist(in)      Waist-Hip_Ratio 
##                    0                    0                  532 
##          TSH (mIU/L)           PRL(ng/mL)        Vit_D3(ng/mL) 
##                    0                    0                    0 
##           PRG(ng/mL)           RBS(mg/dl)          Weight_gain 
##                    0                    0                    0 
##          hair_growth       Skin_darkening            Hair_loss 
##                    0                    0                    0 
##              Pimples            Fast_food         Reg_Exercise 
##                    0                    1                    0 
##    BP_Systolic(mmHg)   BP_Diastolic(mmHg)       Follicle_No(L) 
##                    0                    0                    0 
##       Follicle_No(R)    Avg_F_size(L)(mm)    Avg_F_size(R)(mm) 
##                    0                    0                    0 
##      Endometrium(mm) 
##                    0

\(~\)

For part 2, I tackled the missing values by first transforming Height from cm to m, calculate BMI, Waist-hip ratio and FSH/LH. Then after careful consideration to the data, I decided to use the median number to replace the missing values in Marriage_Status(yrs), AMH(ng/mL) and Fast_food since it didn’t disrupt the data’s distribution.

\(~\)

After tidying up the data, below is the final distribution:

## `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

## `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

## `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

\(~\)

Plots:

Now that the data is ready, I have created some plots to better understand the relationship of each variable.

  • It appears that the age and weight of the women are distributed evenly throughout as opposed to being clustered in a certain age and weight group.

  • Hip to waist counts increase for those with and without PCOS.

  • There’s a steady relationship between random glucose and waist-hip ratio but I don’t think this is valuable information.

\(~\)

Below I have statistics based on yes / no variables we were given. Out of 541 women:

  • 32.72% reported to have PCOS

  • 38.08% of women reported to being pregnant

  • 37.71% reported to experience weight gain

  • 27.36% reported to experience hair growth

  • 30.68% experience skin darkening

  • 45.29% reported to experience hair loss

  • 48.98% reported to experience pimples

  • 51.57% reported to have fast food

  • 24.77% reported to exercise regularly

\(~\)

Based on these next few plots:

  • Majority of the women in this study have the blood type of O+, B+ and A+.

  • There is a wide range of distribution between the age of these women most falling between 23 to 38 years old.

  • Similarly, weight also has a wide range of distribution.

  • Most of the women are between 1.5 - 1.6 meters tall (4’9” - 5’2”).

  • The distribution of years married also varies where the majority of the women fall between the first 10 years.

  • I looked into PCOS related genes … but can’t make much sense on this variable.

\(~\)

Below are scatterplots to show the relationship between a couple of variables and the women’s Age(yrs):

  • Cycle length seems to be very consistent regardless of PCOS diagnosis

  • Women with PCOS (blue) experience more irregular periods with a few with no PCOS(coral) do as well

  • With or without PCOS women’s BMI fluctuate

  • The distribution of number of follicles is greater for those women without PCOS.

  • The distribution of number of size in follicles left or right differ for those with and without PCOS

  • The endometrium is the inner layer. During a woman’s menstrual cycle, hormones cause the endometrium to change. Estrogen causes the endometrium to thicken so that it could nourish an embryo if pregnancy occurs. In the plot below you notice the difference in endometrium size for women with PCOS (lower) than with those without PCOS (slightly higher).

\(~\)

Blood tests are very important to notice abnormalities to detect PCOS, below my findings after examining these variables:

  • Pregnancy hormones are higher with those who reported to have PCOS than with those who reported to not have PCOS.

  • Vitamin D3 levels are spread out for those who reported with or without PCOS.

  • FSH/LH levels are also spread out for those who reported with or without PCOS.

  • Blood pressure levels seem a little off but are all in the high range.

  • Pulse rate (normal 60-100 bpm) and respiration rate (normal 12-20 breathes/min) seems normal for those who reported to be with or without PCOS.

  • The egg count and follicle growth are clustered for women who reported to have PCOS and those who don’t.

  • TSH levels are also spread out for women who reported to have PCOS and those who don’t.

  • Hemoglobin levels seem to be higher for those who reported to have PCOS.

  • PRL levels (prolactin in the blood) are consistent at 1 or 2 for those with or without PCOS.

  • RBS (random glucose) is fairly distributed with those with and without PCOS.

\(~\)

The plots below were created to visualize the percentage of women with PCOS who experience symptoms:

  • Women with blood type of A+, B+, and O+ reported to have PCOS more than those with blood types of A-, B-, O-, AB+ and AB-

  • Being that PCOS and infertility are linked, there are 11.83% of women who reported to being pregnant with PCOS and 20.89% who reported not being pregnant with PCOS.

  • There are 22.37% of women who reported to experience weight gain with PCOS and 23.24% reported to not experience weight gain with PCOS.

  • There are 18.67% of women who reported to experience hair growth with PCOS and 14.05% reported to not experience hair growth with PCOS.

  • There are 20.33% of women who reported to experience skin darkening with PCOS and 12.38% reported to not experience skin darkening with PCOS.

  • There are 18.85% of women who reported to experience hair loss with PCOS and 13.86% reported to not experience hair loss with PCOS.

  • There are 22.74% of women who reported to experience pimples with PCOS and 9.98% reported to not experience pimples with PCOS.

  • There are 25.13% of women who reported to consume fast food with PCOS and 7.02% reported to not consume fast food with PCOS.

  • There are 9.43% of women who reported to regularly exercise with PCOS and 23.29% reported to not regularly exercise with PCOS.

  • Not surprised to see how fluctuated the cycle length is especially for women without PCOS since they tend to have a more regular cycle.

  • Sadly, abortions also plays a part in PCOS for women trying to conceive. I am however happy to see that the number isn’t as high for women with PCOS

\(~\)

Conclusion:

Being diagnosed almost 2 years ago with PCOS I was intrigued to learn more and explore the data that was available. Although not much data was easily accessible this is a start to explore differences or similarities women share with their physical aspect and blood work. It’s also important to be aware of symptoms experienced by PCOS because it’s such symptoms that can always be misinterpreted as “too much stress” or “lose weight” by doctors. Essentially we could use these commonalities to help women all over learn to distinguish symptoms not only by the physical aspects of PCOS but also during a routine lab work. After advocating for myself with multiple doctors over the span of 6 years I saw the true value in listening to your body and sharing my experience with others. I hope this project is able to make women aware of what to look out for, what kind of questions to ask, and to be persistent with their health because who knows our bodies better than ourselves.

\(~\)

References: