DATA 606 Project Presentation

Bikram Barua
12/9/2021

Abstract

For more details on authoring R presentations please visit https://support.rstudio.com/hc/en-us/articles/200486468.

  • Bullet 1
  • Bullet 2
  • Bullet 3

Research Question

Myth or Fact: Do more Women Go into Labor During a Full Moon ?

Compare the birth counts on regular days with the full moon days.
Do you see any difference?
Can we draw any inference bsaed on the data?

Null Hypothesis: There is no effect of Full Moon with Women going into Labor and child being born.

Alternate Hypothesis: Women go into Labor due to Full Moon, there are more births on full moon nights.

Below is the reference to an article on this topic:
Ref: https://www.dukehealth.org/blog/myth-or-fact-more-women-go-labor-during-full-moon

US Birth Data

year month date_of_month day_of_week births
2000 1 1 6 9083
2000 1 2 7 8006
2000 1 3 1 11363
2000 1 4 2 13032
2000 1 5 3 12558
2000 1 6 4 12466
2000 1 7 5 12516
2000 1 8 6 8934
2000 1 9 7 7949
2000 1 10 1 11668
2000 1 11 2 12611
2000 1 12 3 12398
2000 1 13 4 11815
2000 1 14 5 12180
2000 1 15 6 8525
2000 1 16 7 7657
2000 1 17 1 10824
2000 1 18 2 12350
2000 1 19 3 12405
2000 1 20 4 12506
2000 1 21 5 11953
2000 1 22 6 8855
2000 1 23 7 7856
2000 1 24 1 11449
2000 1 25 2 12593

Full Moon Days

Day Date
Monday 15 January 1900
Wednesday 14 February 1900
Friday 16 March 1900
Sunday 15 April 1900
Monday 14 May 1900
Wednesday 13 June 1900
Thursday 12 July 1900
Friday 10 August 1900
Sunday 9 September 1900
Monday 8 October 1900
Tuesday 6 November 1900
Thursday 6 December 1900
Saturday 5 January 1901
Sunday 3 February 1901
Tuesday 5 March 1901
Thursday 4 April 1901
Friday 3 May 1901
Sunday 2 June 1901
Tuesday 2 July 1901
Wednesday 31 July 1901
Thursday 29 August 1901
Saturday 28 September 1901
Sunday 27 October 1901
Tuesday 26 November 1901
Wednesday 25 December 1901

Format Full Moon Data

Split Date description into Date, Month and Year

FM_Day FM_Full_Date FM_Date FM_Month FM_Year
Monday 15 January 1900 15 1 1900
Wednesday 14 February 1900 14 2 1900
Friday 16 March 1900 16 3 1900
Sunday 15 April 1900 15 4 1900
Monday 14 May 1900 14 5 1900
Wednesday 13 June 1900 13 6 1900
Thursday 12 July 1900 12 7 1900
Friday 10 August 1900 10 8 1900
Sunday 9 September 1900 9 9 1900
Monday 8 October 1900 8 10 1900
Tuesday 6 November 1900 6 11 1900
Thursday 6 December 1900 6 12 1900
Saturday 5 January 1901 5 1 1901
Sunday 3 February 1901 3 2 1901
Tuesday 5 March 1901 5 3 1901
Thursday 4 April 1901 4 4 1901
Friday 3 May 1901 3 5 1901
Sunday 2 June 1901 2 6 1901
Tuesday 2 July 1901 2 7 1901
Wednesday 31 July 1901 31 7 1901
Thursday 29 August 1901 29 8 1901
Saturday 28 September 1901 28 9 1901
Sunday 27 October 1901 27 10 1901
Tuesday 26 November 1901 26 11 1901
Wednesday 25 December 1901 25 12 1901

Highest Birth Count By Year (Top 6)

Group By Year, Sum of births and sorted descending

year sum
2007 4380784
2006 4335154
2008 4310737
2005 4211941
2009 4190991
2004 4186863

Year 2007 has highest number of births

Mean birth count by month

month mean
1 11629.71
2 11843.46
3 11802.84
4 11432.70
5 11854.55
6 12116.20
7 12414.87
8 12785.65
9 12425.80
10 12105.55
11 11977.83
12 11619.45
[1] "Total Births(mean) in 2007 is 144008.61"
[1] "Mean Birth per day in 2007 is 12000.72"

Full Moon Births in 2007

Year 2007 has highest number of births

year month date_of_month day_of_week births FM_Day FM_Full_Date
2007 1 3 3 13687 Wednesday 3 January 2007
2007 2 2 5 13305 Friday 2 February 2007
2007 3 4 7 7566 Sunday 4 March 2007
2007 4 2 1 12450 Monday 2 April 2007
2007 5 2 3 13325 Wednesday 2 May 2007
2007 6 1 5 13714 Friday 1 June 2007
2007 6 30 6 9016 Saturday 30 June 2007
2007 7 30 1 13439 Monday 30 July 2007
2007 8 28 2 14959 Tuesday 28 August 2007
2007 9 26 3 14417 Wednesday 26 September 2007
2007 10 26 5 13223 Friday 26 October 2007
2007 11 24 6 8380 Saturday 24 November 2007
2007 12 24 1 7727 Monday 24 December 2007
[1] "Mean Birth per full moon day in 2007 is 12934.00"

Test differences in means

One of the most common statistical tasks is to compare an outcome between two groups.

month mean year date_of_month day_of_week births FM_Day FM_Full_Date
1 11629.71 2007 3 3 13687 Wednesday 3 January 2007
2 11843.46 2007 2 5 13305 Friday 2 February 2007
3 11802.84 2007 4 7 7566 Sunday 4 March 2007
4 11432.70 2007 2 1 12450 Monday 2 April 2007
5 11854.55 2007 2 3 13325 Wednesday 2 May 2007
6 12116.20 2007 1 5 13714 Friday 1 June 2007
6 12116.20 2007 30 6 9016 Saturday 30 June 2007
7 12414.87 2007 30 1 13439 Monday 30 July 2007
8 12785.65 2007 28 2 14959 Tuesday 28 August 2007
9 12425.80 2007 26 3 14417 Wednesday 26 September 2007
10 12105.55 2007 26 5 13223 Friday 26 October 2007
11 11977.83 2007 24 6 8380 Saturday 24 November 2007
12 11619.45 2007 24 1 7727 Monday 24 December 2007

plot of chunk unnamed-chunk-6plot of chunk unnamed-chunk-6

T-Test Analysis

birth.t.test <- t.test(merge_by_month_data_2007$mean, merge_by_month_data_2007$births, data = merge_by_month_data_2007)
birth.t.test

    Welch Two Sample t-test

data:  merge_by_month_data_2007$mean and merge_by_month_data_2007$births
t = 0.093272, df = 12.465, p-value = 0.9272
alternative hypothesis: true difference in means is not equal to 0
95 percent confidence interval:
 -1570.097  1711.144
sample estimates:
mean of x mean of y 
 12009.60  11939.08 
# p-value
birth.t.test$p.value
[1] 0.9271695
# confidence interval
birth.t.test$conf.int
[1] -1570.097  1711.144
attr(,"conf.level")
[1] 0.95

High p-values indicate that the evidence is not strong enough to suggest an effect exists. It's possible that the sample size is too small for the hypothesis test to detect it.