LET SEE HOW CONFIRMED CASES ARE DISTRIBUTED ACROSS CHINA AND OTHER PART OF THE WORLD SINCE 100TH CASE OVER DAYS
**Let load data for this analysis which is from “ourworldindata.org”
covid19_confirmed_cases_since_100th_case<-read.csv('covid19_cases.csv')
## now let see strature of the data
str(covid19_confirmed_cases_since_100th_case)
'data.frame': 13332 obs. of 6 variables:
$ X : int 1 2 3 4 5 6 7 8 9 10 ...
$ Entity : Factor w/ 222 levels "Afghanistan",..: 1 1 1 1 1 1 1 1 1 1 ...
$ Code : Factor w/ 206 levels "ABW","AFG","AGO",..: 2 2 2 2 2 2 2 2 2 2 ...
$ Date : Factor w/ 110 levels "Apr 1, 2020",..: 19 49 60 71 74 75 76 77 78 79 ...
$ X.cases. : int 0 0 0 0 0 0 0 0 0 0 ...
$ Days.since.the.100th.confirmed.case..days.: int NA NA NA NA NA NA NA NA NA NA ...
##we are interested in Entity,cases and days since 100th case
library(dplyr)
covid_confirmed_cases_since_100th_case<-select(covid19_confirmed_cases_since_100th_case,Entity,X.cases.,Days.since.the.100th.confirmed.case..days.)
##since the data contain misssing value let remove all NA
covid_confirmed_cases_since_100th_case<-na.omit(covid_confirmed_cases_since_100th_case)
str(covid_confirmed_cases_since_100th_case)
'data.frame': 4852 obs. of 3 variables:
$ Entity : Factor w/ 222 levels "Afghanistan",..: 1 1 1 1 1 1 1 1 1 1 ...
$ X.cases. : int 106 114 141 166 192 235 235 270 299 337 ...
$ Days.since.the.100th.confirmed.case..days.: int 0 1 2 3 4 5 6 7 8 9 ...
- attr(*, "na.action")= 'omit' Named int 1 2 3 4 5 6 7 8 9 10 ...
..- attr(*, "names")= chr "1" "2" "3" "4" ...
##Next we look at China distribution of cases since 100th case
library(ggplot2)
library(ggrepel)
covid19_china_case_since_100th_case<-filter(covid_confirmed_cases_since_100th_case,Entity=='China')
ggplot(covid19_china_case_since_100th_case,aes(x=Days.since.the.100th.confirmed.case..days.,y=X.cases.),group=1)+geom_line(color='red')

##Now let see how cases are distributed per continent and across other countries above China in number of Covid19 cases.
covid19_Africa_cases_since_100th_case<-filter(covid_confirmed_cases_since_100th_case,Entity=='Africa')
covid19_AsiaExcl_China_cases_since_100th_cases<-filter(covid_confirmed_cases_since_100th_case,Entity=='Asia excl. China')
covid19_Europe_cases_since_100th_cases<-filter(covid_confirmed_cases_since_100th_case,Entity=='Europe')
covid19_NorthAmerica_cases_since_100th_case<-filter(covid_confirmed_cases_since_100th_case,Entity=='North America')
covid19_SouthAmerica_cases_since_100th_case<-filter(covid_confirmed_cases_since_100th_case,Entity=='South America')
covid19_Italy<-filter(covid_confirmed_cases_since_100th_case,Entity=='Italy')
covid19_Spain<-filter(covid_confirmed_cases_since_100th_case,Entity=='Spain')
covid19_UK<-filter(covid_confirmed_cases_since_100th_case,Entity=='United Kingdom')
covid19_USA<-filter(covid_confirmed_cases_since_100th_case,Entity=='United States')
covid19_France<-filter(covid_confirmed_cases_since_100th_case,Entity=='France')
##Africa cases
ggplot(covid19_Africa_cases_since_100th_case,aes(x=Days.since.the.100th.confirmed.case..days.,y=X.cases.),group=1)+geom_line(color='red')

##Asia cases excl China
options(scipen = 10000)
ggplot(covid19_AsiaExcl_China_cases_since_100th_cases,aes(x=Days.since.the.100th.confirmed.case..days.,y=X.cases.),group=1)+geom_line(color='red')

##Europe cases
ggplot(covid19_Europe_cases_since_100th_cases,aes(x=Days.since.the.100th.confirmed.case..days.,y=X.cases.),group=1)+geom_line(color='red')

##North America cases
ggplot(covid19_NorthAmerica_cases_since_100th_case,aes(x=Days.since.the.100th.confirmed.case..days.,y=X.cases.),group=1)+geom_line(color='red')

##South America cases
ggplot(covid19_SouthAmerica_cases_since_100th_case,aes(x=Days.since.the.100th.confirmed.case..days.,y=X.cases.),group=1)+geom_line(color='red')

##Now let look at the top 5 nations ahead of china which is currently 6 with highest cases
##United States cases
options(scipen = 10000)
ggplot(covid19_USA,aes(x=Days.since.the.100th.confirmed.case..days.,y=X.cases.),group=1)+geom_line(color='red')

##Spain cases
ggplot(covid19_Spain,aes(x=Days.since.the.100th.confirmed.case..days.,y=X.cases.),group=1)+geom_line(color='red')

##United Kingdom cases
ggplot(covid19_UK,aes(x=Days.since.the.100th.confirmed.case..days.,y=X.cases.),group=1)+geom_line(color='red')

##France cases
ggplot(covid19_France,aes(x=Days.since.the.100th.confirmed.case..days.,y=X.cases.),group=1)+geom_line(color='red')

##Italy cases
ggplot(covid19_Italy,aes(x=Days.since.the.100th.confirmed.case..days.,y=X.cases.),group=1)+geom_line(color='red')

From 1st plot its clear that China started to flatten the curve since 100th case after around 50 days
Africa in less than 40 days since 100th case new cases have gone high upto over 20k
Asia excluding China since 100th case new cases have gone up by more than 200k in less than 70 days
Europe since 100th case new cases have gone up to 1M in less than 60 days
North America since 100th case new infection are now over 600k in less than 50 days
South America since 100th case new infection now stand at over 60k in less than 40 days
Now Let Look How China Compare With Other Nations In Number Of Covid19 Infection Since 100th Case
##United States
Its beeen less than 50 days since USA recorded 100th case of coronavirus but number of new cases have skyrocketed to over 600k with China number standing at 83k in more than 85 days since 100th case
##Spain
Less than 50 days since Spain recorded 100th case with new infection going up to over 150k against China 83k cases in over 85 days since 100th cases
##United Kingdom
UK in less than 50 days since 100th case new infection have reached over 100k against China cases 83k in over 85 days
##France
Less than 50 days since France recorded 100th case but new infection have gone high of over 100k which more than China cases in over 85 days
##Italy
Less than 60 days since 100th case new infection stand at over 170k against China 83k in over 85 days
##Conclusion
First case of covid19 was reported in Wuhan China in late 2019 but it until March that it hit global magnitude probably two months after the fist case, as per John Hopkins University covid19 datasets number of cases globally on 1st of March 2020 was 88.4k while by 29th March number of confirmed cases reached over 720k a differential of 631,600 cases in duration of 29 days translating to 21k new infection cases per day in average worldwide,and it after this reality checked in did leaders and analyst from all walk of life started to raise doubts about actual number of cases in China while despite each country shortcomings on containing corona virus most of the measures put inplace in China have been replicated in almost all situation but numbers continue to rise even further suggesting China may have blindfolded everyone to think that virus was not as infectious as it was with cases standing above 2.2m as of 18th April 2020 from 88.4k in start of March which is less than two months China battled the virus before it hit every corner of the world thus making corona virus a black swan from China with world watching in retrospective review mirror:
