Set working directory
list.files()
[1] "gapminder_aids.nb.html" "gapminder_aids.R" "gapminder_aids.Rmd" "hiv_prev.csv" "lesson3_student.rmd"
[6] "pseudo_facebook.tsv" "rsconnect"
Steps:
- Read data from csv into a dataframe
- Look at the data’s rows and values
- Put the original dataframe into a holder variable
Reasoning:
- Get a sense of what to expect
- Determine what kind of questions need to be asked
- Determine what kind of problems may arise
- Save the data in it’s original form
Load data from csv file
Steps:
- Install tidyr package
- Get row names for the years
- Collapse original dataframe into “years” and “cases” columns
Reasoning:
- Resulting dataframe is more intuituve
Summary of HIV prevalence in 1999
summary(hp_1999)
country year prevalence
Algeria : 1 Length:146 Min. : 0.060
Angola : 1 Class :character 1st Qu.: 0.100
Argentina: 1 Mode :character Median : 0.300
Armenia : 1 Mean : 2.067
Australia: 1 3rd Qu.: 1.475
Austria : 1 Max. :25.700
(Other) :140
max_row$country
[1] Zimbabwe
275 Levels: \x81land Abkhazia Afghanistan Akrotiri and Dhekelia Albania Algeria American Samoa Andorra Angola Anguilla ... Zimbabwe
Notes:
Highest HIV rate is nearly over a quarter of the population. The average HIV rate globally was measured at around 2%. 34 countries, including Algeria, Bangladesh, and Egypt reported an HIV prevalence of 0.06. Zimbabwe had the highest prevalence of HIV (25.7).
I have a vague distrust of the data, however, when I consider how culture and social norms may play into reporting. In the US, for example, gay and bisexual people make up only 2% of the population but 55% of PLWH (people living with HIV). In more conservative countries where homosexuality is supressed and hidden, I’m afraid that some men who are gay and bisexual are less likely to come out and make their conditions known or that officials might be understating HIV prevalence in order to hide poor leadership or downplay the issue. Some countries may also overstate their HIV prevalence in order to attract more international funding. The possibility of such distortions must be taken into account.
Summary of HIV prevalence in 1999

Notes:
Data points for the various countries are shown. Vast majority of countries have HIV prevalance below 2%, there are quite a few that are above 2% and some that are extremely high.
Make a boxplot of the data

Notes:
The IQR for HIV prevalence appears to fall between .1 and 1.5 with a median at .3
Separate the values above and below 2% prevalence
Notes:
Highest prevalence seems to situated in Sub-Saharan Africa and countries with a sizable diaspora from Sub-Saharan Africa. Researchers have shown that a genetic variation called Duffy Antigen Receptor for Chemokines (DARC) which was evolved as a protection against a now extinct form of malaria has led to a greater susceptibility to HIV. DARC is present in an overwhelming percentage of people with African descent making them more susceptible (but also more resilient) to HIV. This increased susceptibility, in concert with behavioral factors, helps to explain the high prevalence of HIV in geographically dispersed countries like Botswana and Haiti.
Has prevalence diminished over time?

Notes:
In Zimbabwe, consistently the hardest hit country, it looks like HIV prevalence increased throughout the 1980s and 90s and began to level-off after 1997. Prevalence seems to have started decreasing in the 2000s as testing and treatment were made more accessible. The most significant cause of the decrease in the case of Zimbabwe, however, was the reduction of the number of secual partners by 30% for men (UNAIDS, 2011). This reductions in new cases of HIV spells good news for future reduction efforts because it shows that the issue, despite whatever negative predispositions may be at play, can be contained and current cases can be treated with increasing effectiveness.
LS0tCnRpdGxlOiAiR2xvYmFsIEhJViBQcmV2YWxlbmNlIgpvdXRwdXQ6IGh0bWxfbm90ZWJvb2sKLS0tCgojIyMgU2V0IHdvcmtpbmcgZGlyZWN0b3J5CmBgYHtyfQpzZXR3ZCgiL1VzZXJzL2Rtb3Rvbi9kYW5kL29uZS12YXIiKQpsaXN0LmZpbGVzKCkKYGBgCgojIyMjI1N0ZXBzOgoqIFJlYWQgZGF0YSBmcm9tIGNzdiBpbnRvIGEgZGF0YWZyYW1lCiogTG9vayBhdCB0aGUgZGF0YSdzIHJvd3MgYW5kIHZhbHVlcwoqIFB1dCB0aGUgb3JpZ2luYWwgZGF0YWZyYW1lIGludG8gYSBob2xkZXIgdmFyaWFibGUKCiMjIyMjUmVhc29uaW5nOgoqIEdldCBhIHNlbnNlIG9mIHdoYXQgdG8gZXhwZWN0CiogRGV0ZXJtaW5lIHdoYXQga2luZCBvZiBxdWVzdGlvbnMgbmVlZCB0byBiZSBhc2tlZAoqIERldGVybWluZSB3aGF0IGtpbmQgb2YgcHJvYmxlbXMgbWF5IGFyaXNlCiogU2F2ZSB0aGUgZGF0YSBpbiBpdCdzIG9yaWdpbmFsIGZvcm0KCioqKgoKIyMjIExvYWQgZGF0YSBmcm9tIGNzdiBmaWxlCmBgYHtyfQpocCA8LSByZWFkLmNzdignaGl2X3ByZXYuY3N2JykKc3RyKGhwKQpuYW1lcyhocCkKaHAKaHBfb3JpZyA8LSBocApgYGAKCiMjIyMjU3RlcHM6CiogSW5zdGFsbCB0aWR5ciBwYWNrYWdlCiogR2V0IHJvdyBuYW1lcyBmb3IgdGhlIHllYXJzCiogQ29sbGFwc2Ugb3JpZ2luYWwgZGF0YWZyYW1lIGludG8gInllYXJzIiBhbmQgImNhc2VzIiBjb2x1bW5zCgojIyMjI1JlYXNvbmluZzoKKiBSZXN1bHRpbmcgZGF0YWZyYW1lIGlzIG1vcmUgaW50dWl0dXZlCgoqKioKCiMjIyBDbGVhbiB0aGUgZGF0YQpgYGB7cn0KaW5zdGFsbC5wYWNrYWdlcygidGlkeXIiKQpsaWJyYXJ5KCd0aWR5cicpCgojIGV4dHJhY3QgdGhlIHJvdyBuYW1lcyAoeWVhcnMpCnJvd19uYW1lcyA8LSBuYW1lcyhocFstKDA6MSldKQpoZWFkKGhwKQoKIyBDb2xsYXBzZSB0aGUgZGYgaW50byB5ZWFycyBhbmQgcHJldmFsZW5jZSB1c2luZyB0aWR5cgpocCA8LSBocF9vcmlnICU+JQogIGdhdGhlcihyb3dfbmFtZXMsIGtleSA9ICJ5ZWFyIiwgdmFsdWUgPSAicHJldmFsZW5jZSIpCmhlYWQoaHApCgojIHJlbmFtZSBmaXJzdCBjb2x1bW4gdG8gc29tZXRoaW5nIHNpbXBsZXIKY29sbmFtZXMoaHApWzFdIDwtICJjb3VudHJ5IgpoZWFkKGhwKQoKIyByZW1vdmUgdGhlIFggZnJvbSB0aGUgcm93IG5hbWVzCmhwJHllYXIgPC0gZ3N1YignW1hdJywgJycsIGhwJHllYXIpCmhlYWQoaHApICMgbG9va3MgZ29vZAoKIyByZW1vdmUgdGhlIG1pc3NpbmcgdmFsdWVzIC0gdGhleSdyZSBub3QgcmVhbGx5IGNvbnRyaWJ1dGluZwpocCA8LSBzdWJzZXQoaHAsICFpcy5uYShocCRwcmV2YWxlbmNlKSkKCmhwCmBgYAoKCiMjIyBTdW1tYXJ5IG9mIEhJViBwcmV2YWxlbmNlIGluIDE5OTkKIApgYGB7cn0KIyBnZXQgSElWIHByZXZhbGVuY2UgZm9yIDE5OTkgLS0gbG9vayBhdCBzdW1tYXJ5IHN0YXRzCmhwXzE5OTkgPC0gc3Vic2V0KGhwLCBocCR5ZWFyID09ICcxOTk5JykKc3VtbWFyeShocF8xOTk5KQpgYGAKCmBgYHtyfQojIGNvdW50cmllcyB3aXRoIHRoZSBsb3dlc3QgSElWIHByZXZhbGVuY2UKc3Vic2V0KGhwXzE5OTksIHByZXZhbGVuY2UgPT0gMC4wNikKYGBgCgpgYGB7cn0KIyBjb3VudHJpZXMgd2l0aCB0aGUgaGlnaGVzdCBISVYgcHJldmFsZW5jZQpzdWJzZXQoaHBfMTk5OSwgcHJldmFsZW5jZSA9PSAyNS43KQpgYGAKCiMjIyMjTm90ZXM6IApIaWdoZXN0IEhJViByYXRlIGlzIG5lYXJseSBvdmVyIGEgcXVhcnRlciBvZiB0aGUgcG9wdWxhdGlvbi4gVGhlIGF2ZXJhZ2UgSElWIHJhdGUgZ2xvYmFsbHkgd2FzIG1lYXN1cmVkIGF0IGFyb3VuZCAyJS4gMzQgY291bnRyaWVzLCBpbmNsdWRpbmcgQWxnZXJpYSwgQmFuZ2xhZGVzaCwgYW5kIEVneXB0IHJlcG9ydGVkIGFuIEhJViBwcmV2YWxlbmNlIG9mIDAuMDYuIFppbWJhYndlIGhhZCB0aGUgaGlnaGVzdCBwcmV2YWxlbmNlIG9mIEhJViAoMjUuNykuIAoKSSBoYXZlIGEgdmFndWUgZGlzdHJ1c3Qgb2YgdGhlIGRhdGEsIGhvd2V2ZXIsIHdoZW4gSSBjb25zaWRlciBob3cgY3VsdHVyZSBhbmQgc29jaWFsIG5vcm1zIG1heSBwbGF5IGludG8gcmVwb3J0aW5nLiBJbiB0aGUgVVMsIGZvciBleGFtcGxlLCBnYXkgYW5kIGJpc2V4dWFsIHBlb3BsZSBtYWtlIHVwIG9ubHkgMiUgb2YgdGhlIHBvcHVsYXRpb24gYnV0IDU1JSBvZiBQTFdIIChwZW9wbGUgbGl2aW5nIHdpdGggSElWKS4gSW4gbW9yZSBjb25zZXJ2YXRpdmUgY291bnRyaWVzIHdoZXJlIGhvbW9zZXh1YWxpdHkgaXMgc3VwcmVzc2VkIGFuZCBoaWRkZW4sIEknbSBhZnJhaWQgdGhhdCBzb21lIG1lbiB3aG8gYXJlIGdheSBhbmQgYmlzZXh1YWwgYXJlIGxlc3MgbGlrZWx5IHRvIGNvbWUgb3V0IGFuZCBtYWtlIHRoZWlyIGNvbmRpdGlvbnMga25vd24gb3IgdGhhdCBvZmZpY2lhbHMgbWlnaHQgYmUgdW5kZXJzdGF0aW5nIEhJViBwcmV2YWxlbmNlIGluIG9yZGVyIHRvIGhpZGUgcG9vciBsZWFkZXJzaGlwIG9yIGRvd25wbGF5IHRoZSBpc3N1ZS4gU29tZSBjb3VudHJpZXMgbWF5IGFsc28gb3ZlcnN0YXRlIHRoZWlyIEhJViBwcmV2YWxlbmNlIGluIG9yZGVyIHRvIGF0dHJhY3QgbW9yZSBpbnRlcm5hdGlvbmFsIGZ1bmRpbmcuIFRoZSBwb3NzaWJpbGl0eSBvZiBzdWNoIGRpc3RvcnRpb25zIG11c3QgYmUgdGFrZW4gaW50byBhY2NvdW50LgoKIyMjIyNSZXNvdXJjZXM6IAoqIGh0dHBzOi8vd3d3LmhyYy5vcmcvcmVzb3VyY2VzL2hyYy1pc3N1ZS1icmllZi1oaXYtYWlkcy1hbmQtdGhlLWxnYnQtY29tbXVuaXR5CiogaHR0cDovL2ZpbGVzLnVuYWlkcy5vcmcvZW4vbWVkaWEvdW5haWRzL2NvbnRlbnRhc3NldHMvZG9jdW1lbnRzL3VuYWlkc3B1YmxpY2F0aW9uLzIwMDQvR0FSMjAwNF9lbi5wZGYKCioqKgoKIyMjIFN1bW1hcnkgb2YgSElWIHByZXZhbGVuY2UgaW4gMTk5OQoKYGBge3J9CmxpYnJhcnkoJ2dncGxvdDInKQpnZ3Bsb3QoYWVzKHggPSBjb3VudHJ5LCB5ID0gcHJldmFsZW5jZSksIGRhdGEgPSBzdWJzZXQoaHBfMTk5OSwgIWlzLm5hKHByZXZhbGVuY2UpKSkgKyAKICBnZW9tX3BvaW50KCkgKyAKICBzY2FsZV95X2NvbnRpbnVvdXMoYnJlYWtzID0gc2VxKDAsMzAsMSkpICsKICBnZW9tX2xpbmUoc3RhdCA9ICJzdW1tYXJ5IiwgZnVuLnkgPSAyLjA2NywgY29sb3IgPSAicmVkIikgKwogIHRoZW1lKGF4aXMudGV4dC54ID0gZWxlbWVudF9ibGFuaygpLAogICAgICAgIGF4aXMudGlja3MueCA9IGVsZW1lbnRfYmxhbmsoKSkKYGBgCiMjIyMjTm90ZXM6IApEYXRhIHBvaW50cyBmb3IgdGhlIHZhcmlvdXMgY291bnRyaWVzIGFyZSBzaG93bi4gVmFzdCBtYWpvcml0eSBvZiBjb3VudHJpZXMgaGF2ZSBISVYgcHJldmFsYW5jZSBiZWxvdyAyJSwgdGhlcmUgYXJlIHF1aXRlIGEgZmV3IHRoYXQgYXJlIGFib3ZlIDIlIGFuZCBzb21lIHRoYXQgYXJlIGV4dHJlbWVseSBoaWdoLgoKIyMjIE1ha2UgYSBib3hwbG90IG9mIHRoZSBkYXRhCmBgYHtyfQpxcGxvdCh4ID0geWVhciwgeSA9IHByZXZhbGVuY2UsIGRhdGEgPSBzdWJzZXQoaHBfMTk5OSwgIWlzLm5hKHByZXZhbGVuY2UpKSwgZ2VvbSA9ICJib3hwbG90IikgKyBjb29yZF9jYXJ0ZXNpYW4oeWxpbSA9IGMoMCwgNSkpCmBgYAojIyMjI05vdGVzOiAKVGhlIElRUiBmb3IgSElWIHByZXZhbGVuY2UgYXBwZWFycyB0byBmYWxsIGJldHdlZW4gLjEgYW5kIDEuNSB3aXRoIGEgbWVkaWFuIGF0IC4zCgoqKioKCiMjIyBTZXBhcmF0ZSB0aGUgdmFsdWVzIGFib3ZlIGFuZCBiZWxvdyAyJSBwcmV2YWxlbmNlCmBgYHtyfQpoaWdoX3ByZXYgPC0gc3Vic2V0KGhwXzE5OTksIGhwXzE5OTkkcHJldmFsZW5jZSA+IDIuMCkKaGlnaF9wcmV2CmBgYAoKYGBge3J9Cmxvd19wcmV2IDwtIHN1YnNldChocF8xOTk5LCBocF8xOTk5JHByZXZhbGVuY2UgPCAyLjApCmxvd19wcmV2CmBgYAoKIyMjIyNOb3RlczogCkhpZ2hlc3QgcHJldmFsZW5jZSBzZWVtcyB0byBzaXR1YXRlZCBpbiBTdWItU2FoYXJhbiBBZnJpY2EgYW5kIGNvdW50cmllcyB3aXRoIGEgc2l6YWJsZSBkaWFzcG9yYSBmcm9tIFN1Yi1TYWhhcmFuIEFmcmljYS4gUmVzZWFyY2hlcnMgaGF2ZSBzaG93biB0aGF0IGEgZ2VuZXRpYyB2YXJpYXRpb24gY2FsbGVkIER1ZmZ5IEFudGlnZW4gUmVjZXB0b3IgZm9yIENoZW1va2luZXMgKERBUkMpIHdoaWNoIHdhcyBldm9sdmVkIGFzIGEgcHJvdGVjdGlvbiBhZ2FpbnN0IGEgbm93IGV4dGluY3QgZm9ybSBvZiBtYWxhcmlhIGhhcyBsZWQgdG8gYSBncmVhdGVyIHN1c2NlcHRpYmlsaXR5IHRvIEhJVi4gREFSQyBpcyBwcmVzZW50IGluIGFuIG92ZXJ3aGVsbWluZyBwZXJjZW50YWdlIG9mIHBlb3BsZSB3aXRoIEFmcmljYW4gZGVzY2VudCBtYWtpbmcgdGhlbSBtb3JlIHN1c2NlcHRpYmxlIChidXQgYWxzbyBtb3JlIHJlc2lsaWVudCkgdG8gSElWLiBUaGlzIGluY3JlYXNlZCBzdXNjZXB0aWJpbGl0eSwgaW4gY29uY2VydCB3aXRoIGJlaGF2aW9yYWwgZmFjdG9ycywgaGVscHMgdG8gZXhwbGFpbiB0aGUgaGlnaCBwcmV2YWxlbmNlIG9mIEhJViBpbiBnZW9ncmFwaGljYWxseSBkaXNwZXJzZWQgY291bnRyaWVzIGxpa2UgQm90c3dhbmEgYW5kIEhhaXRpLiAKCiMjIyMjUmVzb3VyY2VzOgoqIGh0dHA6Ly93d3cubmF0dXJlLmNvbS9uZXdzLzIwMDgvMDgwNzE2L2Z1bGwvbmV3cy4yMDA4Ljk0OC5odG1sCiogaHR0cDovL3d3dy5zY2llbmNlZGlyZWN0LmNvbS9zY2llbmNlL2FydGljbGUvcGlpL1MwNzUzMzMyMjk5ODAwMjEzCgoqKioKCiMjIyBIYXMgcHJldmFsZW5jZSBkaW1pbmlzaGVkIG92ZXIgdGltZT8KYGBge3J9CnppbWIgPC0gc3Vic2V0KGhwLCBjb3VudHJ5ID09ICdaaW1iYWJ3ZScpCnppbWIKYGBgCgpgYGB7cn0KcXBsb3QoeCA9IHllYXIsIHkgPSBwcmV2YWxlbmNlLCBkYXRhID0gemltYikgCgpgYGAKCiMjIyMjTm90ZXM6CkluIFppbWJhYndlLCBjb25zaXN0ZW50bHkgdGhlIGhhcmRlc3QgaGl0IGNvdW50cnksIGl0IGxvb2tzIGxpa2UgSElWIHByZXZhbGVuY2UgaW5jcmVhc2VkIHRocm91Z2hvdXQgdGhlIDE5ODBzIGFuZCA5MHMgYW5kIGJlZ2FuIHRvIGxldmVsLW9mZiBhZnRlciAxOTk3LiBQcmV2YWxlbmNlIHNlZW1zIHRvIGhhdmUgc3RhcnRlZCBkZWNyZWFzaW5nIGluIHRoZSAyMDAwcyBhcyB0ZXN0aW5nIGFuZCB0cmVhdG1lbnQgd2VyZSBtYWRlIG1vcmUgYWNjZXNzaWJsZS4gVGhlIG1vc3Qgc2lnbmlmaWNhbnQgY2F1c2Ugb2YgdGhlIGRlY3JlYXNlIGluIHRoZSBjYXNlIG9mIFppbWJhYndlLCBob3dldmVyLCB3YXMgdGhlIHJlZHVjdGlvbiBvZiB0aGUgbnVtYmVyIG9mIHNlY3VhbCBwYXJ0bmVycyBieSAzMCUgZm9yIG1lbiAoVU5BSURTLCAyMDExKS4gVGhpcyByZWR1Y3Rpb25zIGluIG5ldyBjYXNlcyBvZiBISVYgc3BlbGxzIGdvb2QgbmV3cyBmb3IgZnV0dXJlIHJlZHVjdGlvbiBlZmZvcnRzIGJlY2F1c2UgaXQgc2hvd3MgdGhhdCB0aGUgaXNzdWUsIGRlc3BpdGUgd2hhdGV2ZXIgbmVnYXRpdmUgcHJlZGlzcG9zaXRpb25zIG1heSBiZSBhdCBwbGF5LCBjYW4gYmUgY29udGFpbmVkIGFuZCBjdXJyZW50IGNhc2VzIGNhbiBiZSB0cmVhdGVkIHdpdGggaW5jcmVhc2luZyBlZmZlY3RpdmVuZXNzLiAKCiMjIyMjUmVzb3VyY2VzOiAKKiBodHRwOi8vd3d3LnVuYWlkcy5vcmcvZW4vcmVzb3VyY2VzL3ByZXNzY2VudHJlL2ZlYXR1cmVzdG9yaWVzLzIwMTEvbWFyY2gvMjAxMTAzMTV6aW1iYWJ3ZQo=