It is estimated that on average about fifty-five million people die each year. People die from several reasons ranging from sickness to one taking his/her life by himself or herself. In 2018, it was also reported that about 61 million people died across the world respectively. Such questions as:
• What caused the death of all these people?
• How has causes of death changed over time?
• How different is the causes of death from one country to another?
Finding reliable answers to these questions led us to our interest in analyzing the causes of death across world with using data visualization techniques.
In this project, I made use if 14 different datasets from three main sources. These are:
Six separate sets of data from the World Health Organization (WHO).
Five sets of data from the Institute for Health Metrics and Evaluation’s (IHME), 2019 report on Burden and Disease.
Three sets of data 3 sets of data from the Global Burden of Disease Study for 2019.
In all, 195 countries were capture by these datasets which includes 192 UN recognized list of countries in addition to The Holy See and the state of Palestine. A total of 33 variables was used for the analyses including 31 causes of deaths recorded globally. Each data contains an average of 7000 observations from 1991 through to 2018 respectively. We used R programming to explore, cleaned and prepared all the datasets for the analysis.
Figure 1. Causes of death in the world (1991-2018)
Figure 2. Causes of death in the world in 2018
I begin my visualization technique with a bar graph showing deaths caused by the various categories in millions. From the figure below we observe that the leading cause of death worldwide in the year 2018 is Cardiovascular diseases followed by Cancers. Poisoning, heat and natural deaths were the least causes of death respectively for this given year.
Figure 3. Number of death by group of ages and by type of diseases
Figure 3 (left) shows the global distribution of causes of deaths by five defined age groups from 1991 to 2018. In this graph we see that age group between 5 and 14 are the least people that recorded deaths across these years while age 70 and able recorded the highest with increasing numbers as the year goes by. Also, children under age 5 significantly recorded an increasing number of deaths as the years goes by.
Figure 3 (right) also shows a distribution of causes of death under three categories (communicable diseases, non-communicable diseases and injuries). Non-communicable diseases turns to record the highest number of deaths for this period of time while injuries recorded the least number of deaths among this category.
Figure 4. Causes of deaths among age group 5 -14 between USA and the world in 2018.
Below we try to compare major causes of death of children (Between 5 and 14 years old) in USA and those from the rest of the world. We observed that Road accidents and teen cancer are the top two causes of death for both children in USA and the rest of the world in 2018. Also, while malaria is the least cause of death among this age group in the USA in 2018, it is significantly high among the same age group across the world. Homicide and suicide are among the leading causes of death for children in USA, while these two phenomena are relatively low in other parts of the world.
Figure 5. Causes of deaths among age group 15 to 49 between USA and the world in 2018
For people with age in the bucket 15-49, we see a different pattern. In figure 5, we see that Drug, cancers and cardiovascular diseases are the leading cause of death. We also see that suicide is a bit high for that age group in USA. Cardiovascular diseases and cancers are also the leading causes of death in the world for that age group, alongside with HIV. But, contrary to USA, the number of deaths by drugs is very low, which tell us that USA might have a drug problem (Further analysis and research needs to confirm that).
Figure 6. Percentage of total deaths due to cardiovascular disease and cancers for each world’s region (1991-2018)
Here, we try to see how the percentage of total deaths due to cardiovascular diseases and cancers evolve overtime for different region in the world. We see region like Eastern Europe, Central Europe and Central Asia have the highest percentage of deaths due to Cardiovascular disease during the period 1990-2018.But, the trend is different for some regions. For instance, while the percentage is going upward for Central Asia, this is going down for central Europe. Western Europe and South Asia tend to have the lowest percentage of deaths due to cardiovascular diseases during that same period, but with a rapid growth for South Asia for the last couple years. On the other hand, for cancer, Western Europe tend to have the highest percentage of deaths due to cancers, followed by North America. Sub-Saharan Africa and South Asia are the regions where the percentages of total deaths due to cancers are the least during our time frame.
Figure 7. Percentage of deaths due to cardiovascular diseases and cancers for each socio-demographic index level (1991-2018)
Socio-demographic index (SDI) is a summary measure that identifies where countries or other geographic areas sit on the spectrum of development. Expressed on a scale of 0 to 1, SDI is a composite average of the rankings of the incomes per capita, average educational attainment, and fertility rates of all areas in the GBD study. Here, we try to see how the percentage of total deaths due to cardiovascular diseases and cancers evolve overtime for each SDI levels, and in which SDI level that percentage is the highest (respectively lowest). In figure 6, we see that High-middle SDI countries tend to have the highest percentage of deaths by Cardiovascular disease during the period 1990-2018, while low SDI tend to have the lowest during that same period. For cancers, High SDI tend to have the highest percentage of deaths during the period 1990-2018. One explication maybe that in low SDI countries, people mostly die by infectious diseases because of lack of health advancement.
Figure 8 (D3). Number of Deaths caused by Cardiovascular Diseases across the world (2018)
I decided to explore more about the cardiovascular and Cancer related deaths. This is because as seen earlier, these two happens to be the two most leading causes of death globally. The above graph was built making use of the D3 library from JavaScript. It is an interactive choropleth map of the world showing the number of deaths recorded by each country in the world in 2018 related to Cardiovascular diseases. This map shows the raw count of the number of cases without adjusting for each country’s population size. From the graph, we can conclude that China and India recorded the highest number of cases followed by Russia and United States respectively. Africa as a continent recorded less cases on average while Greenland, an Island close to Canada reported the least number of deaths related to cardiovascular diseases.
After carefully exploring the various causes of death across countries, we concluded the following based on our findings from our visualization;
• Cardiovascular diseases are the leading cause of death globally.
• Cancer is the second leading cause of death globally.
• Causes of death varies across countries.
• Non-communicable diseases dominate causes of death in rich countries.
• Infectious diseases are common causes in poor countries.
• The world is making significant progress against infectious diseases related causes of death