Identify the subject of your project. What are you going to be making your visualizations about?
I am going to compare among 4 countries data which are Bangladesh, Malaysia, Pakistan, and Singapore. The analysis will be a panel analysis, which is a combination of Time Series and Cross-sectional data analysis. The analysis is going to be on Economic variables and Gender-based analysis.
The data is collected from World bank development indicators (WDI) dataset. The World Development Indicators (WDI) database is the World Bank’s primary, free, and comprehensive collection of internationally comparable statistics on global development, covering economic, social, and environmental data for over 217 economies, with many indicators extending back to 1960.
url: https://databank.worldbank.org/source/world-development-indicators
Please provide specific details about your data.
The data is a panel analysis from 2011-2021, of four countries, which are Bangladesh, Pakistan, Malaysia, and Singapore. The analysis is going to be mainly on economic indicators and Gender-based analysis. One exception only is the Women’s share of HIV Population among the countries, which is solely based on personal interest.
The available format of the dataset from WDI is in Excel file and not in rectangular format primarily. The data will be processed using data wrangling functions of the Tidyverse (mainly using the dplyr package).
After the wrangling process, the dataset is expected to contain atleast 40 rows and 15 columns.
The gender-based analyses will incorporate mostly barcharts. The continuous variables like population, gdp will need the use of scatterplot, boxplot, line graph, and the combination of scatterplot and trend line. As the comparison is made among 4 different regions, the visualization tool like faceting would be quite useful. It is ensured that the requirement of 8 graphs where 3 different chart type should be used; is considered strictly.