A Typical Data exploration activity consists of the following steps:
- Environment Set up and Data Import
- Variable Identification
- Univariate Analysis
- Bi-Variate Analysis
- Missing Value Treatment (Not in scope for our project)
- Outlier Treatment (Not in scope for our project)
- Variable Transformation / Feature Creation
- Feature Exploration
We shall follow these steps in exploring the provided dataset. Although Steps 5 and 6 are not in scope for this project, a brief about these steps (and other steps as well) is given, as these are important steps for Data Exploration journey.
Environment Set up and Data Import
Install necessary Packages and Invoke Libraries
Use this section to install necessary packages and invoke associated libraries. Having all the packages at the same places increases code readability.
Set up working Directory
Setting a working directory on starting of the R session makes importing and exporting data files and code files easier. Basically, working directory is the location/ folder on the PC where you have the data, codes etc. related to the project.