4/25/2020

Project Objective

The objective of the report is to explore the cardio data set (“CardioGoodFitness”) in R and generate insights about the data set. This exploration report will consists of the following: Importing the dataset in R Understanding the structure of dataset Graphical exploration Descriptive statistics Insights from the dataset

Assumptions

<Think from practical Project Execution perspective. Add all your assumptions here.>

Exploratory Data Analysis – Step by step approach

A Typical Data exploration activity consists of the following steps:

  1. Environment Set up and Data Import
  2. Variable Identification
  3. Univariate Analysis
  4. Bi-Variate Analysis
  5. Missing Value Treatment (Not in scope for our project)
  6. Outlier Treatment (Not in scope for our project)
  7. Variable Transformation / Feature Creation
  8. Feature Exploration

We shall follow these steps in exploring the provided dataset. Although Steps 5 and 6 are not in scope for this project, a brief about these steps (and other steps as well) is given, as these are important steps for Data Exploration journey.

Environment Set up and Data Import

Install necessary Packages and Invoke Libraries

Use this section to install necessary packages and invoke associated libraries. Having all the packages at the same places increases code readability.

Set up working Directory

Setting a working directory on starting of the R session makes importing and exporting data files and code files easier. Basically, working directory is the location/ folder on the PC where you have the data, codes etc. related to the project.

Slide with Bullets

  • Bullet 1
  • Bullet 2
  • Bullet 3

Slide with R Output

summary(cars)
##      speed           dist       
##  Min.   : 4.0   Min.   :  2.00  
##  1st Qu.:12.0   1st Qu.: 26.00  
##  Median :15.0   Median : 36.00  
##  Mean   :15.4   Mean   : 42.98  
##  3rd Qu.:19.0   3rd Qu.: 56.00  
##  Max.   :25.0   Max.   :120.00

Slide with Plot