Exploratotry Data Analysis of Car Price Prediction

Tayyab Rajput

Life Cycle of a Machine Learning Project:

A) Problem Statement for Prediction

Importing data and required packages
Load the dataset
show top five rows of the dataset
Delete Columns Unnamed:0 and again show Top 5 Rows
Show Bottom 5 Rows
Shape of the Dataset
Show features/columns of the Dataset
Summary of Data
Check Null Value Counts and DataTypes of the features
Check duplicate values
Check the number of unique values of each column

C) Exploring Data

Show Categories in columns
Check Cars with 0 Seats
Define Numerical and Categorical Columns
proportion of count data on categorical columns

D) Univariate Analysis

E) Multivariate Analysis

Multivariate Analysis is the analysis of more than one variable.

Check Multicollinearity of Numerical Features

F) Visualization

F.1) Visualize the Target Feature
Selling Price Distribution
B.2) Most Selling Cars
Check mean price of Hyundai i20 which is most sold
B.3) Most Selling Brands
Check the Mean price of Maruti brand which is most sold
B.4) Costliest Brand and Costliest Car
B.5) Costliest Car
Most Mileage Brand and Car Name
Car with Highest Mileage
Kilometer driven vs Selling Price
Most sold Fuel type
Fuel types available and mileage given
Mileage vs Selling Price
Mileage Distribution
Vechile age vs Selling_price
Vehicle Age vs Mileage
Car_name vs Age
Transmission Type :
Transmission type vs Selling Price
Seller Type