Group 3
7th December, 2018
The goal of this project is to analyze what is the impact of COD mode of payment on the revenue(final total price) of an e-commerce company
getwd()
[1] "C:/Users/Risheel/Desktop/2018 DAM/Project Proposal"
projData = read.csv("PromotionDataV4.csv")
str(projData)
'data.frame': 45898 obs. of 29 variables:
$ OrderItemCode : int 50089810 50089812 50358742 48983376 49999082 50217970 49704460 49704458 49963528 49963526 ...
$ OrderID : int 29171450 29171450 29330182 28513266 29116292 29248820 28943024 28943024 29094620 29094620 ...
$ OrderDate : Factor w/ 19878 levels "4/10/2015 0:00",..: 8712 8712 10169 2286 8187 9447 6289 6289 7931 7931 ...
$ Category : Factor w/ 1 level "Apparels": 1 1 1 1 1 1 1 1 1 1 ...
$ SubCategory : Factor w/ 58 levels "3/4THS","ASYMMETRIC DRESS",..: 38 38 38 49 38 49 38 49 38 49 ...
$ Brand : Factor w/ 10 levels "ATHENA","FABALLEY",..: 2 2 2 2 2 2 2 2 2 2 ...
$ ProductColor : Factor w/ 44 levels "AQUA","AQUA BLUE",..: 43 4 4 35 5 33 42 5 42 5 ...
$ ProductSize : Factor w/ 18 levels "28","30","32",..: 13 16 13 13 15 15 15 15 15 15 ...
$ MRP : int 1100 800 1250 1200 1100 1250 950 850 950 850 ...
$ FinalTotalPrice : num 866 630 919 805 808 ...
$ VATPercent : int 5 5 5 5 5 5 5 5 5 5 ...
$ VAT : num 41.2 30 43.8 36 38.5 ...
$ CODCharge : int 0 0 0 49 0 0 0 0 0 49 ...
$ VendorDiscount : int 0 0 0 480 0 0 0 0 0 0 ...
$ WebsiteDiscountCode: Factor w/ 1894 levels "ACT200","ACT300",..: 1118 1118 1505 675 576 597 1108 1108 1118 1118 ...
$ WebsiteDiscount : num 275 200 375 0 330 ...
$ HasVendorDiscount : int 0 0 0 1 0 0 0 0 0 0 ...
$ HasWebsiteDiscount : int 1 1 1 0 1 1 1 1 1 1 ...
$ CustomerID : int 1224439 1224439 139127 15601014 6275116 6275116 3596101 3596101 3596101 3596101 ...
$ COD : int 1 1 1 1 1 1 1 1 1 1 ...
$ ShippingName : Factor w/ 30892 levels " Alok Karmakar",..: 18346 18346 8692 9116 17418 17418 17541 17541 17541 17541 ...
$ ShippingCity : Factor w/ 791 levels " Cuttack","110058",..: 590 590 321 427 590 590 321 321 321 321 ...
$ ShippingState : Factor w/ 32 levels "AN","AP","AR",..: 19 19 2 26 19 19 2 2 2 2 ...
$ ShippingPincode : int 411027 411027 500017 324007 411030 411030 500081 500081 500081 500081 ...
$ ShippingAddressType: Factor w/ 3 levels "Home","Null",..: 1 1 1 2 2 2 3 3 3 3 ...
$ BillingCity : Factor w/ 983 levels "190008","Abbowal",..: 719 719 400 527 719 719 400 400 400 400 ...
$ BillingState : Factor w/ 33 levels "AN","AP","AS",..: 18 18 2 27 18 18 2 2 2 2 ...
$ BillingPincode : int 411027 411027 500017 324007 411030 411030 500081 500081 500081 500081 ...
$ BillingAddressType : Factor w/ 3 levels "Home","Null",..: 2 2 1 2 2 2 3 3 3 3 ...
summary(projData[,c(5,6.9,10,13,14,15)])
SubCategory Brand FinalTotalPrice
TOPS :14605 HARPA :7271 Min. : 0.0
DRESSES : 8869 GRITSTONES :6901 1st Qu.: 525.0
POLO-T-SHIRTS : 3793 FABALLEY :6899 Median : 678.0
ROUND NECK T-SHIRTS: 2937 MONTEIL & MUNERO:6539 Mean : 754.8
SHIRTS : 2897 MEIRA :5422 3rd Qu.: 882.0
CASUAL SHIRTS : 1882 THE VANCA :4804 Max. :6977.9
(Other) :10915 (Other) :8062
CODCharge VendorDiscount WebsiteDiscountCode
Min. : 0.0 Min. : 0.0 NIL :29871
1st Qu.: 0.0 1st Qu.: 0.0 APP05 : 2686
Median : 0.0 Median : 110.0 STYLE30: 1699
Mean :13.7 Mean : 225.4 WOW30 : 1605
3rd Qu.:49.0 3rd Qu.: 400.0 SALE30 : 1299
Max. :49.0 Max. :4400.0 RUSH30 : 1144
(Other): 7594
psych::describe(projData)
vars n mean sd median
OrderItemCode 1 45898 49722785.04 940665.91 49663205.00
OrderID 2 45898 28953309.24 556520.76 28918355.00
OrderDate* 3 45898 8625.75 6092.07 8403.00
Category* 4 45898 1.00 0.00 1.00
SubCategory* 5 45898 34.13 16.36 38.00
Brand* 6 45898 4.68 2.58 4.00
ProductColor* 7 45898 18.86 14.38 16.00
ProductSize* 8 45898 14.21 2.08 14.00
MRP 9 45898 1015.03 464.76 950.00
FinalTotalPrice 10 45898 754.79 343.63 677.95
VATPercent 11 45898 5.00 0.00 5.00
VAT 12 45898 35.29 16.20 29.95
CODCharge 13 45898 13.70 21.99 0.00
VendorDiscount 14 45898 225.42 269.63 110.00
WebsiteDiscountCode* 15 45898 703.26 334.21 675.00
WebsiteDiscount 16 45898 83.81 167.46 0.00
HasVendorDiscount 17 45898 0.51 0.50 1.00
HasWebsiteDiscount 18 45898 0.35 0.48 0.00
CustomerID 19 45898 9014567.14 5248491.89 8963196.00
COD 20 45898 0.61 0.49 1.00
ShippingName* 21 45898 15433.57 8920.88 15471.00
ShippingCity* 22 45898 393.91 208.84 435.00
ShippingState* 23 45898 17.59 8.30 17.00
ShippingPincode 24 45898 400188.81 210286.71 400706.00
ShippingAddressType* 25 45898 1.66 0.66 2.00
BillingCity* 26 45898 482.17 254.31 535.00
BillingState* 27 45898 17.18 8.92 16.00
BillingPincode 28 45898 400234.40 210418.62 400706.00
BillingAddressType* 29 45898 1.73 0.61 2.00
trimmed mad min max
OrderItemCode 49709041.49 1313310.80 47856780 51238880.00
OrderID 28943786.35 776774.17 27856296 29857723.00
OrderDate* 8406.79 8375.21 1 19878.00
Category* 1.00 0.00 1 1.00
SubCategory* 34.61 16.31 1 58.00
Brand* 4.55 2.97 1 10.00
ProductColor* 17.95 17.79 1 44.00
ProductSize* 14.33 1.48 1 18.00
MRP 966.92 511.50 299 10999.00
FinalTotalPrice 711.12 235.07 0 6977.95
VATPercent 5.00 0.00 5 5.00
VAT 33.21 11.14 0 329.95
CODCharge 11.01 0.00 0 49.00
VendorDiscount 188.44 163.09 0 4400.00
WebsiteDiscountCode* 707.60 0.00 1 1894.00
WebsiteDiscount 45.35 0.00 0 3149.00
HasVendorDiscount 0.51 0.00 0 1.00
HasWebsiteDiscount 0.31 0.00 0 1.00
CustomerID 9185248.64 7705045.51 197 16007333.00
COD 0.64 0.00 0 1.00
ShippingName* 15436.44 11391.56 1 30892.00
ShippingCity* 396.36 212.01 1 791.00
ShippingState* 17.71 8.90 1 32.00
ShippingPincode 390462.46 251017.52 110001 854301.00
ShippingAddressType* 1.57 1.48 1 3.00
BillingCity* 485.21 262.42 1 983.00
BillingState* 17.12 8.90 1 33.00
BillingPincode 390493.96 251017.52 110001 900000.00
BillingAddressType* 1.68 0.00 1 3.00
range skew kurtosis se
OrderItemCode 3382100.00 0.13 -1.43 4390.75
OrderID 2001427.00 0.14 -1.43 2597.67
OrderDate* 19877.00 0.18 -1.30 28.44
Category* 0.00 NaN NaN 0.00
SubCategory* 57.00 -0.32 -1.47 0.08
Brand* 9.00 0.43 -0.90 0.01
ProductColor* 43.00 0.28 -1.54 0.07
ProductSize* 17.00 -2.03 8.69 0.01
MRP 10700.00 2.72 27.29 2.17
FinalTotalPrice 6977.95 2.39 17.32 1.60
VATPercent 0.00 NaN NaN 0.00
VAT 329.95 2.42 17.71 0.08
CODCharge 49.00 0.98 -1.04 0.10
VendorDiscount 4400.00 1.53 7.57 1.26
WebsiteDiscountCode* 1893.00 0.17 1.55 1.56
WebsiteDiscount 3149.00 2.88 16.66 0.78
HasVendorDiscount 1.00 -0.02 -2.00 0.00
HasWebsiteDiscount 1.00 0.63 -1.60 0.00
CustomerID 16007136.00 -0.09 -1.37 24498.40
COD 1.00 -0.46 -1.78 0.00
ShippingName* 30891.00 -0.01 -1.19 41.64
ShippingCity* 790.00 -0.26 -1.03 0.97
ShippingState* 31.00 0.00 -0.71 0.04
ShippingPincode 744300.00 0.16 -0.99 981.56
ShippingAddressType* 2.00 0.51 -0.73 0.00
BillingCity* 982.00 -0.27 -0.97 1.19
BillingState* 32.00 0.20 -0.82 0.04
BillingPincode 789999.00 0.15 -0.99 982.17
BillingAddressType* 2.00 0.22 -0.60 0.00
Dependent Variable: FinalTotalPriice
Independent Variable: MRP,CODCharge,VendorDiscount,WebsiteDiscount, HasWebsiteDiscount, HasVendorDiscount, COD, Brand, Category, SubCategory, ShippingCity, ShippingState, ShippingAddressType, BillingCity, BillingState, BillingAddressType,
Continuous Variable: OrderItemCode, OrderID, MRP, VendorDiscount, CustomerID, COD, FinalTotalPrice, VAT, WebsiteDiscount
Categorical Variable: OrderDate, Category, SubCategory, Brand, ProductColor, ProductSize, VATPercent, CODCharge, WebsiteDiscountCode, HasVendorDiscount, HasWebsiteDiscount, ShippingName, ShippingCity, ShippingState, ShippingPincode, ShippingAddressType, BillingCity, BillingState, BillingPincode, BillingAddressType
attach(projData)
round(addmargins(prop.table(table(projData$HasWebsiteDiscount,projData$COD,dnn=c("Web Site Discount","COD")),1)*100,2),1)
COD
Web Site Discount 0 1 Sum
0 35.6 64.4 100.0
1 44.4 55.6 100.0
Without Web site discount 64.4% people go for COD , while without web site discount only 55.6% goes for COD.
round(addmargins(prop.table(table(projData$COD,projData$Brand,dnn=c("COD","BRAND")),1)*100,2),1)
BRAND
COD ATHENA FABALLEY GRITSTONES HARPA MEIRA MISS CHASE MONTEIL & MUNERO
0 7.5 16.2 14.5 17.6 12.2 3.0 13.3
1 10.0 14.3 15.3 14.7 11.6 4.1 14.9
BRAND
COD MR BUTTON THE VANCA TSHIRT COMPANY Sum
0 1.8 10.8 3.1 100.0
1 1.9 10.2 3.0 100.0
round(addmargins(prop.table(table(projData$Brand,projData$COD,dnn=c("BRAND","COD")),1)*100,2),1)
COD
BRAND 0 1 Sum
ATHENA 32.1 67.9 100.0
FABALLEY 41.7 58.3 100.0
GRITSTONES 37.4 62.6 100.0
HARPA 43.0 57.0 100.0
MEIRA 39.9 60.1 100.0
MISS CHASE 32.0 68.0 100.0
MONTEIL & MUNERO 36.0 64.0 100.0
MR BUTTON 37.0 63.0 100.0
THE VANCA 40.0 60.0 100.0
TSHIRT COMPANY 39.9 60.1 100.0
Inference: 1. Gritstones has the maximum share of COD within customers who preferred COD. 2. Athena & Miss Chase has approximately 68% of their sales through COD.
table(projData$COD,projData$ShippingAddressType,dnn=c("COD","ADDRESS TYPE"),exclude = c("Null"))
ADDRESS TYPE
COD Home Office
0 9697 2891
1 10840 2015
Inference: In case of Home address customers prefer COD mode of payment than the case of office address.
aggregate (projData$FinalTotalPrice~projData$COD, FUN = mean)
projData$COD projData$FinalTotalPrice
1 0 723.2480
2 1 774.6938
boxplot(projData$FinalTotalPrice~projData$COD, ylab = "Final Total Price" , xlab =" COD (1= COD, 0 = Not COD" , main= "Mean price based on COD")
vartest1<- var.test(projData$FinalTotalPrice~projData$COD)
vartest1
F test to compare two variances
data: projData$FinalTotalPrice by projData$COD
F = 0.71574, num df = 17753, denom df = 28143, p-value < 2.2e-16
alternative hypothesis: true ratio of variances is not equal to 1
95 percent confidence interval:
0.6970046 0.7350407
sample estimates:
ratio of variances
0.715741
t.test(projData$FinalTotalPrice~projData$COD, var.equal= TRUE)
Two Sample t-test
data: projData$FinalTotalPrice by projData$COD
t = -15.662, df = 45896, p-value < 2.2e-16
alternative hypothesis: true difference in means is not equal to 0
95 percent confidence interval:
-57.88375 -45.00781
sample estimates:
mean in group 0 mean in group 1
723.2480 774.6938
Inference:With and without COD the revenue for the company is significantly different.
aggregate(projData$FinalTotalPrice~projData$HasVendorDiscount, FUN = mean)
projData$HasVendorDiscount projData$FinalTotalPrice
1 0 808.4329
2 1 702.3885
boxplot(projData$FinalTotalPrice~projData$HasVendorDiscount, ylab = "Final Total Price" , xlab =" Vendor discount (1= Present, 0 = Not Present)" , main= "Mean price based on vendor discount")
library(gplots)
plotmeans(projData$FinalTotalPrice~projData$HasVendorDiscount, ylab = "Final Total Price" , xlab =" Vendor discount (1= Present, 0 = Not Present" , main= "Mean price based on vendor discount")
Inference:Without the Vendor discount the mean price is 808.4329 and with vendor discount the price is 702.
vartest2<- var.test(projData$FinalTotalPrice~projData$HasVendorDiscount)
vartest2
F test to compare two variances
data: projData$FinalTotalPrice by projData$HasVendorDiscount
F = 0.97696, num df = 22681, denom df = 23215, p-value = 0.07756
alternative hypothesis: true ratio of variances is not equal to 1
95 percent confidence interval:
0.9520066 1.0025778
sample estimates:
ratio of variances
0.9769631
t.test(projData$FinalTotalPrice~projData$HasVendorDiscount, var.equal= TRUE)
Two Sample t-test
data: projData$FinalTotalPrice by projData$HasVendorDiscount
t = 33.455, df = 45896, p-value < 2.2e-16
alternative hypothesis: true difference in means is not equal to 0
95 percent confidence interval:
99.83165 112.25708
sample estimates:
mean in group 0 mean in group 1
808.4329 702.3885
Inference: Assuming Normality in the data, we run the T-test and reject the null hypothesis. The Final total price is significantly different with and without Vendor Discount.
aggregate(projData$FinalTotalPrice~projData$HasWebsiteDiscount, FUN = mean)
projData$HasWebsiteDiscount projData$FinalTotalPrice
1 0 747.507
2 1 768.375
boxplot(projData$FinalTotalPrice~projData$HasWebsiteDiscount, ylab = "Final Total Price" , xlab =" Website discount (1= Present, 0 = Not Present" , main= "Mean price based on Website discount")
The mean final price is 747 withpout Website DIscount and 768 without Website discount
vartest3<- var.test(projData$FinalTotalPrice~projData$HasWebsiteDiscount)
vartest3
F test to compare two variances
data: projData$FinalTotalPrice by projData$HasWebsiteDiscount
F = 0.9667, num df = 29870, denom df = 16026, p-value = 0.01424
alternative hypothesis: true ratio of variances is not equal to 1
95 percent confidence interval:
0.9407641 0.9932430
sample estimates:
ratio of variances
0.9667019
t.test(projData$FinalTotalPrice~projData$HasWebsiteDiscount, var.equal= FALSE)
Welch Two Sample t-test
data: projData$FinalTotalPrice by projData$HasWebsiteDiscount
t = -6.1731, df = 32300, p-value = 6.774e-10
alternative hypothesis: true difference in means is not equal to 0
95 percent confidence interval:
-27.49373 -14.24216
sample estimates:
mean in group 0 mean in group 1
747.507 768.375
Inference: According to the two sample t-test the mean prices are significantly different than each other
attach(projData)
library(Hmisc)
CODselected <- c("MRP", "VendorDiscount", "COD", "WebsiteDiscount")
rcorr(as.matrix(projData[,CODselected]))
MRP VendorDiscount COD WebsiteDiscount
MRP 1.00 0.53 0.03 0.33
VendorDiscount 0.53 1.00 0.01 -0.29
COD 0.03 0.01 1.00 -0.02
WebsiteDiscount 0.33 -0.29 -0.02 1.00
n= 45898
P
MRP VendorDiscount COD WebsiteDiscount
MRP 0.0000 0.0000 0.0000
VendorDiscount 0.0000 0.0071 0.0000
COD 0.0000 0.0071 0.0003
WebsiteDiscount 0.0000 0.0000 0.0003
Inference:
library(corrgram)
corrgram(projData[,CODselected], order=TRUE,
main="Corrgram of discount and payment intercorrelations ",
lower.panel=panel.conf, upper.panel=panel.pie,
diag.panel=panel.minmax, text.panel=panel.txt)
library("PerformanceAnalytics")
chart.Correlation(projData[,CODselected], histogram = TRUE , main= "Correlation between Prices and discount")
Metrocity = c("Mumbai","New Delhi","Delhi","Kolkata","Hyderabad","Bangalore","Chennai","Pune","Ahmedabad","Thane")
projData = cbind(projData, ISMetro = as.integer(ShippingCity %in% Metrocity))
attach(projData)
apparel <- projData [!(projData$FinalTotalPrice=="0"),]
detach(projData)
attach(apparel)
OutlierPrice = boxplot.stats(FinalTotalPrice)$out
apparel = apparel[!FinalTotalPrice %in% OutlierPrice, ]
m1=lm(FinalTotalPrice~COD + Brand + CODCharge + HasVendorDiscount + HasWebsiteDiscount + ShippingAddressType+ SubCategory+ISMetro, data = apparel)
summary(m1)
Call:
lm(formula = FinalTotalPrice ~ COD + Brand + CODCharge + HasVendorDiscount +
HasWebsiteDiscount + ShippingAddressType + SubCategory +
ISMetro, data = apparel)
Residuals:
Min 1Q Median 3Q Max
-1458.75 -77.55 -5.41 72.88 825.82
Coefficients:
Estimate Std. Error t value
(Intercept) 811.40713 33.51815 24.208
COD 5.19632 1.65714 3.136
BrandFABALLEY 37.51509 3.30388 11.355
BrandGRITSTONES -276.00916 6.14647 -44.905
BrandHARPA 1.11545 3.18787 0.350
BrandMEIRA -287.11353 3.39003 -84.694
BrandMISS CHASE 62.35171 4.55052 13.702
BrandMONTEIL & MUNERO -113.45555 6.42743 -17.652
BrandMR BUTTON 438.09655 11.97368 36.588
BrandTHE VANCA -25.56861 3.32140 -7.698
BrandTSHIRT COMPANY -89.78190 5.38454 -16.674
CODCharge 1.16514 0.03691 31.570
HasVendorDiscount -254.14871 1.84241 -137.944
HasWebsiteDiscount -84.66874 1.68169 -50.348
ShippingAddressTypeNull 5.31712 1.49277 3.562
ShippingAddressTypeOffice -0.68542 2.31901 -0.296
SubCategoryASYMMETRIC DRESS 266.64008 49.49942 5.387
SubCategoryBEACHWEAR 284.68318 78.19913 3.640
SubCategoryBODYCON DRESS 307.22960 44.57398 6.893
SubCategoryCAMISOLES -231.45575 41.72724 -5.547
SubCategoryCAPRIS 251.87884 41.05256 6.136
SubCategoryCAPS & HATS 245.55157 34.66386 7.084
SubCategoryCASUAL JACKETS 583.72150 47.45534 12.300
SubCategoryCASUAL SHIRTS 378.11843 33.99591 11.122
SubCategoryCHINOS 248.05337 37.75185 6.571
SubCategoryDENIM JACKETS 368.93491 52.68833 7.002
SubCategoryDRESSES 311.52751 33.39428 9.329
SubCategoryFORMAL SHIRTS 406.26165 145.70270 2.788
SubCategoryFORMAL TROUSERS 406.38246 145.70206 2.789
SubCategoryGSTOP235BLK 277.90365 145.34359 1.912
SubCategoryHENLEY T-SHIRTS 30.48685 33.95129 0.898
SubCategoryHIGH NECK T SHIRTS 89.98477 34.03907 2.644
SubCategoryJEANS 396.83865 105.36787 3.766
SubCategoryJEGGINGS 251.15857 34.85152 7.207
SubCategoryJUMPSUITS 313.79794 35.80555 8.764
SubCategoryKURTAS 165.71783 36.74830 4.510
SubCategoryKURTIS 87.36755 35.56630 2.456
SubCategoryLEGGINGS 81.42774 42.27812 1.926
SubCategoryLOUNGEWEAR 22.34442 88.25737 0.253
SubCategoryMANDARIN T-SHIRTS 182.80434 39.20892 4.662
SubCategoryNIGHTWEAR 25.01172 40.08412 0.624
SubCategoryOFF SHOULDER/TUBE/HALTER 341.87075 88.20987 3.876
SubCategoryPOLO-T-SHIRTS 86.16100 33.82011 2.548
SubCategoryQUILTED JACKETS 435.34326 105.36411 4.132
SubCategoryROUND NECK SWEATERS 287.53990 44.47882 6.465
SubCategoryROUND NECK T-SHIRTS -5.97240 33.87443 -0.176
SubCategoryROUND NECK VESTS -10.87511 49.67314 -0.219
SubCategorySALWAR 172.01808 37.82473 4.548
SubCategorySCARVES 62.42531 63.02462 0.990
SubCategorySHIFT DRESS 155.30921 39.74272 3.908
SubCategorySHIRTS 174.03757 33.47412 5.199
SubCategorySHORTS 6.85632 34.49659 0.199
SubCategorySHRUGS 116.57972 34.78508 3.351
SubCategorySKATER DRESS 286.66838 42.21061 6.791
SubCategorySKIRTS 253.92316 33.79783 7.513
SubCategorySTOCKINGS -255.94811 145.29568 -1.762
SubCategorySUMMER JACKETS 134.49065 145.23749 0.926
SubCategorySWEATERS 202.52531 35.66933 5.678
SubCategorySWEATSHIRTS 451.11681 37.17538 12.135
SubCategoryT SHIRTS 31.17598 33.78345 0.923
SubCategoryTOPS 46.74409 33.38448 1.400
SubCategoryTRACK PANTS 361.12847 38.52904 9.373
SubCategoryTROUSERS 165.68612 33.70811 4.915
SubCategoryTUNICS 109.82342 35.78843 3.069
SubCategoryV NECK SWEATERS 190.40741 63.24858 3.010
SubCategoryV NECK T-SHIRTS -73.48711 33.95057 -2.165
SubCategoryWAISTCOATS 353.06889 35.05017 10.073
SubCategoryWINTER JACKETS 489.28531 34.00367 14.389
ISMetro -2.28556 1.39157 -1.642
Pr(>|t|)
(Intercept) < 2e-16 ***
COD 0.001716 **
BrandFABALLEY < 2e-16 ***
BrandGRITSTONES < 2e-16 ***
BrandHARPA 0.726413
BrandMEIRA < 2e-16 ***
BrandMISS CHASE < 2e-16 ***
BrandMONTEIL & MUNERO < 2e-16 ***
BrandMR BUTTON < 2e-16 ***
BrandTHE VANCA 1.41e-14 ***
BrandTSHIRT COMPANY < 2e-16 ***
CODCharge < 2e-16 ***
HasVendorDiscount < 2e-16 ***
HasWebsiteDiscount < 2e-16 ***
ShippingAddressTypeNull 0.000369 ***
ShippingAddressTypeOffice 0.767563
SubCategoryASYMMETRIC DRESS 7.21e-08 ***
SubCategoryBEACHWEAR 0.000272 ***
SubCategoryBODYCON DRESS 5.55e-12 ***
SubCategoryCAMISOLES 2.93e-08 ***
SubCategoryCAPRIS 8.56e-10 ***
SubCategoryCAPS & HATS 1.42e-12 ***
SubCategoryCASUAL JACKETS < 2e-16 ***
SubCategoryCASUAL SHIRTS < 2e-16 ***
SubCategoryCHINOS 5.07e-11 ***
SubCategoryDENIM JACKETS 2.56e-12 ***
SubCategoryDRESSES < 2e-16 ***
SubCategoryFORMAL SHIRTS 0.005301 **
SubCategoryFORMAL TROUSERS 0.005287 **
SubCategoryGSTOP235BLK 0.055877 .
SubCategoryHENLEY T-SHIRTS 0.369213
SubCategoryHIGH NECK T SHIRTS 0.008207 **
SubCategoryJEANS 0.000166 ***
SubCategoryJEGGINGS 5.83e-13 ***
SubCategoryJUMPSUITS < 2e-16 ***
SubCategoryKURTAS 6.51e-06 ***
SubCategoryKURTIS 0.014035 *
SubCategoryLEGGINGS 0.054111 .
SubCategoryLOUNGEWEAR 0.800135
SubCategoryMANDARIN T-SHIRTS 3.14e-06 ***
SubCategoryNIGHTWEAR 0.532643
SubCategoryOFF SHOULDER/TUBE/HALTER 0.000106 ***
SubCategoryPOLO-T-SHIRTS 0.010849 *
SubCategoryQUILTED JACKETS 3.61e-05 ***
SubCategoryROUND NECK SWEATERS 1.03e-10 ***
SubCategoryROUND NECK T-SHIRTS 0.860051
SubCategoryROUND NECK VESTS 0.826703
SubCategorySALWAR 5.44e-06 ***
SubCategorySCARVES 0.321940
SubCategorySHIFT DRESS 9.33e-05 ***
SubCategorySHIRTS 2.01e-07 ***
SubCategorySHORTS 0.842456
SubCategorySHRUGS 0.000805 ***
SubCategorySKATER DRESS 1.12e-11 ***
SubCategorySKIRTS 5.89e-14 ***
SubCategorySTOCKINGS 0.078150 .
SubCategorySUMMER JACKETS 0.354449
SubCategorySWEATERS 1.37e-08 ***
SubCategorySWEATSHIRTS < 2e-16 ***
SubCategoryT SHIRTS 0.356107
SubCategoryTOPS 0.161468
SubCategoryTRACK PANTS < 2e-16 ***
SubCategoryTROUSERS 8.90e-07 ***
SubCategoryTUNICS 0.002151 **
SubCategoryV NECK SWEATERS 0.002610 **
SubCategoryV NECK T-SHIRTS 0.030429 *
SubCategoryWAISTCOATS < 2e-16 ***
SubCategoryWINTER JACKETS < 2e-16 ***
ISMetro 0.100508
---
Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
Residual standard error: 141.4 on 43246 degrees of freedom
Multiple R-squared: 0.6508, Adjusted R-squared: 0.6502
F-statistic: 1185 on 68 and 43246 DF, p-value: < 2.2e-16
m2=lm(FinalTotalPrice~COD + Brand + CODCharge + HasVendorDiscount + HasWebsiteDiscount + ShippingAddressType+ISMetro, data = apparel)
summary(m2)
Call:
lm(formula = FinalTotalPrice ~ COD + Brand + CODCharge + HasVendorDiscount +
HasWebsiteDiscount + ShippingAddressType + ISMetro, data = apparel)
Residuals:
Min 1Q Median 3Q Max
-1070.29 -115.20 -8.98 80.65 808.97
Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) 1031.76751 4.02773 256.166 < 2e-16 ***
COD 0.24983 2.15991 0.116 0.908
BrandFABALLEY -62.46442 3.80798 -16.404 < 2e-16 ***
BrandGRITSTONES -446.59226 4.14190 -107.823 < 2e-16 ***
BrandHARPA -117.02493 3.73463 -31.335 < 2e-16 ***
BrandMEIRA -387.70204 3.99070 -97.151 < 2e-16 ***
BrandMISS CHASE -43.99428 5.71879 -7.693 1.47e-14 ***
BrandMONTEIL & MUNERO -247.14130 3.80538 -64.945 < 2e-16 ***
BrandMR BUTTON 377.44409 13.59904 27.755 < 2e-16 ***
BrandTHE VANCA -89.23894 4.14970 -21.505 < 2e-16 ***
BrandTSHIRT COMPANY -271.12799 6.00221 -45.171 < 2e-16 ***
CODCharge 1.72452 0.04785 36.036 < 2e-16 ***
HasVendorDiscount -224.85663 2.34139 -96.035 < 2e-16 ***
HasWebsiteDiscount -78.41096 2.18592 -35.871 < 2e-16 ***
ShippingAddressTypeNull -1.69646 1.94361 -0.873 0.383
ShippingAddressTypeOffice -1.17480 3.02384 -0.389 0.698
ISMetro -2.05703 1.81429 -1.134 0.257
---
Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
Residual standard error: 184.5 on 43298 degrees of freedom
Multiple R-squared: 0.4045, Adjusted R-squared: 0.4043
F-statistic: 1838 on 16 and 43298 DF, p-value: < 2.2e-16
m3=lm(FinalTotalPrice~COD + CODCharge + HasVendorDiscount + HasWebsiteDiscount + ShippingAddressType + SubCategory+ISMetro, data = apparel)
summary(m3)
Call:
lm(formula = FinalTotalPrice ~ COD + CODCharge + HasVendorDiscount +
HasWebsiteDiscount + ShippingAddressType + SubCategory +
ISMetro, data = apparel)
Residuals:
Min 1Q Median 3Q Max
-993.27 -91.84 -13.90 75.94 869.14
Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) 703.02773 40.58405 17.323 < 2e-16
COD 4.68786 2.01126 2.331 0.019768
CODCharge 1.52491 0.04459 34.199 < 2e-16
HasVendorDiscount -184.03449 1.89826 -96.949 < 2e-16
HasWebsiteDiscount -6.28909 1.88040 -3.345 0.000825
ShippingAddressTypeNull -0.65356 1.81200 -0.361 0.718338
ShippingAddressTypeOffice -4.11740 2.81773 -1.461 0.143954
SubCategoryASYMMETRIC DRESS 299.96239 60.07936 4.993 5.98e-07
SubCategoryBEACHWEAR 6.65931 94.99735 0.070 0.944114
SubCategoryBODYCON DRESS 378.13856 54.08031 6.992 2.75e-12
SubCategoryCAMISOLES -171.05404 50.63833 -3.378 0.000731
SubCategoryCAPRIS 321.43522 49.85934 6.447 1.15e-10
SubCategoryCAPS & HATS 194.11430 42.06485 4.615 3.95e-06
SubCategoryCASUAL JACKETS 516.22632 57.28704 9.011 < 2e-16
SubCategoryCASUAL SHIRTS 310.86776 40.74669 7.629 2.41e-14
SubCategoryCHINOS 281.23482 45.62190 6.164 7.13e-10
SubCategoryDENIM JACKETS 354.87244 64.04670 5.541 3.03e-08
SubCategoryDRESSES 306.91436 40.55341 7.568 3.86e-14
SubCategoryFORMAL SHIRTS 810.21441 176.55470 4.589 4.47e-06
SubCategoryFORMAL TROUSERS 804.87299 176.55711 4.559 5.16e-06
SubCategoryGSTOP235BLK 37.86492 176.56062 0.214 0.830191
SubCategoryHENLEY T-SHIRTS -132.26784 40.77514 -3.244 0.001180
SubCategoryHIGH NECK T SHIRTS -87.01757 40.87681 -2.129 0.033278
SubCategoryJEANS 395.39986 128.08716 3.087 0.002023
SubCategoryJEGGINGS 293.80983 42.26274 6.952 3.65e-12
SubCategoryJUMPSUITS 122.35583 43.41362 2.818 0.004829
SubCategoryKURTAS -64.09186 44.54954 -1.439 0.150253
SubCategoryKURTIS -97.60399 43.12624 -2.263 0.023627
SubCategoryLEGGINGS 2.12208 51.24145 0.041 0.966967
SubCategoryLOUNGEWEAR 151.73101 107.16992 1.416 0.156842
SubCategoryMANDARIN T-SHIRTS 106.91758 47.11885 2.269 0.023267
SubCategoryNIGHTWEAR 65.09678 48.58990 1.340 0.180344
SubCategoryOFF SHOULDER/TUBE/HALTER 272.69471 107.16776 2.545 0.010945
SubCategoryPOLO-T-SHIRTS -39.72901 40.62301 -0.978 0.328083
SubCategoryQUILTED JACKETS 403.27500 128.08198 3.149 0.001642
SubCategoryROUND NECK SWEATERS 204.22785 53.58417 3.811 0.000138
SubCategoryROUND NECK T-SHIRTS -109.21596 40.63871 -2.687 0.007202
SubCategoryROUND NECK VESTS -50.56165 60.10780 -0.841 0.400250
SubCategorySALWAR -82.49967 45.84725 -1.799 0.071955
SubCategorySCARVES -137.15161 76.55181 -1.792 0.073201
SubCategorySHIFT DRESS 90.14486 48.24446 1.869 0.061699
SubCategorySHIRTS 189.18138 40.64256 4.655 3.25e-06
SubCategorySHORTS -53.81860 41.84422 -1.286 0.198392
SubCategorySHRUGS 134.34991 42.21184 3.183 0.001460
SubCategorySKATER DRESS 336.51422 51.23407 6.568 5.15e-11
SubCategorySKIRTS 312.68148 41.00330 7.626 2.48e-14
SubCategorySTOCKINGS -172.45192 176.55213 -0.977 0.328685
SubCategorySUMMER JACKETS 138.76535 176.55413 0.786 0.431892
SubCategorySWEATERS 230.43724 43.25855 5.327 1.00e-07
SubCategorySWEATSHIRTS 349.22906 44.73153 7.807 5.98e-15
SubCategoryT SHIRTS -13.09968 40.89549 -0.320 0.748727
SubCategoryTOPS 36.47958 40.53749 0.900 0.368180
SubCategoryTRACK PANTS 307.80203 46.27802 6.651 2.94e-11
SubCategoryTROUSERS 9.56935 40.88253 0.234 0.814932
SubCategoryTUNICS -25.42721 43.42360 -0.586 0.558173
SubCategoryV NECK SWEATERS 113.02870 76.54577 1.477 0.139786
SubCategoryV NECK T-SHIRTS -199.89323 40.73250 -4.907 9.26e-07
SubCategoryWAISTCOATS 225.10656 42.09469 5.348 8.96e-08
SubCategoryWINTER JACKETS 499.45348 41.26729 12.103 < 2e-16
ISMetro -5.01370 1.69076 -2.965 0.003025
(Intercept) ***
COD *
CODCharge ***
HasVendorDiscount ***
HasWebsiteDiscount ***
ShippingAddressTypeNull
ShippingAddressTypeOffice
SubCategoryASYMMETRIC DRESS ***
SubCategoryBEACHWEAR
SubCategoryBODYCON DRESS ***
SubCategoryCAMISOLES ***
SubCategoryCAPRIS ***
SubCategoryCAPS & HATS ***
SubCategoryCASUAL JACKETS ***
SubCategoryCASUAL SHIRTS ***
SubCategoryCHINOS ***
SubCategoryDENIM JACKETS ***
SubCategoryDRESSES ***
SubCategoryFORMAL SHIRTS ***
SubCategoryFORMAL TROUSERS ***
SubCategoryGSTOP235BLK
SubCategoryHENLEY T-SHIRTS **
SubCategoryHIGH NECK T SHIRTS *
SubCategoryJEANS **
SubCategoryJEGGINGS ***
SubCategoryJUMPSUITS **
SubCategoryKURTAS
SubCategoryKURTIS *
SubCategoryLEGGINGS
SubCategoryLOUNGEWEAR
SubCategoryMANDARIN T-SHIRTS *
SubCategoryNIGHTWEAR
SubCategoryOFF SHOULDER/TUBE/HALTER *
SubCategoryPOLO-T-SHIRTS
SubCategoryQUILTED JACKETS **
SubCategoryROUND NECK SWEATERS ***
SubCategoryROUND NECK T-SHIRTS **
SubCategoryROUND NECK VESTS
SubCategorySALWAR .
SubCategorySCARVES .
SubCategorySHIFT DRESS .
SubCategorySHIRTS ***
SubCategorySHORTS
SubCategorySHRUGS **
SubCategorySKATER DRESS ***
SubCategorySKIRTS ***
SubCategorySTOCKINGS
SubCategorySUMMER JACKETS
SubCategorySWEATERS ***
SubCategorySWEATSHIRTS ***
SubCategoryT SHIRTS
SubCategoryTOPS
SubCategoryTRACK PANTS ***
SubCategoryTROUSERS
SubCategoryTUNICS
SubCategoryV NECK SWEATERS
SubCategoryV NECK T-SHIRTS ***
SubCategoryWAISTCOATS ***
SubCategoryWINTER JACKETS ***
ISMetro **
---
Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
Residual standard error: 171.8 on 43255 degrees of freedom
Multiple R-squared: 0.4838, Adjusted R-squared: 0.4831
F-statistic: 687.2 on 59 and 43255 DF, p-value: < 2.2e-16
m4=lm(FinalTotalPrice~COD + Brand + CODCharge + HasVendorDiscount + HasWebsiteDiscount + SubCategory+ISMetro, data = apparel)
summary(m4)
Call:
lm(formula = FinalTotalPrice ~ COD + Brand + CODCharge + HasVendorDiscount +
HasWebsiteDiscount + SubCategory + ISMetro, data = apparel)
Residuals:
Min 1Q Median 3Q Max
-1460.07 -77.48 -4.92 72.89 823.37
Coefficients:
Estimate Std. Error t value
(Intercept) 812.73341 33.51486 24.250
COD 6.44592 1.62332 3.971
BrandFABALLEY 37.56966 3.30412 11.371
BrandGRITSTONES -275.71096 6.14684 -44.854
BrandHARPA 0.85504 3.18728 0.268
BrandMEIRA -287.07485 3.38989 -84.686
BrandMISS CHASE 62.38838 4.55116 13.708
BrandMONTEIL & MUNERO -113.70725 6.42801 -17.689
BrandMR BUTTON 437.32084 11.97362 36.524
BrandTHE VANCA -25.72819 3.32081 -7.748
BrandTSHIRT COMPANY -89.67320 5.38523 -16.652
CODCharge 1.17052 0.03689 31.734
HasVendorDiscount -254.14276 1.84256 -137.929
HasWebsiteDiscount -84.29900 1.67904 -50.207
SubCategoryASYMMETRIC DRESS 267.13089 49.50607 5.396
SubCategoryBEACHWEAR 284.62730 78.20951 3.639
SubCategoryBODYCON DRESS 307.66399 44.58002 6.901
SubCategoryCAMISOLES -231.12560 41.73302 -5.538
SubCategoryCAPRIS 251.44631 41.05784 6.124
SubCategoryCAPS & HATS 246.35051 34.66746 7.106
SubCategoryCASUAL JACKETS 584.11117 47.46056 12.307
SubCategoryCASUAL SHIRTS 378.36147 34.00051 11.128
SubCategoryCHINOS 248.39874 37.75607 6.579
SubCategoryDENIM JACKETS 369.68463 52.69440 7.016
SubCategoryDRESSES 311.61734 33.39862 9.330
SubCategoryFORMAL SHIRTS 410.65251 145.71921 2.818
SubCategoryFORMAL TROUSERS 404.20660 145.72110 2.774
SubCategoryGSTOP235BLK 281.22655 145.36194 1.935
SubCategoryHENLEY T-SHIRTS 30.91886 33.95561 0.911
SubCategoryHIGH NECK T SHIRTS 90.39928 34.04359 2.655
SubCategoryJEANS 396.81094 105.38244 3.765
SubCategoryJEGGINGS 251.53615 34.85551 7.217
SubCategoryJUMPSUITS 314.43885 35.81002 8.781
SubCategoryKURTAS 165.47218 36.75275 4.502
SubCategoryKURTIS 88.05811 35.57017 2.476
SubCategoryLEGGINGS 81.56233 42.28397 1.929
SubCategoryLOUNGEWEAR 23.44666 88.26931 0.266
SubCategoryMANDARIN T-SHIRTS 183.23289 39.21374 4.673
SubCategoryNIGHTWEAR 25.00943 40.08948 0.624
SubCategoryOFF SHOULDER/TUBE/HALTER 340.83844 88.22123 3.863
SubCategoryPOLO-T-SHIRTS 86.79495 33.82440 2.566
SubCategoryQUILTED JACKETS 435.88749 105.37849 4.136
SubCategoryROUND NECK SWEATERS 289.36690 44.48245 6.505
SubCategoryROUND NECK T-SHIRTS -5.25692 33.87845 -0.155
SubCategoryROUND NECK VESTS -10.83694 49.67959 -0.218
SubCategorySALWAR 172.25511 37.82975 4.553
SubCategorySCARVES 62.40259 63.03132 0.990
SubCategorySHIFT DRESS 155.61125 39.74839 3.915
SubCategorySHIRTS 174.07199 33.47863 5.199
SubCategorySHORTS 6.57147 34.50090 0.190
SubCategorySHRUGS 117.10255 34.78963 3.366
SubCategorySKATER DRESS 286.93813 42.21655 6.797
SubCategorySKIRTS 254.09385 33.80214 7.517
SubCategorySTOCKINGS -258.83062 145.31357 -1.781
SubCategorySUMMER JACKETS 137.64351 145.25604 0.948
SubCategorySWEATERS 203.17534 35.67345 5.695
SubCategorySWEATSHIRTS 451.20649 37.18037 12.136
SubCategoryT SHIRTS 30.91734 33.78770 0.915
SubCategoryTOPS 47.02173 33.38876 1.408
SubCategoryTRACK PANTS 362.38550 38.53306 9.405
SubCategoryTROUSERS 166.18569 33.71146 4.930
SubCategoryTUNICS 109.52613 35.79327 3.060
SubCategoryV NECK SWEATERS 190.02720 63.25600 3.004
SubCategoryV NECK T-SHIRTS -72.70481 33.95458 -2.141
SubCategoryWAISTCOATS 354.15279 35.05365 10.103
SubCategoryWINTER JACKETS 489.38387 34.00798 14.390
ISMetro -2.80756 1.38401 -2.029
Pr(>|t|)
(Intercept) < 2e-16 ***
COD 7.17e-05 ***
BrandFABALLEY < 2e-16 ***
BrandGRITSTONES < 2e-16 ***
BrandHARPA 0.788497
BrandMEIRA < 2e-16 ***
BrandMISS CHASE < 2e-16 ***
BrandMONTEIL & MUNERO < 2e-16 ***
BrandMR BUTTON < 2e-16 ***
BrandTHE VANCA 9.57e-15 ***
BrandTSHIRT COMPANY < 2e-16 ***
CODCharge < 2e-16 ***
HasVendorDiscount < 2e-16 ***
HasWebsiteDiscount < 2e-16 ***
SubCategoryASYMMETRIC DRESS 6.85e-08 ***
SubCategoryBEACHWEAR 0.000274 ***
SubCategoryBODYCON DRESS 5.22e-12 ***
SubCategoryCAMISOLES 3.07e-08 ***
SubCategoryCAPRIS 9.19e-10 ***
SubCategoryCAPS & HATS 1.21e-12 ***
SubCategoryCASUAL JACKETS < 2e-16 ***
SubCategoryCASUAL SHIRTS < 2e-16 ***
SubCategoryCHINOS 4.79e-11 ***
SubCategoryDENIM JACKETS 2.32e-12 ***
SubCategoryDRESSES < 2e-16 ***
SubCategoryFORMAL SHIRTS 0.004833 **
SubCategoryFORMAL TROUSERS 0.005542 **
SubCategoryGSTOP235BLK 0.053038 .
SubCategoryHENLEY T-SHIRTS 0.362529
SubCategoryHIGH NECK T SHIRTS 0.007924 **
SubCategoryJEANS 0.000166 ***
SubCategoryJEGGINGS 5.42e-13 ***
SubCategoryJUMPSUITS < 2e-16 ***
SubCategoryKURTAS 6.74e-06 ***
SubCategoryKURTIS 0.013304 *
SubCategoryLEGGINGS 0.053748 .
SubCategoryLOUNGEWEAR 0.790528
SubCategoryMANDARIN T-SHIRTS 2.98e-06 ***
SubCategoryNIGHTWEAR 0.532736
SubCategoryOFF SHOULDER/TUBE/HALTER 0.000112 ***
SubCategoryPOLO-T-SHIRTS 0.010290 *
SubCategoryQUILTED JACKETS 3.53e-05 ***
SubCategoryROUND NECK SWEATERS 7.84e-11 ***
SubCategoryROUND NECK T-SHIRTS 0.876688
SubCategoryROUND NECK VESTS 0.827324
SubCategorySALWAR 5.29e-06 ***
SubCategorySCARVES 0.322167
SubCategorySHIFT DRESS 9.06e-05 ***
SubCategorySHIRTS 2.01e-07 ***
SubCategorySHORTS 0.848940
SubCategorySHRUGS 0.000763 ***
SubCategorySKATER DRESS 1.08e-11 ***
SubCategorySKIRTS 5.71e-14 ***
SubCategorySTOCKINGS 0.074889 .
SubCategorySUMMER JACKETS 0.343342
SubCategorySWEATERS 1.24e-08 ***
SubCategorySWEATSHIRTS < 2e-16 ***
SubCategoryT SHIRTS 0.360172
SubCategoryTOPS 0.159047
SubCategoryTRACK PANTS < 2e-16 ***
SubCategoryTROUSERS 8.27e-07 ***
SubCategoryTUNICS 0.002215 **
SubCategoryV NECK SWEATERS 0.002665 **
SubCategoryV NECK T-SHIRTS 0.032260 *
SubCategoryWAISTCOATS < 2e-16 ***
SubCategoryWINTER JACKETS < 2e-16 ***
ISMetro 0.042508 *
---
Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
Residual standard error: 141.4 on 43248 degrees of freedom
Multiple R-squared: 0.6507, Adjusted R-squared: 0.6501
F-statistic: 1221 on 66 and 43248 DF, p-value: < 2.2e-16
m5=lm(FinalTotalPrice~MRP + COD + Brand + CODCharge + HasVendorDiscount + HasWebsiteDiscount + ShippingAddressType+ SubCategory+ISMetro, data = apparel)
summary(m5)
Call:
lm(formula = FinalTotalPrice ~ MRP + COD + Brand + CODCharge +
HasVendorDiscount + HasWebsiteDiscount + ShippingAddressType +
SubCategory + ISMetro, data = apparel)
Residuals:
Min 1Q Median 3Q Max
-1547.27 -48.22 -8.22 34.12 531.32
Coefficients:
Estimate Std. Error t value
(Intercept) 3.671e+02 2.129e+01 17.246
MRP 5.974e-01 2.349e-03 254.341
COD 3.075e+00 1.049e+00 2.931
BrandFABALLEY -1.705e+00 2.097e+00 -0.813
BrandGRITSTONES -8.591e+01 3.962e+00 -21.684
BrandHARPA -3.009e+01 2.022e+00 -14.886
BrandMEIRA -8.090e+01 2.294e+00 -35.266
BrandMISS CHASE 1.735e+01 2.886e+00 6.014
BrandMONTEIL & MUNERO -3.907e+01 4.079e+00 -9.578
BrandMR BUTTON 1.115e+02 7.687e+00 14.509
BrandTHE VANCA -3.671e+01 2.103e+00 -17.455
BrandTSHIRT COMPANY -4.821e+01 3.412e+00 -14.128
CODCharge 9.979e-01 2.337e-02 42.697
HasVendorDiscount -2.729e+02 1.169e+00 -233.537
HasWebsiteDiscount -1.151e+02 1.071e+00 -107.482
ShippingAddressTypeNull 4.597e+00 9.449e-01 4.865
ShippingAddressTypeOffice 1.414e-01 1.468e+00 0.096
SubCategoryASYMMETRIC DRESS -1.596e+01 3.135e+01 -0.509
SubCategoryBEACHWEAR 8.372e+01 4.951e+01 1.691
SubCategoryBODYCON DRESS -6.391e+01 2.825e+01 -2.262
SubCategoryCAMISOLES -6.328e+01 2.642e+01 -2.395
SubCategoryCAPRIS 2.548e+01 2.600e+01 0.980
SubCategoryCAPS & HATS -1.995e+01 2.197e+01 -0.908
SubCategoryCASUAL JACKETS -3.629e+02 3.027e+01 -11.991
SubCategoryCASUAL SHIRTS -1.189e+00 2.157e+01 -0.055
SubCategoryCHINOS -5.567e+01 2.393e+01 -2.327
SubCategoryDENIM JACKETS -3.699e+01 3.339e+01 -1.108
SubCategoryDRESSES -2.491e+01 2.118e+01 -1.176
SubCategoryFORMAL SHIRTS 3.291e+01 9.224e+01 0.357
SubCategoryFORMAL TROUSERS 3.444e+01 9.224e+01 0.373
SubCategoryGSTOP235BLK 1.457e+02 9.200e+01 1.584
SubCategoryHENLEY T-SHIRTS -3.288e+01 2.149e+01 -1.530
SubCategoryHIGH NECK T SHIRTS -1.144e+01 2.155e+01 -0.531
SubCategoryJEANS -1.377e+01 6.672e+01 -0.206
SubCategoryJEGGINGS -5.455e+01 2.209e+01 -2.469
SubCategoryJUMPSUITS -4.481e+01 2.271e+01 -1.973
SubCategoryKURTAS -1.593e+01 2.327e+01 -0.685
SubCategoryKURTIS -1.641e+01 2.252e+01 -0.729
SubCategoryLEGGINGS 3.212e+01 2.676e+01 1.200
SubCategoryLOUNGEWEAR -9.980e+01 5.587e+01 -1.786
SubCategoryMANDARIN T-SHIRTS -1.757e+01 2.483e+01 -0.707
SubCategoryNIGHTWEAR -1.937e+01 2.537e+01 -0.763
SubCategoryOFF SHOULDER/TUBE/HALTER -9.606e+01 5.586e+01 -1.720
SubCategoryPOLO-T-SHIRTS -2.055e+01 2.141e+01 -0.960
SubCategoryQUILTED JACKETS -1.444e+02 6.673e+01 -2.164
SubCategoryROUND NECK SWEATERS -1.391e+02 2.820e+01 -4.933
SubCategoryROUND NECK T-SHIRTS -2.831e+01 2.144e+01 -1.320
SubCategoryROUND NECK VESTS -5.876e+01 3.144e+01 -1.869
SubCategorySALWAR -3.444e+01 2.396e+01 -1.438
SubCategorySCARVES -7.539e+00 3.989e+01 -0.189
SubCategorySHIFT DRESS -4.562e+01 2.517e+01 -1.813
SubCategorySHIRTS -2.883e+01 2.120e+01 -1.360
SubCategorySHORTS -5.044e+01 2.184e+01 -2.310
SubCategorySHRUGS -4.874e+01 2.203e+01 -2.213
SubCategorySKATER DRESS -1.050e+01 2.674e+01 -0.393
SubCategorySKIRTS -4.710e+00 2.142e+01 -0.220
SubCategorySTOCKINGS -9.543e+01 9.197e+01 -1.038
SubCategorySUMMER JACKETS -1.559e+02 9.194e+01 -1.696
SubCategorySWEATERS -3.524e+01 2.260e+01 -1.560
SubCategorySWEATSHIRTS -6.506e+01 2.362e+01 -2.755
SubCategoryT SHIRTS -4.792e+01 2.139e+01 -2.241
SubCategoryTOPS -3.026e+01 2.113e+01 -1.432
SubCategoryTRACK PANTS -1.046e+02 2.446e+01 -4.278
SubCategoryTROUSERS -1.627e+01 2.135e+01 -0.762
SubCategoryTUNICS -3.964e+01 2.266e+01 -1.749
SubCategoryV NECK SWEATERS -1.050e+02 4.005e+01 -2.623
SubCategoryV NECK T-SHIRTS -4.360e+01 2.149e+01 -2.029
SubCategoryWAISTCOATS -6.074e+01 2.225e+01 -2.731
SubCategoryWINTER JACKETS -7.235e+01 2.164e+01 -3.344
ISMetro -2.627e+00 8.808e-01 -2.983
Pr(>|t|)
(Intercept) < 2e-16 ***
MRP < 2e-16 ***
COD 0.003377 **
BrandFABALLEY 0.416168
BrandGRITSTONES < 2e-16 ***
BrandHARPA < 2e-16 ***
BrandMEIRA < 2e-16 ***
BrandMISS CHASE 1.83e-09 ***
BrandMONTEIL & MUNERO < 2e-16 ***
BrandMR BUTTON < 2e-16 ***
BrandTHE VANCA < 2e-16 ***
BrandTSHIRT COMPANY < 2e-16 ***
CODCharge < 2e-16 ***
HasVendorDiscount < 2e-16 ***
HasWebsiteDiscount < 2e-16 ***
ShippingAddressTypeNull 1.15e-06 ***
ShippingAddressTypeOffice 0.923275
SubCategoryASYMMETRIC DRESS 0.610653
SubCategoryBEACHWEAR 0.090816 .
SubCategoryBODYCON DRESS 0.023700 *
SubCategoryCAMISOLES 0.016630 *
SubCategoryCAPRIS 0.327046
SubCategoryCAPS & HATS 0.363881
SubCategoryCASUAL JACKETS < 2e-16 ***
SubCategoryCASUAL SHIRTS 0.956034
SubCategoryCHINOS 0.019985 *
SubCategoryDENIM JACKETS 0.267898
SubCategoryDRESSES 0.239567
SubCategoryFORMAL SHIRTS 0.721217
SubCategoryFORMAL TROUSERS 0.708900
SubCategoryGSTOP235BLK 0.113263
SubCategoryHENLEY T-SHIRTS 0.126050
SubCategoryHIGH NECK T SHIRTS 0.595364
SubCategoryJEANS 0.836535
SubCategoryJEGGINGS 0.013545 *
SubCategoryJUMPSUITS 0.048465 *
SubCategoryKURTAS 0.493572
SubCategoryKURTIS 0.466125
SubCategoryLEGGINGS 0.230104
SubCategoryLOUNGEWEAR 0.074044 .
SubCategoryMANDARIN T-SHIRTS 0.479271
SubCategoryNIGHTWEAR 0.445304
SubCategoryOFF SHOULDER/TUBE/HALTER 0.085515 .
SubCategoryPOLO-T-SHIRTS 0.337192
SubCategoryQUILTED JACKETS 0.030488 *
SubCategoryROUND NECK SWEATERS 8.14e-07 ***
SubCategoryROUND NECK T-SHIRTS 0.186734
SubCategoryROUND NECK VESTS 0.061670 .
SubCategorySALWAR 0.150568
SubCategorySCARVES 0.850124
SubCategorySHIFT DRESS 0.069882 .
SubCategorySHIRTS 0.173907
SubCategorySHORTS 0.020897 *
SubCategorySHRUGS 0.026917 *
SubCategorySKATER DRESS 0.694542
SubCategorySKIRTS 0.825924
SubCategorySTOCKINGS 0.299461
SubCategorySUMMER JACKETS 0.089874 .
SubCategorySWEATERS 0.118853
SubCategorySWEATSHIRTS 0.005880 **
SubCategoryT SHIRTS 0.025056 *
SubCategoryTOPS 0.152158
SubCategoryTRACK PANTS 1.89e-05 ***
SubCategoryTROUSERS 0.445947
SubCategoryTUNICS 0.080239 .
SubCategoryV NECK SWEATERS 0.008730 **
SubCategoryV NECK T-SHIRTS 0.042469 *
SubCategoryWAISTCOATS 0.006326 **
SubCategoryWINTER JACKETS 0.000826 ***
ISMetro 0.002858 **
---
Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
Residual standard error: 89.47 on 43245 degrees of freedom
Multiple R-squared: 0.8601, Adjusted R-squared: 0.8599
F-statistic: 3853 on 69 and 43245 DF, p-value: < 2.2e-16
Sub-category: As expected, the sub-category has an impact on prices as according to the type of wear, the price would vary. Brand: The brand also has a significant impact on prices as some brands would be expensive and some would be cheap. The COD charge naturally has an impact as it would increase the final price. We also wish to see the magnitude of impact caused by having website and vendor discounts. Although the shipping address type does not have a significant impact, removing it decreases the adjusted R-squared value. From the results, we can see that the cost of products ordered at home is marginally higher than those ordered at office address.
lmModel=lm(FinalTotalPrice ~ MRP + COD + Brand + CODCharge + HasVendorDiscount + HasWebsiteDiscount + SubCategory + ISMetro)
summary(lmModel)
Call:
lm(formula = FinalTotalPrice ~ MRP + COD + Brand + CODCharge +
HasVendorDiscount + HasWebsiteDiscount + SubCategory + ISMetro)
Residuals:
Min 1Q Median 3Q Max
-1847.99 -60.19 -12.27 41.27 851.09
Coefficients:
Estimate Std. Error t value
(Intercept) 3.539e+02 2.570e+01 13.775
MRP 6.733e-01 2.254e-03 298.698
COD 4.145e+00 1.219e+00 3.400
BrandFABALLEY 3.972e+00 2.436e+00 1.630
BrandGRITSTONES -9.004e+01 4.623e+00 -19.478
BrandHARPA -3.014e+01 2.363e+00 -12.755
BrandMEIRA -7.300e+01 2.649e+00 -27.556
BrandMISS CHASE 3.981e+01 3.334e+00 11.939
BrandMONTEIL & MUNERO -2.759e+01 4.778e+00 -5.774
BrandMR BUTTON 1.989e+02 6.518e+00 30.517
BrandTHE VANCA -4.293e+01 2.488e+00 -17.257
BrandTSHIRT COMPANY -3.038e+01 3.961e+00 -7.670
CODCharge 9.941e-01 2.743e-02 36.242
HasVendorDiscount -3.076e+02 1.339e+00 -229.775
HasWebsiteDiscount -1.481e+02 1.237e+00 -119.719
SubCategoryASYMMETRIC DRESS -6.477e+01 3.790e+01 -1.709
SubCategoryBEACHWEAR 7.204e+01 5.986e+01 1.203
SubCategoryBODYCON DRESS -1.093e+02 3.354e+01 -3.258
SubCategoryCAMISOLES -6.583e+01 3.194e+01 -2.061
SubCategoryCAPRIS -2.817e+01 3.143e+01 -0.896
SubCategoryCAPS & HATS -5.293e+01 2.652e+01 -1.996
SubCategoryCASUAL JACKETS -2.734e+02 2.847e+01 -9.603
SubCategoryCASUAL SHIRTS -4.477e+01 2.601e+01 -1.721
SubCategoryCHINOS -7.855e+01 2.744e+01 -2.863
SubCategoryDENIM JACKETS -8.629e+01 4.036e+01 -2.138
SubCategoryDRESSES -5.091e+01 2.559e+01 -1.989
SubCategoryETHNIC JACKETS -3.037e+01 2.887e+01 -1.052
SubCategoryFORMAL SHIRTS -1.994e+02 8.087e+01 -2.465
SubCategoryFORMAL TROUSERS -1.448e+02 4.310e+01 -3.359
SubCategoryGSTOP235BLK 1.475e+02 1.112e+02 1.326
SubCategoryHENLEY T-SHIRTS -5.312e+01 2.597e+01 -2.045
SubCategoryHIGH NECK T SHIRTS -3.381e+01 2.604e+01 -1.298
SubCategoryJACKETS & BLAZERS -4.614e+02 3.333e+01 -13.843
SubCategoryJEANS -7.200e+01 8.067e+01 -0.893
SubCategoryJEGGINGS -5.369e+01 2.653e+01 -2.024
SubCategoryJUMPSUITS -3.754e+01 2.697e+01 -1.392
SubCategoryKURTAS -2.699e+01 2.746e+01 -0.983
SubCategoryKURTIS -4.907e+01 2.722e+01 -1.802
SubCategoryLEGGINGS 1.019e+01 3.236e+01 0.315
SubCategoryLOUNGEWEAR -1.633e+02 6.755e+01 -2.417
SubCategoryMANDARIN T-SHIRTS -5.728e+01 3.000e+01 -1.909
SubCategoryNIGHTWEAR -3.797e+01 3.068e+01 -1.238
SubCategoryOFF SHOULDER/TUBE/HALTER -1.699e+02 6.753e+01 -2.516
SubCategoryPOLO-T-SHIRTS -4.531e+01 2.587e+01 -1.751
SubCategoryQUILTED JACKETS -2.085e+02 8.068e+01 -2.584
SubCategoryROUND NECK SWEATERS -2.017e+02 3.406e+01 -5.922
SubCategoryROUND NECK T-SHIRTS -4.204e+01 2.591e+01 -1.623
SubCategoryROUND NECK VESTS -9.790e+01 3.801e+01 -2.576
SubCategorySALWAR -4.984e+01 2.896e+01 -1.721
SubCategorySCARVES -2.229e+01 4.824e+01 -0.462
SubCategorySHIFT DRESS -8.223e+01 3.033e+01 -2.711
SubCategorySHIRTS -7.104e+01 2.563e+01 -2.772
SubCategorySHORTS -6.362e+01 2.637e+01 -2.413
SubCategorySHRUGS -7.925e+01 2.663e+01 -2.977
SubCategorySKATER DRESS -4.858e+01 3.195e+01 -1.521
SubCategorySKIRTS -3.917e+01 2.586e+01 -1.515
SubCategorySTOCKINGS -1.163e+02 1.112e+02 -1.046
SubCategorySUITS -4.404e+02 3.078e+01 -14.306
SubCategorySUMMER JACKETS 1.031e+02 5.987e+01 1.723
SubCategorySWEATERS -7.447e+01 2.731e+01 -2.726
SubCategorySWEATSHIRTS -9.453e+01 2.814e+01 -3.359
SubCategoryT SHIRTS -8.422e+01 2.585e+01 -3.258
SubCategoryTOPS -5.182e+01 2.555e+01 -2.028
SubCategoryTRACK PANTS -1.640e+02 2.934e+01 -5.590
SubCategoryTRACKPANTS 5.041e+02 8.080e+01 6.239
SubCategoryTRACKSUITS 1.108e+03 1.114e+02 9.939
SubCategoryTROUSERS -4.102e+01 2.580e+01 -1.590
SubCategoryTUNICS -6.321e+01 2.740e+01 -2.307
SubCategoryV NECK SWEATERS -1.552e+02 4.841e+01 -3.207
SubCategoryV NECK T-SHIRTS -4.838e+01 2.597e+01 -1.863
SubCategoryWAISTCOATS -9.074e+01 2.658e+01 -3.414
SubCategoryWINTER JACKETS -1.396e+02 2.607e+01 -5.355
ISMetro -3.412e+00 1.034e+00 -3.299
Pr(>|t|)
(Intercept) < 2e-16 ***
MRP < 2e-16 ***
COD 0.000675 ***
BrandFABALLEY 0.103009
BrandGRITSTONES < 2e-16 ***
BrandHARPA < 2e-16 ***
BrandMEIRA < 2e-16 ***
BrandMISS CHASE < 2e-16 ***
BrandMONTEIL & MUNERO 7.78e-09 ***
BrandMR BUTTON < 2e-16 ***
BrandTHE VANCA < 2e-16 ***
BrandTSHIRT COMPANY 1.75e-14 ***
CODCharge < 2e-16 ***
HasVendorDiscount < 2e-16 ***
HasWebsiteDiscount < 2e-16 ***
SubCategoryASYMMETRIC DRESS 0.087422 .
SubCategoryBEACHWEAR 0.228805
SubCategoryBODYCON DRESS 0.001124 **
SubCategoryCAMISOLES 0.039345 *
SubCategoryCAPRIS 0.370146
SubCategoryCAPS & HATS 0.045937 *
SubCategoryCASUAL JACKETS < 2e-16 ***
SubCategoryCASUAL SHIRTS 0.085179 .
SubCategoryCHINOS 0.004202 **
SubCategoryDENIM JACKETS 0.032504 *
SubCategoryDRESSES 0.046705 *
SubCategoryETHNIC JACKETS 0.292972
SubCategoryFORMAL SHIRTS 0.013703 *
SubCategoryFORMAL TROUSERS 0.000782 ***
SubCategoryGSTOP235BLK 0.184788
SubCategoryHENLEY T-SHIRTS 0.040833 *
SubCategoryHIGH NECK T SHIRTS 0.194149
SubCategoryJACKETS & BLAZERS < 2e-16 ***
SubCategoryJEANS 0.372070
SubCategoryJEGGINGS 0.042971 *
SubCategoryJUMPSUITS 0.164045
SubCategoryKURTAS 0.325771
SubCategoryKURTIS 0.071493 .
SubCategoryLEGGINGS 0.752873
SubCategoryLOUNGEWEAR 0.015636 *
SubCategoryMANDARIN T-SHIRTS 0.056241 .
SubCategoryNIGHTWEAR 0.215767
SubCategoryOFF SHOULDER/TUBE/HALTER 0.011888 *
SubCategoryPOLO-T-SHIRTS 0.079880 .
SubCategoryQUILTED JACKETS 0.009767 **
SubCategoryROUND NECK SWEATERS 3.21e-09 ***
SubCategoryROUND NECK T-SHIRTS 0.104667
SubCategoryROUND NECK VESTS 0.010001 *
SubCategorySALWAR 0.085280 .
SubCategorySCARVES 0.643996
SubCategorySHIFT DRESS 0.006702 **
SubCategorySHIRTS 0.005574 **
SubCategorySHORTS 0.015846 *
SubCategorySHRUGS 0.002916 **
SubCategorySKATER DRESS 0.128319
SubCategorySKIRTS 0.129828
SubCategorySTOCKINGS 0.295594
SubCategorySUITS < 2e-16 ***
SubCategorySUMMER JACKETS 0.084933 .
SubCategorySWEATERS 0.006408 **
SubCategorySWEATSHIRTS 0.000784 ***
SubCategoryT SHIRTS 0.001124 **
SubCategoryTOPS 0.042562 *
SubCategoryTRACK PANTS 2.28e-08 ***
SubCategoryTRACKPANTS 4.45e-10 ***
SubCategoryTRACKSUITS < 2e-16 ***
SubCategoryTROUSERS 0.111894
SubCategoryTUNICS 0.021050 *
SubCategoryV NECK SWEATERS 0.001344 **
SubCategoryV NECK T-SHIRTS 0.062447 .
SubCategoryWAISTCOATS 0.000641 ***
SubCategoryWINTER JACKETS 8.60e-08 ***
ISMetro 0.000970 ***
---
Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
Residual standard error: 108.2 on 45434 degrees of freedom
Multiple R-squared: 0.8976, Adjusted R-squared: 0.8975
F-statistic: 5532 on 72 and 45434 DF, p-value: < 2.2e-16
plot(lmModel, 1)
library(car)
qqPlot(FinalTotalPrice)
[1] 43558 45503
library(nortest)
ad.test(FinalTotalPrice)
Anderson-Darling normality test
data: FinalTotalPrice
A = 1561.4, p-value < 2.2e-16
From Plot 1, we conclude that the model is not linear. From qqPlot and the ad test, we conclude that the data is not normally distributed.
We've used Box-Cox Transformation.
library(caret)
FinalTotalPriceTrans = BoxCoxTrans(FinalTotalPrice)
FinalTotalPriceTrans
Box-Cox Transformation
45507 data points used to estimate Lambda
Input data summary:
Min. 1st Qu. Median Mean 3rd Qu. Max.
7.35 532.35 677.95 761.28 883.91 6977.95
Largest/Smallest: 949
Sample Skewness: 2.58
Estimated Lambda: 0
With fudge factor, Lambda = 0 will be used for transformations
Lambda is found to be 0, hence we'll proceed with the log-linear model.
logModel=lm(log(FinalTotalPrice) ~ MRP + COD + Brand + CODCharge + HasVendorDiscount + HasWebsiteDiscount + SubCategory + ISMetro)
summary(logModel)
Call:
lm(formula = log(FinalTotalPrice) ~ MRP + COD + Brand + CODCharge +
HasVendorDiscount + HasWebsiteDiscount + SubCategory + ISMetro)
Residuals:
Min 1Q Median 3Q Max
-4.0687 -0.0683 -0.0022 0.0743 1.2049
Coefficients:
Estimate Std. Error t value
(Intercept) 6.166e+00 3.579e-02 172.294
MRP 6.866e-04 3.139e-06 218.698
COD 4.171e-03 1.698e-03 2.456
BrandFABALLEY 1.292e-02 3.393e-03 3.807
BrandGRITSTONES -1.483e-01 6.439e-03 -23.026
BrandHARPA -1.812e-02 3.291e-03 -5.506
BrandMEIRA -1.722e-01 3.690e-03 -46.678
BrandMISS CHASE 6.347e-02 4.644e-03 13.665
BrandMONTEIL & MUNERO -1.057e-01 6.654e-03 -15.877
BrandMR BUTTON -1.264e-02 9.079e-03 -1.392
BrandTHE VANCA -4.919e-02 3.465e-03 -14.196
BrandTSHIRT COMPANY -6.436e-02 5.517e-03 -11.666
CODCharge 1.474e-03 3.820e-05 38.589
HasVendorDiscount -3.845e-01 1.865e-03 -206.183
HasWebsiteDiscount -1.676e-01 1.723e-03 -97.276
SubCategoryASYMMETRIC DRESS 2.474e-02 5.278e-02 0.469
SubCategoryBEACHWEAR 1.940e-01 8.337e-02 2.327
SubCategoryBODYCON DRESS -4.456e-02 4.671e-02 -0.954
SubCategoryCAMISOLES -2.095e-01 4.449e-02 -4.709
SubCategoryCAPRIS 3.969e-02 4.378e-02 0.907
SubCategoryCAPS & HATS 2.936e-02 3.693e-02 0.795
SubCategoryCASUAL JACKETS -3.000e-01 3.966e-02 -7.565
SubCategoryCASUAL SHIRTS 1.321e-01 3.623e-02 3.646
SubCategoryCHINOS 1.540e-02 3.822e-02 0.403
SubCategoryDENIM JACKETS 5.137e-02 5.621e-02 0.914
SubCategoryDRESSES 2.709e-02 3.565e-02 0.760
SubCategoryETHNIC JACKETS -4.946e-02 4.022e-02 -1.230
SubCategoryFORMAL SHIRTS 7.630e-02 1.126e-01 0.677
SubCategoryFORMAL TROUSERS 7.056e-02 6.004e-02 1.175
SubCategoryGSTOP235BLK 2.681e-01 1.549e-01 1.730
SubCategoryHENLEY T-SHIRTS -3.976e-02 3.617e-02 -1.099
SubCategoryHIGH NECK T SHIRTS 1.356e-02 3.627e-02 0.374
SubCategoryJACKETS & BLAZERS -1.485e+00 4.642e-02 -31.991
SubCategoryJEANS 5.031e-02 1.124e-01 0.448
SubCategoryJEGGINGS 1.618e-02 3.695e-02 0.438
SubCategoryJUMPSUITS -1.187e-02 3.757e-02 -0.316
SubCategoryKURTAS 4.117e-02 3.825e-02 1.076
SubCategoryKURTIS 3.116e-02 3.792e-02 0.822
SubCategoryLEGGINGS 5.958e-02 4.507e-02 1.322
SubCategoryLOUNGEWEAR -1.531e-01 9.408e-02 -1.627
SubCategoryMANDARIN T-SHIRTS 8.099e-02 4.178e-02 1.938
SubCategoryNIGHTWEAR -1.159e-02 4.273e-02 -0.271
SubCategoryOFF SHOULDER/TUBE/HALTER -4.229e-02 9.406e-02 -0.450
SubCategoryPOLO-T-SHIRTS 1.553e-02 3.604e-02 0.431
SubCategoryQUILTED JACKETS -1.036e-01 1.124e-01 -0.922
SubCategoryROUND NECK SWEATERS -1.788e-02 4.744e-02 -0.377
SubCategoryROUND NECK T-SHIRTS -5.616e-02 3.609e-02 -1.556
SubCategoryROUND NECK VESTS -5.524e-02 5.294e-02 -1.044
SubCategorySALWAR -3.496e-02 4.034e-02 -0.867
SubCategorySCARVES -4.866e-02 6.719e-02 -0.724
SubCategorySHIFT DRESS -2.918e-03 4.224e-02 -0.069
SubCategorySHIRTS 5.843e-03 3.570e-02 0.164
SubCategorySHORTS -6.755e-02 3.673e-02 -1.839
SubCategorySHRUGS -3.161e-02 3.708e-02 -0.852
SubCategorySKATER DRESS 5.122e-02 4.450e-02 1.151
SubCategorySKIRTS 2.143e-02 3.602e-02 0.595
SubCategorySTOCKINGS -2.787e-01 1.549e-01 -1.799
SubCategorySUITS -6.016e-01 4.288e-02 -14.031
SubCategorySUMMER JACKETS 1.490e-02 8.339e-02 0.179
SubCategorySWEATERS 1.923e-02 3.804e-02 0.505
SubCategorySWEATSHIRTS 6.870e-02 3.920e-02 1.753
SubCategoryT SHIRTS -5.072e-02 3.601e-02 -1.409
SubCategoryTOPS -3.560e-02 3.559e-02 -1.000
SubCategoryTRACK PANTS 4.296e-02 4.086e-02 1.051
SubCategoryTRACKPANTS 1.342e-01 1.125e-01 1.193
SubCategoryTRACKSUITS -4.190e-01 1.552e-01 -2.700
SubCategoryTROUSERS 2.836e-02 3.594e-02 0.789
SubCategoryTUNICS -2.314e-02 3.816e-02 -0.607
SubCategoryV NECK SWEATERS -1.510e-02 6.743e-02 -0.224
SubCategoryV NECK T-SHIRTS -1.480e-01 3.617e-02 -4.092
SubCategoryWAISTCOATS -3.438e-02 3.702e-02 -0.929
SubCategoryWINTER JACKETS -4.691e-02 3.631e-02 -1.292
ISMetro -5.626e-03 1.440e-03 -3.906
Pr(>|t|)
(Intercept) < 2e-16 ***
MRP < 2e-16 ***
COD 0.014047 *
BrandFABALLEY 0.000141 ***
BrandGRITSTONES < 2e-16 ***
BrandHARPA 3.69e-08 ***
BrandMEIRA < 2e-16 ***
BrandMISS CHASE < 2e-16 ***
BrandMONTEIL & MUNERO < 2e-16 ***
BrandMR BUTTON 0.163871
BrandTHE VANCA < 2e-16 ***
BrandTSHIRT COMPANY < 2e-16 ***
CODCharge < 2e-16 ***
HasVendorDiscount < 2e-16 ***
HasWebsiteDiscount < 2e-16 ***
SubCategoryASYMMETRIC DRESS 0.639252
SubCategoryBEACHWEAR 0.019978 *
SubCategoryBODYCON DRESS 0.340127
SubCategoryCAMISOLES 2.50e-06 ***
SubCategoryCAPRIS 0.364536
SubCategoryCAPS & HATS 0.426619
SubCategoryCASUAL JACKETS 3.96e-14 ***
SubCategoryCASUAL SHIRTS 0.000266 ***
SubCategoryCHINOS 0.686927
SubCategoryDENIM JACKETS 0.360819
SubCategoryDRESSES 0.447304
SubCategoryETHNIC JACKETS 0.218731
SubCategoryFORMAL SHIRTS 0.498178
SubCategoryFORMAL TROUSERS 0.239894
SubCategoryGSTOP235BLK 0.083620 .
SubCategoryHENLEY T-SHIRTS 0.271658
SubCategoryHIGH NECK T SHIRTS 0.708570
SubCategoryJACKETS & BLAZERS < 2e-16 ***
SubCategoryJEANS 0.654317
SubCategoryJEGGINGS 0.661426
SubCategoryJUMPSUITS 0.751964
SubCategoryKURTAS 0.281743
SubCategoryKURTIS 0.411172
SubCategoryLEGGINGS 0.186170
SubCategoryLOUNGEWEAR 0.103742
SubCategoryMANDARIN T-SHIRTS 0.052587 .
SubCategoryNIGHTWEAR 0.786187
SubCategoryOFF SHOULDER/TUBE/HALTER 0.652982
SubCategoryPOLO-T-SHIRTS 0.666444
SubCategoryQUILTED JACKETS 0.356329
SubCategoryROUND NECK SWEATERS 0.706241
SubCategoryROUND NECK T-SHIRTS 0.119631
SubCategoryROUND NECK VESTS 0.296704
SubCategorySALWAR 0.386200
SubCategorySCARVES 0.468894
SubCategorySHIFT DRESS 0.944933
SubCategorySHIRTS 0.869988
SubCategorySHORTS 0.065874 .
SubCategorySHRUGS 0.394042
SubCategorySKATER DRESS 0.249643
SubCategorySKIRTS 0.551877
SubCategorySTOCKINGS 0.072028 .
SubCategorySUITS < 2e-16 ***
SubCategorySUMMER JACKETS 0.858198
SubCategorySWEATERS 0.613288
SubCategorySWEATSHIRTS 0.079675 .
SubCategoryT SHIRTS 0.158973
SubCategoryTOPS 0.317183
SubCategoryTRACK PANTS 0.293088
SubCategoryTRACKPANTS 0.233005
SubCategoryTRACKSUITS 0.006940 **
SubCategoryTROUSERS 0.429971
SubCategoryTUNICS 0.544167
SubCategoryV NECK SWEATERS 0.822828
SubCategoryV NECK T-SHIRTS 4.29e-05 ***
SubCategoryWAISTCOATS 0.353086
SubCategoryWINTER JACKETS 0.196385
ISMetro 9.38e-05 ***
---
Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
Residual standard error: 0.1507 on 45434 degrees of freedom
Multiple R-squared: 0.8512, Adjusted R-squared: 0.8509
F-statistic: 3609 on 72 and 45434 DF, p-value: < 2.2e-16
plot(logModel,3)
ncvTest(logModel)
Non-constant Variance Score Test
Variance formula: ~ fitted.values
Chisquare = 8760.055, Df = 1, p = < 2.22e-16
library(lmtest)
bptest(logModel)
studentized Breusch-Pagan test
data: logModel
BP = 2229.7, df = 72, p-value < 2.2e-16
Inference: From the plot and the hypothesis testing we conclude that the data is heteroscedastic.
vif(logModel)
GVIF Df GVIF^(1/(2*Df))
MRP 4.269575 1 2.066295
COD 1.374700 1 1.172476
Brand 182.604223 9 1.335486
CODCharge 1.422037 1 1.192492
HasVendorDiscount 1.741956 1 1.319832
HasWebsiteDiscount 1.341829 1 1.158373
SubCategory 198.978370 57 1.047526
ISMetro 1.038111 1 1.018877
sqrt(vif(logModel)) > 2
GVIF Df GVIF^(1/(2*Df))
MRP TRUE FALSE FALSE
COD FALSE FALSE FALSE
Brand TRUE TRUE FALSE
CODCharge FALSE FALSE FALSE
HasVendorDiscount FALSE FALSE FALSE
HasWebsiteDiscount FALSE FALSE FALSE
SubCategory TRUE TRUE FALSE
ISMetro FALSE FALSE FALSE
Inference: From the table we can see that multicollinearity doesn't exist between variables in the model.