Melbourne is the state capital and most populous city of the Australian state of Victoria, and the second-most populous city in Australia. Melbourne rates highly in education, entertainment, health care, research and development, tourism and sport making it the world’s most liveable city. Melbourne has minimal public housing and high demand for rental housing, which is becoming unaffordable for some.Public housing is usually provided by the Housing Commission of Victoria, and operates within the framework of the Commonwealth-State Housing Agreement, by which federal and state governments provide housing funding. But when it comes to properties, and its buying and selling, it becomes a very crutial issue. This is because a property’s price reflects an assessment of the value that the property has based on the locality and various other factors, and also their(buyers) willingness-to-pay for the property. This analysis is focussed on the “pricing of the property”, based on the quality of accomodation it provides.
Our research study concerns property/housing prices in Melbourne which is located in Australia. It is the beating heart of the city and a magnet for everyone who surrounds it. Melbourne is experiencing high population growth, generating high demand for housing. This housing boom has increased house prices and rents, as well as the availability of all types of housing. Subdivision regularly occurs in the outer areas of Melbourne, with numerous developers offering house and land packages. However, after 10 years of planning policies to encourage medium-density and high-density development in existing areas with greater access to public transport and other services, Melbourne’s middle and outer-ring suburbs have seen significant brownfields redevelopment. Based on strategies, the prices of the housing are decided based on the numbers of rooms and on other accomodation factors. We empirically study how the factors like distance from the housing, number of rooms, landsize and car-parking areas influence the housing prices.
The main motive behind this Study was to investigate the pricing strategy employed by housingslocated in the city of Melbourne. This study analyzed property prices at Melbourne, Australia. Our goal was to compare prices of housings on the basis of the number of rooms, with the parking area and landsize. The rationale behind this is summarized next.
Hypothesis H1: There is no significant difference in the prices of the housing which are having more landsize than those which occupy less area.
To conduct this Study, the data was collected from the website (https://www.kaggle.com/datasets). The Housing societies in the city of Melbourne has a lot of various. Beacuse this data was collected only from a few areas, so the sellers are common for most of it. The Housing Society has a lot of variations, not excatly variations but can be called types. The properties differ on the basis of the “number of rooms” it has and “how many bedrooms and bathrooms” it has and “the landsize of the property”.
Locality: Locality, that is, the place where the societies are situated are highly important because that will decide the kind of buyers of those housing. The finanial status of the customers will decide with housing they’d prefer according to their suitable locality. Locality also refers to the connectivity of that place to the rest of the city with ease.
Family Size: The family size also depends on how big or small a property you want to buy. again, if the family is huge, it might have kids and working ppeople who will require good and efficient means to commute from one place to another, connectivity with schools and shops around.
In order to test Hypothesis 1a, we proposed the following model:
Price ~ Rooms+Landsize+Bedroom+Bathroom+Car
regg<- lm(MHousing.df$Price~MHousing.df$Rooms+MHousing.df$Landsize+MHousing.df$Bedroom2+MHousing.df$Bathroom+MHousing.df$Car)
summary(regg)
##
## Call:
## lm(formula = MHousing.df$Price ~ MHousing.df$Rooms + MHousing.df$Landsize +
## MHousing.df$Bedroom2 + MHousing.df$Bathroom + MHousing.df$Car)
##
## Residuals:
## Min 1Q Median 3Q Max
## -3074991 -323854 -106127 229091 9557236
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 68734.805 15027.355 4.574 4.82e-06 ***
## MHousing.df$Rooms 220485.592 14835.579 14.862 < 2e-16 ***
## MHousing.df$Landsize 2.220 1.121 1.980 0.0478 *
## MHousing.df$Bedroom2 -14473.980 14602.386 -0.991 0.3216
## MHousing.df$Bathroom 243346.872 8000.809 30.415 < 2e-16 ***
## MHousing.df$Car 8416.958 4932.561 1.706 0.0880 .
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 560000 on 15760 degrees of freedom
## (15165 observations deleted due to missingness)
## Multiple R-squared: 0.2621, Adjusted R-squared: 0.2619
## F-statistic: 1120 on 5 and 15760 DF, p-value: < 2.2e-16
t.test(MHousing.df$Price, MHousing.df$Landsize)
##
## Welch Two Sample t-test
##
## data: MHousing.df$Price and MHousing.df$Landsize
## t = 253.9, df = 24198, p-value < 2.2e-16
## alternative hypothesis: true difference in means is not equal to 0
## 95 percent confidence interval:
## 1043882 1060125
## sample estimates:
## mean of x mean of y
## 1052590.4437 587.0988
We here see the effect of the above mentioned factors on the price of a property with the simplest regression model.
From the regression analysis, we see that the price of the Melbourne housing is greatly affected by the number of rooms in total, especially The nuber of bathrooms as it is a great necessity. The landsize undoubtly matters for the buys because it gives him the value of asset he has bought.
Also through this t-test we reject our null hypothesis becuase the p-value is less than 0.05, which tells us that there is a significant diiference in the prices of the properties which have different landsizes. If we look at the regression model, we see that the price will increase by 1$, if the landsize changes by 2.2 unit square.
This paper was motivated by the need for research that could improve our understanding building patterns influences the pricing strategies in a housing society. The attrative research was that to show, prices actually influneces on many factors. It is just not the choice of a seller to set a price. There are strategies an importnat though-process which undergoes in deciding the prices of the housing.
https://www.kaggle.com/datasets https://en.wikipedia.org/wiki/Melbourne#Housing https://www.udemy.com/data-science-and-analytics-using-r/learn/v4/overview
library("psych", lib.loc="~/R/win-library/3.4")
describe(MHousing.df[,c(3,5,9,11:15)])
## vars n mean sd median trimmed mad
## Rooms 1 30931 3.02 0.97 3 2.99 1.48
## Price 2 24197 1052590.44 644499.49 875000 953681.82 437367.00
## Distance* 3 30931 117.62 77.58 101 117.00 109.71
## Bedroom2 4 23868 3.05 0.99 3 3.02 1.48
## Bathroom 5 23862 1.61 0.72 1 1.50 1.48
## Car 6 23497 1.70 1.00 2 1.61 1.48
## Landsize 7 20405 587.10 3540.11 500 451.57 329.14
## BuildingArea 8 12397 158.34 418.09 133 141.88 59.30
## min max range skew kurtosis se
## Rooms 1 16 15 0.54 2.75 0.01
## Price 85000 11200000 11115000 2.59 13.33 4143.26
## Distance* 1 232 231 0.09 -1.62 0.44
## Bedroom2 0 30 30 1.52 28.69 0.01
## Bathroom 0 12 12 1.42 5.30 0.00
## Car 0 26 26 2.14 22.71 0.01
## Landsize 0 433014 433014 95.26 11096.44 24.78
## BuildingArea 0 44515 44515 96.90 10223.76 3.76
table23<-table(MHousing.df$Type)
table23
##
## h t u
## 21160 3178 6593
prop.table(table23)*100
##
## h t u
## 68.41033 10.27448 21.31519
table22<-xtabs(~MHousing.df$Type+MHousing.df$Method)
table22
## MHousing.df$Method
## MHousing.df$Type PI PN S SA SN SP SS VB W
## h 2733 155 12555 138 944 2896 23 1630 86
## t 533 40 1693 24 108 446 4 319 11
## u 949 69 3439 40 171 1161 4 707 53
prop.table(table22)*100
## MHousing.df$Method
## MHousing.df$Type PI PN S SA
## h 8.83579580 0.50111539 40.59034625 0.44615434
## t 1.72319033 0.12932010 5.47347321 0.07759206
## u 3.06811936 0.22307717 11.11829556 0.12932010
## MHousing.df$Method
## MHousing.df$Type SN SP SS VB
## h 3.05195435 9.36277521 0.07435906 5.26979406
## t 0.34916427 1.44191911 0.01293201 1.03132779
## u 0.55284343 3.75351589 0.01293201 2.28573276
## MHousing.df$Method
## MHousing.df$Type W
## h 0.27803821
## t 0.03556303
## u 0.17134913
boxplot(MHousing.df$Price, xlab="Price of the property",ylab="Melbourne Housing", horizontal=TRUE)
boxplot(MHousing.df$Rooms, xlab="Number of rooms",ylab="Melbourne Housing", horizontal=TRUE)
boxplot(MHousing.df$Bedroom2, xlab="Number of bedrooms",ylab="Melbourne Housing", horizontal=TRUE)
boxplot(MHousing.df$Bathroom, xlab="Number of bathrooms",ylab="Melbourne Housing", horizontal=TRUE)
boxplot(MHousing.df$Car, xlab="Number of cars",ylab="Melbourne Housing", horizontal=TRUE)
boxplot(MHousing.df$Landsize, xlab="Landsize of the property",ylab="Melbourne Housing", horizontal=TRUE)
library("corrgram", lib.loc="~/R/win-library/3.4")
corrgram(MHousing.df, order=TRUE, lower.panel=panel.shade,
upper.panel=panel.pie, text.panel=panel.txt,
main="Corrgram of Variables")
cor.test(MHousing.df$Price, MHousing.df$Rooms)
##
## Pearson's product-moment correlation
##
## data: MHousing.df$Price and MHousing.df$Rooms
## t = 82.907, df = 24195, p-value < 2.2e-16
## alternative hypothesis: true correlation is not equal to 0
## 95 percent confidence interval:
## 0.4604913 0.4801167
## sample estimates:
## cor
## 0.4703622
cor.test(MHousing.df$Price, MHousing.df$Landsize)
##
## Pearson's product-moment correlation
##
## data: MHousing.df$Price and MHousing.df$Landsize
## t = 4.2764, df = 15944, p-value = 1.91e-05
## alternative hypothesis: true correlation is not equal to 0
## 95 percent confidence interval:
## 0.01833626 0.04934331
## sample estimates:
## cor
## 0.03384793
library("car", lib.loc="~/R/win-library/3.4")
##
## Attaching package: 'car'
## The following object is masked from 'package:psych':
##
## logit
scatterplot(MHousing.df$Price,MHousing.df$Rooms,xlab = "price", ylab = "number of rooms")
library("car", lib.loc="~/R/win-library/3.4")
scatterplot(MHousing.df$Price,MHousing.df$Landsize,xlab = "price", ylab = "landsize")