- Data Gathering
- Airbnb and Zillow Data Quality
- EDA of Final Data
- Analysis & Metadata
- Final Data Visuals
- ROI Zip Codes
- Popular Zip Codes
- Conclusion
- Future Research
2/5/2021
ABNB = read.csv('https://dataprojects1.s3.amazonaws.com/listingsCap1.csv')
Zillow_cost = read.csv('https://dataprojects1.s3.amazonaws.com/Zip_Zhvi_2B.csv')
dim(Zillow_cost)
[1] 8946 262
- Property Costs have been increasing steadily
| zipcode | Median_cost | Cities |
|---|---|---|
| 10003 | 2147000 | New York |
| 10011 | 2480400 | New York |
| 10013 | 3316500 | New York |
| 10014 | 2491600 | New York |
| 10021 | 1815600 | New York |
| 10022 | 2031600 | New York |
dim(ABNB)
[1] 48895 106
| zip | price | state | p_type | Neighbourhood | Cl_fee | Beds | Reviews | Rating |
|---|---|---|---|---|---|---|---|---|
| 10029 | 190 | NY | Apartment | Manhattan | NA | 2 | 0 | NA |
| 11221 | 115 | NY | Townhouse | Brooklyn | 85 | 2 | 11 | 94 |
| 11206 | 228 | NY | Loft | Brooklyn | 128 | 2 | 82 | 94 |
| 10001 | 375 | NY | Apartment | Manhattan | 120 | 2 | 5 | 100 |
| 10162 | 250 | NY | Apartment | Manhattan | 200 | 2 | 66 | 93 |
| 11215 | 225 | NY | Condominium | Brooklyn | NA | 2 | 4 | 100 |
Combined_data1 = ABNB_2 %>% inner_join(Zillow_final, by = 'zipcode')
dim(Combined_data1)
[1] 1563 15
[1] FALSE
The purpose of the study is to find zip codes with attractive ROIs. To do so, we created new variables on our cleaned and merged data which can assist the client to make an educated financial decision.
| zipcode | Median_cost | Annual_Rev | Profit | Breakeven_years | Profit_in_5 | ROI_in_5 | Profit_in_10 | ROI_in_10 |
|---|---|---|---|---|---|---|---|---|
| 10003 | 2147000 | 87417.50 | 75179.05 | 28.55852 | -1771105 | 0.1750793 | -1395210 | 0.3501586 |
| 10011 | 2480400 | 110020.12 | 94617.31 | 26.21541 | -2007313 | 0.1907299 | -1534227 | 0.3814599 |
| 10013 | 3316500 | 118716.25 | 102095.98 | 32.48414 | -2806020 | 0.1539213 | -2295540 | 0.3078425 |
| 10014 | 2491600 | 98066.38 | 84337.08 | 29.54407 | -2069915 | 0.1692428 | -1648229 | 0.3384856 |
| 10021 | 1815600 | 74460.00 | 64035.60 | 28.35298 | -1495422 | 0.1763483 | -1175244 | 0.3526966 |
| 10022 | 2031600 | 108843.00 | 93604.98 | 21.70397 | -1563575 | 0.2303726 | -1095550 | 0.4607451 |
According payback period, zip codes: - 10306 - 10303 - 11234 - 11234 - 10304 - 11434, will be paid off first
In terms of revenue, zip codes: 10022 - 10036 - 11201 - 11215 - 11231 have the highest sales
Looking at the ROI in 5 years, our best zip codes are:
- The outcome does not seem to change from the 5 year to 10 year mark
According to the analysis, the Zip codes that will maximize the return on investment are:
10306
10303
11234
10304
11434
Now, let us know more about these codes
| zipcode | mean.price | mean.review | median_cost | Revenue | location | review.score |
|---|---|---|---|---|---|---|
| 10303 | 104.00000 | 18.00000 | 327700 | 39766.75 | Staten Island | 91.75000 |
| 10304 | 93.33333 | 31.66667 | 328300 | 37047.50 | Staten Island | 91.66667 |
| 10306 | 117.50000 | 10.50000 | 352900 | 52240.62 | Staten Island | 89.00000 |
| 11234 | 135.11111 | 34.88889 | 476900 | 55315.75 | Brooklyn | 94.25000 |
| 11434 | 136.87500 | 37.12500 | 382300 | 42294.38 | Queens | 95.53846 |
Up to this point, we have analyzed zip codes without taking an important metric into account, demand. Airbnb demand is difficult to forecast, yet we can take the number of reviews and rating as sound estimates for popularity.
| zipcode | mean.review | Ratings |
|---|---|---|
| 10308 | 43.00000 | 100.0 |
| 11234 | 34.88889 | 96.5 |
| 11215 | 30.13228 | 97.0 |
| 11231 | 26.40860 | 97.0 |
| 10305 | 25.66667 | 98.0 |
| 11217 | 22.61789 | 98.0 |
| zipcode | mean.price | mean.review | median_cost | Revenue | Profit | Profit_in_5 | ROI_in_5 | location | review.score |
|---|---|---|---|---|---|---|---|---|---|
| 11234 | 125.0 | 34.88889 | 476900 | 55315.75 | 47571.54 | -239042.3 | 0.4987581 | Brooklyn | 94.25000 |
| 10305 | 111.0 | 25.66667 | 425100 | 44612.12 | 38366.43 | -233267.9 | 0.4512636 | Staten Island | 95.36364 |
| 10308 | 109.5 | 43.00000 | 409500 | 35998.12 | 30958.39 | -254708.1 | 0.3780023 | Staten Island | 100.00000 |
| 11231 | 188.0 | 26.40860 | 1202900 | 70810.00 | 60896.60 | -898417.0 | 0.2531241 | Brooklyn | 95.07595 |
| 11215 | 169.0 | 30.13228 | 1070800 | 62123.00 | 53425.78 | -803671.1 | 0.2494667 | Brooklyn | 95.28144 |
The goal of this study was to clear the noise in the data, and provide a list of zip codes which can provide superior ROIs in the NY real estate market. To accomplish this objective, we cleaned and join our tables, created variables that measure ROI, and analyzed those metrics. As a result, these are our findings:
ROI Zip Codes
Popular Zip Codes
Explore other potential costs (mortgage, reparations, property manager, taxes)
The idea of property appreciation is an important aspect to study.
For this study, we did not take occupancy and demand into account. This metric should be explored in the future.