Building energy consumption currently account for around 40% of total energy consumption and 70% of energy demand in the U.S.. By creating tool to help building managers to make decision on increasing building efficiency, it will be one of the easiest ways reduce environmental impact.
Energy benchmarking is a non-labor intensive process on creating a relative energy efficiency metrics by comparing similar buildings based on the characteristics, such as weather, gross area, location. The result can later be translate into an actionable insight for the building manager, to have a better understanding on the saving potential and drive cost effective change.
This project will be applying quantile regression to benchmark the total building consumption for universities in the United States. The methodology is based on research paper by Jonathan Roth and Ram Rajagopal from Stanford University, “Benchmarking building energy efficiency using quantile regression”. Although their original application is on individual buildings, a similar approach can be apply to university campus with slight adjustments. Roths address issues current benchmarking methods by creating the following practices:
1. Cumulative Distribution Plot that showing the individualized information, such as theoretical maximum performance.
2. Reduces outlier effect. Assuming there has no error in data, the outlier can be representing the very efficient building, therefore by applying the conventional way, which is ordinary lease square (OLS) regression, the end result will be either underestimated or overestimated.
This document uses Stevens Institute of Technology as an example to provides guidance on interpreting the dashboard. The example contains both baseline and performance year data, while the dashboard only contains performance year data at the current stage.
The dataset is extracted from The Association for the Advancement of Sustainability in Higher Education (AASHE), which contains sustainability related data of over 200 universities across United States and Canada. The features of the dataset are selected based on the result of research by Roth, 2018. If time is allowed. this project will perform feature selection with additional variables.
| School | Consumption | Employees | Student | Health | Cool | Area | Heat | Lab |
|---|---|---|---|---|---|---|---|---|
| Agnes Scott College | 63336.1 | 418 | 917.000 | 5601 | 1550 | 1022795 | 3148 | 25492.0 |
| American University | 316968.9 | 3338 | 12504.000 | 4815 | 1908 | 4553432 | 3311 | 107561.6 |
| Antioch College | 25589.0 | 154 | 133.000 | 0 | 900 | 388246 | 4858 | 20000.0 |
| Arizona State University | 1916420.6 | 10460 | 87955.000 | 30129 | 5071 | 23316686 | 829 | 1622712.0 |
| Auburn University | 1216793.0 | 5873 | 24849.000 | 163336 | 2331 | 11846397 | 1809 | 478837.0 |
| Austin College | 105806.7 | 338 | 1232.143 | 3900 | 2677 | 1011199 | 2344 | 29925.0 |
| Data Name | Brieft Description | Unit |
|---|---|---|
| Consumption | Total Building Energy Consumption | MMBtu |
| Employees | Number of Employees (Faculty + Staffs) | N/A |
| Student | Number of Full time Student | N/A |
| Health | Total Area of Healthcare | sqft |
| Cool | Cooling Degree Day (65°F Above) | Degree-Days (°F) |
| Area | Total Gross Building Area | sqft |
| Heat | Heating Degree Day (65°F Below) | Degree-Days (°F) |
| Lab | Total Area of Labortory | sqft |
| Year | Number of Employees | Number of Student | Area of Healthcare (sqft) | Coolday Degree | Total Gross Area (sqft) | Heatday | Lab Area (sqft) |
|---|---|---|---|---|---|---|---|
| 2015 | 1255 | 5129 | 0 | 1348 | 1480090 | 5172 | 131359 |
| 2019 | 863 | 6929 | 0 | 1454 | 1480090 | 4773 | 131359 |
There are several limitations and problems with the existing energy efficiency benchmark products and studies, including partial set of explanatory variable, low resolution data and using solely energy use intensity as a primary metrics. The following section will further discuss how the propose methodology solve the limitation to the existing problem:
The following section will briefly explain the proposed benchmarking model with the help of flowchart (Roth, 2018), and follow by step by step text description.
Quantile Regression: Quantile Regression is chosen because this method is useful when extreme is important, while in this case is to identify the best and worst energy efficient universities. It provides a complete picture with conditional distribution for each quantile (tau value). Unlike Ordinary Lease Square (OLS), it does not require the data to have normal residual and constant variance.
Individualized cumulative distribution function (CDF): The second step of the benchmarking model is to generate cumulative distribution function for the Stevens Institute of Technology. CDF plot can be interpret as theoretical distribution for each building.
By applying quantile regression on energy benchmarking, 99 models will be build (tau value from 0.01 to 0.99. In this case, tau value is from 0.05 to 0.95). Those models will later be aggregate conditional distribution across quantile, to form cumulative distribution plot for the target university based on the given input. From the chart, campus manager then able to realized the saving potential, and to set saving target based on the quantile (score) target to be achieve.
Influence Plot: Influence Plot is the plot of coefficient respect with quantile for each features from the model. The influence of each feature on the energy efficiency can be observed.
## Using School as id variables
Based on the actual energy consumption of Stevens Institute of Technology (SIT), the score can be obtained from the cumulative distribution plot. Using the information from year 2015 and year 2018, the scores are 71 and 92 respectively. It is alarming signal to the campus manager that the campus is becoming less energy efficient, and consuming more energy than their comparable peers that sharing the similar characteristics. The significant decrease in energy efficiency may due to the lack of change total gross area of building data from 2015 to 2018, therefore buildings that is built in between year 2015 and year 2018, such as Babbio Garage and new construction site are not include in the data analysis process.
The density plot shows the density of quantile point respect with the change in normalized energy usage. The higher the density at a specific level, it is more easier for the campus manager to alter the energy efficiency score. Comparing with year 2015 and year 2018, it is more difficult in year 2018 to increase energy efficiency. The tipping point for both years occur on the left end, and for year 2018 is further left on the chart, which means it is more difficult to reach the tipping point from the current level.
Jonathan Roth, Ram Rajagopal, Benchmarking building energy efficiency using quantile regression, Energy, Volume 152, 2018, Pages 866-876, ISSN 0360-5442, https://doi.org/10.1016/j.energy.2018.02.108. (http://www.sciencedirect.com/science/article/pii/S0360544218303360)