Obtaining the data

GSA open data web site

It is possible to search APIs and dataset through the web site of GSA. The API is implemented here: https://open.gsa.gov/data.json

You have to be careful with the API because it is a relatively complex list of lists. What do I mean? I mean than not every field is replicated in each case (in 19/02/2020 there are 251 cases) in the list and sometimes the same piece of information is treated in different manners (for example, in order to download a dataset you have to refer sometimes to downloadURL or accessURL).

But nevertheless, there is a wide range of information available. Here I put a table in order to see what datasets and APIs are contained:

From that table, I chose the “Fed. Data Center Consolidation Initiative (FDCCI) Data Center Closings 2010-2015” which can be acceced through the command raw_data$dataset[[75]]$distribution[[1]]$downloadURL (I defined raw_data as raw_data<-rjson::fromJSON(file="https://open.gsa.gov/data.json")).



Data centers data analysis

The first thing that caught my attention was the number of state departments, a total of 20 involved in the centralization process, ranging from the Department of Defense (DoD), to the National Science Foundation.
But the most important number is the amount of square feet of data center involved in the centralization 1.700.090. The followin tree map shows how many square feet of data center were shot down in each state Department during the 2010-2015 period5:

The full list containing number of data center and square feet per Department can be seen in the Annex of this document.



Location of closing data centers

Unfortunatly, only 16,8% of the 4.116 data centers covered in the database has an address. The exact location of data centers can be seen in the following map:






Annex

The followin table shows, by Department, the total square feet and number of data center shot down as a consecuence of the Data Center Consolidation Initiative:

Department SquareFeet Quantity
Department of Defense 700.939 577
Department of the Treasury 351.701 575
Department of Agriculture 135.535 2.264
Department of Commerce 98.562 80
Department of Justice 62.860 69
Department of the Interior 62.628 149
Department of Health and Human Services 60.147 64
National Aeronautics and Space Administration 43.997 30
Department of State 32.144 8
General Services Administration 29.589 90
Department of Labor 26.461 48
Department of Veterans Affairs 23.874 24
Department of Homeland Security 21.384 41
Department of Energy 18.330 15
Environmental Protection Agency 18.038 28
Department of Transportation 11.584 49
Department of Education 1.042 2
Nuclear Regulatory Commission 740 1
U.S. Agency for International Development 535 1
National Science Foundation 0 1
Total 1.700.090 4.116
1 Quantity= number of data centers per Department.
2 CAUTION Numbers are formated according to european notation: decimal.mark = “,”, big.mark = “.”

  1. I have put the concept of System in capital letters, because I want to emphasize the idea of a Socio-Economic System, as opposed to (solely) an E-procurement system.

  2. https://www.gsa.gov/

  3. https://datacenters.cio.gov/

  4. To see ITDashboard click here.

  5. I’m assuiming they were closed.