For this project, I decided to use city data sets public and available to download. Most cities/counties are required to track certain services or metrics that could be useful for the communities.
Among the various cities available, I chose Chicago. The reason for this is the rapid expansion the city is having. Additionally, Chicago is an interesting city to analyze due to its diverse community.
To decide on a specific data set, I explored the Chicago Data Portal and its various catalogs (https://data.cityofchicago.org/)
Finally, I decided to work with the Business Licenses - Current Active data set. Link: https://data.cityofchicago.org/Community-Economic-Development/Business-Licenses-Current-Active/uupf-x98q/about_data
sapply(BusinessLic, class)
## ID LICENSE ID
## "character" "numeric"
## ACCOUNT NUMBER SITE NUMBER
## "numeric" "numeric"
## LEGAL NAME DOING BUSINESS AS NAME
## "character" "character"
## ADDRESS CITY
## "character" "character"
## STATE ZIP CODE
## "character" "character"
## WARD PRECINCT
## "numeric" "numeric"
## WARD PRECINCT POLICE DISTRICT
## "character" "numeric"
## COMMUNITY AREA COMMUNITY AREA NAME
## "numeric" "character"
## NEIGHBORHOOD LICENSE CODE
## "character" "numeric"
## LICENSE DESCRIPTION BUSINESS ACTIVITY ID
## "character" "character"
## BUSINESS ACTIVITY LICENSE NUMBER
## "character" "numeric"
## APPLICATION TYPE APPLICATION CREATED DATE
## "character" "character"
## APPLICATION REQUIREMENTS COMPLETE PAYMENT DATE
## "character" "character"
## CONDITIONAL APPROVAL LICENSE TERM START DATE
## "character" "character"
## LICENSE TERM EXPIRATION DATE LICENSE APPROVED FOR ISSUANCE
## "character" "character"
## DATE ISSUED LICENSE STATUS
## "character" "character"
## LICENSE STATUS CHANGE DATE SSA
## "logical" "numeric"
## LATITUDE LONGITUDE
## "numeric" "numeric"
## LOCATION
## "character"
The Business Licenses data set contains all the business licenses issued by the Department of Business Affairs and Consumer Protection in Chicago from 2022 - Present. Licenses issued include new requests, renewals and other types of applications. Each issued license includes:
The name of the business
The type of business
Location within Chicago
Issue and Expiration date
Additional Information of the business that requests the license to operate
The main idea of my project is to explore the how the business activity is moving through the city. I want to explore
Possible expansions in business activity
Trends in emerging new businesses
Shifts in business types and business districts
Below is a preview of the data
datatable(data = Test_Set,
class = 'compact',
options = list(scrollX = TRUE))
Note that the echo = FALSE parameter was added to the
code chunk to prevent printing of the R code that generated the
plot.