Head of the Dataset

    County License.Number Operation.Type Establishment.Type
1   ALBANY         752247          Store                  A
2   ALBANY         750931          Store                 AC
3 ALLEGANY         710458          Store                 AC
4 ALLEGANY         728010          Store                 AC
5    BRONX         747358          Store                 AC
6    BRONX         748332          Store                 AC
             Entity.Name              DBA.Name Street.Number  Street.Name
1          ANK PETRO INC                 CITGO          2450       RTE 9W
2            EVANS JULIA               TERROSA             8    VERDA AVE
3             HUBRIX LLC STEVES GAS N GRUB HUB            41      MAIN ST
4        REID STORES INC         CROSBYS 40090            66   GENESEE ST
5   361 DELI GROCERY LLC      361 DELI GROCERY           361 E 204 STREET
6 AMAA GOURMET DELI CORP     AMAA GOURMET DELI          1591   WATSON AVE
  Address.Line.2 Address.Line.3        City State Zip.Code Square.Footage
1                                    RAVENA    NY    12143             NA
2                               CLARKSVILLE    NY    12041             NA
3                                   ANDOVER    NY    14806           1100
4                                      CUBA    NY    14727             NA
5                                     BRONX    NY    10467             NA
6                                     BRONX    NY    10472             NA
                        Georeference
1  POINT (-73.816249346 42.46925125)
2 POINT (-73.949128787 42.577416307)
3 POINT (-77.795356304 42.158830586)
4 POINT (-78.277036649 42.220633641)
5 POINT (-73.877219853 40.871759695)
6 POINT (-73.875481195 40.826581028)

Summary of the Dataset

    County          License.Number   Operation.Type     Establishment.Type
 Length:24221       Min.   : 10008   Length:24221       Length:24221      
 Class :character   1st Qu.:610637   Class :character   Class :character  
 Mode  :character   Median :721708   Mode  :character   Mode  :character  
                    Mean   :625830                                        
                    3rd Qu.:745316                                        
                    Max.   :763163                                        
                                                                          
 Entity.Name          DBA.Name         Street.Number      Street.Name       
 Length:24221       Length:24221       Length:24221       Length:24221      
 Class :character   Class :character   Class :character   Class :character  
 Mode  :character   Mode  :character   Mode  :character   Mode  :character  
                                                                            
                                                                            
                                                                            
                                                                            
 Address.Line.2     Address.Line.3         City              State          
 Length:24221       Length:24221       Length:24221       Length:24221      
 Class :character   Class :character   Class :character   Class :character  
 Mode  :character   Mode  :character   Mode  :character   Mode  :character  
                                                                            
                                                                            
                                                                            
                                                                            
    Zip.Code     Square.Footage   Georeference      
 Min.   : 6390   Min.   :     0   Length:24221      
 1st Qu.:10958   1st Qu.:  1000   Class :character  
 Median :11379   Median :  1800   Mode  :character  
 Mean   :11856   Mean   :  6742                     
 3rd Qu.:12801   3rd Qu.:  4000                     
 Max.   :14905   Max.   :500000                     
                 NA's   :4927                       

Structure of the Dataset

'data.frame':   24221 obs. of  15 variables:
 $ County            : chr  "ALBANY" "ALBANY" "ALLEGANY" "ALLEGANY" ...
 $ License.Number    : int  752247 750931 710458 728010 747358 748332 606849 753398 100196 728623 ...
 $ Operation.Type    : chr  "Store" "Store" "Store" "Store" ...
 $ Establishment.Type: chr  "A" "AC" "AC" "AC" ...
 $ Entity.Name       : chr  "ANK PETRO INC" "EVANS JULIA" "HUBRIX LLC" "REID STORES INC" ...
 $ DBA.Name          : chr  "CITGO" "TERROSA" "STEVES GAS N GRUB HUB" "CROSBYS 40090" ...
 $ Street.Number     : chr  "2450" "8" "41" "66" ...
 $ Street.Name       : chr  "RTE 9W" "VERDA AVE" "MAIN ST" "GENESEE ST" ...
 $ Address.Line.2    : chr  "" "" "" "" ...
 $ Address.Line.3    : chr  "" "" "" "" ...
 $ City              : chr  "RAVENA" "CLARKSVILLE" "ANDOVER" "CUBA" ...
 $ State             : chr  "NY" "NY" "NY" "NY" ...
 $ Zip.Code          : int  12143 12041 14806 14727 10467 10472 10462 13865 12037 12434 ...
 $ Square.Footage    : int  NA NA 1100 NA NA NA 1600 NA 1200 NA ...
 $ Georeference      : chr  "POINT (-73.816249346 42.46925125)" "POINT (-73.949128787 42.577416307)" "POINT (-77.795356304 42.158830586)" "POINT (-78.277036649 42.220633641)" ...

Missing Values

 [1] "County"             "License.Number"     "Operation.Type"    
 [4] "Establishment.Type" "Entity.Name"        "DBA.Name"          
 [7] "Street.Number"      "Street.Name"        "Address.Line.2"    
[10] "Address.Line.3"     "City"               "State"             
[13] "Zip.Code"           "Square.Footage"     "Georeference"      

Univariate

Histogram of License Number

Bar plot for a categorical column

Bivariate

Boxplot for numerical data grouped by a categorical variable

Multivariate

Pair plot for numeric columns in the dataset