BASIC INFORMATION ABOUT THE DATASET

##   rows columns discrete_columns continuous_columns all_missing_columns
## 1 2798       7                5                  2                   0
##   total_missing_values complete_rows total_observations memory_usage
## 1                    0          2798              19586       262720
Data summary
Name SPD
Number of rows 2798
Number of columns 7
_______________________
Column type frequency:
character 4
factor 1
numeric 2
________________________
Group variables None

Variable type: character

skim_variable n_missing complete_rate min max empty n_unique whitespace
Owner 0 1 9 43 0 158 0
Team 0 1 9 59 0 115 0
League 0 1 3 14 0 16 0
Recipient 0 1 3 96 0 1274 0

Variable type: factor

skim_variable n_missing complete_rate ordered n_unique top_counts
Party 0 1 FALSE 7 Rep: 1625, Dem: 921, Bip: 195, Bip: 40

Variable type: numeric

skim_variable n_missing complete_rate mean sd p0 p25 p50 p75 p100 hist
Amount 0 1 16790.10 77994.48 36 2700 5000 5600 1850000 ▇▁▁▁▁
Election.Year 0 1 2017.93 1.60 2016 2016 2018 2020 2020 ▇▁▇▁▇

WHICH PARTY AND LEAGUE RECEIVED THE MOST DONATIONS?

## 3 columns ignored with more than 50 categories.
## Owner: 158 categories
## Team: 115 categories
## Recipient: 1274 categories

## 3 columns ignored with more than 50 categories.
## Owner: 158 categories
## Team: 115 categories
## Recipient: 1274 categories

## 3 columns ignored with more than 50 categories.
## Owner: 158 categories
## Team: 115 categories
## Recipient: 1274 categories

DISTRIBUTIONS OF CONTINUOUS FEATURES

QQ PLOT OF CONTINUOUS FEATURES

CORREALTION HEATMAP

## 3 features with more than 20 categories ignored!
## Owner: 158 categories
## Team: 115 categories
## Recipient: 1274 categories

PRINCIPAL COMPONENT ANALYSIS

## 3 features with more than 50 categories ignored!
## Owner: 158 categories
## Team: 115 categories
## Recipient: 1274 categories