Data report overview

The dataset examined has the following dimensions:

Feature Result
Number of observations 100
Number of variables 44

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values \(\times\) \(\times\) \(\times\) \(\times\) \(\times\) \(\times\) \(\times\)
Identify prefixed and suffixed whitespace \(\times\) \(\times\) \(\times\) \(\times\)
Identify levels with < 6 obs. \(\times\) \(\times\) \(\times\) \(\times\)
Identify case issues \(\times\) \(\times\) \(\times\) \(\times\)
Identify misclassified numeric or integer variables \(\times\) \(\times\) \(\times\) \(\times\)
Identify outliers \(\times\) \(\times\) \(\times\)

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
X1 numeric 100 0.00 %
AGE numeric 41 0.00 % \(\times\)
SEX numeric 2 0.00 %
HOSPITAL_STAY numeric 11 0.00 % \(\times\)
NYHA_CLASS_AT_ADMISSION numeric 3 0.00 % \(\times\)
NYHA_CLASS_AT_DISCHARGE numeric 3 7.00 %
DM numeric 3 0.00 % \(\times\)
HTN numeric 2 0.00 %
CAD numeric 2 0.00 %
CVA numeric 2 0.00 % \(\times\)
CKD numeric 2 0.00 %
COPD numeric 2 0.00 %
HEART_RATE numeric 26 0.00 %
SBP numeric 34 0.00 % \(\times\)
DBP numeric 26 0.00 % \(\times\)
HEMOGLOBIN_GM_DL numeric 40 0.00 % \(\times\)
SERUM_UREA_MG_DL numeric 52 0.00 % \(\times\)
S_CREATININE_MG_DL numeric 28 0.00 % \(\times\)
S_SODIUM_M_EQ_L numeric 22 0.00 % \(\times\)
S_POTASSIUM_M_EQ_L numeric 28 0.00 % \(\times\)
BNP_AT_ADMISSION numeric 65 0.00 % \(\times\)
BNP_AT_DISCHARGE numeric 43 7.00 % \(\times\)
TROPONIN_AT_ADMISSION numeric 31 0.00 %
TROPONIN_AT_DISCHARGE numeric 16 7.00 % \(\times\)
ECG numeric 2 0.00 %
CXR_PA_VIEW numeric 3 0.00 %
EF_PERCENT numeric 6 0.00 %
MR_GRADE numeric 4 0.00 %
BETA_BLOCKER numeric 2 0.00 % \(\times\)
ACE_INHIBITORS numeric 2 0.00 %
ALD_ANTA numeric 2 0.00 %
DIURETICS numeric 1 0.00 % \(\times\)
DIGOXIN numeric 2 0.00 %
ARNI numeric 2 0.00 %
ICD numeric 2 0.00 %
CRTD numeric 2 0.00 %
CRTP numeric 2 0.00 % \(\times\)
PRECIPITATING_FACTOR_FOR_ACUTE_HEART_FAILURE numeric 5 0.00 % \(\times\)
DEATH_DURING_CURRENT_HOSPITAL_STAY numeric 2 0.00 %
HOSPITALISATION_OR_DEATH_AFTER_DISCHARGE numeric 5 7.00 % \(\times\)
CAUSE_FOR_HOSPITALISATION_OR_DEATH_AFTER_DISCHARGE numeric 6 66.00 % \(\times\)
CARDIAC_CAUSE_FOR_REHOSPITALISATION numeric 5 77.00 % \(\times\)
DEATH_AFTER_DISCHARGE numeric 3 7.00 % \(\times\)
outcome character 3 7.00 %

Variable list

X1

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 100
Median 50.5
1st and 3rd quartiles 25.75; 75.25
Min. and max. 1; 100

AGE

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 41
Median 56
1st and 3rd quartiles 47.75; 64
Min. and max. 22; 86

  • Note that the following possible outlier values were detected: "22".

SEX

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 2
Mode “1”
Reference category 1

HOSPITAL_STAY

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 11
Median 6
1st and 3rd quartiles 5; 8
Min. and max. 3; 15

  • Note that the following possible outlier values were detected: "3".

NYHA_CLASS_AT_ADMISSION

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 3
Mode “4”
Reference category 1

  • Note that the following levels have at most five observations: "1".

NYHA_CLASS_AT_DISCHARGE

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type numeric
Number of missing obs. 7 (7 %)
Number of unique values 2
Mode “2”
Reference category 1

DM

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 3
Mode “2”
Reference category 1

  • Note that the following levels have at most five observations: "4".

HTN

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 2
Mode “2”
Reference category 1

CAD

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 2
Mode “1”
Reference category 1

CVA

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 2
Mode “2”
Reference category 1

  • Note that the following levels have at most five observations: "1".

CKD

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 2
Mode “2”
Reference category 1

COPD

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 2
Mode “2”
Reference category 1

HEART_RATE

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 26
Median 110
1st and 3rd quartiles 100; 120
Min. and max. 90; 140

SBP

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 34
Median 98
1st and 3rd quartiles 92; 112
Min. and max. 76; 196

  • Note that the following possible outlier values were detected: "76", "78", "80", "82", "84", "86".

DBP

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 26
Median 60
1st and 3rd quartiles 55.5; 68
Min. and max. 40; 120

  • Note that the following possible outlier values were detected: "40", "44", "46", "110", "120".

HEMOGLOBIN_GM_DL

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 40
Median 10.75
1st and 3rd quartiles 9.17; 12
Min. and max. 2.6; 16

  • Note that the following possible outlier values were detected: "2.6", "16".

SERUM_UREA_MG_DL

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 52
Median 58.5
1st and 3rd quartiles 38.75; 78
Min. and max. 14; 196

  • Note that the following possible outlier values were detected: "14".

S_CREATININE_MG_DL

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 28
Median 1.4
1st and 3rd quartiles 1.1; 1.9
Min. and max. 0.7; 6.8

  • Note that the following possible outlier values were detected: "0.7", "0.8", "6.4", "6.5", "6.8".

S_SODIUM_M_EQ_L

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 22
Median 135.5
1st and 3rd quartiles 129; 139
Min. and max. 120; 148

  • Note that the following possible outlier values were detected: "148".

S_POTASSIUM_M_EQ_L

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 28
Median 4.5
1st and 3rd quartiles 4; 4.9
Min. and max. 2.8; 6

  • Note that the following possible outlier values were detected: "6".

BNP_AT_ADMISSION

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 65
Median 1200
1st and 3rd quartiles 897.5; 1900
Min. and max. 450; 5600

  • Note that the following possible outlier values were detected: "450", "520", "540", "550", "560", "600".

BNP_AT_DISCHARGE

Feature Result
Variable type numeric
Number of missing obs. 7 (7 %)
Number of unique values 42
Median 330
1st and 3rd quartiles 220; 500
Min. and max. 70; 1300

  • Note that the following possible outlier values were detected: "70", "100", "112", "120".

TROPONIN_AT_ADMISSION

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 31
Median 0.09
1st and 3rd quartiles 0.05; 0.9
Min. and max. 0.02; 6

TROPONIN_AT_DISCHARGE

Feature Result
Variable type numeric
Number of missing obs. 7 (7 %)
Number of unique values 15
Median 0.04
1st and 3rd quartiles 0.02; 0.06
Min. and max. 0.01; 0.7

  • Note that the following possible outlier values were detected: "0.3", "0.4", "0.5", "0.7".

ECG

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 2
Mode “1”
Reference category 1

CXR_PA_VIEW

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 3
Mode “2”
Reference category 1

EF_PERCENT

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 6
Median 30
1st and 3rd quartiles 25; 35
Min. and max. 15; 40

MR_GRADE

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 4
Mode “1”
Reference category 0

BETA_BLOCKER

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 2
Mode “1”
Reference category 1

  • Note that the following levels have at most five observations: "2".

ACE_INHIBITORS

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 2
Mode “1”
Reference category 1

ALD_ANTA

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 2
Mode “1”
Reference category 1

DIURETICS

  • The variable only takes one (non-missing) value: "1". The variable contains 0 % missing observations.

DIGOXIN

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 2
Mode “2”
Reference category 1

ARNI

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 2
Mode “2”
Reference category 1

ICD

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 2
Mode “2”
Reference category 1

CRTD

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 2
Mode “2”
Reference category 1

CRTP

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 2
Mode “2”
Reference category 1

  • Note that the following levels have at most five observations: "1".

PRECIPITATING_FACTOR_FOR_ACUTE_HEART_FAILURE

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 5
Mode “3”
Reference category 1

  • Note that the following levels have at most five observations: "4".

DEATH_DURING_CURRENT_HOSPITAL_STAY

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 2
Mode “2”
Reference category 1

HOSPITALISATION_OR_DEATH_AFTER_DISCHARGE

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type numeric
Number of missing obs. 7 (7 %)
Number of unique values 4
Mode “0”
Reference category 0

  • Note that the following levels have at most five observations: "3".

CAUSE_FOR_HOSPITALISATION_OR_DEATH_AFTER_DISCHARGE

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type numeric
Number of missing obs. 66 (66 %)
Number of unique values 5
Mode “1”
Reference category 1

  • Note that the following levels have at most five observations: "2", "3", "4", "5".

CARDIAC_CAUSE_FOR_REHOSPITALISATION

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type numeric
Number of missing obs. 77 (77 %)
Number of unique values 4
Mode “1”
Reference category 0

  • Note that the following levels have at most five observations: "0", "2", "3".

DEATH_AFTER_DISCHARGE

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type numeric
Number of missing obs. 7 (7 %)
Number of unique values 2
Mode “2”
Reference category 1

  • Note that the following levels have at most five observations: "1".

outcome

Feature Result
Variable type character
Number of missing obs. 7 (7 %)
Number of unique values 2
Mode “Yes”

Report generation information:

  • Created by anupamsingh81 (username: anupam).

  • Report creation time: Wed Aug 07 2019 16:54:17

  • Report was run from directory: /home/anupam/Documents/hf

  • dataMaid v1.2.0 [Pkg: 2018-10-03 from CRAN (R 3.4.3)]

  • R version 3.5.3 (2019-03-11).

  • Platform: x86_64-pc-linux-gnu (64-bit)(Ubuntu 16.04.5 LTS).

  • Function call: makeDataReport(data = hf)