Data Science Project Index

Miguel Ángel Pérez Vargas · vargasmiguel@uninorte.edu.co

1 Preface

I’m Miguel Ángel, a Data Science student at Universidad del Norte (Barranquilla, Colombia). This RPubs space gathers documents I’ve created throughout my degree, including coursework reports, class notes, and project write-ups, along with a few personal projects. The focus is on applied statistics, exploratory data analysis, data visualization, and machine learning, mostly using R and Quarto.

Because my studies are based in Colombia, where Spanish is the primary language of instruction, most posts are in Spanish. You’ll also find some publications in English. Each document notes its language at the beginning, and whenever possible I include code and data so the work is reproducible.

I’m still learning and not yet an expert, so these pieces may not be perfect.

2 Inferential Statistics

2.1 Accident / Traffic Crash Rate in Barranquilla [ES]

Inferential Statistics · 2024

Exploratory analysis for crash rates.

View report

2.2 EDA Global Power Plant [ES]

EDA · 2024

Cleaning and visualization for the Global Power Plant dataset.

View report

2.3 EDA Artificial Intelligence [ES]

EDA · 2024

Exploratory analysis of an AI-related dataset.

View report

3 Statistical Methods

3.1 EDA TOP 1000 Movies by IMDb Rating [ES]

Statistical Methods · 2025

Exploration of the IMDb Top 1000 dataset — ratings and trends.

View report

3.2 Multiple Linear Regression with Categorical Predictors [ES]

Regression · 2025

Modeling and encoding of categorical predictors in MLR.

View report

4 Mathematical Statistics

4.1 Activation Functions [ES]

ML Theory · 2025

Overview and plots of five lesser-known activation functions.

View report

4.2 Activity#4 [ENG]

Mathematical Statistics · 2025

Sample distribution of the minimum - Exercise.

View report

5 Data Visualization

5.1 Detection, Handling, and Imputation of Missing Data and Outliers [ES].

DataViz · 2025

Basic EDA on missing data and outliers.

View report