CleanDataAgent is an intelligent, automated data
cleaning tool for structured datasets.
Built in R, it simplifies preprocessing by handling messy columns,
formatting, missing data, outliers, scaling, and generating summary
reports — all with built-in logging.
janitortotal_guests, lead_time_category)```r # Install from GitHub (after publishing) devtools::install_github(“DanieG777/CleanDataAgent”) Or load locally:
r devtools::load_all(“path/to/CleanDataAgent”) 🧪 Usage
r library(CleanDataAgent)
clean_data_agent_v13_2( file_path = “hotel_bookings.csv”, output_prefix = “v13_cleaned”, log_file = “logfile.txt” ) 📂 Output Example v13_cleaned_data.csv → Cleaned + scaled data
v13_cleaned_summary.csv → Summary from summary()
v13_cleaned_plot.png → Lead time histogram by hotel
logfile.txt → Detailed log of processing
đź”§ Requirements tidyverse
janitor
readr
lubridate
skimr
ggplot2
stringr
👤 Author Daniel Dawit – Junior Data Analyst Connect with me on LinkedIn
🧠License MIT License – free for personal and commercial use.
yaml