Hi, if you are reading this then you will know that this is a template for data cleaning. Well i was training for power bi and i figured out that Data Cleaning was a good thing for me. I know that cleaning data is redundant and most data sets are needed to be cleaned. I was glad to see that Cleaning data sets made me happy when i was applying it in power bi and R. Well its a little boring for others because its redundant but for me it gives me power and energy and its a good gift. I would say thanks to my seniors in DEP for helping me reach my goal.

  1. Load data and Check for missing values.

  2. Amputate the missing values using the most frequent value within the column. If there are no missing values then summarize data.

  3. Use Upper or Lower Cases to columns especially for cleaning bad inputs

  4. Use Delimiters to extract important values from emails and, the uncleaned data after upper or lowering the letters.

  5. Combine Important columns into one such as Fname, MI, and Lname to FullName

  6. Change the data typing in the dataframe and use correct format(like date and time).

  7. Delete all the unused data and if theres no unused then save to excel sheet.