M06-Reflection Essay-Advanced Data Wrangling

Author

Austin Voth

Published

March 13, 2026

1 Summarize what you learned about Tidyverse tricks presented in Step 1 by focusing on the new things you learned that were not dealt with in M05.

  • There were a lot of new tips and tricks that I saw being used in Step 1 but I think the one that I found the most useful was creating multiple visual models because you can use only one model but multiple models help convey a bigger message

2 What do you think about David Robinson’s impromptu screencast performance in Step 2? How much were you able to understand?

  • I though that David Robinsons impromptu screencast performance in Step 2 was very benificial because there was a lot of information said which included some new things that we werent taught before. But there was a double edged sword because since there was so much information, I wasnt able to understand much but if I went back and watched the video again, it would make more sense.

3 In Step 3, you saw two videos, one showing initial data cleaning and the other showing further cleaning. Describe the two processes. What are the big differences between the two?

  • Initial data cleaning is when you first bring the data set into whatever you are doing and giving a quick lookover but when you do further cleaning you do a better cleaning by filtering out invalid values, removing duplicate data, and creating new variables. The big difference is that when yiu intially clean the data, you are just cracking the surface but further data cleaning is a deep dive into what the data actually contains.

4 In Step 4, you learned how to create an Revealjs presentation. What’s your impression of the Revealjs presentation? Describe its capabilities as you learned from the video. What are its strengths and weaknesses compared with PPT?

  • I really do think that Revealjs presentation tool is very usefull and a great standalone replacement for PPT. Some of the coolest capabilities are that you could impliment live R code into the presentation, you can embed code, and there are a lot of supporting themes and custom styles. I think that the only problem that I have with Revealjs compared to PPT is there is a very heavy learning curve and PPT is easier to share in the real world.

6 My advanced wrangling lecture in Step 6 introduced new concepts not covered in M05, such as joining relational databases. Pick three of them, describe when to use them, and give one coding example along with an output for each function. To clarify, you are expected to do literate programming by weaving your narratives with the code.

  • One new concept that you covered was the join function and you used it with the charts to to merge with other information from the other charts which is a great use. Another concept that you covered was the group by function and you used it in tables to group similar data which a a great use of coding to have the information be compared in a similar way. The last new concept you covered was the summarize function which you used to condense the data in the charts which is a great use of code because it condenses information so that you can use it for what you need.

7 Data wrangling is like laying a foundation/basement for a building, while visualization is building a structure above the basement. Working in a basement is not a glamorous job at all. No one notices it; people will see what they can see above the ground and judge it, but without the hard work you did in the basement, the building will never stand well. Besides, wrangling is hard. You learned dplyr and tidyr in this module, but there is still more to learn in M07 and M08. What did you like the most about working with the tools you learned in this module? Elaborate on your point.

  • Like I said in the response above I liked using the newer functions like filter, select, mutate, and arrange. I think those functions were my favorite tools that I learned in this module as they have so many different applications to them.

8 What seems to be the challenge, if any, for you when you try to master the wrangling tools? How may you be able to overcome the challenge? Let me share a little bit about my learning journey. The first time I saw David Robinson’s performance, I was so impressed that I paused the video and typed his code. It took all day to type the code. That experience was very helpful, but I realized that I could shorten my learning curve by focusing on the basics. I couldn’t understand a lot of the tools he was using. So, I tried to study those packages in Tidyverse and watch his video once in a while to check how much I improved. I found this method very effective. Thus, I believe that you will be ready to absorb online resources like YouTube much more effectively once you finish this program. Hope this experience helps you

  • I think that the main challenges I have been having are just learning all of this new information and how to apply it in my coding. The one way that I want to improve and master my skills were just to keep practicing and listening to all of the lecture.