class: center, middle, inverse, title-slide .title[ # Text Mining of Reddit Data using R - (Meetup) ] .author[ ### Dr. Zhenning ‘Jimmy’ Xu, followe me on Twitter:
https://twitter.com/MKTJimmyxu
] .date[ ### 2023/03/08 ] --- background-image: url(https://upload.wikimedia.org/wikipedia/commons/b/be/Sharingan_triple.svg) ??? Image credit: [Wikimedia Commons](https://commons.wikimedia.org/wiki/File:Sharingan_triple.svg) --- class: center, middle # xaringan ### /ʃaː.'riŋ.ɡan/ --- class: inverse, center, middle # Get Started --- #Agenda for today - Introduction? - Why R? - The Reddit platform and the API - Tutorial - Q&As --- background-image: url(https://github.com/yihui/xaringan/releases/download/v0.0.2/karl-moustache.jpg) background-position: 50% 50% class: center, bottom, inverse # You only live once! --- # Why R? .center[<img src="https://raw.githubusercontent.com/utjimmyx/resources/master/R_analytics.png" width='50%' align="middle"/>] --- # What is Marketing? .center[<img src="https://raw.githubusercontent.com/utjimmyx/resources/master/cartoon.jpg" width='50%' align="middle"/>] --- # What is Text Mining? - Text Mining, also known as Text Analytics, is the process of analyzing large amounts of unstructured data, such as text, to extract insights and patterns that would otherwise be difficult to identify. - Text Mining techniques use natural language processing (NLP) to understand and categorize text data. --- # The Reddit platform and the Reddit API - Reddit is a social media platform that allows users to post and discuss content on various topics. - Reddit has a large user base, and its content covers a wide range of subjects, from news and politics to entertainment and technology. - To retrieve Reddit data, we use the Reddit API. The API provides access to a large amount of data, and we can filter the data by subreddit, time period, and keyword. - 'RedditExtractoR' - An R Package that helps you access the Reddit API: https://github.com/ivan-rivera/RedditExtractor - What Are APIs? - Simply Explained: https://www.youtube.com/watch?v=OVvTv9Hy91Q --- # Steps and procedures - Data scraping - Data cleaning - Topic modeling: a technique used to discover latent topics in a large corpus of text data. Reference (free book): Text Mining with R: https://www.tidytextmining.com/ --- ## Thank you all for your participation! ### Questions ### How to make these slides for your creative work - https://www.rstudio.com/about/customer-stories/ xaringan Presentations and R - - Markdownhttps://bookdown.org/yihui/rmarkdown/xaringan.html - Slides for today can be accessed at my Rpubs.com website [**rpubs.com**](https://rpubs.com/utjimmyx). ### Resources - [**my Tableau site**]https://public.tableau.com/app/profile/zhenning.xu - [**my Github website**]https://github.com/utjimmyx. - [**Co-organizer - Central Valley Data Analytics and R Users Meetup Group**]https://www.meetup.com/Bakersfield-Data-Analytics-Using-R-Meetup-Group/.