Click Here“Bamako Data Science Meetup”

“The Bamako Data Science Meetup is open to everyone who wants to contribute to building a Data Science community in Mali. It is meant to be a place for you to share what you do and to find people you can learn from or with.”

What can we learn from the website?

  1. It was founded on July 2, 2018 by Fousseyenou Ba. Mariam B. became an event orgainzer on September 24, 2019.

  2. As of January 12, 2021, there are 268 members.

  3. The most recent event was on October 23, 2020. No upcoming events are scheduled.

  4. To explore data about members, we can try to scrape the website and create a dataset. This exercise uses R and RMarkdown on a Mac computer.

Webscraping

After musch trial, error, and perserverance, I succeded in getting data into Excel, thanks to Outwit Hub software.

Click Here“Outwit Hub”

Steps taken:

  1. Go to the Bamako DS Meetup members website.

https://www.meetup.com/Bamako-Data-Science/members/

  1. Copy the URL
  2. Launch Outwit Hub
  3. Paste the URL into this browser
  4. Scroll down and click on the “show more members” 4 times.
  5. Click on guess.
  6. 267 joiners are identified
  7. Export the data to Excel.
  8. Edit lines 6 (Patrick K.) and 137 (Mariam B. Remove “Event organizer”)
  9. Note: I normally would save the data to a (csv) format. But when this file was read into R, accent marks in names were not recognized.

Load necessary packages

Load data into R

Start data wrangling

Clean Joined Variable with Regular Expressions

Why did Allahsera join twice?

## # A tibble: 2 x 3
##   Name                 Joined                            row
##   <chr>                <chr>                           <dbl>
## 1 Allahsera Auguste T. Joined Thu Jan 07 2021 09:52:02     6
## 2 Allahsera Auguste T. Joined Fri Oct 09 2020 13:08:47    26

Comments

After lots and lots of work to webscrape and wrangle the data into a tidy dataset, we are finally ready for some analysis.

We ended up with 267 members as of January 12, 2021.

Threes polts are produced that provide interesting insights.

  1. Why did new members drop significantly in 2019 and then bounce back in 2020?

  2. Are we surprised that most people sign up on Fridays?

  3. Any ideas on why most people sign up in July and September?

Think of analyses we could do if we had data on gender, educational status, age, R user, career goals, etc.