Funny semantics

I have spent many weekends and vacations on DataCamp website so I consider myself DataCampeR.

Acknowledgement

I would like to give credit to all the instructors in DataCamp for the great job they have done. Perhaps 90% of the knowledge I have now for data was gained there.It’s been some years I first started but I am still excited about my next course.

Preview

The data was scraped from DataCamp website with the motivation to explore the most popular courses and how committed are the participants to complete the tracks that were offered by DataCamp. The focus will be on Data Analyst/Scientist for R and Python tracks. Some limitations apply to the information we gain this way because for instance it’s not very clear what is the meaning of ‘course participant’. Is it someone who just started the first free chapter, went over the free chapter or someone who completed the course. Ether way there’s a staggering difference between the number of participants in the two most popular courses and the number of participants in the least popular courses that are part of a track.

Data

At the moment the data were collected-2019-09-01 there were 296 courses available and some of them are selected to form the tracks for Data Analyst/Scientist.

Courses with most participants for each language.

course_title course_page course_participants course_language
Introduction to Python https://www.datacamp.com/courses/introduction-to-python 1645360 Py
Introduction to R https://www.datacamp.com/courses/introduction-to-r 1309099 R
Intermediate Python for Data Science https://www.datacamp.com/courses/intermediate-python-for-data-science 363446 Py
Intro to SQL for Data Science https://www.datacamp.com/courses/intro-to-sql-for-data-science 357158 SQL
Intermediate R https://www.datacamp.com/courses/intermediate-r 322879 R
Joining Data in SQL https://www.datacamp.com/courses/joining-data-in-postgresql 86133 SQL
Introduction to Git for Data Science https://www.datacamp.com/courses/introduction-to-git-for-data-science 58598 Other
Introduction to Shell for Data Science https://www.datacamp.com/courses/introduction-to-shell-for-data-science 33939 Other

Many begin to learn Python!

Tiny percent completes the tracks

In order to be awarded certificate of completion for a given track you need to complete successfully all the courses listed in it. But you could also work on different courses without following specific track so this two conditions will help us to estimate approximately how many got certified as Data Analyst/Scientist.

So not more than 10916 participants could have possibly gained the certificate for Data Analyst with Python because ‘Analyzing Police Activity with pandas’ is a course from the track and it’s the one with least participants. I would assume also that some people completed this course but haven’t completed other courses from the track so the number is even smaller.We have only this number so 3.0035 % participants that possibly could have got certified for Data Analyst with Python.

And 0.3928% participants certified for Data Scientist with Python.

What’s in the title?