Table of Contents

  1. Mixed Links
  2. Math
  3. NN
  4. Careers
  5. RStudio
  6. R
  7. AI
  8. Visualization
  9. RMarkdown / R Notebook / learnr / RTutor
  10. SQL/ (R)SQLite
  11. Git/Github
  12. Emacs / ESS
  13. Blogs worth reading
  14. Teaching - other courses and ideas
  15. Datasets
  16. lessr
  17. NLP
  18. Blockchain
  19. Interesting applications

Math

NN

An introduction to weight pruning by Tivadar Danka

**Why are neural networks so powerful? The universal approximation theorem ** 22 articles about NN Jan 2019g

Careers

Why treating analytics like a second-class citizen will hurt you, by Cassie Kozyrkov, Oct 19, 2019

Describing Technical Concepts Simply in Interviews by Megan Dibble

Here’s what I learned along the way by Megan Dibble

Carl Howe, Sean Lopp 2020-05-27

How I Transitioned from a Non-Technical Background into Data Science

CLOSED: [2020-05-17 So 20:48]

RStudio

R

A quick introduction to performance optimization in R: the parallel and Rcpp packages.

AI

Understanding the Theoretical Foundations of Multi-Agent AI Systems by Jesus Rodriguez Jun 18 2020

When tuned up, old algorithms can match the abilities of their successors

Visualization

And how to select the right tool for your audience, by Zeming Yu

ein Tool um selber die Daten zu analysieren: Gapminder https://www.gapminder.org/tools-offline/

https://www.researchgate.net/publication/328759928_Good_Things_on_the_Rise_The_One-Sided_Worldview_of_Hans_Rosling_Translation_of_an_essay_published_in_Kvartal_Sept_20_2018_httpskvartalseartiklarbra-saker-pa-uppgang-roslings-varldsbild-ar-ensidigt-pos Exzerpt: https://quillette.com/2018/11/16/the-one-sided-worldview-of-hans-rosling/

RMarkdown / R Notebook / learnr / RTutor

SQL/ (R)SQLite

In this tutorial, you will learn about using SQLite, an extremely light-weight relational database management system (RDBMS) in R.

Learn the basics of SQLite databases from SQLite dot commands to an example of their practical applications using the command line interface.

An Essential Skill for Any Data Science Résumé

A Real-Life Data Analyst Case Study

Git/Github

Posted on November 28, 2018 by caitlinhudon

Emacs / ESS

Blogs worth reading

Teaching - other courses and ideas

by Lucie Heath Nov 12, 2018

Capstone projects in academic data science training should be a semester or two long, and should prioritize group projects over individual projects

The Johns Hopkins Data Science Lab (DaSL) is a group based in the Johns Hopkins Bloomberg School of Public Health whose mission is to enhance data science thinking everywhere and make data science accessible to the entire world. Data science is a fundamental way of thinking in many areas of science, business, and government. We believe all people should be able to develop literacy, fluency and skill in data science so they can make sense of the data they encounter in their personal and professional lives. We recognize data science as a fundamentally human activity and focus our activities on helping people build data analyses for people.

Datasets

Hint: Using a Kaggle data set might not be sufficient.

lessr

NLP

Named Entity Recognition, Sentiment Analysis, and More, Andre Ye, Jun 10

Blockchain

Implement a small blockchain in R and learn more about what a blockchain looks like and some of the core concepts behind it!

Interesting applications

My solution to this Riddler using R

Part 1: Reproducing the results

COVID-19 Open Research Dataset Challenge on Kaggle

https://fermatslibrary.com/s/inventing-the-randomized-double-blind-trial-the-nuremberg-salt-test-of-1835#email-newsletter

I’m interested in running R on the Raspberry Pi, and on Raspbian in particular. There are loads of Debian packages for R, and I’m hoping that many of these find there way into Raspbian eventually. Right now it is possible to install and run R from Raspbian, but relatively few packages are available. However, the package r-base can be installed, and that is enough to get up and running with a basic R installation.

A Data Scientist’s approach for visualizing their Exploratory Data Analysis

Analyzing data scientist job listings with LDA topic modeling

Let’s take a look at an algorithms question that has been used in interviews at Amazon, Google, and Uber. Included is a Python implementation: Write an algorithm for finding the median of two sorted arrays of length n and m respectively. The algorithm should run in O(log(n+m)) time.

Ryan A. Estrellado, Emily A. Bovee, Jesse Mostipak, Joshua M. Rosenberg, and Isabella C. Velásquez

BBC Visual and Data Journalism Feb 1, 2019

On Finding and Fixing Latent Racial Bias

There is so much data out there about all of us – thanks to the “big data revolution.” Every day it’s used to predict our behavior – from consumer habits to the likelihood of committing a crime. But all that data is useless if you don’t know how to interpret it.

Seth Stephens-Davidowitz, New York Times bestselling author and data scientist, understands that data can play an important role in challenging conventional wisdom and making better decisions. In this episode, he explains how to use data analysis to follow the data – wherever it leads.

Later in the show, we talk with Johnny Grant, whose company has launched FOMO Bones, CBD treats for dogs.

Published on June 3, 2020 by Thomas Brand

Published on October 22, 2019 by Thomas Brand, Sébastien Galais

A great little learning exercise that illustrates the range and flexibility of the R language