CS 424 Big Data Analytics

Session 1: Orientation

Instructor: Dr. Bob Batzinger
Academic year: 2021/2022
Semester: 1

Begins June 2021

Let’s start with you…

Who are you?

Why are you taking this course?

What do you want to accomplish this term?

Common Reasons for studying Big Data

  1. Data driven decisions provide a competitive advantage

  2. Big Data provides a spring board for AI

  3. Big Data skills that discover insights from data to create value are in high demand

  4. Big Data combines data science, data architecture, statistics and machine learning principles to mine insights from data

  5. Studying Big Data will broaden your horizon

Instructor for this course

Definition of Terms 大數據分析

Course details

Big Data - Big Opportunity

Big Data

Netflix uses big data to saves $1 billion per year on customer retention.


The nature of Big Data

Velocity: ความเร็ว, Volume: ปริมาณ, Value: มูลค่, Variety: ความหลากหลาย, Veracity: ความจริง

Big Data Word Cloud

Big Data Tools for Data Analysis

Analytical Exercise

Data Analytics

HOW?

WHY?

Framing an experimental question

\[\begin{matrix} \hbox{Cause} & \rightarrow & \hbox{Mediator} &\rightarrow & \hbox{Effect}\\ \hbox{(independant)} & &\hbox{(moderating)} & &\hbox{(dependant)}\\ \\ \hbox{Weather}\atop\hbox{conditions} &\rightarrow & \left\{\begin{matrix}\hbox{Holidays and weekends}\\\hbox{Political unrest}\\\hbox{Daylight savings time}\\\end{matrix}\right\} &\rightarrow & \hbox{Athletic event}\atop\hbox{participation}\\ \\ \hbox{Cost of}\atop\hbox{food delivery} &\rightarrow & \left\{\begin{matrix}\hbox{Ease of use}\\\hbox{Work from home order}\\ \hbox{Curfew and road blocks}\end{matrix}\right\} &\rightarrow &\hbox{Grab Revenue}\\ \end{matrix}\]

Experimentation

Control of factors that influence the result

Experiment designs

Common experimental issues