Introduction To Data Science Programming

Assignment Week 2

March 02, 2026

Profile Photo

KAYLA APRILIA

Data Science Student at ITSB

NIM: 52250057

Email: kaylaaprilia2142@gmail.com

R Programming Data Science Statistics


1 What is the main purpose of our study (Data Science Programming)?

The main purpose of studying Data Science Programming is to extract meaningful insights from data using programming, statistics, and analytical thinking.

As Data Science students, we aim to:

  • Collect and clean data
  • Analyze and visualize data
  • Build predictive models
  • Support decision-making using data

In short, the goal is to transform raw data into useful information that can solve real-world problems.


2 Why do we learn about it?

We learn Data Science Programming because today’s world is driven by data. Almost every industry such as business, healthcare, finance, education, and technology relies on data for decision-making.

By learning it, we can:

  • Identify patterns and trends in large datasets
  • Make data-driven decisions instead of assumptions
  • Solve complex problems using algorithms
  • Increase career opportunities in the digital era

This knowledge prepares us to become data analysts, data scientists, machine learning engineers, or business intelligence specialists.


3 What tools do we need to expert about?

As a Data Science student, these are the main tools we need to become experts in:

library(knitr)

tools <- data.frame(
  Category = c(
    "Programming Languages","Programming Languages","Programming Languages",
    "Data Analysis Libraries","Data Analysis Libraries",
    "Data Visualization Tools","Data Visualization Tools","Data Visualization Tools",
    "Supporting Tools","Supporting Tools","Supporting Tools","Supporting Tools"
  ),
  Tool = c(
    "Python","R","SQL",
    "Pandas & NumPy","dplyr & tidyverse",
    "Matplotlib & Seaborn","ggplot2","Tableau / Power BI",
    "Git","Jupyter Notebook / RStudio","Excel","Hadoop / Spark"
  ),
  Purpose = c(
    "Data analysis, machine learning, automation",
    "Statistics and visualization",
    "Database management and queries",
    "Data manipulation (Python)",
    "Data manipulation (R)",
    "Visualization in Python",
    "Visualization in R",
    "Business intelligence dashboards",
    "Version control system",
    "Coding environment",
    "Spreadsheet analysis",
    "Big data processing"
  )
)

kable(tools)
Category Tool Purpose
Programming Languages Python Data analysis, machine learning, automation
Programming Languages R Statistics and visualization
Programming Languages SQL Database management and queries
Data Analysis Libraries Pandas & NumPy Data manipulation (Python)
Data Analysis Libraries dplyr & tidyverse Data manipulation (R)
Data Visualization Tools Matplotlib & Seaborn Visualization in Python
Data Visualization Tools ggplot2 Visualization in R
Data Visualization Tools Tableau / Power BI Business intelligence dashboards
Supporting Tools Git Version control system
Supporting Tools Jupyter Notebook / RStudio Coding environment
Supporting Tools Excel Spreadsheet analysis
Supporting Tools Hadoop / Spark Big data processing

4 What is your interest domain knowledge in Data Science?

As a Data Science student, I am particularly interested in Business and Sales Analytics.

I want to:

  • Build sales dashboards
  • Analyze customer segmentation
  • Help companies improve business strategies through data

I am interested in applying data science to solve real-world business problems and create impactful solutions.