KAYLA APRILIA
Data Science Student at ITSB
NIM: 52250057
Email: kaylaaprilia2142@gmail.com
1 What is the main purpose of our study (Data Science Programming)?
The main purpose of studying Data Science Programming is to extract meaningful insights from data using programming, statistics, and analytical thinking.
As Data Science students, we aim to:
- Collect and clean data
- Analyze and visualize data
- Build predictive models
- Support decision-making using data
In short, the goal is to transform raw data into useful information that can solve real-world problems.
2 Why do we learn about it?
We learn Data Science Programming because today’s world is driven by data. Almost every industry such as business, healthcare, finance, education, and technology relies on data for decision-making.
By learning it, we can:
- Identify patterns and trends in large datasets
- Make data-driven decisions instead of assumptions
- Solve complex problems using algorithms
- Increase career opportunities in the digital era
This knowledge prepares us to become data analysts, data scientists, machine learning engineers, or business intelligence specialists.
3 What tools do we need to expert about?
As a Data Science student, these are the main tools we need to become experts in:
library(knitr)
tools <- data.frame(
Category = c(
"Programming Languages","Programming Languages","Programming Languages",
"Data Analysis Libraries","Data Analysis Libraries",
"Data Visualization Tools","Data Visualization Tools","Data Visualization Tools",
"Supporting Tools","Supporting Tools","Supporting Tools","Supporting Tools"
),
Tool = c(
"Python","R","SQL",
"Pandas & NumPy","dplyr & tidyverse",
"Matplotlib & Seaborn","ggplot2","Tableau / Power BI",
"Git","Jupyter Notebook / RStudio","Excel","Hadoop / Spark"
),
Purpose = c(
"Data analysis, machine learning, automation",
"Statistics and visualization",
"Database management and queries",
"Data manipulation (Python)",
"Data manipulation (R)",
"Visualization in Python",
"Visualization in R",
"Business intelligence dashboards",
"Version control system",
"Coding environment",
"Spreadsheet analysis",
"Big data processing"
)
)
kable(tools)
| Category | Tool | Purpose |
|---|---|---|
| Programming Languages | Python | Data analysis, machine learning, automation |
| Programming Languages | R | Statistics and visualization |
| Programming Languages | SQL | Database management and queries |
| Data Analysis Libraries | Pandas & NumPy | Data manipulation (Python) |
| Data Analysis Libraries | dplyr & tidyverse | Data manipulation (R) |
| Data Visualization Tools | Matplotlib & Seaborn | Visualization in Python |
| Data Visualization Tools | ggplot2 | Visualization in R |
| Data Visualization Tools | Tableau / Power BI | Business intelligence dashboards |
| Supporting Tools | Git | Version control system |
| Supporting Tools | Jupyter Notebook / RStudio | Coding environment |
| Supporting Tools | Excel | Spreadsheet analysis |
| Supporting Tools | Hadoop / Spark | Big data processing |
4 What is your interest domain knowledge in Data Science?
As a Data Science student, I am particularly interested in Business and Sales Analytics.
I want to:
- Build sales dashboards
- Analyze customer segmentation
- Help companies improve business strategies through data
I am interested in applying data science to solve real-world business problems and create impactful solutions.