Data Science Programming

Assignment Week 2

Ni.MD Aurora Sekarningrum

Student at Bandung Institute of Science and Technology

Major: Data Science

NIM: 5225072

Date: 2026-03-02

R Programming
Statistics
Data Science

1 Question 1

What is the main purpose of our study (Data Science Programming)?

The main purpose of this course is to equip students with fundamental programming skills to process, analyze, and visualize data. Beyond writing code, students also learn to think logically and systematically when solving data-driven problems.

The course introduces concepts such as data structures, conditionals, and loops, as well as the use of programming languages like R or Python to read, clean, explore, and analyze data, and produce visualizations. Therefore, the main focus of the course is to show how programming can support the entire data analysis workflow.

2 Question 2

Why do we learn about it?

Data Science Programming is important because almost all data analysis processes rely on programming. Real-world data is often large, unstructured, and requires specific techniques to clean and analyze efficiently.

The course also develops computational thinking, which is the ability to break down problems into logical steps that can be implemented in code. This skill enables students not only to analyze data but also to understand the workflow of data analysis systematically.

In the digital era, various sectors—such as business, technology, and government—depend on data for decision-making. Therefore, programming skills are a crucial and relevant competency for addressing real-world data challenges.

3 Question 3

What tools to have to expert about?

In Data Science, mastering certain tools is essential because each plays a critical role in the data analysis workflow. Five key tools are:

  1. Programming Languages (R or Python)
    Programming languages form the core of Data Science. Almost all stages—from data exploration and cleaning to modeling—are carried out through coding. Mastering a programming language is therefore a foundational competency that determines the quality of analysis.

  2. IDE, such as RStudio and Jupyter Notebook
    IDEs help make the analysis process more structured and well-documented. They are essential for reproducibility and clarity, especially when working on projects or collaborating in teams.

  3. Data Analysis and Visualization Libraries
    Libraries such as dplyr, pandas, and ggplot2 enable efficient and systematic data processing and visualization. Mastery of these libraries demonstrates the ability to leverage standardized tools widely used in the Data Science community.

  4. Databases and SQL
    Most real-world data is stored in databases. SQL skills are needed to access and prepare data before analysis, providing a strong foundation for subsequent analytical processes.

  5. Business Intelligence Tools, such as Microsoft Power BI or Tableau
    Analytical results must be communicated clearly to users or decision-makers. BI tools are used to present data in interactive and easy-to-understand dashboards, ensuring that analysis has a real-world impact.

Overall, these five tools form an integrated process, from data acquisition to presenting analytical results. Understanding their interconnection helps to see Data Science as a unified workflow rather than a collection of isolated tools.

4 Question 4

Give your interest domain knowledge (Data Science)?

What is Domain Knowledge?
Domain knowledge is the understanding of the specific field or context where data is applied. A data scientist is expected not only to process data technically but also to comprehend the background of the problem being analyzed. For instance, in healthcare, it is important to understand medical terminology, whereas in business, knowledge of profit, consumer behavior, and marketing strategies is necessary. Without domain knowledge, analytical results may be statistically correct but irrelevant in practical application.

My Interests in Data Science
My current interests focus on data visualization, business intelligence, and data-driven UI/UX design. I am keen to learn how complex data can be processed and presented visually in a simpler way, making it easier for users to understand.

Clear visualizations can support decision-making by providing information in an accessible format. Additionally, I am interested in understanding how dashboards or data-driven systems can be designed to be intuitive and user-friendly. My interest in this area is still at a foundational level, and I hope to develop my skills further through coursework and project experience.

5 References

Wickham, H., & Grolemund, G. (2017). R for Data Science. O’Reilly Media.
https://r4ds.had.co.nz/

IBM. (2023). What is Data Science? IBM.
https://www.ibm.com/topics/data-science

Data Science Labs. (2024). Data Science Programming. Bookdown.
https://bookdown.org/dsciencelabs/data_science_programming/00-Introduction-to-Programming.html