DATA SCIENCE PROGRAMMING

Assignment Week 1

NIM: 52250007
SEMESTER 2

Yosef Teofani Tamba

🎓 Active students in the 2025/2026 academic year

Data Science Undergraduate at Institut Teknologi dan Sains Bandung (ITSB)

👨‍🏫 Lecturer
Mr. Bakti Siregar, M.Sc., CDS
📊 R Programming
🤖 Data Science
📚 Currently in the Semester 2 (2026) - Focus on Data Science Programming

1 Introduction

Data science Programming is the art of using computer languages to process, analyze, and visualize information on a large scale. Here, code serves as a digital laboratory. Instead of performing time-consuming manual calculations, one can write instructions for a computer to identify hidden patterns, market trends, and even consumer behavior in a matter of seconds.

2 Question I

What is the main purpose of Data Science Programming?

The main purpose of Data Science programming is to process and clean data, create predictive models, perform Exploratory Data Analysis, create interactive visualizations, and automate the processing of large data sets quickly and accurately using computation so that decisions can be made. Additionally, this course aims to develop logical thinking, problem-solving skills, and computational efficiency. By understanding programming concepts such as control flow, functions, loops, and data transformation techniques, students can automate analytical processes.

3 Question II

Why do we learn about it?

Data Science Programming is important to learn, especially for someone who is going to enter the world of data. This is because professionals usually use it to process raw data efficiently, especially if the data is large, so that it is sometimes ineffective to use only manual methods. Furthermore, one can perform Exploratory Data Analysis to uncover trends and patterns within a dataset to support decision-making. Additionally, one can create compelling data visualizations from the results of previous processes and develop Machine Learning models to make more accurate prediction.

4 Question III

What tools do we have to be expert about?

The tools that will be used during the learning process are R and Python, because both have powerful libraries, statistical tools, and capabilities for machine learning, but each excels in different fields. For comparison, Python is generally intuitive and relatively easy for beginners who want to learn coding for AI, web development, and large-scale data analysis. Python uses several main libraries, including pandas, numpy, matplotlib, seaborn, plotly, and scikit-learn.

Meanwhile, R excels in specialized statistics and statistical computing, and can also be used to create easier visualizations of the given data, enabling more reasonable decisions to be made by viewing the available comparative visualizations. R uses several main libraries, namely tidyverse, dplyr, ggplot2, plotly, and caret.

So it can be concluded that:

  • Python is ideal for AI, machine learning, and large-scale data analysis
  • R excels in statistical analysis and data visualization

5 Question IV

Give your interest domain knowledge about Data Science

I have a keen interest in exploring the potential of data in the financial and retail sectors. My current focus is on mastering analytical techniques, namely determining trends and patterns from historical stock data to predict future price movements. I am also interested in learning about the implementation of forecasting in sales data to predict future sales using time series models.