Assignment Week 2 ~ Introduction to Data Science Programming

Chelsea Tesalonika
Patricia Hutajulu

52250041
Student

Bakti Siregar, M.Sc., CDS

Lecturer

Major : Data Science at ITSB

Data Science Programming

Introduction

Data Science Programming is one of the fundamental subjects in data science because it introduces how programming is used to work with data. In this course, we learn that data science is not only about analyzing numbers, but also about understanding how to process, structure, and manage data using code. Programming becomes an essential tool to transform raw data into meaningful insights. Through this introduction, we understand why coding skills, tools like Python or R, and domain knowledge are important to support data-driven decision making.

Question 1

What is the main purpose of Data Science programming?

The main purpose of Data Science Programming is to help us work with data using coding. In data science, data is usually raw and messy, so we need programming to clean it, organize it, and analyze it properly. Through coding, we can explore patterns in data, build models, and create visualizations. Basically, programming is the main tool that allows us to turn data into useful information that can support decision making.

In Data Science, programming is essential because code is what we use to:

  • Collect and organize data.

  • Clean and change it so it can be analyzed.

  • Explore patterns and relationships.

  • Build predictive models.

  • Make visual graphs, dashboards, or reports.

Question 2

Why do we learn about it?

We learn Data Science Programming because programming is the main foundation of data science. From the introduction, data science combines statistics, computer science, and domain knowledge, and programming is the tool that connects all of them. With coding, we can handle large and messy datasets more efficiently. We can clean data, transform it, explore patterns, and build models without doing everything manually. In real-world situations, data is rarely clean, so programming helps us manage that complexity.

We also learn this subject to understand how data is structured and stored. Programming helps us understand data types, variables, functions, and logical operations, which are basic but essential concepts. Without understanding these basics, it would be hard to move to more advanced topics like machine learning or predictive modeling.

Question 3

What tools should we master to become an expert?

To become good at Data Science Programming, we need to master tools like Python or R. Python is commonly used for data manipulation, machine learning, and automation. R is strong in statistics and data visualization. Besides that, we should also understand basic programming concepts like variables, loops, functions, and data structures. Libraries such as pandas, numpy, and visualization tools are also important. But honestly, practice with real datasets is the key to improving. We also need to master several important tools:

  • Python Python is popular in data science because it is flexible and supports machine learning, data manipulation, and automation.

  • RStudio RStudio is an environment for writing and running R code. It makes data analysis more organized and easier to manage.

  • Jupyter Notebook Jupyter Notebook is commonly used with Python to write code, show output, and explain analysis in one place.

  • Libraries / Packages For example: dplyr, ggplot2 (in R) or pandas, matplotlib (in Python). These help us clean, process, and visualize data more efficiently.

Question 4

Give your interest and domain knowledge of data science

My interest domain knowledge is business, economics, and financial engineering. In Data Science, domain knowledge is very important because it helps us understand the real meaning behind the data. Even if we have strong programming skills, without understanding the field, we might misinterpret the results or make wrong conclusions. In business and economics, data science can be used to analyze market trends, customer behavior, company performance, inflation, and financial growth. For example, we can use data to identify patterns in sales, forecast demand, or evaluate business strategies.

In financial engineering, data science plays an even more technical role. It can be used for quantitative modeling, risk analysis, stock price prediction, portfolio optimization, and financial forecasting. By using programming, we can simulate financial scenarios, calculate potential risks, and build predictive models based on historical financial data. With Data Science Programming, I can combine coding skills with financial and economic knowledge to create data-driven solutions. This combination allows better decision-making, especially in investment, risk management, and financial planning.

Conclusion

Data Science Programming helps us turn raw data into useful insights using coding. It connects programming, statistics, and domain knowledge to solve real-world problems. By mastering tools like Python, R, and related libraries, we can analyze data more efficiently. When combined with domain knowledge in business, economics, and financial engineering, programming skills allow us to make better and more data-driven decisions.