Data Science Programming Assignment

CONTACT INFORMATION
Email Address
Andremusari276@gmail.com
Student ID
52250065
LinkedIn Profile
View Profile

ANDRE

Data Science Student
ITSB
Institut Teknologi Sains Bandung
Institut Teknologi Sains Bandung • Data Science • 2026

1 1. What is the main purpose of our study Data Science Programming?

The main purpose of studying Data Science Programming is to develop the ability to process, analyze, and interpret data using programming languages so that raw data can be transformed into meaningful and valuable information.

Through this course, we do not only learn how to write code, but also how to think logically, systematically, and analytically in solving data-based problems. We learn to:

  • Clean and preprocess data
  • Identify patterns and relationships between variables
  • Create informative data visualizations
  • Build simple analytical and predictive models
  • Automate data processing to improve efficiency

Therefore, the ultimate goal is to build strong data-driven problem-solving skills.

2 2. Why do we learn about it?

We learn Data Science Programming because, in the digital era, data has become one of the most valuable assets. Almost every sector such as business, education, healthcare, and technology generates large amounts of data every day.

Without programming skills:

  • Data processing would take too much time
  • Handling large-scale data would be difficult
  • Analysis would be less efficient and less accurate

By studying this subject, we can make objective decisions based on data rather than assumptions. Moreover, these skills are highly relevant to industry demands and technological advancements.

3 3. What tools do we have to be expert in?

To become an expert in Data Science Programming, we need to master several important tools, including:

  • Python as the main programming language
  • Jupyter Notebook for interactive coding and analysis
  • Libraries such as:
    • Pandas for data manipulation
    • NumPy for numerical computation
    • Matplotlib and Seaborn for data visualization
    • Scikit-learn for basic machine learning
  • SQL for database management
  • Git for version control and collaboration

Mastering these tools will help us work professionally and efficiently in data science projects.

4 4. Give your domain knowledge.

I want to focus on Big Data. I am interested in processing large-scale data to generate useful insights that can be easily understood.

In my opinion, the biggest challenge in big data is not only the volume of data, but also how to manage, filter, and present it in a meaningful way. I want to:

  • Manage large-scale data efficiently
  • Simplify complex data into clear insights
  • Present analysis results through easy-to-understand visualizations
  • Support data-driven decision making

My goal is to transform complex data into simple, understandable, and valuable information for both experts and the general public.