Data Science Programming
Assignment Week 2
Angelica Florentina M.
52250063
Student Majoring in Data Science at Institut Teknologi Sains Bandung
Question 1
What is the main purpose of our study?
The main purpose of studying Data Science Programming is to develop the ability to collect, process, analyze and interpret data using programming techniques in order to generate meaningful insights and support decision-making.
Data Science Programming combines statistics, mathematics, and computer science to solve real-world problems using data. It helps transform raw data into useful information through data cleaning, visualization, modeling and prediction.
In short, the main purpose is :
- To understand data
- To extract patterns and insights
- To build predictive models
- To support data-driven decisions
Question 2
Why do we learn about it?
We learn Data Science Programming because:
- Data is everywhere
Almost every industry (healthcare, finace, education, technology, business) relies on data.
- High demand career
Data scientist and Data analyst are among the most in demand professions today.
- Better decision-making
It helps organizations make accurate and evidence based decisions.
- Problem solving skills
It improves logical thinking, analytical skills and computational thinking.
- Future technology
Artificial Intelligence (AI) and Machine Learning depend heavily on data science.
In the modern digital era, understanding data is a critical skill.
Question 3
What tools to have to expert about it?
To become an expert in Data Science Programming, we need to master several important tools and fundamental skills.
First, programming languages such as Python and R are essential. Python is widely used for data analysis and machine learning because of its simplicity and powerful ecosystem, while R is strong in statistical analysis and visualization.
Second, we need to understand key libraries like Pandas and NumPy for dat processing, Matplotlib for visualization, and Scikit-learn for building predictive models.
We also use tools such as Jupyter Notebook, Google Colab, SQL, and GIT to support coding, database management, and collaboration.
In addition to technical tools, strong knowledge of statistics, mathematics and critical thinking is necessary to analyze data accurately and build reliable models.
Question 4
Give your interest domain knowledge (Data Science)?
I am personally interested in Healthcare and Business as my domain knowledge in data science. In healthcare, I am interested in how data science can be used to support disease prediction, early diagnosis, and patient risk analysis. I believe technology and data have a very important role in improving medical services and helping doctors make more accurate decisions.
In business, I am curious about how data can help companies understand customer behavior, improve marketing strategies and make better strategic decisions. I find it interesting how data analysis can directly impact company growth and efficiency.
Through data science programming, I hope to contribute to both fields by creating solutions that are useful, impactful and data-driven.