Assignment Week 2

Data Science Programming 1

Data Science Study Program Institut Teknologi Sains Bandung
Student
Nadia Apriani
Nadia Apriani
52250006
Student Major in Data Science
R Programming Data Science Statistics
Lecturer
Bakti Siregar
Bakti Siregar, M.Sc., CDS
 
Lecturer in Data Science
& Statistical Computing
Data Science Statistics

1 Question 01

01 Question
What is the main purpose of our study?
The main purpose of this study is to learn programming as a tool in Data Science to process, manipulate, and analyze data effectively, systematically, and automatically to produce useful information.
✦ Explore Keywords
Data Processing Analysis Automation Insight
Data Processing
The act of collecting, cleaning, and transforming raw data into a structured format suitable for analysis and interpretation.
Analysis
Applying statistical and computational methods to examine data patterns, relationships, and trends to draw meaningful conclusions.
Automation
Using programming to automate repetitive data tasks, reducing manual effort and minimizing human error in the process.
Insight
Actionable knowledge extracted from data that helps understand problems, identify opportunities, and support better decision-making.

2 Question 02

02 Question
Why do we learn about it?
We learn this because programming helps us process data faster, more efficiently, and accurately, and allows us to handle large and complex data.
✦ Explore Keywords
Efficiency Accuracy Big Data Speed
Efficiency
Programming streamlines data workflows, allowing analysts to complete complex tasks in a fraction of the time compared to manual methods.
Accuracy
Automated code-based processes reduce human error, ensuring data results are consistent, reliable, and reproducible every time.
Big Data
Programming enables handling of massive datasets with millions of rows that would be impossible to process manually or with spreadsheet tools.
Speed
With programming, data tasks that would take hours manually can be executed in seconds, dramatically accelerating the analysis pipeline.

3 Question 03

03 Question
What tools to have to expert about?
The tools we need to master include R and RStudio, Python, Anaconda as programming environments, and libraries and programming structures such as variables, functions, and loops to help process and analyze data.
✦ Explore Keywords
R & RStudio Python Anaconda Programming Structures
R & RStudio
A powerful statistical programming language and its IDE, widely used for data analysis, visualization, and building reproducible research reports.
Python
A versatile and beginner-friendly programming language with rich libraries like Pandas, NumPy, and Scikit-learn for data science tasks.
Anaconda
A distribution platform that simplifies package management and deployment of Python and R environments for data science projects.
Programming Structures
Fundamental concepts including variables, functions, loops, and conditionals that form the building blocks of any data science program.

4 Question 04

04 Question
Give your domain knowledge / interest?
Domain knowledge refers to the field of data analysis, such as education, business, healthcare, or economics. Domain knowledge helps us understand data and produce more meaningful analysis.
✦ Explore Keywords
Education Business Healthcare Economics
Education
Data science in education helps analyze student performance, predict learning outcomes, and improve curriculum design through data-driven insights.
Business
In business, data science drives sales forecasting, customer segmentation, market analysis, and strategic decision-making for competitive advantage.
Healthcare
Healthcare data science enables disease prediction, patient risk assessment, drug discovery, and optimization of hospital resource management.
Economics
Economic data analysis supports policy evaluation, market trend forecasting, and understanding macroeconomic patterns through quantitative modeling.

5 Conclusion

Conclusion

Programming is an important tool in Data Science to process, manipulate, and analyze data effectively and automatically. It helps us handle large and complex data more efficiently and produce useful information for understanding problems and supporting decision-making.


To do this, we need to master tools such as R, RStudio, Python, and Anaconda, as well as basic programming concepts and libraries. Domain knowledge is also important to help us understand the data and produce meaningful analysis.

✦ Explore Keywords
Programming Data Science Tools Domain Knowledge
Programming
The foundation of data science that enables systematic, automated, and reproducible data processing and analysis at any scale.
Data Science
An interdisciplinary field combining statistics, programming, and domain expertise to extract meaningful insights from structured and unstructured data.
Tools
Mastering R, Python, and Anaconda equips data scientists with the technical capabilities needed to tackle real-world data challenges effectively.
Domain Knowledge
Understanding the context of your data field ensures that analysis is not only technically correct but also practically meaningful and impactful.