Data Science Programing
Assigment ~week 02~
R Programming Data Science Statistics
1 Question and Answer
1.1 What is the main purpose of our study?
The main purpose of this study is to develop proficiency in Data Science Programming, focusing on how to write structured and logical instructions for computers to process, analyze, and interpret data effectively. This study aims to build computational thinking and algorithmic problem-solving skills that transform raw data into meaningful and actionable insights.
1.2 Why do we learn about it?
Kita mempelajari Data Science Programming karena programming merupakan dasar dari analisis data modern. Programming membantu kita mengotomatisasi pekerjaan yang berulang dan memproses data dalam jumlah besar yang tidak bisa ditangani secara efisien oleh tools seperti Excel.
Melalui programming, kita juga dapat membangun model prediksi menggunakan machine learning untuk memperkirakan tren dan mendukung pengambilan keputusan. Selain itu, programming memastikan bahwa analisis yang kita lakukan akurat, konsisten, dan dapat direproduksi kembali.
1.3 What tools to have to expert about?
To become an expert, one must master a combination of languages, libraries, and specialized software:
- Programming Languages
Python
(versatile and popular)
R Programming
(for advanced statistical analysis)
- Integrated Development Environments (IDEs):
Jupyter Notebook Interactive Coding
RStudio Statistical Computing IDE
Visual Studio Code Code Editor
- Data Libraries
Pandas for data manipulation
Numpy for data manipulation
Matplotib for data visualization
Seaborn for data visualization
- Database Management
SQL (Structured Query Language) for managing and retrieving data from relational databases using MySQL.SQL
MYSQL
- Version Control
Using Git to track code changes and GitHub for repository management and collaborative development.
Git
Github
1.4 Give me your interest data science domain knowledge
In Data Science, domain knowledge refers to understanding how data can be applied within a specific industry to create value. My primary interest lies in business-oriented data science, particularly in Business & E-commerce and Finance.
In Business & E-commerce, I am interested in analyzing customer behavior and churn rates to improve customer retention and increase profitability.
In Finance, I am interested in developing fraud detection systems and risk analysis models to support secure and data-driven financial decision-making.
I am especially interested in how data-driven insights can help companies make strategic decisions and strengthen their competitive advantage.