Data Science Programing

Assigment ~week 02~

Khafizatun Nisa
Data Science Undergraduate Student

R Programming Data Science Statistics

1 Question and Answer

1.1 What is the main purpose of our study?

The main purpose of this study is to develop proficiency in Data Science Programming, focusing on how to write structured and logical instructions for computers to process, analyze, and interpret data effectively. This study aims to build computational thinking and algorithmic problem-solving skills that transform raw data into meaningful and actionable insights.

1.2 Why do we learn about it?

Kita mempelajari Data Science Programming karena programming merupakan dasar dari analisis data modern. Programming membantu kita mengotomatisasi pekerjaan yang berulang dan memproses data dalam jumlah besar yang tidak bisa ditangani secara efisien oleh tools seperti Excel.

Melalui programming, kita juga dapat membangun model prediksi menggunakan machine learning untuk memperkirakan tren dan mendukung pengambilan keputusan. Selain itu, programming memastikan bahwa analisis yang kita lakukan akurat, konsisten, dan dapat direproduksi kembali.

1.3 What tools to have to expert about?

To become an expert, one must master a combination of languages, libraries, and specialized software:

  • Programming Languages

    Python

    (versatile and popular)

    R Programming

    (for advanced statistical analysis)

  • Integrated Development Environments (IDEs):

    Jupyter Notebook Interactive Coding

    RStudio Statistical Computing IDE

    Visual Studio Code Code Editor

  • Data Libraries

    Pandas for data manipulation

    Numpy for data manipulation

    Matplotib for data visualization

    Seaborn for data visualization

  • Database Management
    SQL (Structured Query Language) for managing and retrieving data from relational databases using MySQL.

    SQL

    MYSQL

  • Version Control
    Using Git to track code changes and GitHub for repository management and collaborative development.

    Git

    Github

1.4 Give me your interest data science domain knowledge

In Data Science, domain knowledge refers to understanding how data can be applied within a specific industry to create value. My primary interest lies in business-oriented data science, particularly in Business & E-commerce and Finance.

In Business & E-commerce, I am interested in analyzing customer behavior and churn rates to improve customer retention and increase profitability.

In Finance, I am interested in developing fraud detection systems and risk analysis models to support secure and data-driven financial decision-making.

I am especially interested in how data-driven insights can help companies make strategic decisions and strengthen their competitive advantage.