Assignment Week2~Data Science Programming I

Logo

Chandra Rizal Alamsyah

Student Majoring in Data Science at ITSB

NIM: 52250068

Email:

R Programming Data Science Statistics

1 What is the main purpose of data science programming?

The main purpose of data science programming is to process, analyze, and extract meaningful insights from data in order to support data-driven decision making.

More specifically, it aims to:

  • Collect and clean raw data (data collection and preprocessing)

  • Perform exploratory data analysis (EDA)

  • Build statistical and machine learning models

  • Generate predictions (predictive modeling)

  • Automate analytical processes

  • Communicate findings through visualization and reports

In essence, data science programming transforms raw data into valuable information that can guide strategic decisions.

2 Why do we learn data science?

There are several fundamental reasons why learning data science is important:

  1. The modern world is data-driven

Major companies such as Google, Amazon, and Netflix rely heavily on data to:

  • Recommend products and content

  • Optimize advertisements

  • Predict customer behavior

  1. High career demand

Professions such as:

  • Data Scientist

  • Data Analyst

  • Machine Learning Engineer

are in high global demand and offer competitive compensation.

  1. Objective decision-making

Data science enables organizations to make decisions based on statistical evidence and quantitative analysis rather than intuition alone.

  1. Broad applicability

Data science is relevant in nearly every sector, including finance, healthcare, education, government, technology, and sports.

3 What tools should be mastered to become an expert?

To become proficient in data science, the following tools and skills are essential:

  1. Programming Languages
  • Python (most widely used)

  • R (strong in statistical analysis)

  1. Python Libraries
  • NumPy (numerical computing)

  • Pandas (data manipulation)

  • Matplotlib / Seaborn (data visualization)

  • Scikit-learn (machine learning)

  • TensorFlow / PyTorch (deep learning)

  1. Databases
  • SQL

  • PostgreSQL / MySQL

  1. Big Data Technologies (advanced level)
  • Apache Spark

  • Hadoop

  1. Supporting Tools
  • Jupyter Notebook

  • Git & GitHub

  • Microsoft Excel (for initial analysis)

  • Tableau / Power BI (business visualization tools)

In addition, a strong foundation in:

  • Statistics

  • Probability

  • Linear Algebra

  • Basic Calculus

is crucial for achieving expertise.

4 What are the main domains in data science?

Data science is a broad interdisciplinary field. Some key domains include:

🔹 Machine Learning

Developing predictive models that learn from data.

🔹 Artificial Intelligence (AI)

Building systems that simulate human intelligence.

🔹 Computer Vision

Analyzing and interpreting visual data such as images and videos.

🔹 Natural Language Processing (NLP)

Enabling computers to understand and process human language.

🔹 Data Engineering

Designing and maintaining data pipelines and infrastructure.

🔹 Business Intelligence

Using data analytics to support strategic business decisions.

🔹 Financial Analytics

Risk modeling and market forecasting.

🔹 Healthcare Analytics

Disease prediction and medical data analysis.

5 Reference List

  • Davenport, T. H., & Patil, D. J. (2012). Data scientist: The sexiest job of the 21st century. Harvard Business Review, 90(10), 70–76.

  • Provost, F., & Fawcett, T. (2013). Data science for business: What you need to know about data mining and data-analytic thinking. O’Reilly Media.

  • James, G., Witten, D., Hastie, T., & Tibshirani, R. (2021). An introduction to statistical learning: With applications in Python (2nd ed.). Springer.

  • Hastie, T., Tibshirani, R., & Friedman, J. (2009). The elements of statistical learning (2nd ed.). Springer.

  • VanderPlas, J. (2016). Python data science handbook. O’Reilly Media.

  • McKinney, W. (2017). Python for data analysis (2nd ed.). O’Reilly Media.

  • Goodfellow, I., Bengio, Y., & Courville, A. (2016). Deep learning. MIT Press.

  • Kelleher, J. D., Mac Namee, B., & D’Arcy, A. (2020). Fundamentals of machine learning for predictive data analytics (2nd ed.). MIT Press.

LS0tDQp0aXRsZTogIiINCmF1dGhvcjogIkNoYW5kcmEgUml6YWwgQWxhbXN5YWggKDUyMjUwMDY4KSINCmRhdGU6ICJgciBmb3JtYXQoU3lzLkRhdGUoKSwgJyVkICVCICVZJylgIg0Kb3V0cHV0Og0KICBybWRmb3JtYXRzOjpyZWFkdGhlZG93bjoNCiAgICBzZWxmX2NvbnRhaW5lZDogdHJ1ZSANCiAgICBjc3M6IGNzcyBwbHVzIGh0bWwuY3NzDQogICAgdGh1bWJuYWlsczogdHJ1ZSAgICANCiAgICBsaWdodGJveDogdHJ1ZQ0KICAgIGdhbGxlcnk6IHRydWUNCiAgICBudW1iZXJfc2VjdGlvbnM6IHRydWUNCiAgICBsaWJfZGlyOiBsaWJzDQogICAgZGZfcHJpbnQ6ICJwYWdlZCINCiAgICBjb2RlX2ZvbGRpbmc6ICJzaG93Ig0KICAgIGNvZGVfZG93bmxvYWQ6IHllcw0KICAgIA0KLS0tDQo8c3R5bGU+DQovKiA2LiBNZW1wZXJiYWlraSB0YW1waWxhbiB0YWJlbCBqaWthIGFkYSAqLw0KICB0YWJsZSB7DQogICAgYmFja2dyb3VuZC1jb2xvcjogIzI1MjUyNSAhaW1wb3J0YW50Ow0KICAgIGJvcmRlcjogMXB4IHNvbGlkICM0NDQgIWltcG9ydGFudDsNCiAgfQ0KICB0aCB7DQogICAgYmFja2dyb3VuZC1jb2xvcjogICMyZDhjZmYgIWltcG9ydGFudDsNCiAgfQ0KPC9zdHlsZT4NCjxoMSBjbGFzcz0iaGVhZGVyLXRpdGxlIj5Bc3NpZ25tZW50IFdlZWsyfkRhdGEgU2NpZW5jZSBQcm9ncmFtbWluZyBJPC9oMT4NCiAgDQogIDxkaXYgY2xhc3M9InByb2ZpbGUtY2FyZCI+DQogIDxkaXYgY2xhc3M9InByb2ZpbGUtaW1hZ2UiPg0KICA8aW1nIGlkPSJGb3RvIiBzcmM9Imh0dHBzOi8vcmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbS9jaGFuZHJhMjQwMjA1LXN1ZG8vQ2hhbmRyYTMvbWFpbi9HYW50ZW5nLmpwZyIgYWx0PSJMb2dvIiBzdHlsZT0id2lkdGg6MjAwcHg7IGRpc3BsYXk6IGJsb2NrOyBtYXJnaW46IGF1dG87Ij4NCiAgPC9kaXY+DQogIA0KICA8ZGl2IGNsYXNzPSJwcm9maWxlLWluZm8iPg0KICA8aDI+Q2hhbmRyYSBSaXphbCBBbGFtc3lhaDwvaDI+DQogIDxwPlN0dWRlbnQgTWFqb3JpbmcgaW4gRGF0YSBTY2llbmNlIGF0IElUU0I8L3A+DQogIDxwPioqTklNKio6IDUyMjUwMDY4PC9QPg0KICA8UD4qKkVtYWlsKio6IGNoYW5kcmEyNDAyMDVAZ21haWwuY29tPC9wPg0KICANCiAgPGRpdiBjbGFzcz0iYmFkZ2VzIj4NCiAgPHNwYW4gY2xhc3M9ImJhZGdlIGJhZGdlLWJsdWUiPlIgUHJvZ3JhbW1pbmc8L3NwYW4+DQogIDxzcGFuIGNsYXNzPSJiYWRnZSBiYWRnZS1yZWQiPkRhdGEgU2NpZW5jZTwvc3Bhbj4NCiAgPHNwYW4gY2xhc3M9ImJhZGdlIGJhZGdlLWdyZWVuIj5TdGF0aXN0aWNzPC9zcGFuPg0KICA8L2Rpdj4NCiAgPC9kaXY+DQogIDwvZGl2Pg0KLS0tICANCg0KIyBXaGF0IGlzIHRoZSBtYWluIHB1cnBvc2Ugb2YgZGF0YSBzY2llbmNlIHByb2dyYW1taW5nPw0KPGRpdiBjbGFzcz0iaW5mby1ib3giPg0KDQpUaGUgbWFpbiBwdXJwb3NlIG9mIGRhdGEgc2NpZW5jZSBwcm9ncmFtbWluZyBpcyB0byBwcm9jZXNzLCBhbmFseXplLCBhbmQgZXh0cmFjdCBtZWFuaW5nZnVsIGluc2lnaHRzIGZyb20gZGF0YSBpbiBvcmRlciB0byBzdXBwb3J0IGRhdGEtZHJpdmVuIGRlY2lzaW9uIG1ha2luZy4NCg0KTW9yZSBzcGVjaWZpY2FsbHksIGl0IGFpbXMgdG86DQoNCiogQ29sbGVjdCBhbmQgY2xlYW4gcmF3IGRhdGEgKGRhdGEgY29sbGVjdGlvbiBhbmQgcHJlcHJvY2Vzc2luZykNCg0KKiBQZXJmb3JtIGV4cGxvcmF0b3J5IGRhdGEgYW5hbHlzaXMgKEVEQSkNCg0KKiBCdWlsZCBzdGF0aXN0aWNhbCBhbmQgbWFjaGluZSBsZWFybmluZyBtb2RlbHMNCg0KKiBHZW5lcmF0ZSBwcmVkaWN0aW9ucyAocHJlZGljdGl2ZSBtb2RlbGluZykNCg0KKiBBdXRvbWF0ZSBhbmFseXRpY2FsIHByb2Nlc3Nlcw0KDQoqIENvbW11bmljYXRlIGZpbmRpbmdzIHRocm91Z2ggdmlzdWFsaXphdGlvbiBhbmQgcmVwb3J0cw0KDQpJbiBlc3NlbmNlLCBkYXRhIHNjaWVuY2UgcHJvZ3JhbW1pbmcgdHJhbnNmb3JtcyByYXcgZGF0YSBpbnRvIHZhbHVhYmxlIGluZm9ybWF0aW9uIHRoYXQgY2FuIGd1aWRlIHN0cmF0ZWdpYyBkZWNpc2lvbnMuDQoNCjwvZGl2Pg0KDQojIFdoeSBkbyB3ZSBsZWFybiBkYXRhIHNjaWVuY2U/DQo8ZGl2IGNsYXNzPSJpbmZvLWJveCI+DQpUaGVyZSBhcmUgc2V2ZXJhbCBmdW5kYW1lbnRhbCByZWFzb25zIHdoeSBsZWFybmluZyBkYXRhIHNjaWVuY2UgaXMgaW1wb3J0YW50Og0KDQoxLiBUaGUgbW9kZXJuIHdvcmxkIGlzIGRhdGEtZHJpdmVuDQoNCk1ham9yIGNvbXBhbmllcyBzdWNoIGFzIEdvb2dsZSwgQW1hem9uLCBhbmQgTmV0ZmxpeCByZWx5IGhlYXZpbHkgb24gZGF0YSB0bzoNCg0KKiBSZWNvbW1lbmQgcHJvZHVjdHMgYW5kIGNvbnRlbnQNCg0KKiBPcHRpbWl6ZSBhZHZlcnRpc2VtZW50cw0KDQoqIFByZWRpY3QgY3VzdG9tZXIgYmVoYXZpb3INCg0KMi4gSGlnaCBjYXJlZXIgZGVtYW5kDQoNClByb2Zlc3Npb25zIHN1Y2ggYXM6DQoNCiogRGF0YSBTY2llbnRpc3QNCg0KKiBEYXRhIEFuYWx5c3QNCg0KKiBNYWNoaW5lIExlYXJuaW5nIEVuZ2luZWVyDQoNCmFyZSBpbiBoaWdoIGdsb2JhbCBkZW1hbmQgYW5kIG9mZmVyIGNvbXBldGl0aXZlIGNvbXBlbnNhdGlvbi4NCg0KMy4gT2JqZWN0aXZlIGRlY2lzaW9uLW1ha2luZw0KDQpEYXRhIHNjaWVuY2UgZW5hYmxlcyBvcmdhbml6YXRpb25zIHRvIG1ha2UgZGVjaXNpb25zIGJhc2VkIG9uIHN0YXRpc3RpY2FsIGV2aWRlbmNlIGFuZCBxdWFudGl0YXRpdmUgYW5hbHlzaXMgcmF0aGVyIHRoYW4gaW50dWl0aW9uIGFsb25lLg0KDQo0LiBCcm9hZCBhcHBsaWNhYmlsaXR5DQoNCkRhdGEgc2NpZW5jZSBpcyByZWxldmFudCBpbiBuZWFybHkgZXZlcnkgc2VjdG9yLCBpbmNsdWRpbmcgZmluYW5jZSwgaGVhbHRoY2FyZSwgZWR1Y2F0aW9uLCBnb3Zlcm5tZW50LCB0ZWNobm9sb2d5LCBhbmQgc3BvcnRzLg0KDQo8L2Rpdj4NCg0KIyBXaGF0IHRvb2xzIHNob3VsZCBiZSBtYXN0ZXJlZCB0byBiZWNvbWUgYW4gZXhwZXJ0Pw0KPGRpdiBjbGFzcz0iaW5mby1ib3giPg0KVG8gYmVjb21lIHByb2ZpY2llbnQgaW4gZGF0YSBzY2llbmNlLCB0aGUgZm9sbG93aW5nIHRvb2xzIGFuZCBza2lsbHMgYXJlIGVzc2VudGlhbDoNCg0KMS4gUHJvZ3JhbW1pbmcgTGFuZ3VhZ2VzDQoNCiogUHl0aG9uIChtb3N0IHdpZGVseSB1c2VkKQ0KDQoqIFIgKHN0cm9uZyBpbiBzdGF0aXN0aWNhbCBhbmFseXNpcykNCg0KMi4gUHl0aG9uIExpYnJhcmllcw0KDQoqIE51bVB5IChudW1lcmljYWwgY29tcHV0aW5nKQ0KDQoqIFBhbmRhcyAoZGF0YSBtYW5pcHVsYXRpb24pDQoNCiogTWF0cGxvdGxpYiAvIFNlYWJvcm4gKGRhdGEgdmlzdWFsaXphdGlvbikNCg0KKiBTY2lraXQtbGVhcm4gKG1hY2hpbmUgbGVhcm5pbmcpDQoNCiogVGVuc29yRmxvdyAvIFB5VG9yY2ggKGRlZXAgbGVhcm5pbmcpDQoNCjMuIERhdGFiYXNlcw0KDQoqIFNRTA0KDQoqIFBvc3RncmVTUUwgLyBNeVNRTA0KDQo0LiBCaWcgRGF0YSBUZWNobm9sb2dpZXMgKGFkdmFuY2VkIGxldmVsKQ0KDQoqIEFwYWNoZSBTcGFyaw0KDQoqIEhhZG9vcA0KDQo1LiBTdXBwb3J0aW5nIFRvb2xzDQoNCiogSnVweXRlciBOb3RlYm9vaw0KDQoqIEdpdCAmIEdpdEh1Yg0KDQoqIE1pY3Jvc29mdCBFeGNlbCAoZm9yIGluaXRpYWwgYW5hbHlzaXMpDQoNCiogVGFibGVhdSAvIFBvd2VyIEJJIChidXNpbmVzcyB2aXN1YWxpemF0aW9uIHRvb2xzKQ0KDQpJbiBhZGRpdGlvbiwgYSBzdHJvbmcgZm91bmRhdGlvbiBpbjoNCg0KKiBTdGF0aXN0aWNzDQoNCiogUHJvYmFiaWxpdHkNCg0KKiBMaW5lYXIgQWxnZWJyYQ0KDQoqIEJhc2ljIENhbGN1bHVzDQoNCmlzIGNydWNpYWwgZm9yIGFjaGlldmluZyBleHBlcnRpc2UuDQo8L2Rpdj4NCg0KIyBXaGF0IGFyZSB0aGUgbWFpbiBkb21haW5zIGluIGRhdGEgc2NpZW5jZT8NCjxkaXYgY2xhc3M9ImluZm8tYm94Ij4NCkRhdGEgc2NpZW5jZSBpcyBhIGJyb2FkIGludGVyZGlzY2lwbGluYXJ5IGZpZWxkLiBTb21lIGtleSBkb21haW5zIGluY2x1ZGU6DQoNCvCflLkgTWFjaGluZSBMZWFybmluZw0KDQpEZXZlbG9waW5nIHByZWRpY3RpdmUgbW9kZWxzIHRoYXQgbGVhcm4gZnJvbSBkYXRhLg0KDQrwn5S5IEFydGlmaWNpYWwgSW50ZWxsaWdlbmNlIChBSSkNCg0KQnVpbGRpbmcgc3lzdGVtcyB0aGF0IHNpbXVsYXRlIGh1bWFuIGludGVsbGlnZW5jZS4NCg0K8J+UuSBDb21wdXRlciBWaXNpb24NCg0KQW5hbHl6aW5nIGFuZCBpbnRlcnByZXRpbmcgdmlzdWFsIGRhdGEgc3VjaCBhcyBpbWFnZXMgYW5kIHZpZGVvcy4NCg0K8J+UuSBOYXR1cmFsIExhbmd1YWdlIFByb2Nlc3NpbmcgKE5MUCkNCg0KRW5hYmxpbmcgY29tcHV0ZXJzIHRvIHVuZGVyc3RhbmQgYW5kIHByb2Nlc3MgaHVtYW4gbGFuZ3VhZ2UuDQoNCvCflLkgRGF0YSBFbmdpbmVlcmluZw0KDQpEZXNpZ25pbmcgYW5kIG1haW50YWluaW5nIGRhdGEgcGlwZWxpbmVzIGFuZCBpbmZyYXN0cnVjdHVyZS4NCg0K8J+UuSBCdXNpbmVzcyBJbnRlbGxpZ2VuY2UNCg0KVXNpbmcgZGF0YSBhbmFseXRpY3MgdG8gc3VwcG9ydCBzdHJhdGVnaWMgYnVzaW5lc3MgZGVjaXNpb25zLg0KDQrwn5S5IEZpbmFuY2lhbCBBbmFseXRpY3MNCg0KUmlzayBtb2RlbGluZyBhbmQgbWFya2V0IGZvcmVjYXN0aW5nLg0KDQrwn5S5IEhlYWx0aGNhcmUgQW5hbHl0aWNzDQoNCkRpc2Vhc2UgcHJlZGljdGlvbiBhbmQgbWVkaWNhbCBkYXRhIGFuYWx5c2lzLg0KPC9kaXY+DQoNCiMgUmVmZXJlbmNlIExpc3QNCjxkaXYgY2xhc3M9ImluZm8tYm94Ij4NCiogRGF2ZW5wb3J0LCBULiBILiwgJiBQYXRpbCwgRC4gSi4gKDIwMTIpLiBEYXRhIHNjaWVudGlzdDogVGhlIHNleGllc3Qgam9iIG9mIHRoZSAyMXN0IGNlbnR1cnkuIEhhcnZhcmQgQnVzaW5lc3MgUmV2aWV3LCA5MCgxMCksIDcw4oCTNzYuDQoNCiogUHJvdm9zdCwgRi4sICYgRmF3Y2V0dCwgVC4gKDIwMTMpLiBEYXRhIHNjaWVuY2UgZm9yIGJ1c2luZXNzOiBXaGF0IHlvdSBuZWVkIHRvIGtub3cgYWJvdXQgZGF0YSBtaW5pbmcgYW5kIGRhdGEtYW5hbHl0aWMgdGhpbmtpbmcuIE/igJlSZWlsbHkgTWVkaWEuDQoNCiogSmFtZXMsIEcuLCBXaXR0ZW4sIEQuLCBIYXN0aWUsIFQuLCAmIFRpYnNoaXJhbmksIFIuICgyMDIxKS4gQW4gaW50cm9kdWN0aW9uIHRvIHN0YXRpc3RpY2FsIGxlYXJuaW5nOiBXaXRoIGFwcGxpY2F0aW9ucyBpbiBQeXRob24gKDJuZCBlZC4pLiBTcHJpbmdlci4NCg0KKiBIYXN0aWUsIFQuLCBUaWJzaGlyYW5pLCBSLiwgJiBGcmllZG1hbiwgSi4gKDIwMDkpLiBUaGUgZWxlbWVudHMgb2Ygc3RhdGlzdGljYWwgbGVhcm5pbmcgKDJuZCBlZC4pLiBTcHJpbmdlci4NCg0KKiBWYW5kZXJQbGFzLCBKLiAoMjAxNikuIFB5dGhvbiBkYXRhIHNjaWVuY2UgaGFuZGJvb2suIE/igJlSZWlsbHkgTWVkaWEuDQoNCiogTWNLaW5uZXksIFcuICgyMDE3KS4gUHl0aG9uIGZvciBkYXRhIGFuYWx5c2lzICgybmQgZWQuKS4gT+KAmVJlaWxseSBNZWRpYS4NCg0KKiBHb29kZmVsbG93LCBJLiwgQmVuZ2lvLCBZLiwgJiBDb3VydmlsbGUsIEEuICgyMDE2KS4gRGVlcCBsZWFybmluZy4gTUlUIFByZXNzLg0KDQoqIEtlbGxlaGVyLCBKLiBELiwgTWFjIE5hbWVlLCBCLiwgJiBE4oCZQXJjeSwgQS4gKDIwMjApLiBGdW5kYW1lbnRhbHMgb2YgbWFjaGluZSBsZWFybmluZyBmb3IgcHJlZGljdGl2ZSBkYXRhIGFuYWx5dGljcyAoMm5kIGVkLikuIE1JVCBQcmVzcy4NCg0KPC9kaXY+