Unit 1: Introduction
2025-07-12
## Course Details {.scrollable}
Su | Mn | Tu | We | Th | Fr | Sa |
---|---|---|---|---|---|---|
1 | h | h | 4 | [5] | 6 | 7 |
8 | [9] | 10 | 11 | 12 | 13 | [14]+ |
15 | [16] | 17 | 18 | [19]+ | 20 | 21 |
22 | [23]+ | 24 | 25 | 26 | [[27]] | 28 |
29 | [30]+ |
Su | Mn | Tu | We | Th | Fr | Sa |
---|---|---|---|---|---|---|
1 | 2 | – | – | – | ||
6 | – | – | – | h | h | – |
13 | – | – | – | – | 18 | [19]+ |
20 | [21]+ | 22 | 23 | 24 | [[25]] | 26 |
27 | 28 | 29 | 30 | 31 |
\[\small\begin{matrix}{}^{100}&&{}^{80}&&{}^{75} && {}^{70} && {}^{65} && {}^{60} && {}^{55} && {}^{50} && {}^{0}\\ |&A&|&B^+&|&B&|&C^+&|&C&|&D^+&|&D&|& F&|\\ \end{matrix}\]
Late penalty
Days late | 0 | 1 | 2 | 3 |
---|---|---|---|---|
Grade reduced | 0% | 25% | 50% | 75% |
Automatic point deductions may apply for:
Note: Suspected unoriginal work may be subject to academic integrity procedures outlined in the course syllabus.
https://www.facebook.com/share/r/16ZzjQbE7D/
Data Mining: The process of discovering patterns, insights, and anomalies from large datasets. The information extracted can be used for dataset development, decision-making, prediction, and/or understanding.
Data Science: An interdisciplinary field that combines statistical methods, data manipulation and analysis and domain expertise to extract knowledge and insights from data. Data scientists are involved in the entire data lifecycle, from collection and cleaning to analysis, modeling, and communication of results.
Artificial Intelligence (AI): A broad field of computer science that focuses on creating intelligent agents, to assess the situation and take actions to advance towards achieving defined goals. The goal of AI is to enable machines to simulate human intelligence.
Machine Learning (ML): A subfield of Artificial Intelligence that focuses on developing algorithms that allow computers to “learn” from data without being explicitly programmed. Instead of following rigid rules, ML models learn patterns and make predictions or decisions based on the data they are trained on. AI can be either supervised, unsupervised, and reinforcement learning.
Deep Learning (DL): A subfield of Machine Learning that uses artificial neural networks with multiple layers. Deep learning excels at learning complex patterns from large datasets, used in areas like image recognition, speech recognition, and natural language processing.
Big Data: Refers to extremely large datasets that may be analyzed computationally to reveal subtle patterns, trends, and associations, especially relating to human behavior and interactions. These datasets often exceed the size and speed of memory.
Business Intelligence (BI): A set of strategies and technologies used for the data analysis of business information to provide actionable insights that help organizations make better business decisions. Reports and dashboards used to focused on descriptive analytics (what happened) rather than predictive or prescriptive analytics.
Natural Language Processing (NLP): A subfield of AI that focuses on enabling computers to understand, interpret, and generate human language. NLP techniques are used in applications like spam detection, sentiment analysis, machine translation, and chatbots.
Computer Vision: A subfield of AI that enables computers to “see” and interpret visual information such as images and videos. Applications include facial recognition, object detection, autonomous vehicles, and medical image analysis.
Predictive Analytics: A type of data analytics that uses historical data, statistical algorithms, and machine learning techniques to identify the likelihood of future outcomes based on historical data. Its goal is to predict what will happen.
Prescriptive Analytics: The next step beyond predictive analytics, focusing on determining the best course of action for a given situation. It not only predicts what will happen but also suggests why it will happen and recommends actions to take to achieve a desired outcome or mitigate a risk.
Reinforcement Learning (RL): A type of machine learning where an agent learns to make decisions by performing actions in an environment and receiving rewards or penalties. The goal is to maximize the cumulative reward over time. It’s often used in robotics, game playing (like AlphaGo), and autonomous systems.
Data Engineering: The aspect of data science that focuses on the practical applications of data collection, storage, processing, and transformation. Data engineers build and maintain data pipelines, databases, and data warehouses to allow convenient access and efficient use.
Ethics in AI/Data Science: Moral implications, societal impact, responsible development and deployment of AI and data-driven systems in attempt to address concerns like bias, fairness, privacy, transparency, and accountability.
Package manager to facilitate loading and updating software libraries
Extensive collection of modules and packages for a wide range of functions (maps, data manipulation, etc.)
Active support and continued development from academic and corporate users community
Integrated Development Environment and Data Workbook
Feature | R | Python |
---|---|---|
Overview | R is a language and environment for statistical programming which includes statistical computing and data graphics. | Python is a general-purpose programming language for data analysis, scientific computing and application development. Simplify program complexity using common approaches. |
Design Objective | Designed by statisticians for data analysis, modelling and representation for both batch computation and interactive websites. Designed for simplifying complex mathematics and statistics. | Designed by engineers and computer scientists to develop GUI, web and embedded hardware applications |
Key applications | Forecasting, Data Visualization, Machine Learning | Data collection, Computer Vision, Data machines learning |
[]
**by Robert Batzinger, Emeritus Instructor,
\[\begin{equation}N = \left\lfloor D \times \pi \right\rceil\rlap{\qquad\qquad (1)}\end{equation}\] Rational estimate of Pi is calculated as
\[\begin{equation}\pi_{est} = \frac{N}{D}\rlap{\qquad\qquad (2)}\end{equation}\]
\[\begin{equation}\chi ^ 2 = \frac{(\pi_{est}-\pi)^2}{\pi}\rlap{\qquad\qquad (3)}\end{equation}\] \[\begin{equation}\epsilon = \left|\pi_{est} - \pi\right|\rlap{\qquad\qquad (4)}\end{equation}\]
IT408