graph TD
A[Machine Learning] --> B[Supervised]
A --> C[Unsupervised]
A --> D[Semi-Supervised]
B --> B1[Regression]
B --> B2[Classification]
2025-07-26
AI (Artificial Intelligence) is a branch of computer science that focuses on creating systems or machines that can perform tasks that normally require human intelligence.
ML (Machine Learning) is a subset of Artificial Intelligence (AI) that focuses on building systems that can learn from data and improve their performance without being explicitly programmed.
\[ Y_i = \beta_0 + \beta_1 X_{i1} + \beta_2 X_{i2} + \dots + \beta_p X_{ip} + \epsilon_i \]
Where:
- \(Y_i\) – dependent variable
- \(X_{ij}\) – predictor variables
- \(\beta_0\) – intercept
- \(\beta_j\) – regression coefficients
- \(\epsilon_i\) – error term
\[ \mathbf{Y} = \mathbf{X}\boldsymbol{\beta} + \boldsymbol{\epsilon} \]
Where:
- \(\mathbf{Y}\) : \(n \times 1\) response vector
- \(\mathbf{X}\) : \(n \times (p+1)\) design matrix
- \(\boldsymbol{\beta}\) : coefficients vector
- \(\boldsymbol{\epsilon}\) : error vector
Model:
\[ Y = X\beta + \epsilon \]
Goal: Minimize
\[ SSE = (Y - X\beta)^\top (Y - X\beta) \]
Normal Equations:
\[ X^\top X \hat{\beta} = X^\top Y \]
Solution:
\[ \hat{\beta} = (X^\top X)^{-1} X^\top Y \]
Predictions & Residuals:
\[ \hat{Y} = X\hat{\beta}, \quad e = Y - \hat{Y} \]
graph TD
A[Machine Learning] --> B[Supervised]
A --> C[Unsupervised]
A --> D[Semi-Supervised]
B --> B1[Regression]
B --> B2[Classification]
Supervised Learning is a type of machine learning where a model is trained on a labeled dataset (input data with correct outputs) so that it can predict outcomes for new, unseen data.
Unsupervised Learning is a type of machine learning where the model is trained on unlabeled data and tries to find hidden patterns or structures.
One common approach is clustering, where the algorithm groups similar data points so that points in the same cluster are more similar to each other than to those in other clusters.
Examples: Customer segmentation, grouping news articles, market basket analysis.
Semi-Supervised Learning is a machine learning approach that uses both labeled and unlabeled data for training.
Typically, a small portion of labeled data is combined with a large amount of unlabeled data.
It helps when labeling data is costly or time-consuming, but unlabeled data is abundant.
Examples:
Supervised learning uses various algorithms to learn from labeled data. These tools are mainly divided into Regression and Classification methods.
Unsupervised learning works on unlabeled data to find patterns or structures.
The main approaches are clustering and dimensionality reduction.
Generative AI is a branch of Artificial Intelligence that focuses on creating new content—such as text, images, audio, video, or code—based on patterns learned from existing data.
It uses generative models (like GPT, DALL·E, Stable Diffusion) to produce outputs that resemble human-created data.
Generative AI has several branches based on the type of content it generates: