Machine Learning: process of turning data into actionable knowledge for task support and decision making

Define problem → Model data → Evaluate → Deploy

  1. Basic Math for computational data analysis
  2. Unsupervised learning(EDA: exclusive data analysis) for data exploration
  3. Supervised learning for predictive analysis

Entire dataset = Training dataset

These matrices are the raw data → we create a ML model using this training dataset

Unsupervised Learning:

Clustering Analysis

K-means

Gaussian mixture model

Hierarchical clustering

Density-based clustering

Evaluation of clustering algorithms

Dimension Reduction: reduce the dimensions of the dataset b/c data collection with multiple features cost heavily