Notes: Fundamentals of Machine Learning

Posts

February 22, 2025

I am upskilling on AI and machine learning on Datacamp, a learning platform for data, analytics, and AI skills. These notes help me to understand and remember what I’m learning.

Machine Learning

A set of tools for making inferences and predictions from data

Reinforcement Learning Models

Supervised Learning Models

Training data contains labeled Target Variables and as many observations with relevant Features as possible
Workflow: extract features; split data to test and training datasets; train the model; evaluate the model

Classification models

assigns a category

Support vector machine - linear classifier
Support vector machine - polynomial classifier: used when data is not linearly separable. Uses a “Polynomial Kernel” trick that implicitly comutes transformations into higher-dimensional spaces, without explicitly performing the transformation: $$ K(x, y) = (\gamma \cdot x^T y + r)^d $$ where x and y rare input vectors; γ is a scaling parameter; r is a constant term; d is the degree of the polynomial.

Regression models

Assigns a continuous variable

Unsupervised Learning Models

Training data does not contained labeled data as a target variable: only features: useful for clustering and anomaly detection. Use cases:

Clustering

K Means: specify number of clusters
Density-based spacial clustering of applications with noise (DBSCAN): specify meaning of a cluster

Association

find relationships between observations

Anomaly detection

find outliers, which is tough in higher dimensions

Performance Improvement

Dimensionality reduction

reduce features, removing features that are correlated or irrelevant

hyperparameter tuning

Various SVM algorithm options: kernel (linear, poly); C; degree; gamma; shrinking; coef0; tol; …

Ensemble methods

Use multiple models

Deep Learning

Inspired Neural Networks; a subset of Machine Learning, requires vast amounts of data

Computer Vision

Thanks, deepfakes come in here.

Natural Language Processing (NLP)

Bag of words: count word frequency into n-grams; doesn’t consider similar meanings
word embeddings: mathematical meanings for groups of similar words

Updated on February 22, 2025

A garden plan for 2025 Host a human-centered project kickoff for project health and happiness