500 - AI, Machine Learning, and LLM

500 - AI Concepts and Common Tools

AI kinds
- Symbolic AI - The collection of all methods in artificial intelligence research that are based on high-level symbolic (human-readable) representations of problems, logic and search
- Generative AI - A subset of artificial intelligence that uses generative models to produce text, images, videos, or other forms of data
- Causal AI - A technique in artificial intelligence that builds a causal model and can thereby make inferences using causality rather than just correlation
Data/AI tools
- DVC - Data Version Control
- Mojo - The programming language for all AI developers
Data/AI frameworks
- Streamlit - A faster way to build and share data apps
- Chainlit - An open-source Python package to build production ready Conversational AI
Data/AI Platforms
- OpenWebUI - An extensible, feature-rich, and user-friendly self-hosted AI platform designed to operate entirely offline
- Dify - An open-source LLM app development platform
Supporting Services
- Firecrawl - An API service that takes a URL, crawls it, and converts it into clean markdown or structured data
- Tavily Search - A search engine optimized for LLMs, aimed at efficient, quick and persistent search results

520 - Natural Language Processing

Foundational Linguistics Fields
Core NLP Concepts & Techniques
Vector Representations (Embeddings)
- Word embedding
  - Word2vec
  - fastText - Library for efficient text classification and representation learning
  - GloVe - Global Vectors for Word Representation
- Sentence embedding
Libraries & tools
- General Purpose
  - Natural Language Toolkit - A leading platform for building Python programs to work with human language data
  - Gensim - A free open-source Python library for representing documents as semantic vectors
  - wego - The implementations from scratch for word embeddings (a.k.a word representation) models in Go
- Morphological Analyzers / Tokenizers
  - Kuromoji - An open source Japanese morphological analyzer written in Java
  - Kagome - An open source Japanese morphological analyzer written in pure golang
  - mecab-python3 - A Python wrapper for the MeCab morphological analyzer for Japanese text
  - jieba - A Python module for Chinese text segmentation

530 - Machine Learning

Paradigms
- Supervised learning - A paradigm in machine learning where algorithms learn from labeled data
  - Decision tree learning - The method using a decision tree as a predictive model to go from observations about an item to conclusions about the item's target value
  - Ensemble learning - The method using multiple learning algorithms to obtain better predictive performance than could be obtained from any of the constituent learning algorithms alone
    - Random forest - An ensemble learning method for classification, regression and other tasks that operates by constructing a multitude of decision trees at training time
  - Support vector machine - The supervised learning models with associated learning algorithms that analyze data for classification and regression analysis
  - Classification - The problem of identifying which of a set of categories (sub-populations) a new observation belongs to, on the basis of a training set of data containing observations
    - Logistic regression - A statistical model that models the probability of an event taking place by having the log-odds for the event be a linear combination of one or more independent variables
    - ROC curve - A graphical plot that illustrates the diagnostic ability of a binary classifier system as its discrimination threshold is varied
    - Naive Bayes classifier - A family of simple probabilistic classifiers based on applying Bayes' theorem with strong (naive) independence assumptions between the features
  - Regression - A set of statistical processes for estimating the relationships between a dependent variable and one or more independent variables
    - Ordinary least squares - A type of linear least squares method for choosing the unknown parameters in a linear regression model
    - Generalized linear model - A flexible generalization of ordinary least squares regression
    - ARIMA model - A generalization of an autoregressive moving average (ARMA) model, fitted to time series data either to better understand the data or to predict future points in the series
- Unsupervised learning - A type of machine learning in which models are trained using unlabeled dataset and are allowed to act on that data without previous training
  - K-means clustering - A method of vector quantization that aims to partition n observations into k clusters in which each observation belongs to the cluster with the nearest mean
- Reinforcement learning - An area of machine learning concerned with how intelligent agents ought to take actions in an environment in order to maximize the notion of cumulative reward
  - Markov decision process - The mathematical framework for modeling decision making in situations where outcomes are partly random and partly under the control of a decision maker
  - Multi-armed bandit - A problem in which a fixed limited set of resources must be allocated between competing (alternative) choices in a way that maximizes their expected gain
  - Value function - A function used in mathematical optimization and reinforcement learning that assigns a measure of desirability to states or actions
Concepts & Techniques
- Hyperparameter - A parameter whose value is used to control the learning process
- Hyperparameter optimization - The problem of choosing a set of optimal hyperparameters for a learning algorithm
- Embedding - A representation learning technique that maps complex, high-dimensional data into a lower-dimensional vector space of numerical vectors
- Early stopping - A form of regularization used to avoid overfitting when training a learner with an iterative method, such as gradient descent
- Cross-validation - Any of various similar model validation techniques for assessing how the results of a statistical analysis will generalize to an independent data set
Applications & Problem Domains
- Anomaly detection - The identification of rare items, events or observations which raise suspicions by differing significantly from the majority of the data
  - One-class classification - The technique trying to identify objects of a specific class amongst all objects, by primarily learning from a training set containing only the objects of that class
- Recommender system - An information filtering system that seeks to predict the 'rating' or 'preference' a user would give to an item
Related Fields
- Mathematical model - An abstract description of a concrete system using mathematical concepts and language
- Mathematical optimization - The selection of a best element, with regard to some criteria, from some set of available alternatives
Frameworks, Platforms & Tools
- scikit-learn - A free software machine learning library for the Python programming language
  - libsvm - A Library for Support Vector Machines
- ML.NET - An open-source, cross-platform machine learning framework for .NET developers
- Crab - A Python library for building recommender systems
- Gradio - The fastest way to demo your machine learning model with a friendly web interface so that anyone can use it, anywhere
- Cloud Platforms
  - Azure Machine Learning - An enterprise-grade machine learning service to build and deploy models faster
  - Amazon SageMaker - The service to build, train, and deploy machine learning (ML) models for any use case with fully managed infrastructure, tools, and workflows
- MLOps
  - CML - An open-source tool for implementing continuous integration & delivery (CI/CD) in machine learning projects
  - MLFlow - An open source platform to manage the ML lifecycle, including experimentation, reproducibility, deployment, and a central model registry
  - KubeFlow - The Machine Learning Toolkit for Kubernetes, dedicated to making deployments of ML workflows on Kubernetes simple, portable and scalable

540 - Deep Neural Networks

Neural network - The computational models used in machine learning for finding patterns in data
- Tensor - The mathematical objects represented as multidimensional arrays used in machine learning
  - Sigmoid function - A mathematical function having a characteristic 'S'-shaped curve or sigmoid curve
  - Softmax function - A function that converts a vector of K real numbers into a probability distribution of K possible outcomes
- Backpropagation - A widely used algorithm for training feedforward neural networks
- Autoencoder - A type of artificial neural network used to learn efficient codings of unlabeled data (unsupervised learning)
- Vanishing gradient problem - The difficulty encountered when training artificial neural networks with gradient-based learning methods and backpropagation, where gradients shrink as they back-propagate
Deep Learning - A part of a broader family of machine learning methods based on artificial neural networks with representation learning
- Stochastic gradient descent - An iterative method for optimizing an objective function with suitable smoothness properties
- Fine tuning - An approach to transfer learning in which the weights of a pre-trained model are trained on new data
- Recurrent neural network - A class of artificial neural networks where connections between nodes can create cycles, allowing output from some nodes to affect subsequent input to the same nodes
  - LSTM - An artificial neural network used in the fields of artificial intelligence and deep learning, distinguished by feedback connections
- Attention - A technique in the context of neural networks that mimics cognitive attention, enhancing the important parts of the input data and fading out the rest
  - Transformer - A deep learning architecture based on the multi-head attention mechanism
Frameworks
- TensorFlow - An end-to-end open source platform for machine learning
  - TFDS - The collection of datasets ready to use with TensorFlow or other Python ML frameworks like Jax
  - Keras - The Python Deep Learning API designed for human beings, not machines
- PyTorch - An open source machine learning framework that accelerates the path from research prototyping to production deployment
Textbooks
- Neural Networks and Deep Learning - A free online book explaining the core ideas behind neural networks and deep learning
- Deep Learning, MIT Press - The textbook intended to help students and practitioners enter the field of machine learning in general and deep learning in particular

550 - Large Language Models and Agents

Model Providers
- Anthropic - The API providing access to Anthropic's Claude models
- OpenAI - The platform for building applications with OpenAI's models
- Gemini Developer APs - The API that gives you access to the latest Gemini models from Google
Hosting Platforms & Aggregators
- Vertex AI - A machine learning (ML) platform for training and deploying ML models and AI applications
- Amazon Bedrock - A fully managed service offering a choice of high-performing foundation models
- Azure OpenAI Service - The service providing REST API access to OpenAI's powerful language models
- Hugging Face Serverless Inference API - The API allowing inference on models hosted on the Hugging Face Hub
- OpenRouter - A unified interface for LLMs
Local LLM Deployment
- Ollama - A tool designed for deploying and managing large language models (LLMs) locally
- LM Studio - A desktop app for developing and experimenting with LLMs locally on your computer
- LocalAI - The free, Open Source OpenAI alternative
Open Models
- Llama - The open-source AI models you can fine-tune, distill and deploy anywhere
- Gemma - A family of lightweight, state-of-the-art open models built from the same research and technology used to create the Gemini models
- Mistral - A family of open-source and commercial generative AI models
- OLMo - A state-of-the-art, truly open language model and framework to build and study the science of language models
Standards
- Model Formats
  - GGUF - A file format for storing models for inference with GGML and executors based on GGML
  - ONNX - An open format built to represent machine learning models
  - Safetensors - A simple format for storing tensors safely
- Protocols
  - Model Context Protocol (MCP) - An open protocol that standardizes how applications provide context to LLMs
  - A2A Protocol - A protocol for enabling bidirectional communication between web applications and AI agents
Techniques
- Retrieval-augmented generation (RAG)
SDKs
- Go OpenAI - The Go client libraries for OpenAI API
- Ruby OpenAI - A Ruby wrapper for the OpenAI API
- Google Gen AI SDK - The Python SDK for Google's generative AI models
- OmniAI - A minimalist library for interfacing with LLMs
- LiteLLM - A Python SDK and Proxy Server to call over 100 LLM APIs using the OpenAI format
Platforms and Tools
- OpenHands - A platform for software development agents powered by AI
- LangChain - A framework for developing applications powered by language models
  - LangGraph - A library for building stateful, multi-actor applications with LLMs
- Semantic Kernel - A lightweight, open-source development kit that lets you easily build AI agents and integrate the latest AI models
- LLM - A CLI utility and Python library for interacting with Large Language Models
- FastMCP v2
Evaluation and Visualization
- SWE-bench - A benchmark for evaluating large language models on real world software issues collected from GitHub
- Chatbot Arena - A crowdsourced open platform for evaluating LLMs
- AttentionViz - A Global View of Transformer Attention
- BertViz - A tool for visualizing Attention in NLP Models
Prompt Engineering
- ReAct Prompting - A prompting technique synergizing reasoning and acting in language models
- Zero-shot and Few-shot Prompting
- Chain-of-Thought (CoT) Prompting

570 - Computer Vision (WIP)

Core Concepts
- Vision Language Models (VLM) - An exciting class of models that can understand images and text
- Convolutional neural network (CNN) - A class of artificial neural network, most commonly applied to analyze visual imagery
Software, Libraries and Tools
- General computer vision
  - OpenCV - An open source computer vision and machine learning software library
    - GoCV - A package for the Go programming language with bindings for OpenCV 4
- Optical Character Recognition (OCR)
  - Tesseract OCR - An open source text recognition (OCR) Engine
    - gosseract OCR - A Go package for OCR (Optical Character Recognition), by using Tesseract C++ library
  - EasyOCR - A ready-to-use OCR with 80+ supported languages and all popular writing scripts
  - OCRmyPDF - A tool to add a searchable OCR text layer to PDF files

500 - AI Concepts and Common Tools​

520 - Natural Language Processing​

530 - Machine Learning​

540 - Deep Neural Networks​

550 - Large Language Models and Agents​

570 - Computer Vision (WIP)​

500 - AI Concepts and Common Tools

520 - Natural Language Processing

530 - Machine Learning

540 - Deep Neural Networks

550 - Large Language Models and Agents

570 - Computer Vision (WIP)