Comprehensive Inventory of Algorithms, Tools & Math Concepts that are used in modern AI
Agentic Systems
A. Machine Learning (ML)
Algorithms
Foundational:
o Linear Regression, Logistic Regression
o k-Nearest Neighbors (k-NN)
o Naive Bayes
o Decision Trees, Random Forests
o Support Vector Machines (SVM)
o Principal Component Analysis (PCA)
Advanced/Emerging:
o Gradient Boosting Machines (XGBoost, LightGBM, CatBoost)
o Stochastic Gradient Descent
o Ensemble Methods (Bagging, Stacking, Blending)
o AutoML (Auto-sklearn, TPOT, H2O AutoML)
o Bayesian Optimization for hyperparameter tuning
o Active Learning
Programming Tools & Libraries
Python: Scikit-learn, XGBoost, LightGBM, CatBoost, H2O, TPOT, Dask-ML
R: caret, mlr, randomForest
Enterprise: DataRobot, Azure ML, IBM Watson ML
Mathematical Concepts
Probability Theory, Bayesian Inference
Linear Algebra, Matrix Factorization
Calculus (Optimization, Gradients)
Convex Optimization
Information Theory (Entropy, KL-divergence)
Statistical Learning Theory
B. Deep Learning
Algorithms
Foundational:
o Feedforward Neural Networks (MLP)
o Convolutional Neural Networks (CNN)
o Recurrent Neural Networks (RNN), LSTM, GRU)
Advanced/Emerging:
o Transformers (BERT, GPT, ViT, T5)
o Graph Neural Networks (GCN, GAT, GraphSAGE)
o Self-Supervised Learning (SimCLR, MoCo)
o Neural Architecture Search (NAS)
o Diffusion Models, GANs (StyleGAN, BigGAN)
o Attention Mechanisms
Programming Tools & Libraries
TensorFlow, Keras, PyTorch, JAX, MXNet, PaddlePaddle
Hugging Face Transformers, FastAI, DeepSpeed, ONNX
Nvidia CUDA/cuDNN, TensorRT (for GPU acceleration)
Mathematical Concepts
Backpropagation, Chain Rule
Stochastic Optimization (Adam, RMSProp, SGD)
Loss Functions (Cross-entropy, MSE, Huber)
Non-linear Activation Functions (ReLU, GELU, Swish)
Regularization (Dropout, Weight Decay, BatchNorm)
Manifold Learning
C. Natural Language Processing (NLP)
Algorithms/Models
Bag-of-Words, TF-IDF
Word Embeddings (Word2Vec, GloVe, FastText)
Sequence-to-Sequence Models (Seq2Seq, Attention)
Transformer-based Models (BERT, GPT, T5, XLNet, RoBERTa, LLaMA, Mistral)
Summarization (Extractive, Abstractive)
Named Entity Recognition (NER), POS Tagging
Text Classification, Sentiment Analysis
Question Answering, Machine Translation
Programming Tools & Libraries
Hugging Face Transformers, spaCy, NLTK, Stanford CoreNLP
OpenAI GPT API, LlamaIndex, LangChain
Flair, Gensim, AllenNLP
Mathematical Concepts
Tokenization, Vectorization
Sequence Modeling, Markov Models
Embedding Spaces, Similarity Metrics (Cosine, Jaccard)
Information Retrieval (BM25, TF-IDF)
Computer Vision (CV)
Algorithms
Image Classification (CNNs: ResNet, EfficientNet, MobileNet)
Object Detection (YOLO, SSD, Faster R-CNN)
Semantic/Instance Segmentation (UNet, Mask R-CNN, DeepLab)
Image Generation (GANs, Diffusion Models, VQ-VAE)
Image Captioning, Visual Question Answering
Video Analytics (3D CNN, SlowFast Networks)
Self-supervised & Contrastive Learning for Vision
Programming Tools & Libraries
OpenCV, Pillow
PyTorch/TensorFlow/Keras (Vision APIs)
Detectron2, MMDetection, Albumentations
Nvidia DALI, DeepStream, TorchVision
OpenVINO, TensorRT
Mathematical Concepts
Linear Algebra (Convolutions, Pooling, Transformations)
Signal Processing (Fourier, Wavelet)
Image Statistics, Color Spaces
Geometric Transformations, Homographies
E. Reinforcement Learning (RL)
Algorithms
Q-Learning, SARSA
Deep Q-Networks (DQN), Double DQN
Policy Gradient (REINFORCE, A2C, PPO, DDPG, SAC, TD3)
Multi-Agent RL (MADDPG, QMIX)
Model-Based RL (MBPO, Dreamer)
Programming Tools & Libraries
OpenAI Gymnasium, PettingZoo (Multi-Agent RL)
Stable Baselines3, RLlib (Ray), TF-Agents
Unity ML-Agents, Dopamine
Mathematical Concepts
Markov Decision Processes (MDP), Bellman Equations
Temporal Difference Learning
Exploration-Exploitation Trade-off
Game Theory, Nash Equilibrium
F. Multi-Agent Systems
Algorithms
Decentralized RL, Multi-agent Communication Protocols
Distributed Constraint Optimization
Cooperative/Competitive Multi-Agent Planning
Swarm Intelligence (PSO, ACO)
Programming Tools & Libraries
PettingZoo, MAgent
Mesa, SUMO (for simulations)
Mathematical Concepts
Game Theory, Mechanism Design
Network Theory, Graph Theory
Agent-Based Modeling
G. Causal Inference & Explainability
Algorithms
Causal Graphical Models, Structural Equation Modeling
Counterfactual Reasoning
Granger Causality, Do-Calculus
SHAP, LIME (Explainable AI)
Programming Tools & Libraries
DoWhy, CausalML, EconML
SHAP, LIME, Alibi Explain
Mathematical Concepts
Bayesian Networks, Directed Acyclic Graphs (DAGs)
Propensity Score Matching
Statistical Testing (Hypothesis Tests, Permutation Tests)
H. Generative Modeling
Algorithms
Variational Autoencoders (VAE)
Generative Adversarial Networks (GAN, WGAN, StyleGAN)
Diffusion Models (Stable Diffusion, Denoising Diffusion)
Normalizing Flows
Programming Tools & Libraries
PyTorch, TensorFlow, Keras (for custom models)
Hugging Face Diffusers
NVIDIA StyleGAN repo
Mathematical Concepts
Latent Variable Models
KL-Divergence, ELBO
Probability Distributions, Sampling Theory
I. Robotics and Embodied AI
Algorithms
Simultaneous Localization and Mapping (SLAM)
Motion Planning (RRT, A*, D* Lite)
Control Algorithms (PID, LQR, MPC)
Sensor Fusion, Visual Servoing
Reinforcement Learning for Robotics
Programming Tools & Libraries
ROS (Robot Operating System), Gazebo
MoveIt!, Open3D, PyBullet, Mujoco
Nvidia Isaac
Mathematical Concepts
Kinematics/Dynamics
Control Theory
Bayesian Filters (Kalman, Particle Filters)
Trajectory Optimization
J. Other Key Areas
Optimization
Convex & Non-Convex Optimization
Linear Programming, Quadratic Programming, Integer Programming
Evolutionary Algorithms (Genetic, Differential Evolution)
Simulated Annealing
Large-scale Systems
Distributed ML (Horovod, Ray, PySpark MLlib)
Model Serving (TensorFlow Serving, TorchServe, Seldon Core, Triton Inference Server)
MLOps Tools (MLflow, Kubeflow, DVC, Airflow, Metaflow, TFX, Weights & Biases,
Neptune)
Mathematical Concepts (Cross-cutting)
Information Theory, Entropy, Mutual Information
Probability & Statistics
Matrix Factorization, SVD
Time Series Analysis (ARIMA, Prophet, LSTM-based)
Graph Theory (for Graph ML)
Signal Processing
Differential Equations (Neural ODEs)
2. AI Aggregator Platform: Technical Reference for Product Development
A. Architecture & Use Case Considerations
Core Use Cases:
o Aggregating outputs from multiple AI/LLM/ML models
o Summarizing, ranking, or classifying multi-source AI results
o Federated or ensemble inference
o Dynamic model routing (model selection based on input/query)
o Unified interface for user queries and results presentation
High-Level Architecture:
o Modular, microservices-based backend
o API Gateway for unified access and authentication
o Orchestrator (workflow engine or scheduler)
o Scalable, async task management (Celery, RabbitMQ, Kafka)
o Model Registry and Metadata Store
o Frontend UI/dashboard for user interaction, results, analytics
B. Key Algorithms & Models
Aggregation & Summarization:
o Ensemble Voting (hard/soft) for model outputs
o Weighted Averaging, Rank Aggregation
o Text Summarization Models (Extractive: TextRank; Abstractive: BART, T5, GPT)
o Clustering (k-means, Agglomerative) for output grouping
o Outlier/Anomaly Detection (Isolation Forest, LOF)
Classification & Prioritization:
o Meta-models (stacking, blending)
o Relevance Scoring (BM25, semantic similarity)
o Zero-shot/Multitask Classification (via LLMs like GPT-4, LLaMA-3, Gemini)
o Topic Modeling (LDA, BERTopic)
o Multi-criteria decision analysis (MCDA, AHP)
Ranking & Filtering:
o Cosine Similarity, Jaccard Index for vector-based filtering
o Learning to Rank (LTR: LambdaMART, RankNet)
o Rule-based and AI-powered query rewriting for context handling
C. Frameworks, APIs, & Model Integration Tools
APIs for Model Integration:
o OpenAI API, Hugging Face Inference Endpoints
o Google Vertex AI, AWS SageMaker Endpoints
o REST/gRPC API wrappers for custom models
o LangChain, LlamaIndex, Haystack for LLM orchestration
o FastAPI, Flask, Django for REST services
o Celery, Airflow, Prefect for task orchestration
Scheduling, Orchestration, and Inference:
o Apache Airflow, Prefect (for pipelines)
o KServe, Seldon Core, BentoML (for model serving)
o Kubernetes (for scaling and orchestration)
o Ray Serve (for distributed inference and scheduling)
o Redis, RabbitMQ, Kafka (for queues and pub/sub)
Observability & Monitoring:
o Prometheus, Grafana (metrics)
o Sentry, ELK Stack (logs, tracing)
o MLflow, Weights & Biases (experiment tracking)
o OpenTelemetry (distributed tracing)
D. Frontend & Backend Stack Recommendations
Frontend:
o React, [Link], [Link] (SPA frameworks)
o Chakra UI, Material UI, Ant Design (UI libraries)
o [Link], [Link], Plotly (for data visualization)
o WebSocket support for real-time updates
o Auth0 or custom OAuth2 for authentication
Backend:
o Python (FastAPI, Flask, Django), [Link] (Express, NestJS)
o Model servers: TorchServe, TensorFlow Serving, Triton Inference Server
o Containerization: Docker, Podman
o Orchestration: Kubernetes, Docker Compose
o Database: PostgreSQL, MongoDB, Redis (caching), ElasticSearch (search)
o Task/Job Queues: Celery, Redis Queue, RabbitMQ
E. Mathematical and Data Processing Components
Similarity & Ranking:
o Cosine Similarity, Euclidean/Manhattan Distance
o KL-Divergence, Jaccard Similarity
o Semantic Embeddings (BERT, SBERT, Universal Sentence Encoder)
o Dimensionality Reduction (PCA, t-SNE, UMAP)
Summarization & Aggregation:
o Weighted averaging, rank aggregation formulas
o Clustering algorithms for grouping outputs
o Time series analysis for tracking updates or scheduling
Optimization & Model Selection:
o Bayesian Optimization, Grid/Random Search
o Reinforcement Learning for dynamic model selection (optional)
F. Deployment, Security, and Scalability Considerations
Deployment:
o CI/CD: GitHub Actions, GitLab CI, Jenkins
o Infrastructure as Code: Terraform, Ansible
o Multi-cloud deployment (AWS, GCP, Azure), or hybrid/on-prem
o GPU/TPU support for heavy inference
o Blue/Green or Canary deployments for updates
Security:
o OAuth2, OpenID Connect for user authentication
o API security (rate limiting, input validation, RBAC)
o Data encryption (in transit: TLS/SSL; at rest: AES-256)
o Regular vulnerability scanning (Snyk, Dependabot)
Scalability & Resilience:
o Auto-scaling with Kubernetes HPA
o Distributed caching (Redis, Memcached)
o Failover and retry strategies
o Load balancers (NGINX, HAProxy, AWS ALB)
G. Real-World and Enterprise Integration
Enterprise Connectors:
o Microsoft Azure Cognitive Services, Google AI Hub, IBM Watson
o Salesforce Einstein, SAP AI Core
o Integration with enterprise identity providers (SAML, LDAP)
Compliance:
o GDPR/CCPA for data handling
o Audit logging, user consent management
o Data retention policies
Use Cases by AI Field
A. Machine Learning (ML)
1. Credit Risk Scoring in Banking
Description: Banks use ML to predict the probability of loan default.
Algorithms Used:
o Logistic Regression, Random Forests, XGBoost, Support Vector Machines (SVM)
Tools/Libraries:
o Scikit-learn, XGBoost, LightGBM
Mathematical Concepts:
o Probability Theory (risk prediction),
o Optimization (for parameter tuning),
o ROC-AUC (evaluation metric),
o Statistical Learning Theory
B. Deep Learning
2. Automated Medical Image Diagnosis
Description: Classifying X-ray, CT, or MRI images for disease detection.
Algorithms Used:
o Convolutional Neural Networks (CNNs: ResNet, EfficientNet),
o Transformers (Vision Transformer/ViT)
Tools/Libraries:
o PyTorch, TensorFlow, Keras, FastAI
Mathematical Concepts:
o Linear Algebra (convolutions),
o Backpropagation,
o Loss Functions (Cross-entropy),
o Regularization (Dropout, BatchNorm)
C. Natural Language Processing (NLP)
3. Customer Support Chatbots
Description: AI chatbots for answering customer queries in real time.
Algorithms Used:
o Transformer-based Models (GPT-4, BERT, T5),
o Sequence-to-Sequence (Seq2Seq)
Tools/Libraries:
o Hugging Face Transformers, spaCy, OpenAI API, LangChain
Mathematical Concepts:
o Tokenization,
o Embedding Spaces (Cosine Similarity for semantic matching),
o Cross-entropy Loss
D. Computer Vision (CV)
4. Real-Time Object Detection for Autonomous Vehicles
Description: Cars detect pedestrians, traffic signs, and obstacles.
Algorithms Used:
o YOLO, SSD, Faster R-CNN (Object Detection),
o Semantic Segmentation (Mask R-CNN)
Tools/Libraries:
o OpenCV, Detectron2, TensorFlow/Keras, PyTorch
Mathematical Concepts:
o Convolution Operations,
o Non-Maximum Suppression,
o Geometric Transformations
E. Reinforcement Learning (RL)
5. Game-Playing AI (e.g., AlphaGo, OpenAI Five)
Description: RL agents learn to play games at superhuman levels.
Algorithms Used:
o Deep Q-Networks (DQN),
o Policy Gradients (A2C, PPO),
o Monte Carlo Tree Search (AlphaGo)
Tools/Libraries:
o OpenAI Gymnasium, Stable Baselines3, RLlib
Mathematical Concepts:
o Markov Decision Processes (MDP),
o Bellman Equations,
o Temporal Difference Learning
F. Multi-Agent Systems
6. Smart Grid Optimization
Description: Decentralized agents (homes, businesses) optimize power consumption.
Algorithms Used:
o Multi-agent RL (MADDPG),
o Distributed Constraint Optimization,
o Swarm Intelligence (Particle Swarm Optimization)
Tools/Libraries:
o PettingZoo, Ray RLlib, Mesa (simulation)
Mathematical Concepts:
o Game Theory,
o Network Theory,
o Agent-Based Modeling
G. Causal Inference & Explainability
7. Healthcare Policy Impact Analysis
Description: Measuring the causal effect of an intervention (e.g., new drug) on patient
outcomes.
Algorithms Used:
o Causal Graphical Models,
o Propensity Score Matching,
o SHAP/LIME for explanation
Tools/Libraries:
o DoWhy, CausalML, SHAP, LIME
Mathematical Concepts:
o Bayesian Networks,
o Directed Acyclic Graphs (DAGs),
o Hypothesis Testing
H. Generative Modeling
8. AI Art & Synthetic Media Generation
Description: Creating novel images, videos, or music with AI.
Algorithms Used:
o GANs (StyleGAN, BigGAN),
o Diffusion Models (Stable Diffusion)
Tools/Libraries:
o PyTorch, TensorFlow, Hugging Face Diffusers
Mathematical Concepts:
o Latent Variable Models,
o KL-Divergence,
o Sampling Theory
I. Robotics and Embodied AI
9. Autonomous Warehouse Robots
Description: Robots navigating and manipulating objects for warehouse automation.
Algorithms Used:
o SLAM,
o Path Planning (A*, RRT),
o Control Algorithms (PID, MPC),
o Reinforcement Learning for fine-tuning
Tools/Libraries:
o ROS, MoveIt!, PyBullet, Gazebo
Mathematical Concepts:
o Kinematics/Dynamics,
o Control Theory,
o Bayesian Filters (Kalman Filter)
J. Optimization & Large-Scale Systems
10. Large-Scale Recommendation Systems (e.g., Netflix, Amazon)
Description: Personalizing content recommendations to millions of users.
Algorithms Used:
o Matrix Factorization (SVD),
o Deep Learning (Wide & Deep, Neural Collaborative Filtering),
o Bandit Algorithms for exploration
Tools/Libraries:
o PySpark MLlib, TensorFlow, Scikit-learn, Airflow (pipeline), MLflow (tracking)
Mathematical Concepts:
o Linear Algebra (Matrix Factorization),
o Probability (for user-item interaction modeling),
o Convex Optimization