Supervised Learning Models:
---------------------------
1. Linear Regression Regression
2. Logistic Regression Classification
3. Decision Tree Classification/Regression
4. Random Forest Classification/Regression
5. Support Vector Machine Classification
6. K-Nearest Neighbors Classification/Regression
7. Gradient Boosting Classification/Regression
8. XGBoost, LightGBM Classification/Regression
Unsupervised Learning Models:
-----------------------------
1. K-Means Clustering Clustering
2. DBSCAN Clustering
3. Hierarchical Clustering Clustering
4. PCA Dimensionality Reduction
5. t-SNE Visualization
Model Evaluation Metrics:
--------------------------
Classification:
- Accuracy
- Precision, Recall, F1 Score
- ROC AUC
- Confusion Matrix
Regression:
- MAE (Mean Absolute Error)
- MSE (Mean Squared Error)
- RMSE (Root Mean Squared Error)
- R² Score
Cross Validation Techniques:
-----------------------------
- Hold-Out Validation
- K-Fold Cross Validation
- Stratified K-Fold
- Leave-One-Out (LOO)
- Time Series Split (for time series)
Model Selection & Optimization:
-------------------------------
- GridSearchCV
- RandomizedSearchCV
- Bayesian Optimization
- Cross_val_score (for quick CV evaluation)
Best Practices:
---------------
- Feature Scaling: StandardScaler / MinMaxScaler
- Handling Imbalance: SMOTE, Class Weights
- Feature Selection: RFE, SelectKBest
- Overfitting Control: Regularization (L1/L2), Pruning, Early Stopping