{"@attributes":{"version":"2.0"},"channel":{"title":"PyVideo.org - model evaluation","link":"https:\/\/pyvideo.org\/","description":{},"lastBuildDate":"Fri, 23 Oct 2015 00:00:00 +0000","item":[{"title":"How to evaluate a classifier in scikit-learn","link":"https:\/\/pyvideo.org\/data-school\/scikit-learn-09-evaluating-classification-models.html","description":"<h3>Description<\/h3><p>In this video, you'll learn how to properly evaluate a classification model using a variety of common tools and metrics, as well as how to adjust the performance of a classifier to best match your business objectives. I'll start by demonstrating the weaknesses of classification accuracy as an evaluation metric. I'll then discuss the confusion matrix, the ROC curve and AUC, and metrics such as sensitivity, specificity, and precision. By the end of the video, you will have a solid foundation for intelligently evaluating your own classification model.<\/p>\n<p>This is the ninth video in the series, <a class=\"reference external\" href=\"http:\/\/www.dataschool.io\/machine-learning-with-scikit-learn\/\">Introduction to machine learning with scikit-learn<\/a>. The notebook and resources shown in the video are available on <a class=\"reference external\" href=\"https:\/\/github.com\/justmarkham\/scikit-learn-videos\">GitHub<\/a>.<\/p>\n","pubDate":"Fri, 23 Oct 2015 00:00:00 +0000","guid":"tag:pyvideo.org,2015-10-23:\/data-school\/scikit-learn-09-evaluating-classification-models.html","category":["Data School","machine learning","data science","scikit-learn","tutorial","Data School","model evaluation","classification","confusion matrix","ROC curve","AUC"]},{"title":"How to find the best model parameters in scikit-learn","link":"https:\/\/pyvideo.org\/data-school\/scikit-learn-08-parameter-tuning-with-grid-search.html","description":"<h3>Description<\/h3><p>In this video, you'll learn how to efficiently search for the optimal tuning parameters (or &quot;hyperparameters&quot;) for your machine learning model in order to maximize its performance. I'll start by demonstrating an exhaustive &quot;grid search&quot; process using scikit-learn's GridSearchCV class, and then I'll compare it with RandomizedSearchCV, which can often achieve similar results in far less time.<\/p>\n<p>This is the eighth video in the series, <a class=\"reference external\" href=\"http:\/\/www.dataschool.io\/machine-learning-with-scikit-learn\/\">Introduction to machine learning with scikit-learn<\/a>. The notebook and resources shown in the video are available on <a class=\"reference external\" href=\"https:\/\/github.com\/justmarkham\/scikit-learn-videos\">GitHub<\/a>.<\/p>\n","pubDate":"Wed, 15 Jul 2015 00:00:00 +0000","guid":"tag:pyvideo.org,2015-07-15:\/data-school\/scikit-learn-08-parameter-tuning-with-grid-search.html","category":["Data School","machine learning","data science","scikit-learn","tutorial","Data School","cross-validation","model evaluation","parameter tuning","grid search"]},{"title":"Selecting the best model in scikit-learn using cross-validation","link":"https:\/\/pyvideo.org\/data-school\/scikit-learn-07-model-evaluation-with-cross-validation.html","description":"<h3>Description<\/h3><p>In this video, we'll learn about K-fold cross-validation and how it can be used for selecting optimal tuning parameters, choosing between models, and selecting features. We'll compare cross-validation with the train\/test split procedure, and we'll also discuss some variations of cross-validation that can result in more accurate estimates of model performance.<\/p>\n<p>This is the seventh video in the series, <a class=\"reference external\" href=\"http:\/\/www.dataschool.io\/machine-learning-with-scikit-learn\/\">Introduction to machine learning with scikit-learn<\/a>. The notebook and resources shown in the video are available on <a class=\"reference external\" href=\"https:\/\/github.com\/justmarkham\/scikit-learn-videos\">GitHub<\/a>.<\/p>\n","pubDate":"Sun, 28 Jun 2015 00:00:00 +0000","guid":"tag:pyvideo.org,2015-06-28:\/data-school\/scikit-learn-07-model-evaluation-with-cross-validation.html","category":["Data School","machine learning","data science","scikit-learn","tutorial","Data School","cross-validation","model evaluation","feature selection","parameter tuning"]},{"title":"Data science in Python: pandas, seaborn, scikit-learn","link":"https:\/\/pyvideo.org\/data-school\/scikit-learn-06-data-science-pipeline.html","description":"<h3>Description<\/h3><p>In this video, we'll cover the data science pipeline from data ingestion (with pandas) to data visualization (with seaborn) to machine learning (with scikit-learn). We'll learn how to train and interpret a linear regression model, and then compare three possible evaluation metrics for regression problems. Finally, we'll apply the train\/test split procedure to decide which features to include in our model.<\/p>\n<p>This is the sixth video in the series, <a class=\"reference external\" href=\"http:\/\/www.dataschool.io\/machine-learning-with-scikit-learn\/\">Introduction to machine learning with scikit-learn<\/a>. The notebook and resources shown in the video are available on <a class=\"reference external\" href=\"https:\/\/github.com\/justmarkham\/scikit-learn-videos\">GitHub<\/a>.<\/p>\n","pubDate":"Thu, 28 May 2015 00:00:00 +0000","guid":"tag:pyvideo.org,2015-05-28:\/data-school\/scikit-learn-06-data-science-pipeline.html","category":["Data School","machine learning","data science","scikit-learn","tutorial","Data School","pandas","seaborn","linear regression","model evaluation","feature selection","visualization"]},{"title":"Comparing machine learning models in scikit-learn","link":"https:\/\/pyvideo.org\/data-school\/scikit-learn-05-comparing-machine-learning-models.html","description":"<h3>Description<\/h3><p>We've learned how to train different machine learning models and make predictions, but how do we actually choose which model is &quot;best&quot;? We'll cover the train\/test split process for model evaluation, which allows you to avoid &quot;overfitting&quot; by estimating how well a model is likely to perform on new data. We'll use that same process to locate optimal tuning parameters for a KNN model, and then we'll re-train our model so that it's ready to make real predictions.<\/p>\n<p>This is the fifth video in the series, <a class=\"reference external\" href=\"http:\/\/www.dataschool.io\/machine-learning-with-scikit-learn\/\">Introduction to machine learning with scikit-learn<\/a>. The notebook and resources shown in the video are available on <a class=\"reference external\" href=\"https:\/\/github.com\/justmarkham\/scikit-learn-videos\">GitHub<\/a>.<\/p>\n","pubDate":"Thu, 14 May 2015 00:00:00 +0000","guid":"tag:pyvideo.org,2015-05-14:\/data-school\/scikit-learn-05-comparing-machine-learning-models.html","category":["Data School","machine learning","data science","scikit-learn","tutorial","Data School","model evaluation","overfitting"]}]}}