77Version 0.20 (under development)
88================================
99
10- As well as a plethora of new features and enhancements, this release is the
11- first to be accompanied by a :ref: `glossary ` developed by `Joel Nothman `_. The
12- glossary is a reference resource to help users and contributors become familiar
13- with the terminology and conventions used in Scikit-learn.
10+ This release packs in a mountain of bug fixes, features and enhancements for
11+ the Scikit-learn library, and improvements to the documentation and examples.
12+ Thanks to our many contributors!
13+
14+ Highlights
15+ ----------
16+
17+ We have tried to improve our support for common data-science use-cases
18+ including missing values, categorical variables, heterogeneous data, and
19+ features/targets with unusual distributions.
20+
21+ Missing values in features, represented by NaNs, are now accepted in
22+ column-wise preprocessing such as scalers. Each feature is fitted disregarding
23+ NaNs, and data containing NaNs can be transformed. The new :mod: `impute `
24+ module provides estimators for learning despite missing data.
25+
26+ :class: `~compose.ColumnTransformer ` handles the case where different features
27+ or columns of a pandas.DataFrame need different preprocessing.
28+ String or pandas Categorical columns can now be encoded with
29+ :class: `~preprocessing.OneHotEncoder ` or
30+ :class: `~preprocessing.OrdinalEncoder `.
31+
32+ This release is also the first to be accompanied by a :ref: `glossary ` developed
33+ by `Joel Nothman `_. The glossary is a reference resource to help users and
34+ contributors become familiar with the terminology and conventions used in
35+ Scikit-learn.
1436
1537Changed models
1638--------------
@@ -254,8 +276,8 @@ Model evaluation and meta-estimators
254276 :issue: `9304 ` by :user: `Breno Freitas <brenolf> `.
255277
256278- Add `return_estimator ` parameter in :func: `model_selection.cross_validate ` to
257- return estimators fitted on each split. :issue: ` 9686 ` by :user: ` Aurélien Bellet
258- <bellet> `.
279+ return estimators fitted on each split.
280+ :issue: ` 9686 ` by :user: ` Aurélien Bellet <bellet> `.
259281
260282- New ``refit_time_ `` attribute will be stored in
261283 :class: `model_selection.GridSearchCV ` and
@@ -273,13 +295,15 @@ Decomposition and manifold learning
273295Metrics
274296
275297- :func: `metrics.roc_auc_score ` now supports binary ``y_true `` other than
276- ``{0, 1} `` or ``{-1, 1} ``. :issue: `9828 ` by :user: `Hanmin Qin <qinhanmin2014> `.
298+ ``{0, 1} `` or ``{-1, 1} ``.
299+ :issue: `9828 ` by :user: `Hanmin Qin <qinhanmin2014> `.
277300
278- - :func: `metrics.label_ranking_average_precision_score ` now supports vector ``sample_weight ``.
301+ - :func: `metrics.label_ranking_average_precision_score ` now supports vector
302+ ``sample_weight ``.
279303 :issue: `10845 ` by :user: `Jose Perez-Parras Toledano <jopepato> `.
280304
281- - Add ``dense_output `` parameter to :func: `metrics.pairwise.linear_kernel `. When
282- False and both inputs are sparse, will return a sparse matrix.
305+ - Add ``dense_output `` parameter to :func: `metrics.pairwise.linear_kernel `.
306+ When False and both inputs are sparse, will return a sparse matrix.
283307 :issue: `10999 ` by :user: `Taylor G Smith <tgsmith61591> `.
284308
285309- :func: `metrics.cluster.silhouette_score ` and
0 commit comments