Skip to content

Commit e555893

Browse files
committed
DOC highlights in what's new
And other minor changes
1 parent 6ce497c commit e555893

File tree

1 file changed

+34
-10
lines changed

1 file changed

+34
-10
lines changed

doc/whats_new/v0.20.rst

Lines changed: 34 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -7,10 +7,32 @@
77
Version 0.20 (under development)
88
================================
99

10-
As well as a plethora of new features and enhancements, this release is the
11-
first to be accompanied by a :ref:`glossary` developed by `Joel Nothman`_. The
12-
glossary is a reference resource to help users and contributors become familiar
13-
with the terminology and conventions used in Scikit-learn.
10+
This release packs in a mountain of bug fixes, features and enhancements for
11+
the Scikit-learn library, and improvements to the documentation and examples.
12+
Thanks to our many contributors!
13+
14+
Highlights
15+
----------
16+
17+
We have tried to improve our support for common data-science use-cases
18+
including missing values, categorical variables, heterogeneous data, and
19+
features/targets with unusual distributions.
20+
21+
Missing values in features, represented by NaNs, are now accepted in
22+
column-wise preprocessing such as scalers. Each feature is fitted disregarding
23+
NaNs, and data containing NaNs can be transformed. The new :mod:`impute`
24+
module provides estimators for learning despite missing data.
25+
26+
:class:`~compose.ColumnTransformer` handles the case where different features
27+
or columns of a pandas.DataFrame need different preprocessing.
28+
String or pandas Categorical columns can now be encoded with
29+
:class:`~preprocessing.OneHotEncoder` or
30+
:class:`~preprocessing.OrdinalEncoder`.
31+
32+
This release is also the first to be accompanied by a :ref:`glossary` developed
33+
by `Joel Nothman`_. The glossary is a reference resource to help users and
34+
contributors become familiar with the terminology and conventions used in
35+
Scikit-learn.
1436

1537
Changed models
1638
--------------
@@ -254,8 +276,8 @@ Model evaluation and meta-estimators
254276
:issue:`9304` by :user:`Breno Freitas <brenolf>`.
255277

256278
- Add `return_estimator` parameter in :func:`model_selection.cross_validate` to
257-
return estimators fitted on each split. :issue:`9686` by :user:`Aurélien Bellet
258-
<bellet>`.
279+
return estimators fitted on each split.
280+
:issue:`9686` by :user:`Aurélien Bellet <bellet>`.
259281

260282
- New ``refit_time_`` attribute will be stored in
261283
:class:`model_selection.GridSearchCV` and
@@ -273,13 +295,15 @@ Decomposition and manifold learning
273295
Metrics
274296

275297
- :func:`metrics.roc_auc_score` now supports binary ``y_true`` other than
276-
``{0, 1}`` or ``{-1, 1}``. :issue:`9828` by :user:`Hanmin Qin <qinhanmin2014>`.
298+
``{0, 1}`` or ``{-1, 1}``.
299+
:issue:`9828` by :user:`Hanmin Qin <qinhanmin2014>`.
277300

278-
- :func:`metrics.label_ranking_average_precision_score` now supports vector ``sample_weight``.
301+
- :func:`metrics.label_ranking_average_precision_score` now supports vector
302+
``sample_weight``.
279303
:issue:`10845` by :user:`Jose Perez-Parras Toledano <jopepato>`.
280304

281-
- Add ``dense_output`` parameter to :func:`metrics.pairwise.linear_kernel`. When
282-
False and both inputs are sparse, will return a sparse matrix.
305+
- Add ``dense_output`` parameter to :func:`metrics.pairwise.linear_kernel`.
306+
When False and both inputs are sparse, will return a sparse matrix.
283307
:issue:`10999` by :user:`Taylor G Smith <tgsmith61591>`.
284308

285309
- :func:`metrics.cluster.silhouette_score` and

0 commit comments

Comments
 (0)