Skip to content

Documentation for CountVectorizer suggests strange bounds for max_df param #18803

@nschmeller

Description

@nschmeller

Describe the issue linked to the documentation

In the documentation for CountVectorizer stop_words replacement with max_df, max_df is suggested to have bounds [0.7, 1.0).

But, the documentation for max_df allows bounds from 0 to 1.

This seems a little strange! Is the 0.7 suggested to stop the user from doing crazy things with max_df? Or is this old?

Suggest a potential alternative/fix

Should there be consistency?

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions