Skip to content

Conversation

@KKcorps
Copy link
Contributor

@KKcorps KKcorps commented Sep 14, 2022

Following new configurations are being added to csv reader :

  • skipHeader - Set it to true to skip header line while processing CSV data.

  • ignoreEmptyLines - Set it to true to ignore empty lines, otherwise send them as empty string and let the processing logic take care of it.

  • ignoreSurroundingSpaces - Set to true to ignore the spaces around values.

  • recordSeparator - Character being used to distinguising between multiple lines.

  • quoteCharacter - single character that is being used for quotes in CSV files

  • quoteMode -

    /**
     * Quotes all fields.
     */
    ALL,

    /**
     * Quotes fields which contain special characters such as a delimiter, quote character or any of the characters in
     * line separator.
     */
    MINIMAL,

    /**
     * Quotes all non-numeric fields.
     */
    NON_NUMERIC,

    /**
     * Never quotes fields. When the delimiter occurs in data, it is preceded by the current escape character. If the
     * escape character is not set, printing will throw an exception if any characters that require escaping are
     * encountered.
     */
    NONE

public static final String NULL_STRING_VALUE = "nullStringValue";
public static final String SKIP_HEADER = "skipHeader";
public static final String IGNORE_EMPTY_LINES = "ignoreEmptyLines";
public static final String IGNORE_SURROUNDING_SPACE = "ignoreSurroundingSpace";
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
public static final String IGNORE_SURROUNDING_SPACE = "ignoreSurroundingSpace";
public static final String IGNORE_SURROUNDING_SPACES = "ignoreSurroundingSpaces";

@Jackie-Jiang Jackie-Jiang added release-notes Referenced by PRs that need attention when compiling the next release notes Configuration Config changes (addition/deletion/change in behavior) labels Sep 15, 2022
@Jackie-Jiang
Copy link
Contributor

Can you please update the PR with the new added config keys, and the pinot doc?

@KKcorps KKcorps merged commit 3633495 into apache:master Sep 16, 2022
61yao pushed a commit to 61yao/pinot that referenced this pull request Oct 3, 2022
* Add missing properties in CSV plugin

* Fix linting

Co-authored-by: Kartik Khare <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Configuration Config changes (addition/deletion/change in behavior) release-notes Referenced by PRs that need attention when compiling the next release notes

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants