Skip to content

[C++][Python] Support for pandas Categoricals with Intervals #30119

@asfimport

Description

@asfimport

Trying to write a pandas categorical created from pd.cut fails with the following error:

df_test = pd.DataFrame({"cat": pd.cut([1, 2, 3, 4, 5], bins=[2, 4])})
df_test.to_parquet("test.parquet")
File "pyarrow/error.pxi", line 120, in pyarrow.lib.check_status
pyarrow.lib.ArrowNotImplementedError: Unsupported cast from dictionary<values=extension<pandas.interval<ArrowIntervalType>>, indices=int8, ordered=1> to extension<pandas.interval<ArrowIntervalType>> (no available cast function for target type)

The problems seems to be the interval dtype. Specifing the labels keyword solved the problem.

Reporter: Alberto Torres

Related issues:

Note: This issue was originally created as ARROW-14569. Please see the migration documentation for further details.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions