Skip to content

Conversation

@shchur
Copy link
Contributor

@shchur shchur commented Aug 27, 2025

Issue #, if available:

Description of changes:

  • Fix circular dependencies caused by __version__ stored in __init__.py
  • Make known_dynamic_columns, past_dynamic_columns, static_columns explicit properties of the Task instead of automatically inferring them from data.
  • Rename target_column -> target to make the name more consistent with str | list[str] typing.
  • Rename Task.target_columns_list -> target_columns.

To do

  • Update tutorials in docs/
  • Handle deprecated field names

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

@shchur shchur requested a review from abdulfatir August 27, 2025 09:01
Copy link
Collaborator

@abdulfatir abdulfatir left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @shchur! This will make the task definitions more transparent and easy to check which columns are actually used by a model.

)
future_known = future_data.remove_columns(self.target_columns_list + self.past_dynamic_columns)
test = future_data.select_columns([self.id_column, self.timestamp_column] + self.target_columns_list)
future_known = future_data.remove_columns(self.target_columns + self.past_dynamic_columns)
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should we specifically only select known future columns (+ id and timestamp) here after removing past and target_columns?

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It looks like all other columns are removed during data loading, so maybe this is not needed?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I have updated load_full_dataset to not remove any columns, and to only perform column removal before the split. This makes it easy to check what columns are available in the dataset using public API.

@review-notebook-app
Copy link

Check out this pull request on  ReviewNB

See visual diffs & provide feedback on Jupyter Notebooks.


Powered by ReviewNB

@shchur shchur merged commit 2592a84 into pre-v1.0.0 Aug 27, 2025
@shchur shchur deleted the explicit-columns-names branch August 27, 2025 13:50
shchur added a commit that referenced this pull request Sep 16, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants