Skip to content

Conversation

@n-poulsen
Copy link
Contributor

@n-poulsen n-poulsen commented Nov 25, 2024

This pull requests adds the implementation for RTMPose: Real-Time Multi-Person Pose Estimation.

Full changes:

  • New mixin for backbones with pretrained weights on HuggingFace : HuggingFaceWeightsMixin
  • New backbone: CSPNeXt
  • New head: RTMCCHead
  • New target generator: SimCCGenerator
  • New predictor: SimCCPredictor
  • New weight initializer: Rtmpose
  • New RandomBBoxTransform augmentation for top-down models (jitters ground truth bounding boxes)
  • Configs for CSPNeXt backbones: cspnext_s, cspnext_m, cspnext_x
  • Configs for RTMPose models: rtmpose_s, rtmpose_m, rtmpose_x
  • Updated way to scheduler configurations, to allow for the use of SequentialLR (to match the RTMPose training schedulers)
  • Added documentation: using a SequentialLR scheduler for warm-up epochs
  • Updated the top-down crop to function with no-square crops (and re-implemented with numpy)
    • Added top_down_crop_size: (crop_w, crop_h) and top_down_crop_margin in the pytorch_config.yaml file for an easy way to control the size of the crops for top-down models.
  • Bug fix: bbox_from_keypoints when there's more than 1 individual (bug had no impact as method was always called in a way it wouldn't fail)

DEMO

ski-dlc.mov

@MMathisLab MMathisLab requested a review from AlexEMG December 5, 2024 10:58
Copy link
Contributor

@maximpavliv maximpavliv left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Great!!

@n-poulsen n-poulsen merged commit 1d26325 into pytorch_dlc Dec 20, 2024
1 check passed
@n-poulsen n-poulsen deleted the niels/rtmpose branch December 20, 2024 15:40
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants