FoundationPose estimation with custom object

martinho_crespo · November 5, 2024, 11:21am

The pose estimation explained in the isaac_ros_foundationpose quickstart has two main steps:

It detects the object on the image and creates a mask using isaac_ros_rtdetr.
This mask is used by FoundationPose to start iterating on the pose estimation.
A final pose estimation is provided by FoundationPose.

AFAIK, the models used in step 1 are only valid for objects that fall under certain categories, e.g. see SyntheticaDETR or YCB. Also, the API indicates that the pose estimation node is subscribed to the /segmentation topic, which must be published by the object detection nodes.

With the previous in mind, I’d like if someone could clarify the following questions:

How can I use isaac_ros_foundationpose on a novel, custom object that does not fall into any category of the DetectNet, RT-DETR or YOLOv8 object detection models?
Is it possible to exclusively use CAD data without any retraining for this custom, novel object? As stated in the documentation: FoundationPose is designed to perform pose estimation on previously unseen objects without model retraining.

ashwinvk · November 7, 2024, 6:09pm

Hi,

Yes a 2D object detection model has to be trained for the 3D object detection to work with FoundationPose. isaac_ros_foundationpose expects a segmentation mask as one of the inputs. In our tutorials we use synthetica_detr to 2D object detection. And convert that into a segmentation box using nvidia::isaac_ros::foundationpose::Detection2DToMask

You will have to train a 2D object detection model. Someone else from our team can get back to you on if/ how to do that with Isaac ROS.
The CAD and the a 2D object detection model/segmentation mask is required. " without model retraining" refers to without retraining the 3D object detection model,ie FoundationPose.

ashwinvk · November 7, 2024, 7:38pm

We don’t have any direct instructions on training new models within Isaac ROS - instead, we defer to the instructions from TAO or other teams within NVIDIA. Once you have a trained model, we have tutorials that let you use it in most of our pipelines
Here’s an example for DOPE:
https://nvidia-isaac-ros.github.io/concepts/pose_estimation/dope/tutorial_custom_model.html

martinho_crespo · November 7, 2024, 8:10pm

Hello @ashwinvk,

Thank you very much for the clarification.

The CAD and the a 2D object detection model/segmentation mask is required. " without model retraining" refers to without retraining the 3D object detection model,ie FoundationPose.

I think that should be way explicitly stated in at least the Isaac ROS Pose Estimation overview. I had checked quite some resources but was not able to get a clear statement on weather retraining was needed.

Here’s an example for DOPE

Thanks! Should I look into TAO to train on objects that don’t fall into the DOPE categories?

marco.pastorio · June 30, 2025, 7:50am

I’m answering on this topic since I have the same problem, let me know if I’d better open a new one.

I was trying to fine tune RT-DETR or SyntheticaDETR on TAO but given the answers in this topic it seems that there isn’t a compatible model in TAO.

Do you have some instructions on how to solve the retraining/fine-tuning step for the 2D model in the isaac_foundation_pose pipeline?

Topic		Replies	Views
Foundation Pose estimation and tracking Isaac ROS isaac-ros-pose-estimation	14	471	June 9, 2025
Foundation Pose Isaac Sim Isaac ROS isaac-ros-pose-estimation , isaac-sim-v4-2-0	5	395	June 11, 2025
Isaac_ros_foundationpose for pallets Isaac ROS isaac-ros-pose-estimation	24	1246	March 25, 2025
Errors replicating FoundationPose quickstart Isaac ROS	5	498	July 29, 2024
Isaac ros foundationpose not work Isaac ROS camera , ubuntu	7	241	August 12, 2025
How to use my own RT-DETR model for FoundationPose? Isaac ROS	5	220	August 20, 2025
YOLOv8 on Foundation Pose pipeline does not return pose Isaac ROS	5	524	September 11, 2024
Foundation pose model input Isaac ROS	5	199	February 14, 2025
Use SAM for Initial Mask as Input to FoundationPose in Isaac ROS instead of SynDETR Isaac ROS isaac-sim-v4-2-0	3	212	November 17, 2024
NVIDIA Isaac ROS Office Hours - FoundationPose Isaac ROS	1	169	July 3, 2024

FoundationPose estimation with custom object

Related topics