Skip to content

Releases: Genera1Z/SmoothSA

archive-spot

21 Oct 20:24
0745690

Choose a tag to compare

Here are model checkpoints for baseline SPOT.
Using DINO2 S/14 for encoding.
Models are trained on datasets ClevrTex, COCO and VOC, with random seeds 42, 43 and 44.
Input resolution is 256x256 (224x224).

archive-smoothsa

21 Oct 20:24
0745690

Choose a tag to compare

Here are model checkpoints for our SmoothSA.
Using DINO2 S/14 for encoding.
Models are trained on datasets ClevrTex, COCO, VOC, MOVi-C/D and YTVIS (high quality), with random seeds 42, 43 and 44.
Input resolution is 256x256 (224x224).

archive-recogn

22 Oct 08:22
0745690

Choose a tag to compare

Here are model checkpoints for object recognition models powered by SmoothSA and SPOT.
Using DINO2 S/14 for encoding.
Models are trained on dataset COCO and YTVIS (high quality), with random seeds 42, 43 and 44.
Input resolution is 256x256 (224x224).
Slot matching threshold is 1e-3@IoU for COCO and 1e-1@IoU for YTVIS.