archive-spot
Here are model checkpoints for baseline SPOT.
Using DINO2 S/14 for encoding.
Models are trained on datasets ClevrTex, COCO and VOC, with random seeds 42, 43 and 44.
Input resolution is 256x256 (224x224).
Here are model checkpoints for baseline SPOT.
Using DINO2 S/14 for encoding.
Models are trained on datasets ClevrTex, COCO and VOC, with random seeds 42, 43 and 44.
Input resolution is 256x256 (224x224).