📄 Paper
Extending implicit differentiation to other Riemannian bilevel optimization tasks is nontrivial because it requires much expert involvement for case-by-case derivations. In this article, we propose a Riemannian implicit differentiation method that provides a unified expression for outer gradients, leading to flexible application to other tasks with less expert involvement.
TOP-1 ERROR (%) ON CIFAR-LT-10/CIFAR-LT-100
| Method | CIFAR-LT-10 (200) | CIFAR-LT-10 (100) | CIFAR-LT-10 (50) | CIFAR-LT-10 (20) | CIFAR-LT-100 (200) | CIFAR-LT-100 (100) | CIFAR-LT-100 (50) | CIFAR-LT-100 (20) |
|---|---|---|---|---|---|---|---|---|
| Cross-entropy training | 34.32 | 29.63 | 25.19 | 17.77 | 65.16 | 61.68 | 56.15 | 48.86 |
| Class-balanced cross-entropy loss [65] | 31.11 | 27.63 | 21.95 | 15.64 | 64.30 | 61.44 | 55.45 | 42.88 |
| Class-balanced fine-tuning [66] | 33.76 | 28.66 | 22.56 | 16.78 | 61.34 | 58.5 | 53.78 | 47.70 |
| L2RW [67] | 33.75 | 27.77 | 23.55 | 18.65 | 67.00 | 61.10 | 56.83 | 49.25 |
| Meta-weight net [68] | 32.8 | 26.43 | 20.9 | 15.55 | 63.38 | 58.39 | 54.34 | 46.96 |
| Two-component weighting [69] | 29.34 | 23.59 | 19.49 | 13.54 | 60.69 | 56.65 | 51.47 | 44.38 |
| Divide and Retain [70] | - | - | - | - | 59.47 | 55.21 | 50.68 | - |
| Ours | 20.4 | 17.49 | 14.74 | 11.54 | 57.45 | 52.07 | 47.64 | 41.25 |
ACCURACY (%) ON THE MINI-IMAGENET DATASET
| Method | Backbone | 1-shot 5-way | 5-shot 5-way |
|---|---|---|---|
| MAML [72] | ResNet12 | 51.03 ± 0.50 | 68.26 ± 0.47 |
| L2F [73] | ResNet12 | 57.48 ± 0.49 | 74.68 ± 0.43 |
| CAML [74] | ResNet12 | 59.23 ± 0.99 | 72.35 ± 0.71 |
| ALFA [75] | ResNet12 | 60.06 ± 0.49 | 77.42 ± 0.42 |
| MetaOptNet [76] | ResNet12 | 62.64 ± 0.61 | 78.63 ± 0.46 |
| MetaFun [77] | ResNet12 | 62.12 ± 0.30 | 78.20 ± 0.16 |
| DSN [78] | ResNet12 | 62.64 ± 0.66 | 78.83 ± 0.45 |
| Chen et al. [79] | ResNet12 | 63.17 ± 0.23 | 79.26 ± 0.17 |
| MeTAL [80] | ResNet12 | 59.64 ± 0.38 | 76.20 ± 0.19 |
| LEO [81] | WRN-28-10 | 61.76 ± 0.08 | 77.59 ± 0.12 |
| Con-MetaReg [82] | ResNet12 | 53.68 ± 0.50 | 66.88 ± 0.42 |
| Hyper ProtoNet [4] | ResNet18 | 59.47 ± 0.20 | 76.84 ± 0.14 |
| Hyperbolic kernel [83] | ResNet18 | 61.04 ± 0.21 | 77.33 ± 0.15 |
| CurAML [84] | ResNet12 | 63.13 ± 0.41 | 81.04 ± 0.39 |
| Poincaré radial kernel [85] | ResNet18 | 62.15 ± 0.20 | 77.81 ± 0.15 |
| Ours | ResNet12 | 64.5 ± 0.23 | 82.1 ± 0.15 |
ACCURACY (%) ON THE TIERED-IMAGENET DATASET
| Method | Backbone | 1-shot 5-way | 5-shot 5-way |
|---|---|---|---|
| ProtoNet [86] | ResNet12 | 53.51 |
72.69 |
| MAML [72] | ResNet12 | 58.58 |
71.24 |
| L2F [73] | ResNet12 | 63.94 |
77.61 |
| ALFA [75] | ResNet12 | 64.43 |
81.77 |
| DSN [78] | ResNet12 | 66.22 |
82.79 |
| MetaOptNet [76] | ResNet12 | 65.99 |
83.28 |
| MetaFun [77] | ResNet12 | 67.72 |
78.20 |
| Chen et al. [79] | ResNet12 | 68.62 |
83.74 |
| MeTAL [80] | ResNet12 | 63.89 |
80.14 |
| LEO [81] | WRN-28-10 | 66.33 |
81.44 |
| Con-MetaReg [82] | ResNet12 | 54.41 |
68.23 |
| Hyper ProtoNet [4] | ResNet18 | 54.44 |
71.96 |
| Hyperbolic kernel [83] | ResNet18 | 57.78 |
76.48 |
| CurAML [84] | ResNet12 | 68.46 |
83.84 |
| Poincaré radial kernel [85] | ResNet18 | 65.33 |
77.48 |
| Ours | ResNet12 | 71.56 |
85.75 |
Run the script below to train your model with our method.
python Grassmann_pca/train/train.py
Evaluate the trained model using the following code.
python Grassmann_pca/test/test.py
Run the script below to train your model with our method.
bash SPD_clustering/train/train.py
Evaluate the trained model using the following code.
python SPD_clustering/test/evaluation.py
Run the script below to train and test your model with our method.
bash Stiefel_C-LT/vali_20.sh
bash Stiefel_C-LT/vali_50.sh
bash Stiefel_C-LT/vali_100.sh
bash Stiefel_C-LT/vali_200.sh
Run the script below to train and test your model with our method.
bash Hyperbolic_few-shot/miniimagenet/miniimagenet_shot1.sh
bash Hyperbolic_few-shot/miniimagenet/miniimagenet_shot5.sh
bash Hyperbolic_few-shot/tieredimagenet/tieredimagenet_shot1.sh
bash Hyperbolic_few-shot/tieredimagenet/tieredimagenet_shot1.sh
If you find our work helpful, please consider cite our paper 📝 and star us ⭐️!
@ARTICLE{11247945,
author={Fan, Xiaomeng and Wu, Yuwei and Gao, Zhi and Lu, Zhipeng and Li, Feng and Harandi, Mehrtash and Jia, Yunde},
journal={IEEE Transactions on Neural Networks and Learning Systems},
title={Riemannian Implicit Differentiation via a Fixed-Point Equation for Riemannian Bilevel Optimization},
year={2025},
volume={},
number={},
pages={1-15},
doi={10.1109/TNNLS.2025.3624316}}

