George Dasoulas, Kevin Scaman, Aladin Virmaux: Lipschitz normalization for self-attention layers with application to graph neural networks. ICML 2021: 2456-2466