Update neural_tangent_kernels.ipynb #788

ain-soph · 2022-05-09T00:16:24Z

Fix a small bug

    if compute == 'full':
        return result
    if compute == 'trace':
        return torch.einsum('NMKK->NM')        # should be torch.einsum('NMKK->NM', result)
    if compute == 'diagonal':
        return torch.einsum('NMKK->NMK')        # should be torch.einsum('NMKK->NMK', result)

Fix a small bug ```python3 if compute == 'full': return result if compute == 'trace': return torch.einsum('NMKK->NM') # should be torch.einsum('NMKK->NM', result) if compute == 'diagonal': return torch.einsum('NMKK->NMK') # should be torch.einsum('NMKK->NMK', result) ```

ain-soph · 2022-05-09T01:08:29Z

I follow the tutorial and implement the version without using functorch. I wonder what's the advantage of using functorch?

import torch
import torch.nn as nn
from torch.nn.utils import _stateless

import functools

def ntk(module: nn.Module, input1: torch.Tensor, input2: torch.Tensor,
        parameters: dict[str, nn.Parameter] = None,
        compute='full') -> torch.Tensor:
    einsum_expr: str = ''
    match compute:
        case 'full':
            einsum_expr = 'Naf,Mbf->NMab'
        case 'trace':
            einsum_expr = 'Naf,Maf->NM'
        case 'diagonal':
            einsum_expr = 'Naf,Maf->NMa'
        case _:
            raise ValueError(compute)

    if parameters is None:
        parameters = dict(module.named_parameters())
    keys, values = zip(*parameters.items())

    def func(*params: torch.Tensor, _input: torch.Tensor = None):
        _output: torch.Tensor = _stateless.functional_call(
            module, {n: p for n, p in zip(keys, params)}, _input)
        return _output  # (N, C)

    jac1: tuple[torch.Tensor] = torch.autograd.functional.jacobian(
        functools.partial(func, _input=input1), values, vectorize=True)
    jac2: tuple[torch.Tensor] = torch.autograd.functional.jacobian(
        functools.partial(func, _input=input2), values, vectorize=True)
    jac1 = [j.flatten(2) for j in jac1]
    jac2 = [j.flatten(2) for j in jac2]
    result = torch.stack([torch.einsum(einsum_expr, j1, j2) for j1, j2 in zip(jac1, jac2)]).sum(0)
    return result

Chillee · 2022-05-10T03:25:35Z

Thanks!

Fix a small bug ```python3 if compute == 'full': return result if compute == 'trace': return torch.einsum('NMKK->NM') # should be torch.einsum('NMKK->NM', result) if compute == 'diagonal': return torch.einsum('NMKK->NMK') # should be torch.einsum('NMKK->NMK', result) ```

facebook-github-bot added the cla signed label May 9, 2022

ain-soph mentioned this pull request May 9, 2022

Calculating Jacobian of a model with respect to its parameters? #334

Open

Chillee merged commit a7a8e66 into pytorch:main May 10, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update neural_tangent_kernels.ipynb #788

Update neural_tangent_kernels.ipynb #788

Uh oh!

ain-soph commented May 9, 2022

Uh oh!

ain-soph commented May 9, 2022 •

edited

Loading

Uh oh!

Chillee commented May 10, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Update neural_tangent_kernels.ipynb #788

Update neural_tangent_kernels.ipynb #788

Uh oh!

Conversation

ain-soph commented May 9, 2022

Uh oh!

ain-soph commented May 9, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Chillee commented May 10, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ain-soph commented May 9, 2022 •

edited

Loading