-
Notifications
You must be signed in to change notification settings - Fork 24
Description
Thank you very much for your work!
I have a question about the required number of backward passes needed to compute the FIM. In the original KFAC paper (https://arxiv.org/pdf/1503.05671) and in other implementations (https://openreview.net/pdf?id=BJlrF24twB), it seems like the FIM can be computed in a single backward pass. In the first paper, this is due to a single MC sample, while in the second, it seems possible to compute the exact FIM in a single pass.
In nngeometry, when sampling a single MC example, we still explicitly compute the Jacobian with "for i in range(n_output)". It seems to me that this approach is in contrast with the two papers, and can be avoided by doing a standard backpropagation from the loss computed on the new sampled target.
What am I getting wrong?
In particular, I am interested in using nngeometry for a regression problem with MANY output units, but the computation time is very high.
Thanks again.