Blog

May 28, 2024

Paper page — LoGAH: Predicting 774-Million-Parameter Transformers using Graph HyperNetworks with 1/100 Parameters

Posted by in category: robotics/AI

LoGAH: Predicting 774-Million-Parameter Transformers using Graph HyperNetworks with 1/100 Parameters.

https://huggingface.co/papers/2405.

A good initialization of deep learning models is essential since it can help them converge better and faster.


Join the discussion on this paper page.

Leave a reply