DeepMind tekee suuren hyppäämän LLM:ien tulkintaan hajautettujen autoenkoodereiden avulla

Seuraa

DeepMind tekee suuren hyppäämän LLM:ien tulkintaan hajautettujen autoenkoodereiden avulla

Uusi Google DeepMindin tutkimus osoittaa, miten harvat autoenkooderit (SAEs) erityisellä JumpReLU-aktivaatiolla voivat auttaa tulkemaan suurten kielimallien (LLMs) toimintaa.

DeepMind makes big jump toward interpreting LLMs with sparse autoencoders venturebeat.com

RSS Hunter • 29.7.2024