Posts
-
Persona Explorer: Using SAEs to Explore Personas and Drift in LLMs
Using Sparse Autoencoders to explore persona-associated features and assistantness drift in LLMs.
Using Sparse Autoencoders to explore persona-associated features and assistantness drift in LLMs.