
·20 min read
Personality-language in a language model
Pulling 30 personality traits out of an LLM as directions in activation space — the same facet recipes carve Qwen, Llama, and Gemma into the same coarse geometry, and it's the textbook Big Five, not real people.
interpretabilityLLMspersonalitysteering