Writing

Journal

Long-form notes from the workshop. Search by word, filter by tag.

Personality-language in a language model
·20 min read

Personality-language in a language model

Pulling 30 personality traits out of an LLM as directions in activation space — the same facet recipes carve Qwen, Llama, and Gemma into the same coarse geometry, and it's the textbook Big Five, not real people.

interpretabilityLLMspersonalitysteering