Jim Lai
grimjim
AI & ML interests
Experimenting primarily with 7B-12B parameter text completion models. Not all models are intended for direct end use, but aim for research and/or educational purposes.
Recent Contributions: stabilized refusal direction ablation via Gram-Schmidt orthonormalization and norm-preserving interventions; confirmed reasoning transfer via model merger.
Recent Activity
published an article about 1 month ago
ORBA: Orthogonal Reflection Bounded Ablation — A Geometrically Exact Detour in Directional Activation Editing updated a model about 1 month ago
grimjim/gemma-3-12b-it-orthogonal-reflection-bounded-ablation-v4-12B published a model about 1 month ago
grimjim/gemma-3-12b-it-orthogonal-reflection-bounded-ablation-v4-12B