AI & ML interests

AI security & privacy, algorithmic bias, foundations of ML

Recent Activity

tomg-group-umd 's collections 13

Refusal Token Models
This collection contains models described in the refusal token paper published in COLM 2025.
LoRI Adapters
LoRI adapters for natural language understanding, code generation, mathematical reasoning, and safety alignment, based on LLaMA-3-8B and Mistral-7B.
Recurrent Models
These are checkpoints for recurrent LLMs developed to scale test-time compute by recurring in latent space.
Gemstone Models
Our 22 open source Gemstone models for scaling laws range from 50M to 2B parameters, spanning 11 widths from 256 to 3072 and 18 depths from 3 to 80.
Goldfish Loss: Mitigating Memorization in LLMs
This collection contains artifacts from our paper titled: "Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs."
Refusal Token Models
This collection contains models described in the refusal token paper published in COLM 2025.
LoRI Adapters
LoRI adapters for natural language understanding, code generation, mathematical reasoning, and safety alignment, based on LLaMA-3-8B and Mistral-7B.
Gemstone Models
Our 22 open source Gemstone models for scaling laws range from 50M to 2B parameters, spanning 11 widths from 256 to 3072 and 18 depths from 3 to 80.
Recurrent Models
These are checkpoints for recurrent LLMs developed to scale test-time compute by recurring in latent space.
Goldfish Loss: Mitigating Memorization in LLMs
This collection contains artifacts from our paper titled: "Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs."