Tom Goldstein's Lab at University of Maryland, College Park

university

http://www.cs.umd.edu/~tomg/

tomgoldsteincs

Activity Feed

AI & ML interests

AI security & privacy, algorithmic bias, foundations of ML

Recent Activity

smcleish new activity 2 days ago

tomg-group-umd/Gemstone-512x13:Update README.md

ehartz published a dataset 6 days ago

tomg-group-umd/IDE_code_retrival

smcleish authored a paper about 2 months ago

Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence

View all activity

Papers

Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence

Gemstones: A Model Suite for Multi-Faceted Scaling Laws

View all Papers

tomg-group-umd 's collections 13

Retrofitting Recurrence

Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence

Paper • 2511.07384 • Published Nov 10, 2025 • 16

Refusal Token Models

This collection contains models described in the refusal token paper published in COLM 2025.

tomg-group-umd/zephyr-llama3-8b-sft-refusal-n-contrast

8B • Updated Jul 22, 2025 • 3
tomg-group-umd/zephyr-llama3-8b-sft-refusal-n-contrast-multiple-tokens

8B • Updated Jul 22, 2025 • 5 • 1
tomg-group-umd/zephyr-llama3-8b-sft-refusal-n-contrast-single-token

8B • Updated Jul 22, 2025 • 1 • 1
tomg-group-umd/zephyr-llama3-8b-sft-no-refusal-messages

8B • Updated Jul 22, 2025 • 5

LoRI Adapters

LoRI adapters for natural language understanding, code generation, mathematical reasoning, and safety alignment, based on LLaMA-3-8B and Mistral-7B.

tomg-group-umd/LoRI-S_safety_mistral7b_rank_64

Text Generation • Updated Apr 14, 2025 • 4 • 1
tomg-group-umd/LoRI-S_safety_mistral7b_rank_32

Text Generation • Updated Apr 14, 2025 • 2
tomg-group-umd/LoRI-S_safety_llama3_rank_64

Text Generation • Updated Aug 13, 2025 • 3
tomg-group-umd/LoRI-S_safety_llama3_rank_32

Text Generation • Updated Apr 14, 2025 • 3 • 1

Recurrent Models

These are checkpoints for recurrent LLMs developed to scale test-time compute by recurring in latent space.

tomg-group-umd/huginn-0125

Text Generation • 4B • Updated Jul 29, 2025 • 757 • 290
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7, 2025 • 151
tomg-group-umd/huginn_swa_100_10_avg_0.9_merge

Text Generation • 4B • Updated Jul 17, 2025 • 9
tomg-group-umd/step-00010752-recurrence_full_512_0

Text Generation • 4B • Updated Jul 17, 2025 • 5

GenQA

tomg-group-umd/GenQA

Viewer • Updated Jun 21, 2024 • 11.1M • 496 • 54
tomg-group-umd/GenQA_raw

Viewer • Updated Jun 13, 2024 • 11.1M • 210
tomg-group-umd/GenQA_rebalanced

Viewer • Updated Jun 13, 2024 • 6.47M • 76 • 3
tomg-group-umd/GenQA-Subset-llama-3

Text Generation • 8B • Updated Jun 17, 2024 • 2

PixelProse

From Pixels to Prose: A Large Dataset of Dense Image Captions

Paper • 2406.10328 • Published Jun 14, 2024 • 18
tomg-group-umd/pixelprose

Viewer • Updated 26 days ago • 15.6M • 738 • 161
pixelprose/pixelprose-shards

Viewer • Updated 25 days ago • 7.66M • 1.2k • 1
pixelprose/pixelprose-jsons

Preview • Updated Jul 3, 2025 • 144

Zero-Shot Grafting

Zero-Shot Vision Encoder Grafting via LLM Surrogates

Paper • 2505.22664 • Published May 28, 2025 • 7
tomg-group-umd/zero-model-checkpoints

Image-Text-to-Text • Updated Aug 5, 2025 • 2

DynaGuard

https://arxiv.org/abs/2509.02563

tomg-group-umd/DynaGuard-8B

Text Generation • 8B • Updated Sep 3, 2025 • 72 • 14
tomg-group-umd/DynaGuard-4B

Text Generation • 4B • Updated Sep 3, 2025 • 76 • 2
tomg-group-umd/DynaGuard-1.7B

Text Generation • 2B • Updated Sep 3, 2025 • 112 • 3
DynaGuard/DynaBench

Viewer • Updated Nov 22, 2025 • 140k • 346 • 4

FictionalQA

tomg-group-umd/fictionalqa

Viewer • Updated Jun 9, 2025 • 31.7k • 227 • 2
tomg-group-umd/fictionalqa_training_splits

Viewer • Updated Jun 9, 2025 • 107k • 154
tomg-group-umd/fictionalqa_reformatted_triviaqa

Viewer • Updated Jun 9, 2025 • 16.4k • 38

Gemstone Models

Our 22 open source Gemstone models for scaling laws range from 50M to 2B parameters, spanning 11 widths from 256 to 3072 and 18 depths from 3 to 80.

tomg-group-umd/Gemstone-768x45

Text Generation • 0.5B • Updated Feb 9, 2025 • 7
tomg-group-umd/Gemstone-1280x15

Text Generation • 0.5B • Updated Feb 6, 2025 • 10
tomg-group-umd/Gemstone-512x13

Text Generation • 0.1B • Updated Feb 6, 2025 • 5
tomg-group-umd/Gemstone-1536x50

Text Generation • 2B • Updated Feb 7, 2025 • 5

Style Descriptors

How to extract style from images? Model, dataset, and the paper

Measuring Style Similarity in Diffusion Models

Paper • 2404.01292 • Published Apr 1, 2024 • 17
tomg-group-umd/CSD-ViT-L

Image Feature Extraction • Updated Sep 4, 2024 • 28 • 5
tomg-group-umd/ContraStyles

Viewer • Updated Jul 31, 2024 • 498k • 66 • 5

CLRS-Text

Hugging Face collection for all things CLRS-Text

The CLRS-Text Algorithmic Reasoning Language Benchmark

Paper • 2406.04229 • Published Jun 6, 2024 • 4
tomg-group-umd/CLRS-Text-train

Viewer • Updated Jul 14, 2024 • 2.15M • 215 • 2
tomg-group-umd/CLRS-Text-test

Viewer • Updated Jul 10, 2024 • 503k • 211

Goldfish Loss: Mitigating Memorization in LLMs

This collection contains artifacts from our paper titled: "Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs."

Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs

Paper • 2406.10209 • Published Jun 14, 2024 • 8
tomg-group-umd/3-goldfish-loss-llama-1B

Text Generation • 1B • Updated Aug 19, 2024 • 4
tomg-group-umd/4-goldfish-loss-llama-1B

Text Generation • 1B • Updated Aug 19, 2024 • 6
tomg-group-umd/8-goldfish-loss-llama-1B

Text Generation • 1B • Updated Aug 19, 2024 • 4