Center for Language and Speech Processing @ JHU

university

https://www.clsp.jhu.edu/

jhuclsp

JHU-CLSP

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

TaiMingLu authored a paper 5 days ago

Stronger Normalization-Free Transformers

orionweller new activity 6 days ago

jhu-clsp/mmBERT-decay-data:Update README: Fix TiQuAD's language name to Tigrinya

TaiMingLu authored a paper about 1 month ago

World-in-World: World Models in a Closed-Loop World

View all activity

Papers

Genomic Next-Token Predictors are In-Context Learners

Controlled Generation for Private Synthetic Text

View all Papers

Collections 3

View 3 collections

spaces 1

Science Hierarchography

Explore academic paper hierarchies and details

models 53

jhu-clsp/mmBERT-small

Fill-Mask • Updated Oct 17 • 9.48k • • 56

jhu-clsp/mmBERT-base

Fill-Mask • Updated Oct 7 • 353k • • 169

jhu-clsp/mmBERT-checkpoints

Updated Sep 9 • 3

jhu-clsp/ettin-decoder-1b

Fill-Mask • Updated Jul 21 • 179 • 4

jhu-clsp/ettin-decoder-32m

Text Generation • Updated Jul 18 • 166

jhu-clsp/ettin-encoder-1b

Feature Extraction • Updated Jul 18 • 740 • • 21

jhu-clsp/ettin-encoder-68m

Fill-Mask • Updated Jul 18 • 571 • • 3

jhu-clsp/ettin-dec-from-enc-32m

Text Generation • Updated Jul 18 • 15

jhu-clsp/ettin-encoder-150m

Fill-Mask • Updated Jul 18 • 18.5k • • 8

jhu-clsp/ettin-decoder-400m

Text Generation • Updated Jul 18 • 116 • 2

datasets 38

jhu-clsp/mmBERT-decay-data

Updated 6 days ago • 15.7k • 3

jhu-clsp/mmBERT-midtraining-data

Updated Oct 13 • 40.5k • 1

jhu-clsp/megawika-2

Updated Sep 3 • 7.05k • 2

jhu-clsp/ettin-pretraining-data

Updated Jul 18 • 14.1k • 8

jhu-clsp/ettin-decay-data

Updated Jul 18 • 671 • 1

jhu-clsp/astro-llms-benchmark-dataset

Viewer • Updated Jul 16 • 40 • 27

jhu-clsp/astro-llms-full-query-data

Viewer • Updated Jul 16 • 368 • 51

jhu-clsp/ettin-extension-data

Updated Jul 16 • 219

jhu-clsp/ettin-data-order

Viewer • Updated Jul 16 • 3B • 5 • 1

jhu-clsp/rank1-R1-MSMARCO

Viewer • Updated Feb 26 • 635k • 65 • 2

View 38 datasets