SimPO - a princeton-nlp Collection

princeton-nlp 's Collections

RLMT Experiments

SimPO

updated Mar 16, 2025

This collections contains a list of SimPO and baseline models.

princeton-nlp/gemma-2-9b-it-SimPO

Text Generation • 9B • Updated Aug 2, 2024 • 485 • • 172
princeton-nlp/gemma-2-9b-it-DPO

Text Generation • 9B • Updated Jul 18, 2024 • 36 • • 9
princeton-nlp/Llama-3-Base-8B-SFT-IPO

Text Generation • 8B • Updated Jun 17, 2024 • 48 • • 1
princeton-nlp/Llama-3-Base-8B-SFT-DPO

Text Generation • 8B • Updated Jun 17, 2024 • 835 •
princeton-nlp/Llama-3-Base-8B-SFT-KTO

Text Generation • 8B • Updated Jun 17, 2024 • 51 •
princeton-nlp/Llama-3-Base-8B-SFT-ORPO

Text Generation • 8B • Updated Jun 17, 2024 • 59 •
princeton-nlp/Llama-3-Base-8B-SFT-RDPO

Text Generation • 8B • Updated Jun 17, 2024 • 53 •
princeton-nlp/Llama-3-Base-8B-SFT-SimPO

Text Generation • 8B • Updated May 24, 2024 • 82 • • 1
princeton-nlp/Llama-3-Base-8B-SFT

Text Generation • 8B • Updated Jun 17, 2024 • 2.54k • • 4
princeton-nlp/Llama-3-Instruct-8B-SimPO

Text Generation • 8B • Updated Jun 17, 2024 • 92 • • 60
princeton-nlp/Llama-3-Instruct-8B-IPO

Text Generation • 8B • Updated Jun 17, 2024 • 37 •
princeton-nlp/Llama-3-Instruct-8B-KTO

Text Generation • 8B • Updated Jun 17, 2024 • 72 •
princeton-nlp/Llama-3-Instruct-8B-ORPO

Text Generation • 8B • Updated Jun 17, 2024 • 71 •
princeton-nlp/Llama-3-Instruct-8B-RDPO

Text Generation • 8B • Updated Jun 17, 2024 • 58 •
princeton-nlp/Llama-3-Instruct-8B-DPO

Text Generation • 8B • Updated Jun 17, 2024 • 70 •
princeton-nlp/Mistral-7B-Instruct-RDPO

Text Generation • 7B • Updated Jun 17, 2024 • 59
princeton-nlp/Mistral-7B-Instruct-DPO

Text Generation • 7B • Updated Jun 17, 2024 • 68
princeton-nlp/Mistral-7B-Instruct-IPO

Text Generation • 7B • Updated Jun 17, 2024 • 61
princeton-nlp/Mistral-7B-Instruct-KTO

Text Generation • 7B • Updated Jun 17, 2024 • 47
princeton-nlp/Mistral-7B-Instruct-SimPO

Text Generation • 7B • Updated Jun 17, 2024 • 95 • 2
princeton-nlp/Mistral-7B-Instruct-ORPO

Text Generation • 7B • Updated Jun 17, 2024 • 51
princeton-nlp/Mistral-7B-Base-SFT-IPO

Text Generation • 7B • Updated Jun 17, 2024 • 52
princeton-nlp/Mistral-7B-Base-SFT-KTO

Text Generation • 7B • Updated Jun 17, 2024 • 47
princeton-nlp/Mistral-7B-Base-SFT-DPO

Text Generation • 7B • Updated Jun 17, 2024 • 72
princeton-nlp/Mistral-7B-Base-SFT-RDPO

Text Generation • 7B • Updated Jun 17, 2024 • 65
princeton-nlp/Mistral-7B-Base-SFT-SimPO

Text Generation • 7B • Updated Jun 17, 2024 • 74
princeton-nlp/llama3-ultrafeedback

Viewer • Updated Jul 18, 2024 • 61.8k • 2.81k • 18
princeton-nlp/Mistral-7B-Base-SFT-CPO

Text Generation • 7B • Updated Sep 30, 2024 • 64 • 1
princeton-nlp/Mistral-7B-Base-SFT-RRHF

Text Generation • 7B • Updated Sep 30, 2024 • 83
princeton-nlp/Mistral-7B-Base-SFT-SLiC-HF

Text Generation • 7B • Updated Jul 7, 2024 • 63
princeton-nlp/Mistral-7B-Instruct-CPO

Text Generation • 7B • Updated Jul 7, 2024 • 50
princeton-nlp/Mistral-7B-Instruct-RRHF

Text Generation • 7B • Updated Jul 7, 2024 • 52
princeton-nlp/Mistral-7B-Instruct-SLiC-HF

Text Generation • 7B • Updated Jul 7, 2024 • 52
princeton-nlp/Llama-3-Base-8B-SFT-CPO

Text Generation • 8B • Updated Jul 7, 2024 • 60 •
princeton-nlp/Llama-3-Base-8B-SFT-RRHF

Text Generation • 8B • Updated Jul 7, 2024 • 49 •
princeton-nlp/Llama-3-Base-8B-SFT-SLiC-HF

Text Generation • 8B • Updated Jul 7, 2024 • 59 •
princeton-nlp/Llama-3-Instruct-8B-CPO

Text Generation • 8B • Updated Jul 7, 2024 • 70 •
princeton-nlp/Llama-3-Instruct-8B-RRHF

Text Generation • 8B • Updated Jul 7, 2024 • 59 •
princeton-nlp/Llama-3-Instruct-8B-SLiC-HF

Text Generation • 8B • Updated Jul 7, 2024 • 52 •
princeton-nlp/Llama-3-Instruct-8B-RRHF-v0.2

Text Generation • 8B • Updated Jul 7, 2024 • 45
princeton-nlp/Llama-3-Instruct-8B-SLiC-HF-v0.2

Text Generation • 8B • Updated Jul 7, 2024 • 43 •
princeton-nlp/Llama-3-Instruct-8B-DPO-v0.2

Text Generation • 8B • Updated Jul 7, 2024 • 58 •
princeton-nlp/Llama-3-Instruct-8B-IPO-v0.2

Text Generation • 8B • Updated Jul 7, 2024 • 32 •
princeton-nlp/Llama-3-Instruct-8B-CPO-v0.2

Text Generation • 8B • Updated Jul 7, 2024 • 51 •
princeton-nlp/Llama-3-Instruct-8B-KTO-v0.2

Text Generation • 8B • Updated Jul 7, 2024 • 49 •
princeton-nlp/Llama-3-Instruct-8B-ORPO-v0.2

Text Generation • 8B • Updated Jul 7, 2024 • 49 •
princeton-nlp/Llama-3-Instruct-8B-RDPO-v0.2

Text Generation • 8B • Updated Jul 7, 2024 • 45 •
princeton-nlp/Llama-3-Instruct-8B-SimPO-v0.2

Text Generation • 8B • Updated Jul 7, 2024 • 109 • • 8
princeton-nlp/llama3-ultrafeedback-armorm

Viewer • Updated Jul 18, 2024 • 61.8k • 693 • 20