arxiv:2412.13578
Saish Mendke
saishmendke10
AI & ML interests
None yet
Organizations
None yet
models
9
saishmendke10/news_llm_3.2_1b
Updated
•
1
saishmendke10/news_llm_3.2_1b_grpo
Updated
saishmendke10/news_llm_3.2_3b_grpo
Updated
saishmendke10/Llama-3.2-3B-Instruct-GRPO-test
Updated
saishmendke10/Qwen2-0.5B-GRPO-test
Updated
saishmendke10/news_llm_3-8b-Instruct-bnb-4bit-grpo
Updated
saishmendke10/news_llm_3.2_3b
Updated
•
4
saishmendke10/news_llm_3-8b-Instruct-bnb-4bit
Updated
•
2
saishmendke10/Qwen2-0.5B-GRPO
Updated
datasets
0
None public yet