AI & ML interests
None defined yet.
Recent Activity
rlsamplingJF/evolm-4B-160BT-finemath_part1_part2-rm-lr1e-6-constant-warmup_0.05-bs16-gc1.0-cc0.01-ls0-step180
4B
•
Updated
•
15
rlsamplingJF/Qwen2.5-7B-Instruct-finemath_part1-rm-lr1e-6-constant-warmup_0.05-bs8-gc1.0-cc0.01-ls0.1-step45
7B
•
Updated
•
18
rlsamplingJF/Llama-3.2-3B-finemath_part1_part2-rm-lr1e-6-constant-warmup_0.05-bs16-gc1.0-cc0.01-ls0-step375
3B
•
Updated
•
40
rlsamplingJF/Llama-3.2-3B-finemath_part1_part2-rm-lr1e-6-constant-warmup_0.05-bs16-gc1.0-step345
3B
•
Updated
•
19
rlsamplingJF/evolm-4B-160BT-finemath_part1_part2-rm-lr1e-6-constant-warmup_0.05-bs16-gc1.0-cc0.01-ls0-initial
4B
•
Updated
•
20
rlsamplingJF/evolm-4B-160BT-finemath_part1_part2-rm-lr1e-6-constant-warmup_0.05-bs16-gc1.0-cc0.01-ls0-step120
4B
•
Updated
•
24
rlsamplingJF/Llama-3.2-1B-finemath_part1_part2-rm-lr1e-6-constant-warmup_0.05-bs16-gc1.0-cc0.01-ls0.1-step150
1B
•
Updated
•
15
rlsamplingJF/Llama-3.2-1B-finemath_part1_part2-rm-lr1e-6-constant-warmup_0.05-bs16-gc1.0-cc0-ls0-initial
1B
•
Updated
•
21
rlsamplingJF/Llama-3.2-1B-finemath_part1_part2-rm-lr1e-6-constant-warmup_0.05-bs16-gc1.0-cc0-ls0-step405
1B
•
Updated
•
18
rlsamplingJF/Qwen2.5-7B-Instruct-finemath_part1-rm-lr1e-6-constant-warmup_0.05-bs16-gc1.0-cc0.01-ls0.1-step30
7B
•
Updated
•
28
rlsamplingJF/Llama-3.2-3B-finemath_part1_part2-rm-lr1e-5-constant-warmup_0.05-bs44-gc1.0-step105
3B
•
Updated
•
17
rlsamplingJF/Llama-3.2-3B-finemath_part1_part2-rm-lr1e-6-constant-warmup_0.05-bs32-gc1.0-step195
3B
•
Updated
•
27
rlsamplingJF/Qwen2.5-7B-Instruct-finemath_part1-rm-lr5e-7-constant-warmup_0.05-bs8-gc1.0-cc0.0-ls0.0-initial
7B
•
Updated
•
80
rlsamplingJF/Qwen2.5-7B-Instruct-finemath_part1-rm-lr5e-7-constant-warmup_0.05-bs8-gc1.0-cc0.0-ls0.0-step105
7B
•
Updated
•
23
rlsamplingJF/Qwen2.5-7B-Instruct-finemath_part1-rm-lr1e-6-constant-warmup_0.05-bs8-gc1.0-step60
7B
•
Updated
•
113
rlsamplingJF/Llama-3.2-3B-finemath_part1-rm-lr1e-6-constant-warmup_0.05-bs32-gc1.0-cc0.01-ls0-initial
3B
•
Updated
•
32
rlsamplingJF/Llama-3.2-3B-finemath_part1-rm-lr1e-6-constant-warmup_0.05-bs32-gc1.0-cc0.01-ls0-step109
3B
•
Updated
•
23
rlsamplingJF/Llama-3.2-3B-finemath_part1-rm-lr1e-6-constant-warmup_0.05-bs16-gc1.0-initial
3B
•
Updated
•
17
rlsamplingJF/Llama-3.2-3B-finemath_part1-rm-lr1e-6-constant-warmup_0.05-bs16-gc1.0-step220
3B
•
Updated
•
19
rlsamplingJF/Qwen2.5-3B-Instruct-finemath-highquality-part1-seed2028-initial
3B
•
Updated
•
16
rlsamplingJF/Qwen2.5-3B-Instruct-finemath-highquality-part1-seed2028
3B
•
Updated
•
14
rlsamplingJF/myllama-1B-20BT-finemath-highquality-part1-seed2026-initial
0.9B
•
Updated
•
15
rlsamplingJF/myllama-1B-20BT-finemath-highquality-part1-seed2026
0.9B
•
Updated
•
15
rlsamplingJF/Llama-3.2-3B-finemath-highquality-part1-seed2025-initial
3B
•
Updated
•
16
rlsamplingJF/Llama-3.2-3B-finemath-highquality-part1-seed2025
3B
•
Updated
•
15
rlsamplingJF/Llama-3.2-3B-finemath-highquality-rm-run2-lr3e-5-cosine-bs32-gc1.0-initial
3B
•
Updated
•
15
rlsamplingJF/Llama-3.2-3B-finemath-highquality-rm-run2-lr3e-5-cosine-bs32-gc1.0
3B
•
Updated
•
17
rlsamplingJF/posttraining_sentence_Qwen2.5-7B-Instruct-finemath-rm-run1-lr1e-6-constant-bs8-gc10.0-step84
7B
•
Updated
•
6
rlsamplingJF/posttraining_sentence_Qwen2.5-7B-Instruct-finemath-rm-run1-lr1e-6-constant-bs8-gc10.0-step36
7B
•
Updated
•
22
rlsamplingJF/posttraining_sentence_Qwen2.5-7B-Instruct-finemath-rm-run1-lr1e-6-constant-bs8-gc10.0-initial
7B
•
Updated
•
23