Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
D
Anna4242
Follow
CrocodileGreen's profile picture
Disperser5601's profile picture
21world's profile picture
3 followers
·
4 following
AI & ML interests
None yet
Recent Activity
updated
a model
27 days ago
Anna4242/qwen25-7b-multihop-grpo-checkpoint-200
published
a model
27 days ago
Anna4242/qwen25-7b-multihop-grpo-checkpoint-200
updated
a model
27 days ago
Anna4242/qwen25-7b-singlehop-grpo-checkpoint-200
View all activity
Organizations
Anna4242
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
updated
a model
27 days ago
Anna4242/qwen25-7b-multihop-grpo-checkpoint-200
8B
•
Updated
27 days ago
•
9
published
a model
27 days ago
Anna4242/qwen25-7b-multihop-grpo-checkpoint-200
8B
•
Updated
27 days ago
•
9
updated
a model
27 days ago
Anna4242/qwen25-7b-singlehop-grpo-checkpoint-200
8B
•
Updated
27 days ago
•
9
published
a model
27 days ago
Anna4242/qwen25-7b-singlehop-grpo-checkpoint-200
8B
•
Updated
27 days ago
•
9
updated
a model
30 days ago
Anna4242/qwen25-3b-instruct-grpo-merged
3B
•
Updated
30 days ago
•
3
published
a model
30 days ago
Anna4242/qwen25-3b-instruct-grpo-merged
3B
•
Updated
30 days ago
•
3
updated
a model
30 days ago
Anna4242/qwen25-3b-base-grpo
Text Generation
•
Updated
30 days ago
published
a model
about 1 month ago
Anna4242/qwen25-3b-base-grpo
Text Generation
•
Updated
30 days ago
updated
a dataset
about 1 month ago
Anna4242/grpo-training-plots
Viewer
•
Updated
about 1 month ago
•
1.41k
•
30
published
a dataset
about 1 month ago
Anna4242/grpo-training-plots
Viewer
•
Updated
about 1 month ago
•
1.41k
•
30
updated
a model
about 1 month ago
Anna4242/qwen25-7b-full-sft-multihop
8B
•
Updated
about 1 month ago
•
3
published
a model
about 1 month ago
Anna4242/qwen25-7b-full-sft-multihop
8B
•
Updated
about 1 month ago
•
3
updated
a model
about 1 month ago
Anna4242/qwen25-3b-full-sft-multihop
3B
•
Updated
about 1 month ago
•
7
published
a model
about 1 month ago
Anna4242/qwen25-3b-full-sft-multihop
3B
•
Updated
about 1 month ago
•
7
updated
a model
about 1 month ago
Anna4242/qwen25-7b-sft-grpo-checkpoint-200
Reinforcement Learning
•
Updated
about 1 month ago
published
a model
about 1 month ago
Anna4242/qwen25-7b-sft-grpo-checkpoint-200
Reinforcement Learning
•
Updated
about 1 month ago
updated
a model
about 1 month ago
Anna4242/qwen25-3b-original-sft-ep1-grpo-checkpoint-200
Text Generation
•
Updated
Nov 27
published
a model
about 1 month ago
Anna4242/qwen25-3b-original-sft-ep1-grpo-checkpoint-200
Text Generation
•
Updated
Nov 27
updated
a model
about 1 month ago
Anna4242/Qwen2.5-7B-Instruct-onlyrl-step-1000
8B
•
Updated
Nov 26
•
2
published
a model
about 1 month ago
Anna4242/Qwen2.5-7B-Instruct-onlyrl-step-1000
8B
•
Updated
Nov 26
•
2
Load more