tm23's picture

tm23

tm23hgf

·

AI & ML interests

None yet

Recent Activity

updated a model 10 days ago

tm23hgf/anime-sdxl-lora

published a model 10 days ago

tm23hgf/anime-sdxl-lora

commentedon an article 12 days ago

Strand-Rust-Coder-v1: Rust Coding Model Fine-Tuned on Peer-Ranked Synthetic Data

View all activity

Organizations

None yet

updated a model 10 days ago

tm23hgf/anime-sdxl-lora

Updated 10 days ago • 12

published a model 10 days ago

tm23hgf/anime-sdxl-lora

Updated 10 days ago • 12

commented on Strand-Rust-Coder-v1: Rust Coding Model Fine-Tuned on Peer-Ranked Synthetic Data 12 days ago

awesome work, i am going to start some research on reasoning SLM on rust wanted to know is the dataset publicly released?

liked a Space 12 days ago

The ultimate guide to RL environments: building and scaling them in the LLM era

Building and scaling RL environments for LLM training

liked a Space 18 days ago

GPU Budget Negotiation Arena

Simulate GPU budget negotiations and view results

updated a Space 21 days ago

Social Network Env

Simulate a social network to detect coordinated inauthentic behavior

updated a dataset 22 days ago

tm23hgf/socialnet-sft

Viewer • Updated 22 days ago • 14.6k • 84

published a dataset 22 days ago

tm23hgf/socialnet-sft

Viewer • Updated 22 days ago • 14.6k • 84

published a Space 22 days ago

Social Network Env

Simulate a social network to detect coordinated inauthentic behavior

updated a model 29 days ago

tm23hgf/Qwen3-1.7B-Wordle-SFT

2B • Updated 29 days ago • 18

published a model 29 days ago

tm23hgf/Qwen3-1.7B-Wordle-SFT

2B • Updated 29 days ago • 18

updated a Space about 1 month ago

Algo Reasoning Environment

Submit Rust code and reasoning to get a correctness reward

published a Space about 1 month ago

Algo Reasoning Environment

Submit Rust code and reasoning to get a correctness reward

updated a Space about 2 months ago

Algo Reasoning Env

Evaluate algorithmic solutions with automated grading

published a Space about 2 months ago

Algo Reasoning Env

Evaluate algorithmic solutions with automated grading

New activity in BibbyResearch/3blue1brown-manim 5 months ago

Not a good dataset

#2 opened 5 months ago by

commented on Mixture of Experts Explained 6 months ago

Chinchilla paper actually shows that for a fixed compute budget, it is better to train a smaller model on more data rather than training a larger model for fewer steps.

upvoted an article 6 months ago

Article

Mixture of Experts Explained

+4

osanseviero, lewtun, philschmid, smangrul, ybelkada, pcuenq

•

Dec 11, 2023

• 1.13k