THUDM

slime

slime is an LLM post-training framework for RL Scaling.

Trusted Project Python
6.5k stars 195 stars today via trending-python
View on GitHub
← Back to Trending Repos