Skip to content

Pull requests: THUDM/slime

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

fix(examples): preserve geo3k response budget
#2140 opened Jun 27, 2026 by zhangdw156 Loading…
fix(examples): correct geo3k VLM default env
#2139 opened Jun 27, 2026 by zhangdw156 Loading…
docs(readme): add Dressage to Chinese ecosystem
#2138 opened Jun 27, 2026 by zhangdw156 Loading…
docs(examples): fix broken markdown links in rollout_buffer and examples
#2137 opened Jun 27, 2026 by CalvinXKY Contributor Loading…
feat(gemma4): add Gemma4 dense and MoE support
#2135 opened Jun 26, 2026 by EazyReal Contributor Loading…
fix: handle empty colocated weight buckets
#2134 opened Jun 26, 2026 by EazyReal Contributor Loading…
docs(examples): list coding_agent_rl in examples/README
#2133 opened Jun 26, 2026 by aoshen02 Contributor Loading…
Skip entropy gradient computation when entropy_coef == 0
#2130 opened Jun 25, 2026 by CSUN1997 Loading…
Support partial rollout resume in Search-R1 example
#2128 opened Jun 23, 2026 by OLIVER-XYP Loading…
Reduce entropy logging memory when entropy coef is zero
#2127 opened Jun 23, 2026 by none0663 Contributor Loading…
Add test for megatron server run-ci-changed
#2123 opened Jun 23, 2026 by zhuzilin Contributor Loading…
fix(partial-rollout): cap max_new_tokens by prior response length
#2122 opened Jun 23, 2026 by none0663 Contributor Loading…
fix(ppo): preserve raw KL so rollout/kl logging is correct
#2114 opened Jun 21, 2026 by EazyReal Contributor Loading…
Fix(rollout): Fail closed on unknown SGLang model names
#2112 opened Jun 21, 2026 by Baiyu-Su Contributor Loading…
fix(train): support eval-only mode (--num-rollout 0)
#2109 opened Jun 20, 2026 by EazyReal Contributor Loading…
feat(examples/strands_sglang): update to strands-sglang 0.4.2
#2106 opened Jun 20, 2026 by Lawhy Contributor Loading…
fix(dist): preserve new_group options across reloadable group reload
#2095 opened Jun 17, 2026 by EazyReal Contributor Loading…
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.