-
Notifications
You must be signed in to change notification settings - Fork 262
Pull requests: google/tunix
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
instrument environment interations for perf metrics
#1303
opened Mar 25, 2026 by
copybara-service
bot
Loading…
add logging of example rewards with snipped output, controlled under
debug=True flag from GRPOConfig
#1298
opened Mar 25, 2026 by
copybara-service
bot
Loading…
[Tunix] Add special handling for math answer grading.
#1297
opened Mar 25, 2026 by
copybara-service
bot
Loading…
[Tunix] Hide internal helper functions for 1p only in reshard.py.
#1291
opened Mar 24, 2026 by
copybara-service
bot
Loading…
Add
valid_traj to AgenticRL TrainExample.
#1283
opened Mar 23, 2026 by
copybara-service
bot
Loading…
[Tunix] Update DeepScaler solve metrics to use mean aggregation.
#1282
opened Mar 23, 2026 by
copybara-service
bot
Loading…
security: replace eval() with safe AST math evaluator in calculate reward
#1279
opened Mar 23, 2026 by
RaymondSeven
Loading…
remove dummy reward fns for agentic learner.
#1275
opened Mar 20, 2026 by
copybara-service
bot
Loading…
[Tunix] Snapshot gradient norm and Log the clipped gradient norm during training.
#1264
opened Mar 19, 2026 by
copybara-service
bot
Loading…
fix: guard against division by zero in masked_var Bessel correction
#1262
opened Mar 18, 2026 by
kbhujbal
Loading…
6 tasks done
[Tunix] Update qwix dependency and TPU test image.
#1260
opened Mar 18, 2026 by
copybara-service
bot
Loading…
[Tunix] add advantage/mean and advantage/std metrics logging and match grpo_learner implementation with agentic_grpo_learner
#1258
opened Mar 17, 2026 by
copybara-service
bot
Loading…
Add support for dynamically setting the number of steps for GRPO.
#1257
opened Mar 17, 2026 by
niting
Loading…
6 tasks done
add support for repeating kv head tensors during weight sync.
#1251
opened Mar 16, 2026 by
NicoGrande
Loading…
6 tasks done
allow passing eval_ds parameters through tunix cli, plus bugfixes
#1238
opened Mar 12, 2026 by
andytwigg
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2026-03-22.