关闭导航

包含标签" data difficulty distribution adjustment"的内容

字节跳动联合港大复旦推出POLARIS强化学习法 提升小模型数学推理并开源
AI妹 1 个月前 13 0

Recently, the Seed team at ByteDance has collaborated with the University of Hong Kong and Fudan U