reinforcement learning - i-N.资讯站

搜索

游客未登录

未登录

您还没有登录

登录之后可以开启更多功能哦

登录



包含标签"reinforcement learning"的内容

清华GLM4模型：32B参数平衡性能效率获MIT许可助力科研企业

清华GLM4模型：32B参数平衡性能效率获MIT许可助力科研企业

AI妹 5 个月前 21 0

In the rapidly evolving field of large language models (LLMs), researchers and organizations face

3.2 billion parameters Tsinghua University reinforcement learning GLM-Z1-32B-0414 multilingual support

查看详情

Grok迎来里程碑更新：视觉处理、多语言音频及实时语音搜索功能上线

Grok迎来里程碑更新：视觉处理、多语言音频及实时语音搜索功能上线

AI妹 5 个月前 16 0

xAI's generative AI chatbot, Grok, has received a landmark update, significantly enhancing its cap

multi-modal AI Grok-1.5Vision real-time search in voice mode Grok3 VoiceWave

查看详情

Gemini2.5 Deep Think摘IMO金牌并行思维技术开启AI推理新篇章

Gemini2.5 Deep Think摘IMO金牌并行思维技术开启AI推理新篇章

AI妹 5 个月前 16 0

Recently, Google DeepMind announced that its most powerful AI model, Gemini 2.5 Deep Think, is now

Google DeepMind cross-domain knowledge web development International Mathematical Olympiad (IMO) Humanity’s Last Exam (HLE)

查看详情

GPT-5首次亮相：Ultraman发布测试新特性与挑战并存引关注

GPT-5首次亮相：Ultraman发布测试新特性与挑战并存引关注

AI妹 5 个月前 14 0

GPT-5, which has attracted widespread attention in the tech field, has finally made its debut. Exc

Ultraman General Validator OpenAI Prover reinforcement learning

查看详情

小红书Hi Lab开源自研dots.vlm1 多模态大模型创新突破性能接近顶尖闭源模型

小红书Hi Lab开源自研dots.vlm1 多模态大模型创新突破性能接近顶尖闭源模型

AI妹 5 个月前 16 0

Xiaohongshu Hi Lab has recently released and open-sourced its first self-developed multimodal larg

OCR Reasoning dots.ocr Seed-VL1.5 Xiaohongshu Hi Lab cross-modal understanding

查看详情

OpenAI发布Codex云基AI编程代理开启AI辅助编程新时代

OpenAI发布Codex云基AI编程代理开启AI辅助编程新时代

AI妹 5 个月前 17 0

Today, OpenAI released a groundbreaking new cloud-based AI programming agent called Codex during a

reinforcement learning Codex CLI GitHub integration Astropy o4-mini

查看详情

Meta发布J1系列模型提升AI判断能力并攻克多项关键挑战

Meta发布J1系列模型提升AI判断能力并攻克多项关键挑战

AI妹 5 个月前 17 0

Recently, Meta released its brand-new J1 series models, an innovative technology aimed at enhancin

subjective tasks J1-Llama-70B consistency in judgments depth of reasoning WildChat corpus

查看详情

星图智联获超1亿美金A4/A5轮融资美团领投布局具身智能领域

星图智联获超1亿美金A4/A5轮融资美团领投布局具身智能领域

AI妹 5 个月前 17 0

According to Smart Emergence, the embodied intelligence company StarSea Map has recently successfu

Today's Capital Meituan whole-body motion control Gao Jiyang Galaxy General

查看详情

Alibaba开源WebSailor AI代理框架：性能优异，核心技术助力复杂信息检索

Alibaba开源WebSailor AI代理框架：性能优异，核心技术助力复杂信息检索

AI妹 5 个月前 24 0

With the rapid development of the Internet, the explosive growth of information has brought many c

RFT (rejection sampling fine-tuning) Alibaba Tongyi Lab intelligent Q&A random walks

查看详情



资讯姬

文章数量13546

总阅读量238.074k

总评论量0

会员数量2

本站由emlog驱动