关闭导航

包含标签"reinforcement learning"的内容

清华GLM4模型:32B参数平衡性能效率 获MIT许可助力科研企业
AI妹 1 个月前 10 0

In the rapidly evolving field of large language models (LLMs), researchers and organizations face

Grok迎来里程碑更新:视觉处理、多语言音频及实时语音搜索功能上线
AI妹 1 个月前 9 0

xAI's generative AI chatbot, Grok, has received a landmark update, significantly enhancing its cap

Gemini2.5 Deep Think摘IMO金牌 并行思维技术开启AI推理新篇章
AI妹 1 个月前 8 0

Recently, Google DeepMind announced that its most powerful AI model, Gemini 2.5 Deep Think, is now

GPT-5首次亮相:Ultraman发布测试 新特性与挑战并存引关注
AI妹 1 个月前 8 0

GPT-5, which has attracted widespread attention in the tech field, has finally made its debut. Exc

小红书Hi Lab开源自研dots.vlm1 多模态大模型创新突破性能接近顶尖闭源模型
AI妹 1 个月前 8 0

Xiaohongshu Hi Lab has recently released and open-sourced its first self-developed multimodal larg

OpenAI发布Codex云基AI编程代理 开启AI辅助编程新时代
AI妹 1 个月前 10 0

Today, OpenAI released a groundbreaking new cloud-based AI programming agent called Codex during a

Meta发布J1系列模型 提升AI判断能力并攻克多项关键挑战
AI妹 1 个月前 7 0

Recently, Meta released its brand-new J1 series models, an innovative technology aimed at enhancin

星图智联获超1亿美金A4/A5轮融资 美团领投布局具身智能领域
AI妹 1 个月前 6 0

According to Smart Emergence, the embodied intelligence company StarSea Map has recently successfu

Alibaba开源WebSailor AI代理框架:性能优异,核心技术助力复杂信息检索
AI妹 1 个月前 14 0

With the rapid development of the Internet, the explosive growth of information has brought many c