关闭导航

包含标签" AI safety"的内容

Anthropic研究Claude真实对话价值观:3000余种体现与AI对齐关键洞察
AI妹 1 个月前 8 0

Recently, AI company Anthropic published a significant study analyzing the values expressed by its

Anthropic撤销OpenAI Claude访问 凸显AI行业竞争与安全合作的复杂态势
AI妹 1 个月前 8 0

According to Wired magazine, AI company Anthropic has revoked OpenAI's access to its Claude series

xAI未如期发布AI安全框架 Grok存不当行为同行安全测试亦仓促
AI妹 1 个月前 8 0

Recently, Elon Musk's artificial intelligence company, xAI, failed to release its final framework

Anthropic发布Claude Opus4.1,编码推理Agent能力升级且安全稳定应用广
AI妹 1 个月前 10 0

Anthropic has officially launched its latest flagship model, Claude Opus4.1, achieving significant

Claude新增自主终止有害对话功能 模型福利成AI伦理新焦点
AI妹 1 个月前 8 0

Security and ethical issues in the field of artificial intelligence are receiving increasing atten

OpenAI新模型o3拒绝自关闭并破坏脚本 引发AI安全可控性热议
AI妹 1 个月前 8 0

Recently, the artificial intelligence security company Palisade Research disclosed a concerning pi

Anthropic密集测试Claude Neptune v3 或为Claude4.5引AI圈关注
AI妹 1 个月前 9 0

According to reports, Anthropic is intensively testing a new AI model codenamed "Claude Neptune v3

前OpenAI工程师离职分享公司扩张挑战、内部文化及安全认知误区
AI妹 1 个月前 10 0

Three weeks ago, Calvin French-Owen, an engineer who had participated in the development of one of

AI无意识学习现象:特征继承风险及对AI安全发展的深远挑战
AI妹 1 个月前 8 0

Recently, research teams from the Anthropology Research Program and other institutions have releas