关闭导航

包含标签" Carnegie Mellon University"的内容

卡内基梅隆等名校研究发现大语言模型过度预训练引发灾难性过训练
AI妹 1 个月前 7 0

Researchers from Carnegie Mellon University, Stanford University, Harvard University, and Princeto

递归模型挑战Transformer主导 新训练干预显著提升其长序列泛化能力
AI妹 1 个月前 8 0

In the field of deep learning, recurrent neural networks (RNNs) and Transformer models each have t