Language Modelling - 搜索 News

2 天

Qwen3-Coder-Next offers vibe coders a powerful open source, ultra-sparse model with 10x ...

On SWE-Bench Verified, the model achieved a score of 70.6%. This performance is notably competitive when placed alongside ...

Microsoft

Detecting backdoored language models at scale

Learn how Microsoft research uncovers backdoor risks in language models and introduces a practical scanner to detect tampering and strengthen AI security.

2 天

Are Large Language Models A Dead End Or Simply Incomplete?

Once a model is deployed, its internal structure is effectively frozen. Any real learning happens elsewhere: through retraining cycles, fine-tuning jobs or external memory systems layered on top. The ...

InfoQ

Google DeepMind Introduces ATLAS Scaling Laws for Multilingual Language Models

Google DeepMind researchers have introduced ATLAS, a set of scaling laws for multilingual language models that formalize how ...

The Information

ByteDance, Alibaba to Launch New Models in Race for AI Supremacy in China

The battle for AI dominance in China is reaching new heights. Tech giants ByteDance and Alibaba Group are both poised to ...

TechCrunch

Mistral AI makes its first large language model free for everyone

The most popular language models out there may be accessed via API, but open models — as far as that term can be taken seriously — are gaining ground. Mistral, a French AI startup that raised a huge ...

EurekAlert!

A survey on multilingual large language models: corpora, alignment, and bias

Multilingual Large Language Models (MLLMs) have achieved remarkable success in advancing multilingual natural language ...

5 天

World Models Like Google’s Project Genie May Enable Future Hyperwar

Google's Project Genie may prove that world models matter more than LLMs for defense. The military that masters physics ...

The Verge

Meta’s powerful AI language model has leaked online — what happens now?

Posts from this topic will be added to your daily email digest and your homepage feed. Meta’s LLaMA model was created to help researchers but leaked on 4chan a week after it was announced. Some worry ...

一些您可能无法访问的结果已被隐去。

显示无法访问的结果