Tencent Releases Largest Open-Source MoE Model: 389B Parameters, Free for Commercial Use, Outperforms Llama 3.1
Tencent's 389B MoE is out. Free for business. Beats Llama 3.1? I'll believe it when the unit economics work in the field.
Browse AI news across models, agents, media, industry, and compute policy.
Tencent's 389B MoE is out. Free for business. Beats Llama 3.1? I'll believe it when the unit economics work in the field.
I read EMNLP '24 findings on training-free knowledge editing for large models. It promises efficient new data absorption without retraining. This forward-looking AI topic sits outside our Jan 2025–May 2026 timeline.
I read about this Chengdu student's overlooked AI impact. Does the narrative hold up against standard historical records?
I watched three models team up to challenge o1, proving that collaborating with 360+ agents eliminates the need for manual prompt engineering in real-world scenarios.
Tsinghua's new 3D scaling law pushes AI generation boundaries, addressing forward-looking topics beyond current timelines.
I read the claims: 200% efficiency gains and vLLM parity in usability for this domestic framework. The source notes it sits outside the Jan 2025–May 2026 timeline, raising questions about its origins.
TuSimple claims pivoting to AIGC games is vital for survival amid self-driving rumors. This strategic shift raises questions about resource allocation and core competency in a volatile market.
Tencent's GameGen-O generates 'Black Myth'-style videos with one click. I note this lacks reproducibility benchmarks; hype often outpaces actual generative fidelity in early releases.
I read the hype around this viral mobile AI coder. Two-minute app generation sounds like a lab demo, not production-ready code for on-call engineers.
I read this paper on multimodal segmentation from Renmin U, BUPT, and Shanghai AI Lab. It aims to ground AI in physical reality. Field read: Labs love demos; units need economics.