Lugh@futurology.todayM to

Futurology@futurology.todayEnglish · 8 months ago

Qwen2.5-Max, a second Chinese open-source AI that equals western investor-funded AIs, has been released.

qwenlm.github.io

18

Qwen2.5-Max, a second Chinese open-source AI that equals western investor-funded AIs, has been released.

qwenlm.github.io

Lugh@futurology.todayM to

Futurology@futurology.todayEnglish · 8 months ago

Qwen2.5-Max: Exploring the Intelligence of Large-scale MoE Model

qwenlm.github.io

QWEN CHAT API DEMO DISCORD It is widely recognized that continuously scaling both data size and model size can lead to significant improvements in model intelligence. However, the research and industry community has limited experience in effectively scaling extremely large models, whether they are dense or Mixture-of-Expert (MoE) models. Many critical details regarding this scaling process were only disclosed with the recent release of DeepSeek V3. Concurrently, we are developing Qwen2.

Chat

CochiseA
link
fedilink
English
arrow-up
3·
8 months ago
But tou don’t have permission to do. And hacking a binary is much more difficult than specializing a model, for instance.