how does deepseek r1's mixture of experts (moe) architecture enhance its performancedeepseek open source aiGo deepseek r1 vs llama 3