joe_mamba
today at 6:23 PM
>weird training biases that were required by the Chinese government
What is "weird training biases" to us might not be weird to them and vice versa. Just ask the Chinese what they think about LGBTQ+, Japanese, pride parades, Islam and colored minorities.
Every nation has its own biases injected in its domestic LLMs at this point. Otherwise they risk getting in trouble for hate speech/disinformation in the jurisdiction where they operate.
Same how Google Maps cleverly biases the lines of disputed borders based on where you are viewing it from. Or how Google maps switched 'Gulf of Mexico' to 'Gulf of America' in an instant when the orange man signed the paper. Google won't want to anger the US administration the same way how Mistral won't want to anger France and the EU, so Mistral will have all the EU prime directives injected into its LLMs no matter if they're ludicrous or not. The law is the law whether you agree with it or not. Companies want to survive and will pander to whatever the whims the regime they live under are at the current moment regardless of what is right or wrong.
But if I'm using a LLM for personal projects or generating a photorealistic choreographed fight between Tom Cruise and Brad Pitt, I don't care what its political biases are, I care if it solves my problem better and cheaper than the competition, and here the Chinese models could end up winning the consumer market, which is why you see Mistral and other EU alternatives focusing exclusive on B-2-B corporate market.
> What is "weird training biases" to us might not be weird to them and vice versa.
I agree. That's why I think European companies might prefer a European model.
joe_mamba
today at 6:38 PM
Except there's no such thing as the "European model" similar how Europe is not a country.
Mistral is mostly French and tends to have mostly French speaking customers, like BNP PAribas in Belgium. Germany will want its own domestic AI champions, maybe in partnership with Switzerland and Austria, similar to how Denmark already has invested in LLMs focused on the Nordic languages with money from Norway.
The biggest mistake is treating Europe like a single homogenous country/market.
The original question was "Yeah but why use mistral on premises instead of Qwen?". I think you and I agree on the answer.
I for one would love to see more country-specific models. There was a story here the other day about Norway’s National Library developing a LLM specialized in Norwegian: https://news.ycombinator.com/item?id=48270770