Machine translators have made it easier than ever to create error-plagued Wikipedia articles in obscure languages. What happens when AI models get trained on junk pages?
Is it even getting misused? Spreading knowledge via machine translation where there are no human translators available, had to be better than not translating. As long as there is transparency so people can judge the results ……
And ai training trusting everything it reads is a larger systemic issue, not limited to this niche.
Perhaps part of the solution is machine readable citations. Maybe a search engine or ai could provide better results if it knew what was human generated vs machine generated. But even then you have huge gaps on one side with untrustworthy humans (like comedy) and on the other side with machine generated facts such as from a database
Is it even getting misused? Spreading knowledge via machine translation where there are no human translators available, had to be better than not translating. As long as there is transparency so people can judge the results
Sure there are limitations. The point still stands: an imperfect machine translation is better than no translation, as long as people understand it is.
Can we afford to allow a high bad deprive people of knowledge just because of the language they speak?
The article complains about the affect on languages of poor machine translations, but the affect of no translations is worse. Yes those Greenlanders should be able to read all of Wikipedia without learning English and even if the project has no human translators
Is it even getting misused? Spreading knowledge via machine translation where there are no human translators available, had to be better than not translating. As long as there is transparency so people can judge the results ……
And ai training trusting everything it reads is a larger systemic issue, not limited to this niche.
Perhaps part of the solution is machine readable citations. Maybe a search engine or ai could provide better results if it knew what was human generated vs machine generated. But even then you have huge gaps on one side with untrustworthy humans (like comedy) and on the other side with machine generated facts such as from a database
Assumes the AI is accurate, which is debatable
Also how do you do citations on a translation?
Its an interpretation, not a fact
Sure there are limitations. The point still stands: an imperfect machine translation is better than no translation, as long as people understand it is.
Can we afford to allow a high bad deprive people of knowledge just because of the language they speak?
The article complains about the affect on languages of poor machine translations, but the affect of no translations is worse. Yes those Greenlanders should be able to read all of Wikipedia without learning English and even if the project has no human translators
Wikipedia already has a button where you can go to another language’s version of that page where you can then machine translate it yourself.
I didn’t know that. I guess my “English privilege” is showing
Chauvinism is the term you’re seeking. And we all in the West suffer some degree of it.