[Mediawiki-i18n] Please view and comment CAPTCHA images in 154 languages

praveenp me.praveen at gmail.com
Sun Mar 30 14:11:06 UTC 2014


In this style, many of Malayalam captchas are too difficult to read, 
infact only some images are readable ( 
image_deb406cc_f8419aa5c2a1d891.png, 
image_7a8d523a_6b8546bbc5dc3608.png, 
image_d3e539c0_f856b4c90b2ceeeb.png etc are some of the difficult 
ones). The image - image_5dbc3fc3_0e1a119f02c122b8.png - (ാനിതംബം)  
using a vowel sign in the beginning of the captcha, which is not common 
and almost impossible to type in most transliteration keyboards. The 
word നിതംബം after vowel sign ാ exactly means 'buttocks'. BTW, the word 
ലിംഗം repeated couple of times in images, which means 'sexual organ'.

In the image - image_b5d2be0d_7223dc2282b35e15.png -, readable part is 
problematic (last letter is not recognizable). In the readable part, 
letters appear as ച െവി where two different vowel signs are applied on 
same letter (typing probably not possible). This may be rendering 
error, which is very common in many rendering engines. Actual word may 
be (ചെവി). Same kind of problem happens in 
image_077ebd23_d890a7083e967d92.png where two vowel symbols appears 
together. It appears vowel sign ാ used independently to create these 
captchas which should be avoided.

https://en.wikipedia.org/wiki/Malayalam_alphabet

Praveen

On Sunday 30 March 2014 06:00:53 PM IST, Federico Leva (Nemo) wrote:
> Yes, Wiktionary is not perfect. People unhappy with it are encouraged
> to edit. :) Alternatively, if someone produces suitable word lists in
> 150+ languages from higher quality sources, I'll be happy to use them.
>
> Filip Maljković, 30/03/2014 13:09:
>> Serbian (sr) in some cases uses a combination of Cyrillic and Latin
>> scripts, which is a bit awkward.
>
> Indeed, the situation of Serbian, Croatian and Serbo-Croatian on
> Wiktionary is super-confusing. If it's too hard to get consistent
> output we'll just skip them.
>
>> CAPTCHA should be in only one script
>
> True. This is easy to do in PHP with the ICU libraries but I've yet to
> find a python interface (as with
> <https://github.com/mitsuhiko/babel/issues/89>), anyway we'll add a
> sanitisation of the word lists in a way or another.
>
> Nemo
>
> _______________________________________________
> Mediawiki-i18n mailing list
> Mediawiki-i18n at lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/mediawiki-i18n



More information about the Mediawiki-i18n mailing list