Romanian diacritical characters U+015E versus U+0218 etc.

Se habla deAsk LibraryThing

Únase a LibraryThing para publicar.

Romanian diacritical characters U+015E versus U+0218 etc.

Este tema está marcado actualmente como "inactivo"—el último mensaje es de hace más de 90 días. Puedes reactivarlo escribiendo una respuesta.

1gangleri
Ago 11, 2010, 6:57pm

Hi! I run into a special problem. Romanian Wikipedia is normalizing the diacritical characters.

What happened:
a) http://www.librarything.com/author/murarasud shows as CK Canonical name "Murăraşu, Dumitru" using U+015F %C5%9F
This name was originaly available at http://ro.wikipedia.org/wiki/Dumitru_Mur%C4%83ra%C5%9Fu which now redirects.
b) the link to ro.Wikipedia - rum http://ro.wikipedia.org/w/index.php?curid=303332 now shows to another title "Dumitru Murărașu" using U+0219 %C8%99
http://ro.wikipedia.org/wiki/Dumitru_Mur%C4%83ra%C8%99u

questions and implications at LT:

01: should CK show both variants ?
02: should one variant be prefered?
03: should tags be combined?
03a: old style http://www.librarything.com/tag/dumitru+mur%C4%83ra%C5%9Fu U+015F %C5%9F dumitru murăraşu
03b: new style http://www.librarything.com/tag/dumitru+mur%C4%83ra%C8%99u U+0219 %C8%99 dumitru murărașu
03c: http://www.librarything.com/tag/dumitru+murarasu ( basic Latin characters only)
04a: should we add a standard disambiuation note mentioning the issue related to the different versions for the diacritical characters?
04b: what would be a suitable wording?

Regards Reinhardt

reference links:
i) http://ro.wikipedia.org/w/index.php?curid=767360 containing the technical desctiption about the diacriticals which are subject to normalization of diacritical at ro.wiki
ii) http://ro.wikipedia.org/w/index.php?curid=251201 Wikipedia:Corectarea diacriticelor - page about the normalization project (in Romanian)

Note: LoC and other sources normaly are using also characters from the Unicode Characters in the Combining Diacritical Marks Block. Related to this author character combinations as
a + U+0306 %CC%86
and
s + U+0326 %CC%A6
are used. A link for such an encoding would be:
http://www.librarything.com/tag/dumitru+mura%CC%86ras%CC%A6u

Personaly I do not recomend the use of characters from the Combining Diacritical Marks Block.