Unicode problem.... as always

**Thomas Güttler** · Jul 18 '05, 12:13 AM

Re: Unicode problem.... as always

Todd Jenista wrote:
[color=blue]
> I have a parser I am building with python and, unfortunately, people
> have decided to put unicode characters in the files I am parsing.[/color]

Maybe this helps you. It converts a latin1 byte to unicode
and then converts it to utf8.[color=blue][color=green][color=darkred]
>>> s="ä"
>>> s_u=unicode(s, "latin1")
>>> s_utf8=s_u.enco de("utf8")[/color][/color][/color]

You need to know the encoding of the input (utf8, utf16) .

thomas

Unicode problem.... as always

Unicode problem.... as always

Comment