On 7.10.2010 15:17, Thomas Krichel wrote:
> Ere Maijala writes
>
>> # Fix non-UTF-8 characters with two highest bits set (we assume they
>> are actually ISO-8859-1)
>
> What about
>
> use Encode::Guess qw/latin-1/;
> $decoded=decode("Guess", $dodgy_input);
>
> $decoded then should be a utf-8 string with utf8 flag on.
Would that work for a predominantly proper utf-8 input with some
"mistakes" thrown in?
--Ere
|