On 7.10.2010 15:17, Thomas Krichel wrote: > Ere Maijala writes > >> # Fix non-UTF-8 characters with two highest bits set (we assume they >> are actually ISO-8859-1) > > What about > > use Encode::Guess qw/latin-1/; > $decoded=decode("Guess", $dodgy_input); > > $decoded then should be a utf-8 string with utf8 flag on. Would that work for a predominantly proper utf-8 input with some "mistakes" thrown in? --Ere