I agree with Godmar: it looks like (some) change happened to tictocs
between my original wget download and the one I downloaded after I
changed my browser settings.
It appears Godmar is not going nuts (or at least this issue is not due
to him going nuts!) ;-)
Viewing the file http://cuvier.cisti.nrc.ca/~gnewton/tictoc.txt
with my newly installed firefox 3.5.6 I see mangled characters:
221 Acta Ortop \u0192 dica Brasileira http://www.scielo.br/rss.php?pid=1413-7852&lang=en 1413-7852
And my browser default encodings is: UTF-8.
So ignore most of my "solution"! :-)
-glen
PS. I am contemplating trademarking "I see mangled characters" !! :-)
On Mon, Dec 21, 2009 at 2:09 PM, Glen Newton <[log in to unmask]> wrote:
>
> The file I got with wget is:
> http://cuvier.cisti.nrc.ca/~gnewton/tictoc.txt
>
(Just to convince myself I'm not going nuts...) - this file, which
Glen downloaded with wget, appears double-encoded:
# curl -s http://cuvier.cisti.nrc.ca/~gnewton/tictoc.txt | od -a -t x1
| head -1082 | tail -4
0020660 - 3 6 8 2 nl 2 2 1 ht A c t a sp O
2d 33 36 38 32 0a 32 32 31 09 41 63 74 61 20 4f
0020700 r t o p C etx B ) d i c a sp B r a
72 74 6f 70 c3 83 c2 a9 64 69 63 61 20 42 72 61
- Godmar
|