Thanks to everyone to drawing our attention to this issue. A couple of days ago the ticTOCs service moved to a new server where the data is stored as UTF-8 (which it wasn't before). We'd forgotten to remove the UFT-8 conversion in text.php so we were serving double-encoded content (UTF-8 encoded as UTF-8) until our developer put it right in the middle of the discussion on this list (which started at 5pm our time!) You should find the problem is fixed now. Terry Terry Bucknell Electronic Resources Manager Sydney Jones Library University of Liverpool Chatham St, PO Box 123 Liverpool, L69 3DA, UK Tel: +44 (0)151 794 2692 Fax: +44 (0)151 794 2681 -----Original Message----- From: Code for Libraries [mailto:[log in to unmask]] On Behalf Of Glen Newton Sent: 21 December 2009 17:52 To: [log in to unmask] Subject: [CODE4LIB] Character problems with tictoc [I realise there was a recent related 'Character-sets for dummies'[1] discussion recently] I am using tictocs[2] list of journal RSS feeds, and I am getting gibberish in places for diacritics. Below is an example: in emacs: 221 Acta Ortop dica Brasileira http://www.scielo.br/rss.php?pid=1413-7852&lang=en 1413-7852 in Firefox: 221 Acta Ortop dica Brasileira http://www.scielo.br/rss.php?pid=1413-7852&lang=en 1413-7852 Note that the emacs view is both of a save of the Firefox, and from a direct download using 'wget'. Is this something on my end, or are the tictocs people not serving proper UTF-8? The HTTP header from wget claims UTF-8: > wget -S http://www.tictocs.ac.uk/text.php > --2009-12-21 12:47:59-- http://www.tictocs.ac.uk/text.php > Resolving www.tictocs.ac.uk... 130.88.101.131 > Connecting to www.tictocs.ac.uk|130.88.101.131|:80... connected. > HTTP request sent, awaiting response... > HTTP/1.1 200 OK > Date: Mon, 21 Dec 2009 17:42:05 GMT > Server: Apache/2.2.13 (Unix) mod_ssl/2.2.13 OpenSSL/0.9.8k PHP/5.3.0 DAV/2 > X-Powered-By: PHP/5.3.0 > Content-Type: text/plain; charset=utf-8 > Connection: close > Length: unspecified [text/plain] ><....stuff removed> Can someone validate if they are also experiencing this issue? Thanks, Glen [1]https://listserv.nd.edu/cgi-bin/wa?S2=CODE4LIB&q=&s=character-sets+for+dummies&f=&a=&b= [2]http://www.tictocs.ac.uk/text.php -- Glen Newton | [log in to unmask] Researcher, Information Science, CISTI Research & NRC W3C Advisory Committee Representative http://tinyurl.com/yvchmu tel/t l: 613-990-9163 | facsimile/t l copieur 613-952-8246 Canada Institute for Scientific and Technical Information (CISTI) National Research Council Canada (NRC)| M-55, 1200 Montreal Road http://www.nrc-cnrc.gc.ca/ Institut canadien de l'information scientifique et technique (ICIST) Conseil national de recherches Canada | M-55, 1200 chemin Montr al Ottawa, Ontario K1A 0R6 Government of Canada | Gouvernement du Canada --