Ken Irwin wrote: > Hi all, > > I'm looking for a good source to help me understand character sets and how to use them. I pretty much know nothing about this - the whole world of Unicode, ASCII, octal, UTF-8, etc. is baffling to me. Other people have recommended a whole lot of fabulous resources, so I won't cover ground they already have. If, however, you need to deal with characters which don't qualify for inclusion in Unicode (or which do qualify but which haven't yet been assigned code points). I recommend tei:glyph: http://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-glyph.html We use this to represent typographically interesting but short-lived approaches to the representation of Māori in printed works. See for example the 'wh' ligature (which looks like a 'vh' and is pronounced in modern usage like 'f') in the following text: http://www.nzetc.org/tm/scholarly/tei-Auc1911NgaM-t1-body-d4.html for the underlying TEI XML representation see: http://www.nzetc.org/tei-source/Auc1911NgaM.xml cheers stuart -- Stuart Yeates http://www.nzetc.org/ New Zealand Electronic Text Centre http://researcharchive.vuw.ac.nz/ Institutional Repository