README 15 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301302303304305306307308309310311312313314315316317318319320321322323324325326327328329330331332333334335336337338339340341342343344345346347348349350351352353354355356357358359360361362363364365366367368369370
  1. Unicode versions of the X11 "misc-fixed-*" fonts
  2. ------------------------------------------------
  3. Markus Kuhn <http://www.cl.cam.ac.uk/~mgk25/> -- 2008-04-21
  4. This package contains the X Window System bitmap fonts
  5. -Misc-Fixed-*-*-*--*-*-*-*-C-*-ISO10646-1
  6. These are Unicode (ISO 10646-1) extensions of the classic ISO 8859-1
  7. X11 terminal fonts that are widely used with many X11 applications
  8. such as xterm, emacs, etc.
  9. COVERAGE
  10. --------
  11. None of these fonts covers Unicode completely. Complete coverage
  12. simply would not make much sense here. Unicode 5.1 contains over
  13. 100000 characters, and the large majority of them are
  14. Chinese/Japanese/Korean Han ideographs (~70000) and Korean Hangul
  15. Syllables (~11000) that cannot adequately be displayed in the small
  16. pixel sizes of the fixed fonts. Similarly, Arabic characters are
  17. difficult to fit nicely together with European characters into the
  18. fixed character cells and X11 lacks the ligature substitution
  19. mechanisms required for using Indic scripts.
  20. Therefore these fonts primarily attempt to cover Unicode subsets that
  21. fit together with European scripts. This includes the Latin, Greek,
  22. Cyrillic, Armenian, Georgian, and Hebrew scripts, plus a lot of
  23. linguistic, technical and mathematical symbols. Some of the fixed
  24. fonts now also cover Arabic, Thai, Ethiopian, halfwidth Katakana, and
  25. some other non-European scripts.
  26. We have defined 3 different target character repertoires (ISO 10646-1
  27. subsets) that the various fonts were checked against for minimal
  28. guaranteed coverage:
  29. TARGET1 617 characters
  30. Covers all characters of ISO 8859 part 1-5,7-10,13-16,
  31. CEN MES-1, ISO 6937, Microsoft CP1251/CP1252, DEC VT100
  32. graphics symbols, and the replacement and default
  33. character. It is intended for small bold, italic, and
  34. proportional fonts, for which adding block graphics
  35. characters would make little sense. This repertoire
  36. covers the following ISO 10646-1:2000 collections
  37. completely: 1-3, 8, 12.
  38. TARGET2 886 characters
  39. Adds to TARGET1 the characters of the Adobe/Microsoft
  40. Windows Glyph List 4 (WGL4), plus a selected set of
  41. mathematical characters (covering most of ISO 31-11
  42. high-school level math symbols) and some combining
  43. characters. It is intended to be covered by all normal
  44. "fixed" fonts and covers all European IBM, Microsoft, and
  45. Macintosh character sets. This repertoire covers the
  46. following ISO 10646-1:2000 (including Amd 1:2002)
  47. collections completely: 1-3, 8, 12, 33, 45.
  48. TARGET3 3282 characters
  49. Adds to TARGET2 all characters of all European scripts
  50. (Latin, Greek, Cyrillic, Armenian, Georgian), all
  51. phonetic alphabet symbols, many mathematical symbols
  52. (including all those available in LaTeX), all typographic
  53. punctuation, all box-drawing characters, control code
  54. pictures, graphical shapes and some more that you would
  55. expect in a very comprehensive Unicode 4.0 font for
  56. European users. It is intended for some of the more
  57. useful and more widely used normal "fixed" fonts. This
  58. repertoire is, with two exceptions, a superset of all
  59. graphical characters in CEN MES-3A and covers the
  60. following ISO 10646-1:2000 (including Amd 1:2002)
  61. collections completely: 1-12, 27, 30-31, 32 (only
  62. graphical characters), 33-42, 44-47, 63, 65, 70 (only
  63. graphical characters).
  64. [The two MES-3A characters deliberately omitted are the
  65. angle bracket characters U+2329 and U+232A. ISO and CEN
  66. appears to have included these into collection 40 and
  67. MES-3A by accident, because there they are the only
  68. characters in the Unicode EastAsianWidth "wide" class.]
  69. CURRENT STATUS:
  70. 6x13.bdf 8x13.bdf 9x15.bdf 9x18.bdf 10x20.bdf:
  71. Complete (TARGET3 reached and checked)
  72. 5x7.bdf 5x8.bdf 6x9.bdf 6x10.bdf 6x12.bdf 7x13.bdf 7x14.bdf clR6x12.bdf:
  73. Complete (TARGET2 reached and checked)
  74. 6x13B.bdf 7x13B.bdf 7x14B.bdf 8x13B.bdf 9x15B.bdf 9x18B.bdf:
  75. Complete (TARGET1 reached and checked)
  76. 6x13O.bdf 7x13O.bdf 8x13O.bdf
  77. Complete (TARGET1 minus Hebrew and block graphics)
  78. [None of the above fonts contains any character that has in Unicode
  79. the East Asian Width Property "W" or "F" assigned. This way, the
  80. desired combination of "half-width" and "full-width" glyphs can be
  81. achieved easily. Most font mechanisms display a character that is not
  82. covered in a font by using a glyph from another font that appears
  83. later in a priority list, which can be arranged to be a "full-width"
  84. font.]
  85. The supplement package
  86. http://www.cl.cam.ac.uk/~mgk25/download/ucs-fonts-asian.tar.gz
  87. contains the following additional square fonts with Han characters for
  88. East Asian users:
  89. 12x13ja.bdf:
  90. Covers TARGET2, JIS X 0208, Hangul, and a few more. This font is
  91. primarily intended to provide Japanese full-width Hiragana,
  92. Katakana, and Kanji for applications that take the remaining
  93. ("halfwidth") characters from 6x13.bdf. The Greek lowercase
  94. characters in it are still a bit ugly and will need some work.
  95. 18x18ja.bdf:
  96. Covers all JIS X 0208, JIS X 0212, GB 2312-80, KS X 1001:1992,
  97. ISO 8859-1,2,3,4,5,7,9,10,15, CP437, CP850 and CP1252 characters,
  98. plus a few more, where priority was given to Japanese han style
  99. variants. This font should have everything needed to cover the
  100. full ISO-2022-JP-2 (RFC 1554) repertoire. This font is primarily
  101. intended to provide Japanese full-width Hiragana, Katakana, and
  102. Kanji for applications that take the remaining ("halfwidth")
  103. characters from 9x18.bdf.
  104. 18x18ko.bdf:
  105. Covers the same repertoire as 18x18ja plus full coverage of all
  106. Hangul syllables and priority was given to Hanja glyphs in the
  107. unified CJK area as they are used for writing Korean.
  108. The 9x18 and 6x12 fonts are recommended for use with overstriking
  109. combining characters.
  110. Bug reports, suggestions for improvement, and especially contributed
  111. extensions are very welcome!
  112. INSTALLATION
  113. ------------
  114. You install the fonts under Unix roughly like this (details depending
  115. on your system of course):
  116. System-wide installation (root access required):
  117. cd submission/
  118. make
  119. su
  120. mv -b *.pcf.gz /usr/lib/X11/fonts/misc/
  121. cd /usr/lib/X11/fonts/misc/
  122. mkfontdir
  123. xset fp rehash
  124. Alternative: Installation in your private user directory:
  125. cd submission/
  126. make
  127. mkdir -p ~/local/lib/X11/fonts/
  128. mv *.pcf.gz ~/local/lib/X11/fonts/
  129. cd ~/local/lib/X11/fonts/
  130. mkfontdir
  131. xset +fp ~/local/lib/X11/fonts (put this last line also in ~/.xinitrc)
  132. Now you can have a look at say the 6x13 font with the command
  133. xfd -fn '-misc-fixed-medium-r-semicondensed--13-120-75-75-c-60-iso10646-1'
  134. If you want to have short names for the Unicode fonts, you can also
  135. append the fonts.alias file to that in the directory where you install
  136. the fonts, call "mkfontdir" and "xset fp rehash" again, and then you
  137. can also write
  138. xfd -fn 6x13U
  139. Note: If you use an old version of xfontsel, you might notice that it
  140. treats every font that contains characters >0x00ff as a Japanese JIS
  141. font and therefore selects inappropriate sample characters for display
  142. of ISO 10646-1 fonts. An updated xfontsel version with this bug fixed
  143. comes with XFree86 4.0 / X11R6.8 or newer.
  144. If you use the Exceed X server on Microsoft Windows, then you will
  145. have to convert the BDF files into Microsoft FON files using the
  146. "Compile Fonts" function of Exceed xconfig. See the file exceed.txt
  147. for more information.
  148. There is one significant efficiency problem that X11R6 has with the
  149. sparsely populated ISO10646-1 fonts. X11 transmits and allocates 12
  150. bytes with the XFontStruct data structure for the difference between
  151. the lowest and the highest code value found in a font, no matter
  152. whether the code positions in between are used for characters or not.
  153. Even a tiny font that contains only two glyphs at positions 0x0000 and
  154. 0xfffd causes 12 bytes * 65534 codes = 786 kbytes to be requested and
  155. stored by the client. Since all the ISO10646-1 BDF files provided in
  156. this package contain characters in the U+00xx (ASCII) and U+ffxx
  157. (ligatures, etc.) range, all of them would result in 786 kbyte large
  158. XCharStruct arrays in the per_char array of the corresponding
  159. XFontStruct (even for CharCell fonts!) when loaded by an X client.
  160. Until this problem is fixed by extending the X11 font protocol and
  161. implementation, non-CJK ISO10646-1 fonts that lack the (anyway not
  162. very interesting) characters above U+31FF seem to be the best
  163. compromise. The bdftruncate.pl program in this package can be used to
  164. deactivate any glyphs above a threshold code value in BDF files. This
  165. way, we get relatively memory-economic ISO10646-1 fonts that cause
  166. "only" 150 kbyte large XCharStruct arrays to be allocated. The
  167. deactivated glyphs are still present in the BDF files, but with an
  168. encoding value of -1 that causes them to be ignored.
  169. The ISO10646-1 fonts can not only be used directly by Unicode aware
  170. software, they can also be used to create any 8-bit font. The
  171. ucs2any.pl Perl script converts a ISO10646-1 BDF font into a BDF font
  172. file with some different encoding. For instance the command
  173. perl ucs2any.pl 6x13.bdf MAPPINGS/8859-7.TXT ISO8859-7
  174. will generate the file 6x13-ISO8859-7.bdf according to the 8859-7.TXT
  175. Latin/Greek mapping table, which available from
  176. <ftp://ftp.unicode.org/Public/MAPPINGS/>. [The shell script
  177. ./map_fonts automatically generates a subdirectory derived-fonts/ with
  178. many *.bdf and *.pcf.gz 8-bit versions of all the
  179. -misc-fixed-*-iso10646-1 fonts.]
  180. When you do a "make" in the submission/ subdirectory as suggested in
  181. the installation instructions above, this will generate exactly the
  182. set of fonts that have been submitted to the XFree86 project for
  183. inclusion into XFree86 4.0. These consists of all the ISO10646-1 fonts
  184. processed with "bdftruncate.pl U+3200" plus a selected set of derived
  185. 8-bit fonts generated with ucs2any.pl.
  186. Every font comes with a *.repertoire-utf8 file that lists all the
  187. characters in this font.
  188. CONTRIBUTING
  189. ------------
  190. If you want to help me in extending or improving the fonts, or if you
  191. want to start your own ISO 10646-1 font project, you will have to edit
  192. BDF font files. This is most comfortably done with the gbdfed font
  193. editor (version 1.3 or higher), which is available from
  194. http://crl.nmsu.edu/~mleisher/gbdfed.html
  195. Once you are familiar with gbdfed, you will notice that it is no
  196. problem to design up to 100 nice characters per hour (even more if
  197. only placing accents is involved).
  198. Information about other X11 font tools and Unicode fonts for X11 in
  199. general can be found on
  200. http://www.cl.cam.ac.uk/~mgk25/ucs-fonts.html
  201. The latest version of this package is available from
  202. http://www.cl.cam.ac.uk/~mgk25/download/ucs-fonts.tar.gz
  203. If you want to contribute, then get the very latest version of this
  204. package, check which glyphs are still missing or inappropriate for
  205. your needs, and send me whatever you had the time to add and fix. Just
  206. email me the extended BDF-files back, or even better, send me a patch
  207. file of what you changed. The best way of preparing a patch file is
  208. ./touch_id newfile.bdf
  209. diff -d -u -F STARTCHAR oldfile.bdf newfile.bdf >file.diff
  210. which ensures that the patch file preserves information about which
  211. exact version you worked on and what character each "hunk" changes.
  212. I will try to update this packet on a daily basis. By sending me
  213. extensions to these fonts, you agree that the resulting improved font
  214. files will remain in the public domain for everyone's free use. Always
  215. make sure to load the very latest version of the package immediately
  216. before your start, and send me your results as soon as you are done,
  217. in order to avoid revision overlaps with other contributors.
  218. Please try to be careful with the glyphs you generate:
  219. - Always look first at existing similar characters in order to
  220. preserve a consistent look and feel for the entire font and
  221. within the font family. For block graphics characters and geometric
  222. symbols, take care of correct alignment.
  223. - Read issues.txt, which contains some design hints for certain
  224. characters.
  225. - All characters of CharCell (C) fonts must strictly fit into
  226. the pixel matrix and absolutely no out-of-box ink is allowed.
  227. - The character cells will be displayed directly next to each other,
  228. without any additional pixels in between. Therefore, always make
  229. sure that at least the rightmost pixel column remains white, as
  230. otherwise letters will stick together, except of course for
  231. characters -- like Arabic or block graphics -- that are supposed to
  232. stick together.
  233. - Place accents as low as possible on the Latin characters.
  234. - Try to keep the shape of accents consistent among each other and
  235. with the combining characters in the U+03xx range.
  236. - Use gbdfed only to edit the BDF file directly and do not import
  237. the font that you want to edit from the X server. Use gbdfed 1.3
  238. or higher.
  239. - The glyph names should be the Adobe names for Unicode characters
  240. defined at
  241. http://www.adobe.com/devnet/opentype/archives/glyph.html
  242. which gbdfed can set automatically. To make the Edit/Rename Glyphs/
  243. Adobe Names function work, you have to download the file
  244. http://www.adobe.com/devnet/opentype/archives/glyphlist.txt
  245. and configure its location either in Edit/Preferences/Editing Options/
  246. Adobe Glyph List, or as "adobe_name_file" in "~/.gbdfed".
  247. - Be careful to not change the FONTBOUNDINGBOX box accidentally in
  248. a patch.
  249. You should have a copy of the ISO 10646 standard
  250. ISO/IEC 10646:2003, Information technology -- Universal
  251. Multiple-Octet Coded Character Set (UCS),
  252. International Organization for Standardization, Geneva, 2003.
  253. http://standards.iso.org/ittf/PubliclyAvailableStandards/
  254. and/or the Unicode 5.0 book:
  255. The Unicode Consortium: The Unicode Standard, Version 5.0,
  256. Reading, MA, Addison-Wesley, 2006,
  257. ISBN 9780321480910.
  258. http://www.amazon.com/exec/obidos/ASIN/0321480910/mgk25
  259. All these fonts are from time to time resubmitted to the X.Org
  260. project, XFree86 (they have been in there since XFree86 4.0), and to
  261. other X server developers for inclusion into their normal X11
  262. distributions.
  263. Starting with XFree86 4.0, xterm has included UTF-8 support. This
  264. version is also available from
  265. http://dickey.his.com/xterm/xterm.html
  266. Please make the developer of your favourite software aware of the
  267. UTF-8 definition in RFC 2279 and of the existence of this font
  268. collection. For more information on how to use UTF-8, please check out
  269. http://www.cl.cam.ac.uk/~mgk25/unicode.html
  270. ftp://ftp.ilog.fr/pub/Users/haible/utf8/Unicode-HOWTO.html
  271. where you will also find information on joining the
  272. linux-utf8@nl.linux.org mailing list.
  273. A number of UTF-8 example text files can be found in the examples/
  274. subdirectory or on
  275. http://www.cl.cam.ac.uk/~mgk25/ucs/examples/