IBMPC-charsets.announce 3.6 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475
  1. Summary
  2. =======
  3. This document is primarily for people who will be using Lynx
  4. on a remote UNIX or VMS system via an MS-DOS based terminal program.
  5. General Information
  6. ===================
  7. Lynx comes with built-in translation tables to map the 8-bit character codes or
  8. character entities coming in from an HTML document to their equivalent codes,
  9. where possible, for various character sets.
  10. IMPORTANT: you should choose display character set in Lynx Options Menu
  11. according to your font installed locally. Probably it would be cpXXX. Please
  12. contact lynx-dev mailing list if you want any new codepage not listed there.
  13. Note that all points of the connection between the display at your end and Lynx
  14. at the remote end must be 8-bit clean. If the high bit is being stripped at
  15. any point in between, the only character set you can use (effectively) in Lynx
  16. will be "7 bit approximations". More on that later.
  17. MS-DOS character set weirdness
  18. ==============================
  19. MS-DOS uses a bass-ackwards character set in which half the normal characters
  20. have been replaced by pseudo-graphic line and box-drawing characters, and in
  21. which almost all of the international characters are mapped to nonstandard
  22. numbers. It also contains Greek letters.
  23. Further confusing matters, there is more than one MS-DOS character set. The
  24. character sets are referred to as "codepages," each of which has a unique
  25. number. IBM PCs and compatibles come with one hardware-based default codepage
  26. and a keyboard to match. In the US market the hardware codepage is 437. PCs
  27. destined for other regions of the world often have a different default codepage
  28. which contains characters for other languages and keyboards. Under MS-DOS, one
  29. can load different codepages into memory and use one of them instead of the
  30. hardware default.
  31. If you are using Lynx through an MS-DOS based terminal program or telnet
  32. client, you should use an appropriate DOS codepage in Lynx and you need not any
  33. translation within terminal program (this is different from old-style behavior
  34. and works better because of superior Lynx translation support).
  35. Check your display by accessing Martin Ramsch's ISO-8859-1 table
  36. (iso8859-1.html in the Lynx distribution's test subdirectory).
  37. Ramsch's table describes each entity and shows examples of each. It should be
  38. immediately obvious that you are either seeing what you are supposed to, or
  39. you're not. If you see box and line-drawing characters and mismatched letters
  40. and so on, you are likely displaying 7 bit data, not 8. Ensure that all points
  41. of your connection are 8-bit clean:
  42. On any remote UNIX systems you must pass through, do
  43. 'stty cs8 -istrip' or 'stty pass8'. 'stty -a' should list
  44. your settings.
  45. On any remote VMS systems, do 'set terminal /eightbit'.
  46. Make sure your terminal program or telnet client is not filtering
  47. 8-bit data. You may found the choice between "VT-100 strict"
  48. and "VT-100 relaxed" emulation mode - use relaxed.
  49. Note: Procomm for DOS has a confusing "Use 7 bit or 8 bit
  50. ANSI" setting -- this has to do with ANSI sequences. If set to
  51. 8 bit, some 8-bit character sequences, including those passed
  52. by Lynx as well as those which are for your terminal type
  53. (vt100, etc.) will be processed by Procomm as ANSI screen
  54. control codes and will most likely result in a garbled display.
  55. Set it to 7 bit.
  56. If going through a dialup terminal server, you may have to set the
  57. terminal server itself to pass 8 bit data. How to do this
  58. varies with the make of the server, and in some cases only a
  59. system admin in charge of the box will have the authorization
  60. to do that.
  61. SLIP or PPP connections should already be 8-bit clean.