21.xhtml 8.3 KB

  1. <?xml version="1.0" encoding="utf-8"?>
  2. <!--
  3. h t t :: / / t /
  4. h t t :: // // t //
  5. h ttttt ttttt ppppp sssss // // y y sssss ttttt //
  6. hhhh t t p p s // // y y s t //
  7. h hh t t ppppp sssss // // yyyyy sssss t //
  8. h h t t p s :: / / y .. s t .. /
  9. h h t t p sssss :: / / yyyyy .. sssss t .. /
  10. <https://y.st./>
  11. Copyright © 2015 Alex Yst <mailto:copyright@y.st>
  12. This program is free software: you can redistribute it and/or modify
  13. it under the terms of the GNU General Public License as published by
  14. the Free Software Foundation, either version 3 of the License, or
  15. (at your option) any later version.
  16. This program is distributed in the hope that it will be useful,
  17. but WITHOUT ANY WARRANTY; without even the implied warranty of
  19. GNU General Public License for more details.
  20. You should have received a copy of the GNU General Public License
  21. along with this program. If not, see <https://www.gnu.org./licenses/>.
  22. -->
  23. <!DOCTYPE html>
  24. <html xmlns="http://www.w3.org/1999/xhtml">
  25. <head>
  26. <base href="https://y.st./en/weblog/2015/11-November/21.xhtml" />
  27. <title>More word lists for eschalot &lt;https://y.st./en/weblog/2015/11-November/21.xhtml&gt;</title>
  28. <link rel="icon" type="image/png" href="/link/CC_BY-SA_4.0/y.st./icon.png" />
  29. <link rel="stylesheet" type="text/css" href="/link/basic.css" />
  30. <link rel="stylesheet" type="text/css" href="/link/site-specific.css" />
  31. <script type="text/javascript" src="/script/javascript.js" />
  32. <meta name="viewport" content="width=device-width" />
  33. </head>
  34. <body>
  35. <nav>
  36. <p>
  37. <a href="/en/">Home</a> |
  38. <a href="/en/a/about.xhtml">About</a> |
  39. <a href="/en/a/contact.xhtml">Contact</a> |
  40. <a href="/a/canary.txt">Canary</a> |
  41. <a href="/en/URI_research/"><abbr title="Uniform Resource Identifier">URI</abbr> research</a> |
  42. <a href="/en/opinion/">Opinions</a> |
  43. <a href="/en/coursework/">Coursework</a> |
  44. <a href="/en/law/">Law</a> |
  45. <a href="/en/a/links.xhtml">Links</a> |
  46. <a href="/en/weblog/2015/11-November/21.xhtml.asc">{this page}.asc</a>
  47. </p>
  48. <hr/>
  49. <p>
  50. Weblog index:
  51. <a href="/en/weblog/"><abbr title="American Standard Code for Information Interchange">ASCII</abbr> calendars</a> |
  52. <a href="/en/weblog/index_ol_ascending.xhtml">Ascending list</a> |
  53. <a href="/en/weblog/index_ol_descending.xhtml">Descending list</a>
  54. </p>
  55. <hr/>
  56. <p>
  57. Jump to entry:
  58. <a href="/en/weblog/2015/03-March/07.xhtml">&lt;&lt;First</a>
  59. <a rel="prev" href="/en/weblog/2015/11-November/20.xhtml">&lt;Previous</a>
  60. <a rel="next" href="/en/weblog/2015/11-November/22.xhtml">Next&gt;</a>
  61. <a href="/en/weblog/latest.xhtml">Latest&gt;&gt;</a>
  62. </p>
  63. <hr/>
  64. </nav>
  65. <header>
  66. <h1>More word lists for eschalot</h1>
  67. <p>Day 00259: Saturday, 2015 November 21</p>
  68. </header>
  69. <p>
  70. I spent most of today cleaning, just like yesterday.
  71. </p>
  72. <p>
  73. I took a small break because I had a new idea.
  74. I should combine the dictionaries provided by Debian for different languages and create a multilingual dictionary to use with eschalot.
  75. With only a little bit of searching, I found that the relevant packages to install were <a href="apt:miscfiles"><code>miscfiles</code></a>, <a href="apt:wamerican"><code>wamerican</code></a>, <a href="apt:wamerican-huge"><code>wamerican-huge</code></a>, <a href="apt:wamerican-insane"><code>wamerican-insane</code></a>, <a href="apt:wamerican-large"><code>wamerican-large</code></a>, <a href="apt:wamerican-small"><code>wamerican-small</code></a>, <a href="apt:wbrazilian"><code>wbrazilian</code></a>, <a href="apt:wbritish"><code>wbritish</code></a>, <a href="apt:wbritish-huge"><code>wbritish-huge</code></a>, <a href="apt:wbritish-insane"><code>wbritish-insane</code></a>, <a href="apt:wbritish-large"><code>wbritish-large</code></a>, <a href="apt:wbritish-small"><code>wbritish-small</code></a>, <a href="apt:wbulgarian"><code>wbulgarian</code></a>, <a href="apt:wcanadian"><code>wcanadian</code></a>, <a href="apt:wcanadian-huge"><code>wcanadian-huge</code></a>, <a href="apt:wcanadian-insane"><code>wcanadian-insane</code></a>, <a href="apt:wcanadian-large"><code>wcanadian-large</code></a>, <a href="apt:wcanadian-small"><code>wcanadian-small</code></a>, <a href="apt:wcatalan"><code>wcatalan</code></a>, <a href="apt:wdanish"><code>wdanish</code></a>, <a href="apt:wdutch"><code>wdutch</code></a>, <a href="apt:wfaroese"><code>wfaroese</code></a>, <a href="apt:wfrench"><code>wfrench</code></a>, <a href="apt:wgalician-minimos"><code>wgalician-minimos</code></a>, <a href="apt:wgerman-medical"><code>wgerman-medical</code></a>, <a href="apt:witalian"><code>witalian</code></a>, <a href="apt:wngerman"><code>wngerman</code></a>, <a href="apt:wnorwegian"><code>wnorwegian</code></a>, <a href="apt:wogerman"><code>wogerman</code></a>, <a href="apt:wpolish"><code>wpolish</code></a>, <a href="apt:wportuguese"><code>wportuguese</code></a>, <a href="apt:wspanish"><code>wspanish</code></a>, <a href="apt:wswedish"><code>wswedish</code></a>, <a href="apt:wswiss"><code>wswiss</code></a>, and <a href="apt:wukrainian"><code>wukrainian</code></a>.
  76. After installing them, I ran into a bit of an issue, one that is even present in the initial dictionary installed on the system.
  77. Namely, several non-<abbr title="American Standard Code for Information Interchange">ASCII</abbr> (and therefore unusable in onion addresses) characters were present.
  78. I do not want to simply leave out these words, and internationalization is not possible in onion addresses due to the lack of a dash character.
  79. While trying to make a list of unavailable characters that I would have to account for, I found that, according to <a href="file:///usr/share/dict/dutch"><code>/usr/share/dict/dutch</code></a>, some Dutch words contain digits, spaces, periods, slashes, and/or plus signs.
  80. Furthermore, &quot;06&quot; is supposedly a word, and in the case of a few words seemingly relating to carbon dioxide, words may contain a subscript &quot;2&quot;.
  81. I am unsure how reliable this file really is.
  82. <a href="file:///usr/share/dict/ukrainian"><code>/usr/share/dict/ukrainian</code></a> predictably contains no Latin characters, instead consisting of Cyrillic letters.
  83. Using these words would mean not only making small changes here and there, but instead completely butchering every word.
  84. For that reason, there is a strong chance that I will not use the words in this file at all.
  85. Doing <strong>*any*</strong> sort of transformation of any words on these lists is a project for the distant future though.
  86. </p>
  87. <p>
  88. My <a href="/a/canary.txt">canary</a> still sings the tune of freedom and transparency.
  89. </p>
  90. <hr/>
  91. <p>
  92. Copyright © 2015 Alex Yst;
  93. You may modify and/or redistribute this document under the terms of the <a rel="license" href="/license/gpl-3.0-standalone.xhtml"><abbr title="GNU&apos;s Not Unix">GNU</abbr> <abbr title="General Public License version Three or later">GPLv3+</abbr></a>.
  94. If for some reason you would prefer to modify and/or distribute this document under other free copyleft terms, please ask me via email.
  95. My address is in the source comments near the top of this document.
  96. This license also applies to embedded content such as images.
  97. For more information on that, see <a href="/en/a/licensing.xhtml">licensing</a>.
  98. </p>
  99. <p>
  100. <abbr title="World Wide Web Consortium">W3C</abbr> standards are important.
  101. This document conforms to the <a href="https://validator.w3.org./nu/?doc=https%3A%2F%2Fy.st.%2Fen%2Fweblog%2F2015%2F11-November%2F21.xhtml"><abbr title="Extensible Hypertext Markup Language">XHTML</abbr> 5.1</a> specification and uses style sheets that conform to the <a href="http://jigsaw.w3.org./css-validator/validator?uri=https%3A%2F%2Fy.st.%2Fen%2Fweblog%2F2015%2F11-November%2F21.xhtml"><abbr title="Cascading Style Sheets">CSS</abbr>3</a> specification.
  102. </p>
  103. </body>
  104. </html>