13.xhtml 5.4 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104
  1. <?xml version="1.0" encoding="utf-8"?>
  2. <!--
  3. h t t :: / / t /
  4. h t t :: // // t //
  5. h ttttt ttttt ppppp sssss // // y y sssss ttttt //
  6. hhhh t t p p s // // y y s t //
  7. h hh t t ppppp sssss // // yyyyy sssss t //
  8. h h t t p s :: / / y .. s t .. /
  9. h h t t p sssss :: / / yyyyy .. sssss t .. /
  10. <https://y.st./>
  11. Copyright © 2016 Alex Yst <mailto:copyright@y.st>
  12. This program is free software: you can redistribute it and/or modify
  13. it under the terms of the GNU General Public License as published by
  14. the Free Software Foundation, either version 3 of the License, or
  15. (at your option) any later version.
  16. This program is distributed in the hope that it will be useful,
  17. but WITHOUT ANY WARRANTY; without even the implied warranty of
  18. MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
  19. GNU General Public License for more details.
  20. You should have received a copy of the GNU General Public License
  21. along with this program. If not, see <https://www.gnu.org./licenses/>.
  22. -->
  23. <!DOCTYPE html>
  24. <html xmlns="http://www.w3.org/1999/xhtml">
  25. <head>
  26. <base href="https://y.st./en/weblog/2016/01-January/13.xhtml" />
  27. <title>A major overhaul &lt;https://y.st./en/weblog/2016/01-January/13.xhtml&gt;</title>
  28. <link rel="icon" type="image/png" href="/link/CC_BY-SA_4.0/y.st./icon.png" />
  29. <link rel="stylesheet" type="text/css" href="/link/basic.css" />
  30. <link rel="stylesheet" type="text/css" href="/link/site-specific.css" />
  31. <script type="text/javascript" src="/script/javascript.js" />
  32. <meta name="viewport" content="width=device-width" />
  33. </head>
  34. <body>
  35. <nav>
  36. <p>
  37. <a href="/en/">Home</a> |
  38. <a href="/en/a/about.xhtml">About</a> |
  39. <a href="/en/a/contact.xhtml">Contact</a> |
  40. <a href="/a/canary.txt">Canary</a> |
  41. <a href="/en/URI_research/"><abbr title="Uniform Resource Identifier">URI</abbr> research</a> |
  42. <a href="/en/opinion/">Opinions</a> |
  43. <a href="/en/coursework/">Coursework</a> |
  44. <a href="/en/law/">Law</a> |
  45. <a href="/en/a/links.xhtml">Links</a> |
  46. <a href="/en/weblog/2016/01-January/13.xhtml.asc">{this page}.asc</a>
  47. </p>
  48. <hr/>
  49. <p>
  50. Weblog index:
  51. <a href="/en/weblog/"><abbr title="American Standard Code for Information Interchange">ASCII</abbr> calendars</a> |
  52. <a href="/en/weblog/index_ol_ascending.xhtml">Ascending list</a> |
  53. <a href="/en/weblog/index_ol_descending.xhtml">Descending list</a>
  54. </p>
  55. <hr/>
  56. <p>
  57. Jump to entry:
  58. <a href="/en/weblog/2015/03-March/07.xhtml">&lt;&lt;First</a>
  59. <a rel="prev" href="/en/weblog/2016/01-January/12.xhtml">&lt;Previous</a>
  60. <a rel="next" href="/en/weblog/2016/01-January/14.xhtml">Next&gt;</a>
  61. <a href="/en/weblog/latest.xhtml">Latest&gt;&gt;</a>
  62. </p>
  63. <hr/>
  64. </nav>
  65. <header>
  66. <h1>A major overhaul</h1>
  67. <p>Day 00312: Wednesday, 2016 January 13</p>
  68. </header>
  69. <p>
  70. I spent today working on a major overhaul of the spider.
  71. It now stores much less data in the database, but as a consequence, it has to crawl depth-first instead of breadth-first.
  72. This makes it much slower to find onion domains, as it will take a while before it can tap into one of the existing onion address lists.
  73. I think that I have worked the major bugs out of the spider though.
  74. Additionally, Gopher support is now available and <abbr title="Uniform Resource Identifier">URI</abbr>s that should not be crawled (due to having a scheme that makes the site unlikely to hold information on other sites) are no longer crawled.
  75. With this release, the newly-named <a href="https://notabug.org/y.st./include.d">include.d</a> is also necessary.
  76. </p>
  77. <p>
  78. I should have more to say today, but I really do not.
  79. </p>
  80. <p>
  81. My <a href="/a/canary.txt">canary</a> still sings the tune of freedom and transparency.
  82. </p>
  83. <hr/>
  84. <p>
  85. Copyright © 2016 Alex Yst;
  86. You may modify and/or redistribute this document under the terms of the <a rel="license" href="/license/gpl-3.0-standalone.xhtml"><abbr title="GNU&apos;s Not Unix">GNU</abbr> <abbr title="General Public License version Three or later">GPLv3+</abbr></a>.
  87. If for some reason you would prefer to modify and/or distribute this document under other free copyleft terms, please ask me via email.
  88. My address is in the source comments near the top of this document.
  89. This license also applies to embedded content such as images.
  90. For more information on that, see <a href="/en/a/licensing.xhtml">licensing</a>.
  91. </p>
  92. <p>
  93. <abbr title="World Wide Web Consortium">W3C</abbr> standards are important.
  94. This document conforms to the <a href="https://validator.w3.org./nu/?doc=https%3A%2F%2Fy.st.%2Fen%2Fweblog%2F2016%2F01-January%2F13.xhtml"><abbr title="Extensible Hypertext Markup Language">XHTML</abbr> 5.1</a> specification and uses style sheets that conform to the <a href="http://jigsaw.w3.org./css-validator/validator?uri=https%3A%2F%2Fy.st.%2Fen%2Fweblog%2F2016%2F01-January%2F13.xhtml"><abbr title="Cascading Style Sheets">CSS</abbr>3</a> specification.
  95. </p>
  96. </body>
  97. </html>