list-update-200803 6.4 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148
  1. From: whoever send it
  2. To: debian-devel-announce@lists.debian.org
  3. Subject: Bits from the listmaster team
  4. Mail-Followup-To: debian-project@lists.debian.org
  5. Hi everyone,
  6. the listmaster team is constantly trying to improve the setup of our
  7. listserver. Thus, quite a few things have happend since our last update
  8. in September of last year. Here are some highlights:
  9. lists.debian.org moved to a new hosting location
  10. ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  11. lists.debian.org has been moved to man-da[2] to avoid several problems
  12. with DNSBL pointing to our list server. We'd like to thank Brainfood
  13. for hosting lists.debian.org and dealing with the insanity of spam
  14. reporters for so long. After the move of the list service to the new
  15. machine, we also decided to move the list archives to that machine
  16. (which means the list archives are on the same machine as the MX, and
  17. consequently suffer fewer delays).
  18. If you haven't already, please add lists.d.o's new ip, 82.195.75.100,
  19. to your whitelists.
  20. New list archive search engine
  21. ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  22. With the move of lists.debian.org to the new hosting, we took the
  23. opportunity of deploying a new search[3] based on Xapian Omega. The
  24. index comprises some 3.5 million messages, approximately 100k of which
  25. are estimated to be spam. In order to improve your search experience,
  26. we have prepared the indexing software to benefit from our spam removal
  27. plan (see below).
  28. As before, searching by author and list is supported, but the new
  29. search should be more language- and encoding-aware. Work is in
  30. progress to provide our adaptations to upstream and implement
  31. improvements based on our experience and the hints we got from the
  32. friendly people at Xapian.
  33. Config cleanup
  34. ~~~~~~~~~~~~~~
  35. The config cleanup is another big project which seems to turn into an
  36. ongoing task. Since the last update we decided to unify some global
  37. files for all lists, and move all list specific config to extra files.
  38. (This follows the layout the inventors of smartlist, our mailinglist
  39. software, had in mind.)
  40. We also want to move some information like moderation status or maximum
  41. mail size per message to a global file, which is also used by the
  42. listarchive and some more informational or statistical tools.
  43. To check if lists are configured correctly we subscribed an address to
  44. all 182 mailinglists and checked back a month later for ham/spam-ratio,
  45. and other anomalies. We found some wrong spam-rules, which led to some
  46. false positives and other 'backdoors' which bypassed some of our
  47. spamrules, which lead to false negatives. We also found lists which are
  48. supposed to carry only informational mails from an automatic system, so
  49. we could tighten the rules, and on the other hand we could drop the
  50. usual spamfilters for those lists, so distribution gets faster and we
  51. need less CPU/memory ressources to get one mail through.
  52. We also implemented the usual 'Precedence' and 'List-*'-headers on all
  53. lists (we had some lists where those were missing) and automatic
  54. responses, so we are now a little more net-friendly with our service.
  55. While reviewing things we found that our bounce handling had some
  56. issues, see the next section for information about that.
  57. Better bounce handling
  58. ~~~~~~~~~~~~~~~~~~~~~~
  59. We checked our bounce handling because we have more than 500 bounces
  60. for some lists, and in the process found that we didn't have working
  61. bounce handling for other lists (other-*, deity, *-digest,
  62. debian-private). There were also problems in handling and recognizing
  63. mailadresses containing = or ! characters.
  64. Bounces of debian-private subscription are still manualy handed by the
  65. listmasters, but we now address these issues and forward such addresses to
  66. da-manager@debian.org.
  67. To address the other mailing lists we rewrote some parts of our
  68. bounce handler.
  69. While analysing the bounces streaming in, we found that a lot of bounces
  70. are caused by content filters which reject listmail back to us (which
  71. violates the RfC). Even worse: the majority of those are false
  72. positives.
  73. To let those people know we'll implement a notification system, which
  74. will notify users about bounces, and remind forcibly removed users about
  75. their unsubscription.
  76. This is a service for those people with a temporarily unavailable or
  77. broken mailbox, so they see that they (or their provider) has a broken
  78. mail setup or resubscribe back to all lists after their mailaddress is
  79. functional again. These notification will be sent out at a maximum of
  80. once a week, up to a month after the last unsubscription happened.
  81. Both notification systems are in testing now and will be activated
  82. shortly after this mail.
  83. List archive spam
  84. ~~~~~~~~~~~~~~~~~
  85. As avid followers of debian-project will know, we have implemented
  86. support to weed spam out of the www list archives. While we want to
  87. get rid of as much spam as possible our paramount objective in this
  88. effort is to preserve the integrity of the archive (e.g. keeping URLs
  89. constant for past messages and avoiding removal of non-spam mail). This
  90. means that the submissions we receive from users clicking on the
  91. spam-report button of the list archive must be verified manually and
  92. each nomination has to be checked by independently by several people.
  93. Some 1000 spam messages have been deleted from the archives of
  94. debian-java, debian-project, debian-python, and debian-vote.
  95. To help out or learn more please visit our wiki page[4].
  96. How to help
  97. ~~~~~~~~~~~
  98. You can help us in a few important areas:
  99. * Spam rules -- If you notice spam getting through the spam filters,
  100. and have ideas for improving our filters, we accept patches to our
  101. rulessets, which are publicly available via svn.[5]
  102. * Encoding issues -- If you notice encoding problems of messages sent
  103. after November 2007 in the archive, please contact
  104. listmaster@lists.debian.org with a link to the problematic message
  105. and an explanation of the problem.
  106. * Avoid bouncing spam -- If you don't want your MTA to accept spam,
  107. please just discard it instead of 550'ing, at least when a message
  108. comes from liszt.debian.org
  109. * Troubleshooting -- If you notice a problem with a message that
  110. you've sent to a mailing list which hasn't arrived, please provide
  111. us with as much information as possible, including Date/Time (UTC),
  112. From, To, Message-Id, delivering IP, and the logfile entries from
  113. the delivering host.
  114. [1] http://www.brainfood.com
  115. [2] http://www.man-da.de
  116. [3] http://lists.debian.org/search.html
  117. [4] http://wiki.debian.org/Teams/ListMaster/ListArchiveSpam
  118. [5] svn://svn.debian.org/svn/pkg-listmaster/trunk/spamassassin_config