sept_2007_bits 7.6 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179
  1. From: whoever sends this
  2. Subject: Bits from the listmasters
  3. Hi everyone,
  4. As you might have already read from several posts on debian-devel or
  5. debian-user, lists.debian.org has a new spam filter setup. This was
  6. done during the listmaster@ + owner@bugs meeting in Essen, Germany.
  7. But to review everything in a single message:
  8. Internals
  9. ---------
  10. * Amavis-Setup
  11. The new spam filter setup of lists.debian.org includes the use of
  12. amavisd-new. We are using a feature called policy-banks, where we
  13. have grouped all 180 mailing lists into the following policy banks
  14. plus a few more administrative ones:
  15. * bug * lang-greek
  16. * en-ht * lang-hu-fi
  17. * en-lt * lang-indic
  18. * lang-arabic * lang-indonesic
  19. * lang-asian * lang-romanic
  20. * lang-esperanto * lang-scandinavic
  21. * lang-french * lang-slavic
  22. * lang-germanic
  23. Each policy bank has its own spam filtering setup. Most of it can be
  24. looked at, as it is checked into svn[1]. To find out to which policy
  25. bank a list belongs, look for the X-Virus-Scanned header in the email.
  26. The advantage of this new setup is that now we can distinguse between
  27. different list types, and can set filters and scorings for each
  28. list type on its own.
  29. Mails to each list can be "ham", "maybe-spam" and "spam". For
  30. borderline messages (maybe-spam) we are currently implementing a
  31. queueing mechanism, which allows us to delay these mails for a while
  32. and on recheck them after a defined time has passed.
  33. Gandalf
  34. ~~~~~~~
  35. Don Amstrong is currently implementing a new greylisting daemon we
  36. want to use on lists.debian.org. You now might ask, why another
  37. greylisting daemon? We were inspired by the sort of postfix-weight is
  38. working, but think it has some design flaws. Also we consider some
  39. reporting mechanism back from spamassassin back into the greylisting
  40. daemon quite helpful. Stay tuned, as we want this feature going live
  41. rather soon.
  42. Considering of lurker as webfrontend
  43. ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  44. The listmaster team currently evaluating lurker as official
  45. additional webfrontend for the mailing list archive. A few
  46. show-stoppers have been found and documented in [2]. These have been
  47. forwarded to the lurker upstream who is also Debian developer. We
  48. hope to have these changes implemented rather soon, so we can also
  49. offer lurker as an official web archive.
  50. SVN on Alioth
  51. ~~~~~~~~~~~~~
  52. We have moved some non-confidential configuration files to an SVN
  53. repository on Alioth, including our SpamAssassin and amavisd-new
  54. configuration. It can be viewed here[1]. If you want to help us with
  55. spam filtering, see if you can improve the SpamAssassin rule files.
  56. Send patches to them to listmaster@lists.debian.org.
  57. Team members
  58. ~~~~~~~~~~~~
  59. There has been quite a bit of restructuring of listmaster team
  60. members, since we last sent out an official bits from the
  61. listmasters. New members were added, some old members left the team.
  62. All of them did tremendous good work as listmasters and we want to
  63. thank them. Our thanks goes to Jaakko Niemi, Anand Kumria, Frans
  64. Pop, Robert McQueen and Cesar Mendoza.
  65. Recently we have also added three more team members, Don Armstrong,
  66. David Moreno Garza and Thomas Viehmann, last one doing listarchives
  67. only.
  68. Clean up
  69. ~~~~~~~~
  70. We use smartlist[3] for running the lists. Smartlist consists of a
  71. series of C programs, procmail and shellscripts. The setup was
  72. deployed originally sometime in 1998 (judging from some file
  73. timestamps) and since then it has evolved. Currently we have 180
  74. lists and each of it has more than 30 files that define how it works
  75. (maxsize, moderation, ...). That sums up to more than 6000 files we
  76. have to maintain.
  77. So now we have been cleaning up and linking identical files
  78. together, reducing the differing configuration files to ideally one
  79. file per list.
  80. During the listmaster meeting this progress started, and about 1000
  81. files have been linked together. The process of simplifying and
  82. unifying configurations is still in process.
  83. There are also some spam filter remnants in these configurations,
  84. that are also being moved into the spamassassin-config.
  85. whitelist
  86. ~~~~~~~~~
  87. While it is possible to post with an address which isn't subscribed
  88. to the lists, we recommend that you subscribe to our white-list
  89. (http://lists.debian.org/whitelist/) so that our system recognizes
  90. you. This will reduce the risk of false positives causing your mail
  91. to be dropped.
  92. Cooperation between bugs and lists
  93. ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  94. During the meeting in Essen Don Amstrong and Robert Blarson from
  95. bugs.debian.org team were present. This helped quite a lot, as we were
  96. able to ease the configuration of the spamfilters on both sides, so we
  97. are now using mostly the same SpamAssassin config, which should
  98. improve the spam situation on both sides.
  99. How to help listmasters against spam
  100. ------------------------------------
  101. * If you notice a spam in the list archives, press the 'Report As
  102. Spam'-Button.
  103. * If you run some spam-protecting mechanisms like
  104. + greylisting
  105. + tdma (challenge-response system)
  106. + virus scanner
  107. + spamscanner
  108. make sure that it doesn't reject mails from murphy.debian.org (and
  109. master.debian.org), as our bounce detection software is likely to
  110. unsubscribe you. From the figures above you can see that we filter
  111. a lot of spam and malware, but as we run a
  112. posting-is-open-for-everyone policy, there will always new kinds of
  113. junk that will pass our filters.
  114. * Do not ever report spam received through our lists to third parties
  115. services. They are likely to blacklist us or complain to our ISP,
  116. both of which result in degraded performance for yourself and
  117. others. This is also likely to cause tension between us, our
  118. sponsors and their ISPs.
  119. * Report spam that gets to you through our filters to
  120. report-listspam@lists.debian.org. Please leave all the headers
  121. untouched. The best method is to bounce (as in mutt) them. There is a
  122. plugin for thunder^Wicesomething to do that at
  123. http://mailredirect.mozdev.org/ . DO NOT do this automagically. If
  124. you want to help us, you must make personally sure that the things
  125. you report are REALLY spam.
  126. * If you receive lots of spam and know how to stop it through
  127. procmail or spamassassin, send us (listmaster@lists.debian.org) a
  128. note with the recipe, or contact us in OFTC #debian-lists
  129. * If you really want to use some kind of auto-responder, make sure
  130. that it is sane, and interprets the Lists and Precedence-Headers
  131. correctly so it ignores our mails. If we find that your mail
  132. address issues automatic responses to the list or subscribers,
  133. we'll unsubscribe you from all lists.
  134. * Don't subscribe to our lists with a forwarding mail address, if something
  135. goes wrong with the mail address you are forwarding to, it will be harder
  136. for us to find out exactly which address we have to drop. Instead, please
  137. subscribe with the address on which you will be reading the mail. You are
  138. free to send responses with another address, so your receiving
  139. address isn't published.
  140. * Please keep in mind that the Mails to our public lists are
  141. publically archived at lists.debian.org and many other services
  142. on the net. This means that everything in your mail is public,
  143. including your sending mail address.
  144. [1] http://svn.debian.org/wsvn/pkg-listmaster/
  145. [2] http://wiki.debian.org/Teams/ListMaster/LurkerProblems
  146. [3] http://packages.debian.org/smartlist
  147. # vim: tw=72