Commit graph

940 commits

Author SHA1 Message Date
Brion Vibber bde084c272 Some polishing and refactoring on this monstrosity, it's been allowed to grow without some good snipping in a while. :)
* Handle bad regexes more gracefully:
 - The batched regexes are tested for validity, and if one is bad, the lines from that source are broken out line-by-line. This is slower, but the other lines in that source will still be applied correctly.
 - Suppress warnings and be more verbose in the debug log.
 - Check for bad regexes when a local blacklist page is edited, and prompt the user to fix the bad lines.
* Caching issues:
 - Cache the full regexes per-DB instead of per-site; this should be friendlier to shared environments where not every wiki has the same configuration.
 - Hopefully improve the recaching of local pages, which looked like it would preemptively apply the being-edited text to the cache during the filter callback, even though something else might stop the page from being saved. Now just clearing the cache after save is complete, letting it re-load later.
* Split out some of the regex batch functions for clarity.

There are probably still issues with caching of HTTP bits, and in general the local DB loading looks verrrry fragile.
Test this a bit more before syncing. :)
2007-07-20 21:13:26 +00:00
Ivan Lanin 57f84def12 Bug #10623: id translations for Spam-Blacklist extension (patch by Borgx) 2007-07-18 02:15:04 +00:00
Brion Vibber 72ca079b97 Add a local blacklist at MediaWiki:Spam-blacklist which can always be used, just as the local whitelist at MediaWiki:Spam-whitelist.
Should save some trouble for annoyed people. :)
The regular message cache behavior is used for this message, so it'll also update immediately, without waiting for the shared caches to time out.
Additionally, added a fix for configurations which don't hardcode the PHP include_path by using $IP in an include for HttpFunctions.php.
2007-07-07 17:21:49 +00:00
Raimond Spekking 48689b2bca Make the long existing message 'spam-whitelist' available to [[Special:Allmessages]]. 2007-07-04 11:28:52 +00:00
Aryeh Gregor 740736ecd9 Extensions too! 2007-06-29 01:36:09 +00:00
Raimond Spekking 1676db1b55 Update/Add extension credits URLs 2007-06-26 10:38:04 +00:00
Brion Vibber 20b1b3c108 add a compatibility note 2007-03-05 19:56:56 +00:00
Brion Vibber 8285ddcd0f * (bug 8375) Reduce spamblacklist's regex size quite a bit; the actual limit seems very hard to predict and may vary based on version, os, architecture, or phase of the moon. Now breaking at 4096 bytes rather than the previous 20000; this makes 12 regexes for the current Wikimedia set. 2007-01-13 05:24:09 +00:00
Antoine Musso 6aa7b3badc Add descriptions for some extensions used on wikipedia 2007-01-07 13:57:35 +00:00
Antoine Musso 55fbcdc9a6 remove some ending whitespaces 2007-01-06 20:56:46 +00:00
Brion Vibber 9bb2bc11fa Split giant regexes so PCRE stops screaming about them.
Haven't tested cleanup.php
2006-09-18 09:56:57 +00:00
Antoine Musso c475409997 credits + url 2006-06-23 22:22:09 +00:00
Brion Vibber 18bd5bf9ef Apply pre-save transform for more thorough checks 2006-06-22 21:12:18 +00:00
Brion Vibber 5eb474a2f7 Run text through the parser and get the actual links recorded instead of trying to second-guess behavior 2006-06-22 20:35:49 +00:00
Brion Vibber 9036c0242b Add a local whitelist, editable by admins at [[MediaWiki:Spam-whitelist]] 2006-06-22 19:59:43 +00:00
Antoine Musso c92ee8cc03 allow '-' in database name 2006-05-21 11:04:56 +00:00
Rob Church 641a3f7bee (reopened bug 5185) Match on two or more slashes on the protocol to prevent another blacklist workaround 2006-04-28 23:18:47 +00:00
Rob Church 992a1ac684 (bug 5185) Strip out SGML comments before scanning the text for matches so some nutter can't circumvent the lot with a well placed <!-- --> 2006-04-12 04:59:27 +00:00
Brion Vibber 3f5145d746 Live fix: improved reporting 2006-04-02 03:50:06 +00:00
Tim Starling f3219927ae Updated DB: for the 1.5 schema, fixed a few bugs 2006-01-23 01:35:39 +00:00
Tim Starling 233eeb2262 some tweaks 2006-01-21 23:27:39 +00:00
Tim Starling 9c1a3d9cb9 Script to quickly and aggressively clean spam off a wiki. Well, reasonably quickly, it needs to do a linear search of the entire text which might take a while. But there is no human interaction required. 2006-01-19 17:17:03 +00:00
Tim Starling 05a1bf5f1f split the regex fetching part of the filter into its own function 2006-01-19 17:14:10 +00:00
Tim Starling 25eaa74056 fixed blank line at end of file 2006-01-19 07:24:25 +00:00
Brion Vibber 0e53200a91 * (bug 3934) Check _ in hostname prefixes; it's illegal but seems to be accepted by browsers 2005-11-16 09:56:13 +00:00
Tim Starling 76e139b595 bug #2598: only one blacklist file is parsed by SpamBlacklist extension 2005-11-01 07:25:33 +00:00
Tim Starling 6b7d1dc8c3 load_lists is no longer required 2005-10-31 05:31:57 +00:00
Ævar Arnfjörð Bjarmason ff1e275a8c * Support for $wgExtensionCredits 2005-08-26 14:33:40 +00:00
Tim Starling f9b43ca259 fixed empty regex check 2005-07-13 22:41:14 +00:00
Tim Starling cd68beb218 More configuration settings, fixed URL 2005-07-08 16:29:22 +00:00
Tim Starling 0c5c457080 forgot to commit this 2005-07-02 09:03:53 +00:00
Tim Starling 31e30af1b2 Support for HTTP, including working default, to load text from meta once per hour. Special attention paid to reducing load on meta, of course. 2005-06-25 15:49:21 +00:00
Tim Starling 3aaededb3b and title 2005-03-09 14:20:35 +00:00
Tim Starling 588955efbd fixed DB name and table name 2005-03-09 14:17:22 +00:00
River Tarnell e6d30146eb and here 2005-02-20 09:09:29 +00:00
River Tarnell 7d68ef6e01 preg_match keys start at 1, not 0 2005-02-20 09:03:13 +00:00
River Tarnell 8d531d0726 discard backslashes prior to slash 2005-02-20 08:02:18 +00:00
Tim Starling f6b886cd80 Detailed documentation 2005-01-20 07:04:19 +00:00
Tim Starling 6d61a06c88 typo 2004-12-11 11:12:00 +00:00
Tim Starling 7b9d0425d5 from phase3/extensions 2004-12-11 09:59:06 +00:00