wikimedia/mediawiki-extensions-AbuseFilter

mirror of https://gerrit.wikimedia.org/r/mediawiki/extensions/AbuseFilter.git synced 2024-11-25 06:26:03 +00:00

Author	SHA1	Message	Date
Umherirrender	4fca77068c	Clean up line indent with mixed tabs and whitespaces Change-Id: Icc418130ad34e5f169bfc51bb13b58a7806bd636	2022-07-31 16:34:07 +02:00
Umherirrender	da4bc8643a	Use UserIdentity in VariableGenerator::addEditVars Change-Id: If0a65d7a86de776e6499d43949bfb217f20d9b07	2022-07-29 12:55:52 +02:00
jenkins-bot	c3c70f7fa0	Merge "FilterProfiler: use WRStats"	2022-07-06 00:05:15 +00:00
Tim Starling	cdf2f474e8	FilterProfiler: use WRStats A new core facility written for this use case. Bug: T310662 Depends-On: I26b1cdba0a06ad16ad8bb71b455e1b6180924d17 Change-Id: I2b902d034a8c3308c0ba9878b69e873ca8fbda52	2022-07-06 09:35:08 +10:00
Matěj Suchánek	799e1db093	Convert remaining permissions checks to use Authority Change-Id: I5e996cac37bc806db6c3d7ad5c666a606cd79236	2022-07-02 14:49:47 +02:00
DannyS712	139ca18efe	Migrate AbuseFilterPermissionManager to authority Almost all callers already provide an Authority in the form of a User object, so mostly just need to change the typehints Depends-On: I58661943c7e1acb6ff09798ee1a30be0fde3f459 Change-Id: I2ad86859c8194c14d7331f58db62b7cff4698085	2022-07-01 06:58:17 +00:00
jenkins-bot	8d4c5d4d33	Merge "Use LinkTarget in ConsequencesExecutor"	2022-06-29 08:52:37 +00:00
Umherirrender	32a97e8d15	tests: MWTimestamp::setFakeTime is reset by core It is in MediaWikiTestCaseTrait since 438b392 Change-Id: Ib89406fdbad0c9fecada50c8f1ee45e27d17c522	2022-06-28 20:48:31 +00:00
Umherirrender	20fd8f7b07	Use LinkTarget in ConsequencesExecutor The Parameters class already only needs a LinkTarget Change-Id: I4e8e1d7c92f41502a084be3359b97e0d434f08c0	2022-06-28 19:46:50 +02:00
Umherirrender	30fefb75bf	Use UserIdentity in ConsequencesExecutor Change-Id: I281a30610595ed3e984f43aa747eff37abe72939	2022-06-27 22:05:18 +02:00
Daimona Eaytoy	f33bc5868c	Set the 'timestamp' var in addGenericVars This was most definitely my intention when I introduced the concept of "generic vars", so it's a bit surprising to discover, 3.5 years later, that the timestamp isn't computed there. Also make the timestamp always be a string for consistency, since that's the type documented on mw.org. I've manually checked all filters on Wikimedia wikis using the timestamp variable, and added explicit int casts where needed (although I think they'd still work due to implicit casts). Change-Id: Ib6e15225dd95c2eead7e48c200d203d6918e0c18	2022-06-26 14:49:40 +02:00
Umherirrender	3d3c45f348	tests: Mock WikiPage in unit test Bug: T297688 Change-Id: Ic1655141564f02530b1ae6b625a1d3e261a00304	2022-06-24 22:22:24 +02:00
Matěj Suchánek	40564ca635	Remove $info argument from ReversibleConsequence::revert It was a temporary catch-all variable, but we can replace it (and probably won't need it). Change-Id: Ie1a64455c47445050bd83c853b3cafd283d5d020	2022-06-08 11:59:18 +02:00
jenkins-bot	1a6985469b	Merge "Inline/simplify smaller pieces of duplicate/complex PHP code"	2022-06-03 20:38:22 +00:00
Thiemo Kreuz	bbded6231c	Inline/simplify smaller pieces of duplicate/complex PHP code Change-Id: I59d0f17b77c8c3d47bc532bdefd9d8c0883f180b	2022-06-03 21:04:38 +02:00
jenkins-bot	bb94c0914c	Merge "Add support for regex string replacements."	2022-05-31 14:54:33 +00:00
Daimona Eaytoy	a46db47bd5	Fix validation for ip_in_ranges We want to make sure that all parameters are valid regardless of whether there's a match. Also make the minimum number of parameters = 2, so it's easier to switch between this function and ip_in_range. Change-Id: I141558a7ef4533485e315b3d93ea9b64f0959db7	2022-05-21 15:39:21 +02:00
fossifer	b1739a588f	Add ip_in_ranges function Added support for ip_in_ranges which allow multiple ranges to be checked at the same time. If the IP is in any of the ranges, the function returns true. Bug: T305017 Change-Id: Ic75c87ecd4cacf47ce2ff1b04173405230ff81d0	2022-05-11 12:27:16 +08:00
proc	1d1215bafb	Add support for regex string replacements. Bug: T285468 Change-Id: I25f8ad1b58cc10f4c6f6ef5ebab99fe58ec71b1e	2022-04-20 18:38:24 +01:00
Matěj Suchánek	686d7ea88c	Use RestrictionStore instead of deprecated method Also restructure the unit test a bit. Change-Id: If5ce26f1bc4efdb29653aed3fc47335dddc1e44c	2022-03-29 16:11:55 +02:00
Daimona Eaytoy	8ee9a21750	Clean up test files Convert a few integration tests to unit tests now that it's possible, split the AbuseFilterSaveTest file into three different classes. Change-Id: Ia2c0d7ab878b20a89324336a532abdc44f1e6b74	2022-03-20 17:40:49 +00:00
Daimona Eaytoy	2de5fce177	Refactor ConsequencesExecutor to process consequences in more steps Introduce shorter methods, one for each steps, so that it's easier to understand what the code is doing and figure out if the order makes sense. The ConsequencesExecutor test is now a proper unit test. Also simplify AbuseFilterConsequencesTest, removing old/wrong logic and fixing two expected values that were actually wrong (but worked because of the aforementioned wrong logic). The only functional changes should be: - We pick the longest block after checking the ConsequenceDisabler consequences, so e.g. if a filter has a long block + warn and another filter has a shorter block, we still keep the second one if warn will disable the block. - Remove disallow in presence of dangerous actions after checking ConsequenceDisabler's and deduplicating blocks. Otherwise we may remove disallow for filters where block (etc.) doesn't end up being disabled. We may also want to consider not removing disallow at all, now that messages are customizable. Bug: T303059 Change-Id: If00adbf2056758222eaaea70b16d3b4f89502c20	2022-03-19 15:49:36 +00:00
Alexander Vorwerk	4aedfe8d91	Use updated ObjectFactory namespace Change-Id: I99c5e5664d2401c36a9890f148eba7c25e6e8324	2022-03-09 22:17:07 +00:00
jenkins-bot	dad1fff238	Merge "Overhaul throttle identifiers"	2022-03-06 13:50:43 +00:00
Daimona Eaytoy	a0fd0bae01	Overhaul throttle identifiers - Use a /64 range for IPv6 instead of /16. - Fix a curious and serious bug for IPv6, where grouping by range would only use the first (!) number of the IP address, due to the 'v6-' prefix returned by IP::toHex. - Fail hard if the identifier is unknown -- it's not something that's supposed to happen. - Include the type name in each identifier, instead of prefixing all type names to all identifiers. This makes it easier to understand the parts of the key. - Test the whole lot. Bug: T211101 Change-Id: I54c4209f2f0d5a4c5e7b81bed240ca3e28a2ded7	2022-03-06 13:31:06 +00:00
daniel	a512ed31a7	Rename private assertion method assertStatusMessage is being added to MediaWikiTestCaseTrait, rename a method of the same name in FilterValidatorTest to avoid conflicts. Change-Id: I642a3b620ab4d8ad620f7a1253fed98d6796883d NeededBy: Ic01715b9a55444d3df6b5d4097e78cb8ac082b3e	2022-03-05 21:48:18 +00:00
Daimona Eaytoy	167f6cb642	Introduce ActionSpecifier This is a plain value object that represents the action being filtered, replacing associative arrays that were being used up to this point. We should now check whether it's possible to make it not require an accountname (which complicates things), and then use it in related classes as well, e.g. Parameters. Change-Id: I9550c14819b600c97c46b632cc1c2d447972d69c	2022-02-18 11:30:56 +00:00
Daimona Eaytoy	e8471a717c	Add method to properly check visibility of AbuseLog entries This replaces the previous pattern of callers having to use RevisionLookup if the result was 'implicit'. Also, in some cases where we were just hiding things if the visibility was !== true, properly handle the implicit case by using the new method. Make the new method return string constants rather than bool\|string. The new method also fixes some potential info leaks which happened when the row was hidden, the user could view suppressed AbuseLog entries, but the associated revision was also deleted and the user couldn't see it (this shouldn't be relevant for WMF wikis since AF deletion is oversight-level). Also add a bunch of tests for the various cases to ensure we don't regress again. Bug: T261532 Change-Id: I929f865acf5d207b739cb3af043f70cb59243ee0	2021-09-25 00:08:33 +00:00
Daimona Eaytoy	b2dc2c4dd8	Refactor ParserStatus ParserStatus is now more lightweight, and doesn't know about "result" and "from cache". Instead, it has an isValid() method which is merely a shorthand for checking whether getException() is null. Introduce a child class, RuleCheckerStatus, which knows about result and cache and can be (un)serialized. This removes the ambiguity of the $result field, and helps the transition to a new RuleChecker class. Change-Id: I0dac7ab4febbfdabe72596631db630411d967ab5	2021-09-17 11:25:54 +00:00
jenkins-bot	5475cae543	Merge "Rename AbuseFilterVariableGeneratorTest"	2021-09-15 17:10:27 +00:00
Matěj Suchánek	3ffbfb63f2	Rename AbuseFilterVariableGeneratorTest We don't need the AbuseFilter prefix anymore. Change-Id: Ia54016000895fd22dec5f397ab2d42d20bfd1816	2021-09-15 18:17:36 +02:00
Daimona Eaytoy	7c26c4b8d5	More cleanup for parser-related classes Change-Id: I6a2bbf519e1d5c6fe2778f69624bd80b9ea1ef86	2021-09-10 12:50:20 +00:00
Daimona Eaytoy	a722dfe1a4	Rename ParserFactory -> RuleCheckerFactory The old parser now has the correct name "Evaluator", so the ParserFactory name was outdated. Additionally, the plan is to create a new RuleChecker class, acting as a facade for the different parsing-related stages (lexer, parser, evaluator, etc.), which is what most if not all callers should use. The RuleCheckerFactory still returns a FilterEvaluator for now. Also, "Parser" is a specific term defining how things happen internally, whereas "RuleChecker" describes what callers should expect from the new class. Change-Id: I25b47a162d933c1e385175aae715ca38872b1442	2021-09-08 21:59:34 +02:00
Daimona Eaytoy	357ddd498c	Clean up / simplify parser-related classes Remove unnecessary setters, injecting everything in the constructor. These were leftovers from before the introduction of ParserFactory. Remove public access to the conds used, include the information inside the returned ParserStatus instead, and consequently simplify callers. Change-Id: I0a30e044877c6c858af3ff73f819d5ec7c4cc769	2021-09-08 13:41:52 +02:00
Daimona Eaytoy	f8e9ac7e2a	Rename AbuseFilterCachingParser -> FilterEvaluator It's an evaluator, not a parser. Change-Id: Ib6d33e8423ea72709cf5a33f4397ba33e352ea80	2021-09-08 13:40:47 +02:00
Daimona Eaytoy	6684ea6450	Remove AFPTransitionBase Also cleanup the mPos hack in the CachingParser. Change-Id: Ib5693802a3ceb80cb736880ed65e27340abef689	2021-09-06 19:33:48 +00:00
jenkins-bot	199cf1edf8	Merge "Add a static analyzer for the filter language"	2021-09-03 19:51:58 +00:00
Matěj Suchánek	0af21948fc	Replace WikiPage::factory in non-test code Change-Id: I1442ca6603ce5151b98fc88cd84c25af0f34e4f6	2021-09-01 04:55:25 +00:00
Daimona Eaytoy	86257d825c	tests: Use DBConnRef, not IDatabase, as retval of getConnectionRef So that the method can be typehinted in core. Also add phan-var to fix broken master build due to typehint additions in core. Change-Id: I4a072e00ffeeb437753fc3d3c1f15de9929df510	2021-08-31 21:45:10 +02:00
Sorawee Porncharoenwase	320e3d696f	Add a static analyzer for the filter language This commit adds a class AFPSyntaxChecker which can statically analyze a filter code to detect the following errors: - unbound variables (which comes in two modes: conservative and liberal, default to conservative) - unused variables (disabled by default for compatibilty) - assignment on built-in identifiers - function application's arity mismatch - function application's invalid function name - non-string literal in the first argument of set / set_var The existing parser and evaluator are modified as follows: - The new (caching) evaluator no longer needs to perform variable hoisting at runtime. - Note that for array assignment, this changes the semantics. - The new parser is more lenient, reducing parsing errors. The static analyzer will catch these errors instead, allowing us to give a much better error message and reduces the complexity of the parser. * The parser now allows function name to be any identifier. * The parser now allows arity mismatch to occur. * The parser now allows the first argument of set to be any expression. Concretely, obvious changes that users will see are: 1. a := [1]; false & (a[] := 2); a[0] === 1 would evaluate to true, while it used to evaluate to the undefined value due to hoisting 2. f(1) will now error with 'f is not a valid function' as opposed to 'Unexpected "T_BRACE"' 3. length will now error with 'Illegal use of built-in identifier "length"' as opposed to 'Expected a (' Appendix: conservative and liberal mode The conservative mode is completely compatible with the current evaluator. That is, false & (a := 1); a will not deem `a` as unbound, though this is actually undesirable because `a` would then be bound to the troublesome undefined value. The liberal mode rejects the above pattern by deeming `a` as unbound. However, it also rejects true & (a := 1); a even though (a := 1) is always executed. Since there are several filters in Wikimedia projects that rely on this behavior, we default the mode to conservative for now. Note that even the liberal mode doesn't really respect lexical scope appeared in some other programming languages (see also T234690). For instance: (if true then (a := 1) else (a := 2) end); a would be accepted by the liberal checker, even though under lexical scope, `a` would be unbound. However, it is unlikely that lexical scope will be suitable for the filter language, as most filters in Wikimedia projects that have user-defined variable do violate lexical scope. Bug: T260903 Bug: T238709 Bug: T237610 Bug: T234690 Bug: T231536 Change-Id: Ic6d030503e554933f8d220c6f87b680505918ae2	2021-08-31 03:28:24 +02:00
Daimona Eaytoy	704364a5e7	Move parser exceptions to specific namespace and rename them Create a dedicated "Exception" sub-namespace and remove the "AFP" prefix, a leftover from the pre-namespace era. Change-Id: I7e5fded9316d8b7d1628bc1a6ba8b1879ac901e1	2021-08-29 23:38:31 +00:00
libraryupgrader	5377ebe819	build: Updating dependencies composer: * mediawiki/mediawiki-codesniffer: 36.0.0 → 37.0.0 npm: * postcss: 7.0.35 → 7.0.36 * https://npmjs.com/advisories/1693 (CVE-2021-23368) Change-Id: I2b382f3bb236fb44eb24c6a257b13b8fd886541c	2021-07-21 18:51:18 +00:00
daniel	54285fe984	User mock must return Block instance from getBlock. Change-Id: I569e91dd07b8f89af42344b6d6df87560dcb6bbe Needed-By: T271494	2021-06-08 17:12:48 +02:00
Umherirrender	1fa7a83f60	Use static closures where safe to use Created by I25a17fb22b6b669e817317a0f45051ae9c608208 Change-Id: I533690311ca559685de8a4bf123348c9bcfa5931	2021-04-30 20:55:35 +02:00
jenkins-bot	5cd39a51fa	Merge "Remove the old parser"	2021-04-17 15:21:54 +00:00
Daimona Eaytoy	f67c2d5434	Remove deprecated $wgAbuseFilterCustomActionsHandlers Extensions should now specify custom actions using the AbuseFilterCustomActions hook. Change-Id: Id21640d406b18c627eedff39d3f246cf21e042b3	2021-04-11 14:49:50 +00:00
Daimona Eaytoy	f8438a4647	Remove the old parser All methods were moved to the new parser. Tests and other pieces were adjusted to expect just a single parser. There are still some TODOs (remove AFPTransitionBase, remove $this->mCur), but these are left for another commit. Note that the new parser was not renamed: this is because the names are wrong anyway (CachingParser is more of an Evaluator than a Parser, and AFPTreeParser is the real parser, and should be renamed as well). NOTE to reviewers: this patch looks quite big, but if you diff the old parser with the new version of the CachingParser, you'll notice that the diff is actually small, since everything was basically copied verbatim. Bug: T239990 Change-Id: Ie914ef64c70503a201b4d2dec698ca2fa8e69b10	2021-04-09 13:23:07 +00:00
Daimona Eaytoy	3e2153b86b	Update userCanViewRev to use Authority Change-Id: Ia10acf499ce33af03eeea45e34779a00e6628fe1	2021-04-07 13:55:10 +02:00
DannyS712	5d8ac68310	Convert AbuseFilterDBTest to pure unit tests No integration needed, use a mock user. Change-Id: I206d019aec626e6e4c16de10ecf30a29d5ab12e5	2021-04-06 16:28:35 +00:00
daniel	65c5fd6b51	Construct UserIdentityValue without actorId The actorId parameter to the UserIdentityValue constructor has been deprecated. Change-Id: I4a22e761276a9fefa15c7b1554a0d03980d0c663 Needed-By: I9925906d11e47efaec3c1f48d5cb3f9896a982c1	2021-03-26 11:00:56 +01:00

1 2 3 4 5

211 commits