wikimedia/mediawiki-extensions-AbuseFilter

mirror of https://gerrit.wikimedia.org/r/mediawiki/extensions/AbuseFilter.git synced 2024-11-28 16:00:28 +00:00

Author	SHA1	Message	Date
jenkins-bot	6196801178	Merge "Log more empty operands"	2019-08-24 20:53:01 +00:00
Daimona Eaytoy	2d031d0bee	Log more empty operands And fix a couple of minor bugs. Bug: T156096 Depends-On: I3b85087677607573f4fa68681735dc35348dcd87 Change-Id: Ia4c713a1d45827f6a8bc5566a8d8835c49f8108a	2019-08-24 19:59:53 +00:00
jenkins-bot	47838715fa	Merge "Allow if without else"	2019-08-20 20:12:19 +00:00
jenkins-bot	5e605aaa62	Merge "Even better handling of DUNDEFINED"	2019-08-20 20:00:52 +00:00
jenkins-bot	bf8ccccade	Merge "Fix a bug in the return value of the CachingParser"	2019-08-20 19:58:38 +00:00
Daimona Eaytoy	af7744781f	Allow if without else Bug: T230727 Depends-On: I8e7f7710b8cb37ada8531b631456a3ce7b27ee45 Change-Id: I3b85087677607573f4fa68681735dc35348dcd87	2019-08-20 19:36:14 +00:00
Daimona Eaytoy	963221ad6d	Even better handling of DUNDEFINED Ensure that the variable isn't set before marking it as DUNDEFINED: that's only for when we cannot use a default, but if the variable is set we already have one. Most notably, this fixes conditionals handling: right now, if you have a conditional with an assignment in both branches, the variable will be undefined. That's obviously wrong, so it's fixed in this patch. Plus: catch only AFPExceptions in a test to avoid unintentionally catching the assert exception; simplify some assignments using wfSetVar. Depends-On: I446a307e5395ea8cc8ec5ca5d5390b074bea2f24 Change-Id: I8e7f7710b8cb37ada8531b631456a3ce7b27ee45	2019-08-20 19:17:30 +00:00
Daimona Eaytoy	fa76405ea7	Fix a bug in the return value of the CachingParser This has always been wrong, and remained unnoticed. Also added a typehint for added safety. Change-Id: I8a3c31e7385283d95b4712d457784016239a0b3b	2019-08-20 20:54:19 +02:00
Daimona Eaytoy	aa867bd370	Better handling of function params in CachingParser This patch includes various fixes to how func arguments are handled in CachingParser: - Add a comment about a future improvement of checkSyntax, which we could limit to try building the AST. - Having enough args for each function is now also checked when building the AST. This allows implementing the previous point without stopping to report notenoughargs at syntaxcheck-time (otherwise it'd be a runtime error). And it also ensure that we check for the params count inside skipped branches, e.g. inside if/else: these were already only discovered at runtime in CachingParser. The old parser is not affected by this change, because when checking syntax it will always execute all branches, and at runtime it will skip braces altogether. - Fix arg count for CachingParser, which previously added a bogus param in case of a function called without parameters. This was fixed for the other parser in I484fe2994292970276150d2e417801453339e540, and I just ported the updated fix. Also note that the CachingParser was already failing for e.g. `count()`, but instead of complaining about missing arguments, it failed hard when trying to pass NULL to evalNode. - Fixed some tests not to use setExpectedException, which caused the previous point to remain unnoticed: calling that method prevents the loop from continuing, and thus only the AbuseFilterParser part was being executed. The new implementation checks the exception ID and is thus more future-proof if the i18n message changes. - Fixed some function names in error reporting for the old parser. - The arg count is now checked outside of the function handlers, thus it's no more necessary to call checkEnoughArguments at the beginning of each handler. This also produces clearer error messages in case of aliases (e.g. set/set_var). - Check the args count even if some of the args are DUNDEFINED. This is much easier now that the check is outside of the handler. This will make syntax check fail for e.g. `contains_any(added_lines)`. Bug: T156095 Change-Id: I446a307e5395ea8cc8ec5ca5d5390b074bea2f24	2019-08-20 15:32:02 +00:00
jenkins-bot	1f45336157	Merge "Move keywords handlers to the Parser"	2019-08-20 14:16:10 +00:00
jenkins-bot	f18d0814e2	Merge "Make several AFPData functions non-static"	2019-08-20 14:06:02 +00:00
Daimona Eaytoy	430ba818d0	Add test for multiple conditions inside conditionals The regression itself was fixed in I980aec3481a52ecc35f1811a366014a5581a7cdb, so this patch only adds a test for it. Also remove a comment about CachingParser failures: we don't want to encourage people to remove it from tests anymore. Bug: T152281 Change-Id: I3ad49050ea49bf45d3226878e091da3c8dbefdb1	2019-08-12 18:18:05 +02:00
Daimona Eaytoy	3f171dc0a5	Move keywords handlers to the Parser Just like we do for functions, it doesn't really make sense to have keywords separately, in AFPData. Change-Id: I208a9b1ce2bd12038e9fbcc515c48d604ec80eb8	2019-08-12 14:29:56 +02:00
Daimona Eaytoy	2fdf091eb9	Make several AFPData functions non-static The keywords-related ones will be handled in a subsequent patch. Change-Id: Ifcfad438023ef136dc6f2cd5529e867df9b23789	2019-08-12 14:12:16 +02:00
Daimona Eaytoy	69ad23da98	Ban variable variables As explained on phab, it's not worth the effort of keeping this feature. Bug: T229947 Change-Id: Ic6067cab8e1ede98545e704888c99e2ed9a004e4	2019-08-11 01:47:35 +00:00
jenkins-bot	8ee442234f	Merge "Move "block-autopromote" key from $wgMainStash to 'db-replicated'"	2019-08-07 23:07:45 +00:00
jenkins-bot	1fa5eef94c	Merge "Overhaul Blockautopromote action"	2019-08-07 23:03:08 +00:00
Aaron Schulz	9e44f1a9e9	Move "block-autopromote" key from $wgMainStash to 'db-replicated' Keep the key mutation methods in the AbuseFilter class Bug: T227376 Change-Id: I03feb05218789a3b73a31c9a94216daafcb7c145	2019-08-07 01:09:13 +00:00
jenkins-bot	5a067f7237	Merge "Add tests for empty operand logging"	2019-08-06 17:38:31 +00:00
Daimona Eaytoy	b91db1d7be	Add tests for empty operand logging Follow-up `5f4491f9aa`. Change-Id: I80ca8c3c75f7de23cf9ab16aa66a240e9981c395	2019-08-06 17:17:27 +00:00
Daimona Eaytoy	2ed6272bb2	Partly handle set and set_var in shortcircuit This is more complicated than the := operator, because the var name could be a complicated expression, and we have to handle a function call. This patch only covers the case where the variable name is a literal, which is enough for WMF production. Bug: T214674 Change-Id: I6c0f8e95663919a0235b5ccf0c88ad0a539315a7	2019-08-06 16:14:34 +02:00
Daimona Eaytoy	2bdb44d58b	Overhaul Blockautopromote action As for all mostly unused consequences, blockautopromote has a couple of major problems: first, it blocked the status for a random time between 3 and 7 days, which to me makes no sense at all (is it some sort of casino?), and this patch fixes it to 5 days. Second, nothing was logged, not the blocking nor the unblocking. Here I'm adding a LogHandler for two new sub-actions of 'rights' to keep track of both action. Bug: T49412 Change-Id: If48a48f5b8baaf9e77c0826466f5d03bb7f691d0	2019-08-05 22:27:49 -04:00
jenkins-bot	19182606c1	Merge "Merge global profiling keys"	2019-08-04 18:40:14 +00:00
rarohde	d022377578	Merge global profiling keys The last step of the profiling overhaul. See T53294 for the original description by Dragons flight. Note: Here I'm adding a FixMe for a problem which already exists in the code and the child patch will fix it. Bug: T53294 Depends-On: I2d8c8f8278073a9420e3eb373fb89a655925618a Change-Id: Ib12e072a245fcad93c6c6bd452041f3441f68bb7	2019-08-04 17:59:58 +00:00
Daimona Eaytoy	517919fca8	Allow accessing offsets of built-in variables I5ec4ab44c4e88aaf18c0d7b73355d27050beeda7 almost fixed this bug, but we also have to make it possible to access builtin variables as arrays. This will only make sense for a few variables (e.g. added_lines and removed_lines), but I don't think we should validate it when checking syntax. Bug: T198531 Change-Id: I417e1b8d4802bbfccd091ce5c7617659cfd1e4ea	2019-08-04 17:14:44 +00:00
jenkins-bot	c0b6267022	Merge "Use milliseconds for time profiling"	2019-08-04 16:12:59 +00:00
jenkins-bot	f7fd6a6daf	Merge "Move per-filter matches profiling to per-filter data"	2019-08-04 16:07:58 +00:00
Daimona Eaytoy	9049be3609	Specialize empty AFPData types As described in T156096#5389655. Change-Id: Ifbf95a6b72a280cd77db6affbd8d642499bbfedc	2019-08-04 15:26:57 +00:00
Daimona Eaytoy	c3db63714e	Use milliseconds for time profiling Instead of seconds, and round the average condition at 1dp instead of 0. Split from child patch by Dragons flight. Depends-On: I2d8c8f8278073a9420e3eb373fb89a655925618a Change-Id: I339aed5f8c1d49714e7927ce49286f9ce6c839f5	2019-08-03 23:24:46 +00:00
Daimona Eaytoy	0b7902fe6e	Move per-filter matches profiling to per-filter data They're currently stored separately, so move matches count together with other per-filter data to keep it consistent. This also removes a parameter from filterMatchesKey, as it's not needed anymore. Split from child patch by Dragons flight. Bug: T53294 Depends-On: I8f47beb73cfc1b63c4b3c809fc6d65a1e66ee334 Change-Id: I2d8c8f8278073a9420e3eb373fb89a655925618a	2019-08-03 23:22:20 +00:00
Daimona Eaytoy	a85e1ccc59	Make AbuseFilterParser::$funcCache non-static Change-Id: I312efe3ce4d1f06e697aa4564aeec1bacbaf97d3	2019-08-03 09:19:49 +00:00
Daimona Eaytoy	09d0254172	Better handling of DNONE This patch includes: * Making it possible to access offsets of a DNONE (returning a DNONE) * Initializing user-defined variables as DNONE inside short-circuited branches * Make DNONE propagate with other operators * Make DNONE count as false for logic operators * Remove a now-outaded bit in doLevelAtom. In case of shortcircuit, $result is now DNONE instead of DNULL, and thus it's possible to access offsets of it. Performance++! * Don't allow modifying or adding an element of a DNONE as if it were an array (to avoid inconsistencies) This re-applies Id85c673337fa90a3782fd22eb9690cd996967111 with several fixes. NOTE: Haven't tested locally, although I'm pretty confident thanks to the amount of tests added. Bug: T214674 Bug: T228677 Change-Id: I5ec4ab44c4e88aaf18c0d7b73355d27050beeda7	2019-08-02 21:05:08 +00:00
jenkins-bot	e3e157361d	Merge "Revert "Initialize user-defined variables during shortcircuit""	2019-07-29 23:30:50 +00:00
Daimona Eaytoy	13cdb86dd2	Revert "Initialize user-defined variables during shortcircuit" Reason for revert: T214674#5374806 This reverts commit `56e6117afd`. Bug: T214674 Change-Id: Iccce248d2693cd9877a740b74e72a577e730435e	2019-07-29 23:06:23 +00:00
Daimona Eaytoy	4720c97530	Add a new class for methods related to running filters Currently we strongly abuse (pardon the pun) the AbuseFilter class: its purpose should be to hold static functions intended as generic utility functions (e.g. to format messages, determine whether a filter is global etc.), but we actually use it for all methods related to running filters. This patch creates a new class, AbuseFilterRunner, containing all such methods, which have been made non-static. This leads to several improvements (also for related methods and the parser), and opens the way to further improve the code. Aside from making the code prettier, less global and easier to test, this patch could also produce a performance improvement, although I don't have tools to measure that. Also note that many public methods have been removed, and almost any of them has been made protected; a couple of them (the ones used from outside) are left for back-compat, and will be removed in the future. Change-Id: I2eab2e50356eeb5224446ee2d0df9c787ae95b80	2019-07-23 19:06:27 +00:00
Daimona Eaytoy	56e6117afd	Initialize user-defined variables during shortcircuit Bug: T214674 Depends-On: I5a14d4b2bc3ffd9caaaa095f16f36b9b6009db05 Change-Id: Id85c673337fa90a3782fd22eb9690cd996967111	2019-07-23 12:20:53 +00:00
Daimona Eaytoy	9937f8b050	Remove extra file from parser tests Added in I5a14d4b2bc3ffd9caaaa095f16f36b9b6009db05, but .r files aren't used anymore since I6c06e596587750c4ebaabafbd277bc75eeb436a5, and I forgot to remove the file upon rebasing. Change-Id: Id688d215b1136bd0a04b8c0d8d8d16de5da1295e	2019-07-15 12:22:09 +02:00
Daimona Eaytoy	18d7d2ed62	Start using AFPData::DNONE This should allow more flexibility when checking syntax, and a saner behaviour overall. Aside from not throwing exception in certain cases, the results should be almost equal to the ones you would get without this patch. However, there are still a few things to improve (which for convenience I wrote inside the parser test) and many to test. Bug: T204654 Depends-On: I69bfec45c76509fb1112641393f78e8d8834adcd Change-Id: I5a14d4b2bc3ffd9caaaa095f16f36b9b6009db05	2019-07-14 08:48:47 +00:00
Daimona Eaytoy	7bc566e635	Fix the regex for numbers, start deprecation of non-decimal numbers Aside from the 14 thingy reported in the task, this syntax is awful! The fix to the regex should only be intended as a temporary stopgap. A proper fix would be to introduce a new syntax, like for instance the one used in PHP. Bug: T212726 Change-Id: Idc37a17ce539e6c63d67fc07d47d812569debe0e	2019-07-10 13:26:36 +00:00
jenkins-bot	6f0905541a	Merge "Make AbuseFilterVariableHolder::mVars private"	2019-07-09 08:42:16 +00:00
jenkins-bot	69bebbb4ff	Merge "Simplify action arrays"	2019-07-08 23:07:26 +00:00
Daimona Eaytoy	304b58d46a	Make AbuseFilterVariableHolder::mVars private This property is meant to be private, since it has all kinds of getters/setters, aside from one which is introduced in this patch. Change-Id: I217b1e22cabd3c0468c84b1d6a69a6ed3c6fa8e6	2019-07-08 16:25:10 +02:00
Daimona Eaytoy	d8d4750e6a	Simplify action arrays The current form is awkward. They're all like [ actionname => [ 'action' => actionname, 'parameters' => params ] ] This is greatly confusing since adds a nesting level, and just duplicates the actionname information (also, we actually never retrieve it from the internal array). Instead, change all of them to be [ actionname => params ] which is a lot shorter and clearer (and easier to handle). A similar case is handled in I8134ecc41fbecdbed99faf406e9e3ca91b6123b9 (see PS 8..10). Change-Id: I34c040dbeb3ab01158fb3db22496def6ccaf72d9	2019-07-05 10:00:48 +02:00
Aaron Schulz	2cf7b58434	Convert wfGetDB() calls to using getConnectionRef() This handles the logic of calling reuseConnection() automatically Change-Id: I9328e709fe5d81099338a31deef24d34db22d784	2019-07-04 15:09:32 -07:00
Daimona Eaytoy	7398730563	Disallow consecutive comparisons As explained on phabricator, they don't work with shortcircuit, so they already fail for all filters using them. Plus IMHO it's an unnecessary deviation from PHP's behaviour, given that this syntax doesn't do what users may expect. Bug: T218906 Change-Id: If9e7545e14044c8dc3b4163bb6fca8ab0683b9fa	2019-07-04 19:15:07 +02:00
Daimona Eaytoy	e86d4bc124	Simplify code for stashedEdits tests Using the new PageEditStash class allows to simplify a bit the integration tests for edit stashing. As I wrote in a ToDo, it may be enough to manually run the hook, but that's left to do as a follow-up. Change-Id: I3389a6961b4f39ecd980be2f429c23f8b7706a15	2019-06-24 11:13:59 +02:00
Daimona Eaytoy	382751a707	Move conditions-related stuff inside AbuseFilterParser Instead of relying on static methods and members in the AbuseFilter class, move everything related to conditions inside the Parser, as the amount of used conditions is something pertaining a single AbuseFilter(Caching)Parser instance. This change requires changing some signatures and adding parameters, but will make introducing the new AbuseFilterRunner class easier (and that will clean signatures, too). Depends-On: I5b29ff556eca45fe59d15e2e3df4d06f1f6b3934 Change-Id: I7c1ea17adf7f42cf9260d416906bfbf3b8a20688	2019-06-19 15:14:17 +00:00
petarpetkovic	c02590f555	Fix "succesful" typo Change-Id: Ibd92f6de8b03098e7bdc8c4fc5e3f6cfaba95bdf	2019-06-14 03:08:41 +03:00
Daimona Eaytoy	e7cd4b2a98	Rewrite AbuseFilter::decodeGlobalName Now it returns an array with a bit more info, and has a different name to reflect the fact that its input is now split in two parts. Plus, make it throw whenever it gets an unexpected input, and add a bunch of test cases for it. Depends-On: Ib5fdeb75c1324f672b4ded39681f006fde34b4d1 Change-Id: Ie550889495232b534c0f9aec31039cf21b2135b1	2019-06-12 23:56:25 +00:00
Thalia	22ceae7e23	Use MediaWiki\Block\DatabaseBlock instead of Block This follows the rename of the Block class in I6d96b63ca0. Change-Id: I44cf9eb68c23a8299316effa4dee7f732486dd84	2019-05-31 16:08:19 +01:00

1 2 3 4 5

207 commits