mediawiki-extensions-Linter

mirror of https://gerrit.wikimedia.org/r/mediawiki/extensions/Linter synced 2024-12-18 02:40:41 +00:00

Author	SHA1	Message	Date
Arlo Breault	c04b075858	Stop constructing Database with a page id Instead, pass the page id when using methods for a page. The change avoids constructing Database a dummy page id when those methods aren't going to be used. getFromId doesn't seem like it needs a page id, since the linter id is the primary key. Also, a namespace id should no longer optional to setForPage. The LinterWriteNamespaceColumnStage option already gates whether to include it in the row. Follows-Up: I9fd6e7724dcf33be0b1feb19ec8eb448738cab09 Change-Id: Ib3d3622144b670ebe1a4ce04e6db6811584d42c8	2024-04-10 21:07:08 -04:00
Arlo Breault	1c53684200	Construct services with ServiceOptions And addresses some other cleanup from review comments. Follows-Up: I9fd6e7724dcf33be0b1feb19ec8eb448738cab09 Change-Id: If87b0bf91930f0f8d89ed046d18aadb8f346f9aa	2024-04-10 12:34:05 -04:00
C. Scott Ananian	24f771a6a3	[DI] Make CategoryManager and Database injectable services Change-Id: I9fd6e7724dcf33be0b1feb19ec8eb448738cab09	2024-04-09 18:33:13 -04:00
Arlo Breault	8d49b68ba5	Move Database::updateStats to TotalsLookup Database::updateStats moved to Database from RecordLintJob in I2610b9b16d4032b0e18b3537cc9ed51bfdaff299 for reuse in Hooks but seems better placed on TotalsLookup. Change-Id: I600853e5cfc9e8abae9c6b07cee4c2adc37ef464	2024-04-02 17:12:24 -04:00
Arlo Breault	397b36e8e3	Don't include hidden category counts in page info Bug: T337275 Bug: T334527 Change-Id: I6439df894c06fc5592422e72dac04150591f4033	2024-04-02 15:12:19 -04:00
Umherirrender	9555edce6c	build: Upgrade mediawiki/mediawiki-codesniffer to v43.0.0 Change-Id: I0a022343428d4b41d2c3c00a6f48eeb237c76fac	2024-03-11 20:44:04 +01:00
Isabelle Hurbain-Palatin	24155905cd	Migrate Database.php to use QueryBuilders everywhere Bug: T350977 Change-Id: I864c1e4880876b329ba0e8102e6c0f335a149491	2023-12-15 16:04:37 +01:00
sbailey	dd0836d232	Implement multiple namespace selection for Linter filters * Using namespacesmultiselect type in HTMLForm element to provide multiple namespace selection criteria in reports. * New namespace URL encoding implemented which matches namespace parameters against active namespaces to ensure parameter security and validation. * Test system updated to use new URL namespace encoding Bug: T231161 Change-Id: Ic3190cffe259aecdea429c10e35122eabdbe10d4	2023-10-26 10:39:57 -07:00
gerritbot	f6f734ef42	Migrate ILB::getConnectionRef() calls to ILB::getConnection() Deprecated since 1.39 (I6e7544763bd) Bug: T343277 Change-Id: I6921060a38a1956c198f7a1316bed45c0eafa307	2023-08-04 13:03:06 +00:00
gerritbot	e1542b880a	Update moved class WikiMap See T321882. Moved in I60cf4b9ef02b9d5 Bug: T321681 Change-Id: I2d9e34bf9849c3ae05761f83031c806987b5991f	2023-04-25 09:54:08 +00:00
sbailey	bef1042e69	Replace use of select or selectRow with QueryBuilder * All database methods using old access methods converted to use queryBuilder except row counters Bug: T312434 Change-Id: I8796adf63179c4505fbb76ab6fd71cf3775c9b89	2023-03-25 15:13:59 +00:00
sbailey	d12bf639f6	Change linter maintenance scripts to use existing config varaibles * Having separate config variables to enable the maintenance migrateNamespace and migrateTagTemplate scripts is duplicitous and should be shared with the write enable config variables. Bug: T329342 Change-Id: I4cb453fc0678b065cb42a2ca59863da1ab9cdbe4	2023-02-14 09:43:54 -08:00
sbailey	2768a70218	Fix migrate data error when params has excessively long strings * The linter migrate code for linter_tag field and linter_template field are constrained by the database schema to 32 characters for the tag field and 255 characters for the template field. In some anomalous circumstances parsoid can report tag and or template fields in the linter_params object that exceed those character limits. This code truncates these excessively long strings to protect the database migrate update code from a length exceeded error. Bug: T329113 Change-Id: I8af7c44759f172eae77d3519a6eac47110e9b1e7	2023-02-09 18:20:46 +00:00
sbailey	07046457f0	Fix write error when linter_params has excessively long strings * The linter write code for linter_tag field and linter_template field are constrained by the database schema to 32 characters for the tag field and 255 characters for the template field. In some anomalous circumstances parsoid can report tag and or template fields in the linter_params object that exceed those character limits. This code truncates these anomalous strings to protect the database update code from a length exceeded error. Bug: T328979 Change-Id: I057ae2e32a9e1a7735b5300409e5693e8db5c764	2023-02-08 10:40:12 -08:00
sbailey	350d677c5b	Phase 3 of T175177: Migrate linter_params into new fields * The migrate code is designed to perform a one-time update of linter_params JSON encoded template and tag information into the new discrete template and tag text fields for use as additional search criteria. The function can be restarted if it is interrupted. * It now uses configurable batching and sleep times between batches to allow the database to do other work and replication to occur without stressing infrastructure. * The migrate code is only called by test code and needs to be called one-time from a maintenance script. Bug: T175177 Change-Id: Idc4ca88d4762bc7a3bcbc4e66c0f275562083867	2022-12-09 12:01:06 -08:00
sbailey	702ce215d0	Phase 3 migrate code for namespace column add to Linter table * Migrates namespace info from the page tables page_namespace field to the new linter table field linter_namespace. This duplication of the namespace value was requested to greatly reduce the amount of database activity required by the linter search and reporting code. * This patch has been prepared as a dark launch patch enabled with config value LinterMigrateNamespaceStage and assumes that the Linter table has had the linter_namespace column added to it, and recording of the namespace field is already enabled and is populating the namespace column. * The migrate code now runnable from Linter/maintenance directory, using migrateNamespace.php, which will be deployed in a separate patch. The maintenance code creates an appropriate environment to call migrateNamespace( in Database.php. Bug: T299612 Change-Id: I73cb80729d6a5a8716fe93164ad1e42e6958d672	2022-11-28 08:07:54 -08:00
Reedy	89d3f6152b	Minor cleanup Change-Id: I0b8abdbeaece73fe8759ee220b9a3aefce240e68	2022-09-07 02:48:18 +01:00
sbailey	b358b20dca	Second phase of T175177: Adds template and tag to RecordLintJob Bug: T175177 Change-Id: I59be7cabb80ace98da3c7f6f36a0d3d4f6b17d23	2022-08-22 12:47:01 -07:00
Arlo Breault	0f46ed3dbc	Get config from services, not globals If908c4dc99c966cde2981f9a03be38a577406a4e introduced the wgLinterWriteNamespaceColumnStage global but didn't add it to the extension.json Change-Id: I8ee5849f67ddfd894a25425582b59404eb52aef2	2022-08-03 13:01:42 -04:00
Arlo Breault	f6607e8818	Stop using wfGetDB This is pulled out of I59be7cabb80ace98da3c7f6f36a0d3d4f6b17d23 Change-Id: Iab6a47320995e9adb1666cd0bb728f516a2fde69	2022-08-02 14:40:30 -04:00
sbailey	e306f5f681	Add namespace column and new index to Linter table - part 2 * Writes namespaceID to new "linter_namespace" field if global "wgLinterWriteNamespaceColumnStage" is present Bug: T299612 Change-Id: If908c4dc99c966cde2981f9a03be38a577406a4e	2022-07-01 06:51:29 -07:00
Arlo Breault	fc8c39baa5	Fix lint error updating The article id of the title is set to 0 when the page is deleted so, although the lint job from the hook runs, it doesn't remove anything. Reverts most of I06b821b65f65609ddac8ed4e7c662336082d8266 Bug: T298782 Bug: T170313 Change-Id: I2610b9b16d4032b0e18b3537cc9ed51bfdaff299	2022-01-10 16:24:00 -05:00
Kunal Mehta	4f4b700fbd	Fix off-by-one error around MAX_ACCURATE_COUNT Currently we select 20 rows, and return the accurate count if it's less than that, so up to 19 rows. Since we want to return an accurate count if it's 20 rows or less, select one more row, 21, so we can differentiate between only having 20 result rows or hitting the limit. This is the same technique used in MediaWiki's Pager system. Change-Id: I50fa96238eb4c7178414ee92c53799fd69520926	2021-08-06 13:05:29 -07:00
libraryupgrader	577a074b69	build: Updating composer dependencies * mediawiki/mediawiki-codesniffer: 35.0.0 → 36.0.0 * php-parallel-lint/php-parallel-lint: 1.2.0 → 1.3.0 Change-Id: Ib1e2319da19d8c5589d1d41d3c0fe8f882792721	2021-05-05 06:09:03 +00:00
sbailey	201b47e01d	Make Linter category counts more accurate when counts are low * The code now produces an accurate count if the number of errors for a category is below the threshold set by a public constant MAX_ACCURATE_COUNT (currently 20). The database record count limit was originally set to 1, to determine accurately, if there were actually 0 errors in a category as the estimate code would never report 0. If not 0, it would use the estimated count which does not produce an accurate count for any other number of errors. For low error counts this is annoying to editors and unnecessary. The additional CPU/disk activity to accurately check for low error counts is not significantly more than checking for 0 or 1, as checking for 0 likely requires a complete table scan which is probably expensive compared to a low count that early outs when it hits to record limit. * An improvement to consider is recording the accurate count in a separate tiny table, and maintaining an accurate count there which is used in preference to doing the select with row limit based on say a 30 second TTL, to prevent a stampede of requests from doing extraneous database operations. * Added unit test coverage for accurately counting low error conditions that are lower than the threshold and also verify that the estimate is inaccurate beyond the error count threshold. Bug: T194872 Change-Id: I4f74cfe3bf9601baa0dc8fa6464a68030ac2bc4b	2021-04-27 10:38:24 -07:00
Reedy	c647f7c80e	Fix PSR12.Properties.ConstantVisibility.NotFound Bug: T253169 Change-Id: I9bd5f82ed62bb01b80b13507832935ac7a673804	2020-09-19 17:07:37 +00:00
C. Scott Ananian	551a1fb398	Allow Parsoid to provide category ID hints This eases deployment dependencies by allowing Parsoid to supply an appropriate database category ID so that new lint categories can be appropriately stored during the interval between adding a new lint category to Parsoid and deploying an Extension:Linter patch to describe it. Change-Id: Ib7b2342168fa53ca2abac7d5f54fe313be341eb7	2019-12-03 23:26:34 -05:00
Kunal Mehta	db5e5e9003	Use estimateRowCount() instead of actually counting everything On large wikis with lots of lint errors, counting the entire table can be problematic from a performance perspective, sometimes taking minutes. Instead, use Database::estimateRowCount(), which uses EXPLAIN SELECT COUNT(*) to get an approximate value for the number of rows. We do make sure that if the category actually has no rows, that it will return 0. This should be considered a temporary solution, and we should look into doing something like the SiteStats incremental updates table in the long run. Bug: T184280 Change-Id: I2d4dcc615477fd60e41dfed4a3d1a3ad52a9f4af	2018-02-01 13:15:06 -08:00
Kunal Mehta	2c08143ed0	Improve logging for non-existent categories in the database Suggested by Chad in the review for `3a8d3b9e0`. Bug: T179423 Change-Id: I9286ae33bdb3b0b50aa6f1619402caa5486682e3	2017-11-22 23:05:04 -08:00
Kunal Mehta	3a8d3b9e03	Handle non-existent categories in the database better If a newer version of MediaWiki gets rolled back, it's possible for there to be lint entries in the database that don't exist according to CategoryManager. Instead of showing an error to the user, just silently hide those rows. All callers to CategoryManager::getCategoryId() already check the category exists. The callers for CategoryManager::getCategoryName() will catch the MissingCategoryException, and log it if necessary. Notably LinterError::makeLintError() will return false on invalid rows, and all callers have been updated to handle that. Bug: T179423 Change-Id: Ia5f56f18a51fa871511b02410222a6079efbfff6	2017-10-31 10:42:07 -07:00
libraryupgrader	773cc2e8b2	build: Updating mediawiki/mediawiki-codesniffer to 13.0.0 Change-Id: I09c3533b611b98169868f724ac929b20dbe7e43a	2017-09-24 03:51:18 +00:00
Kunal Mehta	eebd04aa00	Add caching to looking up totals The query itself is too expensive to be run on large Wikimedia wikis. So put it behind WAN cache and touch the check keys for each category whenever those have errors added or deleted from them. If this happens to get out of sync, it will get fully refreshed regularly when the totals are sent to statsd. WANObjectCache's 'lockTSE' feature will help avoid cache stampedes that made this query expensive in the past. Change-Id: I3774103a29fa0f29d36283950f136259fa71bffe	2017-05-29 07:33:41 -07:00
Kunal Mehta	5c606fca03	Display count of lint errors on ?action=info Change-Id: Ifcdfcb365e5ff6106b521d58a06df8c006772473	2017-01-20 11:26:44 -08:00
Kunal Mehta	782313088d	Show error counts on Special:LintErrors Change-Id: Ib49f2e391ca4b9d1eef5443c011d54a42921ce4e	2016-12-08 16:52:31 -08:00
Kunal Mehta	8e2d4e42ee	Use INSERT IGNORE when putting new lint errors in the database The most likely scenario of duplicate key errors is that it's the exact same lint error and there's just a race condition when calculating which new errors need to be inserted, so just ignore them. Follows-up `419610bcdb`. Change-Id: I84749ab221bbd517b474be8875bb6a59e4f3258e	2016-12-02 15:54:02 -08:00
Kunal Mehta	14b53d6281	Add integration tests for Database class These tests insert variations of fake lint errors into the database, and then read out of the database to check they round-trip properly. And while we're at it, improve the setForPage() return value. These tests can be run with something like: php tests/phpunit/phpunit.php extensions/Linter/tests/phpunit/ Change-Id: Ifdba8a8a104d218a822f909bc5d7b3512aca499d	2016-11-30 21:17:51 -08:00
Kunal Mehta	419610bcdb	Enforce category/page/position uniqueness constraint in the database Move location to two separate columns in the database: linter_start and linter_end. This allows us to have the database enforce the uniqueness of those fields, instead of just relying upon the PHP code to do so, which could be bypassed since we have multiple servers and concurrent processes. Change-Id: I3e67ce1b7cb3c93866a388ec3248af4cff2a81e0	2016-11-30 18:55:19 -08:00
Kunal Mehta	9550db450d	Fix inserting errors if none exist for that page If no errors existed for the page, inserting new ones would fail with a database error since $errors was key-ed by unique id, which the database wrapper interpreted as field names, causing issues. Using array_values() gets rid of the keys, fixing the issue. Change-Id: I7645de4e5d3ac4462d7980374c8ef8be6280442b	2016-11-30 18:17:36 -08:00
Kunal Mehta	29379edb0b	De-duplicate errors and trim excessive errors in the same category It's possible to have duplicate, identical lint errors if the same exact error is repeated in a template transclusion (e.g. {{1x\|<b/> <b/>}}) since the position via dsr is the same. In this case, just de-duplicate the errors since we can't differentiate them. At the same time, trim excessive errors on the same page in the same category. It's most likely that if a page has that many of the same errors, the editor or bot will just fix all of them at the same time, so we don't need to include all of them in the database. 20 is kind of a low value, but we can always increase it later on as necessary. Change-Id: I9cded720169870d0eea574e1a930ce4e9b190ac0	2016-11-23 19:47:04 -08:00
Kunal Mehta	b41e74ce8b	Hardcode category ids Instead of having a complex auto-increment category manager that had additional caching, just hardcode the (currently 6) ids for each category, which allows us to simplify a lot of code. If Parsoid sends a lint error in a category that we don't know about, it is silently dropped. Bug: T151287 Change-Id: Ice6edf1b7985390aa0c1c410d357bc565bb69108	2016-11-22 18:31:17 -08:00
Kunal Mehta	ec6f4722aa	Store linter_cat names in a separate table linter_cat is now an int ID that points to a row in the new lint_categories table. The mapping between category names and ids is all handled in PHP, and cached in APC. Note that you will need to drop the `linter` table manually and re-run update.php for this to take effect. Change-Id: I369d9b4d8d08289b4a20d1cd29a2e327bad28ef8	2016-11-03 14:51:10 -07:00
Kunal Mehta	bce5b31616	Initial commit This configures a MediaWiki extension to recieve Parsoid's lint errors and expose them to users. Change-Id: Ie0776aecf145eb1c87c2a539ddf3ea8d35a899f5	2016-10-17 16:02:53 -07:00

42 commits