wikimedia/mediawiki-extensions-VisualEditor

mirror of https://gerrit.wikimedia.org/r/mediawiki/extensions/VisualEditor synced 2024-11-26 23:31:02 +00:00

Author	SHA1	Message	Date
Gabriel Wicke	0b8d1b0387	* Add custom toString methods for tokens to aid debugging * Convert all attributes into strings in Sanitizer * Use strict comparison against empty string in tokenizer * Add very simple sitename parserfunction * 138 tests passing	2012-02-13 17:02:23 +00:00
Gabriel Wicke	025f9cddb3	Prefix all internal data- attributes with data-mw- and adjust the whitelist and test output normalization accordingly. 235 tests passing.	2012-02-13 13:54:07 +00:00
Gabriel Wicke	b1617b1d71	Add some support for ideographic spaces in external links, support the int: namespace alias and perform some normalization on the MediaWiki namespace prefix.	2012-02-13 13:35:46 +00:00
Gabriel Wicke	a122e51eec	Move data-* annotations into separate object on tokens, that is then serialized into a single data-mw-rt attribute if present. Update parserTests to ignore this attribute for comparisons with expected parser output. A few more tweaks and notes are thrown into this commit too. 233 tests are passing now.	2012-02-11 16:43:25 +00:00
Gabriel Wicke	aff30be131	Some comments and reshuffling in the grammar, and a typo in the AttributeExpander.	2012-02-09 22:27:45 +00:00
Gabriel Wicke	6e33255503	Improve support for preprocessor functionality in attributes; Support multi-line xmlish tags with preprocessor stuff in attributes.	2012-02-09 16:36:29 +00:00
Gabriel Wicke	16ded7d955	Fix a bug in wikilink with trail tokenization.	2012-02-09 14:06:35 +00:00
Gabriel Wicke	3f7c1499cd	Enable support for general preprocessor functionality in attribute keys and values. This includes comments, templates and template arguments. This also replaces the specialized expansion logic in the TemplateHandler. The removal of link validation lets one more parser test fail for now. External link target validation will need to be implemented in the token stream handler for links. This is noted as TODO in https://www.mediawiki.org/wiki/Future/Parser_development#Token_stream_transforms.	2012-02-08 15:10:30 +00:00
Gabriel Wicke	1f6db903e9	Pluck a few low-hanging fruit in external link tokenization, and add a simple localurl parser function implementation. 230 parser tests now passing.	2012-02-07 10:28:23 +00:00
Gabriel Wicke	cf8b7bf45d	External links don't nest.	2012-02-07 09:38:28 +00:00
Gabriel Wicke	53bf4f2bd0	Temporarily disable the sanitizer and start to support preprocessor functionality (comments, templates, template arguments) in arbitrary attributes. The grammar for this is still quite rough, will need to consolidate that area.	2012-02-06 19:15:44 +00:00
Gabriel Wicke	0bea9fdfbb	Fix nowiki tokenization regression introduced r110495	2012-02-03 13:10:04 +00:00
Gabriel Wicke	8c75aa1a7a	Remove type attribute for tag tokens.	2012-02-01 18:37:48 +00:00
Gabriel Wicke	a5cc10a06b	Change token format to plain strings for text tokens, and specific objects for other tokens. This is only the first half of the conversion. The next step is to drop the type attribute on most tokens and match on the constructor in the token transform machinery.	2012-02-01 16:30:43 +00:00
Gabriel Wicke	14a8a13678	A few more debug helpers including a --trace mode for light debugging. Some improvements to parser functions on the way to support the cite extensions. Preparation for generic template and template arg in attribute support. 222 parser tests now passing.	2012-01-31 16:50:16 +00:00
Gabriel Wicke	7cd94df47d	A few minor tweaks to reduce memory usage	2012-01-27 13:32:44 +00:00
Gabriel Wicke	4e6a54560a	* Emit token chunks for top-level block elements by patching the source of the tokenizer * Fix a bug uncovered by this * Increase the number of outstanding listeners on a single download to 10000	2012-01-22 23:21:53 +00:00
Gabriel Wicke	785a4af76f	Implement a few parser functions. 220 parser tests now passing.	2012-01-21 20:38:13 +00:00
Gabriel Wicke	1a6546fbca	Support empty template arguments and default values in arg expansion	2012-01-21 03:03:33 +00:00
Gabriel Wicke	fdd048b3b2	Remove a few stray debug prints and disable debugging in parse.js	2012-01-20 22:21:33 +00:00
Gabriel Wicke	145df2655c	* NoInclude and IncludeOnly improvements * Tokenizer support for templates and template args in template arguments and titles * Async attribute expansion fixes	2012-01-20 22:02:23 +00:00
Gabriel Wicke	336be4f617	Eat '[[[' as plain text token, makes it 212 passing.	2012-01-18 00:23:17 +00:00
Gabriel Wicke	178adbc342	Accept IPv6 (and IPv4) addresses in the tokenizer, so another test passes.	2012-01-18 00:00:47 +00:00
Gabriel Wicke	e7381da5b8	Trim whitespace off template titles and argument names. 209 parser tests now passing.	2012-01-17 23:18:33 +00:00
Gabriel Wicke	f50fecf1e3	Fix template argument expansion. 200 parser tests now passing.	2012-01-17 22:29:26 +00:00
Gabriel Wicke	34025251a3	Clean up 'END' token handling a bit.	2012-01-17 20:01:21 +00:00
Gabriel Wicke	6bd7ca1e75	Misc improvements, now 196 parser tests passing. * Add handler for post-expand paragraph wrapping on token stream, to handle things like comments on its own line post-expand * Add general Util module * Fix self-closing tag handling in HTML5 tree builder	2012-01-17 18:22:10 +00:00
Gabriel Wicke	f4081bef08	First template expansion tests start working, and a bug fix in DOMPostProcessor paragraph wrapper. 187 parser tests now passing.	2012-01-14 00:58:20 +00:00
Gabriel Wicke	32c9bccd7c	Results of early template expansion debugging. Still disabled by default, but getting closer.	2012-01-11 19:48:49 +00:00
Gabriel Wicke	6601c544e6	Handle default for template arg expansion, add template fetch functionality and tweak a few minor things in the grammar and QuoteTransformer.	2012-01-06 17:19:14 +00:00
Gabriel Wicke	f0c844f28f	Add template expansion handler skeleton, not yet functional. Also note improvements needed in the tokenizer template handling.	2012-01-06 14:30:55 +00:00
Gabriel Wicke	bd98eb4c5a	Land big TokenTransformDispatcher and eventization refactoring. The TokenTransformDispatcher now actually implements an asynchronous, phased token transformation framework as described in https://www.mediawiki.org/wiki/Future/Parser_development/Token_stream_transformations. Additionally, the parser pipeline is now mostly held together using events. The tokenizer still emits a lame single events with all tokens, as block-level emission failed with scoping issues specific to the PEGJS parser generator. All stages clean up when receiving the end tokens, so that the full pipeline can be used for repeated parsing. The QuoteTransformer is not yet 100% fixed to work with the new interface, and the Cite extension is disabled for now pending adaptation. Bold-italic related tests are failing currently.	2012-01-03 18:44:31 +00:00
Gabriel Wicke	8e00a72d0a	Improvements to link trail handling, and two tweaks to the whitelist. 182 tests now passing. Link trails depend on language-dependent positive character classes in the PHP parser. These classes all seem to disallow punctuation implicitly and list differing plain text characters instead, so it might be possible to get away with identifying a common class of non-trail punctuation instead. This would help to keep the tokenizer independent of configurations, which is very desirable for caching and simplified external parsing.	2011-12-30 12:47:06 +00:00
Gabriel Wicke	11ece76b7b	Fix suffix handling for wiki links.	2011-12-30 09:35:57 +00:00
Gabriel Wicke	33e60dd4d9	Update comments a bit.	2011-12-22 12:37:24 +00:00
Gabriel Wicke	9ee0e660ec	Fix regression introduced by r107060 for regular table cells. Good to have a test suite ;)	2011-12-22 12:09:25 +00:00
Gabriel Wicke	a94d0ec10c	Re-add support for row-only tables.	2011-12-22 11:58:32 +00:00
Gabriel Wicke	1c7fe0eb34	Refactor table productions to support table fragments in templates (table start / row / table end). The old productions are not deleted yet to make it easy to compare the output on more complex articles. 181 tests passing after adding two table tests with whitespace-only differences to the whitelist.	2011-12-22 11:43:55 +00:00
Gabriel Wicke	2845ba9552	Handle noinclude and includeonly at start of line, so that syntax after it still matches as if it actually was preceded by a newline.	2011-12-21 11:38:50 +00:00
Gabriel Wicke	cc06551f2e	Rename table_header production to table_heading. Those non-natives strike again.	2011-12-16 19:24:59 +00:00
Gabriel Wicke	605ed23fd2	Fix attributes in table headings.	2011-12-16 19:22:13 +00:00
Gabriel Wicke	a04744b2ec	Add some more attribute remapping capabilities to the DOMConverter, and clean up some grammar formatting.	2011-12-15 17:33:07 +00:00
Gabriel Wicke	3585bd9c8e	Accept row-only tables. The parser now eats [[en:Barack Obama]] as-is. Hooray!	2011-12-15 00:39:28 +00:00
Gabriel Wicke	6df94a34a1	Less lust for urls	2011-12-15 00:26:22 +00:00
Gabriel Wicke	ce2ee067f7	Minor tweak to wiki link production	2011-12-15 00:12:58 +00:00
Gabriel Wicke	574abd9774	A collection of small bug fixes to the grammar, Cite, the Token format converter and the HTML DOM -> WikiDom converter. The tokenizer now digests all parserTests.	2011-12-14 23:38:46 +00:00
Gabriel Wicke	dc77d73ad5	Add ability to pass through JSON data to WikiDom in data-json-* attributes, and fix parser to actually parse the Barack Obama article except for one table with nested templates at the start-of-line.	2011-12-14 17:25:09 +00:00
Gabriel Wicke	feee9ded9f	Convert the Cite extension to a token stream transformer. This required a few further additions to the TokenTransformDispatcher. In particular, there is now an 'any' token match whose callbacks are executed before more specific callbacks. This is used by the Cite extension to eat all tokens between ref and /ref tags. This need is very common, so should be broken out to an intermediate layer in the future. In general, the requirements for the TokenTransformDispatcher API are now clearer, and the API should likely be cleaned up / simplified.	2011-12-13 14:48:47 +00:00
Gabriel Wicke	a8fa9433c4	Convert quote handling (italic/bold) to a core extension operating on the token stream. This is the first token transformation exercising the TokenTransformer class as its dispatcher. Template expansions, wiki link formatting, tag sanitation and extensions should be able to use the same dispatcher by registering for specific token types. The parser performance is very slightly improved as the token stream is only traversed once.	2011-12-12 20:53:14 +00:00
Gabriel Wicke	d616f07a79	Don't re-build the wiki tokenizer for each test. This speeds up the full parserTests.js run slightly from 7-8 minutes to about 14 seconds ;) A few very minor tweaks to the grammar are also thrown into this commit.	2011-12-12 10:47:42 +00:00
Gabriel Wicke	c2b69e2486	Clean up newline handling. Emit a NEWLINE token for each non-{comment,pre,nowiki} newline.	2011-12-08 14:34:18 +00:00
Gabriel Wicke	abc2254110	A bit of comment clean-up and wrapping of tree building into try/catch block to actually count failures.	2011-12-08 11:40:59 +00:00
Gabriel Wicke	92fdf99384	Further renaming, this time from pegParser to pegTokenizer.	2011-12-08 10:59:44 +00:00

1 2 3

103 commits