token_parser 1.1.0 changelog

1.1.0 #

TODO:

Added:

anyUntil(pattern) top-level lexeme, matches any character until the pattern is matched
pattern.until(pattern) lexeme extension, matches the pattern until the pattern is matched
pattern.repeat(min, [ max ]) lexeme extension, matches the pattern between min and max times
start, end top-level lexemes, matches the start and end of the input
startLine, endLine top-level lexemes, matches the start and end of the line

Fixed:

Lexeme's Regex string, it was using '$pattern' within itself instead of '${ pattern.regexString }'

Changed:

BREAKING CHANGES:

Token Parser was refactored to be able to throw lexical syntax errors, using LexicalSyntaxError. This change means that tokenization is mandatory to return a token. If there's no match, it will throw an error suggesting where it went wrong. If you with to have optional tokenization, use the grammar.optionalParse() and lexeme.optionalTokenize() methods instead.
Every lexeme type was reworked, so they might have different behavior than before.

Added:

Tokenization error LexicalSyntaxError, is thrown when a token is not matched
CharacterLexeme to match single characters
Grammar debugging, using the grammar class DebugGrammar
Grammar debugging methods, grammar.tokenizing(), called before lexemes start tokenizing
.length property to token, which returns the length of value matched
empty() top-level lexeme extension, same as Lexeme.empty()
spacing top-level lexeme, matches any conventional spacing
pattern.pad() method surrounds the lexeme with another lexeme, optionally
pattern.spaced lexeme extension, same as pattern.pad(spacing)
- operator to exclude patterns, same as pattern.not.character
~ operator to pad lexeme with spacing around it, same as pattern.spaced

Fixed:

Changed:

Reorganized the documentation
Moved the utils directory inside the source directory
Renamed grammar.lemexes to grammar.rules
Moved toString() into regexString getter, toString() now displays similar to displayName

Initial release: Token Parser