text_analysis 0.11.0 changelog

0.11.0 #

BREAKING CHANGES

This version sees numerous breaking changes, including the re-naming of the primary interfaces of the library.

Breaking Changes:

Renamed TextAnalyzer interface to TextTokenizer.
Renamed TextAnalyzerConfiguration interface to TextAnalyzer.
Added SentenceSplitter get sentenceSplitter to TextAnalyzer interface.
Added ParagraphSplitter get paragraphSplitter to TextAnalyzer interface.
Added SyllableCounter get syllableCounter to TextAnalyzer interface.
Added List<String> paragraphs(SourceText source) to ITextTokenizer interface.
Moved class TextTokenizer to a private implementation class _TextTokenizerImpl and renamed ITextTokenizer interface to TextTokenizer.

New:

Added mixin class TextTokenizerMixin.
Added object model TextDocument.
Added typedef SyllableCounter.
Added unnamed factory constructor to TextTokenizer that initializes a _TextTokenizerImpl.
Added SentenceSplitter get sentenceSplitter to English class.
Added ParagraphSplitter get paragraphSplitter to English class.
Added SyllableCounter get syllableCounter to English class.
Added TextDocument interface.
Added TextDocumentMixin mixin class.
Added TextDocument unnamed factory with private implementation class.
Added TextDocument.analyze factory constructor.
Added TextDocument.analyzeJson factory constructor.
Added extension on String double lengthDistance(Term other).
Added extension on String double lengthSimilarity(Term other).
Added extension on String Map<Term, double> lengthSimilarityMap(Iterable<Term> terms).

Updated dependencies, tests, examples and documentation. Re-organized code repository

0.10.0 #

BREAKING CHANGES

This version sees numerous breaking changes, including the re-naming of the primary interfaces of the library.

Breaking Changes:

Renamed TextAnalyzer interface to TextTokenizer.
Renamed TextAnalyzerConfiguration interface to TextAnalyzer.
Added SentenceSplitter get sentenceSplitter to TextAnalyzer interface.
Added ParagraphSplitter get paragraphSplitter to TextAnalyzer interface.
Added SyllableCounter get syllableCounter to TextAnalyzer interface.
Added List<String> paragraphs(SourceText source) to ITextTokenizer interface.
Moved class TextTokenizer to a private implementation class _TextTokenizerImpl and renamed ITextTokenizer interface to TextTokenizer.

New:

Added mixin class TextTokenizerMixin.
Added object model TextDocument.
Added typedef SyllableCounter.
Added unnamed factory constructor to TextTokenizer that initializes a _TextTokenizerImpl.
Added SentenceSplitter get sentenceSplitter to English class.
Added ParagraphSplitter get paragraphSplitter to English class.
Added SyllableCounter get syllableCounter to English class.
Added TextDocument interface.
Added TextDocumentMixin mixin class.
Added TextDocument unnamed factory with private implementation class.
Added TextDocument.analyze factory constructor.
Added TextDocument.analyzeJson factory constructor.
Added extension on String double lengthDistance(Term other).
Added extension on String double lengthSimilarity(Term other).
Added extension on String Map<Term, double> lengthSimilarityMap(Iterable<Term> terms).

Updated dependencies, tests, examples and documentation. Re-organized code repository

0.9.1 #

New:

Added extension on String double jaccardSimilarity(Term other, [int k = 3]).
Added extension on String double jaccardSimilarity(Term other, [int k = 3]).

Updated dependencies, tests, examples and documentation.

0.9.0 #

BREAKING CHANGES

Breaking Changes:

Removed class TextSource.
Removed class Sentence.
Removed class TermPair.
Removed TextAnalyzer.sentenceSplitter from TextAnalyzer interface.
Changed TextTokenizer.tokenize return value to List<Token>.
Changed TextTokenizer.tokenizeJson return value to List<Token>.

0.8.1 (PRE-RELEASE, BUG FIX) #

Bug Fixes:

Fixed TextTokenizerBase.tokenizeJson would not tokenize documents if Iterable<Zone> zones parameter is empty.

Non-breaking Changes:

TextTokenizerBase.tokenizeJson required non-nullable parameter Iterable<Zone> zones to optional nullable [Iterable<Zone>? zones].

0.8.0 (PRE-RELEASE, BREAKING CHANGES) #

Breaking Changes:

Added type definitions for kGram and Trigram.
New extension method Set<kGram> Term.kGrams([int k = 3]).
New extension method Set<kGram> Iterable<Token>.kGrams([int k = 3]).

Updated dependencies, tests, examples and documentation.

0.7.0 (PRE-RELEASE, BREAKING CHANGES) #

Breaking Changes:

Renamed FieldName type alias to Zone.
Renamed parameter FieldName? field to Zone? zone wherever it is used.

New:

Type alias IdFt.
Type alias Ft.
Type alias ZoneWeightMap.

Updated dependencies, tests, examples and documentation.

0.6.5+1 (PRE-RELEASE) #

Minor bug fixes, updated dependencies, tests, examples and documentation.

0.6.5 (PRE-RELEASE) #

New:

Added custom implementation of TermPair.toString().

Updated dependencies, tests, examples and documentation.

0.6.4 (PRE-RELEASE) #

New:

Added == operator and hashCode getter to TermPair.

Updated dependencies, tests, examples and documentation.

0.6.3 (PRE-RELEASE) #

New:

Added object model TermPair.

Updated dependencies, tests, examples and documentation.

0.6.2 (PRE-RELEASE) #

New:

Added extension getter List<String> get allTerms on Iterable<Token>.

Updated dependencies, tests, examples and documentation.

0.6.1 (PRE-RELEASE) #

Added type aliases to improve code readability.

Updated dependencies, tests, examples and documentation.

0.6.0+1 (PRE-RELEASE) #

Codebase formatted.

Updated dependencies, tests, examples and documentation.

0.6.0 (PRE-RELEASE, BREAKING CHANGES) #

Breaking Changes:

Changed parameters for JsonTokenizer type defintion.

Updated dependencies, tests, examples and documentation.

0.5.0 (PRE-RELEASE) #

New:

Added JsonTokenizer type defintion.

Updated dependencies, tests, examples and documentation.

0.4.1 (PRE-RELEASE) #

New:

Added optional, nullable FieldName? field optional parameter to Tokenizer definition.

Updated dependencies, tests, examples and documentation.

0.4.0+1 (PRE-RELEASE) #

Updated documentation.

0.4.0 (PRE-RELEASE, BREAKING CHANGES) #

Breaking Changes:

Added Token.field property to token, breaks default generative constructor.
Added FieldName? field optional parameter to TextTokenizer.tokenize method.
Removed deprecated property Token.index, use Token.termPosition instead.
Removed deprecated property Token.position, use Token.termPosition instead.
Removed deprecated extension method Iterable<Token>.maxIndex, use Iterable<Token>.Iterable
Removed extension method Iterable<Token>.minIndex, use Iterable<Token>.Iterable

New:

Added new method ITextAnalyser,tokenizeJson.
Added new tests.
Added new examples.

Updated dependencies, tests, examples and documentation.

0.3.1 (PRE-RELEASE) #

Updated dependencies, tests, examples and documentation.

0.3.0+1 (PRE-RELEASE) #

Updated documentation.

0.3.0 (PRE-RELEASE, BREAKING CHANGES) #

Breaking Changes:

TextAnalyzer.characterFilter changed to non-nullable. Use (phrase) => phrase if no characterFilter is required.
TextAnalyzer.termFilter changed to non-nullable. Use (phrase) => [phrase] if no termFilter is required.

New:

Added porter_2_stemmer package export so it does not need to be imported separately.
Updated documentation.

0.2.0+1 (PRE-RELEASE) #

Updated dependencies, tests, examples and documentation.

0.2.0 (PRE-RELEASE) #

New:

Added abstract class TextTokenizerBase.
Updated documentation.

0.1.0+1 (PRE-RELEASE) #

Updated dependencies, tests, examples and documentation.

0.1.0 (PRE-RELEASE, BREAKING CHANGES) #

Breaking Changes:

Added Token.termPosition property to token, breaks default generative constructor.

Deprecated:

Property Token.index, use Token.termPosition instead.
PropertyToken.position, use Token.termPosition instead.
Extension method Iterable<Token>.maxIndex.

1.0.0+1 (PRE-RELEASE) #

Updated dependencies, tests, examples and documentation.

1.0.0 (PRE-RELEASE) #

Updated dependencies, tests, examples and documentation.

0.0.12+1 (PRE-RELEASE) #

Updated documentation.

0.0.12 (PRE-RELEASE) #

New:

Added == operator to Token, Sentence and TextSource.

Updated dependencies, tests, examples and documentation.

0.0.11+build.1.e8af2efb #

PRE-RELEASE

Updated dependencies, tests, examples and documentation.

0.0.11 (PRE-RELEASE) #

Updated dependencies, tests, examples and documentation.

0.0.11-beta.1 (PRE-RELEASE) #

Updated dependencies, tests, examples and documentation.

0.0.10 (PRE-RELEASE) #

Updated dependencies, tests, examples and documentation.

0.0.9-beta.1 #

Breaking Changes:

Changed definition of Token.position.

Updated dependencies, tests, examples and documentation.

0.0.8 (PRE-RELEASE, BREAKING CHANGES) #

BREAKING CHANGE: removed relevance extension method from TokenCollectionExtension.

text_analysis 0.11.0 text_analysis: ^0.11.0 copied to clipboard

Metadata

0.11.0 #

Breaking Changes:

New:

0.10.0 #

Breaking Changes:

New:

0.9.1 #

New:

0.9.0 #

Breaking Changes:

0.8.1 (PRE-RELEASE, BUG FIX) #

Bug Fixes:

Non-breaking Changes:

0.8.0 (PRE-RELEASE, BREAKING CHANGES) #

Breaking Changes:

0.7.0 (PRE-RELEASE, BREAKING CHANGES) #

Breaking Changes:

New:

0.6.5+1 (PRE-RELEASE) #

0.6.5 (PRE-RELEASE) #

New:

0.6.4 (PRE-RELEASE) #

New:

0.6.3 (PRE-RELEASE) #

New:

0.6.2 (PRE-RELEASE) #

New:

0.6.1 (PRE-RELEASE) #

0.6.0+1 (PRE-RELEASE) #

0.6.0 (PRE-RELEASE, BREAKING CHANGES) #

Breaking Changes:

0.5.0 (PRE-RELEASE) #

New:

0.4.1 (PRE-RELEASE) #

New:

0.4.0+1 (PRE-RELEASE) #

0.4.0 (PRE-RELEASE, BREAKING CHANGES) #

Breaking Changes:

New:

0.3.1 (PRE-RELEASE) #

0.3.0+1 (PRE-RELEASE) #

0.3.0 (PRE-RELEASE, BREAKING CHANGES) #

Breaking Changes:

New:

0.2.0+1 (PRE-RELEASE) #

0.2.0 (PRE-RELEASE) #

New:

0.1.0+1 (PRE-RELEASE) #

0.1.0 (PRE-RELEASE, BREAKING CHANGES) #

Breaking Changes:

Deprecated:

1.0.0+1 (PRE-RELEASE) #

1.0.0 (PRE-RELEASE) #

0.0.12+1 (PRE-RELEASE) #

0.0.12 (PRE-RELEASE) #

New:

0.0.11+build.1.e8af2efb #

0.0.11 (PRE-RELEASE) #

0.0.11-beta.1 (PRE-RELEASE) #

0.0.10 (PRE-RELEASE) #

0.0.9-beta.1 #

Breaking Changes:

0.0.8 (PRE-RELEASE, BREAKING CHANGES) #

0.0.7 (PRE-RELEASE) #

0.0.6 (PRE-RELEASE, BREAKING CHANGES) #

New:

0.0.5 (PRE-RELEASE, BREAKING CHANGES) #

Breaking Changes:

0.0.4 (PRE-RELEASE, BREAKING CHANGES) #

New:

0.0.3 (PRE-RELEASE, BREAKING CHANGES) #

Breaking Changes:

0.0.2 (PRE-RELEASE, BREAKING CHANGES) #

0.0.1-beta.1 (PRE-RELEASE) #

← Metadata

Publisher

Metadata

License

text_analysis 0.11.0
text_analysis: ^0.11.0 copied to clipboard