text_analysis 0.8.0 text_analysis: ^0.8.0 copied to clipboard
Text analyzer that extracts tokens from text for use in full-text search queries and indexes.
0.8.0 (PRE-RELEASE, BREAKING CHANGES) #
New:
- Added type definitions for
kGram
andTrigram
. - New extension method
Set<kGram> Term.kGrams([int k = 3])
. - New extension method
Set<kGram> Iterable<Token>.kGrams([int k = 3])
.
Updated dependencies, tests, examples and documentation.
0.7.0 (PRE-RELEASE, BREAKING CHANGES) #
Breaking Changes:
- Renamed
FieldName
type alias toZone
. - Renamed parameter
FieldName? field
toZone? zone
wherever it is used.
New:
- Type alias
IdFt
. - Type alias
Ft
. - Type alias
ZoneWeightMap
.
Updated dependencies, tests, examples and documentation.
0.6.5+1 (PRE-RELEASE) #
Minor bug fixes, updated dependencies, tests, examples and documentation.
0.6.5 (PRE-RELEASE) #
New:
- Added custom implementation of
TermPair.toString()
.
Updated dependencies, tests, examples and documentation.
0.6.4 (PRE-RELEASE) #
New:
- Added
==
operator andhashCode
getter toTermPair
.
Updated dependencies, tests, examples and documentation.
0.6.3 (PRE-RELEASE) #
New:
- Added object model
TermPair
.
Updated dependencies, tests, examples and documentation.
0.6.2 (PRE-RELEASE) #
New:
- Added extension getter
List<String> get allTerms
onIterable<Token>
.
Updated dependencies, tests, examples and documentation.
0.6.1 (PRE-RELEASE) #
- Added type aliases to improve code readability.
Updated dependencies, tests, examples and documentation.
0.6.0+1 (PRE-RELEASE) #
- Codebase formatted.
Updated dependencies, tests, examples and documentation.
0.6.0 (PRE-RELEASE, BREAKING CHANGES) #
Breaking Changes:
- Changed parameters for
JsonTokenizer
type defintion.
Updated dependencies, tests, examples and documentation.
0.5.0 (PRE-RELEASE) #
New:
- Added
JsonTokenizer
type defintion.
Updated dependencies, tests, examples and documentation.
0.4.1 (PRE-RELEASE) #
New:
- Added optional, nullable
FieldName? field
optional parameter toTokenizer
definition.
Updated dependencies, tests, examples and documentation.
0.4.0+1 (PRE-RELEASE) #
- Updated documentation.
0.4.0 (PRE-RELEASE, BREAKING CHANGES) #
Breaking Changes:
- Added
Token.field
property to token, breaks default generative constructor. - Added
FieldName? field
optional parameter toTextAnalyzer.tokenize
method. - Removed deprecated property
Token.index
, useToken.termPosition
instead. - Removed deprecated property
Token.position
, useToken.termPosition
instead. - Removed deprecated extension method
Iterable<Token>.maxIndex
, useIterable<Token>.
Iterable - Removed extension method
Iterable<Token>.minIndex
, useIterable<Token>.
Iterable
New:
- Added new method
ITextAnalyser,tokenizeJson
. - Added new tests.
- Added new examples.
Updated dependencies, tests, examples and documentation.
0.3.1 (PRE-RELEASE) #
Updated dependencies, tests, examples and documentation.
0.3.0+1 (PRE-RELEASE) #
- Updated documentation.
0.3.0 (PRE-RELEASE, BREAKING CHANGES) #
Breaking Changes:
TextAnalyzerConfiguration.characterFilter
changed to non-nullable. Use(phrase) => phrase
if nocharacterFilter
is required.TextAnalyzerConfiguration.termFilter
changed to non-nullable. Use(phrase) => [phrase]
if notermFilter
is required.
New:
- Added
porter_2_stemmer
package export so it does not need to be imported separately. - Updated documentation.
0.2.0+1 (PRE-RELEASE) #
Updated dependencies, tests, examples and documentation.
0.2.0 (PRE-RELEASE) #
New:
- Added abstract class
TextAnalyzerBase
. - Updated documentation.
0.1.0+1 (PRE-RELEASE) #
Updated dependencies, tests, examples and documentation.
0.1.0 (PRE-RELEASE, BREAKING CHANGES) #
Breaking Changes:
- Added
Token.termPosition
property to token, breaks default generative constructor.
Deprecated:
- Property
Token.index
, useToken.termPosition
instead. - Property
Token.position
, useToken.termPosition
instead. - Extension method
Iterable<Token>.maxIndex
.
1.0.0+1 (PRE-RELEASE) #
Updated dependencies, tests, examples and documentation.
1.0.0 (PRE-RELEASE) #
Updated dependencies, tests, examples and documentation.
0.0.12+1 (PRE-RELEASE) #
- Updated documentation.
0.0.12 (PRE-RELEASE) #
New:
- Added == operator to
Token
,Sentence
andTextSource
.
Updated dependencies, tests, examples and documentation.
0.0.11+build.1.e8af2efb #
- PRE-RELEASE
Updated dependencies, tests, examples and documentation.
0.0.11 (PRE-RELEASE) #
Updated dependencies, tests, examples and documentation.
0.0.11-beta.1 (PRE-RELEASE) #
Updated dependencies, tests, examples and documentation.
0.0.10 (PRE-RELEASE) #
Updated dependencies, tests, examples and documentation.
0.0.9-beta.1 #
Breaking Changes:
- Changed definition of
Token.position
.
Updated dependencies, tests, examples and documentation.
0.0.8 (PRE-RELEASE, BREAKING CHANGES) #
- BREAKING CHANGE: removed
relevance
extension method fromTokenCollectionExtension
.
0.0.7 (PRE-RELEASE) #
- Updated documentation.
0.0.6 (PRE-RELEASE, BREAKING CHANGES) #
New:
- Added
TokenCollectionExtension
onIterable<Token>
.
0.0.5 (PRE-RELEASE, BREAKING CHANGES) #
Breaking Changes:
- added
position
property toToken
class.
0.0.4 (PRE-RELEASE, BREAKING CHANGES) #
New:
- Added
Tokenizer
type definition.
0.0.3 (PRE-RELEASE, BREAKING CHANGES) #
Breaking Changes:
- Stemmer removed from English configuration.
- Stemmer incorporated into default tokenFilter for
TextAnalyzer
.
0.0.2 (PRE-RELEASE, BREAKING CHANGES) #
- Updated documentation.
0.0.1-beta.1 (PRE-RELEASE) #
- Initial version.