text_indexing 0.14.4 text_indexing: ^0.14.4 copied to clipboard
Dart library for creating an inverted index on a collection of text documents.
THIS PACKAGE IS PRE-RELEASE, IN ACTIVE DEVELOPMENT AND SUBJECT TO DAILY BREAKING CHANGES.
0.14.4 #
Upgraded dependencies.
0.14.3+1 #
Updated documentation.
0.14.3 #
Updated dependencies, tests, examples and documentation.
0.14.2 #
Updated dependencies, tests, examples and documentation.
0.14.1+1 #
Updated documentation.
0.14.1 #
New:
- Added extension property
KGramIndex.terms
.
Updated documentation.
0.14.0+1 #
Updated documentation.
0.14.0 #
Breaking Changes:
-
Removed class
TextSource
. -
Removed class
Sentence
. -
Removed class
TermPair
. -
Removed
TextAnalyzerConfiguration.sentenceSplitter
fromTextAnalyzerConfiguration
interface. -
Changed
TextTokenizer.tokenize
return value toList<Token>
. -
Changed
TextTokenizer.tokenizeJson
return value toList<Token>
. -
Re-structured codebase.
-
Updated dependencies, tests, examples and documentation.
0.13.0+3 #
Updated documentation.
0.13.0+2 #
Updated documentation.
0.13.0+1 #
Updated documentation.
0.13.0 #
BREAKING CHANGES
Breaking changes:
-
Removed
TextIndexer.async
,TextIndexer.index
andTextIndexer.inMemory
factory constructors. -
Added a new unnamed factory constructor for
TextIndexer
. -
Re-structured codebase.
-
Updated dependencies, tests, examples and documentation.
0.12.0+1 #
Updated dependencies and documentation.
0.12.0 #
BREAKING CHANGES
Breaking changes:
- Added method
InvertedIndex.getKGramIndex
toInvertedIndex
interface. - Added method
InvertedIndex.upsertKGramIndex
toInvertedIndex
interface. - Added field
InvertedIndex.k
toInvertedIndex
interface. - Removed field
TextIndexer.postingsStream
. - Renamed method
TextIndexer.emit
toTextIndexer.updateIndexes
. - Added
AsyncIndex.k
,AsyncIndex.kGramIndexLoader
andAsyncIndex.kGramIndexUpdater
final fields and parameters toAsyncIndex
class. - Added
InMemoryIndex.k
, andInMemoryIndex.kGramIndex
final fields and parameters toInMemoryIndex
class.
New:
- Type alias
KGramIndex
. - Type alias
KGramIndexLoader
. - Type alias
KGramIndexUpdater
. - Extension method
void KGramIndex.addTermKGrams(Term term, Iterable<KGram> kGrams)
.
Updated dependencies, tests, examples and documentation.
0.11.0 #
New:
- Mixin class
AsyncCallbackIndexMixin
. - Mixin class
InMemoryIndexMixin
.
Updated dependencies, tests, examples and documentation.
0.10.0 #
Breaking changes:
TextIndexerBase
default generative constructor is no longer markedconst
as it has a method body that initializes listeners toTextIndexer.documentStream
andTextIndexer.collectionStream
.
New:
- Input stream fields
TextIndexer.documentStream
andTextIndexer.collectionStream
added toTextIndexer
interface.- - Optional named parameter
Stream<Map<String, Map<String, dynamic>>>? collectionStream
added to added toTextIndexer.async
,TextIndexer.inMemory
andTextIndexer.index
factory contructors.
Updated dependencies, tests, examples and documentation.
0.9.0 #
Breaking changes:
- Renamed
InvertedPositionalZoneIndex
interface toInvertedIndex
. - Renamed
TextIndexer.instance
factory toTextIndexer.index
. - Parameter
dictionaryLengthLoader
added toAsynCallbackIndex
constructor; - Parameter
dictionaryLengthLoader
added toAsyncIndexer
constructor; - Parameter
dictionaryLengthLoader
added toTextIndexer.async
factory constructor; - Removed class
InMemoryIndexer
, use factory constructorTextIndexer.inMemory
in stead. - Removed class
AsyncIndexer
, use factory constructorTextIndexer.async
in stead.
New:
- Type definition
FtdPostings
. - Type definition
IdFtIndex
. - Type definition
IdFt
. - Type definition
ZoneWeightMap
. - Field getter
Future<int> InvertedIndex.vocabularyLength
. - Field getter
Future<int> Function() AsynCallbackIndex.dictionaryLengthLoader
; - Field getter
int InvertedIndex.phraseLength
. - Field getter
ZoneWeightMap InvertedIndex.zones
. - Optional named parameter
ZoneWeightMap zones
added toTextIndexer.async
factory. - Optional named parameter
ZoneWeightMap zones
added toTextIndexer.inMemory
factory. - Method
Future<FtdPostings> InvertedIndex.getFtdPostings(Iterable<Term>, int)
. - Method
Future<IdFtIndex> InvertedIndex.getIdFtIndex(Iterable<Term>)
. - Method
Future<Dictionary> InvertedIndex.getTfIndex(Iterable<Term>)
.
Updated dependencies, tests, examples and documentation.
0.8.0+1 #
Updated dependencies
0.8.0 #
BREAKING CHANGES
Breaking changes:
- Implementation of
TextIndexer.indexText
changed to also insert postings for every pair of terms in the source text.
Updated dependencies, tests, examples and documentation.
0.7.2+1 #
Updated dependencies
0.7.2 #
Updated dependencies
0.7.1 #
Updated dependencies
0.7.0 #
BREAKING CHANGES
Breaking changes:
- Renamed
Postings.documents
extension method toPostings.docIds
.
New:
- Extension method
Set<DocId> containsAll(Iterable<Term>)
- Extension method
Set<DocId> containsAny(Iterable<Term>)
Updated dependencies, tests, examples and documentation.
0.6.0 #
BREAKING CHANGES
Breaking changes:
- Changed signature of extension method
Postings.termPostingsList(Term)
toPostings.termPostingsList([Iterable<Term>?])
. - Removed field
InMemoryIndexer.dictionary
. UseInMemoryIndexer.index.dictionary
instead. - Removed field
InMemoryIndexer.postings
. UseInMemoryIndexer.index.postings
instead. - Removed method
TextIndexer.upsertDictionary
. UseTextIndexer.index.upsertDictionary
instead; - Removed method
TextIndexer.getDictionary
. UseTextIndexer.index.getDictionary
instead; - Removed method
TextIndexer.getPostings
. UseTextIndexer.index.getPostings
instead; - Removed method
TextIndexer.upsertPostings
. UseTextIndexer.index.upsertPostings
instead. - Removed field
InMemoryIndexer.dictionary
. Useindex.dictionary
instead. - Removed field
InMemoryIndexer.postings
. Useindex.postings
instead. - Added new field
InvertedIndex.analyzer
, changing the signatures of factory constructorsTextIndexer.inMemory
and 'TextIndexer.async'.
Updated dependencies, tests, examples and documentation.
0.6.0-2 #
BREAKING CHANGES
Breaking changes:
- Changed signature of extension method
Postings.termPostingsList(Term)
toPostings.termPostingsList([Iterable<Term>?])
.
Updated dependencies, tests, examples and documentation.
0.6.0-1 #
BREAKING CHANGES
Updated dependencies, tests, examples and documentation.
0.5.0 #
BREAKING CHANGES
Deprecated:
- Field
InMemoryIndexer.dictionary
is deprecated. Useindex.dictionary
instead. - Field
InMemoryIndexer.postings
is deprecated. Useindex.postings
instead.
Updated dependencies, tests, examples and documentation.
0.4.0 #
BREAKING CHANGES
Breaking changes:
- Renamed method
TextIndexer.index
toTextIndexer.indexText
. - Renamed class
PersistedIndexer
toAsyncIndexer
.
New:
InvertedIndex
interface and implementation.TextIndexer.index
field getter.TextIndexer.index
factory constructor.TextIndexer.async
factory constructor.TextIndexer.inMemory
factory constructor.
Deprecated:
- Method
TextIndexer.upsertDictionary
is deprecated. UseTextIndexer.index.upsertDictionary
instead; - Method
TextIndexer.getDictionary
is deprecated. UseTextIndexer.index.getDictionary
instead; - Method
TextIndexer.getPostings
is deprecated. UseTextIndexer.index.getPostings
instead; - Method
TextIndexer.upsertPostings
is deprecated. UseTextIndexer.index.upsertPostings
instead. - Field
InMemoryIndexer.dictionary
is deprecated. Useindex.dictionary
instead. - Field
InMemoryIndexer.postings
is deprecated. Useindex.postings
instead.
Updated dependencies, tests, examples and documentation.
0.3.2 #
New:
JSON
andJsonCollection
type aliases.TextIndexer.indexCollection
method.PostingsExtension.documents
getter.
Updated dependencies, tests, examples and documentation.
0.3.1 #
New:
JSON
andJsonCollection
type aliases.- implemented
TextIndexer.indexCollection
method.
Updated dependencies, tests, examples and documentation.
0.3.0 #
BREAKING CHANGES
Breaking changes:
- Removed interface
Document
.
0.2.0 #
BREAKING CHANGES
New:
ZonePostings
,DocumentPostings
, andFieldPostingsEntry
type definitions.Ft
,Pt
,TermPositions
andDocId
type aliases.- interface
Document
.
Breaking changes:
- Replaced object-model class
PostingsEntry
with typedefPostingsEntry
. - Replaced object-model class
DocumentPostingsEntry
with typedefDocumentPostingsEntry
. - Replaced object-model class
DictionaryEntry
with typedefDictionaryEntry
.
Restructured and simplified the codebase.
Updated dependencies, tests, examples and documentation.
0.1.0 #
New:
ITextIndexer.indexJson
method.
Updated dependencies, tests, examples and documentation.
0.0.2+1 #
Updated dependencies, tests, examples and documentation.
0.0.2 #
New:
text_analysis
package to exports.
Updated dependencies, tests, examples and documentation.
0.0.1+10 #
Updated dependencies, tests, examples and documentation.
0.0.1+9 #
Updated dependencies, tests, examples and documentation.
0.0.1+8 #
Updated dependencies, tests, examples and documentation.
0.0.1+7 #
Updated dependencies, tests, examples and documentation.
0.0.1+6 #
Re-worked private methods to suit changes in text_analysis package.
Updated dependencies, tests, examples and documentation.
0.0.1+5 #
- Updated dependencies.
0.0.1+4 #
Updated dependencies, tests, examples and documentation.
0.0.1+3 #
Updated dependencies, tests, examples and documentation.
0.0.1+2 #
Updated documentation.
0.0.1+1 #
Updated dependencies, tests, examples and documentation.
0.0.1 #
BREAKING CHANGES
Interfaces finalized (see breaking changes)
Breaking changes:
TermDictionary
renamedDictionary
.DocumentPostingsEntry
renamedPostings
.PostingsMapEntry
renamedPostingsEntry
.Term
renamedDictionaryEntry
.TermPositions
renamedDocumentPostingsEntry
.AsyncIndexer
implementation.TextIndexerBase
implementation.InMemoryIndexer
implementation.
Updated dependencies, tests, examples and documentation.
0.0.1-beta.4 #
Updated dependencies, tests, examples and documentation.
0.0.1-beta.3 #
BREAKING CHANGES
New:
AsyncIndexer
class.
Breaking changes:
TextIndexerBase
implementation.InMemoryIndexer
implementation.
Updated dependencies, tests, examples and documentation.
0.0.1-beta.2 #
New:
TextIndexerBase
.InMemoryIndexer
.
Updated dependencies, tests, examples and documentation.
0.0.1-beta.1 #
Initial version.