Skip to main content

vector

Lightweight vector module for direct collection management. Agents create named collections, index documents with automatic chunking, and search semantically or with hybrid keyword + vector. Shares the FastEmbed singleton with context_builder and rag - zero extra memory cost when all three are loaded.

PropertyValue
Module idvector
Version1.0.0
Typeuser
Pip depsfastembed, qdrant-client

Use vector for raw vector ops; use rag for full RAG pipelines (knowledge bases, hybrid retrieval, RRF fusion, citations, semantic cache, Text2SQL). rag reference →

The 14 actions

Mostly risk_level: low (reads), medium for inserts, high for delete_collection.

ToolSourcePurpose
vector.create_collectionCreate a named collection.
vector.delete_collectionDelete a collection + all its docs.
vector.list_collectionsList user collections + counts.
vector.addAdd raw text documents (chunks + embeds).
vector.add_fileRead + chunk + embed + add a file.
vector.searchSemantic search.
vector.hybrid_searchSemantic + keyword fusion.
vector.getRetrieve docs by id.
vector.deleteDelete docs by id (or filter).
vector.update_metadataPatch metadata on existing docs.
vector.countCount docs (with optional filter).
vector.collection_statsDoc count, dimensions, storage info.
vector.add_directoryWalk a directory tree, chunk + embed each file. Skips unchanged files (content-hash dedup).
vector.search_multiFan-out semantic search across many collections, merged + re-ranked.

Chunking strategies

StrategySplits onDefault sizeBest for
fixedN characters + overlap500 charsStructured data, code.
sentenceSentence boundaries (. ! ?)500 charsNatural prose.
paragraphDouble newlines (\n\n)1000 charsArticles, docs.
recursive (default)\n\n\n. → char500 charsUniversal.

Each chunk carries {text, index, start_char, end_char, metadata}.

Embedding model

PropertyValue
Default modelsentence-transformers/paraphrase-multilingual-MiniLM-L12-v2 (minilm-l12).
Dimensions384.
Languages~50.
SourceFastEmbed (ONNX, CPU).
SharingSingleton with context_builder + rag.

Override via config.embedding_model (any FastEmbed-supported HuggingFace id).

Collection isolation

Naming convention: user collections are stored as user_{app_id}_{name} and are kept separate from the system tools collection used by tool discovery. Two apps that pick the same nominal collection name do not clash.

Configuration

tools:
modules:
vector:
config:
embedding_model: null # null = use context_builder's singleton
default_chunk_size: 500
default_overlap: 50
persistence_dir: null # null = in-memory; otherwise Qdrant on-disk path

Aliases (FR / EN)

vector ships aliases for international agents. A few examples:

ActionAliases
vector.create_collectioncreer_collection, new_collection
vector.addajouter, indexer, embed, insert
vector.searchrechercher, chercher, query, find_similar
vector.hybrid_searchrecherche_hybride
vector.deletesupprimer_documents, remove

Full list per action via @action(aliases=[...]) in

Cross-references