Weaviate · JSON Structure

Weaviate Tokenizer User Dict Config Structure

A list of pairs of strings that should be replaced with another string during tokenization.

Type: object Properties: 2
Vector DatabaseAIMachine LearningSemantic SearchOpen SourceGraphQLKubernetes

Weaviate Tokenizer User Dict Config Structure is a JSON Structure definition published by Weaviate, describing 2 properties. It conforms to the https://json-structure.org/meta/extended/v0/# meta-schema.

Properties

tokenizer replacements

Meta-schema: https://json-structure.org/meta/extended/v0/#

JSON Structure

Raw ↑
{
  "$schema": "https://json-structure.org/meta/extended/v0/#",
  "$id": "https://api-evangelist.github.io/weaviate/json-structure/weaviate-tokenizer-user-dict-config-structure.json",
  "title": "TokenizerUserDictConfig",
  "description": "A list of pairs of strings that should be replaced with another string during tokenization.",
  "type": "object",
  "properties": {
    "tokenizer": {
      "type": "str",
      "description": "The tokenizer to which the user dictionary should be applied. Currently, only the `kagame` ja and kr tokenizers supports user dictionaries."
    },
    "replacements": {
      "type": "array"
    }
  }
}