Ошибки ElasticSearch при массовой вставке: invalid_state_exception Слишком много кэшированных токенов (> 100) - PullRequest
0 голосов
/ 26 сентября 2019

При массовой вставке данных я получаю следующую ошибку:

"status" : 500,
"error" : {
  "type" : "illegal_state_exception",
  "reason" : "Too many cached tokens (> 100)"
}

Данные, которые вызывают ошибку:

{"index":{"_index":"ind1","_id":"ID_22"}}
{"from":"ADD","id":"ID_22","system":"Photo","title":["Elasticsearch is a search engine based on the Lucene library"],"detail":["<p>It provides a distributed, multitenant-capable full-text search engine with an HTTP web interface and schema-free JSON documents. Elasticsearch is developed in Java."," <a href='https://www.google.com' target='_blank'>Google</a>","  Following an open-core business model, <br>parts of the software are licensed under various open-source licenses</p>"],"a":[{"title":"Pro","subtitle":"sub titles","url":"https://www.google.com","img":"Aenean"}]}

Я использовал следующие сопоставления и настройки:

{
  "settings": {
    "analysis": {
      "filter": {
        "english_stemmer": {
          "type": "stemmer",
          "language": "english"
        },
        "english_stop": {
          "type": "stop",
          "stopwords": "_english_"
        },
        "synonym": {
          "format": "wordnet",
          "type": "synonym",
          "synonyms_path": "analysis/wn_s.pl.txt",
          "lenient": "true"
        }
      },
      "analyzer": {
        "synonym": {
          "filter": [
            "english_stop",
            "english_stemmer",
            "synonym",
            "lowercase"
          ],
          "tokenizer": "standard"
        }
      }
    }
  },
  "mappings": {
    "properties": {
      "system": {
        "type": "keyword"
      },
      "title": {
        "type": "search_as_you_type",
        "analyzer": "synonym"
      }
    }
  }
}
...