json - Insert multiple documents in elasticsearch

Question

Welcome To Ask or Share your Answers For Others

json - Insert multiple documents in elasticsearch

asked Oct 24, 2021 in Technique[技术] by 深蓝 (71.8m points)

json - Insert multiple documents in elasticsearch

I have to insert a json array in elastic. The accepted answer in the link suggests to insert a header-line before each json entry. The answer is 2 years old, is there a better solution out in the market? Need I edit my json file manually?

is there any way to import a json file(contains 100 documents) in elasticsearch server.?

[
  {
    "id":9,
    "status":"This is cool."
  },
  ...
]

See Question&Answers more detail:os

与恶龙缠斗过久,自身亦成为恶龙；凝视深渊过久,深渊将回以凝视…

1 Answer

深蓝 · Answer 1 · 2021-10-23T21:27:39+0000

OK, then there's something pretty simple you can do using a simple shell script (see below). The idea is to not have to edit your file manually, but let Python do it and create another file whose format complies with what the _bulk endpoint expects. It does the following:

First, we declare a little Python script that reads your JSON file and creates a new one with the required file format to be sent to the _bulk endpoint.
Then, we run that Python script and store the bulk file
Finally, we send the file created in step 2 to the _bulk endpoint using a simple curl command
There you go, you now have a new ES index containing your documents

bulk.sh:

#!/bin/sh

# 0. Some constants to re-define to match your environment
ES_HOST=localhost:9200
JSON_FILE_IN=/path/to/your/file.json
JSON_FILE_OUT=/path/to/your/bulk.json

# 1. Python code to transform your JSON file
PYTHON="import json,sys;
out = open('$JSON_FILE_OUT', 'w');
with open('$JSON_FILE_IN') as json_in:
    docs = json.loads(json_in.read());
    for doc in docs:
        out.write('%s
' % json.dumps({'index': {}}));
        out.write('%s
' % json.dumps(doc, indent=0).replace('
', ''));
"

# 2. run the Python script from step 1
python -c "$PYTHON"

# 3. use the output file from step 2 in the curl command
curl -s -XPOST $ES_HOST/index/type/_bulk --data-binary @$JSON_FILE_OUT

You need to:

save the above script in the bulk.sh file and chmod it (i.e. chmod u+x bulk.sh)
modify the three variable at the top (step 0) in ordre to match your environment
run it using ./bulk.sh

Categories

json - Insert multiple documents in elasticsearch

json - Insert multiple documents in elasticsearch

Please log in or register to add a comment.

Please log in or register to answer this question.

1 Answer

Please log in or register to add a comment.

Just Browsing Browsing

Most popular tags