Elastic Data Tool ⚒️

Convert data to formats to easily ingest into Elasticsearch

Purpose 💖

Personal learning project which should result in a greater understanding of how to manipulate data formats, eg

SQL -> JSON
geo decoding JSON data containing location by adding latitude, longitude from address fields using external API
JSON -> NDJSON -> import file to Elasticsearch
JSON -> NDJSON -> Elasticsearch API POST /_bulk command, eg

POST alpacas/_bulk
{"index":{"_id":1}}
{"country": "NO","alpacaId":9876543210,"keeper":0123456789,"gender":"SEX_FEMALE","alpacaShortName":"ANITA IS COOL","name":"Anita's Alpacas"}
{"index":{"_id":2}}
{"country": "NO","alpacaId":9999543210,"keeper":0123456789,"gender":"SEX_MALE","alpacaShortName":"THOR IS COOL","name":"Anita's Alpacas"}

1. Install app 🐣

Clone repo and navigate to new repo
Run npm install

First time users only 🪴

See config.js and override any non sensitive values in the corresponding environment files eg config.test.json
.env file in root project should contain keys for sensitive values, eg

Elasticsearch

ELASTIC_CLOUD_ID="UPDATE-ME"
ELASTIC_USERNAME="UPDATE-ME"
ELASTIC_PASSWORD="UPDATE-ME"

Google API

GOOGLE_MAPS_API_KEY="UPDATE-ME"

MySQL

MYSQL_PASSWORD="YOUR PASSWORD GOES HERE"

MySQL on Azure portal

Get certificate from https://portal.azure.com/ MySQL flexible server > Settings > Networking > Download SSL Certificate
Put it in the ./data folder which must be in .gitignore
Update filename to match config db.ssl_ca value

2. Use the app 🎷

Pre-conditions

.env file contains correct overrides for sensitive values for current environment
Create local folder ./data and store .sql file dump there
Local MySQL is running and database is populated. If not follow the steps at pre-requisistes.md

SQL -> Elasticsearch index 🤖

Local env npm run sql_to_elastic or test env npm run sql_to_elastic_test

SQL -> JSON File 👾

Local env npm run sql_to_json or test env npm run sql_to_json_test

JSON File -> Elasticsearch index 🤖

Automate with Elasticsearch client

Create index in Elasticsearch from existing JSON file: node json_to_elastic - edit JSON filename as needed // TODO automate getting this fromSQL -> JSON step
Verify the index was created in Elasticsearch Dev Tools: GET alpacas/_search - note it uses an alias that is updated GET _alias/alpacas

JSON File -> NDJSON File -> import file to Elasticsearch 💾

Generate NDJSON file to import manually to Elasticsearch

Edit the JSON filename to read from in json_to_ndjson.js and save the file
Run node json_to_ndjson
Look for the generated file in the directory
Import this file to Elasticsearch

3. Test app ✅

npm run test

4. Create data in Elasticsearch from scratch 🎸

Data setup in Elasticsearch

5. Override database with real world farm categories 🛍️

Edit the file farm_category.js to contain actual categories for farms
The values in the file overrides values in the database which are not up to date

6. Development

Format the code

npm run prettier

Credits 👏

Location data from Google Maps

License 📝

The work is under exclusive copyright by default.

Name		Name	Last commit message	Last commit date
Latest commit History 235 Commits
.github/workflows		.github/workflows
config		config
functions		functions
test/functions		test/functions
.gitignore		.gitignore
.prettierignore		.prettierignore
.prettierrc		.prettierrc
README.md		README.md
elasticsearch_data_setup.md		elasticsearch_data_setup.md
index.js		index.js
json_to_elastic.js		json_to_elastic.js
json_to_ndjson.js		json_to_ndjson.js
package-lock.json		package-lock.json
package.json		package.json
pre-requisistes.md		pre-requisistes.md
sql_to_json.js		sql_to_json.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Elastic Data Tool ⚒️

Purpose 💖

1. Install app 🐣

First time users only 🪴

2. Use the app 🎷

SQL -> Elasticsearch index 🤖

SQL -> JSON File 👾

JSON File -> Elasticsearch index 🤖

JSON File -> NDJSON File -> import file to Elasticsearch 💾

3. Test app ✅

4. Create data in Elasticsearch from scratch 🎸

5. Override database with real world farm categories 🛍️

6. Development

Credits 👏

License 📝

About

Releases 2

Packages

Languages

purplebugs/elastic-data-tool

Folders and files

Latest commit

History

Repository files navigation

Elastic Data Tool ⚒️

Purpose 💖

1. Install app 🐣

First time users only 🪴

2. Use the app 🎷

SQL -> Elasticsearch index 🤖

SQL -> JSON File 👾

JSON File -> Elasticsearch index 🤖

JSON File -> NDJSON File -> import file to Elasticsearch 💾

3. Test app ✅

4. Create data in Elasticsearch from scratch 🎸

5. Override database with real world farm categories 🛍️

6. Development

Credits 👏

License 📝

About

Resources

Stars

Watchers

Forks

Releases 2

Packages 0

Languages

Packages