See geomesa-tools
... for community maintained readmes, simple feature type configs, and converter configs
GeoMesa Data (gm-data) provides several built in SimpleFeatureType (SFT) and Converter definitions to help ingest common data formats from across the web. Each module contains a reference.conf
file that defines a named SFT and converter that can be referenced via the GeoMesa tools or GeoMesa Nifi processor when properly installed on the classpath.
The gm-data-all
module will package all submodules together into a shaded jar and concatenate any reference.conf
files found in other modules to provide a single deployable artifact.
Build GeoMesa convert module (geomesa-convert-all). Then build gm-data:
cd gm-data
mvn clean install
Data source | SFT name (-s) | Converter Name (-C) |
---|---|---|
GDELT | gdelt |
gdelt |
GeoLife | geolife |
geolife |
Global Terrorism Database | gtd |
gtd |
New York City Taxi | nyctaxi 1 nyctaxi-single |
nyctaxi nyctaxi-drop nyctaxi-single |
Open Street Map GPX | osm |
osm |
GeoNames | geonames |
geonames |
T-Drive | tdrive |
tdrive |
Twitter 2 | twitter |
twitter twitter-place-centroid 3 |
- NYC Taxi data includes two points: the pickup and dropoff. The
nyctaxi
feature type contains one point and an indicator of whether it is the pickup or dropoff. Thenyctaxi-single
feature has both geometries. - Converter expects one tweet per line in the file.
- Converter
twitter
applies precise point geometry if available;twitter-place-centroid
extracts the centroid from the bounding box of the tweet's place attribute.
For GeoMesa Tools, copy the gm-data-all-{version}.jar
file into the $GEOMESA_HOME/lib
directory and run geomesa ingest
. Specify the simple feature type by name with the --spec
or -s
option. Give the converter config by name with the --converter
or -C
(capital C) option.
For Apache NiFi, copy the gm-data-all-{version}.jar
file in the $NIFI_HOME/lib
directory and restart NiFi.