Skip to content

Latest commit

 

History

History
155 lines (107 loc) · 7.21 KB

03_Introduction_into_Metafacture-Fix.md

File metadata and controls

155 lines (107 loc) · 7.21 KB

Lesson 3: Introduction into Metafacture Fix

In the last session we learned about Flux-Moduls. Flux-Moduls can do a lot of things. They configure the the "high-level" transformation pipeline.

But the main transformation of incoming data at record-, elemenet- and value-level is usually done by the transformation moduls: fix or morph as one step in the pipeline.

What do we mean when we talk about transformation, e.g.:

  • Manipulating element-names and element-values
  • Change hierachies and structures of records
  • Lookup values in concordance list.

But not changing serialization that is part of encoding and decoding.

In this tutorial we focus on Fix. If you want to learn about Morph have a look at https://slides.lobid.org/metafacture-2020/#/

Metafacture Fix and Fix Functions

So let's dive into Metafacture Fix and get back to the Playground. Clear it if needed and paste the following Flux in the Flux-File area.

"https://weather-proxy.freecodecamp.rocks/api/current?lat=50.93414&lon=6.93147"
| open-http
| as-lines
| decode-json
| fix ("retain('name')")
| encode-yaml
| print
;

You should end up with something like:

---
name: Cologne
...

The fix module in Metafacture is used to manipulate the input data filtering fields we would like to see. Only one fix-function was used: retain, which throws away all the data from the input except the ‘name’ field.

Also add the info that is written in main

"https://weather-proxy.freecodecamp.rocks/api/current?lat=50.93414&lon=6.93147"
| open-http
| as-lines
| decode-json
| fix ("retain('name', 'main.temp')")
| encode-yaml
| print
;

You should now see something like this:

---
main:
  temp: "15.99"
name: "Cologne"

When manipulating data you often need to create many fixes to process a data file in the format and structure you need. With a text editor you can write all fix functions in a singe separate fix-file.

The playground has an transformationFile-content area that can be used as if the fix is in a separate file. In the playground we use the variable transformationFile to adress the fix file in the playground.

Like this.

image

Write the following Fix into the transformationFile-content area:

retain("name", "main.temp")

With this separate fix-file it will be a bit easier to write many fix-functions and it does not overcrowd the flux-workflow.

To add more fixes we can again edit the fix file. And add these lines in front of the retain function:

move_field("main.temp", "temp")

With move_field you can change the name and the position of an element in a record. Here instead of temp being a subfield of main, it tempis now a top leve element.

Also change the retain function, that you only keep "name" and "temp" and not the not existing "main.temp" any more.

move_field("main.temp","temp")
retain("name", "temp")

The output should be something like this:

---
name: "Cologne"
temp: "16.29"

So with move_field we moved and renamed an existing element. As next step add the following function before the retain function.

prepend("temp","The temperature is ")
append("temp"," degrees Kelvin")

If you execute your last workflow with the Process-Button again, you should now see as ouput:

---
name: "Cologne"
temp: "The temperature is 16.29 degrees Kelvin"

See the example in the playground.

Metafacture contains many fix function to manipulate data. Also there are many flux commands/modules that can be used.

Check the documentation to get a complete list of flux command and fix functions. This post only presented a short introduction into Metafacture. In the next posts we will go deeper into its capabilities.

Have a look at the fix functions and try to solve the following excercises:

Excercise

Add a field with todays date with add_field or use timestamp

Answer [See here](https://metafacture.org/playground/?flux=%22https%3A//weather-proxy.freecodecamp.rocks/api/current%3Flat%3D50.93414%26lon%3D6.93147%22%0A%7C+open-http%0A%7C+as-lines%0A%7C+decode-json%0A%7C+fix+%28transformationFile%29%0A%7C+encode-yaml%0A%7C+print%0A%3B&transformation=move_field%28%22main.temp%22%2C%22temp%22%29%0Aprepend%28%22temp%22%2C%22The+temperature+is+%22%29%0Aappend%28%22temp%22%2C%22+degrees+Kelvin%22%29%0Aadd_field%28%22time%22%2C%222023-11-11%22%29%0Aretain%28%22name%22%2C+%22temp%22%2C%22time%22%29)

Use the function timestamp instead of add_field.

Answer [See here](https://metafacture.org/playground/?flux=%22https%3A//weather-proxy.freecodecamp.rocks/api/current%3Flat%3D50.93414%26lon%3D6.93147%22%0A%7C+open-http%0A%7C+as-lines%0A%7C+decode-json%0A%7C+fix+%28transformationFile%29%0A%7C+encode-yaml%0A%7C+print%0A%3B&transformation=move_field%28%22main.temp%22%2C%22temp%22%29%0Aprepend%28%22temp%22%2C%22The+temperature+is+%22%29%0Aappend%28%22temp%22%2C%22+degrees+Kelvin%22%29%0Atimestamp%28%22time%22%2Cformat%3A%22yyyy-MM-dd%27T%27HH%3Amm%3Ass%22%2C+timezone%3A%22Europe/Berlin%22%29%0Aretain%28%22name%22%2C+%22temp%22%2C%22time%22%29)

Upcase the name element

Answer [See here](https://metafacture.org/playground/?flux=%22https%3A//weather-proxy.freecodecamp.rocks/api/current%3Flat%3D50.93414%26lon%3D6.93147%22%0A%7C+open-http%0A%7C+as-lines%0A%7C+decode-json%0A%7C+fix+%28transformationFile%29%0A%7C+encode-yaml%0A%7C+print%0A%3B&transformation=move_field%28%22main.temp%22%2C%22temp%22%29%0Aprepend%28%22temp%22%2C%22The+temperature+is+%22%29%0Aappend%28%22temp%22%2C%22+degrees+Kelvin%22%29%0Atimestamp%28%22time%22%2Cformat%3A%22yyyy-MM-dd%27T%27HH%3Amm%3Ass%22%2C+timezone%3A%22Europe/Berlin%22%29%0Aretain%28%22name%22%2C+%22temp%22%2C%22time%22%29)

Next lesson: 04 Fix Path