Signal, Action and Policy System #5

mostafa · 2023-12-29T22:00:47Z

Problem

When proxying requests and responses between the client(s) and the database in the proxy object, the traffic hooks are called. These hooks allow the plugins to inspect and possibly modify the traffic, provided that they register to those hooks. The return value of the hook functions contain two essential fields (key-value) in the req.Fields or resp.Fields:

request (bytes): the binary request from the client(s).
response (bytes): the binary response from the database or injected by the plugin(s).

The plugins can inspect and return a modified request and/or response. Then, GatewayD decides what to do with the modified request and/or response.

Plugins can also influence the traffic flow. To enable this, they should return an extra field to signal GatewayD to terminate the request. The terminate (boolean) is a "signal" that dictates an "action" to be taken by GatewayD.

GatewayD detects the terminate signal and decides whether to act on the signal or not based on a policy, called the termination policy. The termination policy takes either of the following values:

stop (default): terminates the request and returns the response injected into the request.Fields by the plugin to the client.
continue: disregards the signal and continues the traffic flow, thus forwarding the packet to the database.

The terminate signal should be injected into the request.Fields and returned by the OnTrafficFromClient hook function. And only the OnTrafficFromClient can return this signal (plus a normal or error response) to influence the traffic flow.

Signal	Policy	Action
`terminate`	Termination policy (stop/continue)	terminate the request by returning a response

Solution

An action system should be developed to account for actions other than just termination of the request:

Flowchart

flowchart LR
    subgraph GatewayD
        direction LR
        subgraph Proxy
            PassThroughToServer
        end
        PassThroughToServer -- calls --> h
        subgraph PX["Plugin X"]
            h["OnTrafficFromClient"] --returns--> v1S
        end
        subgraph v1S["*v1.Struct"]
            rFBt["req.Fields['signals'] = []Signal{
                Terminate({'value': true}),
                Log({'level':'debug','msg':'...'}),
                Call({'method':'get','url':'...'})}"]
        end
        rFBt --are--> Signals
        subgraph PE["Policy Engine"]
            subgraph TP["Terminate policy"]
                IT{"Signal.terminate == true"} -->|true| ST
                IT -->|false| sendToS["Send request to server"]
                ST{"Policy.terminate == 'stop'"} -->|true| sendToC["Send error/response to client"]
                ST -->|false| sendToS
            end
            subgraph LP["Log policy"]
                IL{"'log' in Signal"} -->|true| PL
                IL -->|false| X["Discard log signal"]
                PL{"Policy.log == true"} -->|true| Log
                PL -->|false| X
            end
            subgraph CP["Call policy"]
                IC{"'call' in Signal"} -->|true| PC
                IC -->|false| W["Discard call signal"]
                PC{"Policy.call == true"} -->|true| Y["Call an API"]
                PC -->|false| W
            end
        end
        Signals -. Signal .-> Actions
        Log -.- Async
        X -.- Async
        sendToC -.- Sync
        sendToS -.- Sync
        W -.- Async
        W -.- Sync
        Y -.- Async
        Signals -- passes through --> IT
        Signals -- passes through --> IL
        Signals -- passes through --> IC
        Actions --> Sync
        Actions --> Async
        Async --> Queue
        subgraph JQ["Job Queue"]
            Queue --> Consumer
            Consumer --> Worker
            Worker --> Result
        end
        Sync -->|sync| Worker
        Result -->|response| h
        h -->|response| PassThroughToServer
    end
    style Actions fill:#fff,color:black
    style Log fill:#fff,color:black
    style X fill:#fff,color:black
    style sendToC fill:#fff,color:black
    style sendToS fill:#fff,color:black
    style W fill:#fff,color:black
    style Y fill:#fff,color:black
    style GatewayD fill:#fff,color:black
    style PX fill:#fff,color:black
    style v1S fill:#fff,color:black
    style JQ fill:#fff,color:black

Sequence diagram

sequenceDiagram
    Client ->> GatewayD: sends a query
    GatewayD ->> Plugins: calls a hook (onTrafficFromClient)
    Plugins ->> GatewayD: return single or multiple non-contradicting signal(s)
    GatewayD ->> GatewayD: signal is mapped to an action
    GatewayD ->> GatewayD: policies control whether to run or discard the action
    par runs sync action(s) (e.g. terminate)
        GatewayD ->> GatewayD: decides what to do with the request or response
        GatewayD -->> Client: terminates traffic (decision is final)
    and queues async action(s) (e.g. call a webhook)
        GatewayD ->> Queue: queue job for calling a webhook
        Queue ->> GatewayD: queued or failed and logged
    end

Actions

The following is the list of built-in and custom actions.

GatewayD sync actions

Signal	Policy	Action
`terminate` (bool)	`signal.terminate == true && policy.terminate == "stop"`	Terminate request (current) (change to `drop`?)
`disconnect`		Terminate connection
`reject`		Drop the request and reset the TCP connection
`transmit`/`tx`		Bypass plugins and just relay the request/response
`allow`		Allow request/response
`deny`		Deny request/response
`block`		Block client (time-based or permanent)
`discard`		Discard a request/response
`reset`		Reset the connection either way (`disconnect`?)
`fallthough`		No action (is it needed?)
`route`		Route traffic to a specific server
`forward`		Forward traffic to a specific server
`upgrade`		Upgrade connection to TLS
`police`		Apply rate limits to the traffic
`set`		Set/update session parameter
`noset`		Prevent session parameter update
`create`		create a new connection to the database (`connect`?)
`limit`		Limit client or request/response (should it be a custom action?)
`error`		Return error response (should it be a `response`?)
`request`		Return a (modified) request
`response`		Return a (modified) response

GatewayD async actions

Signal	Policy	Action
`log`		Log audit trail (or request/response)
`metric`		Record a metric
`queue`		Queue request/response
`conntrack`		Connection tracking
`mirror`		Mirror traffic to another server
`inspect`		Store traffic for inspection (`submit` for inspection?)
`record`		Store all traffic for inspection
`quarantine`		Like record, but for malicious packets (?)
`call`		Call an API or webhook (HTTP request?)

Plugin or custom actions

Signal	Policy	Action
`cache`		Cache request/response
`alert`		Log and/or trigger an alert
`notify`		Notify a plugin of something (or call a service?)
`run`		Run an action (should it be a custom action?)
`webhook`		Call a webhook
`rotate`		Rotate keys and secrets
`reauth`		Reauthenticate the client either way
`ebpf`		Run eBPF program to block a client in the kernel
`wasm`		Run WebAssembly filter
`publish`/`produce`		Publish a (batch of) message(s) (to Kafka or any other streaming/messaging system)
`subscribe`/`consume`		Consume a (batch of) message(s) (to Kafka or any other streaming/messaging system)

Custom debugger actions

Signal	Policy	Action
`debug`		Start debugging
`breakpoint`		Set a breakpoint on an action/step
`pause`		Pause for user input
`step`		Run a single step after user input (maybe `next`?)
`continue`		Continue processing
`abort`		Abort processing

Action storage/channel

Currently everything is passed around at the root level of the request.Fields. The action can be passed through in the context (metadata) to avoid clashing with the keys injected by HandleClientMessage (specifically the Terminate message) from the PostgreSQL wire protocol parser. This needs to be abstracted away in something like gRPC metadata. The metadata is passed around in the context object, which is unidirectional from the client (GatewayD) to the server (plugins). However, the plugins can only pass data to GatewayD using the returned request.Fields (and the error, which is not designed for this task). This means that a custom field should be created to handle the action(s).

Policies and actions

Note to self: Policies dictate actions. For example, a simple policy would be equality to a value (query.where.id == 1). Policies can be stored anywhere, including a policy engine like OPA.

TODO

Consider simplifying actions (too many seemingly duplicate names: terminate, reject, block, deny, etc.)
Create a new ticket for custom debugger actions
Create a new ticket for adding more capabilities to the plugin system (and plugins) like extending actions (in addition to hooks)

Resources

The text was updated successfully, but these errors were encountered:

This giant PR adds the very first version of the Act system that was proposed in [this proposal](gatewayd-io/proposals#5). The old way of signaling was static and only supported a single signal: `terminate`. The new system support more signals, adds proper policies that can be easily controlled by the users and the actions are executed in sync and async mode. The Act system consists of these components: 1. **Act Registry**: takes care of registering signals, policies and actions. It also applies policies to signals to produce outputs for actions and runs actions using those outputs. 2. **Signals**: plugins' hooks can return signal(s) as part of their request/response. These signals tell GatewayD what to do. 3. **Policies**: signals pass through predefined policies that will decide whether GatewayD should react to the signal or not. 4. **Actions**: actions run in sync or async mode and perform a function. Sync actions are used to control traffic (passthrough, terminate, etc.) and other parts of the system, and async actions can other things (log, publish a message to Kafka, etc.). 5. **Plugin Registry**: after running a hook on each plugin, the signals are extracted and the policies are applied to those signals. The output of those policy evaluations are returned to the caller, which knows how to run action and use its results. And the code spans over two projects: 1. **GatewayD**: all the above components of the Act system are in GatewayD. 2. **SDK**: types and helper functions for creating and exporting signals are in the SDK. ### Breaking changes The old way of terminating requests don't work anymore, as it was refactored in #442 and all the plugins are updated to pick up the changes.

mostafa added the enhancement New feature or request label Dec 29, 2023

This was referenced Dec 29, 2023

Refactor notification hooks gatewayd-io/gatewayd#397

Closed

Routing gatewayd-io/gatewayd#399

Open

mostafa mentioned this issue Jan 6, 2024

Refactor proxy gatewayd-io/gatewayd#337

Open

9 tasks

mostafa self-assigned this Jan 21, 2024

mostafa changed the title ~~Refactor and extend the action system~~ The signal, action and policy system Jan 28, 2024

mostafa changed the title ~~The signal, action and policy system~~ The Signal, Action and Policy System Jan 28, 2024

mostafa transferred this issue from gatewayd-io/gatewayd Jan 28, 2024

mostafa changed the title ~~The Signal, Action and Policy System~~ Signal, Action and Policy System Jan 28, 2024

mostafa mentioned this issue Jan 29, 2024

Implement signal, action and policy system gatewayd-io/gatewayd#422

Closed

9 tasks

mostafa moved this from 🚧 In progress to 📋 Backlog in GatewayD Core Public Roadmap Feb 18, 2024

mostafa mentioned this issue Feb 24, 2024

Act system gatewayd-io/gatewayd#451

Merged

13 tasks

mostafa mentioned this issue Mar 2, 2024

Act system gatewayd-io/docs#46

Merged

mostafa added the core Anything related to the gatewayd core project label Mar 2, 2024

mostafa mentioned this issue Oct 7, 2024

Use the result of notification hooks gatewayd-io/gatewayd#616

Merged

14 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Signal, Action and Policy System #5

Signal, Action and Policy System #5

mostafa commented Dec 29, 2023 •

edited

Loading

Signal, Action and Policy System #5

Signal, Action and Policy System #5

Comments

mostafa commented Dec 29, 2023 • edited Loading

Problem

Solution

Actions

Action storage/channel

Policies and actions

TODO

Related

Resources

mostafa commented Dec 29, 2023 •

edited

Loading