Collection of graph partitioning algorithms implemented in Python for composing cloud-native applications from standalone serverless functions in a cost-efficient and latency-constrained manner.
In the context of serverless computing, function fusion is a novel, high-level approach to improve performance and at the same time reduce the operational cost of serverless applications consisting of stateless, ephemeral functions. This is achieved by grouping, encompassing, and assembling connected FaaS functions into separate composite components representing the deployable software artifacts that are provisioned by the serverless frameworks in the same way as other single functions. In addition, different user-defined Quality of Service (QoS) constraints should be also taken into account, e.g., overall response time of the application or an end-to-end latency constraint on the critical path in the application's call graph.
Under the hood, this problem can be formalized as the partitioning of the application call graph (DAG) into disjoint, connected subgraphs in a cost-efficient manner, while specific requirements imposed by the user and the platform (flavors) itself need to be satisfied.
In this package, we designed, implemented, and collected various partitioning algorithms tailored to tree-shape serverless applications with different runtime complexity, considering communication parameters and requirements. Our main goal is to find the cost-optimal grouping of functions concerning node and edge-weighted trees and cost/memory/latency models based on public cloud frameworks, whereas each flavor imposes an upper limit on the available operative memory. Moreover, a user-given latency constraint has to be fulfilled on the tree's critical path, which is defined as the subchain between the first/front-end function and a predefined leaf node.
Our implementations require Python3.10 or above. The following code snippet can be used to set up the latest Python environment on Ubuntu.
sudo add-apt-repository -y 'ppa:deadsnakes/ppa' && sudo apt update
sudo apt install python3.11 python3.11-dev
sudo curl -sS https://bootstrap.pypa.io/get-pip.py | sudo python3.11
The easiest way to get our algorithms collected in SLAMBUC is to install the package from PyPI repository.
python3.11 -m pip install slambuc
However, for the latest changes, it can be installed directly from GitHub as follows.
python3.11 -m pip install --no-cache-dir git+https://github.com/hsnlab/SLAMBUC.git
Tree plotting relies on networkx's internal plotting feature that generates a layout based on the graphviz tool and its python frontend. Thus, in that case, the related dependencies must be installed first.
sudo apt-get install graphviz graphviz-dev
python3.11 -m pip install pygraphviz
External solvers can also be used in LP-based algorithms that require the given solver packages to be preinstalled and available for the PuLP frontend. Currently, the following solvers are tested.
- CBC (default, packaged with PuLP)
- GLPK (see installation here)
- CPLEX (installation and setup)
It is worth noting that CPLEX's python wrapper docplex (as a replacement for PuLP) is left behind the latest Python version. For using this API, requirements are prepared separately for Python3.10.
python3.10 -m pip install -U -r requirements_py3.10.txt
For solving constrained shortest path problems (CSP), we apply solution methods from cspy.
Our repository contains separate test scripts under the tests folder for validating the input/output formats and call parameters. These codes also serve as examples for using the different implementations of our package.
For comparative analyses, we also implemented a test harness under validation to automatize test executions with generated test input graphs from validation/data and monitor elapsed time and memory demands of tested algorithms initiated as separate subprocesses.
To install additional dependencies, run the following commands.
python3.11 -m pip install slambuc[tests] # For executing tests
python3.11 -m pip install slambuc[validation] # For using our test harness framework
Refer to the wiki for formats, execution parameters, examples, and API documentation.
from slambuc.alg.tree.serial.pseudo import pseudo_ltree_partitioning
from slambuc.misc.random import get_random_tree
# Get input parameters
tree = get_random_tree(nodes=10) # Assuming random memory demands are in GB
params = dict(tree=tree,
root=1, # Root node ID
M=6, # Memory upper limit
L=450, # Latency upper limit
cp_end=10, # Critical path: [root -> cp_end]
delay=10 # Platform delay in ms
)
# Partitioning
res = pseudo_ltree_partitioning(**params)
print(f"Part: {res[0]}, opt. cost: {params['M'] * (res[1] / 1000)} GBs, latency: {res[2]} ms")
"Part: [[1, 2], [3, 4], [5, 6, 7, 8], [9], [10]], opt. cost: 7.512 GBs, latency: 449 ms"
Validation results of a subset of our algorithms with a fully serialized block execution model, which are executed with our validation script using different configurations and a random-generated input call graph of size 10.
Used algorithmic parameters (if applicable):
- Root node ID (root): 1
- Memory limit (M): 6
- Available vCPU count (N): 1
- Critical path's end node ID (cp_end): 10
- Latency limit: (L): 500
- Platform delay: (delay): 10
- Bidirectional elimination (bidirectional): True
- Cost approximation ratio (Epsilon): 0.0
- Latency violation ratio (Lambda): 0.0
Exact algorithms are configured to yield all optimal solutions (if exists) with the numerating
format {alg}_{num}
.
Execution results:
Algorithm | Partitioning | Cost | Latency | Time (s) |
---|---|---|---|---|
GREEDY_0 | [[1, 2, 3], [4, 5, 6, 8, 9, 10], [7]] | 858 | 443 | 0.0235749 |
GREEDY_1 | [[1, 3, 4, 5], [2], [6, 8, 9, 10], [7]] | 858 | 474 | 0.0235749 |
GREEDY_2 | [[1, 3, 4, 5], [2], [6, 7, 8, 9], [10]] | 858 | 471 | 0.0235749 |
ILP_CFG_HYBRID | [[1, 3, 4, 5], [2], [6, 8, 9, 10], [7]] | 858 | 474 | 0.0167496 |
ILP_MTX | [[1, 3, 4, 5], [2], [6, 8, 9, 10], [7]] | 858 | 474 | 0.0197985 |
PSEUDO_B | [[1, 2, 3], [4, 5, 6, 8, 9, 10], [7]] | 858 | 443 | 0.00047041 |
PSEUDO_L | [[1, 3, 4, 5], [2], [6, 7, 8, 9], [10]] | 858 | 471 | 0.00083811 |
BIFPTAS_L | [[1, 3, 4, 5], [2], [6, 7, 8, 9], [10]] | 858 | 471 | 0.00082326 |
BASELINE_NO_PART | [[1], [2], [3], [4], [5], [6], [7], [8], [9], [10]] | 1090 | 472 | 9.38e-05 |
BASELINE_SINGLE | [[1, 2, 3, 4, 5, 6, 7, 8, 9, 10]] | 822 | 686 | 6.718e-05 |
If you would like to contribute, add a feature, or just play with the implementations, the development environment can be set up with the following commands.
git clone https://github.com/hsnlab/SLAMBUC.git
python3.11 -m pip install -U -r SLAMBUC/requirements.txt
python3.11 -m pip install --no-deps -e SLAMBUC/
# OR
cd SLAMBUC && make install-req && make dev-install
## Remove editing-mode package outside of repo root
python3.11 -m pip uninstall slambuc
# OR
make uninstall
If you use one of our algorithms published in this package or our test harness, please consider citing one of our related works.
J. Czentye and B. Sonkoly, "Serverless application composition leveraging function fusion: Theory and algorithms," Future Generation Computer Systems 153 pp. 403–418., 16 p. (2024), doi: 10.1016/j.future.2023.12.010.
@ARTICLE{Czentye2024fgcs,
author = {J{\'{a}}nos Czentye and Bal{\'{a}}zs Sonkoly},
title = {{Serverless application composition leveraging function fusion: Theory and algorithms}},
journal = {{Future Generation Computer Systems}},
volume = {153},
pages = {403--418},
year = {2024},
issn = {0167-739X},
doi = {https://doi.org/10.1016/j.future.2023.12.010}
}
J. Czentye, I. Pelle and B. Sonkoly, "Cost-optimal Operation of Latency Constrained Serverless Applications: From Theory to Practice," NOMS 2023-2023 IEEE/IFIP Network Operations and Management Symposium, Miami, FL, USA, 2023, pp. 1-10, doi: 10.1109/NOMS56928.2023.10154412.
@INPROCEEDINGS{Czentye2022noms,
author = {J{\'{a}}nos Czentye and Istv{\'{a}}n Pelle and Bal{\'{a}}zs Sonkoly},
booktitle = {{NOMS 2023-2023 IEEE/IFIP Network Operations and Management Symposium}},
title = {{Cost-optimal Operation of Latency Constrained Serverless Applications: From Theory to Practice}},
publisher = {{IEEE}},
year = {2023},
month = may,
pages = {1--10},
doi = {10.1109/NOMS56928.2023.10154412}
}
I. Pelle, J. Czentye, J. Dóka, A. Kern, B. P. Gerő and B. Sonkoly, "Operating Latency Sensitive Applications on Public Serverless Edge Cloud Platforms," in IEEE Internet of Things Journal, vol. 8, no. 10, pp. 7954–7972, 15 May, 2021, doi: 10.1109/JIOT.2020.3042428.
@ARTICLE{Pelle2021jiot,
author = {Pelle, Istv{\'{a}}n and Czentye, J{\'{a}}nos and D{\'{o}}ka, J{\'{a}}nos and Kern, Andr{\'{a}}s and Ger{\H{o}}, Bal{\'{a}}zs P. and Sonkoly, Bal{\'{a}}zs},
journal = {{IEEE Internet of Things Journal}},
title = {{Operating Latency Sensitive Applications on Public Serverless Edge Cloud Platforms}},
publisher = {Institute of Electrical and Electronics Engineers ({IEEE})},
year = {2021},
month = may,
volume = {8},
number = {10},
pages = {7954--7972},
doi = {10.1109/JIOT.2020.3042428}
}
J. Czentye, I. Pelle, A. Kern, B. P. Gero, L. Toka and B. Sonkoly, "Optimizing Latency Sensitive Applications for Amazon's Public Cloud Platform," 2019 IEEE Global Communications Conference (GLOBECOM), Waikoloa, HI, USA, 2019, pp. 1-7, doi: 10.1109/GLOBECOM38437.2019.9013988.
@INPROCEEDINGS{Czentye2019globecom,
author = {Czentye, Janos and Pelle, Istvan and Kern, Andras and Gero, Balazs Peter and Toka, Laszlo and Sonkoly, Balazs},
booktitle = {{2019 IEEE Global Communications Conference (GLOBECOM)}},
title = {{Optimizing Latency Sensitive Applications for Amazon's Public Cloud Platform}},
publisher = {{IEEE}},
year = {2019},
month = dec,
pages = {1--7},
doi = {10.1109/GLOBECOM38437.2019.9013988}
}
SLAMBUC is an open-source software licensed under Apache 2.0.