Implement Memory-mapped MLModel #275

OctoberChang · 2024-01-07T00:05:00Z

Issue #, if available:

Description of changes:
Implement Memory-mapped MLModel class for both C and Python Interface.

significantly reduce loading time
significantly reduce real-time inference latency

Usage
User needs to have a MLModel saved on disk (in original .npz format),
and manually compile into mmap format by calling compile_mmap_model:

from pecos.xmc import MLModel
npz_model_path = f"/path/to/xlinear/pecos-models/"
mmap_model_path = f"/path/to/xlinear/mmap-models/"
MLModel.compile_mmap_model(npz_model_folder, mmap_model_folder)

Then user can load the memory-mapped model and do inference:

from pecos.utils import smat_util
from pecos.xmc import MLModel
Xt = smat_util.load_matrix(f"/test/data/validation/X.npz")
model = MLModel.load(mmap_model_path, lazy_load=True)
Yp = model.predict(Xt)

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

weiliw-amz · 2024-01-09T19:07:10Z

test/pecos/xmc/xlinear/test_xlinear.py

@@ -1168,3 +1168,36 @@ def test_mmap(tmpdir):
        py_pred = py_model.predict(Xt, **kwargs).todense()
        mmap_pred = mmap_model.predict(Xt, **kwargs).todense()
        assert mmap_pred == approx(py_pred, abs=1e-6), f"post_processor:{pp}"
+
+
+def test_mmap_mlmodel(tmpdir):


I think this test should be put in https://github.com/amzn/pecos/blob/mainline/test/pecos/xmc/test_xmc.py since the functionalities added are for the xmc/base.py class.

Fixed. See latest Revision.

weiliw-amz · 2024-01-09T19:08:54Z

pecos/core/libpecos.cpp

-    // ==== C Interface of XMC Models ====
+    // ==== C Interface of MLModels ====
+    // Only implemented for w_matrix_t = pecos::csc_t
+    //typedef pecos::bin_search_chunked_matrix_t MLMODEL_MAT_T;


Remove this unused comment.

Fixed. See latest Revision.

OctoberChang requested review from weiliw-amz and jiong-zhang January 7, 2024 00:05

OctoberChang force-pushed the mmap-mlmodel branch 2 times, most recently from 37437c5 to 5c06b9e Compare January 8, 2024 21:25

weiliw-amz reviewed Jan 9, 2024

View reviewed changes

Implement Memory-mapped MLModel

8f397c0

OctoberChang force-pushed the mmap-mlmodel branch from 5c06b9e to 8f397c0 Compare January 9, 2024 20:30

weiliw-amz approved these changes Feb 8, 2024

View reviewed changes

Merge branch 'mainline' into mmap-mlmodel

a0ce0e9

OctoberChang merged commit c181496 into amzn:mainline Feb 8, 2024
25 checks passed

OctoberChang deleted the mmap-mlmodel branch February 8, 2024 18:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement Memory-mapped MLModel #275

Implement Memory-mapped MLModel #275

OctoberChang commented Jan 7, 2024

weiliw-amz Jan 9, 2024

OctoberChang Jan 9, 2024

weiliw-amz Jan 9, 2024

OctoberChang Jan 9, 2024

Implement Memory-mapped MLModel #275

Implement Memory-mapped MLModel #275

Conversation

OctoberChang commented Jan 7, 2024

weiliw-amz Jan 9, 2024

Choose a reason for hiding this comment

OctoberChang Jan 9, 2024

Choose a reason for hiding this comment

weiliw-amz Jan 9, 2024

Choose a reason for hiding this comment

OctoberChang Jan 9, 2024

Choose a reason for hiding this comment