0.33.1-alpha (2024-12-03)
- release: release v0.33.1-alpha (7ad830f)
0.33.0-alpha (2024-11-05)
- minio: import updated minio package and add tag on file upload (#704) (7d03354)
- run: write data for usage by model trigger chart record api (#705) (fababc9)
- migration: migrate unsupported task to custom (#709) (f44afd6)
- run: add metadata retention handler (#707) (43a6a61)
- run: add namespace id in response (#708) (2940b1c)
0.32.0-alpha (2024-10-23)
0.31.0-alpha (2024-10-22)
- run: run logging data list by requester API (#695) (b118026)
- service: standardize the tag naming convention (#700) (e08be0d)
- ray: fix upscale deployment failure (#698) (96563c9)
- run: fix run logging update function (#699) (a753fe4)
- run: not return minio error in list model run (#696) (cbeea05)
0.30.2-alpha (2024-10-08)
- minio: fix nil logger causing panic (ef9645e)
- test: switch to minimock and add tests (#687) (141da9d)
0.30.1-alpha (2024-09-13)
0.30.0-alpha (2024-09-10)
0.29.0-alpha (2024-08-29)
- minio: added minio connection (#651) (d17770f)
- minio: added minio upload file bytes function (#656) (1f7b403)
- minio: upgraded to minio v7 and return url and file size (#654) (4c34bf4)
- model: support model statistics in model message (#660) (b460778)
- model: support num_of_runs in list filter (#661) (b161512)
- ray: add separate state for scaling (#659) (f519c82)
- ray: support triggering upscale (#665) (441af13)
- redis: replace I/O data retrieval from redis with minio (#667) (fc46ef8)
- run: fix run logging owner view permission (#668) (5c07c9a)
- ray: fix upscale request timeout (#675) (d24eea5)
- redis: fix input value overwrite (#658) (a383380)
- workflow: fix duplicate key when activity retry (#669) (97d9cd2)
- workflow: fix state check (#673) (0514cae)
0.28.0-alpha (2024-07-31)
- mgmt: use explicit
user_id
andorganization_id
in mgmt request (#638) (4af37e6) - model: implement namespace endpoints (#639) (8c9acac)
- pkg: implement fuzze search for namespace and model ID (#642) (30b11a8)
- tag: allow custom tags for model (#643) (9c49db6)
- version: embed version names in model struct (#645) (88f107b)
- repository: fix missing tag query (#644) (f1f3229)
- transpiler: fix expression (#649) (4680c92)
- worker: added tests for trigger model activity (#630) (abe562e)
0.27.0-alpha (2024-07-16)
- credit: INS-5058 added check and collect for model trigger (#619) (5d55b5b)
- credit: refactored check and collect for model trigger (#629) (0a14f9e)
- ray: sync deployment configs when starting up (#631) (4a2fc8a)
- service: check trigger permissions when impersonating an org (#635) (427fc45)
0.26.0-alpha (2024-07-02)
- model: support model version deletion (#616) (2dca40b)
- repository: support case-insensitive search models (#621) (26c76b2)
0.25.1-alpha (2024-06-20)
- schema: use camelCase for schema fields (5629b6a)
0.25.0-alpha (2024-06-18)
- endpoints: use camelCase for
filter
query string (#603) (23955e9) - handler: use camelCase for HTTP body (#599) (70f6d9a)
- model: support model tag (#600) (ef87bc9)
0.24.0-alpha (2024-06-06)
- model: adopt containerized model serving (#542)
- handler: implement get latest operation (#589) (33d2395)
- handler: support listing available regions for model deployment (#561) (52c2172)
- handler: support model profile image (#566) (0c8dbba)
- model: add permission field in model object (#576) (2d36a58)
- model: add task schema in model struct (#578) (647069d)
- model: adopt containerized model serving (#542) (3c80f39)
- model: embed sample input/output in model proto message (#558) (5fba538)
- model: support latest model version trigger (#580) (47cb36c)
- model: support resource spec in model definition (#557) (fee6e4b)
- model: support search/filter with list endpoints (#559) (7b17393)
- model: support watch latest model and
order_by
for list endpoints (#586) (1a5e48c) - prediction: implement sync/async prediction records (#555) (8d58eda)
- ray: support containerized model deployment (#529) (4dcab05)
- ray: support custom accelerator type (#547) (f0cc0d7)
- acl: fix wrong type name (#560) (89d09a5)
- dockerfile: update deploy config yaml path (#590) (ee369e0)
- model: fix missing package in test models (#552) (a28a21b)
- ray: check CDI availability for model container (#538) (28bad42)
- server: add missing message size option (#597) (d0a0aac)
- service: fix list model version pagination (#569) (d8fb04a)
- service: fix list model version return list size (#556) (9b69f9c)
0.23.0-alpha (2024-03-09)
- handler,ray: fix reconciliation model status and namespace (#525) (62a30b6)
- redis: delete redis key when errored (#526) (bb4e18d)
0.22.0-alpha (2024-02-20)
- triton: deprecate triton inference server (#512)
- cmd,pkg: refactor codebase to align with
golanci-linter
checks (#506) (b213812) - handler: fix multipart request (352a4ae)
- pkg: fix isError and set maxBatchSize to 0 (2adfe5b)
- pkg: fix org model namespace (#510) (f4be09c)
- service: fix workflow retry when deleting (adcbde5)
- service: remove org subscription check (76cd66f)
- usage: add missing org usage collection (239d3f4)
- worker: fix temporal cloud namespace init (#513) (17c5d68)
0.21.3-alpha (2024-01-30)
- model: fix indexing error in text2img and img2img postprocessing (#501) (0ba505b)
- model: fix missing field in ray while serving img2img task (#496) (f572f18)
- payload: fix wrong form data key (#503) (4d69e5e)
0.21.2-alpha (2024-01-25)
- main: fix misused return statement (5cbfc3d)
0.21.1-alpha (2024-01-02)
- release: release v0.21.1-alpha (bd320b0)
0.21.0-alpha (2023-12-14)
- model: refactoring AI Tasks for Consistency Across Text and Image Generation (#461) (e827130)
- redis: use redis for model state caching (#472) (3b6b977)
0.20.0-alpha (2023-11-30)
- model: Enhancements for Image Inpu in Text 2 Image Task (#457) (eb604a1)
- ray: use shared python executable (#455) (db9658b)
- model: fix deployment reconciliation (#459) (bac1961)
- ray: fix model file extension (#453) (424d632)
0.19.0-alpha (2023-11-11)
- model: Support New Fields for Multi-Modal Model In Text Generation Task and Refactor Existing Ones (#448) (49bdf5b)
- ray: add
ray serve
as model serving backend (#445) (a9b4005)
- predeploy: fix predeploy model missing triton models reference (3f296cd)
- ray: fix model healthcheck causing scaling loop (#450) (4d8cdbf)
- ray: fix unziping ray model (ca79411)
- service: fix fail model deletion in state error (#449) (91125c0)
0.18.0-alpha (2023-10-26)
- model: Enhancements for Llava Model Support and Model Hub File Movement (#434) (58cb97c)
- model: Support for LLM-like models in TRITON Inference Server (#432) (590eb0b)
- Dockerfile: fix Python 3.11 using Debian base image (#438) (2ace6eb)
- payload: fix incorrect conversion between integer types (#440) (32bffea)
0.17.2-alpha (2023-10-13)
- model: fix init model namespace (77a35b3)
0.17.1-alpha (2023-09-30)
0.17.0-alpha (2023-09-13)
- release: release v0.17.0-alpha (70172a2)
0.16.11-alpha (2023-08-19)
- release: release v0.16.11-alpha (5aba1ce)
0.16.10-alpha (2023-08-03)
- release: release v0.16.10-alpha (1cd7990)
0.16.9-alpha (2023-07-20)
- release: release v0.16.9-alpha (485a9fd)
0.16.8-alpha (2023-07-09)
- release: release v0.16.8-alpha (8251037)
0.16.7-alpha (2023-06-20)
- release: release 0.16.7-alpha (c8ef5c4)
0.16.6-alpha (2023-06-11)
- release: release v0.16.6-alpha (c1f57a9)
0.16.5-alpha (2023-06-02)
- release: release v0.16.5-alpha (b8ba368)
0.16.4-alpha (2023-05-11)
- release: release v0.16.4-alpha (ab8cf12)
0.16.3-alpha (2023-05-06)
0.16.2-alpha (2023-04-25)
- release: release v0.16.2-alpha (b735b17)
0.16.1-alpha (2023-04-24)
- release 0.16.1-alpha (1d497e0)
0.16.0-alpha (2023-04-15)
0.15.0-alpha (2023-04-07)
- controller: add model state monitoring with controller (#323) (4397826)
- remove model instance (#320) (15e1b62)
- support model caching (#317) (d15ffba)
0.14.0-alpha (2023-03-26)
- config: use private port for mgmt-backend (#307) (3264e2b)
- list models and model instances pagination (#304) (1f19ed4)
0.13.1-alpha (2023-02-26)
- create a subfolder in model-repository if needed (#290) (7f8d78b)
- fix creating subfolder (105a11a)
- fix subfolder creation (#292) (0b6ec3f)
- fix variable name (#293) (a7995dd)
0.13.0-alpha (2023-02-23)
0.12.1-alpha (2023-02-12)
0.12.0-alpha (2023-02-10)
0.11.1-alpha (2023-01-20)
0.11.0-alpha (2023-01-14)
- release 0.11.0-alpha (d592acb)
0.10.0-alpha (2022-12-23)
- support async deploy and undeploy model instance (#192) (ed36dc7)
- support semantic segmentation (#203) (f22262c)
- model instance state update to unspecified state (#206) (14c87d5)
- panic error with nil object (#208) (a342113)
0.9.1-alpha (2022-11-28)
- HuggingFace batching bug in preprocess model (b1582e8)
0.9.0-alpha (2022-10-19)
0.8.1-alpha (2022-09-19)
0.8.0-alpha (2022-09-14)
0.7.3-alpha (2022-09-07)
- release 0.7.3-alpha (9033c50)
0.7.2-alpha (2022-08-22)
- release 0.7.2-alpha (17529d6)
0.7.1-alpha (2022-08-21)
- post process ocr task (e387154)
0.7.0-alpha (2022-08-17)
0.6.3-alpha (2022-07-19)
- fix client stream server recv wrong file length interval (#143) (0e06f7c)
- post process for unspecified task output (ad88068)
- trigger image with 4 channel (#141) (7445f5f)
0.6.2-alpha (2022-07-12)
- release v0.6.2-alpha (4365f32)
0.6.1-alpha (2022-07-11)
- release v0.6.1-alpha (f18dc30)
0.6.0-alpha (2022-07-06)
0.5.1-alpha (2022-06-27)
- release v0.5.1-alpha (895056d)
0.5.0-alpha (2022-06-26)
- bug usage storage (#103) (975fdc1)
- fix duration configuration bug (ee4a310)
- init config before logger (9d3fb4a)
- status code when deploy model error (#111) (31d3f11)
- update model definitions and tasks in usage collection (#100) (c593087)
- wrong logic when checking user account and service account (7058db6)
0.4.2-alpha (2022-05-31)
- release 0.4.2-alpha (fc5a14a)
0.4.1-alpha (2022-05-19)
- add writeonly to description (f59d98f)
- clone repository and make folder (ac79386)
- model configuration response in integration test (0225c1e)
- refactor JSON schema (f24db48)
0.4.0-alpha (2022-05-13)
0.3.2-alpha (2022-03-22)
- release 0.3.2-alpha (9f8cd91)
0.3.1-alpha (2022-03-21)
- fix unload model issue causing Triton server OOM (#42) (fb4d1d1)
- update version order when get model version list (#38) (83c054a)
0.3.0-alpha (2022-02-24)
- correct version when making inference (#31) (c918e77)
- update docker compose file for building dev image (#29) (83cba09)
0.2.0-alpha (2022-02-19)
- add gRPC Gateway and GetModel API (#7) (bff6fc9)
- support model name when creating model (#25) (7d799b7)
0.1.0-alpha (2022-02-12)
- add codebase for model grpc service (4defa3e)
- add link for guideline create Conda environment file (7ee8e06)
- logic when essemble or not (ab8e7c1)
- postgres host (a322165)
- return list of models in list method (b88ebd7)
- update db schema, protobuf generated files and create model, version in upload api (7573e54)
- update predict for essemble model (016f11c)