Skip to content

Latest commit

 

History

History
735 lines (368 loc) · 47.4 KB

CHANGELOG.md

File metadata and controls

735 lines (368 loc) · 47.4 KB

Changelog

0.33.1-alpha (2024-12-03)

Miscellaneous Chores

  • release: release v0.33.1-alpha (7ad830f)

0.33.0-alpha (2024-11-05)

Features

  • minio: import updated minio package and add tag on file upload (#704) (7d03354)
  • run: write data for usage by model trigger chart record api (#705) (fababc9)

Bug Fixes

  • migration: migrate unsupported task to custom (#709) (f44afd6)
  • run: add metadata retention handler (#707) (43a6a61)
  • run: add namespace id in response (#708) (2940b1c)

0.32.0-alpha (2024-10-23)

Features

  • run: add requester id in list model run response (#701) (8c22eec)

0.31.0-alpha (2024-10-22)

Features

  • run: run logging data list by requester API (#695) (b118026)
  • service: standardize the tag naming convention (#700) (e08be0d)

Bug Fixes

  • ray: fix upscale deployment failure (#698) (96563c9)
  • run: fix run logging update function (#699) (a753fe4)
  • run: not return minio error in list model run (#696) (cbeea05)

0.30.2-alpha (2024-10-08)

Bug Fixes

  • minio: fix nil logger causing panic (ef9645e)
  • test: switch to minimock and add tests (#687) (141da9d)

0.30.1-alpha (2024-09-13)

Bug Fixes

0.30.0-alpha (2024-09-10)

Features

0.29.0-alpha (2024-08-29)

Features

  • minio: added minio connection (#651) (d17770f)
  • minio: added minio upload file bytes function (#656) (1f7b403)
  • minio: upgraded to minio v7 and return url and file size (#654) (4c34bf4)
  • model: support model statistics in model message (#660) (b460778)
  • model: support num_of_runs in list filter (#661) (b161512)
  • ray: add separate state for scaling (#659) (f519c82)
  • ray: support triggering upscale (#665) (441af13)
  • redis: replace I/O data retrieval from redis with minio (#667) (fc46ef8)
  • run: fix run logging owner view permission (#668) (5c07c9a)

Bug Fixes

0.28.0-alpha (2024-07-31)

Features

  • mgmt: use explicit user_id and organization_id in mgmt request (#638) (4af37e6)
  • model: implement namespace endpoints (#639) (8c9acac)
  • pkg: implement fuzze search for namespace and model ID (#642) (30b11a8)
  • tag: allow custom tags for model (#643) (9c49db6)
  • version: embed version names in model struct (#645) (88f107b)

Bug Fixes

0.27.0-alpha (2024-07-16)

Features

  • credit: INS-5058 added check and collect for model trigger (#619) (5d55b5b)
  • credit: refactored check and collect for model trigger (#629) (0a14f9e)
  • ray: sync deployment configs when starting up (#631) (4a2fc8a)
  • service: check trigger permissions when impersonating an org (#635) (427fc45)

Bug Fixes

  • worker: unit test for TriggerModelActivity (#624) (dec7a60)

0.26.0-alpha (2024-07-02)

Features

  • model: support model version deletion (#616) (2dca40b)
  • repository: support case-insensitive search models (#621) (26c76b2)

Bug Fixes

  • redis: fix misconfigured ttl (f5da795)
  • worker: fix mishandled workflow not found (0189dd8)

0.25.1-alpha (2024-06-20)

Bug Fixes

  • schema: use camelCase for schema fields (5629b6a)

0.25.0-alpha (2024-06-18)

Features

0.24.0-alpha (2024-06-06)

⚠ BREAKING CHANGES

  • model: adopt containerized model serving (#542)

Features

  • handler: implement get latest operation (#589) (33d2395)
  • handler: support listing available regions for model deployment (#561) (52c2172)
  • handler: support model profile image (#566) (0c8dbba)
  • model: add permission field in model object (#576) (2d36a58)
  • model: add task schema in model struct (#578) (647069d)
  • model: adopt containerized model serving (#542) (3c80f39)
  • model: embed sample input/output in model proto message (#558) (5fba538)
  • model: support latest model version trigger (#580) (47cb36c)
  • model: support resource spec in model definition (#557) (fee6e4b)
  • model: support search/filter with list endpoints (#559) (7b17393)
  • model: support watch latest model and order_by for list endpoints (#586) (1a5e48c)
  • prediction: implement sync/async prediction records (#555) (8d58eda)
  • ray: support containerized model deployment (#529) (4dcab05)
  • ray: support custom accelerator type (#547) (f0cc0d7)

Bug Fixes

  • acl: fix wrong type name (#560) (89d09a5)
  • dockerfile: update deploy config yaml path (#590) (ee369e0)
  • model: fix missing package in test models (#552) (a28a21b)
  • ray: check CDI availability for model container (#538) (28bad42)
  • server: add missing message size option (#597) (d0a0aac)
  • service: fix list model version pagination (#569) (d8fb04a)
  • service: fix list model version return list size (#556) (9b69f9c)

0.23.0-alpha (2024-03-09)

Features

  • pkg: use mgmtPB.Owner to embed the owner information (#523) (37d5708)

Bug Fixes

  • handler,ray: fix reconciliation model status and namespace (#525) (62a30b6)
  • redis: delete redis key when errored (#526) (bb4e18d)

0.22.0-alpha (2024-02-20)

⚠ BREAKING CHANGES

  • triton: deprecate triton inference server (#512)

Features

  • acl,org: adopt ACL and add organization endpoints (#504) (13a1650)

Bug Fixes

  • cmd,pkg: refactor codebase to align with golanci-linter checks (#506) (b213812)
  • handler: fix multipart request (352a4ae)
  • pkg: fix isError and set maxBatchSize to 0 (2adfe5b)
  • pkg: fix org model namespace (#510) (f4be09c)
  • service: fix workflow retry when deleting (adcbde5)
  • service: remove org subscription check (76cd66f)
  • usage: add missing org usage collection (239d3f4)
  • worker: fix temporal cloud namespace init (#513) (17c5d68)

Code Refactoring

  • triton: deprecate triton inference server (#512) (f8a277d)

0.21.3-alpha (2024-01-30)

Bug Fixes

  • model: fix indexing error in text2img and img2img postprocessing (#501) (0ba505b)
  • model: fix missing field in ray while serving img2img task (#496) (f572f18)
  • payload: fix wrong form data key (#503) (4d69e5e)

0.21.2-alpha (2024-01-25)

Bug Fixes

  • main: fix misused return statement (5cbfc3d)

0.21.1-alpha (2024-01-02)

Features

  • service: support basic github pat to avoid rate-limit (#477) (45931ca)

Miscellaneous Chores

  • release: release v0.21.1-alpha (bd320b0)

0.21.0-alpha (2023-12-14)

Features

  • model: refactoring AI Tasks for Consistency Across Text and Image Generation (#461) (e827130)
  • redis: use redis for model state caching (#472) (3b6b977)

Bug Fixes

  • model: fix grpc message size limit issue (#474) (1ec7ae1)

0.20.0-alpha (2023-11-30)

Features

  • model: Enhancements for Image Inpu in Text 2 Image Task (#457) (eb604a1)
  • ray: use shared python executable (#455) (db9658b)

Bug Fixes

0.19.0-alpha (2023-11-11)

Features

  • model: Support New Fields for Multi-Modal Model In Text Generation Task and Refactor Existing Ones (#448) (49bdf5b)
  • ray: add ray serve as model serving backend (#445) (a9b4005)

Bug Fixes

  • predeploy: fix predeploy model missing triton models reference (3f296cd)
  • ray: fix model healthcheck causing scaling loop (#450) (4d8cdbf)
  • ray: fix unziping ray model (ca79411)
  • service: fix fail model deletion in state error (#449) (91125c0)

0.18.0-alpha (2023-10-26)

Features

  • model: Enhancements for Llava Model Support and Model Hub File Movement (#434) (58cb97c)
  • model: Support for LLM-like models in TRITON Inference Server (#432) (590eb0b)

Bug Fixes

  • Dockerfile: fix Python 3.11 using Debian base image (#438) (2ace6eb)
  • payload: fix incorrect conversion between integer types (#440) (32bffea)

0.17.2-alpha (2023-10-13)

Bug Fixes

  • model: fix init model namespace (77a35b3)

0.17.1-alpha (2023-09-30)

Bug Fixes

  • main: fix namespace error when deploying model (#423) (dd5badf)

0.17.0-alpha (2023-09-13)

Miscellaneous Chores

  • release: release v0.17.0-alpha (70172a2)

0.16.11-alpha (2023-08-19)

Miscellaneous Chores

  • release: release v0.16.11-alpha (5aba1ce)

0.16.10-alpha (2023-08-03)

Miscellaneous Chores

  • release: release v0.16.10-alpha (1cd7990)

0.16.9-alpha (2023-07-20)

Miscellaneous Chores

  • release: release v0.16.9-alpha (485a9fd)

0.16.8-alpha (2023-07-09)

Miscellaneous Chores

  • release: release v0.16.8-alpha (8251037)

0.16.7-alpha (2023-06-20)

Miscellaneous Chores

  • release: release 0.16.7-alpha (c8ef5c4)

0.16.6-alpha (2023-06-11)

Miscellaneous Chores

  • release: release v0.16.6-alpha (c1f57a9)

0.16.5-alpha (2023-06-02)

Miscellaneous Chores

  • release: release v0.16.5-alpha (b8ba368)

0.16.4-alpha (2023-05-11)

Miscellaneous Chores

  • release: release v0.16.4-alpha (ab8cf12)

0.16.3-alpha (2023-05-06)

Bug Fixes

0.16.2-alpha (2023-04-25)

Miscellaneous Chores

  • release: release v0.16.2-alpha (b735b17)

0.16.1-alpha (2023-04-24)

Bug Fixes

  • pass the context between package layers (#345) (e6e7f2f)

Miscellaneous Chores

0.16.0-alpha (2023-04-15)

Features

0.15.0-alpha (2023-04-07)

Features

0.14.0-alpha (2023-03-26)

Features

  • add private endpoint and gRPC test cases (#306) (bb3c193)

Bug Fixes

  • config: use private port for mgmt-backend (#307) (3264e2b)
  • list models and model instances pagination (#304) (1f19ed4)

0.13.1-alpha (2023-02-26)

Bug Fixes

0.13.0-alpha (2023-02-23)

Features

  • add support for text generation tasks (#252) (767ec45)

Bug Fixes

  • keep format for empty inference output (#258) (e2a2e48)

0.12.1-alpha (2023-02-12)

Bug Fixes

0.12.0-alpha (2023-02-10)

Features

Bug Fixes

  • fix usage client nil issue when mgmt-backend not ready (#241) (4290159)

0.11.1-alpha (2023-01-20)

Bug Fixes

0.11.0-alpha (2023-01-14)

Miscellaneous Chores

0.10.0-alpha (2022-12-23)

Features

  • support async deploy and undeploy model instance (#192) (ed36dc7)
  • support semantic segmentation (#203) (f22262c)

Bug Fixes

  • model instance state update to unspecified state (#206) (14c87d5)
  • panic error with nil object (#208) (a342113)

0.9.1-alpha (2022-11-28)

Bug Fixes

  • HuggingFace batching bug in preprocess model (b1582e8)

0.9.0-alpha (2022-10-19)

Features

Bug Fixes

  • allow updating emtpy description for a model (#177) (100ec84)

0.8.1-alpha (2022-09-19)

Bug Fixes

  • update description for GitHub model from user input (#173) (821dab3)

0.8.0-alpha (2022-09-14)

Features

0.7.3-alpha (2022-09-07)

Features

Miscellaneous Chores

0.7.2-alpha (2022-08-22)

Miscellaneous Chores

0.7.1-alpha (2022-08-21)

Bug Fixes

0.7.0-alpha (2022-08-17)

Features

0.6.3-alpha (2022-07-19)

Bug Fixes

  • fix client stream server recv wrong file length interval (#143) (0e06f7c)
  • post process for unspecified task output (ad88068)
  • trigger image with 4 channel (#141) (7445f5f)

0.6.2-alpha (2022-07-12)

Miscellaneous Chores

0.6.1-alpha (2022-07-11)

Miscellaneous Chores

0.6.0-alpha (2022-07-06)

Features

  • support creating a HuggingFace model (#113) (1577d87)

Bug Fixes

  • model definition in list model and missing zero in output (#121) (a90072d)

0.5.1-alpha (2022-06-27)

Miscellaneous Chores

0.5.0-alpha (2022-06-26)

Features

Bug Fixes

  • bug usage storage (#103) (975fdc1)
  • fix duration configuration bug (ee4a310)
  • init config before logger (9d3fb4a)
  • status code when deploy model error (#111) (31d3f11)
  • update model definitions and tasks in usage collection (#100) (c593087)
  • wrong logic when checking user account and service account (7058db6)

0.4.2-alpha (2022-05-31)

Bug Fixes

  • fix config path (a8cf2c0)
  • regexp zap logger with new protobuf package (8b9c463)

Miscellaneous Chores

0.4.1-alpha (2022-05-19)

Bug Fixes

  • add writeonly to description (f59d98f)
  • clone repository and make folder (ac79386)
  • model configuration response in integration test (0225c1e)
  • refactor JSON schema (f24db48)

0.4.0-alpha (2022-05-13)

Features

Bug Fixes

  • refactor model definition and model JSON schema (#73) (0cce154)

0.3.2-alpha (2022-03-22)

Miscellaneous Chores

0.3.1-alpha (2022-03-21)

Bug Fixes

  • fix unload model issue causing Triton server OOM (#42) (fb4d1d1)
  • update version order when get model version list (#38) (83c054a)

0.3.0-alpha (2022-02-24)

Features

  • support url/base64 content prediction (#34) (a88ddfd)

Bug Fixes

  • correct version when making inference (#31) (c918e77)
  • update docker compose file for building dev image (#29) (83cba09)

0.2.0-alpha (2022-02-19)

Features

  • add gRPC Gateway and GetModel API (#7) (bff6fc9)
  • support model name when creating model (#25) (7d799b7)

Bug Fixes

0.1.0-alpha (2022-02-12)

Features

  • add codebase for model grpc service (4defa3e)

Bug Fixes

  • add link for guideline create Conda environment file (7ee8e06)
  • logic when essemble or not (ab8e7c1)
  • postgres host (a322165)
  • return list of models in list method (b88ebd7)
  • update db schema, protobuf generated files and create model, version in upload api (7573e54)
  • update predict for essemble model (016f11c)