!LuUSGaeArTeoOgUpwk:matrix.org

kubeflow-kfserving

433 Members
2 Servers

Load older messages


SenderMessageTime
10 May 2022
@_slack_kubeflow_U02SF1C1Y67:matrix.orgTimosRedacted or Malformed Event19:27:44
11 May 2022
@_slack_kubeflow_U0104H1616Z:matrix.orgiamlovingit Hi, Diego Kiner community plans to supports ABN test by using the new feature inference graph, which is in reviewing progress now, you can try this if you are interesting. 01:00:11
@_slack_kubeflow_U02JYD39G57:matrix.org_slack_kubeflow_U02JYD39G57 changed their display name from _slack_kubeflow_U02JYD39G57 to Zoltán R. Jánki.11:24:06
@_slack_kubeflow_U02JYD39G57:matrix.org_slack_kubeflow_U02JYD39G57 set a profile picture.11:24:08
@_slack_kubeflow_UFVUV2UFP:matrix.orgDan Sun We are cancelling today's community as many folks are not available, also remind that Kubecon EU is next week and we have quite a few contributors giving KServe talks there! 12:51:04
@_slack_kubeflow_U03D067RTJN:matrix.orgRyan McCaffrey joined the room.19:01:26
@_slack_kubeflow_U9UFLSBM4:matrix.org_slack_kubeflow_U9UFLSBM4 I'm trying out the pvc example for with kserve-raw on OpenShift. I have the model on my PV, but when I try to spin-up the inferenceservice, I get the following from the storage-initializer: https://paste.centos.org/view/e5848b4b Has anyone ran into something similar or better yet, solved it? 20:23:11
@_slack_kubeflow_U9UFLSBM4:matrix.org_slack_kubeflow_U9UFLSBM4 Here is the example I'm working with: https://kserve.github.io/website/modelserving/storage/pvc/pvc/ 20:29:32
@_slack_kubeflow_U9UFLSBM4:matrix.org_slack_kubeflow_U9UFLSBM4 Might be an issue with my storage class being set to WaitForFirstConsumer. Tweaking that to Immediate seems to maybe get me rolling again. 21:08:57
12 May 2022
@_slack_kubeflow_U01T25HRREK:matrix.orgMark Winter Seems like maybe it can't find the file in the PVC? Is your model file called model.joblib like it expects? /mnt/pvc/model.joblib 03:09:17
@_slack_kubeflow_U01T25HRREK:matrix.orgMark Winter It seems scikit-learn model serving is hardcoded to model.joblib file name at the moment. https://github.com/kserve/kserve/issues/2079 03:27:21
@_slack_kubeflow_U03CN7QAHN3:matrix.orgzorba(손주형) Is kserve not support tensorRT?? I thought it possible because of triton but tensorRT is not in the guide. 06:14:48
@_slack_kubeflow_U01T25HRREK:matrix.orgMark Winter When you use Triton with KServe you get just a normal Triton server. So you can use TensorRT with Triton as you would normally 06:20:31
@_slack_kubeflow_U01T25HRREK:matrix.orgMark Winter https://velog.io/@pjs102793/Triton-Inference-Server%EC%97%90%EC%84%9C-TensorRT-Engine-Inference 06:22:44
@_slack_kubeflow_U01T25HRREK:matrix.orgMark Winter Triton 서버를 동작시키기 위해서는 반드시 모델 파일과 모델 파일에 대한 Configuration 들을 포함하고 있는 Model Repository 를 생성해야 합니다. 06:25:05
@_slack_kubeflow_U03CN7QAHN3:matrix.orgzorba(손주형) oh i see. it’s just same as usning triton. Thanks! 06:25:36
@_slack_kubeflow_U01T25HRREK:matrix.orgMark Winter Yer, just set modelFormat to tensorrt and you will get a Triton server 06:26:49
@_slack_kubeflow_U01T25HRREK:matrix.orgMark Winter Would there be any issues with having serverless kserve and modelmesh in the same cluster? 06:31:32
13 May 2022
@_slack_kubeflow_U03FCED8QTE:matrix.org_slack_kubeflow_U03FCED8QTE joined the room.05:32:17
@_slack_kubeflow_U032CM1LH3N:matrix.orgSaurabh Agarwal Shri Javadekar ^ 12:20:47
@_slack_kubeflow_U02AYBVSLSK:matrix.orgAlexandre BrownRedacted or Malformed Event17:36:35
@_slack_kubeflow_U0315UY2WRM:matrix.orgShri Javadekar • What version of Kserve are you using? I'm using kserve 0.7 and there is a :predict suffix to the endpoint url (http://my-model-a.kserve-test.svc.cluster.local/v1/models/my-model-a:predict) • When you say that the url isn't working, what's the error you are seeing? 20:56:58
14 May 2022
@_slack_kubeflow_UFVUV2UFP:matrix.orgDan Sun Should work fine, that’s all the work we have doing to unifying on the inference service api to deploy single and multi models 20:06:34
15 May 2022
@_slack_kubeflow_U03DGNK5EBT:matrix.orgAmanjeet Sahu joined the room.08:38:19
16 May 2022
@_slack_kubeflow_U02S0UGJWCV:matrix.orgwenyang zhou截屏2022-05-16 上午11.04.47.png
Download 截屏2022-05-16 上午11.04.47.png
03:06:39
@_slack_kubeflow_U02S0UGJWCV:matrix.orgwenyang zhou There seems to be a lock 03:06:39
@_slack_kubeflow_U02S0UGJWCV:matrix.orgwenyang zhou I have release several revision in a period of time. 03:09:02
@_slack_kubeflow_U02S0UGJWCV:matrix.orgwenyang zhou The old revision can not terminate before it finish pending . 03:09:47
@_slack_kubeflow_U02S0UGJWCV:matrix.orgwenyang zhou And the new revision can not allocate enough resource before the old revision release resource. 03:10:33
@_slack_kubeflow_U02S0UGJWCV:matrix.orgwenyang zhou Could we improve this problem ? 03:10:54

Show newer messages


Back to Room ListRoom Version: 6