10 May 2022 |
Timos | Download image.png | 19:27:43 |
Timos | Redacted or Malformed Event | 19:27:44 |
11 May 2022 |
iamlovingit | Hi, Diego Kiner community plans to supports ABN test by using the new feature inference graph , which is in reviewing progress now, you can try this if you are interesting. | 01:00:11 |
| _slack_kubeflow_U02JYD39G57 changed their display name from _slack_kubeflow_U02JYD39G57 to Zoltán R. Jánki. | 11:24:06 |
| _slack_kubeflow_U02JYD39G57 set a profile picture. | 11:24:08 |
Dan Sun | We are cancelling today's community as many folks are not available, also remind that Kubecon EU is next week and we have quite a few contributors giving KServe talks there! | 12:51:04 |
| Ryan McCaffrey joined the room. | 19:01:26 |
_slack_kubeflow_U9UFLSBM4 | I'm trying out the pvc example for with kserve-raw on OpenShift. I have the model on my PV, but when I try to spin-up the inferenceservice, I get the following from the storage-initializer: https://paste.centos.org/view/e5848b4b Has anyone ran into something similar or better yet, solved it? | 20:23:11 |
_slack_kubeflow_U9UFLSBM4 | Here is the example I'm working with: https://kserve.github.io/website/modelserving/storage/pvc/pvc/ | 20:29:32 |
_slack_kubeflow_U9UFLSBM4 | Might be an issue with my storage class being set to WaitForFirstConsumer. Tweaking that to Immediate seems to maybe get me rolling again. | 21:08:57 |
12 May 2022 |
Mark Winter | Seems like maybe it can't find the file in the PVC? Is your model file called model.joblib like it expects? /mnt/pvc/model.joblib | 03:09:17 |
Mark Winter | It seems scikit-learn model serving is hardcoded to model.joblib file name at the moment. https://github.com/kserve/kserve/issues/2079 | 03:27:21 |
zorba(손주형) | Is kserve not support tensorRT?? I thought it possible because of triton but tensorRT is not in the guide. | 06:14:48 |
Mark Winter | When you use Triton with KServe you get just a normal Triton server. So you can use TensorRT with Triton as you would normally | 06:20:31 |
Mark Winter | https://velog.io/@pjs102793/Triton-Inference-Server%EC%97%90%EC%84%9C-TensorRT-Engine-Inference | 06:22:44 |
Mark Winter | Triton 서버를 동작시키기 위해서는 반드시 모델 파일과 모델 파일에 대한 Configuration 들을 포함하고 있는 Model Repository 를 생성해야 합니다. | 06:25:05 |
zorba(손주형) | oh i see. it’s just same as usning triton. Thanks! | 06:25:36 |
Mark Winter | Yer, just set modelFormat to tensorrt and you will get a Triton server | 06:26:49 |
Mark Winter | Would there be any issues with having serverless kserve and modelmesh in the same cluster? | 06:31:32 |
13 May 2022 |
| _slack_kubeflow_U03FCED8QTE joined the room. | 05:32:17 |
Saurabh Agarwal | Shri Javadekar ^ | 12:20:47 |
Alexandre Brown | Redacted or Malformed Event | 17:36:35 |
Shri Javadekar | • What version of Kserve are you using? I'm using kserve 0.7 and there is a :predict suffix to the endpoint url (http://my-model-a.kserve-test.svc.cluster.local/v1/models/my-model-a:predict )
• When you say that the url isn't working, what's the error you are seeing? | 20:56:58 |
14 May 2022 |
Dan Sun | Should work fine, that’s all the work we have doing to unifying on the inference service api to deploy single and multi models | 20:06:34 |
15 May 2022 |
| Amanjeet Sahu joined the room. | 08:38:19 |
16 May 2022 |
wenyang zhou | Download 截屏2022-05-16 上午11.04.47.png | 03:06:39 |
wenyang zhou | There seems to be a lock | 03:06:39 |
wenyang zhou | I have release several revision in a period of time. | 03:09:02 |
wenyang zhou | The old revision can not terminate before it finish pending . | 03:09:47 |
wenyang zhou | And the new revision can not allocate enough resource before the old revision release resource. | 03:10:33 |