!LuUSGaeArTeoOgUpwk:matrix.org

kubeflow-kfserving

433 Members
2 Servers

Load older messages


SenderMessageTime
3 Jun 2022
@_slack_kubeflow_U03HW39JNLV:matrix.orgChristian Lehre Yes, i already checked that. Should i use the workload identity namespace somehow? 09:57:42
@_slack_kubeflow_UM56LA7N3:matrix.orgBenjamin Tan Workload identity is based on the google service account which has access to GCS 09:58:37
@_slack_kubeflow_UM56LA7N3:matrix.orgBenjamin Tan U then bind it with the kubernetes service account In ur nameapace of your choosing 09:58:57
@_slack_kubeflow_U03HW39JNLV:matrix.orgChristian Lehre Benjamin Tan no success.. Im binding to a principal with list access to cloud storage (kubeflow-user@ project .iam.gserviceaccount.com), but it says that the storage.objectViewer role is not supported for this resource 11:02:47
@_slack_kubeflow_UFVUV2UFP:matrix.orgDan Sun https://github.com/kserve/kserve/blob/master/docs/samples/v1beta1/advanced/timeout.yaml#L7 11:15:49
@_slack_kubeflow_U03CN7QAHN3:matrix.orgzorba(손주형) yeah but this is not about terminationGracePeriodSecond. it’s timeout of calling predictor service (In seconds). As in the document.
apiVersion: "serving.kserve.io/v1beta1"
kind: "InferenceService"
metadata:
  name: "pytorch-cifar10"
spec:
  predictor:
    timeout: 60
    minReplicas: 1
    batcher:
      maxBatchSize: 32
      maxLatency: 5000
    pytorch:
      storageUri: " gs://kfserving-examples/models/torchserve/image-classifier "
maxBatchSize: the max batch size for triggering a prediction. • maxLatency: the max latency for triggering a prediction (In milliseconds). • timeout: timeout of calling predictor service (In seconds).
11:39:20
@_slack_kubeflow_U03HW39JNLV:matrix.orgChristian Lehre Nevermind, I figured it out. Had to annotate the service account in the namespace with the gcp service principal. Thank you so much for your help, you really guided me into the right track 😄 11:54:12
@_slack_kubeflow_UM56LA7N3:matrix.orgBenjamin Tan Yayyyyy 11:57:14
@_slack_kubeflow_UM56LA7N3:matrix.orgBenjamin Tan Awesome 👌 👏 👍 11:57:19
@californiatl:matrix.org@californiatl:matrix.orgI'll help 10 individuals how to earn $20,000 in just 72 hours from the crypto market. But you will pay me 10% commission when you receive your profit. if interested send me a direct message via WhatsApp by asking me HOW for more details on how to get started +1 (229778188111:58:06
@_slack_kubeflow_U01N2J7JDAL:matrix.orgSebastian Lehrig any admin here to get rid of this spam? 12:01:49
@_slack_kubeflow_UFVUV2UFP:matrix.orgDan Sun The termination grace period is controlled by timeout because you need to wait for the request to drain and finish processing before shutting down the pod. 12:36:22
@_slack_kubeflow_UFVUV2UFP:matrix.orgDan Sun That’s why it is not allowed setting termination grace period directly 12:36:50
@_slack_kubeflow_U03CN7QAHN3:matrix.orgzorba(손주형) very nice. thanks 13:24:57
@_slack_kubeflow_UR0H6TE3A:matrix.orgDemetrios joined the room.14:41:41
@_slack_kubeflow_U022U7KG24W:matrix.org_slack_kubeflow_U022U7KG24W Shri Javadekar: I followed this https://kserve.github.io/website/0.8/developer/developer/#install-knative-on-a-kubernetes-cluster to test it out. You can checkout the branch for PR https://github.com/kserve/kserve/pull/1910 and follow this https://kserve.github.io/website/0.8/developer/developer/#deploy-kserve-with-your-own-version 15:58:41
@_slack_kubeflow_U039JEMLJ1W:matrix.orgRobert Irvine hi, I use kserve batching which is great but it means you cannot pass in parameters others than the inference text. For debugging I want to be able to bypass this and just call with parameters directly (e.g diff temperature values for nlp model). Is this possible? 16:19:18
@_slack_kubeflow_U039JEMLJ1W:matrix.orgRobert Irvine if its not possible then I have to deploy a second instance without batching just for debugging with additional parameters (which kinda sucks) 16:20:47
@_slack_kubeflow_UL38EU0BG:matrix.orgThomas Ounnas Kenneth Koski it seems you installed this app, could you remove this spams ? 17:50:09
@_slack_kubeflow_U0278DYPCJE:matrix.orgPradeep Thalasta joined the room.18:39:20
@_slack_kubeflow_U0278DYPCJE:matrix.orgPradeep ThalastaRedacted or Malformed Event18:39:26
@_slack_kubeflow_UL38EU0BG:matrix.orgThomas Ounnas Hello 👋 , Just a quick feedback, it may have change in new KServe version, but for those on Kubeflow v1.4 (KFServing/KServe v0.6.1) like me, I had an issue when trying to deploy an sklearn inference service, and got a 404...🤔 Everything was ok (pod, authpolicy, iam role service account, etc..) I kinda went crazy since 404 means Not Found but I had the logs of the pod with the requests’ hit arriving... It turns out... hidden in the logs, I found an error with the load of the model, indeed, KServe v0.6.1 rely on sklearn 0.20.3 and there was a version mismatch.... 🤦‍♂️ Kinda silly error, that I would have found more quickly if the status was a 502 (since I would have look at the right place, server side error) 😅 20:07:04
@_slack_kubeflow_U02LE3KB53M:matrix.orgVivian Pan Shri Javadekar I already have the v0.8.0 version of kserve installed in a cluster, then I built and pushed the latest version of the kserve controller and router from the graph feature branch to a registry. Then reconfigure the existing kserve set up from v0.8.0 images to the feature branch images. You should be able to test out the inference graph features. Otherwise you can do the local dev setup that is documented above 21:11:35
@californiatl:matrix.org@californiatl:matrix.orgI'll help 10 individuals how to earn $20,000 in just 72 hours from the crypto market. But you will pay me 10% commission when you receive your profit. if interested send me a direct message via WhatsApp by asking me HOW for more details on how to get started +1 (229778188121:18:44
@_slack_kubeflow_U0315UY2WRM:matrix.orgShri Javadekar Got it.. thanks for the reply! 21:18:51
@_slack_kubeflow_UFVUV2UFP:matrix.orgDan Sun Where can you see the installed app? 22:26:50
@_slack_kubeflow_UFVUV2UFP:matrix.orgDan Sun It means that the model is not found, I think in later version we actually failed the start up in case model is not loaded 22:38:53
@_slack_kubeflow_UFVUV2UFP:matrix.orgDan Sun Mathew Wicks can you help remove the spam app? 22:41:50
4 Jun 2022
@_slack_kubeflow_U013UG6SQAJ:matrix.orgMathew Wicks James Wu can you please remove the "element bridge" slack app ASAP? It is spamming this channel with commercial ads. 01:21:12
@californiatl:matrix.org@californiatl:matrix.org I'll help 10 individuals how to earn $20,000 in just 72 hours from the crypto market. But you will pay me 10% commission when you receive your profit. if interested send me a direct message via WhatsApp by asking me HOW for more details on how to get started > +1 (2297781881 09:49:01

Show newer messages


Back to Room ListRoom Version: 6