!LuUSGaeArTeoOgUpwk:matrix.org

kubeflow-kfserving

434 Members
3 Servers

Load older messages


SenderMessageTime
17 May 2022
@_slack_kubeflow_U036ZCFAFLP:matrix.orgJonny Browning changed their display name from _slack_kubeflow_U036ZCFAFLP to Jonny Browning.22:20:28
@_slack_kubeflow_U036ZCFAFLP:matrix.orgJonny Browning set a profile picture.22:20:33
@_slack_kubeflow_U022U7KG24W:matrix.orgRachit Chauhan (commenting to follow the thread) 22:58:55
18 May 2022
@_slack_kubeflow_UFVUV2UFP:matrix.orgDan Sun Diego Kiner We create graph orchestrator as a service to chain the requests, so the entry point is the graph orchestrator service 00:27:36
@_slack_kubeflow_U02PHBULPDZ:matrix.orgDiego Kiner Ok, we're trying to work off of the changes in the PR to get it working - how is the orchestrator created? 00:28:32
@_slack_kubeflow_UFVUV2UFP:matrix.orgDan Sun what’s the use case of having two containers on transformer? 00:29:57
@_slack_kubeflow_UFVUV2UFP:matrix.orgDan Sun there is a docker file on the PR to build the graph orchestrator image 00:32:10
@_slack_kubeflow_U02PHBULPDZ:matrix.orgDiego Kiner I see the router.Dockerfile - is this what you mean? But even if built, how is it meant to be deployed? 00:35:01
@_slack_kubeflow_UFVUV2UFP:matrix.orgDan Sun just run make deploy-dev that will deploy the graph orchestrator image and when you create the inference graph it will expose the endpoint for you 00:42:46
@_slack_kubeflow_U02PHBULPDZ:matrix.orgDiego Kiner Ok thanks! will give it a try 00:43:05
@_slack_kubeflow_U02PHBULPDZ:matrix.orgDiego Kiner ``kustomize build config/overlays/development , kubectl apply -f - Error: trouble configuring builtin PatchStrategicMergeTransformer with config: paths: - configmap/inferenceservice_patch.yaml - manager_image_patch.yaml : evalsymlink failure on '.../kserve/config/overlays/development/configmap/inferenceservice_patch.yaml' : lstat .../kserve/config/overlays/development/configmap/inferenceservice_patch.yaml: no such file or directory`` 01:28:59
@_slack_kubeflow_U03CN7QAHN3:matrix.orgzorba(손주형) Dan Sun I don’t know.. but it’s possible since containers yaml input list List of container 02:40:04
@_slack_kubeflow_U0315UY2WRM:matrix.orgShri Javadekar I can imagine one container for the predictions and a sidecar container for logs/metrics, etc. 04:18:29
@_slack_kubeflow_U03CN7QAHN3:matrix.orgzorba(손주형) Shri Javadekar I thought that too. However how kserve know which container is transformer and which is sidecar? 05:24:27
@_slack_kubeflow_UFVUV2UFP:matrix.orgDan Sun well kserve already can inject a model agent sidecar, if any features need to be added I recommend contributing there 07:42:13
@_slack_kubeflow_UFVUV2UFP:matrix.orgDan Sun KServe introduction talk on Kubecon AI Days is now on youtube ! https://www.youtube.com/watch?v=FX6naJLaq2Y&list=PLj6h78yzYM2PJdsIBxtDOyiFqP3wIbOcc&index=9 08:25:56
@_slack_kubeflow_UM56LA7N3:matrix.orgBenjamin Tan Hurray! Was looking forward to this ❤️ 09:38:53
@_slack_kubeflow_U03CN7QAHN3:matrix.orgzorba(손주형) oh it will be in 0.9.0 11:20:03
@_slack_kubeflow_U02UYNBU951:matrix.orgAlexa Griffith changed their display name from _slack_kubeflow_U02UYNBU951 to Alexa Griffith.13:40:58
@_slack_kubeflow_U02UYNBU951:matrix.orgAlexa Griffith set a profile picture.13:41:01
@_slack_kubeflow_U02UYNBU951:matrix.orgAlexa Griffith Yay!! Had so much fun 13:41:12
@_slack_kubeflow_U02LE3KB53M:matrix.orgVivian Pan We tried the ensemble model, but the inference graph doesn’t seem to spin up the pod and can’t see any errors with the inference graph https://github.com/yuzisun/kserve/tree/graph/docs/samples/graph#34-ensemble
❯ kubectl -n kubeflow-demo get ig
NAME             URL   READY   AGE
ensemble-model                 23m
❯ k -n kubeflow-demo get isvc
NAME                       URL                                                                      READY   PREV   LATEST   PREVROLLEDOUTREVISION                              LATESTREADYREVISION                                AGE
xgboost-iris               http://xgboost-iris.kubeflow-demo. host-name                True           100                                                         xgboost-iris-predictor-default-00001               22h
sklearn-iris               http://sklearn-iris.kubeflow-demo. host-name                True           100                                                         sklearn-iris-predictor-default-00001               22h
❯ k -n kubeflow-demo get pod
NAME                                                              READY   STATUS    RESTARTS   AGE
sklearn-iris-predictor-default-00001-deployment-67df87c8ffddn8t   3/3     Running   0          22h
xgboost-iris-predictor-default-00001-deployment-7f88bf596crkdtt   3/3     Running   0          22h
17:41:11
@_slack_kubeflow_UFVUV2UFP:matrix.orgDan Sun what’s the output for kubectl get ig ? 19:38:44
@_slack_kubeflow_U02LE3KB53M:matrix.orgVivian Pan
❯ k -n kubeflow-demo get ig -o yaml
apiVersion: v1
items:
- apiVersion: serving.kserve.io/v1alpha1
  kind: InferenceGraph
  metadata:
    annotations:
      kubectl.kubernetes.io/last-applied-configuration: ,
        {"apiVersion":"serving.kserve.io/v1alpha1","kind":"InferenceGraph","metadata":{"annotations":{},"name":"ensemble-model","namespace":"kubeflow-demo"},"spec":{"nodes":{"ensembleModel":{"routerType":"Ensemble","routes":[{"service":"sklearn-iris"},{"service":"xgboost-iris"}]}}}}
    creationTimestamp: "2022-05-18T16:47:14Z"
    generation: 1
    managedFields:
    - apiVersion: serving.kserve.io/v1alpha1
      fieldsType: FieldsV1
      fieldsV1:
        f:metadata:
          f:annotations:
            .: {}
            f:kubectl.kubernetes.io/last-applied-configuration: {}
        f:spec:
          .: {}
          f:nodes:
            .: {}
            f:ensembleModel:
              .: {}
              f:routerType: {}
              f:routes: {}
      manager: kubectl-client-side-apply
      operation: Update
      time: "2022-05-18T16:47:14Z"
    name: ensemble-model
    namespace: kubeflow-demo
    resourceVersion: "347617836"
    uid: 6ae8f825-5624-47c7-b70d-f20d76eaf2be
  spec:
    nodes:
      ensembleModel:
        routerType: Ensemble
        routes:
        - service: sklearn-iris
        - service: xgboost-iris
kind: List
metadata:
  resourceVersion: ""
  selfLink: ""
19:39:46
@_slack_kubeflow_U02LE3KB53M:matrix.orgVivian Pan no events listed
❯ k -n kubeflow-demo describe inferencegraph ensemble-model
Name:         ensemble-model
Namespace:    kubeflow-demo
Labels:        none 
Annotations:   none 
API Version:  serving.kserve.io/v1alpha1
Kind:         InferenceGraph
Metadata:
  Creation Timestamp:  2022-05-18T16:47:14Z
  Generation:          1
  Managed Fields:
    API Version:  serving.kserve.io/v1alpha1
    Fields Type:  FieldsV1
    fieldsV1:
      f:metadata:
        f:annotations:
          .:
          f:kubectl.kubernetes.io/last-applied-configuration:
      f:spec:
        .:
        f:nodes:
          .:
          f:ensembleModel:
            .:
            f:routerType:
            f:routes:
    Manager:         kubectl-client-side-apply
    Operation:       Update
    Time:            2022-05-18T16:47:14Z
  Resource Version:  347617836
  UID:               6ae8f825-5624-47c7-b70d-f20d76eaf2be
Spec:
  Nodes:
    Ensemble Model:
      Router Type:  Ensemble
      Routes:
        Service:  sklearn-iris
        Service:  xgboost-iris
Events:            none 
19:40:18
@_slack_kubeflow_UERAV8XJ8:matrix.orgshotarok joined the room.20:51:34
@_slack_kubeflow_U02LE3KB53M:matrix.orgVivian Pan i managed to resolve the issue, the validating webhook was not configured properly. But the router image seems unable right now. I’m guessing this is not pushed to the public container registry yet?
Revision "ensemble-model-00001" failed with message: Unable to fetch image "kserve/router:latest":
22:53:36
@_slack_kubeflow_UFVUV2UFP:matrix.orgDan Sun if you run make deploy-dev that will deploy the development image, the latest image will be published once the PR is merged 23:11:19
@_slack_kubeflow_U02LE3KB53M:matrix.orgVivian Pan I’m running your setup in our test cluster with a slightly different setup than what the make deploy-dev would generate. I build your image and pushed it to our container registry, I can see the inference graph is ready and pod is healthy
❯ k -n kubeflow-demo get ig
NAME             URL                                                            READY   AGE
ensemble-model   http://ensemble-model.kubeflow-demo.iap.test. unity-host    True    26m

❯ k -n kubeflow-demo get pods
NAME                                                              READY   STATUS    RESTARTS   AGE
ensemble-model-00001-deployment-86b996545f-c9n48                  3/3     Running   0          16m
Checking the virtualservice that gets created from this. Does the alpha version allow for usage of a custom gateway, other than the knative one?
❯ k -n kubeflow-demo get vs
NAME                                                 GATEWAYS                                                                              HOSTS
ensemble-model-ingress                               ["knative-serving/knative-ingress-gateway","knative-serving/knative-local-gateway"]   ["ensemble-model.kubeflow-demo","ensemble-model.kubeflow-demo.iap.test. unity-host ","ensemble-model.kubeflow-demo.svc","ensemble-model.kubeflow-demo.svc.cluster.local"]                                                                                                                                                                                                                                                                                                                                                                                                                                  17m
ensemble-model-mesh                                  ["mesh"]                                                                              ["ensemble-model.kubeflow-demo","ensemble-model.kubeflow-demo.svc","ensemble-model.kubeflow-demo.svc.cluster.local"]
23:41:20
@_slack_kubeflow_U02LE3KB53M:matrix.orgVivian Pan similar to what the inference virtualservice has?
❯ k -n kubeflow-demo get vs
NAME                                                 GATEWAYS                                                                              HOSTS
sklearn-iris                                         ["knative-serving/knative-local-gateway","istio-system/gateway-iap"]                  ["sklearn-iris.kubeflow-demo.svc.cluster.local","sklearn-iris.kubeflow-demo.iap.test. unity-host "]                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                        28h
sklearn-iris-predictor-default-ingress               ["knative-serving/knative-ingress-gateway","knative-serving/knative-local-gateway"]   ["sklearn-iris-predictor-default.kubeflow-demo","sklearn-iris-predictor-default.kubeflow-demo.iap.test. unity-host ","sklearn-iris-predictor-default.kubeflow-demo.svc","sklearn-iris-predictor-default.kubeflow-demo.svc.cluster.local"]                                                                                                                                                                                                                                                                                                                                                                  28h
sklearn-iris-predictor-default-mesh                  ["mesh"]                                                                              ["sklearn-iris-predictor-default.kubeflow-demo","sklearn-iris-predictor-default.kubeflow-demo.svc","sklearn-iris-predictor-default.kubeflow-demo.svc.cluster.local"]
23:44:07

Show newer messages


Back to Room ListRoom Version: 6