refactor pytest e2e on top of k8s manifests

Open tarilabs opened this issue 1 year ago • 1 comments

Following the great work and refactor from @isinyaaa where the pytest are now being leveraged for e2e-testing,

I would propose to move away from the use of docker-compose.yaml, and rebase the e2e on top of the KF manifests, with the following proposal:

Before all e2e tests

This is where we typically launch the docker compose up.

kind create cluster
kubectl create namespace kubeflow
from the manifests directory, kubectl apply -k overlays/db
wait for Deployments like we do here
launch kubectl port-forward svc/model-registry-service -n kubeflow 8080:8080 in the background, maybe noting down pid
maybe await a couple seconds for the port fwd like we do here

After each test

This is where we typically delete the sqlite file, and restart the mlmd service in docker compose.

Seems to me this:

kubectl exec -n kubeflow -it $(kubectl get pods -l component=db -o jsonpath="{.items[0].metadata.name}" -n kubeflow) -- mysql -u root -ptest -D metadb -e "START TRANSACTION; DELETE FROM Artifact; DELETE FROM ArtifactProperty; DELETE FROM Association; DELETE FROM Attribution; DELETE FROM Context; DELETE FROM ContextProperty; DELETE FROM Event; DELETE FROM EventPath; DELETE FROM Execution; DELETE FROM ExecutionProperty; DELETE FROM ParentContext; COMMIT;"

should do equivalently (yes it's a 1-liner).

I have run a couple scripts locally, for the same model name and version, which is passing (wouldn't otherwise) but not the full e2e test suite. I prefer to mention this in full transparency, because I don't have full confirmation beforehand this will be enough.

After all e2e tests

This is where we typically do the docker compose down.

Should be enough to:

stop ps of port fwd from the previous step
kubectl delete cluster

Additional considerations

Currently we re-generate the image and we let docker compose pick-it-up, but for kind (or minikube, etc) we actually need the image as part of the "cluster"'s registry: maybe we could repurpose the make target to do these operations, such as regenerating the image, starting kind, kind load docker-image ..., etc and preparing the cluster for the e2e testing from pytest.

Sep 12 '24 19:09 tarilabs

I fully agree with this proposal and vouch for this change. It will bring us closer to real-world scenarios and help identify problems that could arise in an actual cluster.

Sep 20 '24 14:09 Al-Pragliola