Prerequisites
Before you begin, ensure you have configured Tensorkube on your AWS account. If you haven’t done that yet, follow the Getting Started guide.Deploying SAM2 with Tensorfuse
Each tensorkube deployment requires two things - your code and your environment (as a Dockerfile). While deploying machine learning models, it is beneficial if your model is also a part of your container image. This reduces cold-start times by a significant margin. To enable this, along with the Fast API app, we will download the model weights and make them part of the DockerfileCode files
We will write a small FastAPI app that takes image as input and outputs predicted labels. The FastAPI app will have three endpoints -/readiness
, /
, and /segment
. Remember that the /readiness
endpoint is used by Tensorkube to check the health of your deployments.
main.py
Environment files (Dockerfile)
Next, create a Dockerfile for your FastAPI app. Given below is a simple Dockerfile that you can use:Dockerfile