1. Learning ObjectivesΒΆ
GENAIOPS: Deployment is part of the Production stage. (click to view figure)
In the previous sections we completed model selection, exploration and customization:
- We learned to use the model catalog and benchmarks to "select" a model
- We learned to use model deployment and chat playground to "explore" the model
- We learned to add search services and data to "ground" responses with RAG
We can test the grounded model in the playground, but now we want to get a sense for what a real-world user experience and interaction looks like. Let's deploy the grounded chat model to Azure, then add a web app to interact with it! This is what model integration for real-world use, looks like.
By the end of this section you should know how to:
- Deploy ground chat prototype as a hosted endpoint (chat API)
- Deploy a web based application to interact with it (chat UI)
- Use the deployed application (with history) or update it