3.3 Retrieve Related Products¶

1. Add Docs Retrieval Script¶

Let's copy over the get_product_documents.py script into our application source folder.

Make sure you are in the root directory of the repo.

Then run this command:

1	`cp src.sample/api/get_product_documents.py src/api/.`

2. Understand Docs Retrieval¶

Let's start with a sample user query like this:

 I need a new tent for 4 people, what would you recommend?

Different users could phrase the question in different ways, with different levels of information. But we need to map all these queries to a search query that works on the product database. How do we do that? We use a 3-step process:

We teach an AI to extract user intent from an input text query
We teach the AI to map user intent to a search query on products
We use the search index to retrieve product documents matching query.

Let's see how we do this.

3. Create AI Project Client¶

Create an Azure AI Project client using the connection string
Use the client to retrieve a chat_completions inference client
Use the client to retrieve an embeddings inference client
Use the client to setup a search_client using the search connection

Click to expand and view the Python script

src/api/get_product_documents.py - Part 1
# ----------------------------------------------
# 1. Create Search Index Client 
# ----------------------------------------------
import os
from pathlib import Path
from opentelemetry import trace
from azure.ai.projects import AIProjectClient
from azure.ai.projects.models import ConnectionType
from azure.identity import DefaultAzureCredential
from azure.core.credentials import AzureKeyCredential
from azure.search.documents import SearchClient
from config import ASSET_PATH, get_logger

# initialize logging and tracing objects
logger = get_logger(__name__)
tracer = trace.get_tracer(__name__)

# create a project client using environment variables loaded from the .env file
project = AIProjectClient.from_connection_string(
    conn_str=os.environ["AIPROJECT_CONNECTION_STRING"], credential=DefaultAzureCredential()
)

# create a vector embeddings client that will be used to generate vector embeddings
chat = project.inference.get_chat_completions_client()
embeddings = project.inference.get_embeddings_client()

# use the project client to get the default search connection
search_connection = project.connections.get_default(
    connection_type=ConnectionType.AZURE_AI_SEARCH, include_credentials=True
)

# Create a search index client using the search connection
# This client will be used to create and delete search indexes
search_client = SearchClient(
    index_name=os.environ["AISEARCH_INDEX_NAME"],
    endpoint=search_connection.endpoint_url,
    credential=AzureKeyCredential(key=search_connection.key),
)

4. Get Docs For User Intent¶

First, receive input text string (user query)
Then, map user query text into a clear intent (search query)
Then, vectorize the search query (to support retrieval)
Then, search the product index for matches (by cosine similarity)
Then, for each matching product, retrieve its document (content)
Return the collection of documents to the user.

Click to expand and view the Python script

src/api/get_product_documents.py - Part 2
    # ----------------------------------------------
    # 2. Define Function to Get Product Documents
    # ----------------------------------------------
    from azure.ai.inference.prompts import PromptTemplate
    from azure.search.documents.models import VectorizedQuery

    @tracer.start_as_current_span(name="get_product_documents")
    def get_product_documents(messages: list, context: dict = None) -> dict:
        if context is None:
            context = {}

        overrides = context.get("overrides", {})
        top = overrides.get("top", 5)

        # generate a search query from the chat messages
        intent_prompty = PromptTemplate.from_prompty(Path(ASSET_PATH) / "intent_mapping.prompty")

        intent_mapping_response = chat.complete(
            model=os.environ["INTENT_MAPPING_MODEL"],
            messages=intent_prompty.create_messages(conversation=messages),
            **intent_prompty.parameters,
        )

        # the search_query returned here will be a stringified JSON object
        search_query = intent_mapping_response.choices[0].message.content
        logger.debug(f"🧠 Intent mapping: {search_query}")

        # -- OLD: generate a vector representation of the search query
        # embedding = embeddings.embed(model=os.environ["EMBEDDINGS_MODEL"], input=search_query)

        # --- NEW: generate a vector representation of the search query
        #  The intent mapping response is a stringied JSON object 
        #   with the intent and search_query components. We need to
        #   extract the search_query term and create embedding from it
        import json
        intent_map = json.loads(search_query)
        embedding = embeddings.embed(model=os.environ["EMBEDDINGS_MODEL"], input=intent_map["search_query"])
        search_vector = embedding.data[0].embedding
        # --- END

        # search the index for products matching the search query
        vector_query = VectorizedQuery(vector=search_vector, k_nearest_neighbors=top, fields="contentVector")

        search_results = search_client.search(
            search_text=search_query, vector_queries=[vector_query], select=["id", "content", "filepath", "title", "url"]
        )

        documents = [
            {
                "id": result["id"],
                "content": result["content"],
                "filepath": result["filepath"],
                "title": result["title"],
                "url": result["url"],
            }
            for result in search_results
        ]

        # add results to the provided context
        if "thoughts" not in context:
            context["thoughts"] = []

        # add thoughts and documents to the context object so it can be returned to the caller
        context["thoughts"].append(
            {
                "title": "Generated search query",
                "description": search_query,
            }
        )

        if "grounding_data" not in context:
            context["grounding_data"] = []
        context["grounding_data"].append(documents)

        logger.debug(f"📄 {len(documents)} documents retrieved: {documents}")
        return documents

5. Run Docs Retrieval Script¶

Before we can run this script, we need to create the Intent Mapper template for step 2. Let's do that next.