March 13, 2023

Build intelligence in your eCommerce applications using Azure Open AI

Overview

Whenever I shop online, checking the reviews of the products is a tedious task. Sometimes the low-rated reviews are based on delivery, packaging… whereas a few high-rated reviews might be on cost but not on quality. Generally, reviews/ratings are displayed without any context which results in customers needing additional time and effort to get the most value out of it. Hence, we built this prototype using Azure Open AI to present the consolidated information from all the reviews of a product.

Azure Open AI

Prerequisites

  • Azure Subscription with permissions to create the required resources.
  • Azure Open AI service access – check the registration process.
  • Python 3.9 or later

Review Intelligence

We fed all the reviews of a specific product to GPT-3 text-davinci-003 model with a prompt to classify the text based on the three categories – Cost, Quality, and Delivery as shown below. Prompt is where user provide the English text command which the API endpoint of the model will use to generate the completions.

classify_prompt = f'Classify the following review into 1 of the following categories:  categories: [Cost, Quality, Delivery]\n\unreview: {s}\n\nClassified category:'
response = openai.Completion.create(
    engine="text-davinci-003",
    prompt=classify_prompt,
    temperature=0,
    max_tokens=100,
    top_p=1,
    frequency_penalty=0,
    presence_penalty=0,
    best_of=1,
    stop=None)

Similarly, we classified the sentiment of the text using the same model on three categories - Positive, Neutral, and Negative.

sentiment_prompt = f'Identify the sentiment in the following review into 1 of the following categories:  categories: [Positive, Neutral, Negative]\n\nreview: {s}\n\nClassified category:'
response = openai.Completion.create(
    engine="text-davinci-003",
    prompt=sentiment_prompt,
    temperature=0,
    max_tokens=100,
    top_p=1,
    frequency_penalty=0,
    presence_penalty=0,
    best_of=1,
    stop=None)

Here is the sample result after the category classification and sentiment detection of the review text. Data frame sample

As you notice, we are using the pre-trained model available in Azure Open AI service as is above. There are three main approaches for in-context learning: Few-shot, one-shot and zero-shot. In this case, we didn’t provide any examples to the model and only the task request is provided hence it’s zero-shot learning. Based on your requirement or pilot testing, you shall consider fine tuning the model using one of the available models for fine tuning.

Above category classification and sentiment detection job needs execution only once per review text and store the results for future use. Existing data shall be analyzed using one batch processing design options and real time events shall be analyzed using one of real-time processing design options in Azure Architecture Center.

Now when a user requests for review intelligence, make an API call to Azure Open AI service to summarize the reviews per category as shown in the following command.

response = openai.Completion.create(
    engine="text-davinci-003",
    prompt=review_text,
    temperature=0.7,
    max_tokens=100,
    top_p=1,
    frequency_penalty=0,
    presence_penalty=0,
    best_of=1,
    stop=None)

Also consolidate the results from sentiment column per category to show the review intelligence summary below.

Review Intelligence Output

Another use case where Open AI service can bring value is on product search. First, get the embeddings of the product description text using get_embedding API with test-davinci-search-doc-001 model and store the result in a separate column named davinci_search.

df['davinci_search'] = df['product_description'].apply(
    lambda x : get_embedding(x, engine = 'text-davinci-search-doc-1'))

Below is the sample data frame after including the embeddings.

Search Embeddings Data Frame

Now using ordinary questions like text and using cosine similarity and text-search-davinci-query-001 model, we will retrieve the top four most relevant items as shown below.

def search_docs(df, user_query, top_n=3, to_print=True):
    embedding = get_embedding(
        user_query,
        engine='text-davinci-search-1'
    )
    df['similarities'] = df.davinci_search.apply(lambda x: cosine_similarity(x, embedding))

    res = (
        df.sort_values('similarities', ascending=False)
        .head(top_n)
    )
    return res

res = search_docs(df, "feel good ear phone with good sound quality", top_n=4)

The above search request will return the results like the one shown below.

Search Results

Jupyter Notebook for this demo is available in this GitHub repo.

Below is an approach to extract data from your own knowledge base, create embeddings and use prompts to find the best answer to your customers.
Semantic Search using Open AI Design

Conclusion

In this post we have seen a demo of how easy it is to use Azure Open AI service to bring intelligence to your eCommerce Portal and provide better experience to your customers.

Citation

In this demo, we used the dataset published as part of the following paper.
Justifying recommendations using distantly-labeled reviews and fined-grained aspects
Jianmo Ni, Jiacheng Li, Julian McAuley
Empirical Methods in Natural Language Processing (EMNLP), 2019

This post was originally published at Azure Developer Community Blog.

© Prakash P 2015 - 2023

Powered by Hugo & Kiss.