{"id":60368,"date":"2025-04-15T20:27:37","date_gmt":"2025-04-15T20:27:37","guid":{"rendered":"https:\/\/www.bloomreach.com\/?post_type=library&#038;p=60368"},"modified":"2025-04-15T20:27:38","modified_gmt":"2025-04-15T20:27:38","slug":"using-the-power-of-nvidia-to-build-a-groundbreaking-conversational-agent","status":"publish","type":"library","link":"https:\/\/www.bloomreach.com\/en\/blog\/using-the-power-of-nvidia-to-build-a-groundbreaking-conversational-agent","title":{"rendered":"Using the Power of NVIDIA To Build a Groundbreaking Conversational Agent"},"content":{"rendered":"\n<p>The latest technological advancements in generative AI have led to a boom in improving customer ecommerce experience by delivering the right products from search results and enabling conversational product discovery. This is where <a href=\"https:\/\/www.bloomreach.com\/en\/products\/clarity\">Bloomreach Clarity<\/a> makes a big impact on brands. Our AI conversational shopping agent can deliver conversational experiences that convert across a brand\u2019s entire site.&nbsp;<\/p>\n\n\n\n<p>To maximize the use of these foundational technologies, Bloomreach has partnered with NVIDIA to utilize its leading-edge models and software frameworks to optimize and customize these experiences. In this post, I will take a closer look at how both Bloomreach Clarity and <a href=\"https:\/\/www.bloomreach.com\/en\/products\/genai-product-discovery\">Discovery<\/a> make use of NVIDIA models and frameworks to enable highly accurate, personalized conversations.\u00a0<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Delivering a Conversational Experience&nbsp;<\/h2>\n\n\n\n<p>To ground this discussion, we must first understand what conversational discovery means in the context of ecommerce. When we say \u201cconversational,\u201d the first thing many users will think of is a chatbot. However, in ecommerce specifically, users also typically associate chatbots with customer support, which then turns them away from wanting to interact with one.&nbsp;<\/p>\n\n\n\n<p>To address this, we knew we had to embed conversations in new ways across the on-site experience:&nbsp;<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Behaviorally triggered conversations<\/strong>. Instead of having Clarity pop up in a chat window too early (and then getting ignored), we can set triggers so that Clarity doesn\u2019t engage shoppers until the right time. For example, if someone searches for \u201cjacket\u201d and views at least two pages of results, then Clarity could pop up and ask if they need assistance.\u00a0<\/li>\n\n\n\n<li><strong>Conversation starters<\/strong>. We can surface conversations in the form of clickable buttons with relevant questions as people shop. These buttons can appear on category pages, product listing pages, the checkout page, and more. For example, someone looking at a specific mattress might see questions like, \u201cHow does the cooling gel in this mattress compare to regular memory foam?\u201d or \u201cWhat is the return policy for this mattress?\u201d Once a user clicks on a question, it\u2019ll start a conversation with Clarity, and they can continue to interact with the shopping agent from there.\u00a0<\/li>\n\n\n\n<li><strong>Search bar<\/strong>. One way to bring conversations to search is through autogenerated questions as they type their search query (which I previously wrote about <a href=\"https:\/\/www.bloomreach.com\/en\/blog\/balancing-search-tradition-and-transformation-with-bloomreach-clarity\">here<\/a>). Another option is to show an embedded box on the search results page showing the user\u2019s query and offering help to refine the search. For example, a search for \u201cchair\u201d might result in a box that asks what material the shopper would like.\u00a0<\/li>\n<\/ul>\n\n\n\n<figure class=\"wp-block-image size-large\"><img decoding=\"async\" width=\"1024\" height=\"630\" src=\"https:\/\/www.bloomreach.com\/wp-content\/uploads\/2025\/04\/Clarity-and-NVIDIA_search-result-clarification-1024x630.jpg\" alt=\"Bloomreach Clarity engaging a customer after their search query with a conversational prompt\" class=\"wp-image-60369\" srcset=\"https:\/\/www.bloomreach.com\/wp-content\/uploads\/2025\/04\/Clarity-and-NVIDIA_search-result-clarification-1024x630.jpg 1024w, https:\/\/www.bloomreach.com\/wp-content\/uploads\/2025\/04\/Clarity-and-NVIDIA_search-result-clarification-300x185.jpg 300w, https:\/\/www.bloomreach.com\/wp-content\/uploads\/2025\/04\/Clarity-and-NVIDIA_search-result-clarification-768x473.jpg 768w, https:\/\/www.bloomreach.com\/wp-content\/uploads\/2025\/04\/Clarity-and-NVIDIA_search-result-clarification.jpg 1462w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<p>In this way, we can engage customers at just the right moments, whether they\u2019re searching for a specific type of product or just browsing.&nbsp;<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">A Closer Look at the Clarity Architecture&nbsp;<\/h2>\n\n\n\n<p>So, how do we actually deliver these conversational experiences? At its core, Clarity is an agentic, RAG-based platform \u2014 more specifically, it\u2019s a limited economy agentic platform with guardrails and latency constraints in place to boost performance.&nbsp;<\/p>\n\n\n\n<p>The architecture consists of a conversation engine and embedded conversational experiences, and it can also utilize Discovery\u2019s hybrid product search engine.<\/p>\n\n\n\n<p>Unlike a traditional search query, which is a one-off experience (i.e., you search for a query and get results), conversational shopping is more of a journey. To that end, we use <a href=\"https:\/\/www.bloomreach.com\/en\/blog\/balancing-search-tradition-and-transformation-with-bloomreach-clarity\">pre-search, search, and post-search LLM calls<\/a> to ensure Clarity is delivering value throughout the entire conversation.&nbsp;<\/p>\n\n\n\n<p>We have found that a bottleneck for conversation quality is catalog quality. Many catalogs were designed for website navigation vs. a conversational experience. To clear that bottleneck, we will also use LLMs to generate our own taxonomy and attributes that work better for conversational purposes.&nbsp;<\/p>\n\n\n\n<p>This is where we have deployed NVIDIA\u2019s NIM-hosted Llama models to enable cheaper, faster generative AI calls that augment the product catalog with information that helps the quality of the conversation and search.&nbsp;<\/p>\n\n\n\n<figure class=\"wp-block-image\"><img decoding=\"async\" src=\"https:\/\/lh7-rt.googleusercontent.com\/docsz\/AD_4nXc8gYv3sk43sG-AesGlFJ38dQ9CEORtUdgPE2FL1jVAn54khK1eEtHAjnO8ddRDcEQJOwuYyxn9FCjgcoyw9UqarUVSpxJBHmKvZMERb8Ohr7ph8HXP1iwDzTjSKYp1eyhXwlDnAA?key=2KUKp6O_ZMGS7KUTn_XbmlHt\" alt=\"Bloomreach Clarity asking for more details to help a shopper find the right results for a dining table\"\/><\/figure>\n\n\n\n<h3 class=\"wp-block-heading\">Balancing Precision and Recall&nbsp;<\/h3>\n\n\n\n<p>We use a mix of embedding models for Clarity, with one of the more effective ones being an open-source arctic-embed model running on NVIDIA NeMo Retriever Embedding Microservice (NREM). This model features state-of-the-art text embedding, which means better natural language processing and understanding capabilities. Additionally, the model is optimized for fine-tuning and customizations so we can further improve on the performance for our use cases, and most importantly, features a lower latency (average of 12ms) for a faster search experience.&nbsp;<\/p>\n\n\n\n<p>To fine-tune our models, we classify queries as either a positive document (relevant) or a negative document (irrelevant). Within these documents, we train using \u201chard negatives\u201d (high similarity to the query) and \u201csoft negatives\u201d (low similarity to the query).&nbsp;&nbsp;<\/p>\n\n\n\n<p>One of our key learnings was that using more than two hard negatives during training could lead to degradation in understanding. To solve for this, we started using one hard negative and six soft negatives as part of the dataset, allowing for significant improvement on F-scores and relevancy.&nbsp;<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">Further Improving Accuracy&nbsp;<\/h3>\n\n\n\n<p>The contrastive loss function is a standard procedure for bringing positive pairs closer and pushing negative pairs away. However, we modified the contrastive loss function so that it additionally pushes away negative products and positive products from <em>other<\/em> queries, which greatly improves accuracy.&nbsp;<\/p>\n\n\n\n<p>With these modifications, we saw no change in Vertex AI, but saw substantial improvements in the open-source models. In the NVIDIA-powered arctic-embed, the improved contrastive loss function led to a 13-point gain in precision and a 6-point gain in coverage compared to the base model, while gte-large saw an 8-point gain in precision.\u00a0<\/p>\n\n\n\n<p>By prioritizing precision (with f0.5, meaning precision was considered twice as important as coverage), arctic-embed saw similar gains \u2014 13 points in precision and 8 points in coverage \u2014 while gte-large saw a 4-point gain in precision and a 5-point gain in coverage.&nbsp;<\/p>\n\n\n\n<p>To put this into context, let\u2019s look at a sample search. For the search query \u201cCuisinart toaster,\u201d the base open-source arctic-embed model eventually ran out of Cuisinart-brand toasters to show and opted to show other Cuisinart products (that had nothing to do with toasters). On the other hand, our fine-tuned embedding model started showing other brands\u2019 toasters after showing all the Cuisinart ones. In other words, fine-tuning helped our model add a sense of attribute importance (i.e., prioritize the product type over the brand).&nbsp;<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img decoding=\"async\" width=\"1024\" height=\"913\" src=\"https:\/\/www.bloomreach.com\/wp-content\/uploads\/2025\/04\/Clarity-and-NVIDIA_fine-tuned-model-1024x913.jpg\" alt=\"Comparing search results between a base open-source model from NVIDIA and Bloomreach's fine-tuned model \" class=\"wp-image-60372\" srcset=\"https:\/\/www.bloomreach.com\/wp-content\/uploads\/2025\/04\/Clarity-and-NVIDIA_fine-tuned-model-1024x913.jpg 1024w, https:\/\/www.bloomreach.com\/wp-content\/uploads\/2025\/04\/Clarity-and-NVIDIA_fine-tuned-model-300x267.jpg 300w, https:\/\/www.bloomreach.com\/wp-content\/uploads\/2025\/04\/Clarity-and-NVIDIA_fine-tuned-model-768x684.jpg 768w, https:\/\/www.bloomreach.com\/wp-content\/uploads\/2025\/04\/Clarity-and-NVIDIA_fine-tuned-model.jpg 1462w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<h2 class=\"wp-block-heading\">The Power of Conversational Shopping With Clarity<\/h2>\n\n\n\n<p>With the power of NVIDIA\u2019s models and software frameworks, we\u2019ve been able to both enrich the product catalog and fine-tune our embedding models to deliver truly personalized conversations across your entire site through Clarity and Discovery. With the right datasets and an <a href=\"https:\/\/www.bloomreach.com\/en\/blog\/bloomreach-ai-development-principles\">intentional approach<\/a>, we have created a conversational agent that can drive <a href=\"https:\/\/www.bloomreach.com\/en\/case-studies\/tfg-boosts-online-conversion-rate-with-bloomreach-clarity\">meaningful business results<\/a>.<\/p>\n\n\n\n<p>To get even more detail into our Clarity architecture, <a href=\"https:\/\/www.nvidia.com\/en-us\/on-demand\/session\/gtc25-s71086\/\" target=\"_blank\" rel=\"noreferrer noopener\">watch our session<\/a> from NVIDIA GTC featuring Vikas Jha, the VP of Engineering, Conversational Commerce, and Chirayu Samarth, the Software Engineering Manager, Search.\u00a0<\/p>\n\n\n\n<p>And, Clarity is only one of the agentic innovations we\u2019re developing. To see what else we have in store, be sure to <a href=\"https:\/\/visit.bloomreach.com\/innovation-fest-2025\" target=\"_blank\" rel=\"noopener\">register for Innovation Fest<\/a>, taking place on May 7.&nbsp;<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><a href=\"https:\/\/visit.bloomreach.com\/innovation-fest-2025\" target=\"_blank\" rel=\"noopener\"><img decoding=\"async\" width=\"1024\" height=\"967\" src=\"https:\/\/www.bloomreach.com\/wp-content\/uploads\/2025\/04\/Innovation-Fest-2025-banner-1024x967.jpg\" alt=\"\" class=\"wp-image-60375\" srcset=\"https:\/\/www.bloomreach.com\/wp-content\/uploads\/2025\/04\/Innovation-Fest-2025-banner-1024x967.jpg 1024w, https:\/\/www.bloomreach.com\/wp-content\/uploads\/2025\/04\/Innovation-Fest-2025-banner-300x283.jpg 300w, https:\/\/www.bloomreach.com\/wp-content\/uploads\/2025\/04\/Innovation-Fest-2025-banner-768x725.jpg 768w, https:\/\/www.bloomreach.com\/wp-content\/uploads\/2025\/04\/Innovation-Fest-2025-banner.jpg 1462w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><\/a><\/figure>\n","protected":false},"excerpt":{"rendered":"<p>The latest technological advancements in generative AI have led to a boom in improving customer ecommerce experience by delivering the right products from search results and enabling conversational product discovery. This is where Bloomreach Clarity makes a big impact on brands. Our AI conversational shopping agent can deliver conversational experiences that convert across a brand\u2019s [&hellip;]<\/p>\n","protected":false},"author":127,"featured_media":60378,"template":"","ew-regions":[],"ew-solutions":[],"library_type":[513],"library_blog_tag":[362,368],"industry":[],"channel":[276,277,278],"topic":[283,546],"class_list":["post-60368","library","type-library","status-publish","has-post-thumbnail","hentry","library_type-blog","library_blog_tag-ai-and-innovation","library_blog_tag-conversational-shopping","channel-category-pages","channel-product-pages","channel-results-pages","topic-ai","topic-personalization"],"acf":{"library_blog_banner_content":"","library_blog_banner_cta1_text":"","library_blog_banner_cta1_href":"","library_blog_banner_cta1_new_tab":false,"library_blog_banner_cta2_text":"","library_blog_banner_cta2_href":"","library_blog_banner_cta2_new_tab":false,"library_blog_banner_bg_color":"#EAF7FE","library_blog_banner_cta_text_color":"#FFF","library_blog_banner_cta_bg_color":"#019ACE","library_blog_banner_cta2_text_color":"#000","library_blog_banner_cta2_bg_color":"#FFF","library_blog_chatgpt_content":"","library_blog_chatgpt_cta_href":"","library_blog_chatgpt_cta_text":"Ask ChatGPT"},"_links":{"self":[{"href":"https:\/\/www.bloomreach.com\/en\/wp-json\/wp\/v2\/library\/60368","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.bloomreach.com\/en\/wp-json\/wp\/v2\/library"}],"about":[{"href":"https:\/\/www.bloomreach.com\/en\/wp-json\/wp\/v2\/types\/library"}],"author":[{"embeddable":true,"href":"https:\/\/www.bloomreach.com\/en\/wp-json\/wp\/v2\/users\/127"}],"version-history":[{"count":2,"href":"https:\/\/www.bloomreach.com\/en\/wp-json\/wp\/v2\/library\/60368\/revisions"}],"predecessor-version":[{"id":60386,"href":"https:\/\/www.bloomreach.com\/en\/wp-json\/wp\/v2\/library\/60368\/revisions\/60386"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.bloomreach.com\/en\/wp-json\/wp\/v2\/media\/60378"}],"wp:attachment":[{"href":"https:\/\/www.bloomreach.com\/en\/wp-json\/wp\/v2\/media?parent=60368"}],"wp:term":[{"taxonomy":"ew_regions","embeddable":true,"href":"https:\/\/www.bloomreach.com\/en\/wp-json\/wp\/v2\/ew-regions?post=60368"},{"taxonomy":"ew_solutions","embeddable":true,"href":"https:\/\/www.bloomreach.com\/en\/wp-json\/wp\/v2\/ew-solutions?post=60368"},{"taxonomy":"library_type","embeddable":true,"href":"https:\/\/www.bloomreach.com\/en\/wp-json\/wp\/v2\/library_type?post=60368"},{"taxonomy":"library_blog_tag","embeddable":true,"href":"https:\/\/www.bloomreach.com\/en\/wp-json\/wp\/v2\/library_blog_tag?post=60368"},{"taxonomy":"industry","embeddable":true,"href":"https:\/\/www.bloomreach.com\/en\/wp-json\/wp\/v2\/industry?post=60368"},{"taxonomy":"channel","embeddable":true,"href":"https:\/\/www.bloomreach.com\/en\/wp-json\/wp\/v2\/channel?post=60368"},{"taxonomy":"topic","embeddable":true,"href":"https:\/\/www.bloomreach.com\/en\/wp-json\/wp\/v2\/topic?post=60368"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}