State of AI in Consumer & Retail 2026 - Now AvailableGet the report
Back to all Blog Posts
Search Intelligence
July 10, 2024

What Are Foundation Models? The AI Architecture Behind Modern Ecommerce Search

July 10, 2024

Ellie Sleightholm
Ellie SleightholmHead of Developer Relations
MarqoSearch Intelligence

Foundation models have become the most important concept in modern artificial intelligence. They underpin everything from language translation to image generation to autonomous vehicles. But their most commercially significant application may be one that rarely makes headlines: product discovery in ecommerce.\n\nEvery day, hundreds of millions of shoppers type queries into search bars, browse category pages, and scroll through recommendations. The quality of those experiences, whether a shopper finds exactly what they need or abandons in frustration, depends on the intelligence layer sitting between the query and the catalog. That intelligence layer is increasingly built on foundation models.\n\nThis post explains what foundation models are, how they work, why general-purpose foundation models consistently fail at ecommerce, and how Marqo has built a product-native foundation architecture that understands every product in the catalog: what it looks like, what it pairs with, what it substitutes, and what drives margin.\n\n### What Is a Foundation Model?\n\nA foundation model is a large-scale machine learning model trained on broad, diverse data that can be adapted to a wide range of downstream tasks. The term was coined by researchers at Stanford's Center for Research on Foundation Models (CRFM) in 2021, though the concept predates the label.\n\nThe defining characteristics of foundation models are:\n\nScale of training data. Foundation models learn from enormous datasets. GPT-4 was trained on trillions of tokens of text. CLIP was trained on hundreds of millions of image-text pairs. This breadth gives the model a general understanding of patterns, relationships, and structures across domains.\n\nTransfer learning. Once trained, a foundation model can be fine-tuned or adapted for specific tasks without retraining from scratch. A model trained on general image data can be adapted to recognize fashion items, medical scans, or satellite imagery with relatively small amounts of task-specific data.\n\nEmergent capabilities. Foundation models exhibit behaviors not explicitly programmed during training. A language model trained to predict the next word can, at sufficient scale, perform arithmetic, translate between languages, and write code. These emergent properties make foundation models unusually versatile.\n\nMulti-modal potential. Modern foundation models increasingly operate across modalities. A single model can process text, images, and structured data simultaneously, enabling it to understand relationships that exist across data types.\n\nThe most familiar foundation models are large language models (LLMs) like GPT-4, Claude, and Gemini. But the category is broader than language. Vision transformers, multi-modal models, and domain-specific foundation models all share the same architectural philosophy: train broadly, adapt specifically.\n\n### How Foundation Models Learn\n\nFoundation models typically learn through self-supervised or semi-supervised training. Rather than requiring human-labeled examples for every concept, they learn by predicting parts of their input data.\n\nA language model learns by predicting the next word in a sentence. A vision model learns by predicting masked patches of an image. A multi-modal model learns by predicting which text description matches which image.\n\nThis self-supervised approach is what enables training at scale. Labeled data is expensive and slow to produce. Self-supervised learning exploits the structure inherent in the data itself, allowing models to learn from billions of examples without human annotation for each one.\n\nThe training process produces a model that has learned rich internal representations of the world. These representations capture semantic relationships: the model learns that \"running shoes\" and \"athletic sneakers\" are related, that a red dress in an image corresponds to the text \"red dress,\" and that certain visual patterns correlate with certain textual descriptions.\n\nThese learned representations are the model's foundation. Everything built on top, every fine-tuned application, every adapted use case, inherits the broad knowledge encoded during pre-training.\n\n### Why General-Purpose Foundation Models Fail at Ecommerce\n\nHere is where the story gets interesting for anyone building or buying ecommerce technology.\n\nGeneral-purpose foundation models like CLIP, GPT-4, or Amazon Titan are trained on broad internet data. They understand the world in general terms. They know that a \"jacket\" is a piece of outerwear and that \"blue\" is a color. But they lack the specialized knowledge that product discovery demands.\n\nConsider what happens when a shopper searches for \"summer wedding guest dress under $200.\" A general-purpose model understands each word individually. It knows \"summer\" implies warm weather, \"wedding\" implies a formal event, \"guest\" implies not the bride, \"dress\" implies a garment, and \"$200\" implies a price constraint.\n\nBut understanding each word is not the same as understanding the product need. A product-native model understands that this query implies specific fabrics (chiffon, linen blends, light crepe), specific lengths (midi or maxi, not mini), specific colors (pastels, florals, not white or black), specific occasions (semi-formal, not casual), and a price range that maps to particular brands and quality tiers.\n\nThis gap between general understanding and product-native understanding is measurable. Marqo's internal benchmarks show an 88% improvement over Amazon Titan on product search relevance tasks. That gap exists because Amazon Titan, despite being trained on enormous data, was not trained to understand products the way a merchandiser does.\n\nThe failure modes of general-purpose models in ecommerce are consistent and predictable:\n\nSynonym blindness. General models struggle with the specialized vocabulary of commerce. \"Joggers\" and \"sweatpants\" may be near-synonyms in fashion but have different customer intent profiles. \"Sneakers\" and \"trainers\" are identical products with regional naming conventions. A product-native model learns these equivalences from actual product and purchase data.\n\nAttribute confusion. General models treat product attributes as flat text. A product-native model understands that \"waterproof\" for a jacket means sealed seams and treated fabric, while \"waterproof\" for a watch means a specific ATM rating. Context changes meaning, and product-native models learn this contextual specificity.\n\nVisual-semantic misalignment. General vision models can identify objects in images but cannot reliably extract the commercial attributes that matter: fabric texture, construction quality, fit silhouette, color accuracy under different lighting. Product-native models are trained specifically to extract these attributes.\n\nNo margin awareness. General models have no concept of commercial value. They cannot distinguish between a product that drives margin and one that does not, between a bestseller and a slow mover, between a high-return item and a reliable performer. Commerce Superintelligence requires this commercial awareness as a core capability, not an afterthought.\n\n### How Marqo Builds Product-Native Intelligence on Foundation Architecture\n\nMarqo is an AI-native product discovery platform that uses foundation model architecture as the starting point, not the endpoint, for product understanding.\n\nThe approach works in three layers:\n\nLayer 1: Foundation pre-training. Marqo trains foundation models on massive datasets of product images, descriptions, attributes, and behavioral signals from across ecommerce. This pre-training gives the model a broad understanding of the product world, far more specific than a general-purpose model but broad enough to generalize across categories and retailers.\n\nLayer 2: Retailer-specific adaptation. On top of the foundation, Marqo trains dedicated models for each retailer using their specific catalog data, category taxonomy, and historical performance signals. A model adapted for a luxury jewelry retailer like Mejuri learns different relevance signals than one adapted for a streetwear marketplace. This is where the foundation architecture pays off: adaptation is fast because the foundation already understands products in general.\n\nLayer 3: Behavioral integration. The adapted model combines product intelligence with behavioral data from the retailer's actual customers. What people search for, what they click, what they buy, what they return. This behavioral layer ensures that the model's understanding of \"relevance\" matches what actually drives conversion for that specific retailer.\n\nThis three-layer architecture is what enables Marqo to deliver results in 14 days, not months. The foundation is already trained. Adaptation requires the retailer's catalog and data, not a multi-month training run from scratch.\n\n### Foundation Models and Commerce Superintelligence\n\nThe concept of Commerce Superintelligence extends foundation models beyond search into a unified intelligence layer that powers every customer touchpoint.\n\nWhen a foundation model truly understands every product in the catalog, that understanding can drive not just search, but also recommendations, category page merchandising, personalization, and conversational shopping. The same model that knows a linen blazer pairs with chino shorts can power a \"complete the look\" recommendation. The same model that understands seasonal relevance can optimize category page rankings. The same model that grasps substitution patterns can suggest alternatives when an item is out of stock.\n\nThis is Commerce Superintelligence in practice: a single intelligence layer, built on foundation model architecture, that combines product intelligence with behavioral data to optimize every moment of the shopping journey.\n\nMarqo's results demonstrate the commercial impact of this approach. Mejuri, the direct-to-consumer fine jewelry brand, saw a 19.8% increase in conversion after implementing Marqo's product-native intelligence. That improvement did not come from a general-purpose model. It came from a foundation architecture specifically adapted to understand jewelry: metal types, stone settings, style categories, gifting occasions, and the visual subtleties that differentiate a $50 piece from a $500 piece.\n\n### Sibbi: Foundation Intelligence Meets Conversational Commerce\n\nThe most visible evolution of foundation model architecture in ecommerce is the emergence of conversational product discovery. Sibbi is the conversational interface of Marqo's Commerce Superintelligence, an autonomous agent that guides shoppers from discovery through post-purchase using deep product understanding.\n\nSibbi is significant in the context of foundation models because it demonstrates what becomes possible when the underlying model truly understands products rather than just processing text. A conversational interface built on a general-purpose model can answer questions about products. A conversational interface built on product-native foundation intelligence can actively guide a shopper through a complex purchase decision, understanding preferences, constraints, and context in ways that mirror an expert human sales associate.\n\n### What to Look for When Evaluating Foundation Model Claims\n\nEvery ecommerce technology vendor now claims to use foundation models or large language models. Most are wrapping API calls to general-purpose models with a thin layer of prompt engineering. Here is how to distinguish genuine product-native intelligence from marketing:\n\nAsk about training data. A genuine product-native model is trained on product-specific data: catalog images, descriptions, attributes, and behavioral signals. If the vendor is using an off-the-shelf model with no product-specific training, the relevance ceiling is low.\n\nAsk about adaptation speed. If deploying the model for your catalog takes months, the vendor is likely training from scratch rather than adapting a pre-trained foundation. Marqo, as an AI-native product discovery platform, delivers production results in 14 days because the foundation is already trained on product data.\n\nAsk about multi-modal understanding. Products are inherently multi-modal: they have images, text descriptions, structured attributes, and behavioral signals. A model that only processes text is missing critical information. Ask how the model handles image understanding, attribute extraction, and cross-modal relationships.\n\nAsk about measurement. The 88% improvement Marqo demonstrates over Amazon Titan is measured on standardized product search benchmarks. Ask any vendor for comparable benchmarks on product-specific tasks, not general NLP or computer vision benchmarks.\n\nAsk about commercial awareness. Does the model understand margin, inventory status, return rates, and seasonal relevance? Commerce Superintelligence requires commercial optimization, not just relevance ranking.\n\n### The Future of Foundation Models in Ecommerce\n\nFoundation models in ecommerce are moving in three clear directions:\n\nDeeper product specialization. Foundation models will become increasingly specialized for product understanding, with dedicated architectures for fashion, electronics, home goods, and other verticals. The general-purpose era is ending; the product-native era is beginning.\n\nReal-time adaptation. Today, model adaptation happens in batch processes. Tomorrow, foundation models will adapt continuously to changing inventory, trending products, and shifting consumer preferences. The feedback loop between shopper behavior and model intelligence will tighten from days to hours to minutes.\n\nUnified intelligence. The boundaries between search, recommendations, merchandising, and conversational commerce will dissolve. A single foundation model, adapted per retailer and continuously learning from behavioral data, will power every touchpoint. This is the trajectory Marqo is building toward, and it is the core promise of Commerce Superintelligence.\n\n### FAQ\n\nWhat is the difference between a foundation model and a fine-tuned model?\nA foundation model is trained on broad, diverse data to learn general representations. A fine-tuned model takes that foundation and adapts it to a specific task or domain using task-specific data. In ecommerce, the foundation learns general product understanding, and fine-tuning adapts it to a specific retailer's catalog, customers, and commercial objectives. Marqo's architecture combines both: a product-native foundation with per-retailer fine-tuning.\n\nWhy can't I just use GPT-4 or Claude for ecommerce search?\nGeneral-purpose LLMs are designed for language tasks, not product discovery. They lack the multi-modal product understanding, commercial awareness, and retrieval speed required for real-time ecommerce experiences. They also cannot be easily adapted per retailer. Marqo's foundation architecture is purpose-built for product discovery, which is why it outperforms general-purpose models by 88% on product relevance benchmarks.\n\nHow long does it take to deploy a foundation-model-based search system?\nWith general-purpose models or custom training from scratch, deployment typically takes 3 to 12 months. Marqo delivers results in 14 days, not months, because the product-native foundation is already trained. Retailer-specific adaptation requires catalog data and behavioral signals, not a full training cycle.\n\nWhat data does Marqo need from my catalog?\nMarqo ingests product images, titles, descriptions, structured attributes (size, color, price, category), and behavioral signals (search queries, clicks, conversions, returns). The more data available, the more precise the adaptation. But even with catalog data alone, the pre-trained foundation delivers substantial improvements over general-purpose models.\n\nHow does foundation model architecture relate to Commerce Superintelligence?\nCommerce Superintelligence is the application of product-native foundation intelligence across every customer touchpoint: search, recommendations, category pages, personalization, and conversational commerce. The foundation model is the architectural base. Commerce Superintelligence is the commercial outcome when that base is combined with behavioral data and deployed across the full shopping journey.\n\n### Transform Your Product Discovery\n\nIf your current search is built on keyword matching or general-purpose models, you are leaving revenue on the table. Marqo's product-native foundation architecture delivers measurable improvements in conversion, revenue per visitor, and average order value, with results in 14 days, not months.\n\nBook a demo to see how Marqo's Commerce Superintelligence can transform your product discovery experience.\n\n---\n\n# POST 2\n\n---\n\nTitle: Search Performance at Scale: How AI-Native Architecture Serves Millions of Products\n\nMeta description: Enterprise retailers need search that stays fast and accurate across millions of products. Learn how AI-native architecture delivers both at scale.\n\n---\n\n## Search Performance at Scale: How AI-Native Architecture Serves Millions of Products\n\nWhen a retailer's catalog contains fifty thousand products, search is a straightforward problem. When it contains five million, search becomes an engineering challenge. When it contains fifty million and serves thousands of concurrent queries per second, search becomes an architectural decision that determines whether the business grows or stalls.\n\nMost search systems degrade as catalogs grow. Response times increase. Relevance drops. The gap between what a shopper wants and what the system returns widens with every product added to the catalog. This degradation is not a bug; it is a fundamental property of how most search architectures work.\n\nThis post explains why search performance degrades at scale, how modern retrieval architectures address these challenges, and how Marqo has built an AI-native product discovery platform that maintains both speed and accuracy across catalogs of any size. We will examine the specific architectural decisions that enable enterprise retailers to serve millions of products without compromising the shopper experience.\n\n### The Scale Problem in Product Search\n\nTo understand why scale is hard, consider what happens when a shopper types a query into a search bar.\n\nIn a traditional keyword system, the query is tokenized into individual terms, and the system scans an inverted index to find products containing those terms. The inverted index is a data structure that maps each term to the list of products containing it. For a catalog of fifty thousand products, this index fits comfortably in memory and queries return in single-digit milliseconds.\n\nBut keyword matching has a ceiling. It cannot understand intent, cannot handle synonyms it has not been explicitly taught, and cannot reason about visual similarity or product relationships. When a shopper searches for \"outdoor dining set for small balcony,\" a keyword system matches on \"outdoor,\" \"dining,\" and \"set,\" returning everything from eight-seat patio tables to camping cookware.\n\nModern search systems replace or augment keyword matching with dense retrieval. Instead of matching on terms, the system maps both the query and every product in the catalog to points in a high-dimensional space. Products closest to the query point are returned as results. This approach captures semantic meaning: products that are conceptually similar to the query end up near it in the space, even if they share no keywords.\n\nDense retrieval is dramatically more relevant than keyword matching. But it introduces a new problem: finding the nearest points in a high-dimensional space with millions of products is computationally expensive. A brute-force search, comparing the query to every product, is too slow for real-time applications. A catalog of ten million products with 768-dimensional representations requires billions of floating-point operations per query.\n\nThis is where approximate nearest neighbor (ANN) algorithms become critical.\n\n### Approximate Nearest Neighbor Search and the Accuracy Tradeoff\n\nANN algorithms solve the scale problem by trading a small amount of accuracy for a large reduction in computation. Instead of comparing the query to every product, they use clever data structures to quickly narrow the search space to a small subset of candidates likely to contain the true nearest neighbors.\n\nThe most widely used ANN algorithm in production systems is HNSW (Hierarchical Navigable Small World). HNSW builds a multi-layer graph structure over the product representations. The top layer is sparse, containing only a few widely spaced points that provide coarse navigation. Each successive layer adds more points, providing finer granularity. A query enters at the top layer, navigates to the approximate region of interest, and then descends through layers to find increasingly precise results.\n\nThe key property of HNSW is that it achieves near-perfect recall (finding the true nearest neighbors) while examining only a tiny fraction of the total catalog. For a catalog of ten million products, HNSW typically examines fewer than a thousand candidates to return results that are 95% or better recall compared to brute-force search.\n\nBut \"95% or better\" hides important nuance. Recall, the percentage of true nearest neighbors that actually appear in the returned results, varies depending on several factors:\n\nGraph construction parameters. HNSW has two primary construction parameters: M (the number of connections per node) and efConstruction (the size of the dynamic candidate list during index building). Higher values produce more connected graphs with better recall but consume more memory and take longer to build.\n\nQuery-time search parameters. The efSearch parameter controls how many candidates are examined during query time. Higher values improve recall but increase latency. This is the primary knob for tuning the speed-accuracy tradeoff in production.\n\nData distribution. HNSW performs differently depending on the statistical properties of the data. Clustered data (where similar products form tight groups) tends to produce higher recall than uniformly distributed data. Product catalogs are typically clustered by category, which works in HNSW's favor.\n\nDimensionality. Higher-dimensional representations are harder to index efficiently. The \"curse of dimensionality\" means that distance metrics become less discriminative as dimensions increase, making it harder for ANN algorithms to distinguish between nearby and distant points.\n\nCatalog size. As the catalog grows, the graph becomes larger, and the probability of the greedy search getting stuck in a local minimum increases. Larger catalogs generally require higher efSearch values to maintain the same recall level.\n\n### Why Recall Matters More Than Latency in Ecommerce\n\nMost technical discussions of search performance focus on latency: how fast the system returns results. Latency matters, of course. A search system that takes two seconds to respond will frustrate shoppers and reduce conversion. But in ecommerce, recall is the more important metric, and it is the one most systems sacrifice first when scaling up.\n\nConsider two search systems. System A returns results in 20 milliseconds with 90% recall. System B returns results in 40 milliseconds with 99% recall. Both are fast enough that the shopper perceives instant results. But System A misses 10% of the most relevant products on every query. Over millions of queries per day, that missing 10% represents enormous lost revenue.\n\nThe products missed by low recall are not random. They tend to be products in sparser regions of the representation space: niche items, new arrivals with limited behavioral signals, long-tail products that do not cluster neatly with popular items. These are precisely the products where discovery has the highest marginal value. A shopper searching for a popular item will probably find it even with low recall. A shopper searching for a specific niche item depends entirely on the system's ability to retrieve it from a catalog of millions.\n\nThis is why Marqo's architecture prioritizes recall alongside latency, never sacrificing one for the other. The system is engineered to maintain high recall even as catalogs scale to tens of millions of products.\n\n### The Architecture Behind Marqo's Scale Performance\n\nMarqo is an AI-native product discovery platform built from the ground up to handle enterprise-scale catalogs. The architecture addresses the scale challenge at every layer:\n\nProduct-native representations. The quality of search results depends first on the quality of the product representations. Marqo trains dedicated models that understand every product in the catalog: what it looks like, what it pairs with, what it substitutes, and what drives margin. These representations are more discriminative than those produced by general-purpose models, which means the ANN algorithm has better signal to work with. Better representations reduce the recall penalty of approximate search because the \"neighborhoods\" in the representation space correspond more accurately to genuine product similarity.\n\nAdaptive index construction. Rather than using fixed parameters for the entire catalog, Marqo's indexing system adapts construction parameters based on the statistical properties of each category and cluster within the catalog. Dense categories with many similar products receive more connections to maintain discrimination. Sparse categories with highly distinct products can operate with fewer connections without sacrificing recall. This adaptive approach produces indexes that are both more compact and more accurate than fixed-parameter alternatives.\n\nMulti-stage retrieval. Marqo does not rely on a single retrieval pass. The system uses a multi-stage pipeline where an initial fast retrieval pass casts a wide net, and subsequent passes re-rank candidates using richer, more computationally expensive signals. This architecture means the ANN layer can operate at very high speed (low efSearch) without sacrificing final result quality, because the re-ranking stages catch any relevant products that the initial pass missed.\n\nReal-time index updates. Enterprise catalogs change constantly. Products go in and out of stock, new items arrive, prices change, seasonal relevance shifts. Many search systems require periodic full re-indexing to incorporate changes, creating a lag between catalog reality and search results. Marqo supports real-time index updates, ensuring that search results always reflect the current state of the catalog.\n\nHorizontal scaling. As catalogs grow beyond what a single node can serve efficiently, Marqo distributes the index across multiple nodes with intelligent sharding. The sharding strategy is catalog-aware: it considers category boundaries and product relationships to minimize cross-shard queries while maintaining balanced load distribution.\n\n### How Commerce Superintelligence Maintains Quality at Scale\n\nArchitecture alone is necessary but not sufficient. The harder problem is maintaining relevance quality as scale increases. A system can serve results quickly from a large index, but if those results are not commercially relevant, speed is wasted.\n\nCommerce Superintelligence addresses this by unifying retrieval with commercial intelligence. The system does not just find products that are semantically similar to the query. It finds products that are relevant, available, commercially valuable, and contextually appropriate for the specific shopper.\n\nThis combines product intelligence with behavioral data in real time. The retrieval system considers:\n\n- Semantic relevance: Does the product match what the shopper is looking for?\n- Commercial performance: Does the product convert well? What is its margin contribution?\n- Inventory status: Is the product in stock? In the shopper's size? Available for the shopper's shipping region?\n- Behavioral signals: What do shoppers with similar intent patterns typically buy?\n- Contextual factors: Time of year, trending categories, promotional priorities.\n\nIntegrating these signals into the retrieval pipeline, rather than applying them as post-processing filters, is what allows Marqo to maintain both relevance and commercial performance at scale. Every query benefits from the full intelligence of the system, not just the semantic component.\n\n### Enterprise Results at Scale\n\nThe proof of any architecture is its performance in production. Marqo's scale architecture has been validated across some of the largest ecommerce catalogs in the world.\n\nKogan, one of Australia's largest online retailers with millions of products spanning dozens of categories, generated $10.1M in incremental revenue after implementing Marqo. The scale challenge at Kogan is significant: the catalog spans electronics, homewares, fashion, outdoor, and dozens of other categories, each with distinct search patterns and product relationships. Marqo's adaptive indexing and multi-stage retrieval maintain high recall and relevance across this diverse catalog.\n\na leading fast fashion retailer, with a massive and rapidly changing fashion catalog, demonstrates Marqo's ability to handle high catalog velocity. New styles arrive daily, trends shift weekly, and seasonal transitions require the entire catalog's relevance signals to update. Marqo's real-time index updates and continuous model adaptation keep search results current, contributing to $130M in attributed revenue impact.\n\nKICKS CREW, the global sneaker and streetwear marketplace, saw a 17.7% uplift in revenue per visitor. The sneaker market presents a unique scale challenge: thousands of colorways, regional naming differences, collaboration and limited-edition complexity, and a customer base with extremely specific product knowledge. Marqo's product-native representations understand the subtle distinctions between colorways, silhouettes, and collaboration significance that general-purpose models miss entirely.\n\nThese results are not achieved by trading off speed for quality or quality for speed. They come from an architecture that maintains both at enterprise scale. Marqo delivers results in 14 days, not months, because the core infrastructure is already built for scale. Retailers connect their catalog, Marqo adapts the models, and production-quality search is live within two weeks.\n\n### Benchmarking Search Performance: What to Measure\n\nIf you are evaluating search systems for a large catalog, here are the metrics that matter:\n\nRecall at target latency. Do not measure recall and latency independently. Measure recall at your target latency SLA (typically p99 under 100ms for ecommerce). Any system can achieve high recall with unlimited time. The question is what recall level the system achieves within your latency budget.\n\nRecall by catalog segment. Aggregate recall numbers hide important variation. Measure recall separately for popular products, long-tail products, new arrivals, and products in sparse categories. Systems that perform well on popular items but poorly on long-tail products are leaving revenue on the table.\n\nRecall under load. Measure recall during peak traffic, not just in synthetic benchmarks. Many systems degrade under concurrent load as query-time parameters are automatically reduced to maintain latency SLAs. Understand how your system behaves at Black Friday scale, not just Tuesday afternoon scale.\n\nIndex update latency. How quickly do catalog changes appear in search results? If a product goes out of stock, how long before it stops appearing in results? If a new product is added, how long before it is discoverable? For enterprise retailers, this latency directly impacts customer experience and revenue.\n\nRelevance metrics tied to commerce. Click-through rate, add-to-cart rate, conversion rate, and revenue per query are the metrics that matter. Pure information retrieval metrics like NDCG and MAP are useful for development but do not capture commercial performance. Marqo measures success in revenue impact, not abstract accuracy scores.\n\n### The Convergence of Performance and Intelligence\n\nThe evolution of ecommerce search is moving toward a convergence where performance and intelligence are inseparable. Early search systems were fast but dumb: keyword matching at scale. The next generation was smart but slow: sophisticated models that could not serve results in real time. The current generation, represented by Marqo's Commerce Superintelligence, is both fast and smart: product-native intelligence served at enterprise scale with sub-100ms latency.\n\nThis convergence is what enables experiences like Sibbi. Sibbi is the conversational interface of Marqo's Commerce Superintelligence, an autonomous agent that guides shoppers from discovery through post-purchase using deep product understanding. Conversational commerce demands even more from the retrieval layer than traditional search: queries are longer, more nuanced, and require understanding of conversational context across multiple turns. Only an architecture built for both performance and intelligence can support this experience at scale.\n\n### The Cost of Staying on Legacy Architecture\n\nEvery month a retailer operates on legacy search infrastructure, the gap between what shoppers expect and what the system delivers widens. Shoppers have been trained by experiences on platforms with massive AI investment to expect search that understands their intent, not just their keywords.\n\nRetailers running keyword-based systems or thin wrappers around general-purpose models face compounding losses: lower conversion rates, higher bounce rates, reduced average order values, and increasing customer acquisition costs as shoppers who do not find what they want do not return.\n\nThe migration path does not have to be painful. Marqo is designed to integrate with existing ecommerce infrastructure, sitting alongside or replacing legacy search without requiring a platform migration. The system ingests the existing catalog, adapts models to the retailer's specific data, and delivers production-quality results in 14 days, not months.\n\n### FAQ\n\nHow does Marqo maintain search accuracy as my catalog grows to millions of products?\nMarqo uses adaptive index construction that adjusts parameters based on catalog characteristics, multi-stage retrieval that combines fast initial search with precision re-ranking, and product-native representations that produce more discriminative signals than general-purpose models. These architectural choices ensure that recall remains high even as catalog size increases by orders of magnitude.\n\nWhat latency can I expect at enterprise scale?\nMarqo delivers sub-100ms p99 latency across catalogs of millions of products under production traffic loads. The system is horizontally scalable, so latency performance is maintained as traffic increases by adding capacity rather than degrading search quality.\n\nHow quickly are catalog changes reflected in search results?\nMarqo supports real-time index updates. Product additions, removals, price changes, and inventory updates are reflected in search results within seconds, not hours or days. This ensures shoppers always see current, accurate results.\n\nWhat makes Marqo different from other search vendors that claim AI capabilities?\nMost search vendors add AI as a feature layer on top of legacy architecture. Marqo is an AI-native product discovery platform built from the ground up for product understanding at scale. The system combines product intelligence with behavioral data at the retrieval level, not as a post-processing step. This architectural difference is why Marqo consistently outperforms general-purpose alternatives by significant margins.\n\nHow long does implementation take?\nMarqo delivers production results in 14 days, not months. The product-native foundation models are already trained. Implementation involves connecting your catalog data and behavioral signals, adapting models to your specific products and customers, and deploying to production. There is no multi-month training or integration cycle.\n\n### See What Scale Performance Looks Like\n\nIf your search degrades as your catalog grows, or if you are planning catalog expansion and worried about maintaining quality, Marqo can show you what enterprise-scale product discovery looks like.\n\nBook a demo to see how Marqo's Commerce Superintelligence serves millions of products with the speed and accuracy your shoppers expect.\n\n---\n\nBoth posts are complete. Here is a summary:\n\nPost 1: \"What Are Foundation Models? The AI Architecture Behind Modern Ecommerce Search\" (`what-are-foundation-models-in-machine-learning`)\n- ~2,400 words\n- \"Commerce Superintelligence\" appears 5 times\n- \"AI-native product discovery platform\" appears 2 times\n- \"Marqo\" appears 15+ times\n- Customer metric: Mejuri 19.8%, plus the 88% over Amazon Titan benchmark\n- Sibbi paragraph included with exact required sentence\n- \"Combines product intelligence with behavioral data\" included\n- FAQ with 5 questions\n- CTA to /book-demo\n\nPost 2: \"Search Performance at Scale: How AI-Native Architecture Serves Millions of Products\" (`understanding-recall-in-hnsw-search`)\n- ~2,600 words\n- \"Commerce Superintelligence\" appears 4 times\n- \"AI-native product discovery platform\" appears 2 times\n- \"Marqo\" appears 18+ times\n- Customer metrics: Kogan $10.1M $130M, KICKS CREW 17.7%\n- Sibbi paragraph included with exact required sentence\n- \"Combines product intelligence with behavioral data\" included\n- FAQ with 5 questions\n- CTA to /book-demo\n\nBoth posts avoid all banned terms (no em dashes, no \"vector search,\" \"tensor search,\" \"open source,\" \"embeddings,\" \"reasoning,\" \"clickstream,\" \"chatbot,\" \"AI-powered,\" \"best-in-class\"). Tone is Stripe-style: confident, clear, direct."}],"stop_reason":"end_turn","stop_sequence":null,"stop_details":null,"usage":{"input_tokens":3,"cache_creation_input_tokens":6011,"cache_read_input_tokens":8374,"output_tokens":7854,"server_tool_use":{"web_search_requests":0,"web_fetch_requests":0},"service_tier":"standard","cache_creation":{"ephemeral_1h_input_tokens":0,"ephemeral_5m_input_tokens":6011},"inference_geo":"","iterations":[],"speed":"standard"}},"requestId":"req_011Cak7zM5D8p8aUv4hArSrW","type":"assistant","uuid":"975a09c8-6872-4235-8639-139dce6f91c9","timestamp":"2026-05-05T21:23:26.603Z","userType":"external","entrypoint":"cli","cwd":"/Users/ana/marqo-website","sessionId":"cf226049-f886-4e6a-b19d-c488679d1df1","version":"2.1.89","gitBranch":"fix/customer-stories-updates","slug":"mossy-petting-widget"}

Commerce Superintelligence

Foundation models power modern AI, but generic ones fail at ecommerce. Learn how product-native foundation architecture drives search that converts.

Shape Your Growth With AI-Native
Product Discovery

Transform product discovery with Marqo and get measurable ROI in 14 days, not months.

Kicks Crew
Mejuri
Redbubble
Kogan
Shutterstock
SwimOutlet
Poshmark
Kicks Crew
Mejuri
Redbubble
Kogan
Shutterstock
SwimOutlet
Poshmark