Semantic Search

Semantic search is an advanced information retrieval technique that enhances search accuracy by understanding the user's intent and the contextual meaning of query terms. Unlike traditional keyword-based searches, semantic search delves into the broader context, considering the relationships between words and the meanings of phrases. Instead of relying on metadata and keyword matching, it uses vector-based search, allowing deeper query comprehension. It uses advanced language models to generate precise and reliable answers.

Semantic search eliminates the need for manual curation or keyword extraction. It uses the user’s intent and generates natural, contextually accurate responses. By leveraging vector embeddings, semantic search can identify and retrieve relevant information more effectively, leading to significant improvements in accuracy. It achieves this by using the Retrieval Augmented Generation (RAG) method. RAG enriches the query with information, ensuring that the responses are based on the knowledge base and improving the overall user experience by providing more relevant and accurate information.

Key Features of Semantic Search

Focus on User Intent: Semantic search prioritizes understanding the user's intent behind the query, leading to more relevant and accurate results. Instead of matching exact keywords, it interprets the meaning and context, offering a more intuitive search experience.

No Manual Curation: Unlike keyword search, Semantic search does not require manual curation or keyword extraction due to no dependency on metadata and summary. The zero-touch approach streamlines the curation process, reducing the need for extensive human intervention.

Contextual Understanding: Luma considers the relationships between words and the meanings of phrases, thereby understanding the broader context of the query. This results in more meaningful and accurate search outcomes, even for complex or ambiguous queries.

High Accuracy: The transition to vector embedding-based search has led to significant improvements in accuracy.

Natural Responses: Semantic search responses are natural and contextually appropriate. Instead of merely fetching answers from articles, it leverages advanced language models to construct precise answers, enhancing the user experience.

Retrieval Augmented Generation (RAG): The RAG concept integrates retrieval mechanisms and large language models to generate accurate responses. It ensures that answers are generated based only on the trusted knowledge base rather than on external sources.

Multiple Knowledge Sources: Precise answers from multiple knowledge articles, ensuring that the most relevant and accurate information is provided. This comprehensive approach improves the reliability of the search results.

Flexible Configuration for Tenants: For new tenants, semantic search is enabled automatically. Existing tenants can transition to the new system through a non-reversible process managed by the operations team, ensuring compatibility with semantic search configurations.

Semantic Search vs. Metadata Search

Metadata Search

Semantic Search

Metadata Search

Semantic Search

  • Keyword-Based: Relies on keywords or metadata like subject, topic, Action and Keywords.

  • Requires Curation: Needs manual extraction of keywords and metadata fine-tuning.

  • Limited Context Understanding: May not fully grasp the user's intent or the relationships between words.

  • Artifact summary is mandatory

  • Generates answer only from one article

  • Based on User Intent: Understands the context and meaning of the query.

  • Vector Embedding-Based: Uses Vector Embeddings for better accuracy. With RAG searches information only from trusted knowledge base.

  • No Manual Curation: Operates without the need for keyword extraction or metadata fine-tuning.

  • Generates Natural Responses: Provides contextually accurate answers using LLMs from multiple knowledge articles.

How Does Semantic Search Work?

image-20240805-081329.png

Semantic Search involves two main steps:

Data Ingestion: When a user provides documents such as PDFs, web pages, and Word files, the text from these documents is extracted and chunked into manageable pieces. These chunks are processed by an embedding model that converts the text into vectors, which are subsequently stored in a vector database.

Query/Data Retrieval: During the Query/Data Retrieval phase, the user query is processed by the system to identify the relevant chunks from the vector database. These chunks, along with the user’s query, are then used to generate a precise response using a large language model (LLM), which is then delivered back to the user.

This process highlights the integration of text extraction, vectorization, and advanced AI models to enhance the accuracy and relevance of search results.

Semantic Search is enabled by default for all new tenants provisioned. To migrate existing tenants to Semantic Search, get in touch with the Operations team. This would require migrating the artifact structure to support semantic search i.e. reprocessing of all the Artifacts available in the Tenant. This is a non-reversible process. Once migrated, cannot be reverted.