- Use cases: Gen AI, Content Management
- Industries: Insurance, Financial Services, Manufacturing and Mobility, Retail
- Products: Atlas, Vector Search
- Partners: SuperDuperDB, OpenAI, FastAPI
Retrieval-augmented generation (RAG) applications are a game changer for insurance companies, enabling them to harness the power of unstructured data while promoting accessibility and flexibility. Special attention goes to PDFs, which are ubiquitous yet difficult to search, leading claim adjusters and underwriters to spend hours reviewing contracts, claims, and guidelines in this common format. RAG for PDF search brings efficiency and accuracy to this historically cumbersome task. Now, users can simply type a question in natural language and the app will sift through the company data, provide an answer, summarize the content of the documents, and indicate the source of the information, including the page and paragraph where it was found.
In this GitHub repo, you will find detailed, step-by-step instructions on how to build the PDF search application combining MongoDB, SuperDuperDB, and LLMs. Our use case for this solution focuses on a claim adjuster or an underwriter handling a specific case. Analyzing the guidelines PDF associated with a specific customer helps determine the loss amount in the event of an accident or the new premium in the case of a policy renewal. The app assists by answering questions and displaying the relevant sections of the document.
Insurance firms rely heavily on data processing. To make investment decisions or handle claims, they leverage vast amounts of data, mostly unstructured. Underwriters and claim adjusters need to comb through numerous pages of guidelines, contracts, and reports, typically in PDF format. Manually finding and reviewing every piece of information is time-consuming and can easily lead to expensive mistakes, such as incorrect risk estimations. Quickly finding and accessing relevant content is essential. Combining Atlas Vector Search and LLMs to build RAG apps can directly impact the bottom line of an insurance company.
Combining MongoDB and SuperDuperDB allows you to build an information retrieval system with ease. Let’s break down the process:
Each customer has a guidelines PDF associated with their account based on country of residency. When the user selects a customer and asks a question, the system runs a vector search query only on that particular document, seamlessly filtering out the non-relevant ones. This is made possible by the pre-filtering (see code snippets below) field included in the index and in the search query.
Atlas Vector Search also takes advantage of MongoDB’s new Search Nodes dedicated architecture, enabling better optimization for the right level of resourcing for specific workload needs. Search Nodes provide dedicated infrastructure for Atlas Search and Vector Search workloads, allowing you to optimize compute resources and fully scale search needs independent of the database. Search Nodes provide better performance at scale, delivering workload isolation, higher availability, and the ability to better optimize resource usage.
SuperDuperDB is an open-source Python framework for integrating AI models and workflows directly with and across major databases for more flexible and scalable custom enterprise AI solutions. It enables developers to build, deploy, and manage AI on their existing data infrastructure and data, while using their preferred tools, eliminating data migration and duplication.
With SuperDuperDB, developers can:
SuperDuperDB provides an array of sample use cases and notebooks that developers can use to get started, including vector search with MongoDB, embedding generation, multimodal search, RAG, transfer learning, and many more. The demo showcased in this solution is adapted from an app previously developed by SuperDuperDB.
Build the solution following the instructions in this Github repo. It is important to note that the solution is made of two logical steps:
Replicate this demo by following the instructions and associated models in this solution’s repository.
Learn how MongoDB’s developer data platform supports a wide range of use cases in the insurance industry.
MongoDB and Nomic equip organizations with a powerful AI-powered search solution for large PDF collections.
Learn how insurers are using AI to transform operations and how MongoDB underpins this innovation.