- Use cases: Gen AI, Fraud Prevention
- Industries: Financial Services, Insurance
- Products and tools: Atlas, Atlas App Services, Atlas Charts, Data Federation, Amazon S3, Amazon SageMaker Canvas
- Partners: AWS
Financial services organizations face growing risks from cybercriminals. High-profile hacks and fraudulent transactions undermine faith in the industry. As technology evolves, so do the techniques employed by these perpetrators, making the battle against fraud a perpetual challenge. Existing fraud detection systems often grapple with a critical limitation: relying on stale data. The newest tactics often can be seen in the data. That's where the power of operational data comes into play.
By harnessing real-time data, fraud detection models can be trained on the most accurate and relevant clues available. MongoDB Atlas, a highly scalable and flexible developer data platform, coupled with Amazon SageMaker Canvas, an advanced machine learning tool, presents a groundbreaking opportunity to revolutionize fraud detection. By harnessing operational data and leveraging the power of real-time insights, financial institutions can fortify their defenses against cybercriminals who seek to exploit vulnerabilities for illicit gains. MongoDB Atlas proves its strength as an operational data store, accommodating high-volume transactional data with exceptional performance and flexibility. Meanwhile, Amazon SageMaker Canvas empowers business analysts to leverage AI/ML solutions effortlessly, providing a no-code platform that brings the power of advanced analytics to their fingertips.
Below, you will find the architecture used to build this fraud solution. The architecture includes an end-to-end solution for detecting different types of fraud in the banking sector, including card fraud detection, identity theft detection, account takeover detection, money laundering detection, consumer fraud detection, insider fraud detection, and mobile banking fraud detection to name a few.
The architecture diagram illustrates model training and near real-time inference. The operational data stored in MongoDB Atlas is written to the Amazon S3 bucket using the Triggers feature in Atlas Application Services. Thus stored, data is used to create and train the model in Amazon SageMaker Canvas. The SageMaker Canvas stores the metadata for the model in the S3 bucket and exposes the model endpoint for inference.
The data is divided into two separate files: one containing identity information and the other containing transaction data. These files are connected through the TransactionID. It's important to note that not every transaction includes associated identity details.
Based on the above two datasets, we prepare a test join on the TransactionID, adding the target column as Fraud.
Data courtesy of Kaggle.
The detailed step-by-step guide to build this solution can be found in this Github repo. Below you will find an overview of those steps taken:
The MongoDB Atlas developer data platform is an integrated suite of data services centered on a cloud database designed to accelerate and simplify how developers build with data. Its ability to handle large amounts of data in a flexible schema empowers financial institutions to effortlessly capture, store, and process high-volume transactional data in real-time. This means that every transaction, every interaction, and every piece of operational data can be seamlessly integrated into the fraud detection pipeline, ensuring that the models are continuously trained on the most current and relevant information available. With MongoDB Atlas, financial institutions gain an unrivaled advantage in their fight against fraud, unleashing the full potential of operational data to create a robust and proactive defense system.
Amazon SageMaker Canvas revolutionizes the way business analysts leverage AI/ML solutions by offering a powerful no-code platform. Traditionally, implementing AI/ML models required specialized technical expertise, making it inaccessible for many business analysts. However, SageMaker Canvas eliminates this barrier by providing a visual point-and-click interface to generate accurate ML predictions for classification, regression, forecasting, natural language processing (NLP), and computer vision (CV). SageMaker Canvas empowers business analysts to unlock valuable insights, make data-driven decisions, and harness the power of AI without being hindered by technical complexities. It boosts collaboration between business analysts and data scientists by sharing, reviewing, and updating ML models across tools. It brings the realm of AI/ML within reach, allowing analysts to explore new frontiers and drive innovation within their organizations.
Create this demo for yourself by following the instructions and associated models in this solution’s repository.
Work your way through this e-workshop designed to show you the steps required for building this solution.
Analyze and detect fraud in real time with MongoDB.