An open banking dataset is a collection of banking-related data made available for research, development, or analysis. These datasets range from publicly available bank information (ATM locations, product rates) to synthetic transaction data for machine learning, to sandbox environments for API development.
Whether you're building a fintech application, training ML models for fraud detection, or researching banking trends, this guide covers the best sources to find and download open banking data.
Free, no-authentication APIs providing public banking data
Official Open Banking Open Data APIs provide publicly available, non-personal banking information. These APIs require no authentication and are free to access.
Official UK standard APIs for ATM locations, branch data, personal/business current accounts, loans, and credit cards.
Free access to Santander UK ATM, branch, and product data with unlimited rate limits.
EU banking supervision data including risk indicators, transparency exercises, and statistical reports.
Developer environments with realistic test data
Sandbox environments provide test data for developing and testing open banking integrations without accessing real customer accounts. Most offer free developer access.
550+ API endpoints with sandbox environment. Create test banks, accounts, and transactions for development.
Official UK Open Banking conformance testing environment with standardized test data.
Test Plaid API integration with simulated bank connections and transaction data.
European-focused sandbox for testing account aggregation and payment initiation.
Free datasets for machine learning and research
Public banking datasets on platforms like Kaggle provide transaction data, bank statements, and financial records for machine learning training and research analysis.
| Dataset | Use Case | Format | Size |
|---|---|---|---|
| UK Open Banking API Data (GitHub) Daily snapshots of UK banks' Open Data APIs since 2019. Includes ATM locations, branches, and product data from major UK banks. | Time-series analysis, research | JSON | Daily updates |
| Bank Transaction Data (Kaggle) Synthetic bank transaction dataset for fraud detection and transaction analysis. | Fraud detection, ML training | CSV | ~1M transactions |
| Bank Statements Dataset (Kaggle) Sample bank statement data for document parsing and financial analysis. | Document processing, OCR | CSV | Various |
| Credit Card Transactions (Kaggle) Credit card transaction data for spending pattern analysis and categorization. | Spending analysis, categorization | CSV | ~1.8M transactions |
| IEEE-CIS Fraud Detection Large-scale transaction dataset from Vesta Corporation for fraud detection models. | Fraud detection competition | CSV | 590K transactions |
Note: Public datasets typically contain synthetic or anonymized data. For real transaction data, you'll need to use authorized API aggregators with customer consent.
Industry statistics, reports, and data archives
Official UK Open Banking usage statistics and adoption data. Includes monthly reports on API calls, user numbers, and payment volumes.
Historical panel dataset of UK Open Banking Open Data APIs. Daily snapshots from Barclays, HSBC, Lloyds, Natwest, Santander, and more.
Commercial banking API and provider data
Open Banking Tracker maintains a comprehensive dataset of 2,000+ banking API providers, aggregators, and financial institutions with 80+ data attributes including:
Use the Open Banking Open Data APIs directly—no registration required. Simply make HTTP requests to the documented endpoints for ATM, branch, and product data.
Register for a free sandbox account with Open Bank Project or your preferred aggregator. You'll receive API credentials and access to realistic test data.
Download datasets from Kaggle (free account required) or GitHub. Most datasets are in CSV format and ready for analysis or model training.
Real customer data requires authorization. Partner with an API aggregator and implement proper consent flows per PSD2/Open Banking regulations.
You can download open banking data from several sources: official Open Banking APIs (UK Open Data APIs for ATM/branch/product data), public datasets on Kaggle and GitHub, sandbox environments like Open Bank Project, and commercial providers. Most official Open Banking data APIs require no authentication for public datasets like ATM locations and product information.
Much open banking data is free. Official Open Data APIs from UK Open Banking provide free access to ATM locations, branch information, and product data. Public datasets on Kaggle are free to download. Sandbox environments offer free test data. However, real customer transaction data requires proper authorization and typically comes from commercial API aggregators.
Open banking data is typically available in JSON format via REST APIs. Public datasets may also be available in CSV, Excel, or Parquet formats. The UK Open Banking APIs follow standardized JSON schemas documented in the Open Data API Specification. Kaggle datasets are commonly available in CSV format for easy analysis.
Yes, open banking datasets are widely used for machine learning projects including fraud detection, transaction categorization, credit scoring, and spending analysis. Kaggle hosts several banking transaction datasets suitable for ML. For production ML models, you would typically use data from API aggregators like Plaid or TrueLayer with proper customer consent.
Open Banking Open Data refers to publicly available, non-personal banking information that financial institutions must share via APIs. This includes ATM locations, branch details, product information (interest rates, fees), and service availability. Unlike Account Information Services (AIS), Open Data requires no customer authentication as it contains no personal financial information.
To access Open Banking sandbox data, register for a developer account with providers like Open Bank Project, UK Open Banking sandbox, or aggregators like Plaid and TrueLayer. Sandboxes provide realistic test data for development without accessing real customer accounts. Most sandboxes are free and provide API credentials immediately upon registration.
Connect to 30+ accounting platforms through a single unified API with Apideck.
Explore Apideck Accounting API