Amazon Data Exchange
AWS Data Exchange makes it easy to find, subscribe to, and use third-party data in the cloud. Qualified data providers can publish data products consisting of data sets with versioned revisions and assets including S3 snapshots, Redshift data shares, API Gateway APIs, and Lake Formation permissions. Subscribers can find and subscribe to data products directly in the AWS Management Console and use the Data Exchange API to load data into Amazon S3 for analysis with AWS analytics and machine learning services.
APIs
AWS Data Exchange API
The AWS Data Exchange API enables programmatic access to find, subscribe to, and use third-party data products. It supports managing data sets, revisions, assets, jobs, and even...
Capabilities
Features
Create, update, and manage data sets containing versioned collections of data available for subscription and distribution in the marketplace.
Organize data into versioned revisions with comments, then finalize and publish them to make data available to subscribers automatically.
Support for S3 snapshots, Redshift data shares, API Gateway APIs, Lake Formation permissions, and S3 data access as asset types.
Asynchronous import/export jobs for transferring data between external sources (S3, Redshift) and Data Exchange revisions at scale.
Configurable event actions that automatically export revision data to S3 when a new revision is published, eliminating manual downloads.
Seamlessly list and sell data products in AWS Marketplace with built-in billing, subscription management, and entitlement enforcement.
Control access to data products using AWS IAM policies and resource- level permissions with ARN-based resource identification.
Use Cases
Subscribe to curated third-party datasets from financial data providers, healthcare data aggregators, weather services, and market research firms.
Publish and sell proprietary datasets to other AWS customers via the marketplace with automated billing and subscription management.
Configure event actions to automatically deliver new data revisions to S3, enabling downstream analytics pipelines to process fresh data.
Access high-quality labeled datasets and specialized data products from Data Exchange to train and improve machine learning models.
Subscribe to compliance reference data including sanctions lists, legal entity identifiers, and regulatory taxonomies via Data Exchange.
Integrations
Primary storage integration for Data Exchange assets — used as both source for imports and destination for exports via job operations.
Share Redshift tables and views directly through Data Exchange without copying data, enabling live query access for subscribers.
Distribute Lake Formation data permissions through Data Exchange, giving subscribers governed access to data lake resources.
Expose API-based data products through Data Exchange, allowing subscribers to call APIs for real-time data access.
Catalog and transform Data Exchange S3 datasets using AWS Glue for ETL and data lake integration workflows.
Query Data Exchange S3 snapshot data directly with SQL using Athena for serverless analytics on subscribed datasets.
Use third-party datasets from Data Exchange as training data for machine learning models in Amazon SageMaker.