Data Download
Download and Explore
The OpenData dataset is available for immediate download. The dataset is released quarterly in multiple formats, designed for seamless integration into your data pipelines and AI applications. For the data dictionary or schema reference, please consult the Documentation page.
Understanding the Data Structure
The OpenData.org dataset is organized by release quarter and data format, and there are separate data files for each entity type. This allows you to download only the data you need:
Entity Types:
organizations— Companies, non-profits, universitieslegal-entities— Jurisdictional legal constructslocations— Physical places of business or activityaddresses— Normalized physical address recordspeople— Legally and ethically appropriate person records
Example Path Structure:
s3://opendata-project/release/Q12025.1/csv/organizations.csv
Download Methods
1. AWS CLI (Recommended for Large Downloads)
Install the AWS CLI and configure your credentials.
Configure your AWS credentials (one-time setup):
aws configure
# Enter your Access Key ID (provided in approval email)
# Enter your Secret Access Key (provided in approval email)
# Default region: us-west-2
# Default output format: csv
Download all files in CSV format:
aws s3 cp --recursive \
s3://opendata-project/release/2025-Q1/csv/*
Download specific entity type (Senzing JSON):
aws s3 cp --recursive \
s3://opendata-project/release/2025-Q1/json/organizations.json
Download entire release:
aws s3 sync \
s3://opendata-project/release/2025-Q1/
2. Python (Programmatic Access)
Use the AWS SDK (boto3) to download data programmatically with your credentials:
import boto3
# Credentials automatically loaded from AWS CLI configuration
# Or explicitly provide them:
s3 = boto3.client(
's3',
aws_access_key_id='YOUR_ACCESS_KEY',
aws_secret_access_key='YOUR_SECRET_KEY',
region_name='us-west-2'
)
# Download a specific file
s3.download_file(
'opendata-project',
'release/2025-Q1/csv/organizations.csv'
)
3. Web Console Access
After receiving credentials, you can also access files through the AWS S3 Console:
- Log into the AWS Console and open S3.
- Navigate to the
opendata-projectbucket. - Browse to your desired entity type and download files directly.
License
All OpenData.org datasets are released under the Community Data License Agreement - Permissive (CDLA-Permissive), allowing:
- Free commercial and non-commercial use
- Modification and redistribution
- No attribution requirements (though attribution is appreciated)
