Getting USAspending Data

Note: The FY 2017 3rd Quarter submission window is currently open. Submissions are due August 14, 2017.

After an agency certifies a submission on broker.usaspending.gov, they are presented with the option to publish their certified submission to beta.usaspending.gov. After the submission is chosen to be published in the broker:

The submission ID (unique to each submission) is queued for loading into a clone of the database used by beta.usaspending.gov
If other submissions are currently in the queue, they are loaded first
An individual submission can take up to 24 hours to load
When a new build of usaspending-api is deployed (every two weeks), the cloned database replaces the old one used by beta.usaspending.gov
Data will also be published to beta.usaspending.gov the day following the quarterly submission deadline
Any data certified before the close of the quarterly submission window will be available on beta.usaspending.gov the following day

The current beta.usaspending.gov database is publicly available on Amazon's Relational Database Service: https://aws.amazon.com/public-datasets/usaspending/

Loading USAspending Data (for Developers)

Data can be loaded to the USAspending API via a series of commands that are run from a terminal.

Note: You must set the DATABASE_URL environment variable in order to run any of these loader commands.

There are two types of data loads. Both require USAspending reference data to already be loaded (see below):

Certified DATA Act submission data from broker.usaspending.gov
Historic USAspending data

Reference Data (Required):

Populates lookup tables like country codes, agency names, and other fairly static information. Developers typically only need to need to load this information once, when first setting up their environment.

To load in the reference data, from the same directory as manage.py:

python manage.py load_reference_data

DATA Act Certified Submission Data:

To load certified submission data from the broker, you will need a read-only (or higher) connection string to the broker PostgreSQL database. If not running locally, you will also need to ensure your IP address has been whitelisted in the appropriate AWS Security Groups. Set this environment variable before running the load_submission command:

DATA_BROKER_DATABASE_URL=postgres://user:password@url:5432/data_broker

To load a submission from the data broker database:

python manage.py load_submission [broker_submission_id]

This will load data into your USAspending database in the following order:

File A (Appropriation data)
File B (Program activity object class data)
File D2 (Award financial assistance data)
File D1 (Award procurement data)
File C (Award financial data)
- This is matched against award records created after loading D1 and D2
Any subaward data

Historic USAspending Data

This section needs to be updated. The commands listed below may be different than those in the current version of the USAspending API.

Award data from the current USAspending site comes in two different formats:

Contracts
Assistance awards (which include grants, loans, and "other" awards)

Note: Current USAspending loaders are insert only (not update). Flush any existing data before running these loaders.

Loading Historic Contract Awards

Go to the current USAspending Data Download page.
Choose Contracts in option 2. Select the Spending Type
Select any other download parameters you'd like (agency, date range, etc.)
Make sure csv is selected in 5. Select Type of File
Click Submit to download the contract award file
Once the file is downloaded, start the load by running:

python manage.py load_usaspending_contracts [path-to-contracts-file.csv]

Loading Historic Financial Assistance Awards

Go to the current USAspending Data Download page.
Choose Grants, Loans, or Other Financial Assistnace in option 2. Select the Spending Type
Select any other download parameters you'd like (agency, date range, etc.)
Make sure csv is selected in 5. Select Type of File
Click Submit to download the financial assistance award file
Once the file is downloaded, start the load by running:

python manage.py load_usaspending_assistance [path-to-contracts-file.csv]

Miscellaneous Data Loading Commands

python manage.py update_location_usage_flags - Updates all locations to have proper usage flags. This should be run after any set of submission loads to ensure the flags are properly set.
python manage.py load_executive_compensation --all - Loads executive compensation data for any currently loaded submissions. For more information on other options for this command, reference the command's help text.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Getting USAspending Data

Loading USAspending Data (for Developers)

Reference Data (Required):

DATA Act Certified Submission Data:

Historic USAspending Data

Loading Historic Contract Awards

Loading Historic Financial Assistance Awards

Miscellaneous Data Loading Commands

FilesExpand file tree

loading_data.md

Latest commit

History

loading_data.md

File metadata and controls

Getting USAspending Data

Loading USAspending Data (for Developers)

Reference Data (Required):

DATA Act Certified Submission Data:

Historic USAspending Data

Loading Historic Contract Awards

Loading Historic Financial Assistance Awards

Miscellaneous Data Loading Commands