BBP Flat File Format

From SCECpedia
Revision as of 22:09, 16 August 2017 by Fsilva (talk | contribs)
Jump to navigationJump to search

This page contains information regarding the BBP Flat File Format, used to collect data from BBP simulations.

The sample flat file above contains data for 50 realizations of the 1994 Northridge earthquake. It uses the GP method and includes data for all stations.

Export Script

The current script only works for simulations performed at the cluster, it requires (a small) change to be able to generate a flat file for a single BBP simulation. Here's the current usage:

$ export_bbp_cluster_simulation.py -i <cluster_top_level_input_directory> -o <output_directory> [-c]

The top level cluster directory is the same directory provided to the scripts that generated the cluster simulation (bbp_hpcc_validation.py). The output directory will be created if needed and will contain the output flat file (currently named bbl-summary-file.csv). The optional -c parameter causes the script to copy the time series from the various realizations into subdirectories inside the output directory. This can be used to generate a package containing the simulation parameters and the time series that can be distributed.

Assumptions

In the produced flat file, linked above, the following abbreviations were used:

  • NA - Not Available
  • TBC - To Be Calculated
  • TBD - To Be Determined

Also, the following assumptions were made on these fields:

  • Simulation Workflow Description: BBP modules used in the workflow, separated by a '/'
  • Site Effects Model: Currently GP2014 or None


Additionally:

  • The missing information needed in the General category (Fault ID/Fault Name/etc) could be provided either as a separate file that would server as input to the flat file generation script, or as additional keys to the SRC file. In the second case, they would be ignored by the BBP and the script would check for their presence before writing "TBD" or "NA" to the file.