Skip to content

Standard Event Format for Seismic Event Data

Folder structure:

  • /SeismicEventData/
    • data folder:
      • ci38443183.h5
      • ...
    • phase_picks.csv
      • ci38443183.csv
      • ...
    • stations.json
    • catalog.csv
    • meta_info.txt

Waveform format in the data folder:

For simple explanation, we use the M6.4 Ridgecrest earthquake as an example. We recommand to store raw data without preprocessing such as filtering. In the dataset we are using, we put the first P pick of all stations at 30s and cut a window size of 120s. This convection can be changed according to different seismic networks.

  • File name: ci38443183.h5
    • "data": (group)
    • "data".attrs:
      • "event_id": ci38443183 (str)
      • "event_time": 2019-07-04T17:33:490000+00:00 (str)
      • "event_time_index"1: 2518 (int)
      • "begin_time": 2019-07-04T17:33:190000+00:00 (str)
      • "end_time": 2019-07-04T17:35:190000+00:00 (str)
      • "latitude": 35.705 (float)
      • "longitude": -117.504 (float)
      • "depth_km": 10.5 (float)
      • "magnitude": 6.4 (float)
      • "magnitude_type": Mw (str)
      • "source": CI (str)
    • "data/CI.RJOB..EH": (dataset; shape: 3\(\times\)nt; unit: μm/s; float32)
    • "data/CI.RJOB..EH".attrs:
      • "network": CI (str)
      • "station": RJOB (str)
      • "location": "" (str)
      • "latitude": 35.705 (float)
      • "longitude": -117.504 (float)
      • "elevation_m": 10.0 (float)
      • “local_depth_m”2: -3.0 (float)
      • "component": [E,N,Z] (list of str)
      • "distance_km": 19.2 (float32)
      • "takeoff_angle": 12.0 (float32)
      • "azimuth": 35.3 (float32)
      • "back_azimuth": 152.1 (float32)
      • "dt_s": 0.01 (float)
      • "unit": 1e-6 m/s (str)
      • "snr": [1.1, 2.3, 2.0] (list of float)
      • "phase_type": [P, S, …] (list of str)
      • "phase_index"3: [3000,3023,…] (list of int)
      • "phase_score": [1.0, 0.9, …] (list of float)
      • "phase_time": [2022-04-26T13:50:65.160000+00:00, … ] (list of str)
      • "phase_polarity": [U, D, N, …]4 (list of str)
      • "event_id": [ci38443183, ci38443183, ...] (list of str; multiple events in a window)

Phase pick format in the phase_picks folder:

Fhe file name should be the same as the hdf5 file. We recommand use comma (,) as the delimiter of the CSV file.

  • File name: ci38443183.csv
  • Headers: station_id,phase_index,phase_time,phase_score,phase_type,phase_polarity
  • dtype: str,int32,str,float32,str,str
  • e.g.:CI.RJOB..EH,,3000,2019-07-04T17:33:520000+00:00,0.98,P,U

stations.json format:

The stations.json file contains station location information

{
    "CI.CCC..BH": {
        "longitude": -117.36453,
        "latitude": 35.52495,
        "elevation_m": 670,
        "local_depth_m": -3,
        "component": ["E","N","Z"],
        "sensitivity": [627368000.0,627368000.0,627368000.0],
        "unit": "m/s"
        },
    .... (next station)
}

catalog.csv format:

The catalog.csv file contains earthquake event information

  • Headers: event_id,time, latitude, longitude,depth_km,magnitude,magnitude_type,source
  • dtype: str,str,float,float,float,float,str,str
  • e.g.:ci38443183,2019-07-04T17:33:490000+00:00,35.705,-117.504,10.5,6.4,Mw,CI

meta_info.txt format

This file contains other useful information about the dataset

e.g.:

Earthquake number: 145483

Time range: 2019-06-01T00:00:00.000000+00:00 - 2020-06-01T00:00:00.000000+00:00

Spatial range: (min_latitude, max_latitude, min_longitude, max_longitude) = (-122 -112, 30, 40)

Magnitude range: (-1.0, 8.0)


  1. which data point in the event origin time 

  2. the depth of borehole data 

  3. which data point is the picked phase time 

  4. U: upgoing; D: downgoing; N: unknown