GEOH5 Format

../../_images/logo.png

About

The GEOH5 format aims to provide a robust means of handling large quantities of diverse data required in geoscience. The file structure builds on the generic qualities of the Geoscience ANALYST data model, and attempts to maintain a certain level of simplicity and consistency throughout. It is based entirely on free and open HDF5 technology. Given that this specification is public, the file format could, with further investment and involvement, become a useful exchange format for the broader geoscientific community.

Why GEOH5?

  • Leverages properties of HDF5.

    Fast I/O, compression, cross-platform

  • Content readable and writeable by third party software.

    We recommend using HDFView, along with Geoscience ANALYST, when learning the format.

  • Easily extensible to new data types.

    It is intended for Geoscience ANALYST to preserve data it does not understand (and generally be very tolerant with regards to missing information) when loading and saving geoh5 files. This will allow third parties to write to this format fairly easily, as well as include additional information not included in this spec for their own purposes. In the current implementation, Geoscience ANALYST automatically removes unnecessary information on save.

Definition

The following sections define the structure and components making up the GEOH5 file format.

Standards

General notes on formatting.

  • All text data and attributes are variable-length and use UTF-8 encoding

  • All numeric data uses INTEL PC native types

  • Boolean values are stored using char (0:false, 1:true)

  • Anything found in a geoh5 v1.0 file which is not mentioned in this document is optional information