netcdftools¶

This module extends the features of module filetools for loading data from and storing data to netCDF4 files, consistent with the NetCDF Climate and Forecast (CF) Metadata Conventions.

Usually, the features implemented in this module are applied only indirectly in three steps:

Call either method open_netcdfreader() or method open_netcdfwriter() of the SequenceManager object available in module pub to prepare a NetCDFInterface object for reading or writing.

Call either the usual reading or writing methods of other HydPy classes (e.g. method load_fluxseries() of class HydPy or method save_stateseries() of class Elements). The prepared NetCDFInterface object collects all read or write requests of those sequences that are supposed to be read from or written to NetCDF files.

Finalize reading or writing by calling either method close_netcdfreader() or method close_netcdfwriter().

Step 2 is a logging process only, telling the NetCDFInterface object which data needs to be read or written. The actual reading from or writing to NetCDF files is triggered by step 3.

During step 2, the NetCDFInterface object, as well as its subobjects, are accessible, allowing to inspect their current state or to modify their behaviour.

The following real code examples show how to perform these three steps both for reading and writing data, based on the example configuration defined by function prepare_io_example_1():

>>> from hydpy.examples import prepare_io_example_1
>>> nodes, elements = prepare_io_example_1()

(1) We prepare a NetCDFInterface object for writing data by calling method open_netcdfwriter():

>>> from hydpy import pub
>>> pub.sequencemanager.open_netcdfwriter()

(2) We tell the SequenceManager to read and write all time series data from and to NetCDF files placed within a folder called example (In real cases you would not write the “with TestIO:” line. This code block makes sure we are polluting the IO testing directory instead of our current working directory):

>>> pub.sequencemanager.generalfiletype = "nc"
>>> from hydpy import TestIO
>>> with TestIO():
...     pub.sequencemanager.generaldirpath = "example"

(3) We store all the time series handled by the Node and Element objects of the example dataset by calling save_allseries() of class Nodes and save_allseries() of class Elements:

>>> nodes.save_allseries()
>>> elements.save_allseries()

(4) We again log all sequences, but this time after telling the SequenceManager to average each time series spatially:

>>> pub.sequencemanager.generalaggregation = "mean"
>>> nodes.save_allseries()
>>> elements.save_allseries()

(5) We can now navigate into the details of the logged time series data via the NetCDFInterface object and its subobjects. For example, we can find out which flux sequence objects of type NKor belonging to application model lland_v1 have been logged (those of elements element1 and element2):

>>> writer = pub.sequencemanager.netcdfwriter
>>> writer.lland_v1.flux_nkor.subdevicenames
('element1', 'element2')

(6) In the example discussed here, all sequences are stored within the same folder (example). Storing sequences in separate folders goes hand in hand with storing them in separate NetCDF files, of course. In such cases, you have to include the folder into the attribute name:

>>> writer.foldernames
('example',)
>>> writer.example_lland_v1.flux_nkor.subdevicenames
('element1', 'element2')

(7) We close the NetCDFInterface object, which is the moment where the writing process happens. After that, the interface object is not available anymore:

>>> with TestIO():
...     pub.sequencemanager.close_netcdfwriter()
>>> pub.sequencemanager.netcdfwriter
Traceback (most recent call last):
...
RuntimeError: The sequence file manager does currently handle no NetCDF writer object.

(8) We set the time series values of two test sequences to zero, which serves the purpose to demonstrate that reading the data back in actually works:

>>> nodes.node2.sequences.sim.series = 0.0
>>> elements.element2.model.sequences.fluxes.nkor.series = 0.0

(9) We move up a gear and and prepare a NetCDFInterface object for reading data, log all NodeSequence and ModelSequence objects, and read their time series data from the created NetCDF file (note that we disable option checkseries temporarily, to prevent from raising an exception when reading incomplete data from file):

>>> pub.sequencemanager.open_netcdfreader()
>>> nodes.load_simseries()
>>> elements.load_allseries()
>>> with TestIO():
...     with pub.options.checkseries(False):
...         pub.sequencemanager.close_netcdfreader()

We check if the data is available via the test sequences again:

>>> nodes.node2.sequences.sim.series
InfoArray([ 64.,  65.,  66.,  67.])
>>> elements.element2.model.sequences.fluxes.nkor.series
InfoArray([[ 16.,  17.],
           [ 18.,  19.],
           [ 20.,  21.],
           [ 22.,  23.]])
>>> pub.sequencemanager.netcdfreader
Traceback (most recent call last):
...
RuntimeError: The sequence file manager does currently handle no NetCDF reader object.

(11) The process of spatial aggregation cannot be inverted. Hence reading averaged time series is left for postprocessing tools. To show that writing the averaged series worked, we access both relevant NetCDF files more directly using the underlying NetCDF4 library (note that averaging 1-dimensional time series as those of node sequence Sim is allowed for the sake of consistency):

>>> from hydpy.core.netcdftools import netcdf4
>>> from numpy import array
>>> with TestIO():
...     with netcdf4.Dataset("example/node.nc") as ncfile:
...         array(ncfile["sim_q_mean"][:])
array([[ 60.,  61.,  62.,  63.]])
>>> with TestIO():
...     with netcdf4.Dataset("example/lland_v1.nc") as ncfile:
...         array(ncfile["flux_nkor_mean"][:])[1]
array([ 16.5,  18.5,  20.5,  22.5])

The described workflow is, besides the testing related specialities, more or less standard and can be modified in many ways, which are described in the documentation of the different features implemented in module netcdftools, but also in the documentation on class SequenceManager of module filetools and class IOSequence of module sequencetools.

The examples above give little insight into the resulting/required structure of NetCDF files. One should at least be aware of the optional arguments flatten, isolate, and timeaxis. When “flattening” data, multidimensional time series are handled as a larger number of 1-dimensional time series. When “isolating” data, all IOSequence objects of a specific subclass belong to one single NetCDF file. When selecting the first axis as the time axis (by setting timeaxis to zero), we increase the speed of “spatial access”, but decrease the speed of “time series access”, which is the focus of the default configuration (where timeaxis is one). When reading a NetCDF file, one has to choose the same options used for writing.

The following test shows that both open_netcdfwriter() and open_netcdfreader() pass the mentioned arguments correctly to the constructor of NetCDFInterface:

>>> from unittest.mock import patch
>>> with patch("hydpy.core.netcdftools.NetCDFInterface") as mock:
...     pub.sequencemanager.open_netcdfwriter(
...         flatten=True, isolate=True, timeaxis=0)
...     mock.assert_called_with(flatten=True, isolate=True, timeaxis=0)
...     pub.sequencemanager.open_netcdfreader(
...         flatten=True, isolate=True, timeaxis=0)
...     mock.assert_called_with(flatten=True, isolate=True, timeaxis=0)

Both methods take the current values of the options flattennetcdf, isolatenetcdf, and timeaxisnetcdf as default arguments:

>>> with patch("hydpy.core.netcdftools.NetCDFInterface") as mock:
...     pub.sequencemanager.open_netcdfwriter()
...     mock.assert_called_with(
...         flatten=pub.options.flattennetcdf,
...         isolate=pub.options.isolatenetcdf,
...         timeaxis=pub.options.timeaxisnetcdf)
...     pub.sequencemanager.open_netcdfreader()
...     mock.assert_called_with(
...         flatten=pub.options.flattennetcdf,
...         isolate=pub.options.isolatenetcdf,
...         timeaxis=pub.options.timeaxisnetcdf)

Module netcdftools implements the following members:

IntOrSlice Type variable.

str2chars() Return ndarray containing the byte characters (second axis) of all given strings (first axis).

chars2str() Inversion function of function str2chars().

create_dimension() Add a new dimension with the given name and length to the given NetCDF file.

create_variable() Add a new variable with the given name, datatype, and dimensions to the given NetCDF file.

query_variable() Return the variable with the given name from the given NetCDF file.

query_timegrid() Return the Timegrid defined by the given NetCDF file.

query_array() Return the data of the variable with the given name from the given NetCDF file.

get_filepath() Return the filepath of the given NetCDF file.

NetCDFInterface Interface between SequenceManager and multiple NetCDF files.

NetCDFFile Handles a single NetCDF file.

Subdevice2Index Return type of method query_subdevice2index().

NetCDFVariableBase Base class for NetCDFVariableDeep, NetCDFVariableAgg, and NetCDFVariableFlat.

DeepAndAggMixin Mixin class for NetCDFVariableDeep and NetCDFVariableAgg

AggAndFlatMixin Mixin class for NetCDFVariableAgg and NetCDFVariableFlat.

NetCDFVariableDeep Relates some objects of a specific IOSequence subclass with a single NetCDF variable without modifying dimensionality.

NetCDFVariableAgg Relates some objects of a specific IOSequence subclass with a single NetCDF variable when aggregation of data is required.

NetCDFVariableFlat Relates some objects of a specific IOSequence subclass with a single NetCDF variable when flattening of data is required.

hydpy.core.netcdftools.dimmapping = {'nmb_characters': 'char_leng_name', 'nmb_subdevices': 'stations', 'nmb_timepoints': 'time'}¶

Maps dimension name terms from HydPy terms NetCDF.

You can change this mapping if it does not suit your requirements. For example, change the value of the keyword “nmb_subdevices”, if you prefer to call this dimension “location” instead of “stations” within NetCDF files:

>>> from hydpy.core.netcdftools import dimmapping
>>> dimmapping["nmb_subdevices"] = "location"

hydpy.core.netcdftools.varmapping = {'subdevices': 'station_id', 'timepoints': 'time'}¶

Maps variable name terms from HydPy terms NetCDF.

You can change this mapping if it does not suit your requirements. For example, change the value of the keyword “timepoints”, if you prefer to call this variable “period” instead of “time” within NetCDF files:

>>> from hydpy.core.netcdftools import varmapping
>>> varmapping["timepoints"] = "period"

hydpy.core.netcdftools.fillvalue = nan¶

Default fill value for writing NetCDF files.

You can set another float value before writing a NetCDF file:

>>> from hydpy.core import netcdftools
>>> netcdftools.fillvalue = -777.0

hydpy.core.netcdftools.str2chars(strings) → numpy.ndarray [source]¶

Return ndarray containing the byte characters (second axis) of all given strings (first axis).

>>> from hydpy.core.netcdftools import str2chars
>>> str2chars(["zeros", "ones"])
array([[b'z', b'e', b'r', b'o', b's'],
       [b'o', b'n', b'e', b's', b'']],
      dtype='|S1')

>>> str2chars([])
array([], shape=(0, 0),
      dtype='|S1')

hydpy.core.netcdftools.chars2str(chars) → List[str][source]¶

Inversion function of function str2chars().

>>> from hydpy.core.netcdftools import chars2str

>>> chars2str([[b"z", b"e", b"r", b"o", b"s"],
...            [b"o", b"n", b"e", b"s", b""]])
['zeros', 'ones']

>>> chars2str([])
[]

hydpy.core.netcdftools.create_dimension(ncfile, name, length) → None [source]¶

Add a new dimension with the given name and length to the given NetCDF file.

Essentially, create_dimension() just calls the equally named method of the NetCDF library, but adds information to possible error messages:

>>> from hydpy import TestIO
>>> from hydpy.core.netcdftools import netcdf4
>>> with TestIO():
...     ncfile = netcdf4.Dataset("test.nc", "w")
>>> from hydpy.core.netcdftools import create_dimension
>>> create_dimension(ncfile, "dim1", 5)
>>> dim = ncfile.dimensions["dim1"]
>>> dim.size if hasattr(dim, "size") else dim
5

>>> try:
...     create_dimension(ncfile, "dim1", 5)
... except BaseException as exc:
...     print(exc)    
While trying to add dimension `dim1` with length `5` to the NetCDF file `test.nc`, the following error occurred: ...

>>> ncfile.close()

hydpy.core.netcdftools.create_variable(ncfile, name, datatype, dimensions) → None [source]¶

Add a new variable with the given name, datatype, and dimensions to the given NetCDF file.

Essentially, create_variable() just calls the equally named method of the NetCDF library, but adds information to possible error messages:

>>> from hydpy import TestIO
>>> from hydpy.core.netcdftools import netcdf4
>>> with TestIO():
...     ncfile = netcdf4.Dataset("test.nc", "w")
>>> from hydpy.core.netcdftools import create_variable
>>> try:
...     create_variable(ncfile, "var1", "f8", ("dim1",))
... except BaseException as exc:
...     print(str(exc).strip('"'))    
While trying to add variable `var1` with datatype `f8` and dimensions `('dim1',)` to the NetCDF file `test.nc`, the following error occurred: ...

>>> from hydpy.core.netcdftools import create_dimension
>>> create_dimension(ncfile, "dim1", 5)
>>> create_variable(ncfile, "var1", "f8", ("dim1",))
>>> import numpy
>>> numpy.array(ncfile["var1"][:])
array([ nan,  nan,  nan,  nan,  nan])

>>> ncfile.close()

hydpy.core.netcdftools.query_variable(ncfile, name) → netCDF4._netCDF4.Variable[source]¶

Return the variable with the given name from the given NetCDF file.

Essentially, query_variable() just performs a key assess via the used NetCDF library, but adds information to possible error messages:

>>> from hydpy.core.netcdftools import query_variable
>>> from hydpy import TestIO
>>> from hydpy.core.netcdftools import netcdf4
>>> with TestIO():
...     file_ = netcdf4.Dataset("model.nc", "w")
>>> query_variable(file_, "flux_prec")
Traceback (most recent call last):
...
OSError: NetCDF file `model.nc` does not contain variable `flux_prec`.

>>> from hydpy.core.netcdftools import create_variable
>>> create_variable(file_, "flux_prec", "f8", ())
>>> isinstance(query_variable(file_, "flux_prec"), netcdf4.Variable)
True

>>> file_.close()

hydpy.core.netcdftools.query_timegrid(ncfile) → hydpy.core.timetools.Timegrid [source]¶

Return the Timegrid defined by the given NetCDF file.

>>> from hydpy.examples import prepare_full_example_1
>>> prepare_full_example_1()
>>> from hydpy import TestIO
>>> from hydpy.core.netcdftools import netcdf4
>>> from hydpy.core.netcdftools import query_timegrid
>>> filepath = "LahnH/series/input/hland_v1_input_t.nc"
>>> with TestIO():
...     with netcdf4.Dataset(filepath) as ncfile:
...         query_timegrid(ncfile)
Timegrid("1996-01-01 00:00:00",
         "2007-01-01 00:00:00",
         "1d")

hydpy.core.netcdftools.query_array(ncfile, name) → numpy.ndarray [source]¶

Return the data of the variable with the given name from the given NetCDF file.

The following example shows that query_array() returns nan entries to represent missing values even when the respective NetCDF variable defines a different fill value:

>>> from hydpy import TestIO
>>> from hydpy.core.netcdftools import netcdf4
>>> from hydpy.core import netcdftools
>>> netcdftools.fillvalue = -999.0
>>> with TestIO():
...     with netcdf4.Dataset("test.nc", "w") as ncfile:
...         netcdftools.create_dimension(ncfile, "dim1", 5)
...         netcdftools.create_variable(ncfile, "var1", "f8", ("dim1",))
...     ncfile = netcdf4.Dataset("test.nc", "r")
>>> netcdftools.query_variable(ncfile, "var1")[:].data
array([-999., -999., -999., -999., -999.])
>>> netcdftools.query_array(ncfile, "var1")
array([ nan,  nan,  nan,  nan,  nan])
>>> import numpy
>>> netcdftools.fillvalue = numpy.nan
>>> ncfile.close()

hydpy.core.netcdftools.get_filepath(ncfile) → str [source]¶

Return the filepath of the given NetCDF file.

>>> from hydpy import TestIO
>>> from hydpy.core.netcdftools import netcdf4
>>> from hydpy.core.netcdftools import get_filepath
>>> with TestIO():
...     with netcdf4.Dataset("test.nc", "w") as ncfile:
...         get_filepath(ncfile)
'test.nc'

class hydpy.core.netcdftools.NetCDFInterface(flatten, isolate, timeaxis)[source]¶

Bases: object

Interface between SequenceManager and multiple NetCDF files.

The core task of class NetCDFInterface is to distribute different IOSequence objects on different instances of class NetCDFFile.

(1) We prepare a SequenceManager object and some devices handling different sequences by applying function prepare_io_example_1():

>>> from hydpy.examples import prepare_io_example_1
>>> nodes, elements = prepare_io_example_1()

We collect all sequences to be used in the following examples:

>>> sequences = []
>>> for node in nodes:
...     sequences.append(node.sequences.sim)
>>> for element in elements:
...     sequences.append(element.model.sequences.inputs.nied)
...     sequences.append(element.model.sequences.fluxes.nkor)

(3) We prepare a NetCDFInterface object and log and write all test sequences. Due to setting flatten to False, NetCDFInterface initialises one NetCDFFile object for handling the NodeSequence objects, two NetCDFFile objects for handling the InputSequence objects of application models lland_v1 and lland_v2, respectively, and two NetCDFFile objects for handling the FluxSequence objects of application models lland_v1 and lland_v2, respectively. Sequences of a specific type of model and nodes are always handled in separate NetCDF files, to avoid name conflicts. InputSequence and FluxSequence objects can only be stored in the same NetCDF file when one wants to store them in the same folder, of course, which is not the case in the given example. This should become clear when looking at the following attempts to query the NetCDFFile objects related to lland_v2:

>>> from hydpy.core.netcdftools import NetCDFInterface
>>> interface = NetCDFInterface(flatten=False, isolate=False, timeaxis=1)
>>> for sequence in sequences:
...     interface.log(sequence, sequence.series)
...     interface.log(sequence, sequence.average_series())
>>> interface.filenames
('lland_v1', 'lland_v2', 'node')
>>> interface.node.variablenames
('sim_q', 'sim_q_mean', 'sim_t', 'sim_t_mean')
>>> interface.node == interface.nodepath_node
True
>>> interface.lland_v2
Traceback (most recent call last):
...
AttributeError: The current NetCDFInterface object does handle multiple NetCDFFile objects named `lland_v2`.  Please be more specific.
>>> hasattr(interface, "outputpath_lland_v2")
True
>>> "outputpath_lland_v2" in dir(interface)
True
>>> interface.lland_v3
Traceback (most recent call last):
...
AttributeError: The current NetCDFInterface object does neither handle a NetCDFFile object named `lland_v3` nor does it define a member named `lland_v3`.

(4) We store all NetCDF files into the inputpath, outputpath, and nodepath folders of the testing directory, define by prepare_io_example_1():

>>> from hydpy import TestIO
>>> with TestIO():
...     interface.write()

(5) We define a shorter initialisation period and re-activate the time series of the test sequences:

>>> from hydpy import pub
>>> pub.timegrids = "02.01.2000", "04.01.2000", "1d"
>>> for sequence in sequences:
...     sequence.activate_ram()

(6) We again initialise class NetCDFInterface, log all test sequences, and read the test data of the defined subperiod:

>>> from hydpy.core.netcdftools import NetCDFInterface
>>> interface = NetCDFInterface(flatten=False, isolate=False, timeaxis=1)
>>> for sequence in sequences:
...     interface.log(sequence, None)

>>> with TestIO():
...     interface.read()
>>> nodes.node1.sequences.sim.series
InfoArray([ 61.,  62.])
>>> elements.element2.model.sequences.fluxes.nkor.series
InfoArray([[ 18.,  19.],
           [ 20.,  21.]])

(7) We repeat the above steps, except that we set both flatten and isolate to True. The relevant difference is that NetCDFInterface now initialises a new NetCDFFile object for each sequence type, resulting in a larger number separate NetCDF files containing only one NetCDF variable:

>>> from hydpy.examples import prepare_io_example_1
>>> nodes, elements = prepare_io_example_1()

>>> sequences = []
>>> for node in nodes:
...     sequences.append(node.sequences.sim)
>>> for element in elements:
...     sequences.append(element.model.sequences.inputs.nied)
...     sequences.append(element.model.sequences.fluxes.nkor)

>>> interface = NetCDFInterface(flatten=True, isolate=True, timeaxis=1)
>>> for sequence in sequences:
...     interface.log(sequence, sequence.series)
...     interface.log(sequence, sequence.average_series())
>>> from pprint import pprint
>>> pprint(interface.filenames)
('lland_v1_flux_nkor',
 'lland_v1_flux_nkor_mean',
 'lland_v1_input_nied',
 'lland_v1_input_nied_mean',
 'lland_v2_flux_nkor',
 'lland_v2_flux_nkor_mean',
 'lland_v2_input_nied',
 'lland_v2_input_nied_mean',
 'node_sim_q',
 'node_sim_q_mean',
 'node_sim_t',
 'node_sim_t_mean')
>>> interface.lland_v1_input_nied_mean.variablenames
('input_nied_mean',)

>>> from hydpy import pub, TestIO
>>> with TestIO():
...     pub.sequencemanager.outputpath = ""
...     interface.write()

>>> from hydpy import pub
>>> pub.timegrids = "02.01.2000", "04.01.2000", "1d"
>>> for sequence in sequences:
...     sequence.activate_ram()

>>> interface = NetCDFInterface(flatten=True, isolate=True, timeaxis=1)
>>> for sequence in sequences:
...     interface.log(sequence, None)
>>> with TestIO():
...     interface.read()
>>> nodes.node1.sequences.sim.series
InfoArray([ 61.,  62.])
>>> elements.element2.model.sequences.fluxes.nkor.series
InfoArray([[ 18.,  19.],
           [ 20.,  21.]])

(8) We technically confirm that the isolate and timeaxis arguments are passed to the constructor of class NetCDFFile correctly:

>>> from unittest.mock import patch
>>> with patch("hydpy.core.netcdftools.NetCDFFile") as mock:
...     interface = NetCDFInterface(
...         flatten=True, isolate=False, timeaxis=0)
...     interface.log(sequences[0], sequences[0].series)
...     mock.assert_called_once_with(
...         name="node",
...         flatten=True, isolate=False, timeaxis=0,
...         dirpath="nodepath")

log(sequence, infoarray) → None [source]¶: Prepare a NetCDFFile object suitable for the given IOSequence object, when necessary, and pass the given arguments to its log() method.

read() → None [source]¶: Call method read() of all handled NetCDFFile objects.

write() → None [source]¶: Call method write() of all handled NetCDFFile objects.

property foldernames¶: A tuple of names of all folders the sequences shall be read from or written to.

property filenames¶: A tuple of names of all handled NetCDFFile objects.

class hydpy.core.netcdftools.NetCDFFile(name: str, flatten, isolate, timeaxis, dirpath)[source]¶

Bases: object

Handles a single NetCDF file.

The core task of class NetCDFFile is to distribute different IOSequence objects on different instances of NetCDFVariableBase subclasses. The documentation on the method log() explains this in detail. Here we focus on how a NetCDFFile object triggers the reading and writing functionalities of its subobjects.

(1) We prepare a SequenceManager object and some devices handling different sequences by applying function prepare_io_example_1():

>>> from hydpy.examples import prepare_io_example_1
>>> nodes, (element1, element2, element3) = prepare_io_example_1()

(2) We define two shortcuts for the sequences used in the following examples:

>>> nied = element1.model.sequences.inputs.nied
>>> nkor = element1.model.sequences.fluxes.nkor

(3) We prepare a NetCDFFile object and log the Nied sequence:

>>> from hydpy.core.netcdftools import NetCDFFile
>>> ncfile = NetCDFFile(
...     "model", flatten=False, isolate=False, timeaxis=1, dirpath="")
>>> ncfile.log(nied, nied.series)

We store the NetCDF file directly into the testing directory:

>>> from hydpy import pub, TestIO
>>> with TestIO():
...     init = pub.timegrids.init
...     ncfile.write(timeunit=init.firstdate.to_cfunits("hours"),
...                  timepoints=init.to_timepoints("hours"))

(5) We set the time series values of the test sequence to zero, log the sequence to a new NetCDFFile instance, read the data from the NetCDF file, and check that test sequence nied in fact contains the read data:

>>> nied.series = 0.0
>>> ncfile = NetCDFFile(
...     "model", flatten=True, isolate=False, timeaxis=1, dirpath="")
>>> ncfile.log(nied, nied.series)
>>> with TestIO():
...     ncfile.read()
>>> nied.series
InfoArray([ 0.,  1.,  2.,  3.])

(6) We show that IO errors and trying to access variables we have not logged so far should result in clear error messages:

>>> ncfile.log(nkor, nkor.series)
>>> with TestIO():
...     ncfile.read()
Traceback (most recent call last):
...
OSError: While trying to read data from NetCDF file `model.nc`, the following error occurred: NetCDF file `model.nc` does not contain variable `flux_nkor`.

>>> "flux_nkor" in dir(ncfile)
True
>>> ncfile.flux_nkor.name
'flux_nkor'
>>> 'state_bowa' in dir(ncfile)
False
>>> ncfile.state_bowa
Traceback (most recent call last):
...
AttributeError: The NetCDFFile object `model` does neither handle a NetCDF variable named `state_bowa` nor does it define a member named `state_bowa`.

log(sequence, infoarray) → None [source]¶

Pass the given IOSequence to a suitable instance of a NetCDFVariableBase subclass.

When writing data, the second argument should be an InfoArray. When reading data, this argument is ignored. Simply pass None.

(1) We prepare some devices handling some sequences by applying function prepare_io_example_1(). We limit our attention to the returned elements, which handle the more diverse sequences:

>>> from hydpy.examples import prepare_io_example_1
>>> nodes, (element1, element2, element3) = prepare_io_example_1()

(2) We define some shortcuts for the sequences used in the following examples:

>>> nied1 = element1.model.sequences.inputs.nied
>>> nied2 = element2.model.sequences.inputs.nied
>>> nkor2 = element2.model.sequences.fluxes.nkor
>>> nkor3 = element3.model.sequences.fluxes.nkor

(3) We define a function that logs these example sequences to a given NetCDFFile object and prints some information about the resulting object structure. Note that sequence nkor2 is logged twice, the first time with its original time series data, the second time with averaged values:

>>> from hydpy import classname
>>> def test(ncfile):
...     ncfile.log(nied1, nied1.series)
...     ncfile.log(nied2, nied2.series)
...     ncfile.log(nkor2, nkor2.series)
...     ncfile.log(nkor2, nkor2.average_series())
...     ncfile.log(nkor3, nkor3.average_series())
...     for name, variable in ncfile.variables.items():
...         print(name, classname(variable), variable.subdevicenames)

(4) We prepare a NetCDFFile object with both options flatten and isolate being disabled:

>>> from hydpy.core.netcdftools import NetCDFFile
>>> ncfile = NetCDFFile(
...     "model", flatten=False, isolate=False, timeaxis=1, dirpath="")

(5) We log all test sequences results in two NetCDFVariableDeep and one NetCDFVariableAgg objects. To keep both NetCDF variables related to NKor distinguishable, the name flux_nkor_mean includes information about the kind of aggregation performed:

>>> test(ncfile)
input_nied NetCDFVariableDeep ('element1', 'element2')
flux_nkor NetCDFVariableDeep ('element2',)
flux_nkor_mean NetCDFVariableAgg ('element2', 'element3')

(6) We confirm that the NetCDFVariableBase objects received the required information:

>>> ncfile.flux_nkor.element2.sequence.descr_device
'element2'
>>> ncfile.flux_nkor.element2.array
InfoArray([[ 16.,  17.],
           [ 18.,  19.],
           [ 20.,  21.],
           [ 22.,  23.]])
>>> ncfile.flux_nkor_mean.element2.sequence.descr_device
'element2'
>>> ncfile.flux_nkor_mean.element2.array
InfoArray([ 16.5,  18.5,  20.5,  22.5])

(7) We again prepare a NetCDFFile object, but now with both options flatten and isolate being enabled. To log test sequences with their original time series data does now trigger the initialisation of class NetCDFVariableFlat. When passing aggregated data, nothing changes:

>>> ncfile = NetCDFFile(
...     "model", flatten=True, isolate=True, timeaxis=1, dirpath="")
>>> test(ncfile)
input_nied NetCDFVariableFlat ('element1', 'element2')
flux_nkor NetCDFVariableFlat ('element2_0', 'element2_1')
flux_nkor_mean NetCDFVariableAgg ('element2', 'element3')
>>> ncfile.flux_nkor.element2.sequence.descr_device
'element2'
>>> ncfile.flux_nkor.element2.array
InfoArray([[ 16.,  17.],
           [ 18.,  19.],
           [ 20.,  21.],
           [ 22.,  23.]])
>>> ncfile.flux_nkor_mean.element2.sequence.descr_device
'element2'
>>> ncfile.flux_nkor_mean.element2.array
InfoArray([ 16.5,  18.5,  20.5,  22.5])

(8) We technically confirm that the isolate argument is passed to the constructor of subclasses of NetCDFVariableBase correctly:

>>> from unittest.mock import patch
>>> with patch("hydpy.core.netcdftools.NetCDFVariableFlat") as mock:
...     ncfile = NetCDFFile(
...         "model", flatten=True, isolate=False, timeaxis=0,
...         dirpath="")
...     ncfile.log(nied1, nied1.series)
...     mock.assert_called_once_with(
...         name="input_nied", timeaxis=0, isolate=False)

property filepath¶: The NetCDF file path.

read() → None [source]¶: Open an existing NetCDF file temporarily and call method read() of all handled NetCDFVariableBase objects.

write(timeunit, timepoints) → None [source]¶: Open a new NetCDF file temporarily and call method write() of all handled NetCDFVariableBase objects.

property variablenames¶: The names of all handled IOSequence objects.

class hydpy.core.netcdftools.Subdevice2Index(dict_, name_sequence, name_ncfile)[source]¶

Bases: object

Return type of method query_subdevice2index().

get_index(name_subdevice) → int [source]¶: Item access to the wrapped dict object with a specialized error message.

class hydpy.core.netcdftools.NetCDFVariableBase(name, isolate, timeaxis)[source]¶

Bases: abc.ABC

Base class for NetCDFVariableDeep, NetCDFVariableAgg, and NetCDFVariableFlat.

The initialisation of NetCDFVariableBase subclasses requires the arguments name, isolate, and timeaxis. Only the last one is checked to be valid:

>>> from hydpy.core.netcdftools import NetCDFVariableBase
>>> from hydpy import make_abc_testable
>>> NCVar = make_abc_testable(NetCDFVariableBase)
>>> ncvar = NCVar("flux_nkor", isolate=True, timeaxis=2)
Traceback (most recent call last):
...
ValueError: The argument `timeaxis` must be either `0` (the first axis handles time) or `1` (the second axis handles time), but for variable `flux_nkor` of class NetCDFVariableBase_ the value `2` is given.

log(sequence, infoarray) → None [source]¶

Log the given IOSequence object either for reading or writing data.

The optional array argument allows for passing alternative data in an InfoArray object replacing the series of the IOSequence object, which is useful for writing modified (e.g. spatially averaged) time series.

Logged time series data is available via attribute access:

>>> from hydpy.core.netcdftools import NetCDFVariableBase
>>> from hydpy import make_abc_testable
>>> NCVar = make_abc_testable(NetCDFVariableBase)
>>> ncvar = NCVar("flux_nkor", isolate=True, timeaxis=1)
>>> from hydpy.examples import prepare_io_example_1
>>> nodes, elements = prepare_io_example_1()
>>> nkor = elements.element1.model.sequences.fluxes.nkor
>>> ncvar.log(nkor, nkor.series)
>>> "element1" in dir(ncvar)
True
>>> ncvar.element1.sequence is nkor
True
>>> "element2" in dir(ncvar)
False
>>> ncvar.element2
Traceback (most recent call last):
...
AttributeError: The NetCDFVariable object `flux_nkor` does neither handle time series data under the (sub)device name `element2` nor does it define a member named `element2`.

abstract property subdevicenames¶: A tuple containing the (sub)device names.

abstract property dimensions¶: A tuple containing the dimension names.

abstract property array¶: A ndarray containing the values of all logged sequences.

property prefix¶

A prefix for names of dimensions and associated variables.

“Isolated” variables do not require a prefix:

>>> from hydpy.core.netcdftools import NetCDFVariableBase
>>> from hydpy import make_abc_testable
>>> NetCDFVariableBase_ = make_abc_testable(NetCDFVariableBase)
>>> NetCDFVariableBase_("name", isolate=True, timeaxis=1).prefix
''

When storing different types of sequences in the same NetCDF file, there is a risk of name conflicts. We solve this by using the variables name as a prefix:

>>> NetCDFVariableBase_("name", isolate=False, timeaxis=1).prefix
'name_'

insert_subdevices(ncfile) → None [source]¶

Insert a variable of the names of the (sub)devices of the logged sequences into the given NetCDF file

(1) We prepare a NetCDFVariableBase subclass with fixed (sub)device names:

>>> from hydpy.core.netcdftools import NetCDFVariableBase, chars2str
>>> from hydpy import make_abc_testable, TestIO
>>> from hydpy.core.netcdftools import netcdf4
>>> Var = make_abc_testable(NetCDFVariableBase)
>>> Var.subdevicenames = "element1", "element_2"

(2) Without isolating variables, insert_subdevices() prefixes the name of the NetCDFVariableBase object to the name of the inserted variable and its dimensions. The first dimension corresponds to the number of (sub)devices, the second dimension to the number of characters of the longest (sub)device name:

>>> var1 = Var("var1", isolate=False, timeaxis=1)
>>> with TestIO():
...     file1 = netcdf4.Dataset("model1.nc", "w")
>>> var1.insert_subdevices(file1)
>>> file1["var1_station_id"].dimensions
('var1_stations', 'var1_char_leng_name')
>>> file1["var1_station_id"].shape
(2, 9)
>>> chars2str(file1["var1_station_id"][:])
['element1', 'element_2']
>>> file1.close()

When isolating variables, we omit the prefix:

>>> var2 = Var("var2", isolate=True, timeaxis=1)
>>> with TestIO():
...     file2 = netcdf4.Dataset("model2.nc", "w")
>>> var2.insert_subdevices(file2)
>>> file2["station_id"].dimensions
('stations', 'char_leng_name')
>>> file2["station_id"].shape
(2, 9)
>>> chars2str(file2["station_id"][:])
['element1', 'element_2']
>>> file2.close()

query_subdevices(ncfile) → List[str][source]¶

Query the names of the (sub)devices of the logged sequences from the given NetCDF file

(1) We apply function query_subdevices() on an empty NetCDF file. The error message shows that the method tries to query the (sub)device names both under the assumptions that variables have been isolated or not:

>>> from hydpy.core.netcdftools import NetCDFVariableBase
>>> from hydpy import make_abc_testable, TestIO
>>> from hydpy.core.netcdftools import netcdf4
>>> with TestIO():
...     ncfile = netcdf4.Dataset("model.nc", "w")
>>> Var = make_abc_testable(NetCDFVariableBase)
>>> Var.subdevicenames = "element1", "element_2"
>>> var = Var("flux_prec", isolate=False, timeaxis=1)
>>> var.query_subdevices(ncfile)
Traceback (most recent call last):
...
OSError: NetCDF file `model.nc` does neither contain a variable named `flux_prec_station_id` nor `station_id` for defining the coordinate locations of variable `flux_prec`.

(2) After inserting the (sub)device name, they can be queried and returned:

>>> var.insert_subdevices(ncfile)
>>> Var("flux_prec", isolate=False, timeaxis=1).query_subdevices(ncfile)
['element1', 'element_2']
>>> Var('flux_prec', isolate=True, timeaxis=1).query_subdevices(ncfile)
['element1', 'element_2']

>>> ncfile.close()

query_subdevice2index(ncfile) → hydpy.core.netcdftools.Subdevice2Index [source]¶

Return a Subdevice2Index that maps the (sub)device names to their position within the given NetCDF file.

Method query_subdevice2index() is based on query_subdevices(). The returned Subdevice2Index object remembers the NetCDF file the (sub)device names stem from, allowing for clear error messages:

>>> from hydpy.core.netcdftools import NetCDFVariableBase, str2chars
>>> from hydpy import make_abc_testable, TestIO
>>> from hydpy.core.netcdftools import netcdf4
>>> with TestIO():
...     ncfile = netcdf4.Dataset("model.nc", "w")
>>> Var = make_abc_testable(NetCDFVariableBase)
>>> Var.subdevicenames = ["element3", "element1", "element1_1", "element2"]
>>> var = Var("flux_prec", isolate=True, timeaxis=1)
>>> var.insert_subdevices(ncfile)
>>> subdevice2index = var.query_subdevice2index(ncfile)
>>> subdevice2index.get_index("element1_1")
2
>>> subdevice2index.get_index("element3")
0
>>> subdevice2index.get_index("element5")
Traceback (most recent call last):
...
OSError: No data for sequence `flux_prec` and (sub)device `element5` in NetCDF file `model.nc` available.

Additionally, query_subdevice2index() checks for duplicates:

>>> ncfile["station_id"][:] = str2chars(
...     ["element3", "element1", "element1_1", "element1"])
>>> var.query_subdevice2index(ncfile)
Traceback (most recent call last):
...
OSError: The NetCDF file `model.nc` contains duplicate (sub)device names for variable `flux_prec` (the first found duplicate is `element1`).

>>> ncfile.close()

sort_timeplaceentries(timeentry, placeentry) → Tuple[Any, Any][source]¶

Return a tuple containing the given timeentry and placeentry sorted in agreement with the currently selected timeaxis.

>>> from hydpy.core.netcdftools import NetCDFVariableBase
>>> from hydpy import make_abc_testable
>>> NCVar = make_abc_testable(NetCDFVariableBase)
>>> ncvar = NCVar("flux_nkor", isolate=True, timeaxis=1)
>>> ncvar.sort_timeplaceentries("time", "place")
('place', 'time')
>>> ncvar = NetCDFVariableDeep("test", isolate=False, timeaxis=0)
>>> ncvar.sort_timeplaceentries("time", "place")
('time', 'place')

get_timeplaceslice(placeindex) → Union[Tuple[slice, int], Tuple[int, slice]][source]¶

Return a tuple for indexing a complete time series of a certain location available in array.

>>> from hydpy.core.netcdftools import NetCDFVariableBase
>>> from hydpy import make_abc_testable
>>> NCVar = make_abc_testable(NetCDFVariableBase)
>>> ncvar = NCVar("flux_nkor", isolate=True, timeaxis=1)
>>> ncvar.get_timeplaceslice(2)
(2, slice(None, None, None))
>>> ncvar = NetCDFVariableDeep("test", isolate=False, timeaxis=0)
>>> ncvar.get_timeplaceslice(2)
(slice(None, None, None), 2)

abstract read(ncfile, timegrid_data) → None [source]¶: Read the data from the given NetCDF file.

abstract write(ncfile) → None [source]¶: Write the data to the given NetCDF file.

class hydpy.core.netcdftools.DeepAndAggMixin[source]¶

Bases: object

Mixin class for NetCDFVariableDeep and NetCDFVariableAgg

property subdevicenames¶: A tuple containing the device names.

write(ncfile) → None [source]¶

Write the data to the given NetCDF file.

See the general documentation on classes NetCDFVariableDeep and NetCDFVariableAgg for some examples.

class hydpy.core.netcdftools.AggAndFlatMixin[source]¶

Bases: object

Mixin class for NetCDFVariableAgg and NetCDFVariableFlat.

property dimensions¶

The dimension names of the NetCDF variable.

Usually, the string defined by property descr_sequence prefixes the first dimension name related to the location, which allows storing different sequences types in one NetCDF file:

>>> from hydpy.examples import prepare_io_example_1
>>> nodes, elements = prepare_io_example_1()
>>> from hydpy.core.netcdftools import NetCDFVariableAgg
>>> ncvar = NetCDFVariableAgg("flux_nkor", isolate=False, timeaxis=1)
>>> ncvar.log(elements.element1.model.sequences.fluxes.nkor, None)
>>> ncvar.dimensions
('flux_nkor_stations', 'time')

But when isolating variables into separate NetCDF files, the variable specific suffix is omitted:

>>> ncvar = NetCDFVariableAgg("flux_nkor", isolate=True, timeaxis=1)
>>> ncvar.log(elements.element1.model.sequences.fluxes.nkor, None)
>>> ncvar.dimensions
('stations', 'time')

When using the first axis as the “timeaxis”, the order of the dimension names turns:

>>> ncvar = NetCDFVariableAgg("flux_nkor", isolate=True, timeaxis=0)
>>> ncvar.log(elements.element1.model.sequences.fluxes.nkor, None)
>>> ncvar.dimensions
('time', 'stations')

class hydpy.core.netcdftools.NetCDFVariableDeep(name, isolate, timeaxis)[source]¶

Bases: hydpy.core.netcdftools.DeepAndAggMixin, hydpy.core.netcdftools.NetCDFVariableBase

Relates some objects of a specific IOSequence subclass with a single NetCDF variable without modifying dimensionality.

Suitable both for reading and writing time series of sequences of arbitrary dimensionality; performs no flattening.

(1) We prepare some devices handling some sequences by applying function prepare_io_example_1(). We limit our attention to the returned elements, which handle the more diverse sequences:

>>> from hydpy.examples import prepare_io_example_1
>>> nodes, (element1, element2, element3) = prepare_io_example_1()

(2) We define two NetCDFVariableDeep instances with different array structures and log the Nied and NKor sequences of the first two elements:

>>> from hydpy.core.netcdftools import NetCDFVariableDeep
>>> var_nied = NetCDFVariableDeep("input_nied", isolate=False, timeaxis=1)
>>> var_nkor = NetCDFVariableDeep("flux_nkor", isolate=False, timeaxis=0)
>>> for element in (element1, element2):
...     seqs = element.model.sequences
...     var_nied.log(seqs.inputs.nied, seqs.inputs.nied.series)
...     var_nkor.log(seqs.fluxes.nkor, seqs.fluxes.nkor.series)

(3) We prepare a (nearly) empty NetCDF file. “Nearly”, because all sequences have to be related to the same period, which is why usually a central instance of class NetCDFFile prepares and passes time information:

>>> from hydpy import TestIO
>>> from hydpy.core.netcdftools import netcdf4
>>> with TestIO():
...     ncfile = netcdf4.Dataset("model.nc", "w")
>>> from hydpy.core.netcdftools import create_dimension
>>> create_dimension(ncfile, "time", 4)

We store the data of all logged sequences in the NetCDF file:

>>> var_nied.write(ncfile)
>>> var_nkor.write(ncfile)
>>> ncfile.close()

(5) We set all values of the series of both selected sequences to -777 and check that they are in fact different from the original values available via attribute testarray:

>>> seq1 = element1.model.sequences.inputs.nied
>>> seq2 = element2.model.sequences.fluxes.nkor
>>> import numpy
>>> for seq in (seq1, seq2):
...     seq.series = -777.0
...     print(numpy.any(seq.series == seq.testarray))
False
False

(6) We again prepare two NetCDFVariableDeep instances and log the same sequences as above, open the existing NetCDF file for reading, read its data, and confirm that this data has been passed to both test sequences properly:

>>> nied1 = NetCDFVariableDeep("input_nied", isolate=False, timeaxis=1)
>>> nkor1 = NetCDFVariableDeep("flux_nkor", isolate=False, timeaxis=0)
>>> for element in (element1, element2):
...     sequences = element.model.sequences
...     nied1.log(sequences.inputs.nied, None)
...     nkor1.log(sequences.fluxes.nkor, None)
>>> with TestIO():
...     ncfile = netcdf4.Dataset("model.nc", "r")
>>> from hydpy import pub
>>> nied1.read(ncfile, pub.timegrids.init)
>>> nkor1.read(ncfile, pub.timegrids.init)
>>> for seq in (seq1, seq2):
...     print(numpy.all(seq.series == seq.testarray))
True
True

(6) We confirm that trying to read data that has not been stored properly results in error messages like the following one:

>>> nied1.log(element3.model.sequences.inputs.nied, None)
>>> nied1.read(ncfile, pub.timegrids.init)
Traceback (most recent call last):
...
OSError: No data for sequence `input_nied` and (sub)device `element3` in NetCDF file `model.nc` available.

>>> ncfile.close()

(7) We repeat the first few steps, but pass True to the constructor of NetCDFVariableDeep to indicate that we want to write each type of sequence into a separate NetCDF file. Nevertheless, we try to store two different types of sequences into the same NetCDF file, which works for the first sequence (Nied) but not for the second one (NKor):

>>> var_nied = NetCDFVariableDeep("input_nied", isolate=True, timeaxis=1)
>>> var_nkor = NetCDFVariableDeep("flux_nkor", isolate=True, timeaxis=0)
>>> for element in (element1, element2):
...     seqs = element.model.sequences
...     var_nied.log(seqs.inputs.nied, seqs.inputs.nied.series)
...     var_nkor.log(seqs.fluxes.nkor, seqs.fluxes.nkor.series)
>>> with TestIO():
...     ncfile = netcdf4.Dataset("model.nc", "w")
>>> create_dimension(ncfile, "time", 4)
>>> var_nied.write(ncfile)
>>> try:
...     var_nkor.write(ncfile)
... except BaseException as exc:
...     print(exc)   
While trying to add dimension `stations` with length `2` to the NetCDF file `model.nc`, the following error occurred: ...
>>> ncfile.close()
>>> from hydpy import TestIO
>>> from hydpy.core.netcdftools import netcdf4
>>> with TestIO():
...     ncfile = netcdf4.Dataset("model.nc", "r")
>>> seq1.series = 0.0
>>> var_nied.read(ncfile, pub.timegrids.init)
>>> seq1.series
InfoArray([ 0.,  1.,  2.,  3.])
>>> ncfile.close()

get_slices(idx, shape) → Tuple[IntOrSlice, …][source]¶

Return a tuple of one int and some slice objects to accesses all values of a certain device within array.

>>> from hydpy.core.netcdftools import NetCDFVariableDeep
>>> ncvar = NetCDFVariableDeep("test", isolate=False, timeaxis=1)
>>> ncvar.get_slices(2, [3])
(2, slice(None, None, None), slice(0, 3, None))
>>> ncvar.get_slices(4, (1, 2))
(4, slice(None, None, None), slice(0, 1, None), slice(0, 2, None))
>>> ncvar = NetCDFVariableDeep("test", isolate=False, timeaxis=0)
>>> ncvar.get_slices(4, (1, 2))
(slice(None, None, None), 4, slice(0, 1, None), slice(0, 2, None))

property shape¶

Required shape of array.

For the default configuration, the first axis corresponds to the number of devices, and the second one to the number of timesteps. We show this for the 0-dimensional input sequence Nied:

>>> from hydpy.examples import prepare_io_example_1
>>> nodes, elements = prepare_io_example_1()
>>> from hydpy.core.netcdftools import NetCDFVariableDeep
>>> ncvar = NetCDFVariableDeep("input_nied", isolate=False, timeaxis=1)
>>> for element in elements:
...     ncvar.log(element.model.sequences.inputs.nied, None)
>>> ncvar.shape
(3, 4)

For higher dimensional sequences, each new entry corresponds to the maximum number of fields the respective sequences require. In the next example, we select the 1-dimensional sequence NKor. The maximum number 3 (last value of the returned tuple) is due to the third element defining three hydrological response units:

>>> ncvar = NetCDFVariableDeep("flux_nkor", isolate=False, timeaxis=1)
>>> for element in elements:
...     ncvar.log(element.model.sequences.fluxes.nkor, None)
>>> ncvar.shape
(3, 4, 3)

When using the first axis for time (timeaxis=0) the order of the first two tuple entries turns:

>>> ncvar = NetCDFVariableDeep("flux_nkor", isolate=False, timeaxis=0)
>>> for element in elements:
...     ncvar.log(element.model.sequences.fluxes.nkor, None)
>>> ncvar.shape
(4, 3, 3)

property array¶

The series data of all logged IOSequence objects contained in one single ndarray.

The documentation on shape explains how array is structured. The first example confirms that, for the default configuration, the first axis definces the location, while the second one defines time:

>>> from hydpy.examples import prepare_io_example_1
>>> nodes, elements = prepare_io_example_1()
>>> from hydpy.core.netcdftools import NetCDFVariableDeep
>>> ncvar = NetCDFVariableDeep("input_nied", isolate=False, timeaxis=1)
>>> for element in elements:
...     nied1 = element.model.sequences.inputs.nied
...     ncvar.log(nied1, nied1.series)
>>> ncvar.array
array([[  0.,   1.,   2.,   3.],
       [  4.,   5.,   6.,   7.],
       [  8.,   9.,  10.,  11.]])

For higher dimensional sequences, array can contain missing values. Such missing values show up for some fiels of the second example element, which defines only two hydrological response units instead of three:

>>> ncvar = NetCDFVariableDeep("flux_nkor", isolate=False, timeaxis=1)
>>> for element in elements:
...     nkor1 = element.model.sequences.fluxes.nkor
...     ncvar.log(nkor1, nkor1.series)
>>> ncvar.array[1]
array([[ 16.,  17.,  nan],
       [ 18.,  19.,  nan],
       [ 20.,  21.,  nan],
       [ 22.,  23.,  nan]])

When using the first axis for time (timeaxis=0) the same data can be accessed with slightly different indexing:

>>> ncvar = NetCDFVariableDeep("flux_nkor", isolate=False, timeaxis=0)
>>> for element in elements:
...     nkor1 = element.model.sequences.fluxes.nkor
...     ncvar.log(nkor1, nkor1.series)
>>> ncvar.array[:, 1]
array([[ 16.,  17.,  nan],
       [ 18.,  19.,  nan],
       [ 20.,  21.,  nan],
       [ 22.,  23.,  nan]])

property dimensions¶

The dimension names of the NetCDF variable.

Usually, the string defined by property descr_sequence prefixes all dimension names except the second one related to time, which allows storing different sequences in one NetCDF file:

>>> from hydpy.examples import prepare_io_example_1
>>> nodes, elements = prepare_io_example_1()
>>> from hydpy.core.netcdftools import NetCDFVariableDeep
>>> ncvar = NetCDFVariableDeep("flux_nkor", isolate=False, timeaxis=1)
>>> ncvar.log(elements.element1.model.sequences.fluxes.nkor, None)
>>> ncvar.dimensions
('flux_nkor_stations', 'time', 'flux_nkor_axis3')

However, when isolating variables into separate NetCDF files, the sequence-specific suffix is omitted:

>>> ncvar = NetCDFVariableDeep("flux_nkor", isolate=True, timeaxis=1)
>>> ncvar.log(elements.element1.model.sequences.fluxes.nkor, None)
>>> ncvar.dimensions
('stations', 'time', 'axis3')

When using the first axis as the “timeaxis”, the order of the first two dimension names turns:

>>> ncvar = NetCDFVariableDeep("flux_nkor", isolate=True, timeaxis=0)
>>> ncvar.log(elements.element1.model.sequences.fluxes.nkor, None)
>>> ncvar.dimensions
('time', 'stations', 'axis3')

read(ncfile, timegrid_data) → None [source]¶

Read the data from the given NetCDF file.

The argument timegrid_data defines the data period of the given NetCDF file.

See the general documentation on class NetCDFVariableDeep for some examples.

class hydpy.core.netcdftools.NetCDFVariableAgg(name, isolate, timeaxis)[source]¶

Bases: hydpy.core.netcdftools.DeepAndAggMixin, hydpy.core.netcdftools.AggAndFlatMixin, hydpy.core.netcdftools.NetCDFVariableBase

Relates some objects of a specific IOSequence subclass with a single NetCDF variable when aggregation of data is required.

Suitable for writing time series data only; performs no flattening.

Essentially, class NetCDFVariableAgg is very similar to class NetCDFVariableDeep but a little bit simpler, as it handles arrays with fixed dimensionality and provides no functionality for reading data from NetCDF files. Hence, the following examples are a selection of the of the more thoroughly explained examples of the documentation on class NetCDFVariableDeep:

>>> from hydpy.examples import prepare_io_example_1
>>> nodes, (element1, element2, element3) = prepare_io_example_1()
>>> from hydpy.core.netcdftools import NetCDFVariableAgg
>>> var_nied = NetCDFVariableAgg("input_nied_mean", isolate=False, timeaxis=1)
>>> var_nkor = NetCDFVariableAgg("flux_nkor_mean", isolate=False, timeaxis=0)
>>> for element in (element1, element2):
...     nied1 = element.model.sequences.inputs.nied
...     var_nied.log(nied1, nied1.average_series())
...     nkor1 = element.model.sequences.fluxes.nkor
...     var_nkor.log(nkor1, nkor1.average_series())
>>> from hydpy import TestIO
>>> from hydpy.core.netcdftools import netcdf4
>>> with TestIO():
...     ncfile = netcdf4.Dataset("model.nc", "w")
>>> from hydpy.core.netcdftools import create_dimension
>>> create_dimension(ncfile, "time", 4)
>>> var_nied.write(ncfile)
>>> var_nkor.write(ncfile)
>>> ncfile.close()

As NetCDFVariableAgg provides no reading functionality, we show that the aggregated values are readily available by using the external NetCDF4 library directly. Note the different shapes due to using the second axis for time (timeaxis=1, default) and using the first axis for time (timeaxis=0) for Nied and NKor, respectively:

>>> with TestIO():
...     ncfile = netcdf4.Dataset("model.nc", "r")
>>> import numpy
>>> numpy.array(ncfile["input_nied_mean"][:])
array([[ 0.,  1.,  2.,  3.],
       [ 4.,  5.,  6.,  7.]])

>>> numpy.array(ncfile["flux_nkor_mean"][:])
array([[ 12. ,  16.5],
       [ 13. ,  18.5],
       [ 14. ,  20.5],
       [ 15. ,  22.5]])

>>> ncfile.close()

property shape¶

Required shape of array.

For the default configuration, the first axis corresponds to the number of devices, and the second one to the number of timesteps. We show this for the 1-dimensional input sequence NKor:

>>> from hydpy.examples import prepare_io_example_1
>>> nodes, elements = prepare_io_example_1()
>>> from hydpy.core.netcdftools import NetCDFVariableAgg
>>> ncvar = NetCDFVariableAgg("flux_nkor", isolate=False, timeaxis=1)
>>> for element in elements:
...     ncvar.log(element.model.sequences.fluxes.nkor, None)
>>> ncvar.shape
(3, 4)

When using the first axis as the “timeaxis”, the order of tuple entries turns:

>>> ncvar = NetCDFVariableAgg("flux_nkor", isolate=False, timeaxis=0)
>>> for element in elements:
...     ncvar.log(element.model.sequences.fluxes.nkor, None)
>>> ncvar.shape
(4, 3)

property array¶

The aggregated data of all logged IOSequence objects contained in one single ndarray object.

The documentation on shape explains how array is structured. This first example confirms that, under default configuration (timeaxis=1), the first axis corresponds to the location, while the second one corresponds to time:

>>> from hydpy.examples import prepare_io_example_1
>>> nodes, elements = prepare_io_example_1()
>>> from hydpy.core.netcdftools import NetCDFVariableAgg
>>> ncvar = NetCDFVariableAgg("flux_nkor", isolate=False, timeaxis=1)
>>> for element in elements:
...     nkor1 = element.model.sequences.fluxes.nkor
...     ncvar.log(nkor1, nkor1.average_series())
>>> ncvar.array
array([[ 12. ,  13. ,  14. ,  15. ],
       [ 16.5,  18.5,  20.5,  22.5],
       [ 25. ,  28. ,  31. ,  34. ]])

When using the first axis as the “timeaxis”, the resulting array is the transposed:

>>> ncvar = NetCDFVariableAgg("flux_nkor", isolate=False, timeaxis=0)
>>> for element in elements:
...     nkor1 = element.model.sequences.fluxes.nkor
...     ncvar.log(nkor1, nkor1.average_series())
>>> ncvar.array
array([[ 12. ,  16.5,  25. ],
       [ 13. ,  18.5,  28. ],
       [ 14. ,  20.5,  31. ],
       [ 15. ,  22.5,  34. ]])

read(ncfile, timegrid_data) → None [source]¶

Raise a RuntimeError in any case.

This method always raises the following exception, to tell users why implementing a real reading functionality is not possible:

>>> from hydpy.core.netcdftools import NetCDFVariableAgg
>>> var_ = NetCDFVariableAgg("flux_nkor", isolate=False, timeaxis=1)
>>> var_.read(None, None)
Traceback (most recent call last):
...
RuntimeError: The process of aggregating values (of sequence `flux_nkor` and other sequences as well) is not invertible.

class hydpy.core.netcdftools.NetCDFVariableFlat(name, isolate, timeaxis)[source]¶

Bases: hydpy.core.netcdftools.AggAndFlatMixin, hydpy.core.netcdftools.NetCDFVariableBase

Relates some objects of a specific IOSequence subclass with a single NetCDF variable when flattening of data is required.

Suitable both for reading and writing time series of sequences of arbitrary dimensionality.

The following examples on the usage of class NetCDFVariableFlat are identical to the ones on the usage of class NetCDFVariableDeep. We repeat the examples for testing purposes but refrain from repeating the explanations. The relevant difference in the NetCDF file structure should become clear when comparing the documentation on the different members of both classes.

>>> from hydpy.examples import prepare_io_example_1
>>> nodes, (element1, element2, element3) = prepare_io_example_1()

>>> from hydpy.core.netcdftools import NetCDFVariableFlat
>>> var_nied = NetCDFVariableFlat("input_nied", isolate=False, timeaxis=1)
>>> var_nkor = NetCDFVariableFlat("flux_nkor", isolate=False, timeaxis=0)
>>> for element in (element1, element2):
...     seqs = element.model.sequences
...     var_nied.log(seqs.inputs.nied, seqs.inputs.nied.series)
...     var_nkor.log(seqs.fluxes.nkor, seqs.fluxes.nkor.series)

>>> from hydpy import TestIO
>>> from hydpy.core.netcdftools import netcdf4
>>> with TestIO():
...     ncfile = netcdf4.Dataset("model.nc", "w")
>>> from hydpy.core.netcdftools import create_dimension
>>> create_dimension(ncfile, "time", 4)

>>> var_nied.write(ncfile)
>>> var_nkor.write(ncfile)
>>> ncfile.close()

>>> seq1 = element1.model.sequences.inputs.nied
>>> seq2 = element2.model.sequences.fluxes.nkor
>>> import numpy
>>> for seq in (seq1, seq2):
...     seq.series = -777.0
...     print(numpy.any(seq.series == seq.testarray))
False
False

>>> nied1 = NetCDFVariableFlat("input_nied", isolate=False, timeaxis=1)
>>> nkor1 = NetCDFVariableFlat("flux_nkor", isolate=False, timeaxis=0)
>>> for element in (element1, element2):
...     sequences = element.model.sequences
...     nied1.log(sequences.inputs.nied, None)
...     nkor1.log(sequences.fluxes.nkor, None)
>>> with TestIO():
...     ncfile = netcdf4.Dataset("model.nc", "r")
>>> from hydpy import pub
>>> nied1.read(ncfile, pub.timegrids.init)
>>> nkor1.read(ncfile, pub.timegrids.init)
>>> for seq in (seq1, seq2):
...     print(numpy.all(seq.series == seq.testarray))
True
True
>>> ncfile.close()

property shape¶

Required shape of array.

For 0-dimensional sequences like Nied and for the default configuration (timeaxis=1), the first axis corresponds to the number of devices, and the second one two the number of timesteps:

>>> from hydpy.examples import prepare_io_example_1
>>> nodes, elements = prepare_io_example_1()
>>> from hydpy.core.netcdftools import NetCDFVariableFlat
>>> ncvar = NetCDFVariableFlat("input_nied", isolate=False, timeaxis=1)
>>> for element in elements:
...     ncvar.log(element.model.sequences.inputs.nied, None)
>>> ncvar.shape
(3, 4)

For higher dimensional sequences, the first axis corresponds to “subdevices”, e.g. hydrological response units within different elements. The 1-dimensional sequence NKor is logged for three elements with one, two, and three response units respectively, making up a sum of six subdevices:

>>> ncvar = NetCDFVariableFlat("flux_nkor", isolate=False, timeaxis=1)
>>> for element in elements:
...     ncvar.log(element.model.sequences.fluxes.nkor, None)
>>> ncvar.shape
(6, 4)

When using the first axis as the “timeaxis”, the order of tuple entries turns:

>>> ncvar = NetCDFVariableFlat("flux_nkor", isolate=False, timeaxis=0)
>>> for element in elements:
...     ncvar.log(element.model.sequences.fluxes.nkor, None)
>>> ncvar.shape
(4, 6)

property array¶

The series data of all logged IOSequence objects contained in one single ndarray object.

The documentation on shape explains how array is structured. The first example confirms that, under default configuration (timeaxis=1), the first axis corresponds to the location, while the second one corresponds to time:

>>> from hydpy.examples import prepare_io_example_1
>>> nodes, elements = prepare_io_example_1()
>>> from hydpy.core.netcdftools import NetCDFVariableFlat
>>> ncvar = NetCDFVariableFlat("input_nied", isolate=False, timeaxis=1)
>>> for element in elements:
...     nied1 = element.model.sequences.inputs.nied
...     ncvar.log(nied1, nied1.series)
>>> ncvar.array
array([[  0.,   1.,   2.,   3.],
       [  4.,   5.,   6.,   7.],
       [  8.,   9.,  10.,  11.]])

Due to the flattening of higher dimensional sequences, their individual time series (e.g. of different hydrological response units) are spread over the rows of the array. For the 1-dimensional sequence NKor, the individual time series of the second element are stored in row two and three:

>>> ncvar = NetCDFVariableFlat("flux_nkor", isolate=False, timeaxis=1)
>>> for element in elements:
...     nkor1 = element.model.sequences.fluxes.nkor
...     ncvar.log(nkor1, nkor1.series)
>>> ncvar.array[1:3]
array([[ 16.,  18.,  20.,  22.],
       [ 17.,  19.,  21.,  23.]])

When using the first axis as the “timeaxis”, the individual time series of the second element are stored in column two and three:

>>> ncvar = NetCDFVariableFlat("flux_nkor", isolate=False, timeaxis=0)
>>> for element in elements:
...     nkor1 = element.model.sequences.fluxes.nkor
...     ncvar.log(nkor1, nkor1.series)
>>> ncvar.array[:, 1:3]
array([[ 16.,  17.],
       [ 18.,  19.],
       [ 20.,  21.],
       [ 22.,  23.]])

property subdevicenames¶

A tuple containing the (sub)device names.

Property subdevicenames clarifies which row of array contains which time series. For 0-dimensional series like Nied, the plain device names are returned

>>> from hydpy.examples import prepare_io_example_1
>>> nodes, elements = prepare_io_example_1()
>>> from hydpy.core.netcdftools import NetCDFVariableFlat
>>> ncvar = NetCDFVariableFlat("input_nied", isolate=False, timeaxis=1)
>>> for element in elements:
...     nied1 = element.model.sequences.inputs.nied
...     ncvar.log(nied1, nied1.series)
>>> ncvar.subdevicenames
('element1', 'element2', 'element3')

For higher dimensional sequences like NKor, an additional suffix defines the index of the respective subdevice. For example contains the third row of array the time series of the first hydrological response unit of the second element:

>>> ncvar = NetCDFVariableFlat("flux_nkor", isolate=False, timeaxis=1)
>>> for element in elements:
...     nkor1 = element.model.sequences.fluxes.nkor
...     ncvar.log(nkor1, nkor1.series)
>>> ncvar.subdevicenames[1:3]
('element2_0', 'element2_1')

read(ncfile, timegrid_data) → None [source]¶

Read the data from the given NetCDF file.

The argument timegrid_data defines the data period of the given NetCDF file.

See the general documentation on class NetCDFVariableFlat for some examples.

write(ncfile) → None [source]¶

Write the data to the given NetCDF file.

See the general documentation on class NetCDFVariableFlat for some examples.

netcdftools¶

Table of Contents

Previous topic

Next topic

This Page

Versions