Skip to content

Commit 5e98526

Browse files
committed
aggregation file -> aggregation dataset
1 parent c8f474d commit 5e98526

File tree

1 file changed

+11
-11
lines changed

1 file changed

+11
-11
lines changed

ch02.adoc

Lines changed: 11 additions & 11 deletions
Original file line numberDiff line numberDiff line change
@@ -276,14 +276,14 @@ If a group attribute is defined in a parent group, and one of the child group re
276276
[[aggregation-variables, Section 2.8, "Aggregation Variables"]]
277277
=== Aggregation Variables
278278

279-
An __aggregation variable__ is a variable which has been formed by combining (i.e. aggregating) multiple __fragments__ that are generally stored in __fragment datasets__ that are external to the file containing the aggregation variable, i.e. the __aggregation file__.
279+
An __aggregation variable__ is a variable which has been formed by combining (i.e. aggregating) multiple __fragments__ that are generally stored in __fragment datasets__ that are external to the dataset containing the aggregation variable, i.e. the __aggregation dataset__.
280280
A fragment contains data with sufficient metadata for it to be correctly interpreted in the context of the aggregation.
281281
The aggregation variable does not contain any actual data, instead it contains instructions on how to create its __aggregated data__ in memory as an aggregation of the data from each fragment.
282-
The aggregated data is identical to that which would be stored in the file if the variable were encoded in usual (i.e. non-aggregated) manner.
282+
The aggregated data is identical to that which would be stored in the dataset if the variable were encoded in usual (i.e. non-aggregated) manner.
283283

284-
Aggregation provides the utility of being able to view, as a single entity, a dataset that has been partitioned across multiple other datasets, whilst taking up very little extra space on disk, since the aggregation file contains no copies of the data in the fragments.
284+
Aggregation provides the utility of being able to view, as a single entity, a dataset that has been partitioned across multiple other datasets, whilst taking up very little extra space on disk, since the aggregation dataset contains no copies of the data in the fragments.
285285
Fragment datasets may be CF-compliant or have any other format, thereby allowing an aggregation variable to act as a CF-compliant view of non-CF datasets.
286-
Aggregations can facilitate a range of activities such as data analysis, by avoiding the computational expense of deriving the aggregation at the time of analysis; archive curation, by acting as a metadata-rich archive index; and the post-processing of model simulation outputs, by spanning multiple files written at run time that together constitute a more cohesive and useful product.
286+
Aggregations can facilitate a range of activities such as data analysis, by avoiding the computational expense of deriving the aggregation at the time of analysis; archive curation, by acting as a metadata-rich archive index; and the post-processing of model simulation outputs, by spanning multiple datasets written at run time that together constitute a more cohesive and useful product.
287287

288288
An aggregation variable must be a scalar (i.e. it has no dimensions).
289289
It acts as a container for all of the usual attributes that describe a variable, with the addition of two special attributes: one that defines its __aggregated dimensions__ (i.e. the dimensions of the aggregated data, which in turn define the aggregated data shape); and one that provides the instructions on how the aggregated data is to be created.
@@ -308,7 +308,7 @@ The details of how to encode and decode aggregation variables are given in this
308308
If a variable has an **`aggregated_dimensions`** attribute then it must be an aggregation variable.
309309
This attribute records the names of the aggregated dimensions as a blank-separated list, in the order of the dimensions of the aggregated data.
310310
If the aggregated data is scalar then there are no aggregated dimensions and the **`aggregated_dimensions`** attribute must be an empty string.
311-
Any aggregated dimensions must exist as dimensions in the aggregation file.
311+
Any aggregated dimensions must exist as dimensions in the aggregation dataset.
312312

313313
The aggregated dimensions are partitioned by the fragments (in their canonical forms, see <<fragment-interpretation>>), and this partitioning is consistent across all of the fragments, i.e. any two fragments either span the same part of a given aggregated dimension, or else do not overlap along that same dimension.
314314
In addition, each fragment data value provides exactly one aggregated data value, and each aggregated data value comes from exactly one fragment.
@@ -375,7 +375,7 @@ See the <<example-aggregation-variable, Example 2.3>> for an encoding of the agg
375375
====
376376

377377
The array of fragments must be defined by an aggregation variable's **`aggregated_data`** attribute.
378-
This attribute must take a string value comprising blank-separated elements of the form "__feature: variable__", where __feature__ is a case-sensitive keyword that specifies a feature of the array of fragments, and __variable__ is a variable in the aggregation file that provides values for that feature.
378+
This attribute must take a string value comprising blank-separated elements of the form "__feature: variable__", where __feature__ is a case-sensitive keyword that specifies a feature of the array of fragments, and __variable__ is a variable in the aggregation dataset that provides values for that feature.
379379
The order of elements in the **`aggregated_data`** attribute is not significant.
380380

381381
The feature keywords must comprise either all three of `map`, `location`, and `identifier`; or else both of `map` and `unique_value`.
@@ -408,11 +408,11 @@ See <<example-L.6, Example L.6>>.
408408

409409
===== location
410410

411-
The string-valued `location` variable defines the locations (i.e. file names) of fragment datasets.
411+
The string-valued `location` variable defines the locations (i.e. dataset names) of fragment datasets.
412412
Its dimensions are those of the array of fragments; and its data provide a location for each fragment.
413413
A fragment dataset is located with a URI (Uniform Resource Identifier) <<URI>> that must be either an __absolute URI__ (a URI that begins with a scheme component followed by a `:` character, such as `\file:///data/file.nc`, `\https:///remote.host/data/file.nc`, `s3:///remote.host/data/file.nc`, or `locally_meaningful_protocol:///UID`), or else a __relative-path URI reference__ (a URI that is not an absolute URI and which does not begin with a `/` or `#` character, such as `file.nc`, `../file.nc`, or `data/file.nc`).
414-
A relative-path URI reference is taken as being relative to the location of the aggregation file.
415-
If the aggregation file is moved to another location, then a fragment dataset identified by an absolute URI will still be accessible, whereas a fragment dataset identified by a relative-path URI reference will also need be moved to preserve the relative reference.
414+
A relative-path URI reference is taken as being relative to the location of the aggregation dataset.
415+
If the aggregation dataset is moved to another location, then a fragment dataset identified by an absolute URI will still be accessible, whereas a fragment dataset identified by a relative-path URI reference will also need be moved to preserve the relative reference.
416416
Not all fragment dataset locations need be of the same URI type.
417417
See <<example-L.1, Example L.1>> and <<example-L.2, Example L.2>>.
418418

@@ -426,11 +426,11 @@ See <<example-L.1, Example L.1>> and <<example-L.4, Example L.4>>.
426426

427427
===== unique_value
428428

429-
When the data values within each fragment are all identical, the `unique_value` variable allows these unique values to be explicitly stored in the aggregation file, rather than by reference to external fragment datasets via the `location` and `identifier` variables.
429+
When the data values within each fragment are all identical, the `unique_value` variable allows these unique values to be explicitly stored in the aggregation dataset, rather than by reference to external fragment datasets via the `location` and `identifier` variables.
430430
The `unique_value` variable dimensions are those of the array of fragments, and the data provide the unique value for each fragment.
431431
The fragment implied by a unique value has dimensions corresponding to the aggregated dimensions, and the fragment shape is defined by the `map` variable.
432432
When a fragment contains wholly missing data, its unique value is specified as any missing value defined by the aggregation variable.
433-
See <<example-L.5, Example L.5>>, which uses an ancillary aggregation variable to make global attributes from the fragment datasets available in the aggregation file.
433+
See <<example-L.5, Example L.5>>, which uses an ancillary aggregation variable to make global attributes from the fragment datasets available in the aggregation dataset.
434434

435435
// Turn section numbering back on
436436
:numbered:

0 commit comments

Comments
 (0)