preserves/preserves.md

---
no_site_title: true
title: "Preserves: an Expressive Data Language"
---

Tony Garnock-Jones <tonyg@leastfixedpoint.com>  
{{ site.version_date }}. Version {{ site.version }}.

{% include what-is-preserves.md %}

This document defines the core semantics and data model of Preserves and
presents a handful of examples. Two other core documents define

 - a [human-readable text syntax](preserves-text.html), and
 - a [machine-oriented binary syntax](preserves-binary.html)

for the Preserves data model.

## <a id="semantics"></a><a id="starting-with-semantics"></a>Values

Preserves *values* are given meaning independent of their syntax. We
will write "`Value`" when we mean the set of all Preserves values or an
element of that set.

`Value`s fall into two broad categories: *atomic* and *compound*
data. Every `Value` is finite and non-cyclic. Embedded values, called
`Embedded`s, are a third, special-case category.

{% include value-grammar.md %}

**Total order.**<a name="total-order"></a> As we go, we will
incrementally specify a total order over `Value`s. Two values of the
same kind are compared using kind-specific rules. The ordering among
values of different kinds is essentially arbitrary, but having a total
order is convenient for many tasks, so we define it as
follows:

            (Values)        Atom < Compound < Embedded

            (Compounds)     Record < Sequence < Set < Dictionary

            (Atoms)         Boolean < Float < Double < SignedInteger
                              < String < ByteString < Symbol

**Equivalence.**<a name="equivalence"></a> Two `Value`s are equal if
neither is less than the other according to the total order.

### Signed integers.

A `SignedInteger` is an arbitrarily-large signed integer.
`SignedInteger`s are compared as mathematical integers.

### Unicode strings.

A `String` is a sequence of [Unicode
scalar value](http://www.unicode.org/glossary/#unicode_scalar_value)s.[^nul-permitted]
`String`s are compared lexicographically, scalar value by
scalar value.[^utf8-is-awesome]

  [^utf8-is-awesome]: Happily, the design of UTF-8 is such that this
    gives the same result as a lexicographic byte-by-byte comparison
    of the UTF-8 encoding of a string!

  [^nul-permitted]: All Unicode scalar values are permitted, including NUL
    (scalar value zero). Because scalar values are defined as code points
    *excluding* surrogate code points
    (D800<sub>16</sub>–DFFF<sub>16</sub>), surrogates are *not* permitted
    in Preserves Unicode data.

### Binary data.

A `ByteString` is a sequence of octets. `ByteString`s are compared
lexicographically.

### Symbols.

Programming languages like Lisp and Prolog frequently use string-like
values called *symbols*.[^even-java-has-quasi-symbols] Here, a `Symbol`
is, like a `String`, a sequence of Unicode scalar values representing an
identifier of some kind. `Symbol`s are also compared lexicographically
by scalar value.

[^even-java-has-quasi-symbols]: Even Java has quasi-symbols in the form
    of its "interned strings". A Java Preserves implementation might
    intern Preserves `Symbol`s while leaving Preserves `String`s
    uninterned.

### Booleans.

There are two `Boolean`s, “false” and “true”. The “false” value is
less-than the “true” value.

### IEEE floating-point values.

`Float`s and `Double`s are single- and double-precision IEEE 754
floating-point values, respectively. `Float`s, `Double`s and
`SignedInteger`s are disjoint; by the rules [above](#total-order), every
`Float` is less than every `Double`, and every `SignedInteger` is
greater than both. Two `Float`s or two `Double`s are to be ordered by
the `totalOrder` predicate defined in section 5.10 of [IEEE Std
754-2008](https://dx.doi.org/10.1109/IEEESTD.2008.4610935).

### Records.

A `Record` is a *labelled* tuple of `Value`s, the record's *fields*. A
label can be any `Value`, but is usually a `Symbol`.[^extensibility]
[^iri-labels] `Record`s are ordered first by label, then
lexicographically[^lexicographical-sequences] by field sequence.

  [^extensibility]: The [Racket](https://racket-lang.org/) programming
    language defines
    “[prefab](http://docs.racket-lang.org/guide/define-struct.html#(part._prefab-struct))”
    structure types, which map well to our `Record`s. Racket supports
    record extensibility by encoding record supertypes into record
    labels as specially-formatted lists.

  [^iri-labels]: It is occasionally (but seldom) necessary to
    interpret such `Symbol` labels as IRIs. Where a
    label can be read as a relative IRI, it is notionally interpreted
    with respect to the IRI
    `urn:uuid:6bf094a6-20f1-4887-ada7-46834a9b5b34`; where a label can
    be read as an absolute IRI, it stands for that IRI; and otherwise,
    it cannot be read as an IRI at all, and so the label simply stands
    for itself—for its own `Value`.

  [^lexicographical-sequences]: When comparing sequences of values for
    the total order, [lexicographical
    ordering](https://en.wikipedia.org/wiki/Lexicographic_order) is
    used. Elements are drawn pairwise from the two sequences to be
    compared. If one is smaller than the other according to the total
    order, the sequence it was drawn from is the smaller of the
    sequences. If the end of one sequence is reached, while the other
    sequence has elements remaining, the shorter sequence is considered
    smaller. Otherwise, all the elements compared equal and neither was
    longer than the other, so they compare equal. For example,
      - `[#f]` is ordered before `[foo]` because `Boolean` appears before `Symbol` in the kind ordering;
      - `[x]` before `[x y]` because there is no element remaining to compare against `y`;
      - `[a b]` before `[x]` because `a` is smaller than `x`; and
      - `[x y]` before `[x z]` because `y` is ordered before `z` according to the ordering rules for `Symbol`.

### Sequences.

A `Sequence` is a sequence of `Value`s. `Sequence`s are compared
lexicographically.[^lexicographical-sequences]

### Sets.

A `Set` is an unordered finite set of `Value`s. It contains no
duplicate values, following the [equivalence relation](#equivalence)
induced by the total order on `Value`s. Two `Set`s are compared by
sorting their elements ascending using the [total order](#total-order)
and comparing the resulting `Sequence`s.[^lexicographical-sequences]

### Dictionaries.

A `Dictionary` is an unordered finite collection of pairs of `Value`s.
Each pair comprises a *key* and a *value*. Keys in a `Dictionary` are
pairwise distinct. Instances of `Dictionary` are compared by
lexicographic[^lexicographical-sequences] comparison of the sequences
resulting from ordering each `Dictionary`'s pairs in ascending order by
key.

### Embeddeds.

An `Embedded` allows inclusion of *domain-specific*, potentially
*stateful* or *located* data into a `Value`.[^embedded-rationale]
`Embedded`s may be used to denote stateful objects, network services,
object capabilities, file descriptors, Unix processes, or other
possibly-stateful things. Because each `Embedded` is a domain-specific
datum, comparison of two `Embedded`s is done according to
domain-specific rules.

  [^embedded-rationale]: **Rationale.** Why include `Embedded`s as a
    special class, distinct from, say, a specially-labeled `Record`?
    First, a `Record` can only hold other `Value`s: in order to embed
    values such as live pointers to Java objects, some means of
    "escaping" from the `Value` data type must be provided. Second,
    `Embedded`s are meant to be able to denote stateful entities, for
    which comparison by address is appropriate; however, we do not
    wish to place restrictions on the *nature* of these entities: if
    we had used `Record`s instead of distinct `Embedded`s, users would
    have to invent an encoding of domain data into `Record`s that
    reflected domain ordering into `Value` ordering. This is often
    difficult and may not always be possible. Finally, because
    `Embedded`s are intended to be able to represent network and
    memory *locations*, they must be able to be rewritten at network
    and process boundaries. Having a distinct class allows generic
    `Embedded` rewriting without the quotation-related complications
    of encoding references as, say, `Record`s.

*Motivating Examples.* In a Java or Python implementation, an `Embedded` may
denote a reference to a Java or Python object; comparison would be
done via the language's own rules for equivalence and ordering. In a
Unix application, an `Embedded` may denote an open file descriptor or
a process ID. In an HTTP-based application, each `Embedded` might be a
URL, compared according to
[RFC 6943](https://tools.ietf.org/html/rfc6943#section-3.3). When a
`Value` is serialized for storage or transfer, `Embedded`s will
usually be represented as ordinary `Value`s, in which case the
ordinary rules for comparing `Value`s will apply.

## <a id="examples"></a>Appendix. Examples

The definitions above are independent of any particular concrete syntax.
The examples of `Value`s that follow are written using [the Preserves
text syntax](preserves-text.html), and the example encoded byte
sequences use [the Preserves binary encoding](preserves-binary.html).

### Ordering.

The total ordering specified [above](#total-order) means that the following statements are true:

 - `"bzz"` &lt; `"c"` &lt; `"caa"` &lt; `#!"a"`
 - `#t` &lt; `3.0f` &lt; `3.0` &lt; `3` &lt; `"3"` &lt; `|3|` &lt; `[]` &lt; `#!#t`
 - `[#f]` &lt; `[foo]`, because `Boolean` appears before `Symbol` in the kind ordering
 - `[x]` &lt; `[x y]`, because there is no element remaining to compare against `y`
 - `[a b]` &lt; `[x]`, because `a` is smaller than `x`
 - `[x y]` &lt; `[x z]`, because `y` is ordered before `z`

### Simple examples.

<!-- TODO: Give some examples of large and small Preserves, perhaps -->
<!-- translated from various JSON blobs floating around the internet. -->

| Value                                               | Encoded byte sequence                                                           |
|-----------------------------------------------------|---------------------------------------------------------------------------------|
| `<capture <discard>>`                               | B4 B3 07 'c' 'a' 'p' 't' 'u' 'r' 'e' B4 B3 07 'd' 'i' 's' 'c' 'a' 'r' 'd' 84 84 |
| `[1 2 3 4]`                                         | B5 B0 01 01 B0 01 02 B0 01 03 B0 01 04 84                                       |
| `[-2 -1 0 1]`                                       | B5 B0 01 FE B0 01 FF B0 00 B0 01 01 84                                          |
| `"hello"`                                           | B1 05 'h' 'e' 'l' 'l' 'o'                                                       |
| `"z水𝄞"`                                            | B1 08 'z' E6 B0 B4 F0 9D 84 9E                                                  |
| `"z水\uD834\uDD1E"`                                 | B1 08 'z' E6 B0 B4 F0 9D 84 9E                                                  |
| `["a" b #"c" [] #{} #t #f]`                         | B5 B1 01 'a' B3 01 'b' B2 01 'c' B5 84 B6 84 81 80 84                           |
| `-257`                                              | B0 02 FE FF                                                                        |
| `-1`                                                | B0 01 FF                                                                        |
| `0`                                                 | B0 00                                                                           |
| `1`                                                 | B0 01 01                                                                        |
| `255`                                               | B0 02 00 FF                                                                        |
| `1.0f`                                              | 87 04 3F 80 00 00                                                               |
| `1.0`                                               | 87 08 3F F0 00 00 00 00 00 00                                                   |
| `-1.202e300`                                        | 87 08 FE 3C B7 B7 59 BF 04 26                                                   |
| `#xf"7f800000"`, positive `Float` infinity          | 87 04 7F 80 00 00                                                               |
| `#xd"fff0000000000000"`, negative `Double` infinity | 87 08 FF F0 00 00 00 00 00 00                                                   |

The next example uses a non-`Symbol` label for a record.[^extensibility2] The `Record`

    <[titled person 2 thing 1] 101 "Blackwell" <date 1821 2 3> "Dr">

encodes to

    B4                                # Record
      B5                                # Sequence
        B3 06 74 69 74 6C 65 64           # Symbol, "titled"
        B3 06 70 65 72 73 6F 6E           # Symbol, "person"
        B0 01 02                          # SignedInteger, "2"
        B3 05 74 68 69 6E 67              # Symbol, "thing"
        B0 01 01                          # SignedInteger, "1"
      84                                # End (sequence)
      B0 01 65                          # SignedInteger, "101"
      B1 09 42 6C 61 63 6B 77 65 6C 6C  # String, "Blackwell"
      B4                                # Record
        B3 04 64 61 74 65                 # Symbol, "date"
        B0 02 07 1D                       # SignedInteger, "1821"
        B0 01 02                          # SignedInteger, "2"
        B0 01 03                          # SignedInteger, "3"
      84                                # End (record)
      B1 02 44 72                       # String, "Dr"
    84                                # End (record)

  [^extensibility2]: It happens to line up with Racket's
    representation of a record label for an inheritance hierarchy
    where `titled` extends `person` extends `thing`:

        (struct date (year month day) #:prefab)
        (struct thing (id) #:prefab)
        (struct person thing (name date-of-birth) #:prefab)
        (struct titled person (title) #:prefab)

    For more detail on Racket's representations of record labels, see
    [the Racket documentation for `make-prefab-struct`](http://docs.racket-lang.org/reference/structutils.html#%28def._%28%28quote._~23~25kernel%29._make-prefab-struct%29%29).

### JSON examples.

Preserves text syntax is a superset of JSON,[^json-string-caveat] so the
examples from [RFC 8259](https://tools.ietf.org/html/rfc8259#section-13)
read as valid Preserves.

  [^json-string-caveat]: There is one caveat to be aware of. [Section 8.2
    of RFC 8259](https://tools.ietf.org/html/rfc8259#section-8.2)
    explicitly permits unpaired [surrogate code
    point](https://unicode.org/glossary/#surrogate_code_point)s in JSON
    texts without specifying an interpretation for them. Preserves mandates
    UTF-8 in its binary syntax, forbids unpaired surrogates in its text
    syntax, and disallows surrogate code points in `String`s and `Symbol`s,
    meaning that any valid JSON text including an unpaired surrogate will
    not be parseable using the Preserves text syntax rules.

The JSON literals `true`, `false` and `null` all read as `Symbol`s, and
JSON numbers read (unambiguously) either as `SignedInteger`s or as
`Double`s.[^json-superset]

  [^json-superset]: The following [schema](./preserves-schema.html)
    definitions match exactly the JSON subset of a Preserves input:

        version 1 .
        JSON = @string string / @integer int / @double double / @boolean JSONBoolean / @null =null
             / @array [JSON ...] / @object { string: JSON ...:... } .
        JSONBoolean = =true / =false .

The first RFC 8259 example:

    {
      "Image": {
          "Width":  800,
          "Height": 600,
          "Title":  "View from 15th Floor",
          "Thumbnail": {
              "Url":    "http://www.example.com/image/481989943",
              "Height": 125,
              "Width":  100
          },
          "Animated" : false,
          "IDs": [116, 943, 234, 38793]
        }
    }

when read using the Preserves text syntax encodes via the binary syntax
as follows:

    B7
      B1 05 "Image"
      B7
        B1 03 "IDs"      B5
                           B0 01 74
                           B0 02 03 AF
                           B0 02 00 EA
                           B0 03 00 97 89
                         84
        B1 05 "Title"    B1 14 "View from 15th Floor"
        B1 05 "Width"    B0 02 03 20
        B1 06 "Height"   B0 02 02 58
        B1 08 "Animated" B3 05 "false"
        B1 09 "Thumbnail"
          B7
            B1 03 "Url"    B1 26 "http://www.example.com/image/481989943"
            B1 05 "Width"  B0 01 64
            B1 06 "Height" B0 01 7D
          84
      84
    84

The second RFC 8259 example:

    [
      {
         "precision": "zip",
         "Latitude":  37.7668,
         "Longitude": -122.3959,
         "Address":   "",
         "City":      "SAN FRANCISCO",
         "State":     "CA",
         "Zip":       "94107",
         "Country":   "US"
      },
      {
         "precision": "zip",
         "Latitude":  37.371991,
         "Longitude": -122.026020,
         "Address":   "",
         "City":      "SUNNYVALE",
         "State":     "CA",
         "Zip":       "94085",
         "Country":   "US"
      }
    ]

encodes to binary as follows:

    B5
      B7
        B1 03 "Zip"        B1 05 "94107"
        B1 04 "City"       B1 0D "SAN FRANCISCO"
        B1 05 "State"      B1 02 "CA"
        B1 07 "Address"    B1 00
        B1 07 "Country"    B1 02 "US"
        B1 08 "Latitude"   87 08 40 42 E2 26 80 9D 49 52
        B1 09 "Longitude"  87 08 C0 5E 99 56 6C F4 1F 21
        B1 09 "precision"  B1 03 "zip"
      84
      B7
        B1 03 "Zip"        B1 05 "94085"
        B1 04 "City"       B1 09 "SUNNYVALE"
        B1 05 "State"      B1 02 "CA"
        B1 07 "Address"    B1 00
        B1 07 "Country"    B1 02 "US"
        B1 08 "Latitude"   87 08 40 42 AF 9D 66 AD B4 03
        B1 09 "Longitude"  87 08 C0 5E 81 AA 4F CA 42 AF
        B1 09 "precision"  B1 03 "zip"
      84
    84

## Appendix. Merging Values

The *merge* of two `Value`s is a combination of the two values that includes all information
from each that is missing from the other. If the values are incompatible, they have no merge.

 - the merge of two `Atom`s has no value if they are not [equal](#equivalence); otherwise, it
   has value equal to (an arbitrary) one of the atoms.

 - the merge of two `Embedded`s depends on the interpretation of the embedded values, and so is
   implementation-defined.

 - the merge of two `Compound`s is:

   - if both are `Sequence`s, let `n` be the minimum of the lengths of the two sequences. If
     every merge of corresponding positions up to `n` in the sequences is defined, the result
     is defined, with elements merged up to position `n` and simply copied from the longer
     sequence from position `n` onward; otherwise, it is undefined.

   - if both are `Record`s, the `Record` with the merge of the two input records' labels as its
     label and the merge of the inputs' field sequences as its fields;

   - if both are `Dictionary`s, if every merge of values associated with keys common to both
     inputs is defined, the result is defined, with merged values at common keys and simply
     copied from either side for keys unique to that side; otherwise, it is undefined.

 - Otherwise, the merge is undefined.

**Examples.**

 - `merge [1, [2], 3] [1, [2, 99], 3, 4, 5] = merge [1, [2, 99], 3, 4, 5]`
 - `merge [1, 2, 3] [1, 5, 3] = ⊥`
 - `merge #{a, b, c} #{a, b, c} = ⊥`
 - `merge {a: 1, b: [2]} {b: [2, 99] c: 3} = {a: 1, b: [2, 99], c: 3}`
 - `merge {a: 1, b: [2]} {a: 5, b: [2]} = ⊥`

<!-- Heading to visually offset the footnotes from the main document: -->
## Notes
-												preserve.md based on codec.md which I'm about to check in

											
										
										
											2018-09-23 13:37:20 +00:00
+								---
-												Proper layouting

											
										
										
											2019-08-18 21:08:55 +00:00
+								no_site_title: true
 								title: "Preserves: an Expressive Data Language"
-												preserve.md based on codec.md which I'm about to check in

											
										
										
											2018-09-23 13:37:20 +00:00
+								---
-												Trim and improve

											
										
										
											2018-09-24 11:59:22 +00:00
+								Tony Garnock-Jones <tonyg@leastfixedpoint.com>
-												Split up spec!

											
										
										
											2022-06-18 17:11:08 +00:00
+								{{ site.version_date }}. Version {{ site.version }}.
-												preserve.md based on codec.md which I'm about to check in

											
										
										
											2018-09-23 13:37:20 +00:00
-												Python preserves doctest runner, and mkdocs documentation stubs

											
										
										
											2023-03-16 16:51:19 +00:00
+								{% include what-is-preserves.md %}
-												Trim and improve

											
										
										
											2018-09-24 11:59:22 +00:00
-												Split up spec!

											
										
										
											2022-06-18 17:11:08 +00:00
+								This document defines the core semantics and data model of Preserves and
 								presents a handful of examples. Two other core documents define
-												preserve.md based on codec.md which I'm about to check in

											
										
										
											2018-09-23 13:37:20 +00:00
-												Split up spec!

											
										
										
											2022-06-18 17:11:08 +00:00
+								 - a [human-readable text syntax](preserves-text.html), and
 								 - a [machine-oriented binary syntax](preserves-binary.html)
-												preserve.md based on codec.md which I'm about to check in

											
										
										
											2018-09-23 13:37:20 +00:00
-												Split up spec!

											
										
										
											2022-06-18 17:11:08 +00:00
+								for the Preserves data model.
 								## <a id="semantics"></a><a id="starting-with-semantics"></a>Values
 								Preserves *values* are given meaning independent of their syntax. We
 								will write "`Value`" when we mean the set of all Preserves values or an
 								element of that set.
 								`Value`s fall into two broad categories: *atomic* and *compound*
-												The Great Renaming: Pointer -> Embedded

											
										
										
											2021-05-17 12:54:06 +00:00
+								data. Every `Value` is finite and non-cyclic. Embedded values, called
 								`Embedded`s, are a third, special-case category.
-												Minor print layout tweaks, and minor content fixes

											
										
										
											2018-09-24 15:08:48 +00:00
-												Python preserves doctest runner, and mkdocs documentation stubs

											
										
										
											2023-03-16 16:51:19 +00:00
+								{% include value-grammar.md %}
-												preserve.md based on codec.md which I'm about to check in

											
										
										
											2018-09-23 13:37:20 +00:00
 								**Total order.**<a name="total-order"></a> As we go, we will
 								incrementally specify a total order over `Value`s. Two values of the
 								same kind are compared using kind-specific rules. The ordering among
 								values of different kinds is essentially arbitrary, but having a total
 								order is convenient for many tasks, so we define it as
-												Remove pointless footnote remark

											
										
										
											2019-10-23 21:58:47 +00:00
+								follows:
-												preserve.md based on codec.md which I'm about to check in

											
										
										
											2018-09-23 13:37:20 +00:00
-												The Great Renaming: Pointer -> Embedded

											
										
										
											2021-05-17 12:54:06 +00:00
+								            (Values)        Atom < Compound < Embedded
-												preserve.md based on codec.md which I'm about to check in

											
										
										
											2018-09-23 13:37:20 +00:00
 								            (Compounds)     Record < Sequence < Set < Dictionary
-												Fixes

											
										
										
											2018-09-23 21:44:43 +00:00
+								            (Atoms)         Boolean < Float < Double < SignedInteger
 								                              < String < ByteString < Symbol
-												preserve.md based on codec.md which I'm about to check in

											
										
										
											2018-09-23 13:37:20 +00:00
 								**Equivalence.**<a name="equivalence"></a> Two `Value`s are equal if
 								neither is less than the other according to the total order.
 								### Signed integers.
-												Split up spec!

											
										
										
											2022-06-18 17:11:08 +00:00
+								A `SignedInteger` is an arbitrarily-large signed integer.
-												WIP from the early hours of this morning, adding textual syntax

											
										
										
											2018-09-27 10:42:55 +00:00
+								`SignedInteger`s are compared as mathematical integers.
-												preserve.md based on codec.md which I'm about to check in

											
										
										
											2018-09-23 13:37:20 +00:00
 								### Unicode strings.
-												Preserves really uses Unicode scalar values, not code points.

											
										
										
											2023-10-13 12:01:21 +00:00
+								A `String` is a sequence of [Unicode
 								scalar value](http://www.unicode.org/glossary/#unicode_scalar_value)s.[^nul-permitted]
 								`String`s are compared lexicographically, scalar value by
 								scalar value.[^utf8-is-awesome]
-												preserve.md based on codec.md which I'm about to check in

											
										
										
											2018-09-23 13:37:20 +00:00
 								  [^utf8-is-awesome]: Happily, the design of UTF-8 is such that this
 								    gives the same result as a lexicographic byte-by-byte comparison
 								    of the UTF-8 encoding of a string!
-												Preserves really uses Unicode scalar values, not code points.

											
										
										
											2023-10-13 12:01:21 +00:00
+								  [^nul-permitted]: All Unicode scalar values are permitted, including NUL
 								    (scalar value zero). Because scalar values are defined as code points
 								    *excluding* surrogate code points
 								    (D800<sub>16</sub>–DFFF<sub>16</sub>), surrogates are *not* permitted
 								    in Preserves Unicode data.
-												Split up spec!

											
										
										
											2022-06-18 17:11:08 +00:00
-												preserve.md based on codec.md which I'm about to check in

											
										
										
											2018-09-23 13:37:20 +00:00
+								### Binary data.
-												Delete misleading, incorrect, or unnecessary text

											
										
										
											2018-11-08 12:35:50 +00:00
+								A `ByteString` is a sequence of octets. `ByteString`s are compared
 								lexicographically.
-												preserve.md based on codec.md which I'm about to check in

											
										
										
											2018-09-23 13:37:20 +00:00
-												Minor print layout tweaks, and minor content fixes

											
										
										
											2018-09-24 15:08:48 +00:00
+								### Symbols.
-												preserve.md based on codec.md which I'm about to check in

											
										
										
											2018-09-23 13:37:20 +00:00
 								Programming languages like Lisp and Prolog frequently use string-like
-												Java quasi-Symbols

											
										
										
											2023-10-30 09:46:11 +00:00
+								values called *symbols*.[^even-java-has-quasi-symbols] Here, a `Symbol`
 								is, like a `String`, a sequence of Unicode scalar values representing an
 								identifier of some kind. `Symbol`s are also compared lexicographically
 								by scalar value.
 								[^even-java-has-quasi-symbols]: Even Java has quasi-symbols in the form
 								    of its "interned strings". A Java Preserves implementation might
 								    intern Preserves `Symbol`s while leaving Preserves `String`s
 								    uninterned.
-												preserve.md based on codec.md which I'm about to check in

											
										
										
											2018-09-23 13:37:20 +00:00
 								### Booleans.
-												Delete misleading, incorrect, or unnecessary text

											
										
										
											2018-11-08 12:35:50 +00:00
+								There are two `Boolean`s, “false” and “true”. The “false” value is
 								less-than the “true” value.
-												preserve.md based on codec.md which I'm about to check in

											
										
										
											2018-09-23 13:37:20 +00:00
 								### IEEE floating-point values.
-												Delete misleading, incorrect, or unnecessary text

											
										
										
											2018-11-08 12:35:50 +00:00
+								`Float`s and `Double`s are single- and double-precision IEEE 754
 								floating-point values, respectively. `Float`s, `Double`s and
-												Split up spec!

											
										
										
											2022-06-18 17:11:08 +00:00
+								`SignedInteger`s are disjoint; by the rules [above](#total-order), every
 								`Float` is less than every `Double`, and every `SignedInteger` is
 								greater than both. Two `Float`s or two `Double`s are to be ordered by
 								the `totalOrder` predicate defined in section 5.10 of [IEEE Std
 -2008](https://dx.doi.org/10.1109/IEEESTD.2008.4610935).
-												preserve.md based on codec.md which I'm about to check in

											
										
										
											2018-09-23 13:37:20 +00:00
 								### Records.
-												Delete misleading, incorrect, or unnecessary text

											
										
										
											2018-11-08 12:35:50 +00:00
+								A `Record` is a *labelled* tuple of `Value`s, the record's *fields*. A
 								label can be any `Value`, but is usually a `Symbol`.[^extensibility]
-												Clarify lexicographical ordering

											
										
										
											2023-10-31 19:00:18 +00:00
+								[^iri-labels] `Record`s are ordered first by label, then
 								lexicographically[^lexicographical-sequences] by field sequence.
-												preserve.md based on codec.md which I'm about to check in

											
										
										
											2018-09-23 13:37:20 +00:00
 								  [^extensibility]: The [Racket](https://racket-lang.org/) programming
 								    language defines
-												Tweaks; python mapping

											
										
										
											2018-09-24 17:34:07 +00:00
+								    “[prefab](http://docs.racket-lang.org/guide/define-struct.html#(part._prefab-struct))”
-												preserve.md based on codec.md which I'm about to check in

											
										
										
											2018-09-23 13:37:20 +00:00
+								    structure types, which map well to our `Record`s. Racket supports
 								    record extensibility by encoding record supertypes into record
 								    labels as specially-formatted lists.
 								  [^iri-labels]: It is occasionally (but seldom) necessary to
-												Introduce the notion of a "delimiter" to follow Boolean and SymbolOrNumber.

											
										
										
											2023-10-29 14:55:19 +00:00
+								    interpret such `Symbol` labels as IRIs. Where a
-												preserve.md based on codec.md which I'm about to check in

											
										
										
											2018-09-23 13:37:20 +00:00
+								    label can be read as a relative IRI, it is notionally interpreted
 								    with respect to the IRI
 								    `urn:uuid:6bf094a6-20f1-4887-ada7-46834a9b5b34`; where a label can
 								    be read as an absolute IRI, it stands for that IRI; and otherwise,
 								    it cannot be read as an IRI at all, and so the label simply stands
-												Trim and improve

											
										
										
											2018-09-24 11:59:22 +00:00
+								    for itself—for its own `Value`.
-												preserve.md based on codec.md which I'm about to check in

											
										
										
											2018-09-23 13:37:20 +00:00
-												Clarify lexicographical ordering

											
										
										
											2023-10-31 19:00:18 +00:00
+								  [^lexicographical-sequences]: When comparing sequences of values for
 								    the total order, [lexicographical
 								    ordering](https://en.wikipedia.org/wiki/Lexicographic_order) is
 								    used. Elements are drawn pairwise from the two sequences to be
 								    compared. If one is smaller than the other according to the total
 								    order, the sequence it was drawn from is the smaller of the
 								    sequences. If the end of one sequence is reached, while the other
 								    sequence has elements remaining, the shorter sequence is considered
 								    smaller. Otherwise, all the elements compared equal and neither was
 								    longer than the other, so they compare equal. For example,
 								      - `[#f]` is ordered before `[foo]` because `Boolean` appears before `Symbol` in the kind ordering;
 								      - `[x]` before `[x y]` because there is no element remaining to compare against `y`;
 								      - `[a b]` before `[x]` because `a` is smaller than `x`; and
 								      - `[x y]` before `[x z]` because `y` is ordered before `z` according to the ordering rules for `Symbol`.
-												preserve.md based on codec.md which I'm about to check in

											
										
										
											2018-09-23 13:37:20 +00:00
+								### Sequences.
-												Delete misleading, incorrect, or unnecessary text

											
										
										
											2018-11-08 12:35:50 +00:00
+								A `Sequence` is a sequence of `Value`s. `Sequence`s are compared
-												Clarify lexicographical ordering

											
										
										
											2023-10-31 19:00:18 +00:00
+								lexicographically.[^lexicographical-sequences]
-												preserve.md based on codec.md which I'm about to check in

											
										
										
											2018-09-23 13:37:20 +00:00
 								### Sets.
 								A `Set` is an unordered finite set of `Value`s. It contains no
 								duplicate values, following the [equivalence relation](#equivalence)
 								induced by the total order on `Value`s. Two `Set`s are compared by
-												Trim and improve

											
										
										
											2018-09-24 11:59:22 +00:00
+								sorting their elements ascending using the [total order](#total-order)
-												Clarify lexicographical ordering

											
										
										
											2023-10-31 19:00:18 +00:00
+								and comparing the resulting `Sequence`s.[^lexicographical-sequences]
-												preserve.md based on codec.md which I'm about to check in

											
										
										
											2018-09-23 13:37:20 +00:00
-												Trim and improve

											
										
										
											2018-09-24 11:59:22 +00:00
+								### Dictionaries.
-												preserve.md based on codec.md which I'm about to check in

											
										
										
											2018-09-23 13:37:20 +00:00
-												Trim and improve

											
										
										
											2018-09-24 11:59:22 +00:00
+								A `Dictionary` is an unordered finite collection of pairs of `Value`s.
-												Delete misleading, incorrect, or unnecessary text

											
										
										
											2018-11-08 12:35:50 +00:00
+								Each pair comprises a *key* and a *value*. Keys in a `Dictionary` are
 								pairwise distinct. Instances of `Dictionary` are compared by
-												Clarify lexicographical ordering

											
										
										
											2023-10-31 19:00:18 +00:00
+								lexicographic[^lexicographical-sequences] comparison of the sequences
 								resulting from ordering each `Dictionary`'s pairs in ascending order by
 								key.
-												preserve.md based on codec.md which I'm about to check in

											
										
										
											2018-09-23 13:37:20 +00:00
-												The Great Renaming: Pointer -> Embedded

											
										
										
											2021-05-17 12:54:06 +00:00
+								### Embeddeds.
-												Introduce pointers

											
										
										
											2021-01-29 11:03:28 +00:00
-												The Great Renaming: Pointer -> Embedded

											
										
										
											2021-05-17 12:54:06 +00:00
+								An `Embedded` allows inclusion of *domain-specific*, potentially
 								*stateful* or *located* data into a `Value`.[^embedded-rationale]
 								`Embedded`s may be used to denote stateful objects, network services,
 								object capabilities, file descriptors, Unix processes, or other
 								possibly-stateful things. Because each `Embedded` is a domain-specific
 								datum, comparison of two `Embedded`s is done according to
-												Introduce pointers

											
										
										
											2021-01-29 11:03:28 +00:00
+								domain-specific rules.
-												The Great Renaming: Pointer -> Embedded

											
										
										
											2021-05-17 12:54:06 +00:00
+								  [^embedded-rationale]: **Rationale.** Why include `Embedded`s as a
-												Introduce pointers

											
										
										
											2021-01-29 11:03:28 +00:00
+								    special class, distinct from, say, a specially-labeled `Record`?
 								    First, a `Record` can only hold other `Value`s: in order to embed
 								    values such as live pointers to Java objects, some means of
 								    "escaping" from the `Value` data type must be provided. Second,
-												The Great Renaming: Pointer -> Embedded

											
										
										
											2021-05-17 12:54:06 +00:00
+								    `Embedded`s are meant to be able to denote stateful entities, for
-												Introduce pointers

											
										
										
											2021-01-29 11:03:28 +00:00
+								    which comparison by address is appropriate; however, we do not
 								    wish to place restrictions on the *nature* of these entities: if
-												The Great Renaming: Pointer -> Embedded

											
										
										
											2021-05-17 12:54:06 +00:00
+								    we had used `Record`s instead of distinct `Embedded`s, users would
-												Introduce pointers

											
										
										
											2021-01-29 11:03:28 +00:00
+								    have to invent an encoding of domain data into `Record`s that
 								    reflected domain ordering into `Value` ordering. This is often
 								    difficult and may not always be possible. Finally, because
-												The Great Renaming: Pointer -> Embedded

											
										
										
											2021-05-17 12:54:06 +00:00
+								    `Embedded`s are intended to be able to represent network and
 								    memory *locations*, they must be able to be rewritten at network
 								    and process boundaries. Having a distinct class allows generic
 								    `Embedded` rewriting without the quotation-related complications
 								    of encoding references as, say, `Record`s.
-												Copy across easy wins from the wip branch

											
										
										
											2023-03-15 14:17:37 +00:00
+								*Motivating Examples.* In a Java or Python implementation, an `Embedded` may
-												The Great Renaming: Pointer -> Embedded

											
										
										
											2021-05-17 12:54:06 +00:00
+								denote a reference to a Java or Python object; comparison would be
 								done via the language's own rules for equivalence and ordering. In a
 								Unix application, an `Embedded` may denote an open file descriptor or
 								a process ID. In an HTTP-based application, each `Embedded` might be a
-												Introduce pointers

											
										
										
											2021-01-29 11:03:28 +00:00
+								URL, compared according to
 								[RFC 6943](https://tools.ietf.org/html/rfc6943#section-3.3). When a
-												The Great Renaming: Pointer -> Embedded

											
										
										
											2021-05-17 12:54:06 +00:00
+								`Value` is serialized for storage or transfer, `Embedded`s will
 								usually be represented as ordinary `Value`s, in which case the
-												Introduce pointers

											
										
										
											2021-01-29 11:03:28 +00:00
+								ordinary rules for comparing `Value`s will apply.
-												Merges

											
										
										
											2023-03-27 21:07:41 +00:00
+								## <a id="examples"></a>Appendix. Examples
-												preserve.md based on codec.md which I'm about to check in

											
										
										
											2018-09-23 13:37:20 +00:00
-												Split up spec!

											
										
										
											2022-06-18 17:11:08 +00:00
+								The definitions above are independent of any particular concrete syntax.
 								The examples of `Value`s that follow are written using [the Preserves
 								text syntax](preserves-text.html), and the example encoded byte
 								sequences use [the Preserves binary encoding](preserves-binary.html).
-												MUCH simpler binary format, inspired by Syrup; alterations to text format

											
										
										
											2020-12-28 22:25:02 +00:00
+								### Ordering.
 								The total ordering specified [above](#total-order) means that the following statements are true:
-												Clarify lexicographical ordering

											
										
										
											2023-10-31 19:00:18 +00:00
+								 - `"bzz"` &lt; `"c"` &lt; `"caa"` &lt; `#!"a"`
 								 - `#t` &lt; `3.0f` &lt; `3.0` &lt; `3` &lt; `"3"` &lt; `|3|` &lt; `[]` &lt; `#!#t`
 								 - `[#f]` &lt; `[foo]`, because `Boolean` appears before `Symbol` in the kind ordering
 								 - `[x]` &lt; `[x y]`, because there is no element remaining to compare against `y`
 								 - `[a b]` &lt; `[x]`, because `a` is smaller than `x`
 								 - `[x y]` &lt; `[x z]`, because `y` is ordered before `z`
-												MUCH simpler binary format, inspired by Syrup; alterations to text format

											
										
										
											2020-12-28 22:25:02 +00:00
-												Cosmetic.

											
										
										
											2019-07-03 23:35:56 +00:00
+								### Simple examples.
-												More TODOs in the text; initial textual reader in Racket

											
										
										
											2018-09-27 18:25:28 +00:00
-												preserve.md based on codec.md which I'm about to check in

											
										
										
											2018-09-23 13:37:20 +00:00
+								<!-- TODO: Give some examples of large and small Preserves, perhaps -->
 								<!-- translated from various JSON blobs floating around the internet. -->
-												Repair text syntax for numbers and symbols. Closes #19/#36/#37/#38.

Numbers and (bare) Symbols are now disambiguated after reading, which
permits leading `+`, leading `0`, and a wider range of acceptable
Symbols.

Updates spec text, test cases, and implementations. Some ancillary fixes
to Python's comparison routines are also included.

											
										
										
											2022-11-06 21:27:01 +00:00
+								| Value                                               | Encoded byte sequence                                                           |
 								|-----------------------------------------------------|---------------------------------------------------------------------------------|
 								| `<capture <discard>>`                               | B4 B3 07 'c' 'a' 'p' 't' 'u' 'r' 'e' B4 B3 07 'd' 'i' 's' 'c' 'a' 'r' 'd' 84 84 |
-												Fix examples; tweak opening text

											
										
										
											2023-10-16 16:20:55 +00:00
+								| `[1 2 3 4]`                                         | B5 B0 01 01 B0 01 02 B0 01 03 B0 01 04 84                                       |
 								| `[-2 -1 0 1]`                                       | B5 B0 01 FE B0 01 FF B0 00 B0 01 01 84                                          |
-												Preserves really uses Unicode scalar values, not code points.

											
										
										
											2023-10-13 12:01:21 +00:00
+								| `"hello"`                                           | B1 05 'h' 'e' 'l' 'l' 'o'                                                       |
 								| `"z水𝄞"`                                            | B1 08 'z' E6 B0 B4 F0 9D 84 9E                                                  |
 								| `"z水\uD834\uDD1E"`                                 | B1 08 'z' E6 B0 B4 F0 9D 84 9E                                                  |
-												Repair text syntax for numbers and symbols. Closes #19/#36/#37/#38.

Numbers and (bare) Symbols are now disambiguated after reading, which
permits leading `+`, leading `0`, and a wider range of acceptable
Symbols.

Updates spec text, test cases, and implementations. Some ancillary fixes
to Python's comparison routines are also included.

											
										
										
											2022-11-06 21:27:01 +00:00
+								| `["a" b #"c" [] #{} #t #f]`                         | B5 B1 01 'a' B3 01 'b' B2 01 'c' B5 84 B6 84 81 80 84                           |
-												Fix examples; tweak opening text

											
										
										
											2023-10-16 16:20:55 +00:00
+								| `-257`                                              | B0 02 FE FF                                                                        |
 								| `-1`                                                | B0 01 FF                                                                        |
 								| `0`                                                 | B0 00                                                                           |
 								| `1`                                                 | B0 01 01                                                                        |
 								| `255`                                               | B0 02 00 FF                                                                        |
 								| `1.0f`                                              | 87 04 3F 80 00 00                                                               |
 								| `1.0`                                               | 87 08 3F F0 00 00 00 00 00 00                                                   |
 								| `-1.202e300`                                        | 87 08 FE 3C B7 B7 59 BF 04 26                                                   |
 								| `#xf"7f800000"`, positive `Float` infinity          | 87 04 7F 80 00 00                                                               |
 								| `#xd"fff0000000000000"`, negative `Double` infinity | 87 08 FF F0 00 00 00 00 00 00                                                   |
-												Progress

											
										
										
											2018-09-23 21:35:00 +00:00
-												More TODOs in the text; initial textual reader in Racket

											
										
										
											2018-09-27 18:25:28 +00:00
+								The next example uses a non-`Symbol` label for a record.[^extensibility2] The `Record`
-												preserve.md based on codec.md which I'm about to check in

											
										
										
											2018-09-23 13:37:20 +00:00
-												Angle bracket S-exprs for Records!

											
										
										
											2019-08-11 22:54:57 +00:00
+								    <[titled person 2 thing 1] 101 "Blackwell" <date 1821 2 3> "Dr">
-												preserve.md based on codec.md which I'm about to check in

											
										
										
											2018-09-23 13:37:20 +00:00
 								encodes to
-												Use # as comment in more places

											
										
										
											2023-10-31 21:20:26 +00:00
+								    B4                                # Record
 								      B5                                # Sequence
 								        B3 06 74 69 74 6C 65 64           # Symbol, "titled"
 								        B3 06 70 65 72 73 6F 6E           # Symbol, "person"
 								        B0 01 02                          # SignedInteger, "2"
 								        B3 05 74 68 69 6E 67              # Symbol, "thing"
 								        B0 01 01                          # SignedInteger, "1"
 # End (sequence)
 								      B0 01 65                          # SignedInteger, "101"
 								      B1 09 42 6C 61 63 6B 77 65 6C 6C  # String, "Blackwell"
 								      B4                                # Record
 								        B3 04 64 61 74 65                 # Symbol, "date"
 								        B0 02 07 1D                       # SignedInteger, "1821"
 								        B0 01 02                          # SignedInteger, "2"
 								        B0 01 03                          # SignedInteger, "3"
 # End (record)
 								      B1 02 44 72                       # String, "Dr"
 # End (record)
-												preserve.md based on codec.md which I'm about to check in

											
										
										
											2018-09-23 13:37:20 +00:00
 								  [^extensibility2]: It happens to line up with Racket's
 								    representation of a record label for an inheritance hierarchy
 								    where `titled` extends `person` extends `thing`:
 								        (struct date (year month day) #:prefab)
 								        (struct thing (id) #:prefab)
 								        (struct person thing (name date-of-birth) #:prefab)
 								        (struct titled person (title) #:prefab)
-												Link to Racket docs for prefab struct labels

											
										
										
											2018-09-25 09:08:22 +00:00
+								    For more detail on Racket's representations of record labels, see
 								    [the Racket documentation for `make-prefab-struct`](http://docs.racket-lang.org/reference/structutils.html#%28def._%28%28quote._~23~25kernel%29._make-prefab-struct%29%29).
-												Cosmetic.

											
										
										
											2019-07-03 23:35:56 +00:00
+								### JSON examples.
-												More TODOs in the text; initial textual reader in Racket

											
										
										
											2018-09-27 18:25:28 +00:00
-												Preserves really uses Unicode scalar values, not code points.

											
										
										
											2023-10-13 12:01:21 +00:00
+								Preserves text syntax is a superset of JSON,[^json-string-caveat] so the
 								examples from [RFC 8259](https://tools.ietf.org/html/rfc8259#section-13)
 								read as valid Preserves.
 								  [^json-string-caveat]: There is one caveat to be aware of. [Section 8.2
 								    of RFC 8259](https://tools.ietf.org/html/rfc8259#section-8.2)
 								    explicitly permits unpaired [surrogate code
 								    point](https://unicode.org/glossary/#surrogate_code_point)s in JSON
 								    texts without specifying an interpretation for them. Preserves mandates
 								    UTF-8 in its binary syntax, forbids unpaired surrogates in its text
 								    syntax, and disallows surrogate code points in `String`s and `Symbol`s,
 								    meaning that any valid JSON text including an unpaired surrogate will
 								    not be parseable using the Preserves text syntax rules.
-												Split up spec!

											
										
										
											2022-06-18 17:11:08 +00:00
 								The JSON literals `true`, `false` and `null` all read as `Symbol`s, and
 								JSON numbers read (unambiguously) either as `SignedInteger`s or as
 								`Double`s.[^json-superset]
 								  [^json-superset]: The following [schema](./preserves-schema.html)
 								    definitions match exactly the JSON subset of a Preserves input:
 								        version 1 .
 								        JSON = @string string / @integer int / @double double / @boolean JSONBoolean / @null =null
 								             / @array [JSON ...] / @object { string: JSON ...:... } .
 								        JSONBoolean = =true / =false .
 								The first RFC 8259 example:
-												More TODOs in the text; initial textual reader in Racket

											
										
										
											2018-09-27 18:25:28 +00:00
 								    {
 								      "Image": {
 								          "Width":  800,
 								          "Height": 600,
 								          "Title":  "View from 15th Floor",
 								          "Thumbnail": {
 								              "Url":    "http://www.example.com/image/481989943",
 								              "Height": 125,
 								              "Width":  100
 								          },
 								          "Animated" : false,
 								          "IDs": [116, 943, 234, 38793]
 								        }
 								    }
-												Split up spec!

											
										
										
											2022-06-18 17:11:08 +00:00
+								when read using the Preserves text syntax encodes via the binary syntax
 								as follows:
-												More TODOs in the text; initial textual reader in Racket

											
										
										
											2018-09-27 18:25:28 +00:00
-												MUCH simpler binary format, inspired by Syrup; alterations to text format

											
										
										
											2020-12-28 22:25:02 +00:00
+								    B7
 								      B1 05 "Image"
 								      B7
-												Update Racket implementation

											
										
										
											2020-12-30 15:43:18 +00:00
+								        B1 03 "IDs"      B5
-												Fix examples; tweak opening text

											
										
										
											2023-10-16 16:20:55 +00:00
+								                           B0 01 74
 								                           B0 02 03 AF
 								                           B0 02 00 EA
 								                           B0 03 00 97 89
-												Update Racket implementation

											
										
										
											2020-12-30 15:43:18 +00:00
-												MUCH simpler binary format, inspired by Syrup; alterations to text format

											
										
										
											2020-12-28 22:25:02 +00:00
+								        B1 05 "Title"    B1 14 "View from 15th Floor"
-												Fix examples; tweak opening text

											
										
										
											2023-10-16 16:20:55 +00:00
+								        B1 05 "Width"    B0 02 03 20
 								        B1 06 "Height"   B0 02 02 58
-												MUCH simpler binary format, inspired by Syrup; alterations to text format

											
										
										
											2020-12-28 22:25:02 +00:00
+								        B1 08 "Animated" B3 05 "false"
 								        B1 09 "Thumbnail"
 								          B7
 								            B1 03 "Url"    B1 26 "http://www.example.com/image/481989943"
-												Fix examples; tweak opening text

											
										
										
											2023-10-16 16:20:55 +00:00
+								            B1 05 "Width"  B0 01 64
 								            B1 06 "Height" B0 01 7D
-												MUCH simpler binary format, inspired by Syrup; alterations to text format

											
										
										
											2020-12-28 22:25:02 +00:00
 
 
-												More TODOs in the text; initial textual reader in Racket

											
										
										
											2018-09-27 18:25:28 +00:00
-												Split up spec!

											
										
										
											2022-06-18 17:11:08 +00:00
+								The second RFC 8259 example:
-												More TODOs in the text; initial textual reader in Racket

											
										
										
											2018-09-27 18:25:28 +00:00
 								    [
 								      {
 								         "precision": "zip",
 								         "Latitude":  37.7668,
 								         "Longitude": -122.3959,
 								         "Address":   "",
 								         "City":      "SAN FRANCISCO",
 								         "State":     "CA",
 								         "Zip":       "94107",
 								         "Country":   "US"
 								      },
 								      {
 								         "precision": "zip",
 								         "Latitude":  37.371991,
 								         "Longitude": -122.026020,
 								         "Address":   "",
 								         "City":      "SUNNYVALE",
 								         "State":     "CA",
 								         "Zip":       "94085",
 								         "Country":   "US"
 								      }
 								    ]
 								encodes to binary as follows:
-												MUCH simpler binary format, inspired by Syrup; alterations to text format

											
										
										
											2020-12-28 22:25:02 +00:00
+								    B5
 								      B7
 								        B1 03 "Zip"        B1 05 "94107"
 								        B1 04 "City"       B1 0D "SAN FRANCISCO"
 								        B1 05 "State"      B1 02 "CA"
 								        B1 07 "Address"    B1 00
 								        B1 07 "Country"    B1 02 "US"
-												Fix examples; tweak opening text

											
										
										
											2023-10-16 16:20:55 +00:00
+								        B1 08 "Latitude"   87 08 40 42 E2 26 80 9D 49 52
 								        B1 09 "Longitude"  87 08 C0 5E 99 56 6C F4 1F 21
-												MUCH simpler binary format, inspired by Syrup; alterations to text format

											
										
										
											2020-12-28 22:25:02 +00:00
+								        B1 09 "precision"  B1 03 "zip"
 
 								      B7
 								        B1 03 "Zip"        B1 05 "94085"
 								        B1 04 "City"       B1 09 "SUNNYVALE"
 								        B1 05 "State"      B1 02 "CA"
 								        B1 07 "Address"    B1 00
 								        B1 07 "Country"    B1 02 "US"
-												Fix examples; tweak opening text

											
										
										
											2023-10-16 16:20:55 +00:00
+								        B1 08 "Latitude"   87 08 40 42 AF 9D 66 AD B4 03
 								        B1 09 "Longitude"  87 08 C0 5E 81 AA 4F CA 42 AF
-												MUCH simpler binary format, inspired by Syrup; alterations to text format

											
										
										
											2020-12-28 22:25:02 +00:00
+								        B1 09 "precision"  B1 03 "zip"
 
 
-												More TODOs in the text; initial textual reader in Racket

											
										
										
											2018-09-27 18:25:28 +00:00
-												Merges

											
										
										
											2023-03-27 21:07:41 +00:00
+								## Appendix. Merging Values
 								The *merge* of two `Value`s is a combination of the two values that includes all information
 								from each that is missing from the other. If the values are incompatible, they have no merge.
 								 - the merge of two `Atom`s has no value if they are not [equal](#equivalence); otherwise, it
 								   has value equal to (an arbitrary) one of the atoms.
 								 - the merge of two `Embedded`s depends on the interpretation of the embedded values, and so is
 								   implementation-defined.
 								 - the merge of two `Compound`s is:
 								   - if both are `Sequence`s, let `n` be the minimum of the lengths of the two sequences. If
 								     every merge of corresponding positions up to `n` in the sequences is defined, the result
 								     is defined, with elements merged up to position `n` and simply copied from the longer
 								     sequence from position `n` onward; otherwise, it is undefined.
 								   - if both are `Record`s, the `Record` with the merge of the two input records' labels as its
 								     label and the merge of the inputs' field sequences as its fields;
 								   - if both are `Dictionary`s, if every merge of values associated with keys common to both
 								     inputs is defined, the result is defined, with merged values at common keys and simply
 								     copied from either side for keys unique to that side; otherwise, it is undefined.
 								 - Otherwise, the merge is undefined.
 								**Examples.**
 								 - `merge [1, [2], 3] [1, [2, 99], 3, 4, 5] = merge [1, [2, 99], 3, 4, 5]`
 								 - `merge [1, 2, 3] [1, 5, 3] = ⊥`
 								 - `merge #{a, b, c} #{a, b, c} = ⊥`
 								 - `merge {a: 1, b: [2]} {b: [2, 99] c: 3} = {a: 1, b: [2, 99], c: 3}`
 								 - `merge {a: 1, b: [2]} {a: 5, b: [2]} = ⊥`
-												Restore removed "Notes" heading

											
										
										
											2019-07-14 18:09:19 +00:00
+								<!-- Heading to visually offset the footnotes from the main document: -->
 								## Notes