GOD, a language for good ol' data

Data serialization can be better, without being too much.

{
  name = "Will";
  age = 26;
  married = false;

  favorite-movies = [
    {
      title = "Interstellar";
      starring = [
        "Matthew McConaughey"
        "Jessica Chastain"
        "Anne Hathaway"
      ];
      director = "Christopher Nolan";
      year = 2014;
    }
    {
      title = "Kill Bill: Volume 1";
      director = "Quinten Tarantino";
      starring = [
        { actor = "Uma Thurman"; character = "The Bride"; }
        { actor = "Lucy Liu"; character = "O-Ren Ishii"; }
        { actor = "David Carradine"; character = "Bill"; }
      ];
      year = 2003;
    }
    {
      title = "The Witch";
      director = "Robert Eggers";
      starring = [ "Anya Taylor-Joy" "Ralph Ineson" ];
      year = 2015;
    }
  ];
  
  friends = [
    {
      name = "Floyd";
      age = 29;
      married = true;
      favorite-movies = [
        {
          title = "The Departed";
          starring = [ "Leonardo DiCaprio" "Vera Farmiga" "Matt Daemon" ];
          director = "Martin Scorsese";
          year = 2006;
        }
        {
          title = "Shutter Island";
          starring = [ "Leonardo DiCaprio" "Mark Ruffalo" ];
          director = "Martin Scorsese";
          year = 2010;
        }
      ];
      friends = [];
    }
  ];
}

Table of Contents

Why?
Background
- Benefits
Implementations
Specification
- Values
  - Strings
  - Numbers
  - Booleans
  - Null
  - Maps
  - Lists
  - Elements
- Structure
  - Document
  - Operators
    - Termination
  - Identifiers
  - Fields
  - Whitespace
    - Comments

Why

As someone who has found themself needing to manually write and programatically work with data serialization formats, I wanted a better way. I tried many formats: JSON, YAML, TOML, CSV, XML, KDL, Lua tables, Java properties, and others. You name it, I tried it. Many of them had enough nagging issues to cause motivation in me to find a better format, which never arose.

"But JSON works fine"

Have you ever been in the position of writing JSON, rather than just having a library parse it? If you haven't; then yes that's a logical conclusion. Personally, I find myself in a position where I need to write data manually, and many of the popular formats make that experience have more friction than it should. For those that may need a data serialization format, but never (or rarely) have to deal directly with the data in its storage format, it may seems like nit-picking; however it becomes different when you find yourself manually writing in these formats.

Background

If you feel that GOD syntax is familiar, that's probably because it is. GOD isn't a new syntax; it is derived directly from the Nix programming language. Any valid GOD code can be validated directly by Nix, with nix eval -f file.god. I saw no need to create a new language when I realized Nix had exactly the bones needed to derive a flexible (and easy to understand) data serialization format. GOD is a subset of Nix which omits it's programming syntax and features in favor of static data representation. Some of the benefits include:

It can be validated by nix
Conversion from GOD to JSON with nix
A number of existing tools for working with Nix code can be used
- a tree-sitter grammar
- linters and formatters such as statix and nixfmt
- language servers such as nixd, nil and rnix-lsp
- Plugins for Nix syntax built-in to editors and cli tools
  - bat: bat file.god -l nix
  - (neo)vim modelines: # vim:ft=nix
- A very thoroughly written Emacs mode

If you would like to see some sample document files, see the examples directory.

Implementations

Guile Scheme: wreedb/guile-god
Tree-sitter grammar: wreedb/tree-sitter-god

Specification

Values

The value types in GOD are intentionally rudimentary, with the goal of being useful to almost any programming language. They are flexible and have few restrictions.

Strings

A standard or regular string is represented by a pair of double quotes with any amount of text inside it.

greeting = "Hello, how are you?";

Yog can escape (double) quotes in a regular string using a backslash (\) before it. This is the same for line-feeds, carriage returns and tab characters (\n,\r,\t).

height = "6'2\"\n";
# 6'2"\n

Numbers

These can represent 64-bit signed integers and IEEE 754 floating point integers

{
  # integer
  age = 26;
  age-negative = -26;
  # float
  pi = 3.14159;
  pi-negative = -3.13159;
}

As in Nix, floats are able to represent up to 64 bits of precision; which is typically enough for most applications. Any number (float or integer) which is not correctly represented in Nix, would not be correctly represented in GOD by extension.

Booleans

These are represented as the unquoted keywords true and false. They are not reserved keywords, meaning they can be used as identifiers, though this is discouraged for obvious reasons.

{
  happy = true;
  sad = false;

  # discouraged, but technically valid
  false = true;
}

Null

This can correlate to a languages' null value, or to represent the absence of a value in languages which do no have a null type. In some languages it might also be represented by things such as a false boolean value, empty string (""), zero (0) or an empty list ('(), such as in lisp-style languages). However this would be an implementation-specific detail.

{
  name = "Will";
  age = 26;
  friends = [
    {
      name = "Floyd";
      age = 29;
      friends = null;
    }
    {
      name = "Alice";
      age = 29;
      friends = [ { name = "Jada"; age = null; friends = null; } ];
    }
  ];
}

Maps

A data structure which is known by many names in different languages. Lua tables, Python dictionaries, Perl hashes, Javascript objects, Lisp and Scheme association lists. The commonality is the structure of an identifier which is assigned a group of fields.

{
  self = {
    name = "Will";
    age = 26;
    married = false;
    favorite-songs = [
      { artist = "Slint"; title = "Nosferatu Man"; }
      { artist = "OutKast"; title = "Hey Ya!"; }
    ];
    best-friend = {
      name = "Floyd";
      age = 29;
      married = true;
      favorite-songs = [
        { artist = "Tool"; title = "Lateralus"; }
        { artist = "Deafheaven"; title = "Dream House"; }
      ];
    };
  };
}

Some languages allow identifiers being used more than once in their form of a map, with the last occurence determining its' value; However this is not valid in Nix - and by extension, here.

{
  self = {
    name = "Will";
    age = 26;
    # This is an ERROR
    age = 25;
  };
}

Lists

These are groups consisting of elements. The entire entity may be the value of a field or nested within another list as one of the its' elements. They are not strict about their elements' types, meaning they can contain any number of valid values. The contained elements are separated by any combination of whitespace

{
  favorite-foods = [
    "Tacos"
    "Pasta"
    "Sandwiches"
  ];

  favorite-numbers = [ 1 2 3 ];

  favorite-lists = [
    [ 1 2 3 ]
    [ "four" "five" "six" ]
    [ true false null ]
  ];

  interesting-list = [
    "Hello!"
    1984
    false
    [ 1998 2025 ]
    {
      name = "map";
      message = "I'm inside a list!";
      more = ''
      So I still adhere to the normal
          field termination rules!
      '';
      my-list = [
        "Hi!"
        true
        {
          name = "another-map";
          message = [ { text = "The nesting knows no limit!"; } 10 false ];
        }
        150
      ];
    }
    null
  ];
}

Elements

These are values within lists. They are not associated with an identifier, only implicitly associated to their index in the list. Elements can be strings, numbers, booleans, null, maps or lists. Elements may NOT be fields however, maps contained within lists; by definition, have fields, and still adhere to the normal rules of field termination within thier scope.

Structure

Document

Similar to formats like JSON, the top (outer-most) level of a GOD file is a pair of opening and closing "curly" braces { }, which we will call the document. Within, fields are allowed in any order, with any valid values at any depth. In representation, it is semantically equivalent to a map-type element

Operators

Termination operator

All fields must have a ; (semicolon) to terminate its' scope.

map assingments: map = { };
simple assignments: name = "Will";
list assignments: list = [];

NOTE: The document is not a field, and therefore has no field terminator.

Identifiers

Non-quoted string values denoting the name or identity of a field.
These are the rules for identifiers:

Identifiers MAY NOT:

contain the following symbols
- . % $ @ ! ^ & * " ` ~ + = , ? < > \ / ( ) [ ] { } ;
begin with a single quote character (')

Identifiers MAY:

contain and be suffixed by (non-paired) single quote characters
contain and be suffixed by hyphens and underscores

{
  # containing hyphens/underscores
  abc-123 = "fa so la ti do";
  abc_123 = null;
      
  # suffixed by hyphens/underscores
  abc-123- = "fa so la ti do";
  abc_123_ = null;
  
  # impractical; just for demonstrating capability
  a'b'c'1'2'3 = "do re mi";
  a_-_b-'_'-c'1_2-'3' = { crazy = true; };
}

Fields

These consist of an identifier and their assigned value. Anything that isn't a document, element, operator or whitespace is a field.

name = "Will";
hobbies = [ "programming" "movies" "music" ];
physical = {
  age = 26;
  height = "6'2\"";
};

In this example, the following are all fields, each one comprising the fields' identifier and assigned value. Note the selective omission of the semicolon field terminators in this list, as they aren't part of the field itself; they are an operator.

name = "Will"
hobbies = [ "programming" "movies" "music" ]
physical = { age = 26; height = "6'2\""; }
age = 26
height = "6'2\""

Identifier	Assigned value
name	`"Will"`
hobbies	`["programming" "movies" "music"]`
physical	`{ age = 26; height = "6'2\""; }`
age	`26`
height	`"6'2\""`

Whitespace

All of the following are considered "whitespace" in a GOD file:

space characters \x20
tab characters \t
line-feed (LF) \n
carriage-return \r
carriage-return line-feed (CR LF) \r\n
line-feed carriage-return (LF CR) \n\r
comments

Comments

In addition to omitting the programming features of Nix, we only support line comments.

# this is a comment
{
  # this is another comment
  # and another one.
  name = "Will"; # over here too
}

The following comments (which are valid in Nix), are NOT valid in GOD:

{
  name = "Will";
  favorite-things = [ "a" "b" "c" /* invalid inline comment */ 1 2 3 ];
  friends = [
    /* invalid
       multiline
       block comment
    */
    { name = "Floyd"; }
  ];
}

When parsing GOD, encountering an octothorpe # (outside of a string) means the remainder of the line is considered whitespace and is ignored.

Reasoning

This is just to reduce some complexity when implementing the language, as nothing about block comments can offer wouldn't be able to be acheived with line comments; and inline comments are just... plain silly.

License

GOD (the specification) is licensed under the the GNU Free Documentation License, version 1.3. Being that this is only the specification, this doesn't place any restrictions on implementing it. Any contributions (or derivatives; of the specification, not the language or its' implementation) to this document/repository must also be licensed under the FDL 1.3. This does not mean using the GOD language or implementing it, the FDL 1.3 terms apply to this repository and document alone.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
example		example
.editorconfig		.editorconfig
.gitattributes		.gitattributes
CHANGELOG.md		CHANGELOG.md
LICENSE.md		LICENSE.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

GOD, a language for good ol' data

Why

"But JSON works fine"

Background

Implementations

Specification

Values

Strings

Numbers

Booleans

Null

Maps

Lists

Elements

Structure

Document

Operators

Termination operator

Identifiers

Identifiers MAY NOT:

Identifiers MAY:

Fields

Whitespace

Comments

Reasoning

License

About

Uh oh!

Releases

Packages

Languages

Uh oh!

License

Uh oh!

wreedb/god

Folders and files

Latest commit

History

Repository files navigation

GOD, a language for good ol' data

Why

"But JSON works fine"

Background

Implementations

Specification

Values

Strings

Numbers

Booleans

Null

Maps

Lists

Elements

Structure

Document

Operators

Termination operator

Identifiers

Identifiers MAY NOT:

Identifiers MAY:

Fields

Whitespace

Comments

Reasoning

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages