This repository has been archived on 2024-04-08. You can view files and clone it, but cannot push or open issues or pull requests.
dagger/docs/introduction/1005-what_is_cue.md
Richard Jones 8c46a1d2f1 initial work on what is cue
Signed-off-by: Richard Jones <richard@dagger.io>
2021-09-21 11:41:13 -07:00

7.2 KiB
Raw Blame History

sidebar_position slug sidebar_label
2 /1005/what-is-cue/ What is Cue?

What is Cue?

CUE is a powerful configuration language created by Marcel van Lohuizen who co-created the Borg Configuration Language (BCL)—the language used to deploy all applications at Google. It is a superset of JSON, with additional features to make declarative, data-driven programming as pleasant and productive as regular imperative programming.

The Need for a Configuration Language

For decades, developers, engineers, and system administrators alike have used some combination of INI, ENV, YAML, XML, and JSON (as well as custom formats such as for Apache, Nginx, etc) to describe configurations, resources, operations, variables, parameters, state, etc. While these technically work, they are merely data formats and not languages.

As a data format they each lack the ability to express logic and operate on data directly. Simple—yet powerful!—things like if statements, for loops, comprehensions, and string interpolation, among others are just not possible in these languages directly—that is to say not without the use of a secondary process. The result is that variables or parameters must be injected, and any logic handled by a templating language (such as Jinja) or a DSL (Domain-specific Language). Often templating languages and DSLs are used in conjuction.

Again, while this technically works, the results are that we end up with code bases, or even single files, that intersperse templating languages with various DSLs, thereby making the code difficult to maintain, challenging to reason about, brittle, and perhaps worst of all prone to side effects.

A configuration language such as CUE, allows us to both specify data, as well as act on that data with any logic necessary to achieve the desired output. Furthermore, and perhaps most importantly, CUE allows us to not only specify data as concrete values, but also the types those concrete values can or must be. It gives us the ability to define a schema but unlike doing so with say JSON Schema, CUE can both define and enforce the schema, whereas JSON Schema is merely a definition that requires some other process to enforce it.

For a deeper dive on the problems with configuration as code, check out The Configuration Complexity Curse and How Cue Wins.

Understanding Cue

We won't attempt to rewrite the CUE documentation or replicate some excellent CUE tutorials, but instead give you enough understanding of CUE to move through the dagger tutorials.

At least for our purposes here you need to understand a few key concepts, each of which will be explained in more detail below:

  1. Superset of JSON
  2. Types are values
  3. Concrete values, no overrides
  4. Definitions and schemas
  5. Disjunctions, Conjunctions, and Unification

It would be most helpful to install CUE, but if you prefer you can also try these examples in the CUE playground.

A Superset of JSON

What you can express in JSON, you can express in CUE, but not everything in CUE can be expressed in JSON. CUE also supports a "lite" version of JSON where certain characters can be eliminated entirely. Take a look at the following code:

{
  "Person": {
    "Name": "Bob Smith"
    "Age": 42
  }
}

Person: Name: "Bob Smith"

In this example we see that in CUE we have declared a top-level key Person twice: once in a more verbose JSON style with brackets and quotes, and again with a "lite" style without the extra characters; notice also that CUE supports short hand: when you are defining a single key within an object, you don't need the curly braces, you can write it as a colon-separated path. Try it in the CUE playground, and notice the output (you can choose different formats). Person is declared twice, but is only output once because CUE is automatically unifying the two declarations of Person. Its ok to declare the same field multiple times, so long as we provide the same value. See Concrete Values, No Overrides below.

Types are Values

In the previous example we defined the Persons name as the string literal "Bob Smith" which is a concrete value (more on this later). But nothing so far enforces a type. Nothing prevents us from setting the Name field to an int, but if this data were to be fed to an API that expects a string we'd have an error. Let's define some types:

Person: {
  Name: string
  Age: int
}

Person: {
  Name: "Bob Smith"
  Age: 42
}

Here weve defined the Persons Name field as a string and the Age field as an int. CUE will enforce these types, so any attempt to provide say an integer for the Name or a string for the Age will result in an error. Its worth noting here that the output from this example is the result of implicit uinification; well talk about explicit unification later. Try it in the CUE playground.

Concrete Values, No Overrides

CUE is ultimately used to export data, and is most useful when that data has been validated against a strong, well-defined schema. In order for CUE to export anything, we must provide concrete values for all defined fields not marked as optional.

In the previous examples we have provided concrete values: "Bob Smith" as a string, and 42 as an int. Were we to leave a required field simply defined as a type without a concrete value, CUE will return an error. Try removing a concrete value from the previous example and note the error you get. CUE will complain of an incomplete value.

CUE also prevents you from specifying conflicting values. In other words, CUE does not allow one declaration to override any other. The following example would result in an error:


Person: {
  Name: "Bob Smith"
  Age: 42
}

Person: {
  Name: "Bob Something Else"
  Age: 42
}

Definitions and Schemas

In a real-world scenario wed likely need to define more than one person, and ensure that each one satisfies the schema. That's where Definitions come in handy.

#Person: {
  Name: string
  Age: int
  Nickname?: string
}

Bob: #Person & {
  Name: "Bob Smith"
  Age: 42
}

In this example weve declared that #Person is a definition, as denoted by the # sign. By doing so we have constrained the Person object to a specific set of fields, each a specific type. Definitions by default are closed meaning that a #Person cannot contain any fields not specified in the definition. You will also notice that Nickname? contains a ? which denotes that this field is optional.

Definitions themselves are not exported to final output. To get concrete output, weve declared that the field Bob is a #Person, and using the single & we unified the #Person definition with an object whose concrete values satisfy the constraints.