Language
Hi, welcome to Cyphernetes.
Let's get you started, you'll be querying the Kubernetes resource graph like a pro in no time!
Node Patterns
In Cyphernetes, we draw patterns of resources using ASCII-art, using parentheses to denote nodes and arrows to denote relationships.
Let's draw a circle using parenthesis:
()
This is called a node. Nodes are the basic building blocks of the Kubernetes resource graph. Nodes are usually not empty. They tend to look more like this:
(p:Pod)
This node contains a variable (in this example it's called p), followed by a colon, followed by a label (in our case, it's Pod).
We assign variable names to nodes so we can refer to them later in the query. Labels are used to specify the node's Kubernetes resource kind.
💡 Variable names are allowed to be mixed-case and have any length. Labels may also be mixed-case. Unlike labels, variable names are case-sensitive, so
(d:Deployment)and(D:Deployment)are not the same.
Now, imagine a flow chart where each node represents a Kubernetes resource, and the edges (arrows) between them represent the relationships between the resources:
(d:Deployment)->(s:Service)
This is Cyphernetes in a nutshell. You draw a pattern of the resources you want to work with, Cyphernetes will then match all instances of this pattern on the current context, and translate it into the appropriate Kubernetes API calls. The arrow -> is used to express a relationship between two nodes.
This pattern will match all Deployments that are exposed by a Service. It will not match Deployments that are not exposed, or Services that do not expose any Deployments.
This is a key feature of Cypher (and Cyphernetes): We act on patterns - and only select resources that exactly match the patterns we draw.
💡 When specifying the label we can use the resource's singular name, plural name or shortname, just like in kubectl. Unlike kubectl, labels in Cyphernetes are case-insensitive, so
(p:Pod),(p:POD),(p:pod),(p:pods),(p:po)etc. are all legal and mean the same. This document adheres to a convention of using minified, lowercase variable names and CamelCase, singular-name labels i.e.(d:Deployment),(rs:ReplicaSet)- however this is completely up to the user.
Reading Resources from the Graph
To query the Kubernetes resource graph, we use MATCH/RETURN expressions.
In the MATCH clause we draw a pattern or patterns of resources.
RETURN is then used to organize the results. It takes a list of comma-separated JSONPaths, and returns the results in a JSON object, allowing us to easily craft a custom payload that only contains the fields we need.
💡 Note that the names of resources are always returned in the special
namefield, even when not specified in theRETURNclause.
// Comments are supported
MATCH (d:Deployment) RETURN d.spec.replicas
This query will match all Deployments in the current context and return their desired replica count (as well as their name):
{
  "d": [
    {
      "name": "nginx",
      "spec": {
        "replicas": 4
      },
    },
    {
      "name": "nginx-internal",
      "spec": {
        "replicas": 2
      },
    }
  ]
}
Let's do one more:
MATCH (d:Deployment)
RETURN d.metadata.labels,
       d.spec.replicas
This query will match all Deployments in the current context, and return a custom payload containing the fields we asked for:
{
  "d": [
    {
      "metadata": {
        "labels": {
          "app": "nginx",
        },
      },
      "name": "nginx",
      "spec": {
        "replicas": 4
      }
    },
    {
      "metadata": {
        "labels": {
          "app": "nginx",
        },
      },
      "name": "nginx-internal",
      "spec": {
        "replicas": 2
      }
    }
  ]
}
The returned payload is a JSON object that contains a key for every variable defined in the RETURN clause.
Each of these keys' value is a list of Kubernetes resources that matched the respective node pattern in the MATCH clause.
💡 Unlike kubectl, Cyphernetes will always return a list (a JSON array), even if only one or zero resources were matched.
The payload will only include the fields requested in the RETURN clause - as well as the name field, which is always present. To see the full resource, we simply return the variable name:
MATCH (d:Deployment)
RETURN d
Context
💡 Some Cyphernetes programs may allow you to change the default namespace or context using command line arguments and UI elements, but this is beyond the scope of this document which is focused on the Cyphernetes query language itself.
By default, Cyphernetes will query the current context (as defined by kubectl config current-context).
If no namespace is specified in the current context, Cyphernetes will default to using the default namespace, similar to kubectl.
Overriding the Default Namespace
You can override the default namespace per node by specifying the namespace property in the node's properties:
MATCH (d:Deployment {namespace: "staging"})->(s:Service)
RETURN d.metadata.name, s.spec.clusterIP
You can use this language feature to query resources across namespaces:
MATCH (d:Deployment {namespace: "staging"}), (d2:Deployment {namespace: "production"})
RETURN d.spec.replicas, d2.spec.replicas
Querying Multiple Clusters
Cyphernetes supports querying multiple clusters using the IN keyword.
IN staging, production
MATCH (d:Deployment {name: "coredns", namespace: "kube-system"})
RETURN d.spec.replicas
Cyphernetes will run the query for each context in the IN clause, and return the results in a single payload.
The results will be prefixed with the context name, followed by an underscore:
{
  "staging_d": [
    {
      "name": "coredns",
      "spec": {
        "replicas": 2
      }
    }
  ],
  "production_d": [
    {
      "name": "coredns",
      "spec": {
        "replicas": 2
      }
    }
  ]
}
Advanced Pattern Matching
Match by Name and Labels
A node may contain an optional set of properties. Node properties let us query the resource by name or by any of it's labels.
MATCH (d:Deployment {name: "nginx-internal", app: "nginx"})
RETURN d.metadata.labels,
       d.spec.template.spec.containers[0].image
(output)
{
  "d": [
    {
      "metadata": {
        "labels": {
          "app": "nginx",
        },
      },
      "name": "nginx-internal",
      "spec": {
        "template": {
          "spec": {
            "containers[0]": {
              "image": "nginx"
            }
          }
        }
      }
    }
  ]
}
Match by Any Field
Using the WHERE clause, we can filter our results by any field in the Kubernetes resource:
MATCH (d:Deployment {app: "nginx", namespace: "default"})
WHERE d.spec.replicas=4
RETURN d.spec.replicas
(output)
{
  "d": [
    {
      "name": "nginx",
      "spec": {
        "replicas": 4
      }
    }
  ]
}
Escaping Dots in JSONPaths
Cyphernetes supports escaping dots in JSONPaths using a backslash. This is useful when querying resources that have dots in their field names.
MATCH (d:Deployment {name: "nginx-internal"})
WHERE d.metadata.annotations.meta\.cyphernet\.es/foo-bar = "baz"
RETURN d.metadata.annotations.meta\.cyphernet\.es/foo-bar
(output)
{
  "d": [
    {
      "name": "nginx-internal",
      "metadata": {
        "annotations": {
          "meta.cyphernet.es/foo-bar": "baz"
        }
      }
    }
  ]
}
WHERE clauses support the following operators:
- =- equal to
- !=- not equal to
- <- less than
- >- greater than
- <=- less than or equal to
- >=- greater than or equal to
- =~- regex matching
- CONTAINS- partial string matching
Examples:
// Get all deployments with more than 2 replicas
MATCH (d:Deployment)
WHERE d.spec.replicas > 2
RETURN d.metadata.name, d.spec.replicas
// Get all pods that are not running
MATCH (p:Pod)
WHERE p.status.phase != "Running"
RETURN p.metadata.name, p.status.phase
// Find all deployments scaled above zero and set their related ingresses' ingressClassName to "active"
MATCH (d:Deployment)->(s:Service)->(i:Ingress)
WHERE d.spec.replicas >= 1
SET i.spec.ingressClassName = "active"
// Find all deployments that end with "api"
MATCH (d:Deployment)
WHERE d.metadata.name =~ "^.*api$"
RETURN d.spec
Matching Multiple Nodes
Use commas to match two or more nodes:
MATCH (d:Deployment), (s:Service)
RETURN d.spec.replicas, s.spec.clusterIP
(output)
{
  "d": [
    {
      "name": "nginx",
      "spec": {
        "replicas": 4
      }
    },
    {
      "name": "nginx-internal",
      "spec": {
        "replicas": 2
      }
    }
  ],
  "s": [
    {
      "name": "nginx",
      "spec": {
        "clusterIP": "10.96.0.1"
      }
    },
    {
      "name": "nginx-internal",
      "spec": {
        "clusterIP": "10.96.0.2"
      }
    }
  ]
}
Relationships
Relationships are the glue that holds the Kubernetes resource graph together. Cyphernetes understands the relationships between Kubernetes resources, and lets us query them in a natural way.
Relationships are expressed using the -> and <- operators:
MATCH (d:Deployment)->(s:Service)
RETURN d.metadata.service, s.metadata.name
This query returns all Services that expose a Deployment, and the name of the Deployment they expose. Only Deployments and Services that have a relationship between them will be returned.
The relationship's direction is unimportant. (d:Deployment)->(s:Service) is the same as (d:Deployment)<-(s:Service).
If you're familiar with Cypher, you might be wondering about relationship properties. At this time, Cyphernetes does not make use of relationship properties - they are, however, legal - and you may use them if you wish for your own documentation purposes. i.e.
(d:Deployment)->[r:SERVICE_EXPOSE_DEPLOYMENT {"service-type": "kubernetes-internal"}]->(s:Service)is legal Cyphernetes syntax, but does not affect the query's outcome. The variableris not defined in this query, and is not available for use in aRETURNclause or otherwise.
Basic Relationship Match
Cyphernetes understands the relationships between Kubernetes resources:
MATCH (d:Deployment {name: "nginx"})->(s:Service)
RETURN s.spec.ports
(output)
{
  "s": [
    {
      "name": "nginx",
      "spec": {
        "ports": [
          {
            "port": 80,
            "protocol": "TCP",
            "targetPort": 80
          }
        ]
      }
    }
  ]
}
Cyphernetes knows how to find related resources using a set of predefined rules. For example, Cyphernetes knows that a Service exposes a Deployment if the two resources have matching selectors.
Similarly, Cyphernetes knows that a Deployment owns a ReplicaSet if the ReplicaSet's metadata.ownerReferences contains a reference to the Deployment.
Relationships with Multiple Nodes
We can match multiple nodes and relationships in a single MATCH clause. This is useful for working with resources that have multiple owners or with custom resources that Cyphernetes doesn't yet understand.
MATCH (vs:VirtualService),
      (d:Deployment {name: "my-app"})->(s:Service)->(i:Ingress)
WHERE vs.metadata.labels.app="my-app"
RETURN i.spec.rules,
       vs.spec.http.paths
(output)
{
  "i": [
    {
      "name": "my-app",
      "spec": {
        "rules": [
          {
            "host": "my-app.example.com",
            "http": {
              "paths": [
                {
                  "backend": {
                    "serviceName": "my-app",
                    "servicePort": 80
                  },
                  "path": "/"
                }
              ]
            }
          }
        ]
      }
    }
  ]
  "vs": [
    {
      "name": "my-app",
      "spec": {
        "http": {
          "paths": [
            {
              "backend": {
                "serviceName": "my-app",
                "servicePort": 80
              },
              "path": "/"
            }
          ]
        }
      }
    }
  ] 
}
Here we match a Deployment, the Service that exposes it, and through the Service also the Ingress that routes to it. We also match the Istio VirtualService that belongs to the same application. Cyphernetes doesn't yet understand Istio, so we fallback to using the app label.
Kindless Nodes
Sometimes you might want to match or operate on resources connected to another resource without knowing their kind in advance. Cyphernetes supports this through "kindless nodes" - nodes where you omit the kind label:
// Find all resources related to a deployment
MATCH (d:Deployment {name: "nginx"})->(x)
RETURN x.kind
This query will find and return all resources that have a relationship with the "nginx" deployment, such as ReplicaSets and Services. Cyphernetes will automatically expand this query to try all possible kinds that can have a relationship with a Deployment.
Some things to consider when using kindless nodes:
- While kindless nodes are a powerful feature, they should be used judiciously. Being explicit about the kinds of resources you're operating on makes queries more predictable and easier to understand.
- Chaining two kindless nodes (e.g.,
MATCH (x)->(y)) is not supported as it would be ambiguous and potentially expensive to resolve. At least one node in a relationship must have a known kind.- Standalone kindless nodes (e.g.,
MATCH (x)) are not supported. Kindless nodes must be part of a relationship.
Anonymous Nodes
Anonymous nodes are nodes without a variable name. They are useful when you want to express a relationship path but don't want to use the intermediate resources in a subsequent RETURN, SET or DELETE clause.
// Find all configmaps that are related to a pod
// Notice that we don't specify a variable name for the Pod
MATCH (cm:ConfigMap)->(:Pod)
RETURN cm.data
For even more flexibility, you can use nodes that are both kindless and anonymous - nodes without both a variable name and kind:
// Find all pods that are two relationships away from a deployment
MATCH (d:Deployment {name: "nginx"})->()->(p:Pod)
RETURN p.metadata.name
Kindless anonymous nodes are useful when you want to express a relationship path but don't care about the intermediate resources.
Mutating the Graph
Cyphernetes supports creating, updating and deleting resources in the graph using the CREATE, SET and DELETE keywords.
Creating Resources
Cyphernetes supports creating resources using the CREATE statement.
Currently, properties of nodes in CREATE clauses must be valid JSON. This is a temporary limitation that will be removed in the future.
CREATE (k:Kind {"k": "v", "k2": "v2", ...})
Creating a Standalone Resource
CREATE (d:Deployment {
  "name": "nginx",
  "metadata": {
    "labels": {
      "app": "nginx"
    }
  },
  "spec": {
    "replicas": 4,
    "selector": {
      "matchLabels": {
        "app": "nginx"
      }
    },
    "template": {
      "metadata": {
        "labels": {
          "app": "nginx"
        }
      },
      "spec": {
        "containers": [
          {"name": "nginx", "image": "nginx"}
        ]
      }
    }
  }
})
Create expressions may optionally be followed by a RETURN clause:
CREATE (d:Deployment {
  "name": "nginx",
  "metadata": {
    "labels": {
      "app": "nginx"
    }
  },
  "spec": {
    "replicas": 4,
    "selector": {
      "matchLabels": {
        "app": "nginx"
      }
    },
    "template": {
      "metadata": {
        "labels": {
          "app": "nginx"
        }
      },
      "spec": {
        "containers": [
          {"name": "nginx", "image": "nginx"}
        ]
      }
    }
  }
}) RETURN d
Creating Resources by Relationship
Relationships can also appear in CREATE clauses. Currently, only two nodes may be connected by a relationship in a CREATE clause.
In CREATE clause relationships, one side of the relationship must contain a node variable that was previously defined in a MATCH clause.
This node does not require a label, as it's label is inferred from the MATCH clause.
The other side of the relationship is the new node being created. Cyphernetes can infer the created resource's name, labels and other fields from the relationship rule defined between the two nodes' resource kinds.
MATCH (d:Deployment {name: "nginx"})
CREATE (d)->(s:Service)
This query is equivalent to
kubectl expose deployment nginx --type=ClusterIP.
Cyphernetes' relationship rules contain a set default values for the created resource's fields. These defaults can be overridden by specifying properties in the CREATE clause. Default relationship fields should usually be enough for creating a resource by relationship without having to specify any properties on the created node.
Patching Resources
Cyphernetes supports patching resources using the SET clause.
The SET clause is similar to the CREATE clause, but instead of creating a new resource, it updates an existing one. SET clauses take a list of comma-separated key-value pairs, where the key is a jsonPath to the field to update, and the value is the new value to set.
SET clauses may only appear after a MATCH clause. They may also be followed by a RETURN clause.
MATCH (d:Deployment {name: "nginx"})
SET d.spec.replicas=4
RETURN d.spec.replicas
Patch by Relationship
Relationships in MATCH clauses may be used to patch resources that are connected to other resources.
MATCH (d:Deployment {name: "nginx"})->(s:Service)
SET s.spec.ports[0].port=8080
Deleting Resources
Deleting resources is done using the DELETE clause. DELETE clauses may only appear after a MATCH clause.
A DELETE clause takes a list of variables from the MATCH clause - All Kubernetes resources matched by those variables will be deleted.
MATCH (d:Deployment {name: "nginx"}) DELETE d
Delete by Relationship
Relationships in MATCH clauses may be used to delete resources that are connected to other resources.
MATCH (d:Deployment {name: "nginx"})->(s:Service)->(i:Ingress)
DELETE s, i
Aggregations
Cyphernetes supports aggregations in the RETURN clause.
Currently, only the COUNT and SUM functions are supported.
MATCH (d:Deployment)->(rs:ReplicaSet)->(p:Pod)
RETURN COUNT{p} AS TotalPods,
       SUM{d.spec.replicas} AS TotalReplicas
{
  ...
  "aggregate": {
    "TotalPods": 10,
    "TotalReplicas": 20
  },
  ...
}
MATCH (d:deployment {name:"auth-service"})->(s:svc)->(p:pod) 
RETURN SUM { p.spec.containers[*].resources.requests.cpu } AS totalCPUReq, 
       SUM {p.spec.containers[*].resources.requests.memory } AS totalMemReq;
{
  ...
  "aggregate": {
    "totalCPUReq": "5",
    "totalMemReq": "336.0Mi"
  },
  ...
}
Result Ordering and Pagination
Cyphernetes supports ordering, limiting, and paginating query results using ORDER BY, LIMIT, SKIP, and OFFSET clauses.
Ordering Results with ORDER BY
You can sort query results using the ORDER BY clause. It supports sorting by any field accessible via JSONPath or by aliases defined with AS.
// Order deployments by name in ascending order (default)
MATCH (d:Deployment)
RETURN d.metadata.name AS name, d.spec.replicas AS replicas
ORDER BY name
// Order deployments by replica count in descending order
MATCH (d:Deployment)
RETURN d.metadata.name AS name, d.spec.replicas AS replicas
ORDER BY replicas DESC
// Order by JSON path directly
MATCH (d:Deployment)
RETURN d.metadata.name, d.spec.replicas
ORDER BY d.metadata.name DESC
You can specify multiple sort fields:
MATCH (d:Deployment)
RETURN d.metadata.name, d.spec.replicas, d.metadata.namespace
ORDER BY d.metadata.namespace ASC, d.spec.replicas DESC
Limiting Results with LIMIT
Use LIMIT to restrict the number of results returned:
// Get the 5 most recent deployments
MATCH (d:Deployment)
RETURN d.metadata.name, d.metadata.creationTimestamp
ORDER BY d.metadata.creationTimestamp DESC
LIMIT 5
Skipping Results with SKIP and OFFSET
Use SKIP or OFFSET to skip a specified number of results. Both keywords are aliases and work identically:
// Skip the first 10 deployments
MATCH (d:Deployment)
RETURN d.metadata.name
ORDER BY d.metadata.name
SKIP 10
// OFFSET is an alias for SKIP
MATCH (d:Deployment)
RETURN d.metadata.name
ORDER BY d.metadata.name
OFFSET 10
Combining ORDER BY, LIMIT, and SKIP
You can combine these clauses for pagination:
// Get the second page of deployments (items 11-20)
MATCH (d:Deployment)
RETURN d.metadata.name AS name, d.spec.replicas AS replicas
ORDER BY name
SKIP 10
LIMIT 10
The order of LIMIT and SKIP/OFFSET can be reversed:
// This is equivalent to the above query
MATCH (d:Deployment)
RETURN d.metadata.name AS name, d.spec.replicas AS replicas
ORDER BY name
LIMIT 10
OFFSET 10
Relationship Pattern Ordering
Ordering also works with relationship patterns:
// Find deployment->pod relationships, ordered by pod name
MATCH (d:Deployment)->(rs:ReplicaSet)->(p:Pod)
RETURN d.metadata.name AS deployment, p.metadata.name AS pod
ORDER BY pod DESC
LIMIT 5
Temporal Expressions
Cyphernetes supports temporal expressions for filtering resources based on their creation or modification times.
The datetime() function returns the current date and time in ISO 8601 format when called without arguments.
The duration() function returns a duration in ISO 8601 format.
You may use plus (+) and minus (-) operators to add or subtract durations from a datetime:
// Find pods that were created in the last 24 hours
MATCH (p:Pod)
WHERE p.metadata.creationTimestamp > datetime() - duration("PT24H")
RETURN p.metadata.name;
// Delete pods that were created more than 7 days ago
MATCH (p:Pod)
WHERE p.metadata.creationTimestamp < datetime() - duration("P7D")
DELETE p;