[DEV] Add grouplevel key-value pairs to the metadata for use on the node #17

alexlovelltroy · 2024-10-22T19:59:44Z

Short Description
SMD can provide the cloud-init server with a set of group names that an individual node is a part of. It cannot provide the key/value pairs that are needed by the node at boot time. We need to store key value pairs in the cloud-init server that are then added to the payload.

For example, we might want a different remote syslog aggregator for each rack of nodes. On the SMD side, the admin can create a group that represents that rack, possibly even by xname, (x3000) and when cloud-init retrieves the list of groups per node, it will be included.

"meta-data": {
  "groups": ["x3000",],
}

If the admin then also adds a key/value pair to the cloud-init server named for the group, that can be included in the client payload as well.

"meta-data": {
  "groups": [
      { "x3000": {
       {"syslog_aggregator": "192.168.0.1"},
     }}]
}

The mechanics of adding key/value pairs for a group should be via POST with standard CRUD operations.

/cloud-init/groups/ PUT --> {key: value, key: value}
/cloud-init/groups POST --> {key:value, key:value}

No need to query SMD in order to add/update. This means that admins can put typo groups into cloud-init and nothing will stop them.

require jwt for changes and log operations with information about who made the changes through JWT interrogation

alexlovelltroy · 2024-10-23T15:40:47Z

The current merge logic that allows any group to specify any portion of the cloud-init payload may be problematic for admins who make a change to one group, expecting it to manifest, but then a merge removes their changes.

A concrete usecase that we need to support is a default cloud-init for all nodes in the compute group with additional information provided because the node is part of the x3000 group representing the rack, as well as a canary_123 group which defines a set of nodes to test rolling out a new image. All three of these may provide information about the rsyslog aggregator. Which one actually gets written to the filesystem and how? How should configurator be involved?

The current logic allows an admin to target a node directly with a singleton cloud-init payload which will supersede all group related cloud-init information. We should review this from an admin perspective to identify desired behavior when a custom cloud-init for a particular node is no longer useful.

Another thought for consideration is how to handle things like writefiles and runcmd. Are they purely additive? Should one group be able to override a different group?

davidallendj · 2024-10-23T17:16:56Z

I was able to get achieve something similar just using the /cloud-init endpoint running locally with curl http://127.0.0.1:27777/cloud-init -d @data.json. The data.json looks like the following:

{
  "name": "IDENTIFIER",
  "cloud-init": {
    "userdata": {
      "write_files": [
        {
          "content": "hello world",
          "path": "/etc/hello"
        }
      ]
    },
    "metadata": {
      "groups": [
        {
          "x3000": {
            "syslog_aggregator": "192.168.0.1"
          }
        }
      ]
    }
  }
}

Then, doing curl 'http://127.0.0.1:27777/cloud-init/IDENTIFIER/meta-data' gives me this:

groups:
- x3000:
    syslog_aggregator: 192.168.0.1

Is this already doing what is expected? I think we'd still want to the following:

address the merge behavior
remove the SMD query
add logging
add a PUT to update (there's already an endpoint for this with /{id})

Other than that, it seems to me that this is already doing the above.

travisbcotton · 2024-10-23T17:23:41Z

Why are we removing the SMD query again?

davidallendj · 2024-10-23T18:28:05Z

Why are we removing the SMD query again?

I think the idea here is that since group names can be any arbitrary thing without any validation from SMD, there's no need to do a query.

travisbcotton · 2024-10-23T18:34:07Z

The query finds the SMD groups the requesting node is a member of. I don't know how you are going to provide cloud-init data to groups of nodes without making that query

alexlovelltroy · 2024-10-23T19:05:57Z

when generating the payload to send to the client, the cloud-init server needs to call SMD to find out what groups the node is a part of.

alexlovelltroy assigned davidallendj Oct 22, 2024

davidallendj mentioned this issue Oct 24, 2024

Add API to manage groups in meta-data #19

Merged

davidallendj linked a pull request Oct 30, 2024 that will close this issue

Add API to manage groups in meta-data #19

Merged

davidallendj closed this as completed in #19 Nov 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DEV] Add grouplevel key-value pairs to the metadata for use on the node #17

[DEV] Add grouplevel key-value pairs to the metadata for use on the node #17

alexlovelltroy commented Oct 22, 2024

alexlovelltroy commented Oct 23, 2024

davidallendj commented Oct 23, 2024 •

edited

Loading

travisbcotton commented Oct 23, 2024

davidallendj commented Oct 23, 2024

travisbcotton commented Oct 23, 2024

alexlovelltroy commented Oct 23, 2024

[DEV] Add grouplevel key-value pairs to the metadata for use on the node #17

[DEV] Add grouplevel key-value pairs to the metadata for use on the node #17

Comments

alexlovelltroy commented Oct 22, 2024

alexlovelltroy commented Oct 23, 2024

davidallendj commented Oct 23, 2024 • edited Loading

travisbcotton commented Oct 23, 2024

davidallendj commented Oct 23, 2024

travisbcotton commented Oct 23, 2024

alexlovelltroy commented Oct 23, 2024

davidallendj commented Oct 23, 2024 •

edited

Loading