507 views
owned this note
# CNCF TAG Operational Resilience
## Current initiatives
https://github.com/cncf/toc/issues?q=state%3Aopen%20label%3A%22tag%2Foperational-resilience%22
## Template
---
## Day Month Year
### Host
### Attendees:
### Agenda to be discussed
#### Welcome & Housekeeping
- Code of Conduct
- Intro new joiners
#### Initiative CheckIn
#### Open Discussion
---
------------------------------------------------------
## TAG Operational Resilience 1st April 2026 (AMER)
### Host
- Carol Valencia
### Attendees:
## Agenda to be discussed
## TAG Operational Resilience 4th March 2026 (AMER)
### Host
- Mario Fahland
### Attendees:
- Mario Fahland
- Carol Valencia
- Matt Young
- Raghu Shankar
## Agenda to be discussed
- Initiatives review current status
## TAG Operational Resilience 4th February 2026 (AMER)
### Host
- Rafael Brito
### Attendees:
- Daniel Jiang
- Rafael Brito
- Carol Valencia
- Matt Young
- Mario Fahlandt
- Riaan Kleinhans
- Raffaele Spazzoli
## Agenda to be discussed
- Daniel
- Heads up: Donating velero to CNCF as Sandbox project
- [rafa] do we have an issue for the process? https://github.com/cncf/sandbox/ . See sandbox current board here: https://github.com/orgs/cncf/projects/14/views/1
- https://github.com/vmware-tanzu/velero
- https://github.com/cncf/toc/blob/main/toc_subprojects/project-reviews-subproject/sandbox-review-guide.md
- https://github.com/cncf/toc/blob/main/toc_subprojects/project-reviews-subproject/general-technical-questions.md
- [Initiative]: Reference framework for the levels of Service Reliability Automation:
- https://github.com/cncf/toc/issues/1984
- [Initiative]: CNCF Project Release Guidelines:
- https://github.com/cncf/toc/issues/1849
- [Initiative]: Help wanted - Cloud Native Business Continuity: whitepaper and best practices:
- https://github.com/cncf/toc/issues/1779
- [Initiative]: Cloud Native Observability Personas:
- https://github.com/cncf/toc/issues/2037
- Overview of initiatives: https://github.com/cncf/toc/issues?q=state%3Aopen%20label%3A%22tag%2Foperational-resilience%22
## TAG Operational Resilience 7th January 2026 (AMER)
### Host
- Riaan kleinhans
### Notetakers
Raw notes: Carol Valencia
### Attendees:
- Raffaele Spazzoli
- Carol Valencia
- Matt Young
- Emily Fox
- Raghu Shankar
- Alessandro Vozza
- Victor Lu
- Lionel Yanick Ishola
- Riaan kleinhans
## Agenda to be discussed
- Introduction to the TAG and the initiatives
- current initiatives in the TAG: https://github.com/cncf/toc/issues?q=state%3Aopen%20label%3A%22tag%2Foperational-resilience%22
- [Initiative]: Reference framework for the levels of Service Reliability Automation: https://github.com/cncf/toc/issues/1984
- [Initiative]: CNCF Project Release Guidelines: https://github.com/cncf/toc/issues/1849
- [Initiative]: Cloud Native Business Continuity: whitepaper and best practices: https://github.com/cncf/toc/issues/1779
- Last presentation in Kubecon about the TAG:
- [slides](https://docs.google.com/presentation/d/1e5GIXwMIYj--cqdZwpmhvFGT_ozjYqiUjuA6kjTstr8/edit?slide=id.p1#slide=id.p1)
- [video](https://www.youtube.com/watch?v=I7TuNNsJSZc)
## TAG Operational Resilience 3rd December 2025 (AMER)
### Host
Mario Fahlandt
### Notetakers
Raw notes: Carol Valencia
### Attendees:
- Severin Neumann
- Mario Fahlandt
- Carol Valencia
- Alexis Gonzalez
- Mark
### Agenda to be discussed
- [Initiative Proposal](https://github.com/cncf/toc/issues/1984) of whitepaper of Service Reliability Automation by Severin Neumann -
- What other companies authors will be involved?
## TAG Operational Resilience 19th November 2025 (APAC/EMEA)
### Host
### Notetakers
Saiyam Pathak
Raw notes:
### Attendees:
Nabarun Pal
Saiyam Pathak
Severin Neumann
### Agenda to be discussed
- Green Reviews - TOC vote completed in favour (https://github.com/cncf/toc/issues/1915)
- Sustainability month(https://www.cncf.io/blog/2025/10/24/cloud-native-sustainability-month-2025-a-global-community-movement-for-greener-tech/)
- Kubecon recap
#### Welcome & Housekeeping
- Code of Conduct
- Intro new joiners
#### Initiative CheckIn
- Project Release Guidelines
- Project maintainer interviews for their pain points on project release
- TODO: List of maintiers to talk with
- TODO: Call for projects or a simple google form for feedback? Sort and select projects across graduation levels.
#### Open Discussion
## TAG Operational Resilience 1st October 2025 (AMER)
### Host
Mario
### Notetakers
Raw notes:
### Attendees:
- Niki Manoledaki (she/her)
- Mario Fahlandt (he/him)
- Raghu Shankar
- Matt Young (he/him)
- Rafa Brito (he/him)
- Alolita Sharma
- Carol Valencia
### Agenda to be discussed
#### Welcome & Housekeeping
- Code of Conduct
- Intro new joiners
#### Initiative CheckIn
- [niki] Green Reviews subproject - questions for the TAG
- [Draft Application](https://docs.google.com/document/d/1RZxx2b3MBe6e5jkMpA7Ptr7PN4ApcL6-DThgtPeTs8s/edit?tab=t.0)
- [Mario] release initiative
- https://github.com/cncf/toc/issues/1849
- [working doc (DRAFT)](https://docs.google.com/document/d/17FJD1RbJWRdGfB5DsRAHSwdx5HUO7577O9zCruwLetw/edit?usp=sharing)
- [Matt] FYI - charter follow-up PR is incoming later this week ([original PR](https://github.com/cncf/toc/pull/1772#issuecomment-3271419954))
- [Matt] YouTube for TAG Operational Resilience channel
#### Open Discussion
- Green Reviews subproject - next steps:
- Create issue for subproject in `toc` repo
- Write a description with lead expectations
- Skillsets, logistical things (meeting series, repository & comms channels)
- Reach out to Kepler folks
------------------------------------------------------
## TAG Operational Resilience 3rd September 2025 (APAC/EMEA)
### Host
- Mario Fahlandt
### Notetakers
Raw notes:
### Attendees:
- Matt Young (he/him) - TL
- Mario Fahlandt (he/him - Kubermatic)
- Rafa Brito
- Raghu Shankar (Entrepreneur)
- Carol Valencia (she/her)
- Antonio Di Turi (he/him)
- Khushboo Nigam
- Ken Finnigan (Alteryx)
- Leonard Pahlke
- Antonio Di Turi
- Alolita Sharma - TL
### Agenda to be discussed
#### Welcome & Housekeeping
- Code of Conduct
- Intro new joiners
- Announcement:
Kubecon North America Kiosk Schedule:
Tuesday 3:30 PM - 7:45 PM; Wednesday 2:00 PM - 5:00 PM; Thursday 12:30 PM - 2:00 PM
Kiosk Number: 17B
Location: Building B | Level 1 | Exhibit Hall B3-B5 | Solutions Showcase
Please stop by to say HI!
#### Initiative CheckIn
- Green review presentation - what we do and why
- previous WG TAG Env (https://github.com/cncf-tags/green-reviews-tooling)
- want to become a new Subproject in TAG OP Resilience
- Pipleline deploys Observability Stack produces Carbon Footprint of the projects
- currently 1 project is active (Falco)
- can run on different releases
- Goal is to compare different Carbon Footprints across projects and also to have a long term plan
- Bi Weekly Call every wednesday
- also have a lot of good first issues are available
- requirements on technical Skills: k8s, DevOps, Go
- Roadmap currently aiming to conclude the automation
- to be done - create a fair comparison profile for projects
- Green review - discuss application details and review
- @Atonio will create the draft for the application
- New Initiative! https://github.com/cncf/toc/issues/1849
- Provide guidance and good practices to CNCF projects on Project Releases.
- We invite our community to provide comments or feedback in the working document until Friday Sept 12th, 2025. If interested in contributing to or helping to lead this Initiative please reach out here and/or directly to @Jeremy Rickard or any of the co-chairs or Technical Leads for our TAG. Thanks!
- reach out to multiple projects on different levels (Sandbox, Incubation, Graduated)
- Guidance on what is a good high quality releases
-
- Awaiting review on Observability QLS Initiative - https://github.com/cncf/toc/issues/1770
- TOC vote is required
- it is stuck currently on a CNCF Staff Level as this would require as a Standard on different Level
- Next Steps: Apply as a Sandbox Project than move to JDF eventually
- needs to move levels for adoption
#### Open Discussion
------------------------------------------------------
## TAG Operational Resilience 20th August 2025 (APAC/EMEA)
### Host
### Notetakers
Raw notes:
### Attendees:
- Nabarun
- Saiyam
- Sunyanan
### Agenda to be discussed
#### Welcome & Housekeeping
- Code of Conduct
- Intro new joiners
#### Initiative CheckIn
- Sustainability Week Initiative for 2025/2026?
- [Nabarun] Is this for 2025 or 2026? 2025 might be tricky because of lack of time/cycles left in 2025. 2026 is a better option. We can
- [Sunyanan] Need for volunteers
- [Saiyam] In 2024, planning starting way ahead in time.
- [Nabarun] Opening a Call for help and Interest Form in public channels
- [Sunyanan] There can be colocated events (like in Japan) as well as standalone meetups/miniconfs.
- Planning to organize one in 2025 as well to ensure continuity
- There was a badge (Credly?) to recognize organizers for their efforts. It helps to keep the motivation.
- [Saiyam] This year can be voluntary. We can check with CNCF if any help from them is possible. An official initiative will take time and might not happen in time for 2025.
- A minimal promotion scheme can be designed to market the event
- Action Item: Saiyam will reach out to Audra who can help on the CNCF side. Also, reach out to Leo to see who helped last year.
- TOC Issue for 2025: https://github.com/cncf/toc/issues/1691
- Green Review Initiative
- Saiyam talked with past leads to formalize a group.
- So far discussion is leaning towards a Subproject.
-
#### Open Discussion
- Access to the meeting
- There is a flag in PCC which needs a change
- Action Item: Nabarun to change the Slack bookmark to show the public calendar instead of PCC.
- Also add the Youtube channel/playlist.
------------------------------------------------------
## TAG Operational Resilience 6th August 2025 (AMER)
### Host
Rafa Brito
### Notetakers
Carol
#### Raw notes:
- Introductions
- Matt Young overview about TAG Operational Resilience, expectations.
- Ted Young about Open Telemetry project details: Governance Committee transition, Project management improvements, Core components: data model, semantic conventions. Overview about [browser instrumentation proposal](https://github.com/open-telemetry/community/blob/main/projects/browser-phase-1.md#browser-instrumentation-phase-1-proposal).
- Presentation by Raffaele Spazzoli about White paper on Business Continuity
### Attendees:
- Matt Young
- Rafael Brito (he/him)
- Raffaele Spazzoli
- Ted Young
- Raghu Shankar
- Carol Valencia (she/her)
- Alolita Sharma
### Agenda to be discussed
- TAG OpRes Kubecon NA Session: https://sched.co/27kZv
- White paper on Business Continuity (initiative to be approved TBD), pre-initiative, based on [TAG Storage White paper](https://github.com/cncf/tag-storage/blob/master/cloud-native-disaster-recovery-whitepaper/Cloud%20Native%20Disaster%20Recovery%20v2.pdf)
- OpenTelemetry new Browser SIG (Ted Young)
- background (OTLP)
- https://opentelemetry.io/docs/specs/otel/protocol
- opentelemetry-proto/docs/design-goals.md
- But what about the clients (browser)?
- Last Week --> [Coming soon to a browser near you: OpenTelemetry | Panel discussion with the OTel Browser SIG](https://youtu.be/E3QhtmvhfL8?si=bpO96HuM0yaKYEZ_)
- [browser-instrumentation-phase-1-proposal](https://github.com/open-telemetry/community/blob/main/projects/browser-phase-1.md#browser-instrumentation-phase-1-proposal)
- LOGO!
#### Welcome & Housekeeping
- Code of Conduct
- Intro new joiners
#### Initiative CheckIn
- [\[Initiative\]: Substation Project Evolution Plan for CNCF Sandbox · Issue \#1710 · cncf/toc · GitHub](https://github.com/cncf/toc/issues/1710)
- [\[Initiative\]: Observability Query Language Standardization Specification · Issue \#1770 · cncf/toc](https://github.com/cncf/toc/issues/1770)
- [\[Initiative\]: CNCF Software Supply Chain Insights · Issue \#1709 · cncf/toc · GitHub](https://github.com/cncf/toc/issues/1709)
- [slack kickoff (missive)](https://cloud-native.slack.com/archives/C08JZ9YLAA3/p1754432340757559?thread_ts=1753311497.308269&cid=C08JZ9YLAA3) `#tag-security-and-compliance`
- https://github.com/cncf/toc/issues/1709#issuecomment-3156934755
> We discussed this at the TAG Security and Compliance meeting today, and [started a thread](https://cloud-native.slack.com/archives/C08JZ9YLAA3/p1753311497308269) for interested participants. The TAG will assist with selecting a project lead and TAG liasons for the project and setting up meeting and reporting on an ongoing basis.
We're moving forward with initial steps to form a core team, establish meetings, and move forward with this initiative.
The internal CNCF Infra work mentioned above is encouraging! That said, this Initiative will aim to provide reference architecture(s) suitable for easy Adoption by End Users and our open Communities. The CNCF Infra work will inform our path and may serve as a good case study for how other LF Foundations (peers of the CNCF) might implement a similar approach.
End Users will invariably need to blend CNCF Project security artifacts with their own internal/private artifacts. Therefore this Initiative is not taking an explicit sequencing or external dependency on the CNCF Infra work. There's so much to do, and no need to wait. See slack for more details here:
[cncf /](https://github.com/cncf) [Projects /](https://github.com/orgs/cncf/projects) [TAG Operational Resilience](https://github.com/orgs/cncf/projects/71)
[https://github.com/orgs/cncf/projects/71](https://github.com/orgs/cncf/projects/71)
#### Open Discussion
------------------------------------------------------
## TAG Operational Resilience 16th July 2025
### Host
Mario
### Notetakers
Rafa
Raw notes:
- Round of intro of all leads, plus Natan Yellin, Julia Yin
- Charter: https://github.com/cncf/toc/pull/1772 (no feedback at this time)
- HolmesGPT Demo
- Where the WG will be reside
- Steps in creating an initiative
- Henrik Rexed
### Attendees:
- Mario Fahlandt (he / him, Kubermatic)
- Rafa Brito
- Matt Young
- Natan Yellin
- Chris Larsen
- Aritra Ghosh
- Julia Yin
- Raffaele Spazzoli
- Henrik Rexed
- Amanda Wang
- Nabarun Pal
- Pavneet Singgh
- Saiyam Pathak
### Agenda to be discussed
- Introduction
- Charter update/TAG update
- PR: https://github.com/cncf/toc/pull/1772 ([human-friendly viewing link](https://github.com/mfahlandt/toc/blob/patch-1/tags/tag-operational-resilience/charter.md))
- [HolmesGPT](https://github.com/robusta-dev/holmesgpt) presentation
- Current Maintainers
- Robusta
- Microsoft
- additional Maintainers but not yet publicly listed
- utilizes [liteLLM](https://github.com/BerriAI/litellm) (MIT licence) as a connector for models
- OTEL Support: it supports Prometheus integration already - https://github.com/robusta-dev/holmesgpt#-data-sources , loki and tempo
- similarities / differences to kAgent & k8sGPT
- [Saiyam] WG green reviews onboarding discussion to this TAG
- Create a [formal request](https://github.com/cncf/toc/issues/new?template=%20subproject-application.yaml) to create a subproject so it can be discussed
- [Henrik] Reach out to https://ecologits.ai/latest/ Start providing common ALgorithm to measure LLM energy footprint
- estimating energy consumption for prompts and sends it back
- to educate companies / communities on how much energy is being consumed by a prompt
### Existing Initiatives (proposed, not (yet) resourced)
- links
- (github): [kind/initiative, "operational resilience"](https://github.com/cncf/toc/issues?q=is%3Aissue%20state%3Aopen%20%22operational%20resilience%22%20label%3Akind%2Finitiative)
- (all) : https://github.com/orgs/cncf/projects/65
- (board) : https://github.com/orgs/cncf/projects/65/views/2 (seemingly under construction)
- **[[Initiative]: Observability Query Language Standardization Specification (#1770)](https://github.com/cncf/toc/issues/1770)** - _Last Updated: 2025-07-11_
- **[[Initiative]: CNCF Software Supply Chain Insights (#1709)](https://github.com/cncf/toc/issues/1709)** - _Last Updated: 2025-07-09_
- Proposed Plan (Draft): https://github.com/cncf/toc/pull/1743
- **[[Initiative]: Develop Subproject application for CNCF Knowledge Graph - Projects and Community Activity (#1712)](https://github.com/cncf/toc/issues/1712)** - _Last Updated: 2025-07-01_
- **[[Initiative]: CNCF Institutional Knowledge Preservation - Newly emeritus interview process (#1729)](https://github.com/cncf/toc/issues/1729)** - _Last Updated: 2025-06-19_
- **[[Initiative]: CNCF Project Capabilities Badging Framework (#1711)](https://github.com/cncf/toc/issues/1711)** - _Last Updated: 2025-06-19_
- **[[Initiative]: Ecosystem-Scale Open Source Development Pattern Analysis (#1708)](https://github.com/cncf/toc/issues/1708)** - _Last Updated: 2025-06-19_
- **[[Initiative]: Cloud Native Sustainability Week (#1691)](https://github.com/cncf/toc/issues/1691)** - _Last Updated: 2025-06-19_
- **[[Initiative]: Substation Project Evolution Plan for CNCF Sandbox (#1710)](https://github.com/cncf/toc/issues/1710)** - _Last Updated: 2025-06-19_