Category: SRE

Observability is about (to) Change

At its heart, observability is about change. Without change, there's nothing to observe. A static system, frozen in time, offers no insights. Time itself is meaningless without change—it’s merely a measure of movement, of transformation. A clock ticks because its hands move; atoms vibrate. Change is the pulse of existence. So, when we talk about observability, we’re exploring how we perceive and interpret change within the systems we design.

More...

AIOps – SignOps + TaskOps

Organizations adopting AI for cloud operations face a critical evaluation of AIOps approaches. While recent advancements in AIOps frameworks standardize AI agent evaluation and enhancement, they risk a dead-end relying on conventional observability models lacking semantic depth. Service Cognition principles should form the AIOps foundation for truly autonomous cloud operations.

More...

Service Cognition – Quantitative to Qualitative

Organizations are drowning in an ocean of metrics. Every click, interaction, and transaction generates quantitative data points, creating vast repositories of numbers that promise insights but often deliver confusion. The challenge isn’t just the volume of data – it’s the fundamental question of how to transform these raw metrics into meaningful qualitative understanding.

More...

Bounded Observability

Reducing and compressing measurements is critical, which is much helped by representations extracted from the environment via hierarchical boundary determination. When this is not done automatically, what happens then is that the custom dashboard capabilities of the Observability solution need to be used to reconstruct some form of structure that mirrors the boundaries all but lost in the data fog. Naturally, this is extremely costly and inefficient for an organization.

More...

The Data Fog of Observability

The overemphasis on data instead of signals and states has created a great fog. This data fog leads to many organizations losing their way and overindulging in data exploration instead of exploiting acquired knowledge and understanding. This has come about with the community still somewhat unconcerned with a steering process such as monitoring or cybernetics.

More...

Change in Observability

Observability is effectively a process of tracking change. At the level of a measurement device, software or hardware-based, change is the difference in the value of two observations taken at distinct points in time. This change detection via differencing is sometimes called static or happened change. Observability is all about happenings.

More...

A Situational Control Tower

It is time for new direction closer aligned to goals, focused more on the dynamics of systems that humans are already highly adapted to with their social intelligence, within which situation is a crucial conceptual element of the cognitive model. Understanding and appropriately responding to different social situations is fundamental to social cognition and effective interpersonal interactions.

More...

Observability: Disruptions

Disruptions are one factor affecting the maintenance of service quality levels. A disruption is an interruption in the flow of (work) items through a network that can, for a while, make it inoperable or where the network flow performance is subpar. Depending on the severity of the disruption, a network may need to replan and restructure itself for a period afterward. There are two main categories of disruptions: disturbance and deviation.

More...

Observability: Projecting Ahead

The low-level data captured in volume by observability instruments has closed our eyes to salient change. We've built a giant wall of white noise. The human mind's perception and prediction capabilities evolved to detect significant changes to our survival. Observability has no steering mechanism to guide effective and efficient measurement, modeling, and memory processes. Companies are gorging on ever-growing mounds of observability data collected that should be of secondary concern.

More...

Observability: The OODA Loop

The OODA loop emphasizes two critical environmental factors - time constraints and information uncertainty. The time factor is addressed by executing through the loop as fast as possible. Information uncertainty is tackled by acting accurately. The model's typical presentation is popular because it closes the loop between sensing (observe and orient) and acting (decide and act).

More...