Observability

Sunil Marella
Toggle Theme

Observability & AIOps

By Sunil Marella


Topics Covered

Observability

Observability is defined as the ability of the internal states of a system to be determined by its external outputs.

Pillars of Observability

Observability relies on three main types of telemetry data: metrics, logs, and traces.

Other Telemetry Data:

OpenTelemetry

OpenTelemetry is an open-source observability framework designed to provide comprehensive insights into software systems' health, performance, and behavior. It serves as a standard for collecting, processing, and exporting telemetry data, such as traces, metrics, and logs, from distributed systems, applications, and services.

Instrumentation

This technique effectively adds instructions to the target program to collect the required information.

AIOps

AIOps is the use of AI and machine learning to help address challenges faced by IT teams. AIOps can help engineers do things like find the root cause of complex application performance problems or automatically remediate infrastructure failures.

AIOps Capabilities

These tools offer a range of features, such as intelligent event correlation, automated incident management, predictive analytics, and anomaly detection. By leveraging these AIOps tools, organizations can enhance their proactive monitoring capabilities, gain actionable insights, and streamline their monitoring processes. Each tool has its strengths and focuses on different aspects of AIOps, allowing organizations to choose the most suitable solution based on their specific monitoring needs.

ITOps

ITOps is the process of implementing, managing, delivering and supporting IT services to meet the business needs of internal and external users.

My Observability Design

Follow below tutorial to build end to end visibility for your organization to identify issues proactively.

My Observability Steps

Follow below tutorial to build end to end visibility for your organization to identify issues proactively.

Step 1 - ITOps

    Servicenow ITOM - Docs

Step 2 - Data Collection/Edge Processing

    Open Telemetry (In Progress) - Docs

    Splunk Edge Processing (In Progress) - Docs

    Cribl (In Progress) - Docs

Step 3 - Observability

    Splunk Observability - Docs

    Dynatrace - Docs

    Grafana (In Progress) - Docs

    Appdynamics (In Progress) - Docs

    Elastic (In Progress) - Docs

    DataDog (In Progress) - Docs

    NewRelic (Yet to start) - Docs

Step 4 - AIOps

    Introduction - Docs

    Moog (Yet to start) - Docs

    ServiceNow (Yet to start) - Docs

    BigPanda (Yet to start) - Docs

    Splunk ITSI - Docs

Additional Information

Splunk

Disclaimer: This is purely based on my learning, knowledge and reference from tutorial / documentation.

My Contact Information

👉 LinkedIn GitHub My Page

My Other sites
👉 My Observability My AIOps My A.I. My Architecture