As an Asserts customer, all you need to do is connect your Observability Datasources like Prometheus, Cloudwatch et al. to Asserts and then let Asserts apply its intelligence to present root cause insights.

Asserts is now part of Grafana Labs. All new customers will be onboarded on Grafana Cloud. Please sign up for early access at https://www.asserts.ai/

Builds an Entity Graph of App and Infra components (with Grafana Dashboards)

Asserts taps into your telemetry data sources like Prometheus, CloudWatch et al., automatically builds a graph of your application and infrastructure components, and indexes the graph for search.

With our search, you can find how the components fit together in real-time and view KPIs in the built-in Grafana dashboard. see more

Runs curated rules to detect Service Unavailability and potential causes

Asserts curates knowledge of common runtime failure patterns and potential causes so your team doesn’t have to research and maintain these complex PromQL recording and alerting rules for frameworks.

It continuously tracks resource Saturations, Amends (i.e. changes such as new container deployments, config updates, kafka consumer group rebalancing, HPA scale events et al.), Anomalies in request rate, error rate & latency, systemic Failures (e.g. Pod Crash looping, Cron Job Failure), and Errors (e.g 5XX / 4XX status code, Latency Threshold Breach) on your golden signals and health metrics.

We call these checks Assertions. The occurrences of these assertions are annotated on the (Knowledge) Graph, so it is easy to consume at a glance. see more

Our assertion catalog is constantly evolving.

With our unified search, you can combine components, relations, configurations, and associated assertions to express your intent in an easy natural language expression.

e.g., Search “Pods crashing on Nodes with high cpu:load” , see more

Wake up when it matters

The SRE book recommends Alerting on Service Level Objectives (SLO), to track"what's broken"and with Asserts setting up your SLOs and tracking your error budget is a breeze. And then finding "why it's broken" is just a click away in our RCA workbench. see more

Spot issues quickly with Top Insights

With our always-on Checks (aka Assertions), you don’t have to wait for SLOs to breach and Alerts to fire. Top Insights presents a stack-ranked view of Services / Nodes that need attention based on their severity score. And then Open in Workbench to find the root cause. see more

Troubleshoot in Workbench with all the Root cause Insights

In our assertion workbench, dig in to view all the possible causes correlated across time and dependency, with just the right metrics, logs and traces at your fingertip.

e.g., an amend (new deployment) on api-server triggered a spike in error rate on an endpoint /slo/incidents. Jump to Dashboard or View Logs to see contextual logs in your existing log store, like Kibana, Graylog, et al. see more

Last updated