Features
As an Asserts customer, all you need to do is connect your Observability Datasources like Prometheus, Cloudwatch et al. to Asserts and then let Asserts apply its intelligence to present root cause insights.
Asserts is now part of Grafana Labs. All new customers will be onboarded on Grafana Cloud. Please sign up for early access at https://www.asserts.ai/
Builds an Entity Graph of App and Infra components (with Grafana Dashboards)
Asserts taps into your telemetry data sources like Prometheus, CloudWatch et al., automatically builds a graph of your application and infrastructure components, and indexes the graph for search.
With our search, you can find how the components fit together in real-time and view KPIs in the built-in Grafana dashboard. see more
Runs curated rules to detect Service Unavailability and potential causes
Asserts curates knowledge of common runtime failure patterns and potential causes so your team doesn’t have to research and maintain these complex PromQL recording and alerting rules for frameworks.
It continuously tracks resource Saturations, Amends (i.e. changes such as new container deployments, config updates, kafka consumer group rebalancing, HPA scale events et al.), Anomalies in request rate, error rate & latency, systemic Failures (e.g. Pod Crash looping, Cron Job Failure), and Errors (e.g 5XX / 4XX status code, Latency Threshold Breach) on your golden signals and health metrics.
We call these checks Assertions. The occurrences of these assertions are annotated on the (Knowledge) Graph, so it is easy to consume at a glance. see more
Our assertion catalog is constantly evolving.
Exploration with Unified Search
With our unified search, you can combine components, relations, configurations, and associated assertions to express your intent in an easy natural language expression.
e.g., Search “Pods crashing
on Nodes with high cpu:load
” , see more
Wake up when it matters
The SRE book recommends Alerting on Service Level Objectives (SLO), to track"what's broken"
and with Asserts setting up your SLOs and tracking your error budget is a breeze. And then finding "why it's broken" is just a click away in our RCA workbench. see more
Spot issues quickly with Top Insights
With our always-on Checks (aka Assertions), you don’t have to wait for SLOs to breach and Alerts to fire. Top Insights presents a stack-ranked view of Services / Nodes that need attention based on their severity score
.
And then Open in Workbench
to find the root cause. see more
Troubleshoot in Workbench with all the Root cause Insights
In our assertion workbench, dig in to view all the possible causes correlated across time and dependency, with just the right metrics, logs and traces at your fingertip.
e.g., an amend (new deployment) on api-server
triggered a spike in error rate on an endpoint /slo/incidents
. Jump to Dashboard
or View Logs
to see contextual logs in your existing log store, like Kibana, Graylog, et al. see more
Last updated