Lambda

Asserts AWS Exporter

Refer to Asserts AWS Exporter and Lambda Layer on pre-requires for observing Lambda function.

Key Performance Indicators (KPIs) and Alerts

Request, Errors, and Latency

Metric

Key Performance Indicator (KPI)

Request Counter

aws_lambda_invocations_sum

Request Rate

avg_over_time(aws_lambda_invocations_sum[5m])

Error Counter

aws_lambda_errors_sum

Error Ratio

avg_over_time(aws_lambda_errors_sum[5m])/ avg_over_time(aws_lambda_invocations_sum[5m])

Latency Quantile

aws_lambda_duration_p99

Request Throttle Count

aws_lambda_throttles_sum

Observe throttle count for Sustained Throttling

aws_lambda_throttles_sum

Resource

Metric

Key Performance Indicator (KPI)

Memory NodeJS

process_heap_bytes

Memory Python

process_resident_memory_bytes

NodeJS

process_heap_bytes / aws_lambda_memory_limit_mb

Python

process_resident_memory_bytes / aws_lambda_memory_limit_mb

CPU Total Time

process_cpu_seconds_total

CPU Time Usage

rate(process_cpu_seconds_total[5m])

Concurrent Executions

aws_lambda_concurrent_executions_avg

When concurrency is reserved at the function level

aws_lambda_concurrent_executions_avg / aws_lambda_allocated_concurrency

For account level

aws_lambda_concurrent_executions_avg / aws_lambda_account_limit{type="concurrent_executions"}

Alerts

KPI

Alert

Request Rate

RequestRateAnomaly

Error Ratio

ErrorRatioBreach and ErrorBuildup based on an availability SLO of 99.9

Latency Average

LatencyAverageBreach and LatencyAverageAnomaly

Latency P99

LatencyP99ErrorBuildup

Request Throttle Count

LambdaFunctionThrottled

Memory Utilization

Saturation with severity level of warning and critical when memory utilization exceeds 90% and 95% respectively

CPU Time Usage

ResourceRateAnomaly

Network Bytes

ResourceRateAnomaly

Lambda Concurrency

Saturation at Function and Account level

KPI Dashboard

The Lambda KPI Dashboard shows all the KPIs and also the Cold Starts for correlation with Traffic Spike and Latency degradation

Last updated