Lambda
Asserts AWS Exporter
Refer to Asserts AWS Exporter and Lambda Layer on pre-requires for observing Lambda function.
Key Performance Indicators (KPIs) and Alerts
Request, Errors, and Latency
Metric | Key Performance Indicator (KPI) |
Request Counter aws_lambda_invocations_sum | Request Rate avg_over_time(aws_lambda_invocations_sum[5m]) |
Error Counter aws_lambda_errors_sum | Error Ratio avg_over_time(aws_lambda_errors_sum[5m])/ avg_over_time(aws_lambda_invocations_sum[5m]) |
Latency Quantile | aws_lambda_duration_p99 |
Request Throttle Count aws_lambda_throttles_sum | Observe throttle count for Sustained Throttling aws_lambda_throttles_sum |
Resource
Metric | Key Performance Indicator (KPI) |
Memory NodeJS process_heap_bytes Memory Python process_resident_memory_bytes | NodeJS process_heap_bytes / aws_lambda_memory_limit_mb Python process_resident_memory_bytes / aws_lambda_memory_limit_mb |
CPU Total Time
| CPU Time Usage
|
Concurrent Executions aws_lambda_concurrent_executions_avg | When concurrency is reserved at the function level aws_lambda_concurrent_executions_avg / aws_lambda_allocated_concurrency For account level aws_lambda_concurrent_executions_avg / aws_lambda_account_limit{type="concurrent_executions"} |
Alerts
KPI | Alert |
Request Rate | RequestRateAnomaly |
Error Ratio | ErrorRatioBreach and ErrorBuildup based on an availability SLO of 99.9 |
Latency Average | LatencyAverageBreach and LatencyAverageAnomaly |
Latency P99 | LatencyP99ErrorBuildup |
Request Throttle Count | LambdaFunctionThrottled |
Memory Utilization | Saturation with severity level of warning and critical when memory utilization exceeds 90% and 95% respectively |
CPU Time Usage | ResourceRateAnomaly |
Network Bytes | ResourceRateAnomaly |
Lambda Concurrency | Saturation at Function and Account level |
KPI Dashboard
The Lambda KPI Dashboard shows all the KPIs and also the Cold Starts for correlation with Traffic Spike and Latency degradation
Last updated