...
Metric | Metric available? | Exposed via Prometheus endpoint? | Comment | |||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
Availability of the policy-pap service | Yes | No | policy-pap healthcheck API | |||||||||
Status of PDPs as registered with policy-pap | Yes | No | policy-pap consolidated healthcheck API | |||||||||
Request rate (API requests per minute) | No | No | To be implemented for all the endpoints exposed by policy-pap. Sample s3p numbers for policy-pap stress tests. | |||||||||
Failure rate (API errors per minute) | No | No | To be implemented for all the endpoints exposed by policy-pap. Number of API calls with non 200 family of status codes per minute | |||||||||
Latency | No | No | To be implemented for all the endpoints exposed by policy-pap. | |||||||||
Policy deployment statistics policyDeployFailureCount | Yes | No | Sample:
| |||||||||
SSL certificate expiry time | No | No | https is disabled for entire Policy framework |
Key metrics for Policy APEX PDP
Metric | Metric available? | Exposed via Prometheus endpoint? | Comment | |||||||
---|---|---|---|---|---|---|---|---|---|---|
Availability of policy-apex-pdp | Yes | No | Exposed by policy-pap consolidated healthcheck. | |||||||
Policy Deployment counter (per apex-pdp instance) policyDeployCount | Yes | No | Exposed by policy-pap statistics
| |||||||
Policy Execution counter (per apex-pdp instance) # of policies executed *Note: the stats currently displays APEX policy counters | No | No | ||||||||
Engine count Can be inferred from the size of the json array object "engineStats" | Yes | No | ||||||||
Engine availability details (by engineID per apex-pdp instance) engineTimestamp: timestamp at which the statistics were recorded | Yes | No | ||||||||
Count of events processed (per engine thread, per apex-pdp instance) # of incoming trigger events processed by policy-apex-pdp *Note: the stats currently displays APEX event counters processed by the engine | No | No | ||||||||
Latency | No | No | Time taken for processing an incoming network trigger event by the policy *Note: the stats currently displays execution time for processing APEX policy. | |||||||
Kafka consumer lag | No | No | Can be implemented outside of the Policy FWK. Monitor kafka consumer lag increase for kafka/dmaap-message-router topics related to apex-pdp | |||||||
SSL certificate expiry time (wherever applicable) | No | Nohttps is disabled for all of Policy FWK |
Key metrics for Policy Drools PDP
...