Jira Legacy

server	System Jira
columnIds	issuekey,summary,issuetype,created,updated,duedate,assignee,reporter,priority,status,resolution
columns	key,summary,type,created,updated,due,assignee,reporter,priority,status,resolution
serverId	4733707d-2057-3a0f-ae5e-4fd8aff50176
key	CPS-1415

Issues and Decisions

#	Issue	Notes	Decision
1	how fast should CPS (and DB) be able to process max heart beat failures	is 60K really realistic if ENM goes down we should get a notification for each node do we ?!	PoC has shown 60 seconds is reasonable
2	restart of NCMP	should/can this be handled
3

Description

Define scenarios which cause a CM Handle to go stale
Implement changes to support tracking of CM Handle Freshness/Staleness

...

dmi plugin identifies that the device is no longer contactable
dmi plugin identifies that an underlying device manager managing the device (node) is out of sync with the device itself.

Requirements

Functional

#	Interface	Requirement	Additional Information	Sign-off
1	CPS-NCMP-

I

E-

01

05	The 'trustlevel' can

be queried (

is visible) on the

methos

methods as currently the 'cm handle state'	can be new or existing (preferred) endpoint
2	CPS-NCMP-

I-01

E-05	CM Handles can be queried (filter condition) on 'trustlevel'	using a new 'trustLevel' condition (cannot use cpsPath condition)
3	CPS-NCMP-E-05	Once a CM Handle is registered

(TBD which state exactly?)

the trust -level for that CM Handle should

be

be reported to be 'COMPLETE'

3


4	CPS-NCMP-

I

E-

01

05

Once DMI (plugin) is detected to be down the trust-level for all affected CM Handles should be reported to be 'NONE'

It might not need to be persisted....

4


5	CPS-NCMP-I-01

REST or ASYNC TBD

.e	DMI plugin can report the current trustlevel of a single

(or collection?) of

cm handle

(

id

)s

ie. the DMI can tell NCMP the trustlevel is 'NONE' when a node heartbeat failure is detected and 'COMPLETE' once it is restored

5Notification on trustlevel changes ?!

Error Handling

#

Error Scenario

Expected behavior

1

NCMP restart Options:

Trustlevels should as they were before the restart? (might depend on how much time has elapsed)

(preferred) (all instances)

To be discussed, not suer if it can/should be handled

Trustlevels should be 'NONE' and need to be restored using an audit-request (not in scope)

2

Characteristics

#	Parameter	Expectation	Notes	Sign-off
1	dmi-down detection speed

60

30 seconds

(TBD)


2	maximum number of cm-handles down report by DMI in one request and/or per minute	30,000 / minute

This looks like an 'ENM down' not sure if that should be handled this way

a peak can be processed within 60 seconds
3	processing of all trustlevel time for DMI-Down and/or peak load by DMI	1 second

Out-of-Scope

This epic will only introduce trustlevels NONE and COMPLETE. PARTIAL and POOR may be added later as below.
Re-registration ie. resolve resolving trutslevel degradation is not in scope of this epic
NCMP wil not send notification on trustlevel changes for external consumers

High Level Interactions

Drawio

border	true

diagramName	Staleness Freshness Overview
simpleViewer	false
width
links	auto
tbstyle	top
lbox	true
diagramWidth	940
revision	3

...

Versions Compared

Old Version 14

New Version 15

Key

Issues and Decisions

Description

Requirements

Functional

Error Handling

Characteristics

Out-of-Scope

High Level Interactions

Page Comparison

Versions Compared

Old Version 14

New Version 15

Key

Issues and Decisions

Description

Requirements

Functional

Error Handling

Characteristics

Out-of-Scope

High Level Interactions