Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

#

Issue

Notes 

Decision

1Add or Delete leaves handle as UPDATE or ADD/DELETE?

The delta report proposed follows the Json Patch format of representing the differences between 2 JSON data. Apart from this the delta is dependent on individual data nodes of the JSON data, this is because the delta report will contain the action, xpath and the source/target data. And since the xpaths are unique to data nodes and not leaf data, every delta entity in delta report will be between 2 data nodes. Going by the general convention and referring RFC-6902:

  • AddCreate: If a data node exists only in the target location (target anchor or JSON payload) then it will be considered an ADD a 'create' action and all the leaf data under this data node and the child data nodes of the aforementioned data node will be considered as newly added data.
    • The child data nodes will be reported separately in the delta report because these data nodes can be individually identified with their unique xpaths.
  • Remove: If a data node exists only in the source location (source anchor) but is not in the target location (target anchor or JSON payload) then this will be considered as a REMOVE 'remove' operation. All the leaf data under this data node and the child data nodes of the aforementioned data node will be considered as removed.
    • The child data nodes will be reported separately in the delta report because these data nodes can be individually identified with their unique xpaths.
  • UpdateReplace: If a data node exists both, in the source and target location then the individual leaf nodes of this data node are compared to find any changes in the data. If the data is changed then it is reported as an UPDATE a 'replace' action because leaf data of an already existing data node are updated here.
    • only the leaves which are added, deleted or modified are reported in delta report. Any unmodified leaf is not reported.
    • Child data nodes are evaluated individually, if a child node is added it's an ADD a 'create' action, if a child node is removed its REMOVE 'remove' action and if a child node exists in both locations, then the leaves are evaluated for any changes which are reported as UPDATE 'replace' action.
Updated as per notes.
2How to handle multiple changes at different levels?
Since the delta report will contain the xpaths of data nodes changed, so for changes at multiple levels i.e., parent and child data nodes, the approach will be to handle each data node individually and report them in the delta report. 

3More scenarios need to be explored and documented in detail. Such as handling arrays within a json, handling child/grandchild changes.
scenarios such as data nodes at multiple levels, arrays and lists are covered.

...

Code Block
titleResponse body should contain anchors delta report (added/deleted/modified configuration) as below.
collapsetrue
[
  {
    "action": "ADDcreate",
    "xpath": "/bookstore/categories/[@code=3]",
    "target-data": {
      "code": 3,
      "name": "kidz"
    }
  },
  {
    "action": "REMOVEremove",
    "xpath": "/bookstore/categories/[@code=1]",
    "source-data": {
      "code": 1,
      "name": "Fiction"
    }
  },
  {
    "action": "UPDATEreplace",
    "xpath": "/bookstore/categories/[@code=2]",
    "source-data": {
      "name": "Funny"
    },
    "target-data": {
      "name": "Comic"
    }
  }
]

Mechanism for Delta generation between Anchors

Gliffy
size600
displayNameCPS Delta Flowchart
nameCPS Delta Flowchart
pagePin

...

3

Additional Details

Format/Conventions to be used for Delta Report

The delta report format is based on two RFCs namely RFC 6902 and RFC 9144 . A detailed comparison of the RFCs can be found here.

RFC 6902 JSON Patch

JSON Patch defines a JSON document structure for expressing a sequence of operations to apply to a JavaScript Object Notation(JSON) document; it is suitable for use with the HTTP PATCH method.

...

OperationDescriptionDelta report equivalent
addAdds the value at the target location; if the value exists in the given location, it’s replacedif the value is not present in the source json, but was found in the comparand target json, then it should be considered an a "addcreate" action.
removeRemoves the value at the target locationif a value was present at the source json, but was not found in the comparand target json, then it should be considered as "deleteremove" action.
replaceReplaces the value at the target locationif a value is present in the source json, but an updated value is present in the comparand target json, then it will be considered as "updatereplace" action.
moveRemoves the value at a specified location and adds it to the target locationN/A
copyCopies the value at a specified location to the target locationN/A
testTests that a value at the target location is equal to a specified valueN/A

...

The format of Delta report has following key take aways from the above-mentioned RFCs:

  • The "op" field from RFC 6902 is replaced with "action" field because in CPS we want to report the action that was performed on the data. And this field can have 3 predefined values: ADDcreate, REMOVE remove and UPDATEreplace.
  • The xpath is used in CPS to uniquely identify individual data nodes and is used in place of JSON path as defined in the two RFCs
  • The source-data and target-data fields are used from RFC9144 to report the data that has been added, removed or updated. This approach properly categorizes the data instead of grouping it under the one "value" field as in RFC6902.
Code Block
titleDelta Report Format for CPS
collapsetrue
[
  {
    "action": "ADDcreate",
    "xpath": "/bookstore/categories/[@code=3]",
    "target-data": {
      "code": "3,",
      "name": "kidz"
    }
  },
  {
    "action": "REMOVEremove",
    "xpath": "/bookstore/categories/[@code=1]",
    "source-data": {
      "code": "1,",
      "name": "Fiction"
    }
  },
  {
    "action": "UPDATEreplace",
    "xpath": "/bookstore/categories/[@code=2]",
    "source-data": {
      "name": "Funny"
    },
    "target-data": {
      "name": "Comic"
    }
  }
]

...

  1. Create an empty JSON Array, to store the result
    • The JSON array will contain the following: action, xpath, payload as individual JSON objects each object representing a singular operation.
  2. Fetch the data from two anchors and store in two separate Maps
  3. Iterate over the entries of first map
  4. For each entry of first map, check if the key is present in second map.
  5. If the key is not found in second map, it means that the key-value pair has been removed from the second map. Create a JSON Object with DELETE 'remove' action, the xpath to deleted node and payload. Add the Object to the JSON Array.
  6. If the key is present in the second map, compare the values associated with the keys in both maps.
  7. If the values are instance of Map, recursively call the comparision algorithm to compare all nested maps. Add all the necessary fields into a JSON object and put the object into the JSON Array
  8. If the values are not equal, it means the key-value pair was updated. Create a JSON Object with UPDATE 'replace' action, the xpath to updated nodes and payload. Add the Object to the JSON Array.
  9. Iterate over the keys of second map and find the keys not present in first map. These keys were added as new entries. Create a JSON Object with ADD 'create' action, the xpath to added node and payload. Add the Object to the JSON Array.
  10. Return the JSON Array and the updates to Kafka events

References/Future Changes