2019-03-07 AAI Developers Meeting
Attachments (including meeting recording)
Chat Server
private group aai-dev on Rocketchat server: http://onap-integration.eastus.cloudapp.azure.com:3000/group/aai-dev
Agenda Items
START RECORDING
Title | Responsible | Status | Last discussed | Notes | |
|---|---|---|---|---|---|
| 1 | CSIT / CSIT on OOM | @James Forsyth | In Progress | 21st Feb 2019 | New requirement to have our csit run as part of oom test environment in windriver Long term goal: one robot test case per endpoint that can run as part of e2e test AAI R4 Integration Sanity Test Plans See also Contributing To AAI Best Practise 4th March: https://lf-onap.atlassian.net/browse/AAI-2208 / https://gerrit.onap.org/r/#/c/79583/ |
| 2 | AAI too slow for Holmes | @Keong Lim @CT Paterson | In Progress | 1st Nov 2018 | @Guangrong Fu mentioned AAI in Baseline Measurements based on Testing Results:
The problem for caching is how to know when to update the cached data. Even though the access time may be fast for Holmes, the risk is using out-of-date data, so the correlations will be wrong anyway. Also, duplicating the AAI data outside of AAI is probably a bad architectural decision. Making AAI faster for these use cases would be better. Has there been a performance analysis of where the time is spent? Could it help to use ElasticSearch (e.g. as in sparky)? Should Holmes have a batch interface to get more AAI data in fewer calls? Or a better correlation API that results in fewer calls? 31st Oct: https://lists.onap.org/g/onap-discuss/topic/27805753 1st Nov:
Would the AAI Cacher https://lf-onap.atlassian.net/browse/AAI-1337 help to improve performance? 5th Mar: Guangrong Fu Hi, Sorry for my late response. It took me a long time to set up AAI in my own env. For Item 10, here's some information: Main APIs invoked in Holmes for different use cases: VoLTE
CCVPN
Performance We set up an AAI env on a VM (8 cores, 16GB memory, 160GB storage) following the guidance https://lf-onap.atlassian.net/wiki/display/DW/How+to+Docker+setup+on+Single+VM+HEAT+Deployment and tried to run a VNF query using "/aai/v11/cloud-infrastructure/cloud-regions/cloud-region/example-cloud-owner-val-45051/example-cloud-region-id-val-56689/tenants/tenant/example-tenant-id-val-51834/vservers/vserver/example-vserver-id-val-51834" (which is returned by "/search/nodes-query?search-node-type=vserver&filter=vserver-name:EQUALS:") for 1000 times. It took ~95ms per query. Also, we tried to query a VNF for 1000 times via "/aai/v11/network/generic-vnfs/generic-vnf/example-vnf-id-val-92494" and the average time is ~86ms. From the result, we know that even for a single request, the time cost reaches around 100ms. Let alone there will be several requests sent to AAI when an alarm is processed by Holmes. Taking CCVPN for example, for each alarm, there are up to 7 requests made. That means it'll take around 600-700 ms for Holmes to interact with AAI. In case of alarm storms, it is hard for AAI to support such intensive queries. 6th March: Guangrong Fu In my opinion, the performance of AAI queries is not only impacted by the computation inside AAI, but also impacted by the HTTP request itself. I've done another test. I tried to send requests to the health check API (which does nothing but return immediately after it receives a request ) of Holmes. The average time cost is also ~ 70ms. So it seems to be a problem with the time cost caused by setting up and releasing HTTP connections. 6th March: Keong Regarding these queries:
What depth is used on these GET calls? If the defaulting to depth=0, then perhaps some improvement can be made by using "depth=1" or "depth=2"? Fewer calls returning more data could improve overall performance. Same could be achieved by changing to Nodes query, e.g. GET /aai/v14/nodes/p-interfaces?interface-name=nodeId-{pnfName}-ltpId-{ifName} Question1: Can the Bulk API be used with GET calls? Documentation does not show any examples of GET actions. https://onap.readthedocs.io/en/casablanca/submodules/aai/aai-common.git/docs/AAI%20REST%20API%20Documentation/bulkApi.html Question2: Would it help to have the Holmes pod co-located with the AAI haproxy and AAI resources pods? Reduced network latency could improve overall performance. Guangrong: Holmes is acutally deployed by DCAE. I'm not sure whether your proposal is feasible. What's more, the performance data I got was based on the fact that Holmes and AAI were deployed on the same VM, sharing the same docker env. |
| 3 | Change PNF to use pnf-id as unique key | @Benjamin Cheung @Chesla Wechsler @David Perez Caparros @Former user (Deleted) @Matthieu Geerebaert | In Progress | 21st Feb 2019 | Potentially breaking change: https://lf-onap.atlassian.net/browse/AAI-2096 / blocked by https://lf-onap.atlassian.net/browse/ONAPARC-409 See also:
Questions:
@William LaMont will check for existing migration utility that handles this use case (changing the key from one existing attribute to another) @James Forsyth will socialize the breaking change on the PNF in the next PTL call so clients can prepare to do a search for ?pnf-name=${pnf-name} instead of /pnfs/pnf/${pnf-name}. They also need to handle doing the PUT operation differently - Added to PTL agenda PTL 2019-02-19 |
| 4 | Shared Cassandra Database | @Mahendra Raghuwanshi @Roger Maitland | In Progress | 14th Feb 2019 | Regarding https://lf-onap.atlassian.net/browse/OOM-1652 / https://lf-onap.atlassian.net/browse/OOM-1676 / https://lf-onap.atlassian.net/browse/OOM-1186 / https://lf-onap.atlassian.net/browse/OOM-1187 / https://lf-onap.atlassian.net/browse/OOM-1194 / https://lf-onap.atlassian.net/browse/OOM-1199:
A number of gerrit review issues raised: https://gerrit.onap.org/r/#/c/79425/ The "rolling upgrade" change has been combined with the "shared cassandra" change. The "shared cassandra" change has been combined with the "AAF shared cassandra" change, which means it's also combined with the "AAI shared cassandra" change. This sounds like a recipe for disaster. |
| 5 | Maven plugin for consistent code style | @Keong Lim | In Progress | 28th Feb 2019 | Raised JIRA https://lf-onap.atlassian.net/browse/AAI-2198 with commits submitted for aai-common, cacher, gizmo, and more to come. Also added to Contributing To AAI Best Practise Please test and review. |
| 6 | Return codes and messages for WS | @Former user (Deleted) | In Progress | 28th Feb 2019 | Is there a guide for the description of the error message and the error codes? How are new error states (message + code) added? @William LaMont will send @James Forsyth the output of a script that formats the error.properties file to make a wiki page and readthedocs |
| 7 | AAI ESR Questions | @THIRILOSHINI KRISHNAKUMAR | Done | 28th Feb 2019 | AAI Developer meeting AAI ESR Questions 02/28/2019 |
| 8 | OOM Artifacts | @Former user (Deleted) | Open | Some of our top level OOM deployment artifacts are not unique (i.e. don’t take namespace into account as all other deployables), is that intentional? | |
| 9 | Purpose of fields in AAI | @Keong Lim | Open | @Dénes Németh wrote in https://lf-onap.atlassian.net/browse/AAI-1104:
Questions: 1. Is AAI intended to strictly prescribe how the fields are used and what contents are in the values? Even if (1) is true, AAI is not really in any position to enforce how clients use the data, so really (2) is always true and we need to consult the original producers of the data and the ultimate consumers of the data to document their intended meanings. How do we push to have documentation on the purpose and meaning of the fields in AAI? Where does all this documentation go? Should the documentation be backed up by validation code? See also discussion about AAI in 2018-11-28 ExtAPI Meeting notes 29th Nov: Started on new wiki page AAI Schema Producer-Consumer Pairings | |
| 10 | AAI test data bootstrap | @Keong Lim | Open | Looking at AAI usage in OOF - HPA guide for integration testing by @Dileep Ranganathan, wondering whether there is a better way to bootstrap AAI test data?
Similarly, @Scott Seabolt and @J / @Joss Armstrong wrote for APPC Sample A&AI Data Setup for vLB/vDNS for APPC Consumption and Script to load vLB into AAI:
Similarly: Related https://lf-onap.atlassian.net/browse/AAI-1948 on the brittleness of the ReadTheDocs links to data files. One for VIM: How-To: Register a VIM/Cloud Instance to ONAP and https://lf-onap.atlassian.net/browse/AAI-1928 Potential issues:
| |
| 11 | AAI too slow for OOF/HAS | @Keong Lim | Open | Under OOF Homing and Allocation Service (HAS) section, @Dileep Ranganathan wrote about Project Specific enhancements:
See also https://lf-onap.atlassian.net/browse/OPTFRA-268 / https://lf-onap.atlassian.net/browse/OPTFRA-291 Similarly to the "AAI too slow for Holmes" item below, this introduction of extra caching of AAI data is a worrisome development and sad indictment of the performance of the system architecture. What can we do about this? Would the AAI Cacher https://lf-onap.atlassian.net/browse/AAI-1337 help to improve performance? | |
| 12 | MultiCloud usage of AAI for HPA telemetry/time-series data to OOF | @Keong Lim | Open | @Bin Yang and @Lianhao Lu (Deactivated) wrote in https://lf-onap.atlassian.net/browse/MULTICLOUD-274:
and
and
Some concerns:
| |
| 13 | Orchestration Scenarios for VNFs | @Keong Lim | Open | Comments on Orchestration Scenarios related to AAI: @Viswanath Kumar Skand Priya / @kspviswa said:
and @Fernando Oliveira replied:
As I understand, ESR has "esr-vnfm-list", which has an "esr-vnfm", which has "esr-system-info-list", which has "esr-system-info", which has a "relationship-list" that can contain relationships to "generic-vnf" and other AAI objects. The "generic-vnf" object also contains "self-link", "ipv4OamAddress", "ipv4OamGatewayAddress", etc, which links the AAI object back to its "source-of-truth" external-system. Is there some new data, new schema or new API that is required on top of this? @Fernando Oliveira; Apologies for my lack of knowledge, but a few comments:
@Bo Lv can comment more on the current ESR capabilities, but I believe there are only 3 kinds of systems so far: EMS, VNFM and third-party SDNC. ESR could be extended to handle VNFO as another kind of system. @Fernando Oliveira : I created JIRA stories: for various parts of the scenario. @t is this item related to your question for Support ETSI NFV-SOL 005 (Os-Ma-Nfvo ref point ) between SO & VF-C/NFVO? @Keong Lim it is related to the question | |
| 14 | range query | @Keong Lim | In Progress | 7th Feb 2019 |
Invite @Arul Nambi and @CT Paterson to next week's dev call to talk about how sparky/elastic does partial/range matches |
| 15 | Image footprint reduction as part of CIA project | @Paul-Ionut Vaduva | In Progress | 7th Feb 2019 | Reduction in size is mostly onf aai-common image as that based on ubuntu. 2/7 - Move the base image to be a part of ONAP Build, maybe aai-common repo @Venkata Harish Kajur will create a Jira for it in Dublin Release Move the aai-common Dockerfile RUN into the resources, traversal, graphadmin, cacher, schema service microservice |
| 16 | Schema-service roadmap | @James Forsyth @Former user (Deleted) | On Hold | 31th Jan 2019 | The schema-service is ready. Currently it provides file-sharing capabilities in terms of schema/edgerule files. In order for GraphGraph to take advantage of the schema parsing/processing in schema-service additional abstractions have to be implemented on top of the crude file2string functionality currently in schema-service. Open question:
@Venkata Harish Kajurwill ask @Manisha Aggarwalif the current functionality of the schema-service is the final version for Dublin and if there will be further enhancements in next releases. @Venkata Harish Kajur and @Manisha Aggarwal What is missing in schema service that is needed in graphgraph is the following:
|
| 17 | New AAF Certificates at startup | In Progress | 24th Jan 2019 | AAF will generate certificates to the be used by the containers at startup; AAI services should use the run-time generated certs instead of the ones that are in the repos or oom charts. In dublin the services will mount a volume with certificates. This is on the roadmap for Dublin as a feature.
@James Forsythwill ask Jonathan Gatham when the certificate init image is going to be available in ONAP and wether it is documented Jan 31, 2019 | |
| 18 | AAI Backup and Restore | @Keong Lim | On Hold | 10th Jan 2019 | @FREEMAN, BRIAN D asked on Re: Backup and Restore Solution: ONAP-OOM :
What is the AAI strategy for backup and restore? What is the overall ONAP strategy for backup and restore? Should it be unified with the data migration strategy as per "Hbase to Cassandra migration" on 2018-11-14 AAI Meeting Notes? @James Forsythwill raise the topic of having backups and restore functionality in ONAP - if it is feasible, on the roadmap and what others PTL think Jimmy didn't directly raise the topic but there was movement - @Keong Lim asked "if istio service mesh is a no-go, is there a replacement for secure onap communications? is backup/restore/upgradability included in s3p?" @Michael O'Brien replied that a reference tool set for backup and restore was introduced in Casablanca: Backup and Restore Solution: ONAP-OOM @Mike Elliott said he would look at Brian's question, AAI will provide support as needed. |
| 19 | aai-cassandra performance issues | @Keong Lim | On Hold | 10th Jan 2019 | @Michael O'Brien has documented performance issues in aai-cassandra:
Is there something that should be tweaked in AAI config? Or documentation on the recommended setup to run the VM? I'll come to the next AAI meet (conflicts with pomba meet) - |