OOM Meeting Notes - 2021-03-31
Agenda:
RC0/RC1: @Sylvain Desbureaux @Krzysztof Opasiak
what's left:
SDNC / CDS / A1P version bump: https://gerrit.onap.org/r/c/oom/+/118284
@Dan Timoney pushed the patch on February 23th
with OOM break, it has not been touched for 2 weeks
after it, we saw it wasn't working last week
lots of change done during these last days
it seems that SDNC behaves OK since yesterday
but gate has still random results
a new SO component / change: https://gerrit.onap.org/r/c/oom/+/120022
renaming VFC adapter to sol 005 adapter
version bump
should be OK to merge if gate is OK
UUI: https://gerrit.onap.org/r/c/oom/+/119844: waiting for TSC vote
DCAE: https://gerrit.onap.org/r/c/oom/+/119996 is for Honolul (Bootstrap bump in order to remove Holmes). Other are for Istanbul
OOF STDOUT: https://gerrit.onap.org/r/c/oom/+/113414
DMAAP : https://gerrit.onap.org/r/c/oom/+/114278 (make ZK and Kafka be able to scale up/down)
Maintenance release: @Sylvain Desbureaux
two patches on guilin:
certificates update for MR: https://gerrit.onap.org/r/c/oom/+/119912 / https://gerrit.onap.org/r/c/oom/+/118913
CCSDK mount bugfix: https://gerrit.onap.org/r/c/oom/+/119418
Resuming merge of new patches:
will be done as soon as we have branched honolulu (the day of RC0)
Too many certificates requests with AAF: @Andreas Geißler
On guilin "old" deployment
problem started on DCAE pods because of issue when retrieving certificates from AAF
msg from AAF:
INFO [service] user=deployer@people.osaaf.org[BAth],ip=10.233.68.159,port=42750,ms=211.37689,status=403,meth=PUT,path=/cert/local,msg="Request New Certificate/ErrResp [SVC1403] Forbidden: There are too many Certificates generated for CN=dcae for dcae@dcae.onap.org"
if you have more than 200 entries, you need to remove entries in `cert` and `x509` databases:
cqlsh> use authz;
cqlsh:authz> delete from cred where id = 'dcae@dcae.onap.org';
cqlsh:authz> select * from cred where id = 'dcae@dcae.onap.org'; id | type | expires | cred | notes | ns | other | prev | tag
----+------+---------+------+-------+----+-------+------+-----(0 rows)cqlsh:authz> select count(*) from x509 where ca = 'local' and id = 'dcae@dcae.onap.org';
-> if there are more than 200, you need to delete them manuallycqlsh:authz> select ca, serial from x509 where ca = 'local' and id = 'dcae@dcae.onap.org';
cqlsh:authz> delete from x509 where ca = 'local' and serial = 0x00e591f558ae34e9f0;
...short term:
cronjob?
add it as workaround in OOM release note and main release note
medium term:
get rid of AAF
finalize prototyping of Service Mesh (istio)
Tracing tools and ONAP: @Illia Halych Valdi
today nothing is really in place
service mesh can be used
but to have full fledged tracing, things have to be done on components:
with this patch:
if on service mesh, it should work "out of the box"
if not, it should work by configuring jaeger endpoint (see spring sleuth configuration https://spring.io/projects/spring-cloud-sleuth)
Proposal for intern: adjourned to next week
postmortem of last so gates:
Patchset | Gate launch | Basic vm | Basic network | Basic CNF | pnf registrate | 5gbulkpm |
26 | NOK in SO part Found 0 process(es) waiting for WorkflowMessage with SDNCCallback_CORRELATOR | OK | OK | NOK Not finished in SO part (IN_PROGRESS != COMPLETE) after 30s | OK | |
27 | Bad | |||||
28 | Bad | |||||
29 | Bad | |||||
30 | NOK in SO part | NOK in SO part | NOK in SO part | NOK in AAI part | OK | |
31 | OK | NOK in SDC part | OK | NOK in SDC Part | OK | |
32 | OK | OK | NOK in SO part | NOK (no results) | OK | |
33 | OK | OK | OK | OK | NOK (no results) | |
34 | OK | OK | NOK in SO part | OK | OK | |
35 | NOK in SDC part | NOK in SDC part | NOK in SDC part | OK | OK | |
36 | Bad | |||||
37 | OK | OK | NOK in SDC part | OK | OK |