Jump to: navigation, search

Difference between revisions of "Meetings/Neutron-DVR"

(Open Discussion)
Line 10: Line 10:
 
= Agenda =
 
= Agenda =
  
== Meeting August 3rd, 2016 ==
+
== Meeting August 10th, 2016 ==
  
 
=== Announcements (haleyb) ===
 
=== Announcements (haleyb) ===
Line 22: Line 22:
  
 
==== New Bugs this week ====
 
==== New Bugs this week ====
 +
* https://bugs.launchpad.net/neutron/+bug/1609540 (MUST FIX) - CSNAT port fails due to no fixed ips. (partial workaround proposed).
 +
** https://review.openstack.org/350783
 +
* https://bugs.launchpad.net/neutron/+bug/1597461 (MUST FIX) - Two masters after reboot of controller when HA enabled. Also seen with DVR
 
* https://bugs.launchpad.net/neutron/+bug/1602320 (SHOULD FIX) - Keepalive process kill vrrp child process with l3/dvr/ha
 
* https://bugs.launchpad.net/neutron/+bug/1602320 (SHOULD FIX) - Keepalive process kill vrrp child process with l3/dvr/ha
 
* https://bugs.launchpad.net/neutron/+bug/1606741 (SHOULD FIX) - Metadata error with dvr_snat on compute hosts.
 
* https://bugs.launchpad.net/neutron/+bug/1606741 (SHOULD FIX) - Metadata error with dvr_snat on compute hosts.
 +
** https://review.openstack.org/352686 ( Needs review)
 +
* https://bugs.launchpad.net/neutron/+bug/1595043 (SHOULD FIX) - Generic DVR portbinding useful for HA ports.
 +
** https://review.openstack.org/324302 ( Need review)
 
* https://bugs.launchpad.net/neutron/+bug/1602794 (MUST FIX) - Itemallocator class can throw a ValueError
 
* https://bugs.launchpad.net/neutron/+bug/1602794 (MUST FIX) - Itemallocator class can throw a ValueError
 
** https://review.openstack.org/341794
 
** https://review.openstack.org/341794
 
* https://bugs.launchpad.net/neutron/+bug/1602614 (MUST FIX) - DVR+L3 HA Loss during failover is higher (Need to triage)
 
* https://bugs.launchpad.net/neutron/+bug/1602614 (MUST FIX) - DVR+L3 HA Loss during failover is higher (Need to triage)
* https://bugs.launchpad.net/neutron/+bug/1597461 (MUST FIX) - Two masters after reboot of controller when HA enabled. Also seen with DVR
 
 
* https://bugs.launchpad.net/neutron/+bug/1593354 (MUST FIX) - SNAT HA failed due to missing NAT rule and sg- interface. (May be a duplicate)
 
* https://bugs.launchpad.net/neutron/+bug/1593354 (MUST FIX) - SNAT HA failed due to missing NAT rule and sg- interface. (May be a duplicate)
 
* https://bugs.launchpad.net/neutron/+bug/1596473 (MUST FIX) - Packet loss with DVR and IPv6 (Need to triage)
 
* https://bugs.launchpad.net/neutron/+bug/1596473 (MUST FIX) - Packet loss with DVR and IPv6 (Need to triage)
Line 125: Line 130:
 
* Current failure rate: https://goo.gl/L1WODG
 
* Current failure rate: https://goo.gl/L1WODG
 
** Upstream multinode job failures seen, root cause identified.
 
** Upstream multinode job failures seen, root cause identified.
** Check job failures more than 50% under investigation.
+
** Check job failures were mostly due to the ipv6 dual net test case failure, cause identified and workaround proposed.
 
** Functional test failures seen in the Check queue, root cause yet to be identified.
 
** Functional test failures seen in the Check queue, root cause yet to be identified.
 
** Single node check job failure rates are more when compared to neutron full job.
 
** Single node check job failure rates are more when compared to neutron full job.

Revision as of 14:32, 10 August 2016

The OpenStack Networking L3 DVR Sub-team holds public meetings as advertised on OpenStack IRC Meetings Calendar. If you are unable to attend, please check the most recent logs.

Meetings

Agenda

Meeting August 10th, 2016

Announcements (haleyb)

  • When adding items below I'd like to try to get feedback on whether they are a MUST FIX, SHOULD FIX, or GOOD TO HAVE

Topics for Discussion

Bugs (Swami)

All DVR bugs should be tagged and listed here: https://bugs.launchpad.net/neutron/+bugs?field.tag=l3-dvr-backlog

New Bugs this week

Categorized Bugs

Gate Test Failures

More Functional test failures seen in Gate.

RFE

Existing Functionality Broken Bugs

Scale and Performance Impact Bugs

New Features Bugs

Refactor or Cleanup Bugs

WishList Bugs

WatchList Bugs

Bugs Closed Recently

Performance/Scalability

Gate failures (haleyb)

  • Current failure rate: https://goo.gl/L1WODG
    • Upstream multinode job failures seen, root cause identified.
    • Check job failures were mostly due to the ipv6 dual net test case failure, cause identified and workaround proposed.
    • Functional test failures seen in the Check queue, root cause yet to be identified.
    • Single node check job failure rates are more when compared to neutron full job.
    • Multi-node check job failure rate have gone up from last week
  • Old bug to help in debugging the gate

Stable backports (haleyb)

Ihar created a page tracking all the potential backports from Mitaka to the stable releases. I have been going through it with the help of Swami to get stable/liberty, well, more stable. Bugs are removed from list as they merge.

We need to continue to be aggressive at proactively backporting fixes to the stable branches

This is a list of bugs with a fix that has been committed to the master branch, and are tagged with 'neutron-proactive-backport-potential+l3-dvr-backlog' (the wiki thinks the link is spam so I had to put a space in it)

Open Discussion

Wanted to talk about some related patches, so we come up with a good answer going forward These two both do similar things around ip_lib.get_devices() code:

And this is a change that also checks existence so not throw exceptions:

Need to escalate the nova patch for the live migration. Since we have the tempest test running right now, we need someone from the nova team to take a look at the nova live migration patch.

Meeting commands

/join #openstack-meeting-alt
#startmeeting neutron_dvr
#topic Announcements
#undo topic
#link https://wiki.openstack.org/wiki/Meetings/Neutron-DVR
#action haleyb will get something specific done this week
#chair Swami
...
#endmeeting