Difference between revisions of "CinderCaracalPTGSummary"

Latest revision as of 11:52, 29 October 2023

Tuesday 24 October

recordings

Bobcat Retrospective

We categorized the discussion into the following sub sections:

What went good?
- Active contribution across different companies
- We got 2 Outreachy interns in summer internship having great contributions in api-ref sample tests
- Releases happened on time

What went bad?
- CI failure rate impacting productivity
- Sofia leaving the team really affected review bandwidth

What should we start doing?
- Need to get more cores
  - Also explained what is expected from a core reviewer.
  - https://docs.openstack.org/cinder/latest/contributor/gerrit.html#efficient-review-guidelines

What should we continue doing?
- Sponsoring outreachy interns
  - proposal accepted with at least one applicant

What should we stop doing?
- Lack of structure around the review request section in cinder meetings
  - Too many patches discourage the reviewers from taking a look
  - Authors should add explanation if the patch is complicated and ask if they have any doubts
  - Author should try to add only the relevant patch based on milestone instead of adding all patches possible
  - Author should be active in reviews since the core team prioritizes patches of contributors that do active reviews
  - #action: whoami-rajat to follow up on this in a cinder meeting

Gate Issues

Continuing the discussion from 2023.2 Bobcat midcycle, we are still seeing gate issues consisting of OOMs and timeouts. Looking at a sample gate job failure, we decided on the following points:

Increase system memory of VM if possible, 8GB is not enough for tempest tests
Increase swap space (make it same size as RAM for one to one mapping)
Change cinder to use the file lock for the coordination in order to get rid of etcd
Reduce number of processes
- We see the pattern of multiple services running multiple instances
  - neutron-server: 5
  - nova-conductor: 6
  - nova-scheduler: 2
  - swift: 3 for each of its service
Reduce concurrency to 2 for testing purposes to see how many VMs we end up running
#action: rosmaita to propose patch for few straightforward tasks like increasing swap space

Backup/Restore performance

There is a bug reported against S3 and swift backend complaining they are very slow. Launchpad: https://bugs.launchpad.net/cinder/+bug/1918119

The general discussion was around the following points:

If there are issues in backup/restore, report a bug, it helps the team to be aware about all potential improvements
Using stream instead of chunks
- The basic infrastructure in the backup metadata should be there, as we can store the version
- https://github.com/openstack/cinder/blob/04e11c1f773b40b16b95a8638c473972d3b42886/cinder/backup/chunkeddriver.py#L107
We don't have a backup or restore progress status anywhere
We can work on something to get data about long running operations and their current status.
#action: zaitcev is to investigate what we have now and propose a spec for observability, in particular for restores ~ we have %% notifications already, but no current %%

Few of the specs related to backup were also mentioned:

Encrypted backups
- https://review.opendev.org/c/openstack/cinder-specs/+/862601
Introduce a new backup status field
- https://review.opendev.org/c/openstack/cinder-specs/+/868761
Multiple cinder backup backends
- https://review.opendev.org/c/openstack/cinder-specs/+/712301

Wednesday 25 October

recordings

Cinder multi-tenancy support

Use Case: An operator wants to partition and isolate their storage array on a per tenant basis.

Manila has DHSS (Driver Handles Server Share) support which allows it's backends to create virtual storage instances.

Currently for cinder, they will need to create all the virtual storage instance manually and map that to a backend in cinder.conf.

They can use a private volume type to tie a particular volume type to a project and also to a backend with "volume_backend_name" extra spec.

Conclusion:

The use case is NetApp specific since the virtual storage instance creation happens on the storage side and manila's workflow includes the step to create virtual instances (share server)
The Manila architecture allows using neutron for network partitioning which is completely different from how cinder exposes the LUNs
The use case is not suitable for Cinder because
- Cinder doesn't interact with neutron at all
- Even if we add the support, other drivers won't be able to make use of it

NetApp LUN space allocation

Netapp team wants a way to configure a space_allocation property for their volumes in the NetApp backend.

Space allocation enables ONTAP to reclaim space automatically when host deletes a file.

The current proposal is to configure it via a volume type extra spec and the volumes created with that type will have space allocation enabled.

Another case is that we need to be able to notify nova that the LUN is sparse.

If the LUN is sparse and nova guest supports TRIM/DISCARD, the guest will send the TRIM/DISCARD commands to the storage array to reclaim the storage space.

By default thin provisioning is enabled in the NetApp backend but space allocation is disabled.

Agreed Design

Add the space_allocation property in volume type extra specs
Report the discard value same as space_allocation when returning connection info
By default, report_discard_supported will be reported with the "discard" parameter (if operator sets it in cinder.conf)
#action: NetApp team to add support for space_allocation and report discard.

NOTE: If we enable/change configuration on backend side, we need to remap the LUN (detach then attach) for the space_allocation/discard value to be reflected

Operator Hour

No operator showed up in the operator hour.

Operators not showing up doesn't seem to be a cinder problem

Manila also had similar problems of operators not showing up.

#action: Ask the opendev team for stats about operators attendance in other project's operator hours.

Block device driver

Block device driver allowed creating local block devices and making it available to nova instances.

It supported providing device path and also exporting the device via iSCSI.

It had limited features and people were not using it so it was removed from the cinder driver tree

The use case to bring is back is replicated databases and it is not limited to etcd.

We can modify the LVM driver to do local attach.

Problems with using nova ephemeral

not configurable: we can either use ceph or lvm
deployed computes may not have local storage available
we don't have a dedicated block device for etcd, we will just allocate a part of total ephemeral storage to etcd

Some of the constraints using this configuration is:

the cinder-volume service needs to be deployed in compute nodes hosting the instance
a local connector that returns a path -- which we have one already
No volume migration and data persistence (it is not a priority)
We will have some k8s controllers with cinder+LVM and some without it
running cinder volume on every compute and scaling it to a number like 500 might put load on scheduler if it reports every 60 seconds (default)

Conclusion

LVM with local storage should be a good option
- block device driver might be a future option if the latency issue persists (currently we don't have a performance claim that block device driver performs better than LVM local)
#action: eharney to go work on this

Cross project with nova

Wednesday: 1600-1700 UTC

Improper attachment cleanup during failure in some operations

Some of the nova operations, that interact with cinder attachments, doesn't do proper cleanup in failure scenarios.

Live migration
Evacuate
Cold migration

Improper cleanup leads to inconsistencies in the deployment that needs to be addressed manually which has been harder to address after CVE-2023-2088 since attachment-delete only works when requested by a service user with a service token.

Solution:

at compute init_host, we could identify the attachments and log each of them which are uncleaned
We have a cinder API that returns all the attachments for a given compute host.
- /attachments?all_tenants=1&attach_host=?
- We will require admin + service token to get it
#action: nova team can report a bug and decide who would like to work on it

fixing RBD retype regressed in Wallaby

Cinder has 2 migration paths:

generic: cinder does the migration
optimized: backend driver does the migration

A change in wallaby caused the operation to use optimized path for drivers that wasn't used before.

This introduced us to a new bug in the optimized migration path.

We have two ways to fix the issue:

Short term fix: revert the change that disallows using optimized path for migration
- https://review.opendev.org/c/openstack/cinder/+/899157
Long term fix: Fix the optimized migration path to call nova API and update nova with the migrated volume info

This will also require certain level of testing:

Tempest patch to test regression
- https://review.opendev.org/c/openstack/tempest/+/890360
devstack-plugin-ceph patch to enable config of separate Ceph pools
- https://review.opendev.org/c/openstack/devstack-plugin-ceph/+/895533
cinder patch to add a cinder-tempest-ceph-multibackend job
- https://review.opendev.org/c/openstack/cinder/+/896172
#action: Need someone to work on it

Image Metadata in Cinder, Nova, and Glance

In glance, the image metadata is unrestricted but cinder and nova restricts it to 255 chars in the schema validation. It is because glance doesn't have any schema validation for image properties.
currently it's a text field (64k) in nova, cinder, glance etc, DB causing lot of IO while reading it which isn't ideal.
#action: (rosmaita) bring it to the mailing list as to what should be the correct value for this field (based on how it is used)

Gate Issues

The gate job shows 6 conductor instances running in the gate job. Based on the configuration, the workers are set to two.

[conductor]

workers = 2

The conductor processes should not cause gate failures but might consume memory causing other processes to starve.

Proposed Solutions:

Increasing swap space. Recommendation: 8 GB
keystone and neutron DB queries can be optimized with caching
the 14 qemu processes is possibly a result of higher tempest concurrency but with current concurrency (6) it shouldn't reach to 14 instances so need to check if there are any cleanup issues
Request for higher RAM systems (16GB) for gate jobs

Another gate issue that requires attention is the lvm calls being stuck in nova gate jobs. grenade can be an example.

Proposal to move lvm calls to privsep
- https://review.opendev.org/c/openstack/cinder/+/891386
Switch gate jobs to use ceph
- we won't be testing iscsi code path (os-brick connector)
check if lvm configure device filters in devstack
- Melanie checked it and fixed issue around it
#action: cinder team to look into gate failures faced by nova regarding LVM

Thursday 26 October

recordings

New Quota system

Provided a presentation for an overview of the accepted quota system design.
- https://docs.google.com/presentation/d/1fVQM1zYM5vJ01irM8ClH4XWjcMOJhNOnn2yo9ormrx0/edit#slide=id.p
The Counters and Counting sounds very confusing
- DynamicQuotaDriver, StoredQuotaDriver are good examples
- another suggestion is strictQuotaDriver
Should we use quota resync and periodically execute it?
- Yes, we'll need it, like we do today: automatic or using cinder-manage
  - Automatic: for deployments that are ok with blocking all operations for a bit when the sync is undergoing
  - Manual: for the deployments that cannot afford to pause operations
PostgreSQL support was removed from devstack in 2019
- https://lists.openstack.org/pipermail/openstack-discuss/2019-October/010196.html
- OpenStack documentation still mentions posgres as a supported backend
  - https://docs.openstack.org/security-guide/databases/database-backend-considerations.html
  - https://docs.openstack.org/oslo.db/latest/install/index.html
#action: Rajat to work on the implementation

Force volume unmanage

The assumption is we can call the unmanage or delete to remove DB entries for resources deleted on backend side.
- Currently they call the driver which is not expected.
There was a concern that if we use force with unmanage, operators might include it always with the unmanage command without first trying it the normal way
We should do the unmanage force in API itself and not call volume service at all (also not the driver)
- the volume_delete/unmanage path in c-vol requires the driver to be initialized
- new flag = only do this in the API and don't touch the volume service
We don't allow unmanaging encrypted volumes because we have no way to (re-)manage them but we will allow unmanage encrypted volumes with new flag
#action: Eric to write a spec on this

Encryption retype

Currently retyping volume from encrypted to non-encrypted volume type doesn't work as expected.

Encrypted volumes lose an amount of space for LUKS header
The current solution is to make the destination 1 GB bigger to account for the header
- https://review.opendev.org/c/openstack/cinder-specs/+/749991
First target for implementation: add size increase options to resize/migrate to allow unenc->enc migration
- We will be introducing a new flag so user knows they want to opt into this i.e. to have an extra GB for the encrypted volume
- Gorka thinks it's not "cloudy" because it exposes the implementation and a cloud hides it
- We can work on this as the initial implementation and later improve it with "user visible" vs "actual" size implementation
We also have a tempest test for this
- https://review.opendev.org/c/openstack/cinder-tempest-plugin/+/715566
#action: Repropose the spec and it should focus only on migrate
#action: Bring this up in the next cinder meeting if there are any takers, a new contributor should be able to implement this

OSC + SDK work

We have gaps between cinderclient and SDK.

The work is being tracked with following topics
Pete saw a case where backup restore was doing something different as compared to the cinderclient operation
- https://bugs.launchpad.net/cinder/+bug/2025277
#action: Rajat to follow up with all the interested people that are planning to work on it

Cross Project with Manila

Etherpads

Improve the extend volume operation in the generic driver

Blueprint: https://blueprints.launchpad.net/manila/+spec/enhance-extend-volume-workflow

Currently the generic driver in manila creates a volume and uses it as a share. When we plan to extend it, it detaches the volume, extends and attaches it again.

Cinder microversion 3.42 has the support for extending in-use volumes.
the manila code has not pinned a specific API microversion so it'll fall back to the 3.0 microversion.
#action: Rajat can work on it if he has some cycles but would be great if someone else volunteers

Multitenancy support in Cinder

This is a continuation of a previous PTG topic to answer some of the doubts.

Does manila created a virtual storage instance in it's workflow?
- Yes, essentially, in DHSS=True Manila will manage the storage virtual machines
- DHSS=False is something more similar to the way cinder works with the backends configuration and setups
How the interaction with neutron works?
- For Manila, in DHSS=True, the operators can define the networking where Manila will allocate ports/IPs when creating the share servers (Storage VMs)

SRBAC work in Manila

Completed phase 1. Need to start with phase 2.

OSC SDK parity work with manila client

Complete parity with OSC. Still missing support in SDK but not planning to add it anytime soon since Manila already has SDK in manilaclient which is widely used.

Manila SDK trackers
- https://tree.taiga.io/project/ashrod98-openstacksdk-manila-support/kanban
Manila OSC trackers
- https://tree.taiga.io/project/gouthampacha-openstack-manila-osc-integration-with-python-manilaclient/kanban

Friday 27 October

recordings

Stalled reviews

Speedup of cinder-backup startup time by optimizing DB query
- https://review.opendev.org/c/openstack/cinder/+/657543
- Need someone to take ownership and address the review comments
Handle missing volumes during cleanup of incomplete backups
- https://review.opendev.org/c/openstack/cinder/+/886584
- Just some exception handling to make things more robust
Ceph: add option to keep only last n snapshots per backup
- Topic: https://review.opendev.org/q/topic:ceph_keep_snapshots
- Not keeping an unbounded number of RBD snapshots on the primary storage saves lots of (expensive) space
Make volume Glance metadata bulk update idempotent
- https://review.opendev.org/c/openstack/cinder/+/764045
- Fixes volume create in scenarios where create volume fails
  - When rescheduling the create volume operation and metadata is already created
  - In some workflow, the signature_verified key is added twice
#action: Review the above patches

SQLAlchemy 2.0

Need to get the changes in to declare Cinder's support for sqlalchemy 2.0
- This is needed since the UC currents constraints sqlalchemy to < 2.0 and we need to unblock it
There are 10 patches in review
- Topic: https://review.opendev.org/q/topic:sqlalchemy-20+project:openstack/cinder+is:open
- The first 5 patches are important to close the work for sqlalchemy 2.0
- Need to prioritize since Cinder and Manila are the only projects left
#action: Review first 5 patches on priority

Nova Cinder cross project (Part 2)

NFS Encryption

The idea is to support encryption for the NFS driver

Nova spec: https://review.opendev.org/c/openstack/nova-specs/+/883516
- Sofia started the effort but not working in Cinder anymore
Nova's concern was reagarding upgrade scenario hence the need for the spec
- the spec was asked during a nova meeting as upgrade concerns were raised which are still missing in the spec
  - https://meetings.opendev.org/meetings/nova/2023/nova.2023-05-02-16.00.log.html#l-129
- During an upgrade, if the attach/BFV request goes to an old compute, what should be done in that scenario
There are 2 cases we need to handle
- BFV: we can have a trait that the scheduler can use to only schedule instance on a compute that supports
- Attach volume to existing instance
  - fail the attachment as we do now
Need a way to get the transport protocol from cinder before attaching the volume
Maybe we can work out with the current behavior we have and only mark feature complete when all computes are upgraded

Feedback regarding approach for online extend in NFS-based volume drivers

This feature supports the extend feature for attached NFS volumes.

Cinder-Spec: https://review.opendev.org/c/openstack/cinder-specs/+/896014
Nova-Spec: https://review.opendev.org/c/openstack/nova-specs/+/895648
Need tempest tests covering combinations of extend+snapshot operations in various orders
Currently reviews are needed and first dependency is Cinder.
#action: Cinder team to review the changes

Unable to take snapshot of a NFS volume attached to an instance which is stopped

Currently snapshot'ing an NFS volume is problematic when it's attached and VM is stopped.
- This is because the format information is not correctly updated in the attachment record leading to VM not being able to start.
Bug: https://bugs.launchpad.net/cinder/+bug/1989514
Patch: https://review.opendev.org/c/openstack/cinder/+/857528
#action: Cinder team to review the patch

Revision as of 11:51, 29 October 2023 (view source) Rajatdhasmana (talk \| contribs) (→‎Introduction) ← Older edit		Latest revision as of 11:52, 29 October 2023 (view source) Rajatdhasmana (talk \| contribs) (→‎Introduction)
Line 5:		Line 5:

	[[File:2024.1 Caracal PTG Cinder.png\|1000px\|thumb\|center\|Cinder 2024.1 (Caracal) Virtual PTG 25 October 2023]]		[[File:2024.1 Caracal PTG Cinder.png\|1000px\|thumb\|center\|Cinder 2024.1 (Caracal) Virtual PTG 25 October 2023]]
		+

	This document aims to give a summary of each session. More information is available on the cinder 2024.1 Caracal PTG etherpad:		This document aims to give a summary of each session. More information is available on the cinder 2024.1 Caracal PTG etherpad:

Difference between revisions of "CinderCaracalPTGSummary"

Latest revision as of 11:52, 29 October 2023

Contents

Introduction

Tuesday 24 October

recordings

Bobcat Retrospective

Gate Issues

Backup/Restore performance

Wednesday 25 October

recordings

Cinder multi-tenancy support

NetApp LUN space allocation

Operator Hour

Block device driver

Cross project with nova

Improper attachment cleanup during failure in some operations

fixing RBD retype regressed in Wallaby

Image Metadata in Cinder, Nova, and Glance

Gate Issues

Thursday 26 October

recordings

New Quota system

Force volume unmanage

Encryption retype

OSC + SDK work

Cross Project with Manila

Improve the extend volume operation in the generic driver

Multitenancy support in Cinder

SRBAC work in Manila

OSC SDK parity work with manila client

Friday 27 October

recordings

Stalled reviews

SQLAlchemy 2.0

Nova Cinder cross project (Part 2)

NFS Encryption

Feedback regarding approach for online extend in NFS-based volume drivers

Unable to take snapshot of a NFS volume attached to an instance which is stopped