Possible improvements to SegQueue

I've been trying to understand the `SegQueue` implementation to reply to #675. These are some notes I took along the way:
- https://github.com/crossbeam-rs/crossbeam/blob/b11f1a83e6362589979a4c58c79895cc936b09fb/crossbeam-queue/src/seg_queue.rs#L67 Might be UB? Need to know whether or not padding bytes within the structs being uninitialized (even if zeroed) counts as UB. Also seems like you could achieve better performance by using actually uninit memory in the slot value instead of zeroing it out.
- `SegQueue::new` should pre-allocate a block to match the `Injector` implementation and remove some initialization code from `push` and `pop`.
- Internally document the memory layout and the magic of the head and tail indexes. Basically, the index skips over one value which is used as a fence to perform block allocation/deallocation. So if LAP=2, BLOCK_CAP=1, and tail=0, two competing threads will result in one of them writing its value into slot 0, bumping tail to 1, and then allocating a new block and bumping tail to 2. The other thread will see that its CAS failed and get a tail of 1 which means it spins/yields until tail is 2. Same but opposite for head.
- Extract out `offset + 1 == BLOCK_CAP` into a variable with the explanation from the previous bullet point.
- https://github.com/crossbeam-rs/crossbeam/blob/b11f1a83e6362589979a4c58c79895cc936b09fb/crossbeam-queue/src/seg_queue.rs#L206 This can double allocate a block under contention that gets immidately discarded. #746 offers a very nice opportunity to throw the extra block back into the pool.
- https://github.com/crossbeam-rs/crossbeam/blob/b11f1a83e6362589979a4c58c79895cc936b09fb/crossbeam-queue/src/seg_queue.rs#L229 https://github.com/crossbeam-rs/crossbeam/blob/b11f1a83e6362589979a4c58c79895cc936b09fb/crossbeam-queue/src/seg_queue.rs#L297 Pretty sure this should use wrapping_add.
- https://github.com/crossbeam-rs/crossbeam/blob/b11f1a83e6362589979a4c58c79895cc936b09fb/crossbeam-queue/src/seg_queue.rs#L300 I don't understand why this is necessary, will have to think on it more.
- There's a bunch of `LAP - 1` that would be clearer as `BLOCK_CAP`
- All shifting of tail should be able to be removed as no metadata is stored in the tail index. The one issue would be making sure head and tail wrap around at the same rate, so maybe tail could be shifted before addition and then unshifted again.

I'm planning on looking into this stuff so that Tokio can use `SegQueue`: https://github.com/tokio-rs/tokio/issues/2528. Given #746's lack of progress, my main concern is that any opened PR would just get ignored, so Tokio might end up copying `SegQueue` with its own fixes (which would be a bummer).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Possible improvements to SegQueue #794

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Possible improvements to SegQueue #794

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions