Implement EPx-only RP2040 USB Host #2814

rppicomidi · 2024-09-24T02:14:40Z

This commit fixes issue #2776
#2776.
The HCD now routes all USB transfers through the EPx endpoint. It no longer uses the "interrupt" endpoint hardware to handle INTERRUPT and BULK endpoints. The fix avoids the data sequence error handling bug in the RP2040 USB IP's "interrupt" endpoint hardware
and allows the host to correctly drop the IN packet with the error without locking up. That fixes
rppicomidi/usb_midi_host#14

This fix requires the CPU to handle the SOF interrupt (every ms). That might be an issue for some systems.

A benefit of this fix is that BULK transfers are
more than 2x faster. There is an opportunity to speed them up further by forcing BULK transfer to begin
immediately instead of waiting for the next SOF interrupt.

This code has been tested with MSC flash drives, HID devices, and USB Hubs. It works with full speed and low speed HID devices connected at the same time through a hub. With the usb_midi_host application host driver, 4 MIDI devices plugged to a hub can send messages to each other (see the [midi2usbhub](https://github.com/rppicomidi/midi2usbhub project).

Describe the PR
See description, above

Additional context
rppicomidi/usb_midi_host#14
raspberrypi/pico-feedback#394

This commit fixes issue hathach#2776 hathach#2776. The HCD now routes all USB transfers through the EPx endpoint. It no longer uses the "interrupt" endpoint hardware to handle INTERRUPT and BULK endpoints. The fix avoids the data sequence error handling bug in the RP2040 USB IP's "interrupt" endpoint hardware and allows the host to correctly drop the IN packet with the error without locking up. That fixes rppicomidi/usb_midi_host#14 This fix requires the CPU to handle the SOF interrupt (every ms). That might be an issue for some systems. A benefit of this fix is that BULK transfers are more than 2x faster. There is an opportunity to speed them up further by forcing BULK transfer to begin immediately instead of waiting for the next SOF interrupt. This code has been tested with MSC flash drives, HID devices, and USB Hubs. It works with full speed and low speed HID devices connected at the same time through a hub. With the usb_midi_host application host driver, 4 MIDI devices plugged to a hub can send messages to each other (see the midi2usbhub project).

rppicomidi · 2024-09-30T03:11:14Z

@hathach The hardware test seems to fail because the flash download failed. Is that a problem with the build system? Flash download worked fine on my system.

hathach · 2024-10-02T11:17:35Z

that is my bad, I was migrating HIL machine from pi5 to an VM. Merged with latest should fix this. Please give me a bit of time, I will review and test this as soon as I could

shreeve · 2024-11-10T09:52:14Z

Anything helpful in this young repo? => https://github.com/shreeve/picousb

rppicomidi · 2024-11-12T00:09:59Z

@shreeve Helpful to this code, or to picousb? My testing shows that this code works better than the TinyUSB host controller driver code for the RP2040 that is currently in TinyUSB. It works around a RP2040 hardware issue and it makes bulk transfers faster than what you get with the current HCD. Do you see an area for improvement in this pull request?

shreeve · 2024-11-13T06:58:44Z

@rppicomidi - It's hard for me to grok the code since it is very compact and there is a lot going on. I've started the other project to try to create a simpler and dynamic version of USB Host. Ha Thach has done an AMAZING job with TinyUSB though and has had a huge impact. My small little library is primarily nowhere near close to a fraction of what he has done, but it is a small effort to understand how the rp2040 USB Host mode works and hopefully it will inspire someone else. Thank you for your work on this patch!

ctag-fh-kiel

The PR works on my test system.

ctag-fh-kiel · 2025-01-23T11:00:26Z

that is my bad, I was migrating HIL machine from pi5 to an VM. Merged with latest should fix this. Please give me a bit of time, I will review and test this as soon as I could

is still PR still on the test list? would be great, thx :)

Force transfers requested from the application to start as soon as possible. No need to wait for SOF. Use as much of the frame as possible for pending transfers. If there is time left in the frame, can retry NAK'd BULK transfers.

rppicomidi · 2025-02-04T23:23:39Z

@hathach I pushed some changes to remove dead code and make this HCD even faster. Even if you don't care about the original data sequence error bug, the speed improvement should make this worth the time to merge. I do understand that the pull request is complicated, that it will take time to review, and that you are very busy, so I will remain patient if you do not have time. Thank you for this project.

With some MIDI devices, forcing the HCD scheduler to run using an unused interrupt source causes data corruption.

wbcbz7 · 2025-03-18T14:00:17Z

Hello everyone, and thank you all for the effort of speeding up USB host on the Raspberry Pi Pico! Unfortunately, after testing the proposed re-implementation of the RP2xxx HCD, I've found a couple of bugs and potentional issues that would be nice to have fixed:

This code assumes that there can be only one USB hub attached (see https://github.com/rppicomidi/tinyusb/blob/f981f5c9fdd8ea227650f145af43829d836c1534/src/portable/raspberrypi/rp2040/hcd_rp2040.c#L113), while in fact the number of hubs is defined by CFG_TUH_HUB in tusb_config.h
The epx endpoint structure is simultaenously used for storing control transfer data and an "actual" EPx endpoint storage for performing all transfers. During my tests of simultaneous USB Mass Storage and HID devices, it turns out control transfers can get sometimes corrupted by current EPx transfer control structures, and vice versa, so it makes sense to store current EPx info in the separate hw_endpoint structure, and store endpoint_control/buffer_control for ep_pool somewhere in the RAM instead of DPRAM.
The RP2040 seems to always auto-retry bulk/interrupt transfers if NAK is received, without any ability of raising the IRQ or stopping the transfer - for example, Mass Storage devices can "throttle" the transaction this way until data is ready for transmission. This not only has a negative impact on the available USB bandwidth (as the HCD cannot schedule next transaction until at least SOF interrupt is raised, so it can observe the IRQ status there), but can also effectively halt the USB bus if (badly designed) device is stuck in an NAK loop. Moreover, one cannot just drop the packet solely on SIE_STATUS.NAK_REC flag, as it only means that the current transaction was NAKed at least once. At the same time, if it was eventually ACKed, the USB controller also raises the INTR.BUFF_STATUS flag (with additional SIE_STATUS.TRANS_COMPLETE if it was the last packet of a transaction). As for your patch, the ISR unconditionally drops data for non-control transfers if SIE_STATUS.NAK_REC if set, and this results in current transaction being "frozen". RP2350 adds a feature to stop current transaction if NAK is received, optionally raising the IRQ, so there should be no such issues, but for RP2040 some sort of a workaround should be provided.
UPD: so far the correct NAK handling (which do not lead to Data Sequence errors) is to ignore NAK status for bulk transfers if BUFF_STATUS was asserted, and drop data on NAK for interrupt.

I really hope these issues will be addressed and resolved, as this patch already shows a vast improvement on bulk transactions on the RP2040 USB Host - my tests already show around 1.0-1.1 MBytes/second for reading 4K/8K sectors from the USB stick, so it would be awesome to have this patch fixed and eventually merged to the mainstream TinyUSB. Thanks in advance! :)

rppicomidi · 2025-03-21T14:42:19Z

@wbcbz7 Thank you for taking the time to review this in detail.

This code assumes that there can be only one USB hub attached

Yes, that is because multiple hub support is a relatively new feature. I will address that.

During my tests of simultaneous USB Mass Storage and HID devices, it turns out control transfers can get sometimes corrupted by current EPx transfer control structures, and vice versa

Can you post evidence of that happening? For example, a USB sniffer trace, a log, or a file diff? It will help me root cause the issues you are seeing.

The RP2040 seems to always auto-retry bulk/interrupt transfers if NAK is received, without any ability of raising the IRQ or stopping the transfer

Auto-retry is a "feature" of the RP2040 chip for Control endpoint transfers that I can do nothing about. This HCD has to assume Control endpoint transfers are rare. I agree that the RP2350 has a workaround feature for this, but this is a RP2040 HCD. If that gets merged, I (or you) can implement a RP2350 HCD.

The RP2040 will also auto-retry Bulk endpoint transfers; however, the workaround for that is supposed to be to configure Bulk endpoint transfers as Interrupt endpoint transfers; Interrrupt endpoint transfers to EPx are not supposed to auto-retry after NAK. I thought I already did that, but if not, I will fix it.

wbcbz7 · 2025-03-21T16:14:17Z

Can you post evidence of that happening? For example, a USB sniffer trace, a log, or a file diff? It will help me root cause the issues you are seeing.
I'll try to hook an logic analyzer and dump a trace at weekends, so far you can try to reproduce a bug this way:

Clone a host_cdc_msc_hid example from Pico-SDK, then add a simple routine to msc_app.c which will repeatedly read or write data from the mass storage device (i.e. from this gist), then compile it.
Plug a USB hub to the Pico USB port, then plug any USB Mass Storage device (e.g. an USB stick) to the hub, then turn the Pico on and watch for messages in UART - if all is ok you should see a R (or W if write is enabled) characters repeatedly sent for each transfer complete.
Plug any other USB device (i.e. a HID mouse) to the hub, then make sure it will send data on HID report requests - for instance you can wilggle the mouse to make it report the movement data.
Try plugging other devices in during a transfer, or disconnecting them, or rebooting the Pico with multiple devices connected to the hub.

This code seems to work fine with the mainline TunyUSB, but with your branch, all transfers, including MSC reads/writes, ocassionally stop until reboot after new device being connected, or an assert in sync_ep_buffer is triggered (example log):

TinyUSB Host CDC MSC HID Example
[0:] USBH Device Attach
[0:] USBH Device Attach
A MassStorage device is mounted
A device with address 1 is mounted
         USB DISK 2.0     rev PMAP
Disk Size: 7389 MB
Block Count = 15132672, Block Size: 512
RRRRRRR[0:] USBH Device Attach
Rassertion "!(buf_ctrl & USB_BUF_CTRL_FULL)" failed: file "D:/SRC/pico2/!test_crap/tinyusb_rppicomidi/src/portable/raspberrypi/rp2040/rp2040_usb.c", line 231, function: sync_ep_buffer
TinyUSB Host CDC MSC HID Example
[0:] USBH Device Attach
[0:] USBH Device Attach
A MassStorage device is mounted
A device with address 1 is mounted
         USB DISK 2.0     rev PMAP
Disk Size: 7389 MB
Block Count = 15132672, Block Size: 512
RRRRRRR[0:] USBH Device Attach
R

(please ignore the pico2 in source file path - this code was compiled for the RP2040 and tested on the Pi Pico)
You can change the MSC_BLOCK_COUNT to select smaller or larger transfer sizes - it also appears that multi-USB-packet bulk transfers gets uninterrupted by i.e interrupt transfers by HID requests, so if one sets a repeating large (>16-32 kB) transfers, the HID polling rate will be negatively impacted, but I suppose that's by the current HCD driver design.

I've been tinkering with your branch's code in a local repo - unfortunately I've first merged it with latest TinyUSB by manually copying the driver files so original commit history is lost. If you're interested, I can put it online :)

rppicomidi · 2025-03-21T18:07:21Z

@wbcbz7 Thank you for the additional information. I will try your experiment.

By design, the HCD first tries pending Control transfers, then pending Interrupt transfers, and only if there is any time left in the frame does it try pending Bulk transfers. If you are seeing large bulk transfers slowing down HID interrupt polling rate, then there is a problem. Control transfers that NAK for a long time could potentially slow down the polling interval because RP2040 auto-retry cannot be disabled for Control endpoints, but Bulk transfers should not.

The function that tells the RP2040 that both Interrupt and Bulk transfers are to be handled as Interrupt endpoint transfers is _hw_setup_epx_from_ep() (around line 288 in hcd_rp2040.c). This function is called before using the EPx hardware to do a transfer. Do you have evidence that the RP2040 is auto-retrying Bulk or Interrupt transfers? I use usb-sniffer-lite for USB traces and will test on my system with your example code also.

rppicomidi · 2025-03-21T22:04:47Z

@wbcbz7 I have addressed the hub issue and have done a git merge with the latest TinyUSB and pushed it up. I am trying your test code now and find no evidence of issues yet. I am using a powered 4-port hub, a USB mouse plus 2 USB flash drives. Is there a particular block size or blocks per transfer number that causes issues in your testing? Is it possible that the merged code you created introduced some other issue?

wbcbz7 · 2025-03-22T16:09:52Z

@rppicomidi, I've just linked the test application with your latest commit, and while it seems to bie slightly more stable, it still appears to occasionally hang during bulk transfers if control transfers are executed concurrently. I've tried usb-sniffer-lite to capture USB frame data but even with an external trigger it seems to not have enough buffer memory to capture the exact moment of hang, so I had to revert back to my logic analyzer. Attached there are two PulseView sessions with bulk in transfers gets intervened by control transfers, which leads to former being stuck - i've also attached USB packet text annotations exported.
It actaully seems that NAKed bulk transfers can be in fact preempted by other transfers, but for some reason it doesn't play well. There is one notable moment in the second capture - first SETUP transfer at 307ms leads to successive bluk transfer being stuck after a while (at 338 ms). Meanwhile, at 760ms, another SETUP packet is sent, and then there seems to be another bulk transfer being scheduled. For some reason, it seems to also end up in a "stuck" state, as there seems to be no further bulk transfers scheduled. Hope these dumps will be useful for you.

rp2040_tinyusb_bulk_hang_after_setup_2.zip
rp2040_tinyusb_bulk_hang_after_setup.zip

bflorac · 2025-05-23T03:28:28Z

I installed the changed files in a PICO 2W with FreeRTOS (running on 2 cores) and it does seem to prevent the run away NAKs.
I had to make sure all the USB related tasks (including the routine that calls tusb_init() were on the same core (I used core 1) or it got a panic in function hw_endpoint_lock_update() of rp2040_usb.c.

nobara256 · 2025-06-15T05:45:11Z

I was trying to implement a minimal USB HCI bridge driver for a rtl8761 based Bluetooth 5.3 Dongle and use with BTStack. I surprisingly worked well until I realized that the ACL endpoints (especially Bulk In transfers) are limited to 64KB/s, while playing with the a2dp sink demo. I scratched my head for days as it worked perfectly with libusb port on linux PC, finally got my answers in here.

This PR works great with the MSC demos but as the HCI driver required a lot of control and interrupt transfers for hci CMDs and EVENTs, it is hitting the Sequence Error after first CMD/EVENT transaction and just stopping there.

~~I guess this is very much a dead end.~~

My bad, somehow HCI code was requesting more than 16bytes from the EVENT endpoint, fix it now every thing works perfect. I don't care about ISO transfers.

I managed to get both tuh and btstack running on the same core (core0). To do this smoothly, I am spinning tuh_task in the mainloop untill bulk transfer packets (64 bytes) are fully consumed by the my HCI driver after that I am moving on to handling btstack and stdio tasks. core1 is running the aptx decoder and I2S tasks.

Also something else that gave me headache is the XIP cache mechanism. openaptx.c file has a lot of look up tables with "static const" that make them go into the flash, and this causes significant slowdown on core0 as well. Eventually the USB controller device drops some of it's buffered packets this results in a2dp packet loss on application side.
I had to manually "__not_in_flash" before every tables and functions to get it working smoothly. I still need to polish a lot of things, but without this PR this progress wasn't possible. Thanks :)

polpo · 2025-06-19T16:57:56Z

I really hope this is able to be merged. The speed boost from this made CD-ROM emulation support on my PicoGUS project possible. Thank you @rppicomidi and @wbcbz7. Right now I'm using wbcbz7's branch as a submodule but it'd be great to go back to mainline TinyUSB!

rppicomidi · 2025-06-27T03:52:19Z

@wbcbz7 Thank you for continuing to work on this. I have had a lot of things going on. Do you plan to submit a separate pull request, or would you like me to merge your changes into this one?

wbcbz7 · 2025-06-29T17:20:08Z

@rppicomidi Since my changes were mostly based on your pull request, I think either can be done. There is still some issues left, such as occasional asserts/panics if the device has been unplugged from a downstream USB hub during a bulk transfer - I could try to look for them and fix if it's possible and when time allows.

rppicomidi mentioned this pull request Sep 24, 2024

Arturia Beat Step Pro no data rppicomidi/usb_midi_host#14

Open

rppicomidi mentioned this pull request Nov 13, 2024

The work-around for the RP2040 DATA Sequence error bug is temporary shreeve/picousb#1

Open

ctag-fh-kiel approved these changes Dec 10, 2024

View reviewed changes

rppicomidi mentioned this pull request Jan 31, 2025

Communication error between two devices rppicomidi/midi2piousbhub#8

Open

rppicomidi added 3 commits February 4, 2025 09:10

Merge remote-tracking branch 'upstream/master' into epx_only_rp2040_host

dac1c2c

Remove dead code

b182562

Use as much of the USB frame as possible

c2d61bf

Force transfers requested from the application to start as soon as possible. No need to wait for SOF. Use as much of the frame as possible for pending transfers. If there is time left in the frame, can retry NAK'd BULK transfers.

Do not force transaction schedule start

f981f5c

With some MIDI devices, forcing the HCD scheduler to run using an unused interrupt source causes data corruption.

ctag-fh-kiel mentioned this pull request Feb 13, 2025

RP2040 MIDI Host #1627

Merged

rppicomidi added 4 commits March 21, 2025 12:59

Allow for more than 1 hub

2e6f1ff

Document ep_reg setup

ea8682b

Merge remote-tracking branch 'origin/master' into epx_only_rp2040_host

12f6093

Fix compiler warning

45f515f

wbcbz7 added a commit to wbcbz7/tinyusb that referenced this pull request May 8, 2025

implement EPx-only RP2040 host (copied from hathach#2814)

77c6925

Implement EPx-only RP2040 USB Host #2814

Are you sure you want to change the base?

Implement EPx-only RP2040 USB Host #2814

Conversation

rppicomidi commented Sep 24, 2024

Uh oh!

rppicomidi commented Sep 30, 2024

Uh oh!

hathach commented Oct 2, 2024

Uh oh!

shreeve commented Nov 10, 2024

Uh oh!

rppicomidi commented Nov 12, 2024

Uh oh!

shreeve commented Nov 13, 2024

Uh oh!

ctag-fh-kiel left a comment

Choose a reason for hiding this comment

Uh oh!

ctag-fh-kiel commented Jan 23, 2025

Uh oh!

rppicomidi commented Feb 4, 2025

Uh oh!

wbcbz7 commented Mar 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rppicomidi commented Mar 21, 2025

Uh oh!

wbcbz7 commented Mar 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rppicomidi commented Mar 21, 2025

Uh oh!

rppicomidi commented Mar 21, 2025

Uh oh!

wbcbz7 commented Mar 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bflorac commented May 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nobara256 commented Jun 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

polpo commented Jun 19, 2025

Uh oh!

rppicomidi commented Jun 27, 2025

Uh oh!

wbcbz7 commented Jun 29, 2025

Uh oh!

Uh oh!

wbcbz7 commented Mar 18, 2025 •

edited

Loading

wbcbz7 commented Mar 21, 2025 •

edited

Loading

wbcbz7 commented Mar 22, 2025 •

edited

Loading

bflorac commented May 23, 2025 •

edited

Loading

nobara256 commented Jun 15, 2025 •

edited

Loading