uhci.txt 7.2 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165
  1. Specification and Internals for the New UHCI Driver (Whitepaper...)
  2. brought to you by
  3. Georg Acher, acher@in.tum.de (executive slave) (base guitar)
  4. Deti Fliegl, deti@fliegl.de (executive slave) (lead voice)
  5. Thomas Sailer, sailer@ife.ee.ethz.ch (chief consultant) (cheer leader)
  6. $Id: README.uhci,v 1.1 1999/12/14 14:03:02 fliegl Exp $
  7. This document and the new uhci sources can be found on
  8. http://hotswap.in.tum.de/usb
  9. 1. General issues
  10. 1.1 Why a new UHCI driver, we already have one?!?
  11. Correct, but its internal structure got more and more mixed up by the (still
  12. ongoing) efforts to get isochronous transfers (ISO) to work.
  13. Since there is an increasing need for reliable ISO-transfers (especially
  14. for USB-audio needed by TS and for a DAB-USB-Receiver build by GA and DF),
  15. this state was a bit unsatisfying in our opinion, so we've decided (based
  16. on knowledge and experiences with the old UHCI driver) to start
  17. from scratch with a new approach, much simpler but at the same time more
  18. powerful.
  19. It is inspired by the way Win98/Win2000 handles USB requests via URBs,
  20. but it's definitely 100% free of MS-code and doesn't crash while
  21. unplugging an used ISO-device like Win98 ;-)
  22. Some code for HW setup and root hub management was taken from the
  23. original UHCI driver, but heavily modified to fit into the new code.
  24. The invention of the basic concept, and major coding were completed in two
  25. days (and nights) on the 16th and 17th of October 1999, now known as the
  26. great USB-October-Revolution started by GA, DF, and TS ;-)
  27. Since the concept is in no way UHCI dependent, we hope that it will also be
  28. transferred to the OHCI-driver, so both drivers share a common API.
  29. 1.2. Advantages and disadvantages
  30. + All USB transfer types work now!
  31. + Asynchronous operation
  32. + Simple, but powerful interface (only two calls for start and cancel)
  33. + Easy migration to the new API, simplified by a compatibility API
  34. + Simple usage of ISO transfers
  35. + Automatic linking of requests
  36. + ISO transfers allow variable length for each frame and striping
  37. + No CPU dependent and non-portable atomic memory access, no asm()-inlines
  38. + Tested on x86 and Alpha
  39. - Rewriting for ISO transfers needed
  40. 1.3. Is there some compatibility to the old API?
  41. Yes, but only for control, bulk and interrupt transfers. We've implemented
  42. some wrapper calls for these transfer types. The usbcore works fine with
  43. these wrappers. For ISO there's no compatibility, because the old ISO-API
  44. and its semantics were unnecessary complicated in our opinion.
  45. 1.4. What's really working?
  46. As said above, CTRL and BULK already work fine even with the wrappers,
  47. so legacy code wouldn't notice the change.
  48. Regarding to Thomas, ISO transfers now run stable with USB audio.
  49. INT transfers (e.g. mouse driver) work fine, too.
  50. 1.5. Are there any bugs?
  51. No ;-)
  52. Hm...
  53. Well, of course this implementation needs extensive testing on all available
  54. hardware, but we believe that any fixes shouldn't harm the overall concept.
  55. 1.6. What should be done next?
  56. A large part of the request handling seems to be identical for UHCI and
  57. OHCI, so it would be a good idea to extract the common parts and have only
  58. the HW specific stuff in uhci.c. Furthermore, all other USB device drivers
  59. should need URBification, if they use isochronous or interrupt transfers.
  60. One thing missing in the current implementation (and the old UHCI driver)
  61. is fair queueing for BULK transfers. Since this would need (in principle)
  62. the alteration of already constructed TD chains (to switch from depth to
  63. breadth execution), another way has to be found. Maybe some simple
  64. heuristics work with the same effect.
  65. ---------------------------------------------------------------------------
  66. 2. Internal structure and mechanisms
  67. To get quickly familiar with the internal structures, here's a short
  68. description how the new UHCI driver works. However, the ultimate source of
  69. truth is only uhci.c!
  70. 2.1. Descriptor structure (QHs and TDs)
  71. During initialization, the following skeleton is allocated in init_skel:
  72. framespecific | common chain
  73. framelist[]
  74. [ 0 ]-----> TD --> TD -------\
  75. [ 1 ]-----> TD --> TD --------> TD ----> QH -------> QH -------> QH ---> NULL
  76. ... TD --> TD -------/
  77. [1023]-----> TD --> TD ------/
  78. ^^ ^^ ^^ ^^ ^^ ^^
  79. 1024 TDs for 7 TDs for 1 TD for Start of Start of End Chain
  80. ISO INT (2-128ms) 1ms-INT CTRL Chain BULK Chain
  81. For each CTRL or BULK transfer a new QH is allocated and the containing data
  82. transfers are appended as (vertical) TDs. After building the whole QH with its
  83. dangling TDs, the QH is inserted before the BULK Chain QH (for CTRL) or
  84. before the End Chain QH (for BULK). Since only the QH->next pointers are
  85. affected, no atomic memory operation is required. The three QHs in the
  86. common chain are never equipped with TDs!
  87. For ISO or INT, the TD for each frame is simply inserted into the appropriate
  88. ISO/INT-TD-chain for the desired frame. The 7 skeleton INT-TDs are scattered
  89. among the 1024 frames similar to the old UHCI driver.
  90. For CTRL/BULK/ISO, the last TD in the transfer has the IOC-bit set. For INT,
  91. every TD (there is only one...) has the IOC-bit set.
  92. Besides the data for the UHCI controller (2 or 4 32bit words), the descriptors
  93. are double-linked through the .vertical and .horizontal elements in the
  94. SW data of the descriptor (using the double-linked list structures and
  95. operations), but SW-linking occurs only in closed domains, i.e. for each of
  96. the 1024 ISO-chains and the 8 INT-chains there is a closed cycle. This
  97. simplifies all insertions and unlinking operations and avoids costly
  98. bus_to_virt()-calls.
  99. 2.2. URB structure and linking to QH/TDs
  100. During assembly of the QH and TDs of the requested action, these descriptors
  101. are stored in urb->urb_list, so the allocated QH/TD descriptors are bound to
  102. this URB.
  103. If the assembly was successful and the descriptors were added to the HW chain,
  104. the corresponding URB is inserted into a global URB list for this controller.
  105. This list stores all pending URBs.
  106. 2.3. Interrupt processing
  107. Since UHCI provides no means to directly detect completed transactions, the
  108. following is done in each UHCI interrupt (uhci_interrupt()):
  109. For each URB in the pending queue (process_urb()), the ACTIVE-flag of the
  110. associated TDs are processed (depending on the transfer type
  111. process_{transfer|interrupt|iso}()). If the TDs are not active anymore,
  112. they indicate the completion of the transaction and the status is calculated.
  113. Inactive QH/TDs are removed from the HW chain (since the host controller
  114. already removed the TDs from the QH, no atomic access is needed) and
  115. eventually the URB is marked as completed (OK or errors) and removed from the
  116. pending queue. Then the next linked URB is submitted. After (or immediately
  117. before) that, the completion handler is called.
  118. 2.4. Unlinking URBs
  119. First, all QH/TDs stored in the URB are unlinked from the HW chain.
  120. To ensure that the host controller really left a vertical TD chain, we
  121. wait for one frame. After that, the TDs are physically destroyed.
  122. 2.5. URB linking and the consequences
  123. Since URBs can be linked and the corresponding submit_urb is called in
  124. the UHCI-interrupt, all work associated with URB/QH/TD assembly has to be
  125. interrupt save. This forces kmalloc to use GFP_ATOMIC in the interrupt.