mmiotrace.txt 6.6 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165
  1. In-kernel memory-mapped I/O tracing
  2. Home page and links to optional user space tools:
  3. http://nouveau.freedesktop.org/wiki/MmioTrace
  4. MMIO tracing was originally developed by Intel around 2003 for their Fault
  5. Injection Test Harness. In Dec 2006 - Jan 2007, using the code from Intel,
  6. Jeff Muizelaar created a tool for tracing MMIO accesses with the Nouveau
  7. project in mind. Since then many people have contributed.
  8. Mmiotrace was built for reverse engineering any memory-mapped IO device with
  9. the Nouveau project as the first real user. Only x86 and x86_64 architectures
  10. are supported.
  11. Out-of-tree mmiotrace was originally modified for mainline inclusion and
  12. ftrace framework by Pekka Paalanen <pq@iki.fi>.
  13. Preparation
  14. -----------
  15. Mmiotrace feature is compiled in by the CONFIG_MMIOTRACE option. Tracing is
  16. disabled by default, so it is safe to have this set to yes. SMP systems are
  17. supported, but tracing is unreliable and may miss events if more than one CPU
  18. is on-line, therefore mmiotrace takes all but one CPU off-line during run-time
  19. activation. You can re-enable CPUs by hand, but you have been warned, there
  20. is no way to automatically detect if you are losing events due to CPUs racing.
  21. Usage Quick Reference
  22. ---------------------
  23. $ mount -t debugfs debugfs /sys/kernel/debug
  24. $ echo mmiotrace > /sys/kernel/debug/tracing/current_tracer
  25. $ cat /sys/kernel/debug/tracing/trace_pipe > mydump.txt &
  26. Start X or whatever.
  27. $ echo "X is up" > /sys/kernel/debug/tracing/trace_marker
  28. $ echo nop > /sys/kernel/debug/tracing/current_tracer
  29. Check for lost events.
  30. Usage
  31. -----
  32. Make sure debugfs is mounted to /sys/kernel/debug.
  33. If not (requires root privileges):
  34. $ mount -t debugfs debugfs /sys/kernel/debug
  35. Check that the driver you are about to trace is not loaded.
  36. Activate mmiotrace (requires root privileges):
  37. $ echo mmiotrace > /sys/kernel/debug/tracing/current_tracer
  38. Start storing the trace:
  39. $ cat /sys/kernel/debug/tracing/trace_pipe > mydump.txt &
  40. The 'cat' process should stay running (sleeping) in the background.
  41. Load the driver you want to trace and use it. Mmiotrace will only catch MMIO
  42. accesses to areas that are ioremapped while mmiotrace is active.
  43. During tracing you can place comments (markers) into the trace by
  44. $ echo "X is up" > /sys/kernel/debug/tracing/trace_marker
  45. This makes it easier to see which part of the (huge) trace corresponds to
  46. which action. It is recommended to place descriptive markers about what you
  47. do.
  48. Shut down mmiotrace (requires root privileges):
  49. $ echo nop > /sys/kernel/debug/tracing/current_tracer
  50. The 'cat' process exits. If it does not, kill it by issuing 'fg' command and
  51. pressing ctrl+c.
  52. Check that mmiotrace did not lose events due to a buffer filling up. Either
  53. $ grep -i lost mydump.txt
  54. which tells you exactly how many events were lost, or use
  55. $ dmesg
  56. to view your kernel log and look for "mmiotrace has lost events" warning. If
  57. events were lost, the trace is incomplete. You should enlarge the buffers and
  58. try again. Buffers are enlarged by first seeing how large the current buffers
  59. are:
  60. $ cat /sys/kernel/debug/tracing/buffer_size_kb
  61. gives you a number. Approximately double this number and write it back, for
  62. instance:
  63. $ echo 128000 > /sys/kernel/debug/tracing/buffer_size_kb
  64. Then start again from the top.
  65. If you are doing a trace for a driver project, e.g. Nouveau, you should also
  66. do the following before sending your results:
  67. $ lspci -vvv > lspci.txt
  68. $ dmesg > dmesg.txt
  69. $ tar zcf pciid-nick-mmiotrace.tar.gz mydump.txt lspci.txt dmesg.txt
  70. and then send the .tar.gz file. The trace compresses considerably. Replace
  71. "pciid" and "nick" with the PCI ID or model name of your piece of hardware
  72. under investigation and your nickname.
  73. How Mmiotrace Works
  74. -------------------
  75. Access to hardware IO-memory is gained by mapping addresses from PCI bus by
  76. calling one of the ioremap_*() functions. Mmiotrace is hooked into the
  77. __ioremap() function and gets called whenever a mapping is created. Mapping is
  78. an event that is recorded into the trace log. Note that ISA range mappings
  79. are not caught, since the mapping always exists and is returned directly.
  80. MMIO accesses are recorded via page faults. Just before __ioremap() returns,
  81. the mapped pages are marked as not present. Any access to the pages causes a
  82. fault. The page fault handler calls mmiotrace to handle the fault. Mmiotrace
  83. marks the page present, sets TF flag to achieve single stepping and exits the
  84. fault handler. The instruction that faulted is executed and debug trap is
  85. entered. Here mmiotrace again marks the page as not present. The instruction
  86. is decoded to get the type of operation (read/write), data width and the value
  87. read or written. These are stored to the trace log.
  88. Setting the page present in the page fault handler has a race condition on SMP
  89. machines. During the single stepping other CPUs may run freely on that page
  90. and events can be missed without a notice. Re-enabling other CPUs during
  91. tracing is discouraged.
  92. Trace Log Format
  93. ----------------
  94. The raw log is text and easily filtered with e.g. grep and awk. One record is
  95. one line in the log. A record starts with a keyword, followed by keyword-
  96. dependent arguments. Arguments are separated by a space, or continue until the
  97. end of line. The format for version 20070824 is as follows:
  98. Explanation Keyword Space-separated arguments
  99. ---------------------------------------------------------------------------
  100. read event R width, timestamp, map id, physical, value, PC, PID
  101. write event W width, timestamp, map id, physical, value, PC, PID
  102. ioremap event MAP timestamp, map id, physical, virtual, length, PC, PID
  103. iounmap event UNMAP timestamp, map id, PC, PID
  104. marker MARK timestamp, text
  105. version VERSION the string "20070824"
  106. info for reader LSPCI one line from lspci -v
  107. PCI address map PCIDEV space-separated /proc/bus/pci/devices data
  108. unk. opcode UNKNOWN timestamp, map id, physical, data, PC, PID
  109. Timestamp is in seconds with decimals. Physical is a PCI bus address, virtual
  110. is a kernel virtual address. Width is the data width in bytes and value is the
  111. data value. Map id is an arbitrary id number identifying the mapping that was
  112. used in an operation. PC is the program counter and PID is process id. PC is
  113. zero if it is not recorded. PID is always zero as tracing MMIO accesses
  114. originating in user space memory is not yet supported.
  115. For instance, the following awk filter will pass all 32-bit writes that target
  116. physical addresses in the range [0xfb73ce40, 0xfb800000[
  117. $ awk '/W 4 / { adr=strtonum($5); if (adr >= 0xfb73ce40 &&
  118. adr < 0xfb800000) print; }'
  119. Tools for Developers
  120. --------------------
  121. The user space tools include utilities for:
  122. - replacing numeric addresses and values with hardware register names
  123. - replaying MMIO logs, i.e., re-executing the recorded writes