123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301302303304305306307308309310311312313314315316317318319320321322323324325326327328329330331332333334335336337338339340341342343344345346347348349350351352353354355356357358359360361362363364365366367368369370371372373374375376377378379380381382383384385386387388389390391392393394395396397398399400401402403404405406407408409410411412413414415416417418419420421422423424425426427428429430431432433434435436437438439440441442443444445446447448449450451452453454455456457458459460461462463464465466467468469470471472473474475476477478479480481482483484485486487488489490491492493494495496497498499500501502503504505506507508509510511512513514515516517518519520521522523524525526527528529530531532533534535536537538539540541542543544545546547548549550551552553554555556557558559560561562563564565566567568569570571572573574575576577578579580581582583584585586587588589590591592593594595596597598599600601602603604605606607608609610611612613614615616617618619620621622623624625626627628629630631632633634635636637638639640641642643644645646647648649650651652653654655656657658659660661662663664665666667668669670671672673674675676677678679680681682683684685686687688689690691692693694695696697698699700701702703704705706707708709710711712713714715716717718719720721722723724725726727728729730731732733734735736737738739740741742743744745746747748749750751752753754755756757758759760761762763764765766767768769770771772773774775776777778779780781782783784785786787788789790791792793794795796797798799800801802803804805806807808809810811812813814815816817818819820821822823824825826827828829830831832833834835836837838839840841842843844845846847848849850851852853854855856857858859860861862863864865866867868869870871872873874875876877878879880881882883884885886887888889890891892893894895896897898899900901902903904905906907908909910911912913914915916917918919920921922923924925926927928929930931932933934935936937938939940941942943944945946947948949950951952953954955956957958959960961962963964965966967968969970971972973974975976977978979980981982983984985986987988989990991992993994995996997998999100010011002100310041005100610071008100910101011101210131014101510161017101810191020102110221023102410251026102710281029103010311032103310341035103610371038103910401041104210431044104510461047104810491050105110521053105410551056105710581059106010611062106310641065106610671068106910701071107210731074107510761077107810791080108110821083108410851086108710881089109010911092109310941095109610971098109911001101110211031104110511061107110811091110111111121113111411151116111711181119112011211122112311241125112611271128112911301131 |
- THE LINUX/x86 BOOT PROTOCOL
- ---------------------------
- On the x86 platform, the Linux kernel uses a rather complicated boot
- convention. This has evolved partially due to historical aspects, as
- well as the desire in the early days to have the kernel itself be a
- bootable image, the complicated PC memory model and due to changed
- expectations in the PC industry caused by the effective demise of
- real-mode DOS as a mainstream operating system.
- Currently, the following versions of the Linux/x86 boot protocol exist.
- Old kernels: zImage/Image support only. Some very early kernels
- may not even support a command line.
- Protocol 2.00: (Kernel 1.3.73) Added bzImage and initrd support, as
- well as a formalized way to communicate between the
- boot loader and the kernel. setup.S made relocatable,
- although the traditional setup area still assumed
- writable.
- Protocol 2.01: (Kernel 1.3.76) Added a heap overrun warning.
- Protocol 2.02: (Kernel 2.4.0-test3-pre3) New command line protocol.
- Lower the conventional memory ceiling. No overwrite
- of the traditional setup area, thus making booting
- safe for systems which use the EBDA from SMM or 32-bit
- BIOS entry points. zImage deprecated but still
- supported.
- Protocol 2.03: (Kernel 2.4.18-pre1) Explicitly makes the highest possible
- initrd address available to the bootloader.
- Protocol 2.04: (Kernel 2.6.14) Extend the syssize field to four bytes.
- Protocol 2.05: (Kernel 2.6.20) Make protected mode kernel relocatable.
- Introduce relocatable_kernel and kernel_alignment fields.
- Protocol 2.06: (Kernel 2.6.22) Added a field that contains the size of
- the boot command line.
- Protocol 2.07: (Kernel 2.6.24) Added paravirtualised boot protocol.
- Introduced hardware_subarch and hardware_subarch_data
- and KEEP_SEGMENTS flag in load_flags.
- Protocol 2.08: (Kernel 2.6.26) Added crc32 checksum and ELF format
- payload. Introduced payload_offset and payload_length
- fields to aid in locating the payload.
- Protocol 2.09: (Kernel 2.6.26) Added a field of 64-bit physical
- pointer to single linked list of struct setup_data.
- Protocol 2.10: (Kernel 2.6.31) Added a protocol for relaxed alignment
- beyond the kernel_alignment added, new init_size and
- pref_address fields. Added extended boot loader IDs.
- Protocol 2.11: (Kernel 3.6) Added a field for offset of EFI handover
- protocol entry point.
- Protocol 2.12: (Kernel 3.8) Added the xloadflags field and extension fields
- to struct boot_params for loading bzImage and ramdisk
- above 4G in 64bit.
- **** MEMORY LAYOUT
- The traditional memory map for the kernel loader, used for Image or
- zImage kernels, typically looks like:
- | |
- 0A0000 +------------------------+
- | Reserved for BIOS | Do not use. Reserved for BIOS EBDA.
- 09A000 +------------------------+
- | Command line |
- | Stack/heap | For use by the kernel real-mode code.
- 098000 +------------------------+
- | Kernel setup | The kernel real-mode code.
- 090200 +------------------------+
- | Kernel boot sector | The kernel legacy boot sector.
- 090000 +------------------------+
- | Protected-mode kernel | The bulk of the kernel image.
- 010000 +------------------------+
- | Boot loader | <- Boot sector entry point 0000:7C00
- 001000 +------------------------+
- | Reserved for MBR/BIOS |
- 000800 +------------------------+
- | Typically used by MBR |
- 000600 +------------------------+
- | BIOS use only |
- 000000 +------------------------+
- When using bzImage, the protected-mode kernel was relocated to
- 0x100000 ("high memory"), and the kernel real-mode block (boot sector,
- setup, and stack/heap) was made relocatable to any address between
- 0x10000 and end of low memory. Unfortunately, in protocols 2.00 and
- 2.01 the 0x90000+ memory range is still used internally by the kernel;
- the 2.02 protocol resolves that problem.
- It is desirable to keep the "memory ceiling" -- the highest point in
- low memory touched by the boot loader -- as low as possible, since
- some newer BIOSes have begun to allocate some rather large amounts of
- memory, called the Extended BIOS Data Area, near the top of low
- memory. The boot loader should use the "INT 12h" BIOS call to verify
- how much low memory is available.
- Unfortunately, if INT 12h reports that the amount of memory is too
- low, there is usually nothing the boot loader can do but to report an
- error to the user. The boot loader should therefore be designed to
- take up as little space in low memory as it reasonably can. For
- zImage or old bzImage kernels, which need data written into the
- 0x90000 segment, the boot loader should make sure not to use memory
- above the 0x9A000 point; too many BIOSes will break above that point.
- For a modern bzImage kernel with boot protocol version >= 2.02, a
- memory layout like the following is suggested:
- ~ ~
- | Protected-mode kernel |
- 100000 +------------------------+
- | I/O memory hole |
- 0A0000 +------------------------+
- | Reserved for BIOS | Leave as much as possible unused
- ~ ~
- | Command line | (Can also be below the X+10000 mark)
- X+10000 +------------------------+
- | Stack/heap | For use by the kernel real-mode code.
- X+08000 +------------------------+
- | Kernel setup | The kernel real-mode code.
- | Kernel boot sector | The kernel legacy boot sector.
- X +------------------------+
- | Boot loader | <- Boot sector entry point 0000:7C00
- 001000 +------------------------+
- | Reserved for MBR/BIOS |
- 000800 +------------------------+
- | Typically used by MBR |
- 000600 +------------------------+
- | BIOS use only |
- 000000 +------------------------+
- ... where the address X is as low as the design of the boot loader
- permits.
- **** THE REAL-MODE KERNEL HEADER
- In the following text, and anywhere in the kernel boot sequence, "a
- sector" refers to 512 bytes. It is independent of the actual sector
- size of the underlying medium.
- The first step in loading a Linux kernel should be to load the
- real-mode code (boot sector and setup code) and then examine the
- following header at offset 0x01f1. The real-mode code can total up to
- 32K, although the boot loader may choose to load only the first two
- sectors (1K) and then examine the bootup sector size.
- The header looks like:
- Offset Proto Name Meaning
- /Size
- 01F1/1 ALL(1 setup_sects The size of the setup in sectors
- 01F2/2 ALL root_flags If set, the root is mounted readonly
- 01F4/4 2.04+(2 syssize The size of the 32-bit code in 16-byte paras
- 01F8/2 ALL ram_size DO NOT USE - for bootsect.S use only
- 01FA/2 ALL vid_mode Video mode control
- 01FC/2 ALL root_dev Default root device number
- 01FE/2 ALL boot_flag 0xAA55 magic number
- 0200/2 2.00+ jump Jump instruction
- 0202/4 2.00+ header Magic signature "HdrS"
- 0206/2 2.00+ version Boot protocol version supported
- 0208/4 2.00+ realmode_swtch Boot loader hook (see below)
- 020C/2 2.00+ start_sys_seg The load-low segment (0x1000) (obsolete)
- 020E/2 2.00+ kernel_version Pointer to kernel version string
- 0210/1 2.00+ type_of_loader Boot loader identifier
- 0211/1 2.00+ loadflags Boot protocol option flags
- 0212/2 2.00+ setup_move_size Move to high memory size (used with hooks)
- 0214/4 2.00+ code32_start Boot loader hook (see below)
- 0218/4 2.00+ ramdisk_image initrd load address (set by boot loader)
- 021C/4 2.00+ ramdisk_size initrd size (set by boot loader)
- 0220/4 2.00+ bootsect_kludge DO NOT USE - for bootsect.S use only
- 0224/2 2.01+ heap_end_ptr Free memory after setup end
- 0226/1 2.02+(3 ext_loader_ver Extended boot loader version
- 0227/1 2.02+(3 ext_loader_type Extended boot loader ID
- 0228/4 2.02+ cmd_line_ptr 32-bit pointer to the kernel command line
- 022C/4 2.03+ initrd_addr_max Highest legal initrd address
- 0230/4 2.05+ kernel_alignment Physical addr alignment required for kernel
- 0234/1 2.05+ relocatable_kernel Whether kernel is relocatable or not
- 0235/1 2.10+ min_alignment Minimum alignment, as a power of two
- 0236/2 2.12+ xloadflags Boot protocol option flags
- 0238/4 2.06+ cmdline_size Maximum size of the kernel command line
- 023C/4 2.07+ hardware_subarch Hardware subarchitecture
- 0240/8 2.07+ hardware_subarch_data Subarchitecture-specific data
- 0248/4 2.08+ payload_offset Offset of kernel payload
- 024C/4 2.08+ payload_length Length of kernel payload
- 0250/8 2.09+ setup_data 64-bit physical pointer to linked list
- of struct setup_data
- 0258/8 2.10+ pref_address Preferred loading address
- 0260/4 2.10+ init_size Linear memory required during initialization
- 0264/4 2.11+ handover_offset Offset of handover entry point
- (1) For backwards compatibility, if the setup_sects field contains 0, the
- real value is 4.
- (2) For boot protocol prior to 2.04, the upper two bytes of the syssize
- field are unusable, which means the size of a bzImage kernel
- cannot be determined.
- (3) Ignored, but safe to set, for boot protocols 2.02-2.09.
- If the "HdrS" (0x53726448) magic number is not found at offset 0x202,
- the boot protocol version is "old". Loading an old kernel, the
- following parameters should be assumed:
- Image type = zImage
- initrd not supported
- Real-mode kernel must be located at 0x90000.
- Otherwise, the "version" field contains the protocol version,
- e.g. protocol version 2.01 will contain 0x0201 in this field. When
- setting fields in the header, you must make sure only to set fields
- supported by the protocol version in use.
- **** DETAILS OF HEADER FIELDS
- For each field, some are information from the kernel to the bootloader
- ("read"), some are expected to be filled out by the bootloader
- ("write"), and some are expected to be read and modified by the
- bootloader ("modify").
- All general purpose boot loaders should write the fields marked
- (obligatory). Boot loaders who want to load the kernel at a
- nonstandard address should fill in the fields marked (reloc); other
- boot loaders can ignore those fields.
- The byte order of all fields is littleendian (this is x86, after all.)
- Field name: setup_sects
- Type: read
- Offset/size: 0x1f1/1
- Protocol: ALL
- The size of the setup code in 512-byte sectors. If this field is
- 0, the real value is 4. The real-mode code consists of the boot
- sector (always one 512-byte sector) plus the setup code.
- Field name: root_flags
- Type: modify (optional)
- Offset/size: 0x1f2/2
- Protocol: ALL
- If this field is nonzero, the root defaults to readonly. The use of
- this field is deprecated; use the "ro" or "rw" options on the
- command line instead.
- Field name: syssize
- Type: read
- Offset/size: 0x1f4/4 (protocol 2.04+) 0x1f4/2 (protocol ALL)
- Protocol: 2.04+
- The size of the protected-mode code in units of 16-byte paragraphs.
- For protocol versions older than 2.04 this field is only two bytes
- wide, and therefore cannot be trusted for the size of a kernel if
- the LOAD_HIGH flag is set.
- Field name: ram_size
- Type: kernel internal
- Offset/size: 0x1f8/2
- Protocol: ALL
- This field is obsolete.
- Field name: vid_mode
- Type: modify (obligatory)
- Offset/size: 0x1fa/2
- Please see the section on SPECIAL COMMAND LINE OPTIONS.
- Field name: root_dev
- Type: modify (optional)
- Offset/size: 0x1fc/2
- Protocol: ALL
- The default root device device number. The use of this field is
- deprecated, use the "root=" option on the command line instead.
- Field name: boot_flag
- Type: read
- Offset/size: 0x1fe/2
- Protocol: ALL
- Contains 0xAA55. This is the closest thing old Linux kernels have
- to a magic number.
- Field name: jump
- Type: read
- Offset/size: 0x200/2
- Protocol: 2.00+
- Contains an x86 jump instruction, 0xEB followed by a signed offset
- relative to byte 0x202. This can be used to determine the size of
- the header.
- Field name: header
- Type: read
- Offset/size: 0x202/4
- Protocol: 2.00+
- Contains the magic number "HdrS" (0x53726448).
- Field name: version
- Type: read
- Offset/size: 0x206/2
- Protocol: 2.00+
- Contains the boot protocol version, in (major << 8)+minor format,
- e.g. 0x0204 for version 2.04, and 0x0a11 for a hypothetical version
- 10.17.
- Field name: realmode_swtch
- Type: modify (optional)
- Offset/size: 0x208/4
- Protocol: 2.00+
- Boot loader hook (see ADVANCED BOOT LOADER HOOKS below.)
- Field name: start_sys_seg
- Type: read
- Offset/size: 0x20c/2
- Protocol: 2.00+
- The load low segment (0x1000). Obsolete.
- Field name: kernel_version
- Type: read
- Offset/size: 0x20e/2
- Protocol: 2.00+
- If set to a nonzero value, contains a pointer to a NUL-terminated
- human-readable kernel version number string, less 0x200. This can
- be used to display the kernel version to the user. This value
- should be less than (0x200*setup_sects).
- For example, if this value is set to 0x1c00, the kernel version
- number string can be found at offset 0x1e00 in the kernel file.
- This is a valid value if and only if the "setup_sects" field
- contains the value 15 or higher, as:
- 0x1c00 < 15*0x200 (= 0x1e00) but
- 0x1c00 >= 14*0x200 (= 0x1c00)
- 0x1c00 >> 9 = 14, so the minimum value for setup_secs is 15.
- Field name: type_of_loader
- Type: write (obligatory)
- Offset/size: 0x210/1
- Protocol: 2.00+
- If your boot loader has an assigned id (see table below), enter
- 0xTV here, where T is an identifier for the boot loader and V is
- a version number. Otherwise, enter 0xFF here.
- For boot loader IDs above T = 0xD, write T = 0xE to this field and
- write the extended ID minus 0x10 to the ext_loader_type field.
- Similarly, the ext_loader_ver field can be used to provide more than
- four bits for the bootloader version.
- For example, for T = 0x15, V = 0x234, write:
- type_of_loader <- 0xE4
- ext_loader_type <- 0x05
- ext_loader_ver <- 0x23
- Assigned boot loader ids (hexadecimal):
- 0 LILO (0x00 reserved for pre-2.00 bootloader)
- 1 Loadlin
- 2 bootsect-loader (0x20, all other values reserved)
- 3 Syslinux
- 4 Etherboot/gPXE/iPXE
- 5 ELILO
- 7 GRUB
- 8 U-Boot
- 9 Xen
- A Gujin
- B Qemu
- C Arcturus Networks uCbootloader
- D kexec-tools
- E Extended (see ext_loader_type)
- F Special (0xFF = undefined)
- 10 Reserved
- 11 Minimal Linux Bootloader <http://sebastian-plotz.blogspot.de>
- 12 OVMF UEFI virtualization stack
- Please contact <hpa@zytor.com> if you need a bootloader ID
- value assigned.
- Field name: loadflags
- Type: modify (obligatory)
- Offset/size: 0x211/1
- Protocol: 2.00+
- This field is a bitmask.
- Bit 0 (read): LOADED_HIGH
- - If 0, the protected-mode code is loaded at 0x10000.
- - If 1, the protected-mode code is loaded at 0x100000.
- Bit 1 (kernel internal): KASLR_FLAG
- - Used internally by the compressed kernel to communicate
- KASLR status to kernel proper.
- If 1, KASLR enabled.
- If 0, KASLR disabled.
- Bit 5 (write): QUIET_FLAG
- - If 0, print early messages.
- - If 1, suppress early messages.
- This requests to the kernel (decompressor and early
- kernel) to not write early messages that require
- accessing the display hardware directly.
- Bit 6 (write): KEEP_SEGMENTS
- Protocol: 2.07+
- - If 0, reload the segment registers in the 32bit entry point.
- - If 1, do not reload the segment registers in the 32bit entry point.
- Assume that %cs %ds %ss %es are all set to flat segments with
- a base of 0 (or the equivalent for their environment).
- Bit 7 (write): CAN_USE_HEAP
- Set this bit to 1 to indicate that the value entered in the
- heap_end_ptr is valid. If this field is clear, some setup code
- functionality will be disabled.
- Field name: setup_move_size
- Type: modify (obligatory)
- Offset/size: 0x212/2
- Protocol: 2.00-2.01
- When using protocol 2.00 or 2.01, if the real mode kernel is not
- loaded at 0x90000, it gets moved there later in the loading
- sequence. Fill in this field if you want additional data (such as
- the kernel command line) moved in addition to the real-mode kernel
- itself.
- The unit is bytes starting with the beginning of the boot sector.
-
- This field is can be ignored when the protocol is 2.02 or higher, or
- if the real-mode code is loaded at 0x90000.
- Field name: code32_start
- Type: modify (optional, reloc)
- Offset/size: 0x214/4
- Protocol: 2.00+
- The address to jump to in protected mode. This defaults to the load
- address of the kernel, and can be used by the boot loader to
- determine the proper load address.
- This field can be modified for two purposes:
- 1. as a boot loader hook (see ADVANCED BOOT LOADER HOOKS below.)
- 2. if a bootloader which does not install a hook loads a
- relocatable kernel at a nonstandard address it will have to modify
- this field to point to the load address.
- Field name: ramdisk_image
- Type: write (obligatory)
- Offset/size: 0x218/4
- Protocol: 2.00+
- The 32-bit linear address of the initial ramdisk or ramfs. Leave at
- zero if there is no initial ramdisk/ramfs.
- Field name: ramdisk_size
- Type: write (obligatory)
- Offset/size: 0x21c/4
- Protocol: 2.00+
- Size of the initial ramdisk or ramfs. Leave at zero if there is no
- initial ramdisk/ramfs.
- Field name: bootsect_kludge
- Type: kernel internal
- Offset/size: 0x220/4
- Protocol: 2.00+
- This field is obsolete.
- Field name: heap_end_ptr
- Type: write (obligatory)
- Offset/size: 0x224/2
- Protocol: 2.01+
- Set this field to the offset (from the beginning of the real-mode
- code) of the end of the setup stack/heap, minus 0x0200.
- Field name: ext_loader_ver
- Type: write (optional)
- Offset/size: 0x226/1
- Protocol: 2.02+
- This field is used as an extension of the version number in the
- type_of_loader field. The total version number is considered to be
- (type_of_loader & 0x0f) + (ext_loader_ver << 4).
- The use of this field is boot loader specific. If not written, it
- is zero.
- Kernels prior to 2.6.31 did not recognize this field, but it is safe
- to write for protocol version 2.02 or higher.
- Field name: ext_loader_type
- Type: write (obligatory if (type_of_loader & 0xf0) == 0xe0)
- Offset/size: 0x227/1
- Protocol: 2.02+
- This field is used as an extension of the type number in
- type_of_loader field. If the type in type_of_loader is 0xE, then
- the actual type is (ext_loader_type + 0x10).
- This field is ignored if the type in type_of_loader is not 0xE.
- Kernels prior to 2.6.31 did not recognize this field, but it is safe
- to write for protocol version 2.02 or higher.
- Field name: cmd_line_ptr
- Type: write (obligatory)
- Offset/size: 0x228/4
- Protocol: 2.02+
- Set this field to the linear address of the kernel command line.
- The kernel command line can be located anywhere between the end of
- the setup heap and 0xA0000; it does not have to be located in the
- same 64K segment as the real-mode code itself.
- Fill in this field even if your boot loader does not support a
- command line, in which case you can point this to an empty string
- (or better yet, to the string "auto".) If this field is left at
- zero, the kernel will assume that your boot loader does not support
- the 2.02+ protocol.
- Field name: initrd_addr_max
- Type: read
- Offset/size: 0x22c/4
- Protocol: 2.03+
- The maximum address that may be occupied by the initial
- ramdisk/ramfs contents. For boot protocols 2.02 or earlier, this
- field is not present, and the maximum address is 0x37FFFFFF. (This
- address is defined as the address of the highest safe byte, so if
- your ramdisk is exactly 131072 bytes long and this field is
- 0x37FFFFFF, you can start your ramdisk at 0x37FE0000.)
- Field name: kernel_alignment
- Type: read/modify (reloc)
- Offset/size: 0x230/4
- Protocol: 2.05+ (read), 2.10+ (modify)
- Alignment unit required by the kernel (if relocatable_kernel is
- true.) A relocatable kernel that is loaded at an alignment
- incompatible with the value in this field will be realigned during
- kernel initialization.
- Starting with protocol version 2.10, this reflects the kernel
- alignment preferred for optimal performance; it is possible for the
- loader to modify this field to permit a lesser alignment. See the
- min_alignment and pref_address field below.
- Field name: relocatable_kernel
- Type: read (reloc)
- Offset/size: 0x234/1
- Protocol: 2.05+
- If this field is nonzero, the protected-mode part of the kernel can
- be loaded at any address that satisfies the kernel_alignment field.
- After loading, the boot loader must set the code32_start field to
- point to the loaded code, or to a boot loader hook.
- Field name: min_alignment
- Type: read (reloc)
- Offset/size: 0x235/1
- Protocol: 2.10+
- This field, if nonzero, indicates as a power of two the minimum
- alignment required, as opposed to preferred, by the kernel to boot.
- If a boot loader makes use of this field, it should update the
- kernel_alignment field with the alignment unit desired; typically:
- kernel_alignment = 1 << min_alignment
- There may be a considerable performance cost with an excessively
- misaligned kernel. Therefore, a loader should typically try each
- power-of-two alignment from kernel_alignment down to this alignment.
- Field name: xloadflags
- Type: read
- Offset/size: 0x236/2
- Protocol: 2.12+
- This field is a bitmask.
- Bit 0 (read): XLF_KERNEL_64
- - If 1, this kernel has the legacy 64-bit entry point at 0x200.
- Bit 1 (read): XLF_CAN_BE_LOADED_ABOVE_4G
- - If 1, kernel/boot_params/cmdline/ramdisk can be above 4G.
- Bit 2 (read): XLF_EFI_HANDOVER_32
- - If 1, the kernel supports the 32-bit EFI handoff entry point
- given at handover_offset.
- Bit 3 (read): XLF_EFI_HANDOVER_64
- - If 1, the kernel supports the 64-bit EFI handoff entry point
- given at handover_offset + 0x200.
- Bit 4 (read): XLF_EFI_KEXEC
- - If 1, the kernel supports kexec EFI boot with EFI runtime support.
- Field name: cmdline_size
- Type: read
- Offset/size: 0x238/4
- Protocol: 2.06+
- The maximum size of the command line without the terminating
- zero. This means that the command line can contain at most
- cmdline_size characters. With protocol version 2.05 and earlier, the
- maximum size was 255.
- Field name: hardware_subarch
- Type: write (optional, defaults to x86/PC)
- Offset/size: 0x23c/4
- Protocol: 2.07+
- In a paravirtualized environment the hardware low level architectural
- pieces such as interrupt handling, page table handling, and
- accessing process control registers needs to be done differently.
- This field allows the bootloader to inform the kernel we are in one
- one of those environments.
- 0x00000000 The default x86/PC environment
- 0x00000001 lguest
- 0x00000002 Xen
- 0x00000003 Moorestown MID
- 0x00000004 CE4100 TV Platform
- Field name: hardware_subarch_data
- Type: write (subarch-dependent)
- Offset/size: 0x240/8
- Protocol: 2.07+
- A pointer to data that is specific to hardware subarch
- This field is currently unused for the default x86/PC environment,
- do not modify.
- Field name: payload_offset
- Type: read
- Offset/size: 0x248/4
- Protocol: 2.08+
- If non-zero then this field contains the offset from the beginning
- of the protected-mode code to the payload.
- The payload may be compressed. The format of both the compressed and
- uncompressed data should be determined using the standard magic
- numbers. The currently supported compression formats are gzip
- (magic numbers 1F 8B or 1F 9E), bzip2 (magic number 42 5A), LZMA
- (magic number 5D 00), XZ (magic number FD 37), and LZ4 (magic number
- 02 21). The uncompressed payload is currently always ELF (magic
- number 7F 45 4C 46).
- Field name: payload_length
- Type: read
- Offset/size: 0x24c/4
- Protocol: 2.08+
- The length of the payload.
- Field name: setup_data
- Type: write (special)
- Offset/size: 0x250/8
- Protocol: 2.09+
- The 64-bit physical pointer to NULL terminated single linked list of
- struct setup_data. This is used to define a more extensible boot
- parameters passing mechanism. The definition of struct setup_data is
- as follow:
- struct setup_data {
- u64 next;
- u32 type;
- u32 len;
- u8 data[0];
- };
- Where, the next is a 64-bit physical pointer to the next node of
- linked list, the next field of the last node is 0; the type is used
- to identify the contents of data; the len is the length of data
- field; the data holds the real payload.
- This list may be modified at a number of points during the bootup
- process. Therefore, when modifying this list one should always make
- sure to consider the case where the linked list already contains
- entries.
- Field name: pref_address
- Type: read (reloc)
- Offset/size: 0x258/8
- Protocol: 2.10+
- This field, if nonzero, represents a preferred load address for the
- kernel. A relocating bootloader should attempt to load at this
- address if possible.
- A non-relocatable kernel will unconditionally move itself and to run
- at this address.
- Field name: init_size
- Type: read
- Offset/size: 0x260/4
- This field indicates the amount of linear contiguous memory starting
- at the kernel runtime start address that the kernel needs before it
- is capable of examining its memory map. This is not the same thing
- as the total amount of memory the kernel needs to boot, but it can
- be used by a relocating boot loader to help select a safe load
- address for the kernel.
- The kernel runtime start address is determined by the following algorithm:
- if (relocatable_kernel)
- runtime_start = align_up(load_address, kernel_alignment)
- else
- runtime_start = pref_address
- Field name: handover_offset
- Type: read
- Offset/size: 0x264/4
- This field is the offset from the beginning of the kernel image to
- the EFI handover protocol entry point. Boot loaders using the EFI
- handover protocol to boot the kernel should jump to this offset.
- See EFI HANDOVER PROTOCOL below for more details.
- **** THE IMAGE CHECKSUM
- From boot protocol version 2.08 onwards the CRC-32 is calculated over
- the entire file using the characteristic polynomial 0x04C11DB7 and an
- initial remainder of 0xffffffff. The checksum is appended to the
- file; therefore the CRC of the file up to the limit specified in the
- syssize field of the header is always 0.
- **** THE KERNEL COMMAND LINE
- The kernel command line has become an important way for the boot
- loader to communicate with the kernel. Some of its options are also
- relevant to the boot loader itself, see "special command line options"
- below.
- The kernel command line is a null-terminated string. The maximum
- length can be retrieved from the field cmdline_size. Before protocol
- version 2.06, the maximum was 255 characters. A string that is too
- long will be automatically truncated by the kernel.
- If the boot protocol version is 2.02 or later, the address of the
- kernel command line is given by the header field cmd_line_ptr (see
- above.) This address can be anywhere between the end of the setup
- heap and 0xA0000.
- If the protocol version is *not* 2.02 or higher, the kernel
- command line is entered using the following protocol:
- At offset 0x0020 (word), "cmd_line_magic", enter the magic
- number 0xA33F.
- At offset 0x0022 (word), "cmd_line_offset", enter the offset
- of the kernel command line (relative to the start of the
- real-mode kernel).
-
- The kernel command line *must* be within the memory region
- covered by setup_move_size, so you may need to adjust this
- field.
- **** MEMORY LAYOUT OF THE REAL-MODE CODE
- The real-mode code requires a stack/heap to be set up, as well as
- memory allocated for the kernel command line. This needs to be done
- in the real-mode accessible memory in bottom megabyte.
- It should be noted that modern machines often have a sizable Extended
- BIOS Data Area (EBDA). As a result, it is advisable to use as little
- of the low megabyte as possible.
- Unfortunately, under the following circumstances the 0x90000 memory
- segment has to be used:
- - When loading a zImage kernel ((loadflags & 0x01) == 0).
- - When loading a 2.01 or earlier boot protocol kernel.
- -> For the 2.00 and 2.01 boot protocols, the real-mode code
- can be loaded at another address, but it is internally
- relocated to 0x90000. For the "old" protocol, the
- real-mode code must be loaded at 0x90000.
- When loading at 0x90000, avoid using memory above 0x9a000.
- For boot protocol 2.02 or higher, the command line does not have to be
- located in the same 64K segment as the real-mode setup code; it is
- thus permitted to give the stack/heap the full 64K segment and locate
- the command line above it.
- The kernel command line should not be located below the real-mode
- code, nor should it be located in high memory.
- **** SAMPLE BOOT CONFIGURATION
- As a sample configuration, assume the following layout of the real
- mode segment:
- When loading below 0x90000, use the entire segment:
- 0x0000-0x7fff Real mode kernel
- 0x8000-0xdfff Stack and heap
- 0xe000-0xffff Kernel command line
- When loading at 0x90000 OR the protocol version is 2.01 or earlier:
- 0x0000-0x7fff Real mode kernel
- 0x8000-0x97ff Stack and heap
- 0x9800-0x9fff Kernel command line
- Such a boot loader should enter the following fields in the header:
- unsigned long base_ptr; /* base address for real-mode segment */
- if ( setup_sects == 0 ) {
- setup_sects = 4;
- }
- if ( protocol >= 0x0200 ) {
- type_of_loader = <type code>;
- if ( loading_initrd ) {
- ramdisk_image = <initrd_address>;
- ramdisk_size = <initrd_size>;
- }
- if ( protocol >= 0x0202 && loadflags & 0x01 )
- heap_end = 0xe000;
- else
- heap_end = 0x9800;
- if ( protocol >= 0x0201 ) {
- heap_end_ptr = heap_end - 0x200;
- loadflags |= 0x80; /* CAN_USE_HEAP */
- }
- if ( protocol >= 0x0202 ) {
- cmd_line_ptr = base_ptr + heap_end;
- strcpy(cmd_line_ptr, cmdline);
- } else {
- cmd_line_magic = 0xA33F;
- cmd_line_offset = heap_end;
- setup_move_size = heap_end + strlen(cmdline)+1;
- strcpy(base_ptr+cmd_line_offset, cmdline);
- }
- } else {
- /* Very old kernel */
- heap_end = 0x9800;
- cmd_line_magic = 0xA33F;
- cmd_line_offset = heap_end;
- /* A very old kernel MUST have its real-mode code
- loaded at 0x90000 */
- if ( base_ptr != 0x90000 ) {
- /* Copy the real-mode kernel */
- memcpy(0x90000, base_ptr, (setup_sects+1)*512);
- base_ptr = 0x90000; /* Relocated */
- }
- strcpy(0x90000+cmd_line_offset, cmdline);
- /* It is recommended to clear memory up to the 32K mark */
- memset(0x90000 + (setup_sects+1)*512, 0,
- (64-(setup_sects+1))*512);
- }
- **** LOADING THE REST OF THE KERNEL
- The 32-bit (non-real-mode) kernel starts at offset (setup_sects+1)*512
- in the kernel file (again, if setup_sects == 0 the real value is 4.)
- It should be loaded at address 0x10000 for Image/zImage kernels and
- 0x100000 for bzImage kernels.
- The kernel is a bzImage kernel if the protocol >= 2.00 and the 0x01
- bit (LOAD_HIGH) in the loadflags field is set:
- is_bzImage = (protocol >= 0x0200) && (loadflags & 0x01);
- load_address = is_bzImage ? 0x100000 : 0x10000;
- Note that Image/zImage kernels can be up to 512K in size, and thus use
- the entire 0x10000-0x90000 range of memory. This means it is pretty
- much a requirement for these kernels to load the real-mode part at
- 0x90000. bzImage kernels allow much more flexibility.
- **** SPECIAL COMMAND LINE OPTIONS
- If the command line provided by the boot loader is entered by the
- user, the user may expect the following command line options to work.
- They should normally not be deleted from the kernel command line even
- though not all of them are actually meaningful to the kernel. Boot
- loader authors who need additional command line options for the boot
- loader itself should get them registered in
- Documentation/kernel-parameters.txt to make sure they will not
- conflict with actual kernel options now or in the future.
- vga=<mode>
- <mode> here is either an integer (in C notation, either
- decimal, octal, or hexadecimal) or one of the strings
- "normal" (meaning 0xFFFF), "ext" (meaning 0xFFFE) or "ask"
- (meaning 0xFFFD). This value should be entered into the
- vid_mode field, as it is used by the kernel before the command
- line is parsed.
- mem=<size>
- <size> is an integer in C notation optionally followed by
- (case insensitive) K, M, G, T, P or E (meaning << 10, << 20,
- << 30, << 40, << 50 or << 60). This specifies the end of
- memory to the kernel. This affects the possible placement of
- an initrd, since an initrd should be placed near end of
- memory. Note that this is an option to *both* the kernel and
- the bootloader!
- initrd=<file>
- An initrd should be loaded. The meaning of <file> is
- obviously bootloader-dependent, and some boot loaders
- (e.g. LILO) do not have such a command.
- In addition, some boot loaders add the following options to the
- user-specified command line:
- BOOT_IMAGE=<file>
- The boot image which was loaded. Again, the meaning of <file>
- is obviously bootloader-dependent.
- auto
- The kernel was booted without explicit user intervention.
- If these options are added by the boot loader, it is highly
- recommended that they are located *first*, before the user-specified
- or configuration-specified command line. Otherwise, "init=/bin/sh"
- gets confused by the "auto" option.
- **** RUNNING THE KERNEL
- The kernel is started by jumping to the kernel entry point, which is
- located at *segment* offset 0x20 from the start of the real mode
- kernel. This means that if you loaded your real-mode kernel code at
- 0x90000, the kernel entry point is 9020:0000.
- At entry, ds = es = ss should point to the start of the real-mode
- kernel code (0x9000 if the code is loaded at 0x90000), sp should be
- set up properly, normally pointing to the top of the heap, and
- interrupts should be disabled. Furthermore, to guard against bugs in
- the kernel, it is recommended that the boot loader sets fs = gs = ds =
- es = ss.
- In our example from above, we would do:
- /* Note: in the case of the "old" kernel protocol, base_ptr must
- be == 0x90000 at this point; see the previous sample code */
- seg = base_ptr >> 4;
- cli(); /* Enter with interrupts disabled! */
- /* Set up the real-mode kernel stack */
- _SS = seg;
- _SP = heap_end;
- _DS = _ES = _FS = _GS = seg;
- jmp_far(seg+0x20, 0); /* Run the kernel */
- If your boot sector accesses a floppy drive, it is recommended to
- switch off the floppy motor before running the kernel, since the
- kernel boot leaves interrupts off and thus the motor will not be
- switched off, especially if the loaded kernel has the floppy driver as
- a demand-loaded module!
- **** ADVANCED BOOT LOADER HOOKS
- If the boot loader runs in a particularly hostile environment (such as
- LOADLIN, which runs under DOS) it may be impossible to follow the
- standard memory location requirements. Such a boot loader may use the
- following hooks that, if set, are invoked by the kernel at the
- appropriate time. The use of these hooks should probably be
- considered an absolutely last resort!
- IMPORTANT: All the hooks are required to preserve %esp, %ebp, %esi and
- %edi across invocation.
- realmode_swtch:
- A 16-bit real mode far subroutine invoked immediately before
- entering protected mode. The default routine disables NMI, so
- your routine should probably do so, too.
- code32_start:
- A 32-bit flat-mode routine *jumped* to immediately after the
- transition to protected mode, but before the kernel is
- uncompressed. No segments, except CS, are guaranteed to be
- set up (current kernels do, but older ones do not); you should
- set them up to BOOT_DS (0x18) yourself.
- After completing your hook, you should jump to the address
- that was in this field before your boot loader overwrote it
- (relocated, if appropriate.)
- **** 32-bit BOOT PROTOCOL
- For machine with some new BIOS other than legacy BIOS, such as EFI,
- LinuxBIOS, etc, and kexec, the 16-bit real mode setup code in kernel
- based on legacy BIOS can not be used, so a 32-bit boot protocol needs
- to be defined.
- In 32-bit boot protocol, the first step in loading a Linux kernel
- should be to setup the boot parameters (struct boot_params,
- traditionally known as "zero page"). The memory for struct boot_params
- should be allocated and initialized to all zero. Then the setup header
- from offset 0x01f1 of kernel image on should be loaded into struct
- boot_params and examined. The end of setup header can be calculated as
- follow:
- 0x0202 + byte value at offset 0x0201
- In addition to read/modify/write the setup header of the struct
- boot_params as that of 16-bit boot protocol, the boot loader should
- also fill the additional fields of the struct boot_params as that
- described in zero-page.txt.
- After setting up the struct boot_params, the boot loader can load the
- 32/64-bit kernel in the same way as that of 16-bit boot protocol.
- In 32-bit boot protocol, the kernel is started by jumping to the
- 32-bit kernel entry point, which is the start address of loaded
- 32/64-bit kernel.
- At entry, the CPU must be in 32-bit protected mode with paging
- disabled; a GDT must be loaded with the descriptors for selectors
- __BOOT_CS(0x10) and __BOOT_DS(0x18); both descriptors must be 4G flat
- segment; __BOOT_CS must have execute/read permission, and __BOOT_DS
- must have read/write permission; CS must be __BOOT_CS and DS, ES, SS
- must be __BOOT_DS; interrupt must be disabled; %esi must hold the base
- address of the struct boot_params; %ebp, %edi and %ebx must be zero.
- **** 64-bit BOOT PROTOCOL
- For machine with 64bit cpus and 64bit kernel, we could use 64bit bootloader
- and we need a 64-bit boot protocol.
- In 64-bit boot protocol, the first step in loading a Linux kernel
- should be to setup the boot parameters (struct boot_params,
- traditionally known as "zero page"). The memory for struct boot_params
- could be allocated anywhere (even above 4G) and initialized to all zero.
- Then, the setup header at offset 0x01f1 of kernel image on should be
- loaded into struct boot_params and examined. The end of setup header
- can be calculated as follows:
- 0x0202 + byte value at offset 0x0201
- In addition to read/modify/write the setup header of the struct
- boot_params as that of 16-bit boot protocol, the boot loader should
- also fill the additional fields of the struct boot_params as described
- in zero-page.txt.
- After setting up the struct boot_params, the boot loader can load
- 64-bit kernel in the same way as that of 16-bit boot protocol, but
- kernel could be loaded above 4G.
- In 64-bit boot protocol, the kernel is started by jumping to the
- 64-bit kernel entry point, which is the start address of loaded
- 64-bit kernel plus 0x200.
- At entry, the CPU must be in 64-bit mode with paging enabled.
- The range with setup_header.init_size from start address of loaded
- kernel and zero page and command line buffer get ident mapping;
- a GDT must be loaded with the descriptors for selectors
- __BOOT_CS(0x10) and __BOOT_DS(0x18); both descriptors must be 4G flat
- segment; __BOOT_CS must have execute/read permission, and __BOOT_DS
- must have read/write permission; CS must be __BOOT_CS and DS, ES, SS
- must be __BOOT_DS; interrupt must be disabled; %rsi must hold the base
- address of the struct boot_params.
- **** EFI HANDOVER PROTOCOL
- This protocol allows boot loaders to defer initialisation to the EFI
- boot stub. The boot loader is required to load the kernel/initrd(s)
- from the boot media and jump to the EFI handover protocol entry point
- which is hdr->handover_offset bytes from the beginning of
- startup_{32,64}.
- The function prototype for the handover entry point looks like this,
- efi_main(void *handle, efi_system_table_t *table, struct boot_params *bp)
- 'handle' is the EFI image handle passed to the boot loader by the EFI
- firmware, 'table' is the EFI system table - these are the first two
- arguments of the "handoff state" as described in section 2.3 of the
- UEFI specification. 'bp' is the boot loader-allocated boot params.
- The boot loader *must* fill out the following fields in bp,
- o hdr.code32_start
- o hdr.cmd_line_ptr
- o hdr.ramdisk_image (if applicable)
- o hdr.ramdisk_size (if applicable)
- All other fields should be zero.
|