18aa05055SPaul BeesleyFirmware Design 28aa05055SPaul Beesley=============== 340d553cfSPaul Beesley 440d553cfSPaul BeesleyTrusted Firmware-A (TF-A) implements a subset of the Trusted Board Boot 534760951SPaul BeesleyRequirements (TBBR) Platform Design Document (PDD) for Arm reference 634760951SPaul Beesleyplatforms. 734760951SPaul Beesley 834760951SPaul BeesleyThe TBB sequence starts when the platform is powered on and runs up 940d553cfSPaul Beesleyto the stage where it hands-off control to firmware running in the normal 1040d553cfSPaul Beesleyworld in DRAM. This is the cold boot path. 1140d553cfSPaul Beesley 1234760951SPaul BeesleyTF-A also implements the `Power State Coordination Interface PDD`_ as a 1340d553cfSPaul Beesleyruntime service. PSCI is the interface from normal world software to firmware 1440d553cfSPaul Beesleyimplementing power management use-cases (for example, secondary CPU boot, 1540d553cfSPaul Beesleyhotplug and idle). Normal world software can access TF-A runtime services via 1640d553cfSPaul Beesleythe Arm SMC (Secure Monitor Call) instruction. The SMC instruction must be 1734760951SPaul Beesleyused as mandated by the SMC Calling Convention (`SMCCC`_). 1840d553cfSPaul Beesley 1940d553cfSPaul BeesleyTF-A implements a framework for configuring and managing interrupts generated 2040d553cfSPaul Beesleyin either security state. The details of the interrupt management framework 2134760951SPaul Beesleyand its design can be found in :ref:`Interrupt Management Framework`. 2240d553cfSPaul Beesley 2340d553cfSPaul BeesleyTF-A also implements a library for setting up and managing the translation 2434760951SPaul Beesleytables. The details of this library can be found in 2534760951SPaul Beesley:ref:`Translation (XLAT) Tables Library`. 2640d553cfSPaul Beesley 2740d553cfSPaul BeesleyTF-A can be built to support either AArch64 or AArch32 execution state. 287446c266SZelalem Aweke 2924566a3fSHarrison Mutai.. note:: 307446c266SZelalem Aweke The descriptions in this chapter are for the Arm TrustZone architecture. 3124566a3fSHarrison Mutai For changes to the firmware design for the `Arm Confidential Compute 3224566a3fSHarrison Mutai Architecture (Arm CCA)`_ please refer to the chapter :ref:`Realm Management 3324566a3fSHarrison Mutai Extension (RME)`. 347446c266SZelalem Aweke 3540d553cfSPaul BeesleyCold boot 3640d553cfSPaul Beesley--------- 3740d553cfSPaul Beesley 3840d553cfSPaul BeesleyThe cold boot path starts when the platform is physically turned on. If 3940d553cfSPaul Beesley``COLD_BOOT_SINGLE_CPU=0``, one of the CPUs released from reset is chosen as the 4040d553cfSPaul Beesleyprimary CPU, and the remaining CPUs are considered secondary CPUs. The primary 4140d553cfSPaul BeesleyCPU is chosen through platform-specific means. The cold boot path is mainly 4240d553cfSPaul Beesleyexecuted by the primary CPU, other than essential CPU initialization executed by 4340d553cfSPaul Beesleyall CPUs. The secondary CPUs are kept in a safe platform-specific state until 4440d553cfSPaul Beesleythe primary CPU has performed enough initialization to boot them. 4540d553cfSPaul Beesley 4634760951SPaul BeesleyRefer to the :ref:`CPU Reset` for more information on the effect of the 4740d553cfSPaul Beesley``COLD_BOOT_SINGLE_CPU`` platform build option. 4840d553cfSPaul Beesley 4940d553cfSPaul BeesleyThe cold boot path in this implementation of TF-A depends on the execution 5040d553cfSPaul Beesleystate. For AArch64, it is divided into five steps (in order of execution): 5140d553cfSPaul Beesley 5240d553cfSPaul Beesley- Boot Loader stage 1 (BL1) *AP Trusted ROM* 5340d553cfSPaul Beesley- Boot Loader stage 2 (BL2) *Trusted Boot Firmware* 5440d553cfSPaul Beesley- Boot Loader stage 3-1 (BL31) *EL3 Runtime Software* 5540d553cfSPaul Beesley- Boot Loader stage 3-2 (BL32) *Secure-EL1 Payload* (optional) 5640d553cfSPaul Beesley- Boot Loader stage 3-3 (BL33) *Non-trusted Firmware* 5740d553cfSPaul Beesley 5840d553cfSPaul BeesleyFor AArch32, it is divided into four steps (in order of execution): 5940d553cfSPaul Beesley 6040d553cfSPaul Beesley- Boot Loader stage 1 (BL1) *AP Trusted ROM* 6140d553cfSPaul Beesley- Boot Loader stage 2 (BL2) *Trusted Boot Firmware* 6240d553cfSPaul Beesley- Boot Loader stage 3-2 (BL32) *EL3 Runtime Software* 6340d553cfSPaul Beesley- Boot Loader stage 3-3 (BL33) *Non-trusted Firmware* 6440d553cfSPaul Beesley 6540d553cfSPaul BeesleyArm development platforms (Fixed Virtual Platforms (FVPs) and Juno) implement a 6640d553cfSPaul Beesleycombination of the following types of memory regions. Each bootloader stage uses 6740d553cfSPaul Beesleyone or more of these memory regions. 6840d553cfSPaul Beesley 6940d553cfSPaul Beesley- Regions accessible from both non-secure and secure states. For example, 7040d553cfSPaul Beesley non-trusted SRAM, ROM and DRAM. 7140d553cfSPaul Beesley- Regions accessible from only the secure state. For example, trusted SRAM and 7240d553cfSPaul Beesley ROM. The FVPs also implement the trusted DRAM which is statically 7340d553cfSPaul Beesley configured. Additionally, the Base FVPs and Juno development platform 7440d553cfSPaul Beesley configure the TrustZone Controller (TZC) to create a region in the DRAM 7540d553cfSPaul Beesley which is accessible only from the secure state. 7640d553cfSPaul Beesley 7740d553cfSPaul BeesleyThe sections below provide the following details: 7840d553cfSPaul Beesley 7940d553cfSPaul Beesley- dynamic configuration of Boot Loader stages 8040d553cfSPaul Beesley- initialization and execution of the first three stages during cold boot 8140d553cfSPaul Beesley- specification of the EL3 Runtime Software (BL31 for AArch64 and BL32 for 8240d553cfSPaul Beesley AArch32) entrypoint requirements for use by alternative Trusted Boot 8340d553cfSPaul Beesley Firmware in place of the provided BL1 and BL2 8440d553cfSPaul Beesley 8540d553cfSPaul BeesleyDynamic Configuration during cold boot 8640d553cfSPaul Beesley~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 8740d553cfSPaul Beesley 8840d553cfSPaul BeesleyEach of the Boot Loader stages may be dynamically configured if required by the 8940d553cfSPaul Beesleyplatform. The Boot Loader stage may optionally specify a firmware 9040d553cfSPaul Beesleyconfiguration file and/or hardware configuration file as listed below: 9140d553cfSPaul Beesley 92089fc624SManish V Badarkhe- FW_CONFIG - The firmware configuration file. Holds properties shared across 93089fc624SManish V Badarkhe all BLx images. 94089fc624SManish V Badarkhe An example is the "dtb-registry" node, which contains the information about 95089fc624SManish V Badarkhe the other device tree configurations (load-address, size, image_id). 9640d553cfSPaul Beesley- HW_CONFIG - The hardware configuration file. Can be shared by all Boot Loader 9740d553cfSPaul Beesley stages and also by the Normal World Rich OS. 9840d553cfSPaul Beesley- TB_FW_CONFIG - Trusted Boot Firmware configuration file. Shared between BL1 9940d553cfSPaul Beesley and BL2. 10040d553cfSPaul Beesley- SOC_FW_CONFIG - SoC Firmware configuration file. Used by BL31. 10140d553cfSPaul Beesley- TOS_FW_CONFIG - Trusted OS Firmware configuration file. Used by Trusted OS 10240d553cfSPaul Beesley (BL32). 10340d553cfSPaul Beesley- NT_FW_CONFIG - Non Trusted Firmware configuration file. Used by Non-trusted 10440d553cfSPaul Beesley firmware (BL33). 10540d553cfSPaul Beesley 10640d553cfSPaul BeesleyThe Arm development platforms use the Flattened Device Tree format for the 10740d553cfSPaul Beesleydynamic configuration files. 10840d553cfSPaul Beesley 10940d553cfSPaul BeesleyEach Boot Loader stage can pass up to 4 arguments via registers to the next 11040d553cfSPaul Beesleystage. BL2 passes the list of the next images to execute to the *EL3 Runtime 11140d553cfSPaul BeesleySoftware* (BL31 for AArch64 and BL32 for AArch32) via `arg0`. All the other 11240d553cfSPaul Beesleyarguments are platform defined. The Arm development platforms use the following 11340d553cfSPaul Beesleyconvention: 11440d553cfSPaul Beesley 11540d553cfSPaul Beesley- BL1 passes the address of a meminfo_t structure to BL2 via ``arg1``. This 11640d553cfSPaul Beesley structure contains the memory layout available to BL2. 11740d553cfSPaul Beesley- When dynamic configuration files are present, the firmware configuration for 11840d553cfSPaul Beesley the next Boot Loader stage is populated in the first available argument and 11940d553cfSPaul Beesley the generic hardware configuration is passed the next available argument. 12040d553cfSPaul Beesley For example, 12140d553cfSPaul Beesley 122089fc624SManish V Badarkhe - FW_CONFIG is loaded by BL1, then its address is passed in ``arg0`` to BL2. 123089fc624SManish V Badarkhe - TB_FW_CONFIG address is retrieved by BL2 from FW_CONFIG device tree. 12440d553cfSPaul Beesley - If HW_CONFIG is loaded by BL1, then its address is passed in ``arg2`` to 12540d553cfSPaul Beesley BL2. Note, ``arg1`` is already used for meminfo_t. 12640d553cfSPaul Beesley - If SOC_FW_CONFIG is loaded by BL2, then its address is passed in ``arg1`` 12740d553cfSPaul Beesley to BL31. Note, ``arg0`` is used to pass the list of executable images. 12840d553cfSPaul Beesley - Similarly, if HW_CONFIG is loaded by BL1 or BL2, then its address is 12940d553cfSPaul Beesley passed in ``arg2`` to BL31. 13040d553cfSPaul Beesley - For other BL3x images, if the firmware configuration file is loaded by 13140d553cfSPaul Beesley BL2, then its address is passed in ``arg0`` and if HW_CONFIG is loaded 13240d553cfSPaul Beesley then its address is passed in ``arg1``. 133b4a87836SManish V Badarkhe - In case of the Arm FVP platform, FW_CONFIG address passed in ``arg1`` to 134b4a87836SManish V Badarkhe BL31/SP_MIN, and the SOC_FW_CONFIG and HW_CONFIG details are retrieved 135b4a87836SManish V Badarkhe from FW_CONFIG device tree. 13640d553cfSPaul Beesley 13740d553cfSPaul BeesleyBL1 13840d553cfSPaul Beesley~~~ 13940d553cfSPaul Beesley 14040d553cfSPaul BeesleyThis stage begins execution from the platform's reset vector at EL3. The reset 14140d553cfSPaul Beesleyaddress is platform dependent but it is usually located in a Trusted ROM area. 14240d553cfSPaul BeesleyThe BL1 data section is copied to trusted SRAM at runtime. 14340d553cfSPaul Beesley 14440d553cfSPaul BeesleyOn the Arm development platforms, BL1 code starts execution from the reset 14540d553cfSPaul Beesleyvector defined by the constant ``BL1_RO_BASE``. The BL1 data section is copied 14640d553cfSPaul Beesleyto the top of trusted SRAM as defined by the constant ``BL1_RW_BASE``. 14740d553cfSPaul Beesley 14840d553cfSPaul BeesleyThe functionality implemented by this stage is as follows. 14940d553cfSPaul Beesley 15040d553cfSPaul BeesleyDetermination of boot path 15140d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^^^^^^^ 15240d553cfSPaul Beesley 15340d553cfSPaul BeesleyWhenever a CPU is released from reset, BL1 needs to distinguish between a warm 15440d553cfSPaul Beesleyboot and a cold boot. This is done using platform-specific mechanisms (see the 15534760951SPaul Beesley``plat_get_my_entrypoint()`` function in the :ref:`Porting Guide`). In the case 15634760951SPaul Beesleyof a warm boot, a CPU is expected to continue execution from a separate 15740d553cfSPaul Beesleyentrypoint. In the case of a cold boot, the secondary CPUs are placed in a safe 15840d553cfSPaul Beesleyplatform-specific state (see the ``plat_secondary_cold_boot_setup()`` function in 15934760951SPaul Beesleythe :ref:`Porting Guide`) while the primary CPU executes the remaining cold boot 16034760951SPaul Beesleypath as described in the following sections. 16140d553cfSPaul Beesley 16240d553cfSPaul BeesleyThis step only applies when ``PROGRAMMABLE_RESET_ADDRESS=0``. Refer to the 16334760951SPaul Beesley:ref:`CPU Reset` for more information on the effect of the 16440d553cfSPaul Beesley``PROGRAMMABLE_RESET_ADDRESS`` platform build option. 16540d553cfSPaul Beesley 16640d553cfSPaul BeesleyArchitectural initialization 16740d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^^^^^^^^^ 16840d553cfSPaul Beesley 16940d553cfSPaul BeesleyBL1 performs minimal architectural initialization as follows. 17040d553cfSPaul Beesley 17140d553cfSPaul Beesley- Exception vectors 17240d553cfSPaul Beesley 17340d553cfSPaul Beesley BL1 sets up simple exception vectors for both synchronous and asynchronous 17440d553cfSPaul Beesley exceptions. The default behavior upon receiving an exception is to populate 17540d553cfSPaul Beesley a status code in the general purpose register ``X0/R0`` and call the 17634760951SPaul Beesley ``plat_report_exception()`` function (see the :ref:`Porting Guide`). The 17734760951SPaul Beesley status code is one of: 17840d553cfSPaul Beesley 17940d553cfSPaul Beesley For AArch64: 18040d553cfSPaul Beesley 18140d553cfSPaul Beesley :: 18240d553cfSPaul Beesley 18340d553cfSPaul Beesley 0x0 : Synchronous exception from Current EL with SP_EL0 18440d553cfSPaul Beesley 0x1 : IRQ exception from Current EL with SP_EL0 18540d553cfSPaul Beesley 0x2 : FIQ exception from Current EL with SP_EL0 18640d553cfSPaul Beesley 0x3 : System Error exception from Current EL with SP_EL0 18740d553cfSPaul Beesley 0x4 : Synchronous exception from Current EL with SP_ELx 18840d553cfSPaul Beesley 0x5 : IRQ exception from Current EL with SP_ELx 18940d553cfSPaul Beesley 0x6 : FIQ exception from Current EL with SP_ELx 19040d553cfSPaul Beesley 0x7 : System Error exception from Current EL with SP_ELx 19140d553cfSPaul Beesley 0x8 : Synchronous exception from Lower EL using aarch64 19240d553cfSPaul Beesley 0x9 : IRQ exception from Lower EL using aarch64 19340d553cfSPaul Beesley 0xa : FIQ exception from Lower EL using aarch64 19440d553cfSPaul Beesley 0xb : System Error exception from Lower EL using aarch64 19540d553cfSPaul Beesley 0xc : Synchronous exception from Lower EL using aarch32 19640d553cfSPaul Beesley 0xd : IRQ exception from Lower EL using aarch32 19740d553cfSPaul Beesley 0xe : FIQ exception from Lower EL using aarch32 19840d553cfSPaul Beesley 0xf : System Error exception from Lower EL using aarch32 19940d553cfSPaul Beesley 20040d553cfSPaul Beesley For AArch32: 20140d553cfSPaul Beesley 20240d553cfSPaul Beesley :: 20340d553cfSPaul Beesley 20440d553cfSPaul Beesley 0x10 : User mode 20540d553cfSPaul Beesley 0x11 : FIQ mode 20640d553cfSPaul Beesley 0x12 : IRQ mode 20740d553cfSPaul Beesley 0x13 : SVC mode 20840d553cfSPaul Beesley 0x16 : Monitor mode 20940d553cfSPaul Beesley 0x17 : Abort mode 21040d553cfSPaul Beesley 0x1a : Hypervisor mode 21140d553cfSPaul Beesley 0x1b : Undefined mode 21240d553cfSPaul Beesley 0x1f : System mode 21340d553cfSPaul Beesley 21440d553cfSPaul Beesley The ``plat_report_exception()`` implementation on the Arm FVP port programs 21540d553cfSPaul Beesley the Versatile Express System LED register in the following format to 21640d553cfSPaul Beesley indicate the occurrence of an unexpected exception: 21740d553cfSPaul Beesley 21840d553cfSPaul Beesley :: 21940d553cfSPaul Beesley 22040d553cfSPaul Beesley SYS_LED[0] - Security state (Secure=0/Non-Secure=1) 22140d553cfSPaul Beesley SYS_LED[2:1] - Exception Level (EL3=0x3, EL2=0x2, EL1=0x1, EL0=0x0) 22240d553cfSPaul Beesley For AArch32 it is always 0x0 22340d553cfSPaul Beesley SYS_LED[7:3] - Exception Class (Sync/Async & origin). This is the value 22440d553cfSPaul Beesley of the status code 22540d553cfSPaul Beesley 22640d553cfSPaul Beesley A write to the LED register reflects in the System LEDs (S6LED0..7) in the 22740d553cfSPaul Beesley CLCD window of the FVP. 22840d553cfSPaul Beesley 22940d553cfSPaul Beesley BL1 does not expect to receive any exceptions other than the SMC exception. 23040d553cfSPaul Beesley For the latter, BL1 installs a simple stub. The stub expects to receive a 23140d553cfSPaul Beesley limited set of SMC types (determined by their function IDs in the general 23240d553cfSPaul Beesley purpose register ``X0/R0``): 23340d553cfSPaul Beesley 23440d553cfSPaul Beesley - ``BL1_SMC_RUN_IMAGE``: This SMC is raised by BL2 to make BL1 pass control 23540d553cfSPaul Beesley to EL3 Runtime Software. 23634760951SPaul Beesley - All SMCs listed in section "BL1 SMC Interface" in the :ref:`Firmware Update (FWU)` 23740d553cfSPaul Beesley Design Guide are supported for AArch64 only. These SMCs are currently 23840d553cfSPaul Beesley not supported when BL1 is built for AArch32. 23940d553cfSPaul Beesley 24040d553cfSPaul Beesley Any other SMC leads to an assertion failure. 24140d553cfSPaul Beesley 24240d553cfSPaul Beesley- CPU initialization 24340d553cfSPaul Beesley 24440d553cfSPaul Beesley BL1 calls the ``reset_handler()`` function which in turn calls the CPU 24540d553cfSPaul Beesley specific reset handler function (see the section: "CPU specific operations 24640d553cfSPaul Beesley framework"). 24740d553cfSPaul Beesley 24840d553cfSPaul BeesleyPlatform initialization 24940d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^^^^ 25040d553cfSPaul Beesley 25140d553cfSPaul BeesleyOn Arm platforms, BL1 performs the following platform initializations: 25240d553cfSPaul Beesley 25340d553cfSPaul Beesley- Enable the Trusted Watchdog. 25440d553cfSPaul Beesley- Initialize the console. 25540d553cfSPaul Beesley- Configure the Interconnect to enable hardware coherency. 25640d553cfSPaul Beesley- Enable the MMU and map the memory it needs to access. 25740d553cfSPaul Beesley- Configure any required platform storage to load the next bootloader image 25840d553cfSPaul Beesley (BL2). 25940d553cfSPaul Beesley- If the BL1 dynamic configuration file, ``TB_FW_CONFIG``, is available, then 26040d553cfSPaul Beesley load it to the platform defined address and make it available to BL2 via 26140d553cfSPaul Beesley ``arg0``. 26240d553cfSPaul Beesley- Configure the system timer and program the `CNTFRQ_EL0` for use by NS-BL1U 26340d553cfSPaul Beesley and NS-BL2U firmware update images. 26440d553cfSPaul Beesley 26540d553cfSPaul BeesleyFirmware Update detection and execution 26640d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ 26740d553cfSPaul Beesley 26840d553cfSPaul BeesleyAfter performing platform setup, BL1 common code calls 26934760951SPaul Beesley``bl1_plat_get_next_image_id()`` to determine if :ref:`Firmware Update (FWU)` is 27034760951SPaul Beesleyrequired or to proceed with the normal boot process. If the platform code 27134760951SPaul Beesleyreturns ``BL2_IMAGE_ID`` then the normal boot sequence is executed as described 27234760951SPaul Beesleyin the next section, else BL1 assumes that :ref:`Firmware Update (FWU)` is 27334760951SPaul Beesleyrequired and execution passes to the first image in the 27434760951SPaul Beesley:ref:`Firmware Update (FWU)` process. In either case, BL1 retrieves a descriptor 27534760951SPaul Beesleyof the next image by calling ``bl1_plat_get_image_desc()``. The image descriptor 27634760951SPaul Beesleycontains an ``entry_point_info_t`` structure, which BL1 uses to initialize the 27734760951SPaul Beesleyexecution state of the next image. 27840d553cfSPaul Beesley 27940d553cfSPaul BeesleyBL2 image load and execution 28040d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^^^^^^^^^ 28140d553cfSPaul Beesley 28240d553cfSPaul BeesleyIn the normal boot flow, BL1 execution continues as follows: 28340d553cfSPaul Beesley 28440d553cfSPaul Beesley#. BL1 prints the following string from the primary CPU to indicate successful 28540d553cfSPaul Beesley execution of the BL1 stage: 28640d553cfSPaul Beesley 28740d553cfSPaul Beesley :: 28840d553cfSPaul Beesley 28940d553cfSPaul Beesley "Booting Trusted Firmware" 29040d553cfSPaul Beesley 29140d553cfSPaul Beesley#. BL1 loads a BL2 raw binary image from platform storage, at a 29240d553cfSPaul Beesley platform-specific base address. Prior to the load, BL1 invokes 29340d553cfSPaul Beesley ``bl1_plat_handle_pre_image_load()`` which allows the platform to update or 29440d553cfSPaul Beesley use the image information. If the BL2 image file is not present or if 29540d553cfSPaul Beesley there is not enough free trusted SRAM the following error message is 29640d553cfSPaul Beesley printed: 29740d553cfSPaul Beesley 29840d553cfSPaul Beesley :: 29940d553cfSPaul Beesley 30040d553cfSPaul Beesley "Failed to load BL2 firmware." 30140d553cfSPaul Beesley 30240d553cfSPaul Beesley#. BL1 invokes ``bl1_plat_handle_post_image_load()`` which again is intended 30340d553cfSPaul Beesley for platforms to take further action after image load. This function must 30440d553cfSPaul Beesley populate the necessary arguments for BL2, which may also include the memory 30540d553cfSPaul Beesley layout. Further description of the memory layout can be found later 30640d553cfSPaul Beesley in this document. 30740d553cfSPaul Beesley 30840d553cfSPaul Beesley#. BL1 passes control to the BL2 image at Secure EL1 (for AArch64) or at 30940d553cfSPaul Beesley Secure SVC mode (for AArch32), starting from its load address. 31040d553cfSPaul Beesley 31140d553cfSPaul BeesleyBL2 31240d553cfSPaul Beesley~~~ 31340d553cfSPaul Beesley 31440d553cfSPaul BeesleyBL1 loads and passes control to BL2 at Secure-EL1 (for AArch64) or at Secure 31540d553cfSPaul BeesleySVC mode (for AArch32) . BL2 is linked against and loaded at a platform-specific 31640d553cfSPaul Beesleybase address (more information can be found later in this document). 31740d553cfSPaul BeesleyThe functionality implemented by BL2 is as follows. 31840d553cfSPaul Beesley 31940d553cfSPaul BeesleyArchitectural initialization 32040d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^^^^^^^^^ 32140d553cfSPaul Beesley 32240d553cfSPaul BeesleyFor AArch64, BL2 performs the minimal architectural initialization required 32340d553cfSPaul Beesleyfor subsequent stages of TF-A and normal world software. EL1 and EL0 are given 324093ba62eSPeng Fanaccess to Floating Point and Advanced SIMD registers by setting the 32540d553cfSPaul Beesley``CPACR.FPEN`` bits. 32640d553cfSPaul Beesley 32740d553cfSPaul BeesleyFor AArch32, the minimal architectural initialization required for subsequent 32840d553cfSPaul Beesleystages of TF-A and normal world software is taken care of in BL1 as both BL1 32940d553cfSPaul Beesleyand BL2 execute at PL1. 33040d553cfSPaul Beesley 33140d553cfSPaul BeesleyPlatform initialization 33240d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^^^^ 33340d553cfSPaul Beesley 33440d553cfSPaul BeesleyOn Arm platforms, BL2 performs the following platform initializations: 33540d553cfSPaul Beesley 33640d553cfSPaul Beesley- Initialize the console. 33740d553cfSPaul Beesley- Configure any required platform storage to allow loading further bootloader 33840d553cfSPaul Beesley images. 33940d553cfSPaul Beesley- Enable the MMU and map the memory it needs to access. 34040d553cfSPaul Beesley- Perform platform security setup to allow access to controlled components. 34140d553cfSPaul Beesley- Reserve some memory for passing information to the next bootloader image 34240d553cfSPaul Beesley EL3 Runtime Software and populate it. 34340d553cfSPaul Beesley- Define the extents of memory available for loading each subsequent 34440d553cfSPaul Beesley bootloader image. 34540d553cfSPaul Beesley- If BL1 has passed TB_FW_CONFIG dynamic configuration file in ``arg0``, 34640d553cfSPaul Beesley then parse it. 34740d553cfSPaul Beesley 34840d553cfSPaul BeesleyImage loading in BL2 34940d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^ 35040d553cfSPaul Beesley 35140d553cfSPaul BeesleyBL2 generic code loads the images based on the list of loadable images 35240d553cfSPaul Beesleyprovided by the platform. BL2 passes the list of executable images 35340d553cfSPaul Beesleyprovided by the platform to the next handover BL image. 35440d553cfSPaul Beesley 35540d553cfSPaul BeesleyThe list of loadable images provided by the platform may also contain 35640d553cfSPaul Beesleydynamic configuration files. The files are loaded and can be parsed as 35740d553cfSPaul Beesleyneeded in the ``bl2_plat_handle_post_image_load()`` function. These 35840d553cfSPaul Beesleyconfiguration files can be passed to next Boot Loader stages as arguments 35940d553cfSPaul Beesleyby updating the corresponding entrypoint information in this function. 36040d553cfSPaul Beesley 36140d553cfSPaul BeesleySCP_BL2 (System Control Processor Firmware) image load 36240d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ 36340d553cfSPaul Beesley 36440d553cfSPaul BeesleySome systems have a separate System Control Processor (SCP) for power, clock, 36540d553cfSPaul Beesleyreset and system control. BL2 loads the optional SCP_BL2 image from platform 36640d553cfSPaul Beesleystorage into a platform-specific region of secure memory. The subsequent 36740d553cfSPaul Beesleyhandling of SCP_BL2 is platform specific. For example, on the Juno Arm 36840d553cfSPaul Beesleydevelopment platform port the image is transferred into SCP's internal memory 36940d553cfSPaul Beesleyusing the Boot Over MHU (BOM) protocol after being loaded in the trusted SRAM 37040d553cfSPaul Beesleymemory. The SCP executes SCP_BL2 and signals to the Application Processor (AP) 37140d553cfSPaul Beesleyfor BL2 execution to continue. 37240d553cfSPaul Beesley 37340d553cfSPaul BeesleyEL3 Runtime Software image load 37440d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ 37540d553cfSPaul Beesley 37640d553cfSPaul BeesleyBL2 loads the EL3 Runtime Software image from platform storage into a platform- 37740d553cfSPaul Beesleyspecific address in trusted SRAM. If there is not enough memory to load the 37840d553cfSPaul Beesleyimage or image is missing it leads to an assertion failure. 37940d553cfSPaul Beesley 38040d553cfSPaul BeesleyAArch64 BL32 (Secure-EL1 Payload) image load 38140d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ 38240d553cfSPaul Beesley 38340d553cfSPaul BeesleyBL2 loads the optional BL32 image from platform storage into a platform- 38440d553cfSPaul Beesleyspecific region of secure memory. The image executes in the secure world. BL2 38540d553cfSPaul Beesleyrelies on BL31 to pass control to the BL32 image, if present. Hence, BL2 38640d553cfSPaul Beesleypopulates a platform-specific area of memory with the entrypoint/load-address 38740d553cfSPaul Beesleyof the BL32 image. The value of the Saved Processor Status Register (``SPSR``) 38840d553cfSPaul Beesleyfor entry into BL32 is not determined by BL2, it is initialized by the 38940d553cfSPaul BeesleySecure-EL1 Payload Dispatcher (see later) within BL31, which is responsible for 39040d553cfSPaul Beesleymanaging interaction with BL32. This information is passed to BL31. 39140d553cfSPaul Beesley 39240d553cfSPaul BeesleyBL33 (Non-trusted Firmware) image load 39340d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ 39440d553cfSPaul Beesley 39540d553cfSPaul BeesleyBL2 loads the BL33 image (e.g. UEFI or other test or boot software) from 39640d553cfSPaul Beesleyplatform storage into non-secure memory as defined by the platform. 39740d553cfSPaul Beesley 39840d553cfSPaul BeesleyBL2 relies on EL3 Runtime Software to pass control to BL33 once secure state 39940d553cfSPaul Beesleyinitialization is complete. Hence, BL2 populates a platform-specific area of 40040d553cfSPaul Beesleymemory with the entrypoint and Saved Program Status Register (``SPSR``) of the 40140d553cfSPaul Beesleynormal world software image. The entrypoint is the load address of the BL33 40240d553cfSPaul Beesleyimage. The ``SPSR`` is determined as specified in Section 5.13 of the 40334760951SPaul Beesley`Power State Coordination Interface PDD`_. This information is passed to the 40434760951SPaul BeesleyEL3 Runtime Software. 40540d553cfSPaul Beesley 40640d553cfSPaul BeesleyAArch64 BL31 (EL3 Runtime Software) execution 40740d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ 40840d553cfSPaul Beesley 40940d553cfSPaul BeesleyBL2 execution continues as follows: 41040d553cfSPaul Beesley 41140d553cfSPaul Beesley#. BL2 passes control back to BL1 by raising an SMC, providing BL1 with the 41240d553cfSPaul Beesley BL31 entrypoint. The exception is handled by the SMC exception handler 41340d553cfSPaul Beesley installed by BL1. 41440d553cfSPaul Beesley 41540d553cfSPaul Beesley#. BL1 turns off the MMU and flushes the caches. It clears the 41640d553cfSPaul Beesley ``SCTLR_EL3.M/I/C`` bits, flushes the data cache to the point of coherency 41740d553cfSPaul Beesley and invalidates the TLBs. 41840d553cfSPaul Beesley 41940d553cfSPaul Beesley#. BL1 passes control to BL31 at the specified entrypoint at EL3. 42040d553cfSPaul Beesley 42140d553cfSPaul BeesleyRunning BL2 at EL3 execution level 42240d553cfSPaul Beesley~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 42340d553cfSPaul Beesley 42440d553cfSPaul BeesleySome platforms have a non-TF-A Boot ROM that expects the next boot stage 42540d553cfSPaul Beesleyto execute at EL3. On these platforms, TF-A BL1 is a waste of memory 42640d553cfSPaul Beesleyas its only purpose is to ensure TF-A BL2 is entered at S-EL1. To avoid 42740d553cfSPaul Beesleythis waste, a special mode enables BL2 to execute at EL3, which allows 42840d553cfSPaul Beesleya non-TF-A Boot ROM to load and jump directly to BL2. This mode is selected 42942d4d3baSArvind Ram Prakashwhen the build flag RESET_TO_BL2 is enabled. 43042d4d3baSArvind Ram PrakashThe main differences in this mode are: 43140d553cfSPaul Beesley 43240d553cfSPaul Beesley#. BL2 includes the reset code and the mailbox mechanism to differentiate 43340d553cfSPaul Beesley cold boot and warm boot. It runs at EL3 doing the arch 43440d553cfSPaul Beesley initialization required for EL3. 43540d553cfSPaul Beesley 43640d553cfSPaul Beesley#. BL2 does not receive the meminfo information from BL1 anymore. This 43740d553cfSPaul Beesley information can be passed by the Boot ROM or be internal to the 43840d553cfSPaul Beesley BL2 image. 43940d553cfSPaul Beesley 44040d553cfSPaul Beesley#. Since BL2 executes at EL3, BL2 jumps directly to the next image, 44140d553cfSPaul Beesley instead of invoking the RUN_IMAGE SMC call. 44240d553cfSPaul Beesley 44340d553cfSPaul Beesley 44440d553cfSPaul BeesleyWe assume 3 different types of BootROM support on the platform: 44540d553cfSPaul Beesley 44640d553cfSPaul Beesley#. The Boot ROM always jumps to the same address, for both cold 44740d553cfSPaul Beesley and warm boot. In this case, we will need to keep a resident part 44840d553cfSPaul Beesley of BL2 whose memory cannot be reclaimed by any other image. The 44940d553cfSPaul Beesley linker script defines the symbols __TEXT_RESIDENT_START__ and 45040d553cfSPaul Beesley __TEXT_RESIDENT_END__ that allows the platform to configure 45140d553cfSPaul Beesley correctly the memory map. 45240d553cfSPaul Beesley#. The platform has some mechanism to indicate the jump address to the 45340d553cfSPaul Beesley Boot ROM. Platform code can then program the jump address with 45440d553cfSPaul Beesley psci_warmboot_entrypoint during cold boot. 45540d553cfSPaul Beesley#. The platform has some mechanism to program the reset address using 45640d553cfSPaul Beesley the PROGRAMMABLE_RESET_ADDRESS feature. Platform code can then 45740d553cfSPaul Beesley program the reset address with psci_warmboot_entrypoint during 45840d553cfSPaul Beesley cold boot, bypassing the boot ROM for warm boot. 45940d553cfSPaul Beesley 46040d553cfSPaul BeesleyIn the last 2 cases, no part of BL2 needs to remain resident at 46140d553cfSPaul Beesleyruntime. In the first 2 cases, we expect the Boot ROM to be able to 46240d553cfSPaul Beesleydifferentiate between warm and cold boot, to avoid loading BL2 again 46340d553cfSPaul Beesleyduring warm boot. 46440d553cfSPaul Beesley 46540d553cfSPaul BeesleyThis functionality can be tested with FVP loading the image directly 46640d553cfSPaul Beesleyin memory and changing the address where the system jumps at reset. 46740d553cfSPaul BeesleyFor example: 46840d553cfSPaul Beesley 46940d553cfSPaul Beesley -C cluster0.cpu0.RVBAR=0x4022000 47040d553cfSPaul Beesley --data cluster0.cpu0=bl2.bin@0x4022000 47140d553cfSPaul Beesley 47240d553cfSPaul BeesleyWith this configuration, FVP is like a platform of the first case, 47340d553cfSPaul Beesleywhere the Boot ROM jumps always to the same address. For simplification, 47440d553cfSPaul BeesleyBL32 is loaded in DRAM in this case, to avoid other images reclaiming 47540d553cfSPaul BeesleyBL2 memory. 47640d553cfSPaul Beesley 47740d553cfSPaul Beesley 47840d553cfSPaul BeesleyAArch64 BL31 47940d553cfSPaul Beesley~~~~~~~~~~~~ 48040d553cfSPaul Beesley 48140d553cfSPaul BeesleyThe image for this stage is loaded by BL2 and BL1 passes control to BL31 at 48240d553cfSPaul BeesleyEL3. BL31 executes solely in trusted SRAM. BL31 is linked against and 48340d553cfSPaul Beesleyloaded at a platform-specific base address (more information can be found later 48440d553cfSPaul Beesleyin this document). The functionality implemented by BL31 is as follows. 48540d553cfSPaul Beesley 48640d553cfSPaul BeesleyArchitectural initialization 48740d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^^^^^^^^^ 48840d553cfSPaul Beesley 48940d553cfSPaul BeesleyCurrently, BL31 performs a similar architectural initialization to BL1 as 49040d553cfSPaul Beesleyfar as system register settings are concerned. Since BL1 code resides in ROM, 49140d553cfSPaul Beesleyarchitectural initialization in BL31 allows override of any previous 49240d553cfSPaul Beesleyinitialization done by BL1. 49340d553cfSPaul Beesley 49440d553cfSPaul BeesleyBL31 initializes the per-CPU data framework, which provides a cache of 49540d553cfSPaul Beesleyfrequently accessed per-CPU data optimised for fast, concurrent manipulation 49640d553cfSPaul Beesleyon different CPUs. This buffer includes pointers to per-CPU contexts, crash 49740d553cfSPaul Beesleybuffer, CPU reset and power down operations, PSCI data, platform data and so on. 49840d553cfSPaul Beesley 49940d553cfSPaul BeesleyIt then replaces the exception vectors populated by BL1 with its own. BL31 50040d553cfSPaul Beesleyexception vectors implement more elaborate support for handling SMCs since this 50140d553cfSPaul Beesleyis the only mechanism to access the runtime services implemented by BL31 (PSCI 50240d553cfSPaul Beesleyfor example). BL31 checks each SMC for validity as specified by the 50371ac931fSSandrine Bailleux`SMC Calling Convention`_ before passing control to the required SMC 50440d553cfSPaul Beesleyhandler routine. 50540d553cfSPaul Beesley 50640d553cfSPaul BeesleyBL31 programs the ``CNTFRQ_EL0`` register with the clock frequency of the system 50740d553cfSPaul Beesleycounter, which is provided by the platform. 50840d553cfSPaul Beesley 50940d553cfSPaul BeesleyPlatform initialization 51040d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^^^^ 51140d553cfSPaul Beesley 51240d553cfSPaul BeesleyBL31 performs detailed platform initialization, which enables normal world 51340d553cfSPaul Beesleysoftware to function correctly. 51440d553cfSPaul Beesley 51540d553cfSPaul BeesleyOn Arm platforms, this consists of the following: 51640d553cfSPaul Beesley 51740d553cfSPaul Beesley- Initialize the console. 51840d553cfSPaul Beesley- Configure the Interconnect to enable hardware coherency. 51940d553cfSPaul Beesley- Enable the MMU and map the memory it needs to access. 52040d553cfSPaul Beesley- Initialize the generic interrupt controller. 52140d553cfSPaul Beesley- Initialize the power controller device. 52240d553cfSPaul Beesley- Detect the system topology. 52340d553cfSPaul Beesley 52440d553cfSPaul BeesleyRuntime services initialization 52540d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ 52640d553cfSPaul Beesley 52740d553cfSPaul BeesleyBL31 is responsible for initializing the runtime services. One of them is PSCI. 52840d553cfSPaul Beesley 52940d553cfSPaul BeesleyAs part of the PSCI initializations, BL31 detects the system topology. It also 53040d553cfSPaul Beesleyinitializes the data structures that implement the state machine used to track 53140d553cfSPaul Beesleythe state of power domain nodes. The state can be one of ``OFF``, ``RUN`` or 53240d553cfSPaul Beesley``RETENTION``. All secondary CPUs are initially in the ``OFF`` state. The cluster 53340d553cfSPaul Beesleythat the primary CPU belongs to is ``ON``; any other cluster is ``OFF``. It also 53440d553cfSPaul Beesleyinitializes the locks that protect them. BL31 accesses the state of a CPU or 53540d553cfSPaul Beesleycluster immediately after reset and before the data cache is enabled in the 53640d553cfSPaul Beesleywarm boot path. It is not currently possible to use 'exclusive' based spinlocks, 53740d553cfSPaul Beesleytherefore BL31 uses locks based on Lamport's Bakery algorithm instead. 53840d553cfSPaul Beesley 53940d553cfSPaul BeesleyThe runtime service framework and its initialization is described in more 54040d553cfSPaul Beesleydetail in the "EL3 runtime services framework" section below. 54140d553cfSPaul Beesley 54240d553cfSPaul BeesleyDetails about the status of the PSCI implementation are provided in the 54340d553cfSPaul Beesley"Power State Coordination Interface" section below. 54440d553cfSPaul Beesley 54540d553cfSPaul BeesleyAArch64 BL32 (Secure-EL1 Payload) image initialization 54640d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ 54740d553cfSPaul Beesley 54840d553cfSPaul BeesleyIf a BL32 image is present then there must be a matching Secure-EL1 Payload 54940d553cfSPaul BeesleyDispatcher (SPD) service (see later for details). During initialization 55040d553cfSPaul Beesleythat service must register a function to carry out initialization of BL32 55140d553cfSPaul Beesleyonce the runtime services are fully initialized. BL31 invokes such a 55240d553cfSPaul Beesleyregistered function to initialize BL32 before running BL33. This initialization 55340d553cfSPaul Beesleyis not necessary for AArch32 SPs. 55440d553cfSPaul Beesley 55540d553cfSPaul BeesleyDetails on BL32 initialization and the SPD's role are described in the 55643f35ef5SPaul Beesley:ref:`firmware_design_sel1_spd` section below. 55740d553cfSPaul Beesley 55840d553cfSPaul BeesleyBL33 (Non-trusted Firmware) execution 55940d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ 56040d553cfSPaul Beesley 56140d553cfSPaul BeesleyEL3 Runtime Software initializes the EL2 or EL1 processor context for normal- 56240d553cfSPaul Beesleyworld cold boot, ensuring that no secure state information finds its way into 56340d553cfSPaul Beesleythe non-secure execution state. EL3 Runtime Software uses the entrypoint 56440d553cfSPaul Beesleyinformation provided by BL2 to jump to the Non-trusted firmware image (BL33) 56540d553cfSPaul Beesleyat the highest available Exception Level (EL2 if available, otherwise EL1). 56640d553cfSPaul Beesley 56740d553cfSPaul BeesleyUsing alternative Trusted Boot Firmware in place of BL1 & BL2 (AArch64 only) 56840d553cfSPaul Beesley~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 56940d553cfSPaul Beesley 57040d553cfSPaul BeesleySome platforms have existing implementations of Trusted Boot Firmware that 57140d553cfSPaul Beesleywould like to use TF-A BL31 for the EL3 Runtime Software. To enable this 57240d553cfSPaul Beesleyfirmware architecture it is important to provide a fully documented and stable 57340d553cfSPaul Beesleyinterface between the Trusted Boot Firmware and BL31. 57440d553cfSPaul Beesley 57540d553cfSPaul BeesleyFuture changes to the BL31 interface will be done in a backwards compatible 57640d553cfSPaul Beesleyway, and this enables these firmware components to be independently enhanced/ 57740d553cfSPaul Beesleyupdated to develop and exploit new functionality. 57840d553cfSPaul Beesley 57940d553cfSPaul BeesleyRequired CPU state when calling ``bl31_entrypoint()`` during cold boot 58040d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ 58140d553cfSPaul Beesley 58240d553cfSPaul BeesleyThis function must only be called by the primary CPU. 58340d553cfSPaul Beesley 58440d553cfSPaul BeesleyOn entry to this function the calling primary CPU must be executing in AArch64 58540d553cfSPaul BeesleyEL3, little-endian data access, and all interrupt sources masked: 58640d553cfSPaul Beesley 58740d553cfSPaul Beesley:: 58840d553cfSPaul Beesley 58940d553cfSPaul Beesley PSTATE.EL = 3 59040d553cfSPaul Beesley PSTATE.RW = 1 59140d553cfSPaul Beesley PSTATE.DAIF = 0xf 59240d553cfSPaul Beesley SCTLR_EL3.EE = 0 59340d553cfSPaul Beesley 59440d553cfSPaul BeesleyX0 and X1 can be used to pass information from the Trusted Boot Firmware to the 59540d553cfSPaul Beesleyplatform code in BL31: 59640d553cfSPaul Beesley 59740d553cfSPaul Beesley:: 59840d553cfSPaul Beesley 59940d553cfSPaul Beesley X0 : Reserved for common TF-A information 60040d553cfSPaul Beesley X1 : Platform specific information 60140d553cfSPaul Beesley 60240d553cfSPaul BeesleyBL31 zero-init sections (e.g. ``.bss``) should not contain valid data on entry, 60340d553cfSPaul Beesleythese will be zero filled prior to invoking platform setup code. 60440d553cfSPaul Beesley 60540d553cfSPaul BeesleyUse of the X0 and X1 parameters 60640d553cfSPaul Beesley''''''''''''''''''''''''''''''' 60740d553cfSPaul Beesley 60840d553cfSPaul BeesleyThe parameters are platform specific and passed from ``bl31_entrypoint()`` to 60940d553cfSPaul Beesley``bl31_early_platform_setup()``. The value of these parameters is never directly 61040d553cfSPaul Beesleyused by the common BL31 code. 61140d553cfSPaul Beesley 61240d553cfSPaul BeesleyThe convention is that ``X0`` conveys information regarding the BL31, BL32 and 61340d553cfSPaul BeesleyBL33 images from the Trusted Boot firmware and ``X1`` can be used for other 61440d553cfSPaul Beesleyplatform specific purpose. This convention allows platforms which use TF-A's 61540d553cfSPaul BeesleyBL1 and BL2 images to transfer additional platform specific information from 61640d553cfSPaul BeesleySecure Boot without conflicting with future evolution of TF-A using ``X0`` to 61740d553cfSPaul Beesleypass a ``bl31_params`` structure. 61840d553cfSPaul Beesley 61940d553cfSPaul BeesleyBL31 common and SPD initialization code depends on image and entrypoint 62040d553cfSPaul Beesleyinformation about BL33 and BL32, which is provided via BL31 platform APIs. 62140d553cfSPaul BeesleyThis information is required until the start of execution of BL33. This 62240d553cfSPaul Beesleyinformation can be provided in a platform defined manner, e.g. compiled into 62340d553cfSPaul Beesleythe platform code in BL31, or provided in a platform defined memory location 62440d553cfSPaul Beesleyby the Trusted Boot firmware, or passed from the Trusted Boot Firmware via the 62540d553cfSPaul BeesleyCold boot Initialization parameters. This data may need to be cleaned out of 62640d553cfSPaul Beesleythe CPU caches if it is provided by an earlier boot stage and then accessed by 62740d553cfSPaul BeesleyBL31 platform code before the caches are enabled. 62840d553cfSPaul Beesley 62940d553cfSPaul BeesleyTF-A's BL2 implementation passes a ``bl31_params`` structure in 63040d553cfSPaul Beesley``X0`` and the Arm development platforms interpret this in the BL31 platform 63140d553cfSPaul Beesleycode. 63240d553cfSPaul Beesley 63340d553cfSPaul BeesleyMMU, Data caches & Coherency 63440d553cfSPaul Beesley'''''''''''''''''''''''''''' 63540d553cfSPaul Beesley 63640d553cfSPaul BeesleyBL31 does not depend on the enabled state of the MMU, data caches or 63740d553cfSPaul Beesleyinterconnect coherency on entry to ``bl31_entrypoint()``. If these are disabled 63840d553cfSPaul Beesleyon entry, these should be enabled during ``bl31_plat_arch_setup()``. 63940d553cfSPaul Beesley 64040d553cfSPaul BeesleyData structures used in the BL31 cold boot interface 64140d553cfSPaul Beesley'''''''''''''''''''''''''''''''''''''''''''''''''''' 64240d553cfSPaul Beesley 64340d553cfSPaul BeesleyThese structures are designed to support compatibility and independent 64440d553cfSPaul Beesleyevolution of the structures and the firmware images. For example, a version of 64540d553cfSPaul BeesleyBL31 that can interpret the BL3x image information from different versions of 64640d553cfSPaul BeesleyBL2, a platform that uses an extended entry_point_info structure to convey 64740d553cfSPaul Beesleyadditional register information to BL31, or a ELF image loader that can convey 64840d553cfSPaul Beesleymore details about the firmware images. 64940d553cfSPaul Beesley 65040d553cfSPaul BeesleyTo support these scenarios the structures are versioned and sized, which enables 65140d553cfSPaul BeesleyBL31 to detect which information is present and respond appropriately. The 65240d553cfSPaul Beesley``param_header`` is defined to capture this information: 65340d553cfSPaul Beesley 65440d553cfSPaul Beesley.. code:: c 65540d553cfSPaul Beesley 65640d553cfSPaul Beesley typedef struct param_header { 65740d553cfSPaul Beesley uint8_t type; /* type of the structure */ 65840d553cfSPaul Beesley uint8_t version; /* version of this structure */ 65940d553cfSPaul Beesley uint16_t size; /* size of this structure in bytes */ 66040d553cfSPaul Beesley uint32_t attr; /* attributes: unused bits SBZ */ 66140d553cfSPaul Beesley } param_header_t; 66240d553cfSPaul Beesley 66340d553cfSPaul BeesleyThe structures using this format are ``entry_point_info``, ``image_info`` and 66440d553cfSPaul Beesley``bl31_params``. The code that allocates and populates these structures must set 66540d553cfSPaul Beesleythe header fields appropriately, and the ``SET_PARAM_HEAD()`` a macro is defined 66640d553cfSPaul Beesleyto simplify this action. 66740d553cfSPaul Beesley 66840d553cfSPaul BeesleyRequired CPU state for BL31 Warm boot initialization 66940d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ 67040d553cfSPaul Beesley 67140d553cfSPaul BeesleyWhen requesting a CPU power-on, or suspending a running CPU, TF-A provides 67240d553cfSPaul Beesleythe platform power management code with a Warm boot initialization 67340d553cfSPaul Beesleyentry-point, to be invoked by the CPU immediately after the reset handler. 67440d553cfSPaul BeesleyOn entry to the Warm boot initialization function the calling CPU must be in 67540d553cfSPaul BeesleyAArch64 EL3, little-endian data access and all interrupt sources masked: 67640d553cfSPaul Beesley 67740d553cfSPaul Beesley:: 67840d553cfSPaul Beesley 67940d553cfSPaul Beesley PSTATE.EL = 3 68040d553cfSPaul Beesley PSTATE.RW = 1 68140d553cfSPaul Beesley PSTATE.DAIF = 0xf 68240d553cfSPaul Beesley SCTLR_EL3.EE = 0 68340d553cfSPaul Beesley 68440d553cfSPaul BeesleyThe PSCI implementation will initialize the processor state and ensure that the 68540d553cfSPaul Beesleyplatform power management code is then invoked as required to initialize all 68640d553cfSPaul Beesleynecessary system, cluster and CPU resources. 68740d553cfSPaul Beesley 68840d553cfSPaul BeesleyAArch32 EL3 Runtime Software entrypoint interface 68940d553cfSPaul Beesley~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 69040d553cfSPaul Beesley 69140d553cfSPaul BeesleyTo enable this firmware architecture it is important to provide a fully 69240d553cfSPaul Beesleydocumented and stable interface between the Trusted Boot Firmware and the 69340d553cfSPaul BeesleyAArch32 EL3 Runtime Software. 69440d553cfSPaul Beesley 69540d553cfSPaul BeesleyFuture changes to the entrypoint interface will be done in a backwards 69640d553cfSPaul Beesleycompatible way, and this enables these firmware components to be independently 69740d553cfSPaul Beesleyenhanced/updated to develop and exploit new functionality. 69840d553cfSPaul Beesley 69940d553cfSPaul BeesleyRequired CPU state when entering during cold boot 70040d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ 70140d553cfSPaul Beesley 70240d553cfSPaul BeesleyThis function must only be called by the primary CPU. 70340d553cfSPaul Beesley 70440d553cfSPaul BeesleyOn entry to this function the calling primary CPU must be executing in AArch32 70540d553cfSPaul BeesleyEL3, little-endian data access, and all interrupt sources masked: 70640d553cfSPaul Beesley 70740d553cfSPaul Beesley:: 70840d553cfSPaul Beesley 70940d553cfSPaul Beesley PSTATE.AIF = 0x7 71040d553cfSPaul Beesley SCTLR.EE = 0 71140d553cfSPaul Beesley 71240d553cfSPaul BeesleyR0 and R1 are used to pass information from the Trusted Boot Firmware to the 71340d553cfSPaul Beesleyplatform code in AArch32 EL3 Runtime Software: 71440d553cfSPaul Beesley 71540d553cfSPaul Beesley:: 71640d553cfSPaul Beesley 71740d553cfSPaul Beesley R0 : Reserved for common TF-A information 71840d553cfSPaul Beesley R1 : Platform specific information 71940d553cfSPaul Beesley 72040d553cfSPaul BeesleyUse of the R0 and R1 parameters 72140d553cfSPaul Beesley''''''''''''''''''''''''''''''' 72240d553cfSPaul Beesley 72340d553cfSPaul BeesleyThe parameters are platform specific and the convention is that ``R0`` conveys 72440d553cfSPaul Beesleyinformation regarding the BL3x images from the Trusted Boot firmware and ``R1`` 72540d553cfSPaul Beesleycan be used for other platform specific purpose. This convention allows 72640d553cfSPaul Beesleyplatforms which use TF-A's BL1 and BL2 images to transfer additional platform 72740d553cfSPaul Beesleyspecific information from Secure Boot without conflicting with future 72840d553cfSPaul Beesleyevolution of TF-A using ``R0`` to pass a ``bl_params`` structure. 72940d553cfSPaul Beesley 73040d553cfSPaul BeesleyThe AArch32 EL3 Runtime Software is responsible for entry into BL33. This 73140d553cfSPaul Beesleyinformation can be obtained in a platform defined manner, e.g. compiled into 73240d553cfSPaul Beesleythe AArch32 EL3 Runtime Software, or provided in a platform defined memory 73340d553cfSPaul Beesleylocation by the Trusted Boot firmware, or passed from the Trusted Boot Firmware 73440d553cfSPaul Beesleyvia the Cold boot Initialization parameters. This data may need to be cleaned 73540d553cfSPaul Beesleyout of the CPU caches if it is provided by an earlier boot stage and then 73640d553cfSPaul Beesleyaccessed by AArch32 EL3 Runtime Software before the caches are enabled. 73740d553cfSPaul Beesley 73840d553cfSPaul BeesleyWhen using AArch32 EL3 Runtime Software, the Arm development platforms pass a 73940d553cfSPaul Beesley``bl_params`` structure in ``R0`` from BL2 to be interpreted by AArch32 EL3 Runtime 74040d553cfSPaul BeesleySoftware platform code. 74140d553cfSPaul Beesley 74240d553cfSPaul BeesleyMMU, Data caches & Coherency 74340d553cfSPaul Beesley'''''''''''''''''''''''''''' 74440d553cfSPaul Beesley 74540d553cfSPaul BeesleyAArch32 EL3 Runtime Software must not depend on the enabled state of the MMU, 74640d553cfSPaul Beesleydata caches or interconnect coherency in its entrypoint. They must be explicitly 74740d553cfSPaul Beesleyenabled if required. 74840d553cfSPaul Beesley 74940d553cfSPaul BeesleyData structures used in cold boot interface 75040d553cfSPaul Beesley''''''''''''''''''''''''''''''''''''''''''' 75140d553cfSPaul Beesley 75240d553cfSPaul BeesleyThe AArch32 EL3 Runtime Software cold boot interface uses ``bl_params`` instead 75340d553cfSPaul Beesleyof ``bl31_params``. The ``bl_params`` structure is based on the convention 75440d553cfSPaul Beesleydescribed in AArch64 BL31 cold boot interface section. 75540d553cfSPaul Beesley 75640d553cfSPaul BeesleyRequired CPU state for warm boot initialization 75740d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ 75840d553cfSPaul Beesley 75940d553cfSPaul BeesleyWhen requesting a CPU power-on, or suspending a running CPU, AArch32 EL3 76040d553cfSPaul BeesleyRuntime Software must ensure execution of a warm boot initialization entrypoint. 76140d553cfSPaul BeesleyIf TF-A BL1 is used and the PROGRAMMABLE_RESET_ADDRESS build flag is false, 76240d553cfSPaul Beesleythen AArch32 EL3 Runtime Software must ensure that BL1 branches to the warm 76340d553cfSPaul Beesleyboot entrypoint by arranging for the BL1 platform function, 76440d553cfSPaul Beesleyplat_get_my_entrypoint(), to return a non-zero value. 76540d553cfSPaul Beesley 76640d553cfSPaul BeesleyIn this case, the warm boot entrypoint must be in AArch32 EL3, little-endian 76740d553cfSPaul Beesleydata access and all interrupt sources masked: 76840d553cfSPaul Beesley 76940d553cfSPaul Beesley:: 77040d553cfSPaul Beesley 77140d553cfSPaul Beesley PSTATE.AIF = 0x7 77240d553cfSPaul Beesley SCTLR.EE = 0 77340d553cfSPaul Beesley 77440d553cfSPaul BeesleyThe warm boot entrypoint may be implemented by using TF-A 77540d553cfSPaul Beesley``psci_warmboot_entrypoint()`` function. In that case, the platform must fulfil 77634760951SPaul Beesleythe pre-requisites mentioned in the 77734760951SPaul Beesley:ref:`PSCI Library Integration guide for Armv8-A AArch32 systems`. 77840d553cfSPaul Beesley 77940d553cfSPaul BeesleyEL3 runtime services framework 78040d553cfSPaul Beesley------------------------------ 78140d553cfSPaul Beesley 78240d553cfSPaul BeesleySoftware executing in the non-secure state and in the secure state at exception 78340d553cfSPaul Beesleylevels lower than EL3 will request runtime services using the Secure Monitor 78440d553cfSPaul BeesleyCall (SMC) instruction. These requests will follow the convention described in 78540d553cfSPaul Beesleythe SMC Calling Convention PDD (`SMCCC`_). The `SMCCC`_ assigns function 78640d553cfSPaul Beesleyidentifiers to each SMC request and describes how arguments are passed and 78740d553cfSPaul Beesleyreturned. 78840d553cfSPaul Beesley 78940d553cfSPaul BeesleyThe EL3 runtime services framework enables the development of services by 79040d553cfSPaul Beesleydifferent providers that can be easily integrated into final product firmware. 79140d553cfSPaul BeesleyThe following sections describe the framework which facilitates the 79240d553cfSPaul Beesleyregistration, initialization and use of runtime services in EL3 Runtime 79340d553cfSPaul BeesleySoftware (BL31). 79440d553cfSPaul Beesley 79540d553cfSPaul BeesleyThe design of the runtime services depends heavily on the concepts and 79640d553cfSPaul Beesleydefinitions described in the `SMCCC`_, in particular SMC Function IDs, Owning 79740d553cfSPaul BeesleyEntity Numbers (OEN), Fast and Yielding calls, and the SMC32 and SMC64 calling 79840d553cfSPaul Beesleyconventions. Please refer to that document for more detailed explanation of 79940d553cfSPaul Beesleythese terms. 80040d553cfSPaul Beesley 80140d553cfSPaul BeesleyThe following runtime services are expected to be implemented first. They have 80240d553cfSPaul Beesleynot all been instantiated in the current implementation. 80340d553cfSPaul Beesley 80440d553cfSPaul Beesley#. Standard service calls 80540d553cfSPaul Beesley 80640d553cfSPaul Beesley This service is for management of the entire system. The Power State 80740d553cfSPaul Beesley Coordination Interface (`PSCI`_) is the first set of standard service calls 80840d553cfSPaul Beesley defined by Arm (see PSCI section later). 80940d553cfSPaul Beesley 81040d553cfSPaul Beesley#. Secure-EL1 Payload Dispatcher service 81140d553cfSPaul Beesley 81240d553cfSPaul Beesley If a system runs a Trusted OS or other Secure-EL1 Payload (SP) then 81340d553cfSPaul Beesley it also requires a *Secure Monitor* at EL3 to switch the EL1 processor 81440d553cfSPaul Beesley context between the normal world (EL1/EL2) and trusted world (Secure-EL1). 81540d553cfSPaul Beesley The Secure Monitor will make these world switches in response to SMCs. The 81640d553cfSPaul Beesley `SMCCC`_ provides for such SMCs with the Trusted OS Call and Trusted 81740d553cfSPaul Beesley Application Call OEN ranges. 81840d553cfSPaul Beesley 81940d553cfSPaul Beesley The interface between the EL3 Runtime Software and the Secure-EL1 Payload is 82040d553cfSPaul Beesley not defined by the `SMCCC`_ or any other standard. As a result, each 82140d553cfSPaul Beesley Secure-EL1 Payload requires a specific Secure Monitor that runs as a runtime 82240d553cfSPaul Beesley service - within TF-A this service is referred to as the Secure-EL1 Payload 82340d553cfSPaul Beesley Dispatcher (SPD). 82440d553cfSPaul Beesley 82540d553cfSPaul Beesley TF-A provides a Test Secure-EL1 Payload (TSP) and its associated Dispatcher 82640d553cfSPaul Beesley (TSPD). Details of SPD design and TSP/TSPD operation are described in the 82743f35ef5SPaul Beesley :ref:`firmware_design_sel1_spd` section below. 82840d553cfSPaul Beesley 82940d553cfSPaul Beesley#. CPU implementation service 83040d553cfSPaul Beesley 83140d553cfSPaul Beesley This service will provide an interface to CPU implementation specific 83240d553cfSPaul Beesley services for a given platform e.g. access to processor errata workarounds. 83340d553cfSPaul Beesley This service is currently unimplemented. 83440d553cfSPaul Beesley 83540d553cfSPaul BeesleyAdditional services for Arm Architecture, SiP and OEM calls can be implemented. 83640d553cfSPaul BeesleyEach implemented service handles a range of SMC function identifiers as 83740d553cfSPaul Beesleydescribed in the `SMCCC`_. 83840d553cfSPaul Beesley 83940d553cfSPaul BeesleyRegistration 84040d553cfSPaul Beesley~~~~~~~~~~~~ 84140d553cfSPaul Beesley 84240d553cfSPaul BeesleyA runtime service is registered using the ``DECLARE_RT_SVC()`` macro, specifying 84340d553cfSPaul Beesleythe name of the service, the range of OENs covered, the type of service and 84440d553cfSPaul Beesleyinitialization and call handler functions. This macro instantiates a ``const struct rt_svc_desc`` for the service with these details (see ``runtime_svc.h``). 845da04341eSChris KayThis structure is allocated in a special ELF section ``.rt_svc_descs``, enabling 84640d553cfSPaul Beesleythe framework to find all service descriptors included into BL31. 84740d553cfSPaul Beesley 84840d553cfSPaul BeesleyThe specific service for a SMC Function is selected based on the OEN and call 84940d553cfSPaul Beesleytype of the Function ID, and the framework uses that information in the service 85040d553cfSPaul Beesleydescriptor to identify the handler for the SMC Call. 85140d553cfSPaul Beesley 85240d553cfSPaul BeesleyThe service descriptors do not include information to identify the precise set 85340d553cfSPaul Beesleyof SMC function identifiers supported by this service implementation, the 85440d553cfSPaul Beesleysecurity state from which such calls are valid nor the capability to support 85540d553cfSPaul Beesley64-bit and/or 32-bit callers (using SMC32 or SMC64). Responding appropriately 85640d553cfSPaul Beesleyto these aspects of a SMC call is the responsibility of the service 85740d553cfSPaul Beesleyimplementation, the framework is focused on integration of services from 85840d553cfSPaul Beesleydifferent providers and minimizing the time taken by the framework before the 85940d553cfSPaul Beesleyservice handler is invoked. 86040d553cfSPaul Beesley 86140d553cfSPaul BeesleyDetails of the parameters, requirements and behavior of the initialization and 86240d553cfSPaul Beesleycall handling functions are provided in the following sections. 86340d553cfSPaul Beesley 86440d553cfSPaul BeesleyInitialization 86540d553cfSPaul Beesley~~~~~~~~~~~~~~ 86640d553cfSPaul Beesley 86740d553cfSPaul Beesley``runtime_svc_init()`` in ``runtime_svc.c`` initializes the runtime services 86840d553cfSPaul Beesleyframework running on the primary CPU during cold boot as part of the BL31 86940d553cfSPaul Beesleyinitialization. This happens prior to initializing a Trusted OS and running 87040d553cfSPaul BeesleyNormal world boot firmware that might in turn use these services. 87140d553cfSPaul BeesleyInitialization involves validating each of the declared runtime service 87240d553cfSPaul Beesleydescriptors, calling the service initialization function and populating the 87340d553cfSPaul Beesleyindex used for runtime lookup of the service. 87440d553cfSPaul Beesley 87540d553cfSPaul BeesleyThe BL31 linker script collects all of the declared service descriptors into a 87640d553cfSPaul Beesleysingle array and defines symbols that allow the framework to locate and traverse 87740d553cfSPaul Beesleythe array, and determine its size. 87840d553cfSPaul Beesley 87940d553cfSPaul BeesleyThe framework does basic validation of each descriptor to halt firmware 88040d553cfSPaul Beesleyinitialization if service declaration errors are detected. The framework does 88140d553cfSPaul Beesleynot check descriptors for the following error conditions, and may behave in an 88240d553cfSPaul Beesleyunpredictable manner under such scenarios: 88340d553cfSPaul Beesley 88440d553cfSPaul Beesley#. Overlapping OEN ranges 88540d553cfSPaul Beesley#. Multiple descriptors for the same range of OENs and ``call_type`` 88640d553cfSPaul Beesley#. Incorrect range of owning entity numbers for a given ``call_type`` 88740d553cfSPaul Beesley 88840d553cfSPaul BeesleyOnce validated, the service ``init()`` callback is invoked. This function carries 88940d553cfSPaul Beesleyout any essential EL3 initialization before servicing requests. The ``init()`` 89040d553cfSPaul Beesleyfunction is only invoked on the primary CPU during cold boot. If the service 89140d553cfSPaul Beesleyuses per-CPU data this must either be initialized for all CPUs during this call, 89240d553cfSPaul Beesleyor be done lazily when a CPU first issues an SMC call to that service. If 89340d553cfSPaul Beesley``init()`` returns anything other than ``0``, this is treated as an initialization 89440d553cfSPaul Beesleyerror and the service is ignored: this does not cause the firmware to halt. 89540d553cfSPaul Beesley 89640d553cfSPaul BeesleyThe OEN and call type fields present in the SMC Function ID cover a total of 89740d553cfSPaul Beesley128 distinct services, but in practice a single descriptor can cover a range of 89840d553cfSPaul BeesleyOENs, e.g. SMCs to call a Trusted OS function. To optimize the lookup of a 89940d553cfSPaul Beesleyservice handler, the framework uses an array of 128 indices that map every 90040d553cfSPaul Beesleydistinct OEN/call-type combination either to one of the declared services or to 90140d553cfSPaul Beesleyindicate the service is not handled. This ``rt_svc_descs_indices[]`` array is 90240d553cfSPaul Beesleypopulated for all of the OENs covered by a service after the service ``init()`` 90340d553cfSPaul Beesleyfunction has reported success. So a service that fails to initialize will never 90440d553cfSPaul Beesleyhave it's ``handle()`` function invoked. 90540d553cfSPaul Beesley 90640d553cfSPaul BeesleyThe following figure shows how the ``rt_svc_descs_indices[]`` index maps the SMC 90740d553cfSPaul BeesleyFunction ID call type and OEN onto a specific service handler in the 90840d553cfSPaul Beesley``rt_svc_descs[]`` array. 90940d553cfSPaul Beesley 91040d553cfSPaul Beesley|Image 1| 91140d553cfSPaul Beesley 9126844c347SMadhukar Pappireddy.. _handling-an-smc: 9136844c347SMadhukar Pappireddy 91440d553cfSPaul BeesleyHandling an SMC 91540d553cfSPaul Beesley~~~~~~~~~~~~~~~ 91640d553cfSPaul Beesley 91740d553cfSPaul BeesleyWhen the EL3 runtime services framework receives a Secure Monitor Call, the SMC 91840d553cfSPaul BeesleyFunction ID is passed in W0 from the lower exception level (as per the 91940d553cfSPaul Beesley`SMCCC`_). If the calling register width is AArch32, it is invalid to invoke an 92040d553cfSPaul BeesleySMC Function which indicates the SMC64 calling convention: such calls are 92140d553cfSPaul Beesleyignored and return the Unknown SMC Function Identifier result code ``0xFFFFFFFF`` 92240d553cfSPaul Beesleyin R0/X0. 92340d553cfSPaul Beesley 92440d553cfSPaul BeesleyBit[31] (fast/yielding call) and bits[29:24] (owning entity number) of the SMC 92540d553cfSPaul BeesleyFunction ID are combined to index into the ``rt_svc_descs_indices[]`` array. The 92640d553cfSPaul Beesleyresulting value might indicate a service that has no handler, in this case the 92740d553cfSPaul Beesleyframework will also report an Unknown SMC Function ID. Otherwise, the value is 92840d553cfSPaul Beesleyused as a further index into the ``rt_svc_descs[]`` array to locate the required 92940d553cfSPaul Beesleyservice and handler. 93040d553cfSPaul Beesley 93140d553cfSPaul BeesleyThe service's ``handle()`` callback is provided with five of the SMC parameters 93240d553cfSPaul Beesleydirectly, the others are saved into memory for retrieval (if needed) by the 93340d553cfSPaul Beesleyhandler. The handler is also provided with an opaque ``handle`` for use with the 93440d553cfSPaul Beesleysupporting library for parameter retrieval, setting return values and context 9350fe7b9f2SOlivier Deprezmanipulation. The ``flags`` parameter indicates the security state of the caller 9360fe7b9f2SOlivier Deprezand the state of the SVE hint bit per the SMCCCv1.3. The framework finally sets 9370fe7b9f2SOlivier Deprezup the execution stack for the handler, and invokes the services ``handle()`` 9380fe7b9f2SOlivier Deprezfunction. 93940d553cfSPaul Beesley 940e34cc0ceSMadhukar PappireddyOn return from the handler the result registers are populated in X0-X7 as needed 941e34cc0ceSMadhukar Pappireddybefore restoring the stack and CPU state and returning from the original SMC. 94240d553cfSPaul Beesley 94340d553cfSPaul BeesleyException Handling Framework 94440d553cfSPaul Beesley---------------------------- 94540d553cfSPaul Beesley 946526f2bddSjohpow01Please refer to the :ref:`Exception Handling Framework` document. 94740d553cfSPaul Beesley 94840d553cfSPaul BeesleyPower State Coordination Interface 94940d553cfSPaul Beesley---------------------------------- 95040d553cfSPaul Beesley 95140d553cfSPaul BeesleyTODO: Provide design walkthrough of PSCI implementation. 95240d553cfSPaul Beesley 95340d553cfSPaul BeesleyThe PSCI v1.1 specification categorizes APIs as optional and mandatory. All the 95440d553cfSPaul Beesleymandatory APIs in PSCI v1.1, PSCI v1.0 and in PSCI v0.2 draft specification 95540d553cfSPaul Beesley`Power State Coordination Interface PDD`_ are implemented. The table lists 95640d553cfSPaul Beesleythe PSCI v1.1 APIs and their support in generic code. 95740d553cfSPaul Beesley 95840d553cfSPaul BeesleyAn API implementation might have a dependency on platform code e.g. CPU_SUSPEND 95940d553cfSPaul Beesleyrequires the platform to export a part of the implementation. Hence the level 96040d553cfSPaul Beesleyof support of the mandatory APIs depends upon the support exported by the 96140d553cfSPaul Beesleyplatform port as well. The Juno and FVP (all variants) platforms export all the 96240d553cfSPaul Beesleyrequired support. 96340d553cfSPaul Beesley 96440d553cfSPaul Beesley+-----------------------------+-------------+-------------------------------+ 96540d553cfSPaul Beesley| PSCI v1.1 API | Supported | Comments | 96640d553cfSPaul Beesley+=============================+=============+===============================+ 96740d553cfSPaul Beesley| ``PSCI_VERSION`` | Yes | The version returned is 1.1 | 96840d553cfSPaul Beesley+-----------------------------+-------------+-------------------------------+ 96940d553cfSPaul Beesley| ``CPU_SUSPEND`` | Yes\* | | 97040d553cfSPaul Beesley+-----------------------------+-------------+-------------------------------+ 97140d553cfSPaul Beesley| ``CPU_OFF`` | Yes\* | | 97240d553cfSPaul Beesley+-----------------------------+-------------+-------------------------------+ 97340d553cfSPaul Beesley| ``CPU_ON`` | Yes\* | | 97440d553cfSPaul Beesley+-----------------------------+-------------+-------------------------------+ 97540d553cfSPaul Beesley| ``AFFINITY_INFO`` | Yes | | 97640d553cfSPaul Beesley+-----------------------------+-------------+-------------------------------+ 97740d553cfSPaul Beesley| ``MIGRATE`` | Yes\*\* | | 97840d553cfSPaul Beesley+-----------------------------+-------------+-------------------------------+ 97940d553cfSPaul Beesley| ``MIGRATE_INFO_TYPE`` | Yes\*\* | | 98040d553cfSPaul Beesley+-----------------------------+-------------+-------------------------------+ 98140d553cfSPaul Beesley| ``MIGRATE_INFO_CPU`` | Yes\*\* | | 98240d553cfSPaul Beesley+-----------------------------+-------------+-------------------------------+ 98340d553cfSPaul Beesley| ``SYSTEM_OFF`` | Yes\* | | 98440d553cfSPaul Beesley+-----------------------------+-------------+-------------------------------+ 98540d553cfSPaul Beesley| ``SYSTEM_RESET`` | Yes\* | | 98640d553cfSPaul Beesley+-----------------------------+-------------+-------------------------------+ 98740d553cfSPaul Beesley| ``PSCI_FEATURES`` | Yes | | 98840d553cfSPaul Beesley+-----------------------------+-------------+-------------------------------+ 98940d553cfSPaul Beesley| ``CPU_FREEZE`` | No | | 99040d553cfSPaul Beesley+-----------------------------+-------------+-------------------------------+ 99140d553cfSPaul Beesley| ``CPU_DEFAULT_SUSPEND`` | No | | 99240d553cfSPaul Beesley+-----------------------------+-------------+-------------------------------+ 99340d553cfSPaul Beesley| ``NODE_HW_STATE`` | Yes\* | | 99440d553cfSPaul Beesley+-----------------------------+-------------+-------------------------------+ 99540d553cfSPaul Beesley| ``SYSTEM_SUSPEND`` | Yes\* | | 99640d553cfSPaul Beesley+-----------------------------+-------------+-------------------------------+ 99740d553cfSPaul Beesley| ``PSCI_SET_SUSPEND_MODE`` | No | | 99840d553cfSPaul Beesley+-----------------------------+-------------+-------------------------------+ 99940d553cfSPaul Beesley| ``PSCI_STAT_RESIDENCY`` | Yes\* | | 100040d553cfSPaul Beesley+-----------------------------+-------------+-------------------------------+ 100140d553cfSPaul Beesley| ``PSCI_STAT_COUNT`` | Yes\* | | 100240d553cfSPaul Beesley+-----------------------------+-------------+-------------------------------+ 100340d553cfSPaul Beesley| ``SYSTEM_RESET2`` | Yes\* | | 100440d553cfSPaul Beesley+-----------------------------+-------------+-------------------------------+ 100540d553cfSPaul Beesley| ``MEM_PROTECT`` | Yes\* | | 100640d553cfSPaul Beesley+-----------------------------+-------------+-------------------------------+ 100740d553cfSPaul Beesley| ``MEM_PROTECT_CHECK_RANGE`` | Yes\* | | 100840d553cfSPaul Beesley+-----------------------------+-------------+-------------------------------+ 100940d553cfSPaul Beesley 101040d553cfSPaul Beesley\*Note : These PSCI APIs require platform power management hooks to be 101140d553cfSPaul Beesleyregistered with the generic PSCI code to be supported. 101240d553cfSPaul Beesley 101340d553cfSPaul Beesley\*\*Note : These PSCI APIs require appropriate Secure Payload Dispatcher 101440d553cfSPaul Beesleyhooks to be registered with the generic PSCI code to be supported. 101540d553cfSPaul Beesley 101640d553cfSPaul BeesleyThe PSCI implementation in TF-A is a library which can be integrated with 101740d553cfSPaul BeesleyAArch64 or AArch32 EL3 Runtime Software for Armv8-A systems. A guide to 101840d553cfSPaul Beesleyintegrating PSCI library with AArch32 EL3 Runtime Software can be found 101934760951SPaul Beesleyat :ref:`PSCI Library Integration guide for Armv8-A AArch32 systems`. 102034760951SPaul Beesley 102134760951SPaul Beesley.. _firmware_design_sel1_spd: 102240d553cfSPaul Beesley 102340d553cfSPaul BeesleySecure-EL1 Payloads and Dispatchers 102440d553cfSPaul Beesley----------------------------------- 102540d553cfSPaul Beesley 102640d553cfSPaul BeesleyOn a production system that includes a Trusted OS running in Secure-EL1/EL0, 102740d553cfSPaul Beesleythe Trusted OS is coupled with a companion runtime service in the BL31 102840d553cfSPaul Beesleyfirmware. This service is responsible for the initialisation of the Trusted 102940d553cfSPaul BeesleyOS and all communications with it. The Trusted OS is the BL32 stage of the 103040d553cfSPaul Beesleyboot flow in TF-A. The firmware will attempt to locate, load and execute a 103140d553cfSPaul BeesleyBL32 image. 103240d553cfSPaul Beesley 103340d553cfSPaul BeesleyTF-A uses a more general term for the BL32 software that runs at Secure-EL1 - 103440d553cfSPaul Beesleythe *Secure-EL1 Payload* - as it is not always a Trusted OS. 103540d553cfSPaul Beesley 103640d553cfSPaul BeesleyTF-A provides a Test Secure-EL1 Payload (TSP) and a Test Secure-EL1 Payload 103740d553cfSPaul BeesleyDispatcher (TSPD) service as an example of how a Trusted OS is supported on a 103840d553cfSPaul Beesleyproduction system using the Runtime Services Framework. On such a system, the 103940d553cfSPaul BeesleyTest BL32 image and service are replaced by the Trusted OS and its dispatcher 104040d553cfSPaul Beesleyservice. The TF-A build system expects that the dispatcher will define the 104140d553cfSPaul Beesleybuild flag ``NEED_BL32`` to enable it to include the BL32 in the build either 104240d553cfSPaul Beesleyas a binary or to compile from source depending on whether the ``BL32`` build 104340d553cfSPaul Beesleyoption is specified or not. 104440d553cfSPaul Beesley 104540d553cfSPaul BeesleyThe TSP runs in Secure-EL1. It is designed to demonstrate synchronous 104640d553cfSPaul Beesleycommunication with the normal-world software running in EL1/EL2. Communication 104740d553cfSPaul Beesleyis initiated by the normal-world software 104840d553cfSPaul Beesley 104940d553cfSPaul Beesley- either directly through a Fast SMC (as defined in the `SMCCC`_) 105040d553cfSPaul Beesley 105140d553cfSPaul Beesley- or indirectly through a `PSCI`_ SMC. The `PSCI`_ implementation in turn 105240d553cfSPaul Beesley informs the TSPD about the requested power management operation. This allows 105340d553cfSPaul Beesley the TSP to prepare for or respond to the power state change 105440d553cfSPaul Beesley 105540d553cfSPaul BeesleyThe TSPD service is responsible for. 105640d553cfSPaul Beesley 105740d553cfSPaul Beesley- Initializing the TSP 105840d553cfSPaul Beesley 105940d553cfSPaul Beesley- Routing requests and responses between the secure and the non-secure 106040d553cfSPaul Beesley states during the two types of communications just described 106140d553cfSPaul Beesley 106240d553cfSPaul BeesleyInitializing a BL32 Image 106340d553cfSPaul Beesley~~~~~~~~~~~~~~~~~~~~~~~~~ 106440d553cfSPaul Beesley 106540d553cfSPaul BeesleyThe Secure-EL1 Payload Dispatcher (SPD) service is responsible for initializing 106640d553cfSPaul Beesleythe BL32 image. It needs access to the information passed by BL2 to BL31 to do 106740d553cfSPaul Beesleyso. This is provided by: 106840d553cfSPaul Beesley 106940d553cfSPaul Beesley.. code:: c 107040d553cfSPaul Beesley 107140d553cfSPaul Beesley entry_point_info_t *bl31_plat_get_next_image_ep_info(uint32_t); 107240d553cfSPaul Beesley 107340d553cfSPaul Beesleywhich returns a reference to the ``entry_point_info`` structure corresponding to 107440d553cfSPaul Beesleythe image which will be run in the specified security state. The SPD uses this 107540d553cfSPaul BeesleyAPI to get entry point information for the SECURE image, BL32. 107640d553cfSPaul Beesley 107740d553cfSPaul BeesleyIn the absence of a BL32 image, BL31 passes control to the normal world 107840d553cfSPaul Beesleybootloader image (BL33). When the BL32 image is present, it is typical 107940d553cfSPaul Beesleythat the SPD wants control to be passed to BL32 first and then later to BL33. 108040d553cfSPaul Beesley 108140d553cfSPaul BeesleyTo do this the SPD has to register a BL32 initialization function during 108240d553cfSPaul Beesleyinitialization of the SPD service. The BL32 initialization function has this 108340d553cfSPaul Beesleyprototype: 108440d553cfSPaul Beesley 108540d553cfSPaul Beesley.. code:: c 108640d553cfSPaul Beesley 108740d553cfSPaul Beesley int32_t init(void); 108840d553cfSPaul Beesley 108940d553cfSPaul Beesleyand is registered using the ``bl31_register_bl32_init()`` function. 109040d553cfSPaul Beesley 109140d553cfSPaul BeesleyTF-A supports two approaches for the SPD to pass control to BL32 before 109240d553cfSPaul Beesleyreturning through EL3 and running the non-trusted firmware (BL33): 109340d553cfSPaul Beesley 109440d553cfSPaul Beesley#. In the BL32 setup function, use ``bl31_set_next_image_type()`` to 109540d553cfSPaul Beesley request that the exit from ``bl31_main()`` is to the BL32 entrypoint in 109640d553cfSPaul Beesley Secure-EL1. BL31 will exit to BL32 using the asynchronous method by 109740d553cfSPaul Beesley calling ``bl31_prepare_next_image_entry()`` and ``el3_exit()``. 109840d553cfSPaul Beesley 109940d553cfSPaul Beesley When the BL32 has completed initialization at Secure-EL1, it returns to 110040d553cfSPaul Beesley BL31 by issuing an SMC, using a Function ID allocated to the SPD. On 110140d553cfSPaul Beesley receipt of this SMC, the SPD service handler should switch the CPU context 110240d553cfSPaul Beesley from trusted to normal world and use the ``bl31_set_next_image_type()`` and 110340d553cfSPaul Beesley ``bl31_prepare_next_image_entry()`` functions to set up the initial return to 110440d553cfSPaul Beesley the normal world firmware BL33. On return from the handler the framework 110540d553cfSPaul Beesley will exit to EL2 and run BL33. 110640d553cfSPaul Beesley 110740d553cfSPaul Beesley#. The BL32 setup function registers an initialization function using 110840d553cfSPaul Beesley ``bl31_register_bl32_init()`` which provides a SPD-defined mechanism to 110940d553cfSPaul Beesley invoke a 'world-switch synchronous call' to Secure-EL1 to run the BL32 111040d553cfSPaul Beesley entrypoint. 1111e1c5026aSPaul Beesley 1112e1c5026aSPaul Beesley .. note:: 1113e1c5026aSPaul Beesley The Test SPD service included with TF-A provides one implementation 111440d553cfSPaul Beesley of such a mechanism. 111540d553cfSPaul Beesley 111640d553cfSPaul Beesley On completion BL32 returns control to BL31 via a SMC, and on receipt the 111740d553cfSPaul Beesley SPD service handler invokes the synchronous call return mechanism to return 111840d553cfSPaul Beesley to the BL32 initialization function. On return from this function, 111940d553cfSPaul Beesley ``bl31_main()`` will set up the return to the normal world firmware BL33 and 112040d553cfSPaul Beesley continue the boot process in the normal world. 112140d553cfSPaul Beesley 112240d553cfSPaul BeesleyCrash Reporting in BL31 112340d553cfSPaul Beesley----------------------- 112440d553cfSPaul Beesley 112540d553cfSPaul BeesleyBL31 implements a scheme for reporting the processor state when an unhandled 112640d553cfSPaul Beesleyexception is encountered. The reporting mechanism attempts to preserve all the 112740d553cfSPaul Beesleyregister contents and report it via a dedicated UART (PL011 console). BL31 112840d553cfSPaul Beesleyreports the general purpose, EL3, Secure EL1 and some EL2 state registers. 112940d553cfSPaul Beesley 113040d553cfSPaul BeesleyA dedicated per-CPU crash stack is maintained by BL31 and this is retrieved via 113140d553cfSPaul Beesleythe per-CPU pointer cache. The implementation attempts to minimise the memory 113240d553cfSPaul Beesleyrequired for this feature. The file ``crash_reporting.S`` contains the 113340d553cfSPaul Beesleyimplementation for crash reporting. 113440d553cfSPaul Beesley 113540d553cfSPaul BeesleyThe sample crash output is shown below. 113640d553cfSPaul Beesley 113740d553cfSPaul Beesley:: 113840d553cfSPaul Beesley 1139b4292bc6SAlexei Fedorov x0 = 0x000000002a4a0000 1140b4292bc6SAlexei Fedorov x1 = 0x0000000000000001 1141b4292bc6SAlexei Fedorov x2 = 0x0000000000000002 1142b4292bc6SAlexei Fedorov x3 = 0x0000000000000003 1143b4292bc6SAlexei Fedorov x4 = 0x0000000000000004 1144b4292bc6SAlexei Fedorov x5 = 0x0000000000000005 1145b4292bc6SAlexei Fedorov x6 = 0x0000000000000006 1146b4292bc6SAlexei Fedorov x7 = 0x0000000000000007 1147b4292bc6SAlexei Fedorov x8 = 0x0000000000000008 1148b4292bc6SAlexei Fedorov x9 = 0x0000000000000009 1149b4292bc6SAlexei Fedorov x10 = 0x0000000000000010 1150b4292bc6SAlexei Fedorov x11 = 0x0000000000000011 1151b4292bc6SAlexei Fedorov x12 = 0x0000000000000012 1152b4292bc6SAlexei Fedorov x13 = 0x0000000000000013 1153b4292bc6SAlexei Fedorov x14 = 0x0000000000000014 1154b4292bc6SAlexei Fedorov x15 = 0x0000000000000015 1155b4292bc6SAlexei Fedorov x16 = 0x0000000000000016 1156b4292bc6SAlexei Fedorov x17 = 0x0000000000000017 1157b4292bc6SAlexei Fedorov x18 = 0x0000000000000018 1158b4292bc6SAlexei Fedorov x19 = 0x0000000000000019 1159b4292bc6SAlexei Fedorov x20 = 0x0000000000000020 1160b4292bc6SAlexei Fedorov x21 = 0x0000000000000021 1161b4292bc6SAlexei Fedorov x22 = 0x0000000000000022 1162b4292bc6SAlexei Fedorov x23 = 0x0000000000000023 1163b4292bc6SAlexei Fedorov x24 = 0x0000000000000024 1164b4292bc6SAlexei Fedorov x25 = 0x0000000000000025 1165b4292bc6SAlexei Fedorov x26 = 0x0000000000000026 1166b4292bc6SAlexei Fedorov x27 = 0x0000000000000027 1167b4292bc6SAlexei Fedorov x28 = 0x0000000000000028 1168b4292bc6SAlexei Fedorov x29 = 0x0000000000000029 1169b4292bc6SAlexei Fedorov x30 = 0x0000000088000b78 1170b4292bc6SAlexei Fedorov scr_el3 = 0x000000000003073d 1171b4292bc6SAlexei Fedorov sctlr_el3 = 0x00000000b0cd183f 1172b4292bc6SAlexei Fedorov cptr_el3 = 0x0000000000000000 1173b4292bc6SAlexei Fedorov tcr_el3 = 0x000000008080351c 1174b4292bc6SAlexei Fedorov daif = 0x00000000000002c0 1175b4292bc6SAlexei Fedorov mair_el3 = 0x00000000004404ff 1176b4292bc6SAlexei Fedorov spsr_el3 = 0x0000000060000349 1177b4292bc6SAlexei Fedorov elr_el3 = 0x0000000088000114 1178b4292bc6SAlexei Fedorov ttbr0_el3 = 0x0000000004018201 1179b4292bc6SAlexei Fedorov esr_el3 = 0x00000000be000000 1180b4292bc6SAlexei Fedorov far_el3 = 0x0000000000000000 1181b4292bc6SAlexei Fedorov spsr_el1 = 0x0000000000000000 1182b4292bc6SAlexei Fedorov elr_el1 = 0x0000000000000000 1183b4292bc6SAlexei Fedorov spsr_abt = 0x0000000000000000 1184b4292bc6SAlexei Fedorov spsr_und = 0x0000000000000000 1185b4292bc6SAlexei Fedorov spsr_irq = 0x0000000000000000 1186b4292bc6SAlexei Fedorov spsr_fiq = 0x0000000000000000 1187b4292bc6SAlexei Fedorov sctlr_el1 = 0x0000000030d00800 1188b4292bc6SAlexei Fedorov actlr_el1 = 0x0000000000000000 1189b4292bc6SAlexei Fedorov cpacr_el1 = 0x0000000000000000 1190b4292bc6SAlexei Fedorov csselr_el1 = 0x0000000000000000 1191b4292bc6SAlexei Fedorov sp_el1 = 0x0000000000000000 1192b4292bc6SAlexei Fedorov esr_el1 = 0x0000000000000000 1193b4292bc6SAlexei Fedorov ttbr0_el1 = 0x0000000000000000 1194b4292bc6SAlexei Fedorov ttbr1_el1 = 0x0000000000000000 1195b4292bc6SAlexei Fedorov mair_el1 = 0x0000000000000000 1196b4292bc6SAlexei Fedorov amair_el1 = 0x0000000000000000 1197b4292bc6SAlexei Fedorov tcr_el1 = 0x0000000000000000 1198b4292bc6SAlexei Fedorov tpidr_el1 = 0x0000000000000000 1199b4292bc6SAlexei Fedorov tpidr_el0 = 0x0000000000000000 1200b4292bc6SAlexei Fedorov tpidrro_el0 = 0x0000000000000000 1201b4292bc6SAlexei Fedorov par_el1 = 0x0000000000000000 1202b4292bc6SAlexei Fedorov mpidr_el1 = 0x0000000080000000 1203b4292bc6SAlexei Fedorov afsr0_el1 = 0x0000000000000000 1204b4292bc6SAlexei Fedorov afsr1_el1 = 0x0000000000000000 1205b4292bc6SAlexei Fedorov contextidr_el1 = 0x0000000000000000 1206b4292bc6SAlexei Fedorov vbar_el1 = 0x0000000000000000 1207b4292bc6SAlexei Fedorov cntp_ctl_el0 = 0x0000000000000000 1208b4292bc6SAlexei Fedorov cntp_cval_el0 = 0x0000000000000000 1209b4292bc6SAlexei Fedorov cntv_ctl_el0 = 0x0000000000000000 1210b4292bc6SAlexei Fedorov cntv_cval_el0 = 0x0000000000000000 1211b4292bc6SAlexei Fedorov cntkctl_el1 = 0x0000000000000000 1212b4292bc6SAlexei Fedorov sp_el0 = 0x0000000004014940 1213b4292bc6SAlexei Fedorov isr_el1 = 0x0000000000000000 1214b4292bc6SAlexei Fedorov dacr32_el2 = 0x0000000000000000 1215b4292bc6SAlexei Fedorov ifsr32_el2 = 0x0000000000000000 1216b4292bc6SAlexei Fedorov icc_hppir0_el1 = 0x00000000000003ff 1217b4292bc6SAlexei Fedorov icc_hppir1_el1 = 0x00000000000003ff 1218b4292bc6SAlexei Fedorov icc_ctlr_el3 = 0x0000000000080400 1219b4292bc6SAlexei Fedorov gicd_ispendr regs (Offsets 0x200-0x278) 1220b4292bc6SAlexei Fedorov Offset Value 1221b4292bc6SAlexei Fedorov 0x200: 0x0000000000000000 1222b4292bc6SAlexei Fedorov 0x208: 0x0000000000000000 1223b4292bc6SAlexei Fedorov 0x210: 0x0000000000000000 1224b4292bc6SAlexei Fedorov 0x218: 0x0000000000000000 1225b4292bc6SAlexei Fedorov 0x220: 0x0000000000000000 1226b4292bc6SAlexei Fedorov 0x228: 0x0000000000000000 1227b4292bc6SAlexei Fedorov 0x230: 0x0000000000000000 1228b4292bc6SAlexei Fedorov 0x238: 0x0000000000000000 1229b4292bc6SAlexei Fedorov 0x240: 0x0000000000000000 1230b4292bc6SAlexei Fedorov 0x248: 0x0000000000000000 1231b4292bc6SAlexei Fedorov 0x250: 0x0000000000000000 1232b4292bc6SAlexei Fedorov 0x258: 0x0000000000000000 1233b4292bc6SAlexei Fedorov 0x260: 0x0000000000000000 1234b4292bc6SAlexei Fedorov 0x268: 0x0000000000000000 1235b4292bc6SAlexei Fedorov 0x270: 0x0000000000000000 1236b4292bc6SAlexei Fedorov 0x278: 0x0000000000000000 123740d553cfSPaul Beesley 123840d553cfSPaul BeesleyGuidelines for Reset Handlers 123940d553cfSPaul Beesley----------------------------- 124040d553cfSPaul Beesley 124140d553cfSPaul BeesleyTF-A implements a framework that allows CPU and platform ports to perform 124240d553cfSPaul Beesleyactions very early after a CPU is released from reset in both the cold and warm 124340d553cfSPaul Beesleyboot paths. This is done by calling the ``reset_handler()`` function in both 124440d553cfSPaul Beesleythe BL1 and BL31 images. It in turn calls the platform and CPU specific reset 124540d553cfSPaul Beesleyhandling functions. 124640d553cfSPaul Beesley 124740d553cfSPaul BeesleyDetails for implementing a CPU specific reset handler can be found in 1248*6a0e8e80SBoyan Karatotev:ref:`firmware_design_cpu_specific_reset_handling`. Details for implementing a 1249*6a0e8e80SBoyan Karatotevplatform specific reset handler can be found in the :ref:`Porting Guide` (see 1250*6a0e8e80SBoyan Karatotevthe``plat_reset_handler()`` function). 125140d553cfSPaul Beesley 125240d553cfSPaul BeesleyWhen adding functionality to a reset handler, keep in mind that if a different 125340d553cfSPaul Beesleyreset handling behavior is required between the first and the subsequent 125440d553cfSPaul Beesleyinvocations of the reset handling code, this should be detected at runtime. 125540d553cfSPaul BeesleyIn other words, the reset handler should be able to detect whether an action has 125640d553cfSPaul Beesleyalready been performed and act as appropriate. Possible courses of actions are, 125740d553cfSPaul Beesleye.g. skip the action the second time, or undo/redo it. 125840d553cfSPaul Beesley 12596844c347SMadhukar Pappireddy.. _configuring-secure-interrupts: 12606844c347SMadhukar Pappireddy 126140d553cfSPaul BeesleyConfiguring secure interrupts 126240d553cfSPaul Beesley----------------------------- 126340d553cfSPaul Beesley 126440d553cfSPaul BeesleyThe GIC driver is responsible for performing initial configuration of secure 126540d553cfSPaul Beesleyinterrupts on the platform. To this end, the platform is expected to provide the 126640d553cfSPaul BeesleyGIC driver (either GICv2 or GICv3, as selected by the platform) with the 126740d553cfSPaul Beesleyinterrupt configuration during the driver initialisation. 126840d553cfSPaul Beesley 126940d553cfSPaul BeesleySecure interrupt configuration are specified in an array of secure interrupt 127040d553cfSPaul Beesleyproperties. In this scheme, in both GICv2 and GICv3 driver data structures, the 127140d553cfSPaul Beesley``interrupt_props`` member points to an array of interrupt properties. Each 127240d553cfSPaul Beesleyelement of the array specifies the interrupt number and its attributes 127340d553cfSPaul Beesley(priority, group, configuration). Each element of the array shall be populated 127440d553cfSPaul Beesleyby the macro ``INTR_PROP_DESC()``. The macro takes the following arguments: 127540d553cfSPaul Beesley 1276d5eee8f3SMing Huang- 13-bit interrupt number, 127740d553cfSPaul Beesley 127840d553cfSPaul Beesley- 8-bit interrupt priority, 127940d553cfSPaul Beesley 128040d553cfSPaul Beesley- Interrupt type (one of ``INTR_TYPE_EL3``, ``INTR_TYPE_S_EL1``, 128140d553cfSPaul Beesley ``INTR_TYPE_NS``), 128240d553cfSPaul Beesley 128340d553cfSPaul Beesley- Interrupt configuration (either ``GIC_INTR_CFG_LEVEL`` or 128440d553cfSPaul Beesley ``GIC_INTR_CFG_EDGE``). 128540d553cfSPaul Beesley 128634760951SPaul Beesley.. _firmware_design_cpu_ops_fwk: 128734760951SPaul Beesley 128840d553cfSPaul BeesleyCPU specific operations framework 128940d553cfSPaul Beesley--------------------------------- 129040d553cfSPaul Beesley 129140d553cfSPaul BeesleyCertain aspects of the Armv8-A architecture are implementation defined, 129240d553cfSPaul Beesleythat is, certain behaviours are not architecturally defined, but must be 129340d553cfSPaul Beesleydefined and documented by individual processor implementations. TF-A 129440d553cfSPaul Beesleyimplements a framework which categorises the common implementation defined 129540d553cfSPaul Beesleybehaviours and allows a processor to export its implementation of that 129640d553cfSPaul Beesleybehaviour. The categories are: 129740d553cfSPaul Beesley 129840d553cfSPaul Beesley#. Processor specific reset sequence. 129940d553cfSPaul Beesley 130040d553cfSPaul Beesley#. Processor specific power down sequences. 130140d553cfSPaul Beesley 130240d553cfSPaul Beesley#. Processor specific register dumping as a part of crash reporting. 130340d553cfSPaul Beesley 130440d553cfSPaul Beesley#. Errata status reporting. 130540d553cfSPaul Beesley 130640d553cfSPaul BeesleyEach of the above categories fulfils a different requirement. 130740d553cfSPaul Beesley 130840d553cfSPaul Beesley#. allows any processor specific initialization before the caches and MMU 130940d553cfSPaul Beesley are turned on, like implementation of errata workarounds, entry into 131040d553cfSPaul Beesley the intra-cluster coherency domain etc. 131140d553cfSPaul Beesley 131240d553cfSPaul Beesley#. allows each processor to implement the power down sequence mandated in 131340d553cfSPaul Beesley its Technical Reference Manual (TRM). 131440d553cfSPaul Beesley 131540d553cfSPaul Beesley#. allows a processor to provide additional information to the developer 131640d553cfSPaul Beesley in the event of a crash, for example Cortex-A53 has registers which 131740d553cfSPaul Beesley can expose the data cache contents. 131840d553cfSPaul Beesley 131940d553cfSPaul Beesley#. allows a processor to define a function that inspects and reports the status 132040d553cfSPaul Beesley of all errata workarounds on that processor. 132140d553cfSPaul Beesley 132240d553cfSPaul BeesleyPlease note that only 2. is mandated by the TRM. 132340d553cfSPaul Beesley 132440d553cfSPaul BeesleyThe CPU specific operations framework scales to accommodate a large number of 132540d553cfSPaul Beesleydifferent CPUs during power down and reset handling. The platform can specify 132640d553cfSPaul Beesleyany CPU optimization it wants to enable for each CPU. It can also specify 132740d553cfSPaul Beesleythe CPU errata workarounds to be applied for each CPU type during reset 132840d553cfSPaul Beesleyhandling by defining CPU errata compile time macros. Details on these macros 132934760951SPaul Beesleycan be found in the :ref:`Arm CPU Specific Build Macros` document. 133040d553cfSPaul Beesley 133140d553cfSPaul BeesleyThe CPU specific operations framework depends on the ``cpu_ops`` structure which 133240d553cfSPaul Beesleyneeds to be exported for each type of CPU in the platform. It is defined in 133340d553cfSPaul Beesley``include/lib/cpus/aarch64/cpu_macros.S`` and has the following fields : ``midr``, 133440d553cfSPaul Beesley``reset_func()``, ``cpu_pwr_down_ops`` (array of power down functions) and 133540d553cfSPaul Beesley``cpu_reg_dump()``. 133640d553cfSPaul Beesley 133740d553cfSPaul BeesleyThe CPU specific files in ``lib/cpus`` export a ``cpu_ops`` data structure with 133840d553cfSPaul Beesleysuitable handlers for that CPU. For example, ``lib/cpus/aarch64/cortex_a53.S`` 133940d553cfSPaul Beesleyexports the ``cpu_ops`` for Cortex-A53 CPU. According to the platform 134040d553cfSPaul Beesleyconfiguration, these CPU specific files must be included in the build by 134140d553cfSPaul Beesleythe platform makefile. The generic CPU specific operations framework code exists 134240d553cfSPaul Beesleyin ``lib/cpus/aarch64/cpu_helpers.S``. 134340d553cfSPaul Beesley 1344*6a0e8e80SBoyan KaratotevCPU PCS 1345*6a0e8e80SBoyan Karatotev~~~~~~~ 1346*6a0e8e80SBoyan Karatotev 1347*6a0e8e80SBoyan KaratotevAll assembly functions in CPU files are asked to follow a modified version of 1348*6a0e8e80SBoyan Karatotevthe Procedure Call Standard (PCS) in their internals. This is done to ensure 1349*6a0e8e80SBoyan Karatotevcalling these functions from outside the file doesn't unexpectedly corrupt 1350*6a0e8e80SBoyan Karatotevregisters in the very early environment and to help the internals to be easier 1351*6a0e8e80SBoyan Karatotevto understand. Please see the :ref:`firmware_design_cpu_errata_implementation` 1352*6a0e8e80SBoyan Karatotevfor any function specific restrictions. 1353*6a0e8e80SBoyan Karatotev 1354*6a0e8e80SBoyan Karatotev+--------------+---------------------------------+ 1355*6a0e8e80SBoyan Karatotev| register | use | 1356*6a0e8e80SBoyan Karatotev+==============+=================================+ 1357*6a0e8e80SBoyan Karatotev| x0 - x15 | scratch | 1358*6a0e8e80SBoyan Karatotev+--------------+---------------------------------+ 1359*6a0e8e80SBoyan Karatotev| x16, x17 | do not use (used by the linker) | 1360*6a0e8e80SBoyan Karatotev+--------------+---------------------------------+ 1361*6a0e8e80SBoyan Karatotev| x18 | do not use (platform register) | 1362*6a0e8e80SBoyan Karatotev+--------------+---------------------------------+ 1363*6a0e8e80SBoyan Karatotev| x19 - x28 | callee saved | 1364*6a0e8e80SBoyan Karatotev+--------------+---------------------------------+ 1365*6a0e8e80SBoyan Karatotev| x29, x30 | FP, LR | 1366*6a0e8e80SBoyan Karatotev+--------------+---------------------------------+ 1367*6a0e8e80SBoyan Karatotev 1368*6a0e8e80SBoyan Karatotev.. _firmware_design_cpu_specific_reset_handling: 1369*6a0e8e80SBoyan Karatotev 137040d553cfSPaul BeesleyCPU specific Reset Handling 137140d553cfSPaul Beesley~~~~~~~~~~~~~~~~~~~~~~~~~~~ 137240d553cfSPaul Beesley 137340d553cfSPaul BeesleyAfter a reset, the state of the CPU when it calls generic reset handler is: 1374*6a0e8e80SBoyan KaratotevMMU turned off, both instruction and data caches turned off, not part 1375*6a0e8e80SBoyan Karatotevof any coherency domain and no stack. 137640d553cfSPaul Beesley 137740d553cfSPaul BeesleyThe BL entrypoint code first invokes the ``plat_reset_handler()`` to allow 137840d553cfSPaul Beesleythe platform to perform any system initialization required and any system 137940d553cfSPaul Beesleyerrata workarounds that needs to be applied. The ``get_cpu_ops_ptr()`` reads 138040d553cfSPaul Beesleythe current CPU midr, finds the matching ``cpu_ops`` entry in the ``cpu_ops`` 138140d553cfSPaul Beesleyarray and returns it. Note that only the part number and implementer fields 138240d553cfSPaul Beesleyin midr are used to find the matching ``cpu_ops`` entry. The ``reset_func()`` in 138340d553cfSPaul Beesleythe returned ``cpu_ops`` is then invoked which executes the required reset 138440d553cfSPaul Beesleyhandling for that CPU and also any errata workarounds enabled by the platform. 138540d553cfSPaul Beesley 1386*6a0e8e80SBoyan KaratotevIt should be defined using the ``cpu_reset_func_{start,end}`` macros and its 1387*6a0e8e80SBoyan Karatotevbody may only clobber x0 to x14 with x14 being the cpu_rev parameter. 138840d553cfSPaul Beesley 138940d553cfSPaul BeesleyCPU specific power down sequence 139040d553cfSPaul Beesley~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 139140d553cfSPaul Beesley 139240d553cfSPaul BeesleyDuring the BL31 initialization sequence, the pointer to the matching ``cpu_ops`` 139340d553cfSPaul Beesleyentry is stored in per-CPU data by ``init_cpu_ops()`` so that it can be quickly 139440d553cfSPaul Beesleyretrieved during power down sequences. 139540d553cfSPaul Beesley 139640d553cfSPaul BeesleyVarious CPU drivers register handlers to perform power down at certain power 139740d553cfSPaul Beesleylevels for that specific CPU. The PSCI service, upon receiving a power down 139840d553cfSPaul Beesleyrequest, determines the highest power level at which to execute power down 139940d553cfSPaul Beesleysequence for a particular CPU. It uses the ``prepare_cpu_pwr_dwn()`` function to 140040d553cfSPaul Beesleypick the right power down handler for the requested level. The function 140140d553cfSPaul Beesleyretrieves ``cpu_ops`` pointer member of per-CPU data, and from that, further 140240d553cfSPaul Beesleyretrieves ``cpu_pwr_down_ops`` array, and indexes into the required level. If the 140340d553cfSPaul Beesleyrequested power level is higher than what a CPU driver supports, the handler 140440d553cfSPaul Beesleyregistered for highest level is invoked. 140540d553cfSPaul Beesley 140640d553cfSPaul BeesleyAt runtime the platform hooks for power down are invoked by the PSCI service to 140740d553cfSPaul Beesleyperform platform specific operations during a power down sequence, for example 140840d553cfSPaul Beesleyturning off CCI coherency during a cluster power down. 140940d553cfSPaul Beesley 141040d553cfSPaul BeesleyCPU specific register reporting during crash 141140d553cfSPaul Beesley~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 141240d553cfSPaul Beesley 141340d553cfSPaul BeesleyIf the crash reporting is enabled in BL31, when a crash occurs, the crash 141440d553cfSPaul Beesleyreporting framework calls ``do_cpu_reg_dump`` which retrieves the matching 141540d553cfSPaul Beesley``cpu_ops`` using ``get_cpu_ops_ptr()`` function. The ``cpu_reg_dump()`` in 141640d553cfSPaul Beesley``cpu_ops`` is invoked, which then returns the CPU specific register values to 141740d553cfSPaul Beesleybe reported and a pointer to the ASCII list of register names in a format 141840d553cfSPaul Beesleyexpected by the crash reporting framework. 141940d553cfSPaul Beesley 1420*6a0e8e80SBoyan Karatotev.. _firmware_design_cpu_errata_implementation: 142134760951SPaul Beesley 1422*6a0e8e80SBoyan KaratotevCPU errata implementation 1423*6a0e8e80SBoyan Karatotev~~~~~~~~~~~~~~~~~~~~~~~~~ 142440d553cfSPaul Beesley 142540d553cfSPaul BeesleyErrata workarounds for CPUs supported in TF-A are applied during both cold and 142640d553cfSPaul Beesleywarm boots, shortly after reset. Individual Errata workarounds are enabled as 142740d553cfSPaul Beesleybuild options. Some errata workarounds have potential run-time implications; 142840d553cfSPaul Beesleytherefore some are enabled by default, others not. Platform ports shall 142940d553cfSPaul Beesleyoverride build options to enable or disable errata as appropriate. The CPU 143040d553cfSPaul Beesleydrivers take care of applying errata workarounds that are enabled and applicable 1431*6a0e8e80SBoyan Karatotevto a given CPU. 143240d553cfSPaul Beesley 1433*6a0e8e80SBoyan KaratotevEach erratum has a build flag in ``lib/cpus/cpu-ops.mk`` of the form: 1434*6a0e8e80SBoyan Karatotev``ERRATA_<cpu_num>_<erratum_id>``. It also has a short description in 1435*6a0e8e80SBoyan Karatotev:ref:`arm_cpu_macros_errata_workarounds` on when it should apply. 143640d553cfSPaul Beesley 1437*6a0e8e80SBoyan KaratotevErrata framework 1438*6a0e8e80SBoyan Karatotev^^^^^^^^^^^^^^^^ 143940d553cfSPaul Beesley 1440*6a0e8e80SBoyan KaratotevThe errata framework is a convention and a small library to allow errata to be 1441*6a0e8e80SBoyan Karatotevautomatically discovered. It enables compliant errata to be automatically 1442*6a0e8e80SBoyan Karatotevapplied and reported at runtime (either by status reporting or the errata ABI). 144340d553cfSPaul Beesley 1444*6a0e8e80SBoyan KaratotevTo write a compliant mitigation for erratum number ``erratum_id`` on a cpu that 1445*6a0e8e80SBoyan Karatotevdeclared itself (with ``declare_cpu_ops``) as ``cpu_name`` one needs 3 things: 144640d553cfSPaul Beesley 1447*6a0e8e80SBoyan Karatotev#. A CPU revision checker function: ``check_erratum_<cpu_name>_<erratum_id>`` 144840d553cfSPaul Beesley 1449*6a0e8e80SBoyan Karatotev It should check whether this erratum applies on this revision of this CPU. 1450*6a0e8e80SBoyan Karatotev It will be called with the CPU revision as its first parameter (x0) and 1451*6a0e8e80SBoyan Karatotev should return one of ``ERRATA_APPLIES`` or ``ERRATA_NOT_APPLIES``. 145240d553cfSPaul Beesley 1453*6a0e8e80SBoyan Karatotev It may only clobber x0 to x4. The rest should be treated as callee-saved. 1454*6a0e8e80SBoyan Karatotev 1455*6a0e8e80SBoyan Karatotev#. A workaround function: ``erratum_<cpu_name>_<erratum_id>_wa`` 1456*6a0e8e80SBoyan Karatotev 1457*6a0e8e80SBoyan Karatotev It should obtain the cpu revision (with ``cpu_get_rev_var``), call its 1458*6a0e8e80SBoyan Karatotev revision checker, and perform the mitigation, should the erratum apply. 1459*6a0e8e80SBoyan Karatotev 1460*6a0e8e80SBoyan Karatotev It may only clobber x0 to x8. The rest should be treated as callee-saved. 1461*6a0e8e80SBoyan Karatotev 1462*6a0e8e80SBoyan Karatotev#. Register itself to the framework 1463*6a0e8e80SBoyan Karatotev 1464*6a0e8e80SBoyan Karatotev Do this with 1465*6a0e8e80SBoyan Karatotev ``add_erratum_entry <cpu_name>, ERRATUM(<erratum_id>), <errata_flag>`` 1466*6a0e8e80SBoyan Karatotev where the ``errata_flag`` is the enable flag in ``cpu-ops.mk`` described 1467*6a0e8e80SBoyan Karatotev above. 1468*6a0e8e80SBoyan Karatotev 1469*6a0e8e80SBoyan KaratotevSee the next section on how to do this easily. 1470*6a0e8e80SBoyan Karatotev 1471*6a0e8e80SBoyan Karatotev.. note:: 1472*6a0e8e80SBoyan Karatotev 1473*6a0e8e80SBoyan Karatotev CVEs have the format ``CVE_<year>_<number>``. To fit them in the framework, the 1474*6a0e8e80SBoyan Karatotev ``erratum_id`` for the checker and the workaround functions become the 1475*6a0e8e80SBoyan Karatotev ``number`` part of its name and the ``ERRATUM(<number>)`` part of the 1476*6a0e8e80SBoyan Karatotev registration should instead be ``CVE(<year>, <number>)``. In the extremely 1477*6a0e8e80SBoyan Karatotev unlikely scenario where a CVE and an erratum numbers clash, the CVE number 1478*6a0e8e80SBoyan Karatotev should be prefixed with a zero. 1479*6a0e8e80SBoyan Karatotev 1480*6a0e8e80SBoyan Karatotev Also, their build flag should be ``WORKAROUND_CVE_<year>_<number>``. 1481*6a0e8e80SBoyan Karatotev 1482*6a0e8e80SBoyan Karatotev.. note:: 1483*6a0e8e80SBoyan Karatotev 1484*6a0e8e80SBoyan Karatotev AArch32 uses the legacy convention. The checker function has the format 1485*6a0e8e80SBoyan Karatotev ``check_errata_<erratum_id>`` and the workaround has the format 1486*6a0e8e80SBoyan Karatotev ``errata_<cpu_number>_<erratum_id>_wa`` where ``cpu_number`` is the shortform 1487*6a0e8e80SBoyan Karatotev letter and number name of the CPU. 1488*6a0e8e80SBoyan Karatotev 1489*6a0e8e80SBoyan Karatotev For CVEs the ``erratum_id`` also becomes ``cve_<year>_<number>``. 1490*6a0e8e80SBoyan Karatotev 1491*6a0e8e80SBoyan KaratotevErrata framework helpers 1492*6a0e8e80SBoyan Karatotev^^^^^^^^^^^^^^^^^^^^^^^^ 1493*6a0e8e80SBoyan Karatotev 1494*6a0e8e80SBoyan KaratotevWriting these errata involves lots of boilerplate and repetitive code. On 1495*6a0e8e80SBoyan KaratotevAArch64 there are helpers to omit most of this. They are located in 1496*6a0e8e80SBoyan Karatotev``include/lib/cpus/aarch64/cpu_macros.S`` and the preferred way to implement 1497*6a0e8e80SBoyan Karatoteverrata. Please see their comments on how to use them. 1498*6a0e8e80SBoyan Karatotev 1499*6a0e8e80SBoyan KaratotevThe most common type of erratum workaround, one that just sets a "chicken" bit 1500*6a0e8e80SBoyan Karatotevin some arbitrary register, would have an implementation for the Cortex-A77, 1501*6a0e8e80SBoyan Karatoteverratum #1925769 like:: 1502*6a0e8e80SBoyan Karatotev 1503*6a0e8e80SBoyan Karatotev workaround_reset_start cortex_a77, ERRATUM(1925769), ERRATA_A77_1925769 1504*6a0e8e80SBoyan Karatotev sysreg_bit_set CORTEX_A77_CPUECTLR_EL1, CORTEX_A77_CPUECTLR_EL1_BIT_8 1505*6a0e8e80SBoyan Karatotev workaround_reset_end cortex_a77, ERRATUM(1925769) 1506*6a0e8e80SBoyan Karatotev 1507*6a0e8e80SBoyan Karatotev check_erratum_ls cortex_a77, ERRATUM(1925769), CPU_REV(1, 1) 1508*6a0e8e80SBoyan Karatotev 1509*6a0e8e80SBoyan KaratotevStatus reporting 1510*6a0e8e80SBoyan Karatotev^^^^^^^^^^^^^^^^ 151140d553cfSPaul Beesley 151240d553cfSPaul BeesleyIn a debug build of TF-A, on a CPU that comes out of reset, both BL1 and the 1513*6a0e8e80SBoyan Karatotevruntime firmware (BL31 in AArch64, and BL32 in AArch32) will invoke a generic 1514*6a0e8e80SBoyan Karatoteverrata status reporting function. It will read the ``errata_entries`` list of 1515*6a0e8e80SBoyan Karatotevthat cpu and will report whether each known erratum was applied and, if not, 1516*6a0e8e80SBoyan Karatotevwhether it should have been. 151740d553cfSPaul Beesley 151840d553cfSPaul BeesleyReporting the status of errata workaround is for informational purpose only; it 151940d553cfSPaul Beesleyhas no functional significance. 152040d553cfSPaul Beesley 152140d553cfSPaul BeesleyMemory layout of BL images 152240d553cfSPaul Beesley-------------------------- 152340d553cfSPaul Beesley 152440d553cfSPaul BeesleyEach bootloader image can be divided in 2 parts: 152540d553cfSPaul Beesley 152640d553cfSPaul Beesley- the static contents of the image. These are data actually stored in the 152740d553cfSPaul Beesley binary on the disk. In the ELF terminology, they are called ``PROGBITS`` 152840d553cfSPaul Beesley sections; 152940d553cfSPaul Beesley 153040d553cfSPaul Beesley- the run-time contents of the image. These are data that don't occupy any 153140d553cfSPaul Beesley space in the binary on the disk. The ELF binary just contains some 153240d553cfSPaul Beesley metadata indicating where these data will be stored at run-time and the 153340d553cfSPaul Beesley corresponding sections need to be allocated and initialized at run-time. 153440d553cfSPaul Beesley In the ELF terminology, they are called ``NOBITS`` sections. 153540d553cfSPaul Beesley 153640d553cfSPaul BeesleyAll PROGBITS sections are grouped together at the beginning of the image, 153740d553cfSPaul Beesleyfollowed by all NOBITS sections. This is true for all TF-A images and it is 153840d553cfSPaul Beesleygoverned by the linker scripts. This ensures that the raw binary images are 153940d553cfSPaul Beesleyas small as possible. If a NOBITS section was inserted in between PROGBITS 154040d553cfSPaul Beesleysections then the resulting binary file would contain zero bytes in place of 154140d553cfSPaul Beesleythis NOBITS section, making the image unnecessarily bigger. Smaller images 154240d553cfSPaul Beesleyallow faster loading from the FIP to the main memory. 154340d553cfSPaul Beesley 1544f8578e64SSamuel HollandFor BL31, a platform can specify an alternate location for NOBITS sections 1545f8578e64SSamuel Holland(other than immediately following PROGBITS sections) by setting 1546f8578e64SSamuel Holland``SEPARATE_NOBITS_REGION`` to 1 and defining ``BL31_NOBITS_BASE`` and 1547f8578e64SSamuel Holland``BL31_NOBITS_LIMIT``. 1548f8578e64SSamuel Holland 154940d553cfSPaul BeesleyLinker scripts and symbols 155040d553cfSPaul Beesley~~~~~~~~~~~~~~~~~~~~~~~~~~ 155140d553cfSPaul Beesley 155240d553cfSPaul BeesleyEach bootloader stage image layout is described by its own linker script. The 155340d553cfSPaul Beesleylinker scripts export some symbols into the program symbol table. Their values 155440d553cfSPaul Beesleycorrespond to particular addresses. TF-A code can refer to these symbols to 155540d553cfSPaul Beesleyfigure out the image memory layout. 155640d553cfSPaul Beesley 155740d553cfSPaul BeesleyLinker symbols follow the following naming convention in TF-A. 155840d553cfSPaul Beesley 155940d553cfSPaul Beesley- ``__<SECTION>_START__`` 156040d553cfSPaul Beesley 156140d553cfSPaul Beesley Start address of a given section named ``<SECTION>``. 156240d553cfSPaul Beesley 156340d553cfSPaul Beesley- ``__<SECTION>_END__`` 156440d553cfSPaul Beesley 156540d553cfSPaul Beesley End address of a given section named ``<SECTION>``. If there is an alignment 156640d553cfSPaul Beesley constraint on the section's end address then ``__<SECTION>_END__`` corresponds 156740d553cfSPaul Beesley to the end address of the section's actual contents, rounded up to the right 156840d553cfSPaul Beesley boundary. Refer to the value of ``__<SECTION>_UNALIGNED_END__`` to know the 156940d553cfSPaul Beesley actual end address of the section's contents. 157040d553cfSPaul Beesley 157140d553cfSPaul Beesley- ``__<SECTION>_UNALIGNED_END__`` 157240d553cfSPaul Beesley 157340d553cfSPaul Beesley End address of a given section named ``<SECTION>`` without any padding or 157440d553cfSPaul Beesley rounding up due to some alignment constraint. 157540d553cfSPaul Beesley 157640d553cfSPaul Beesley- ``__<SECTION>_SIZE__`` 157740d553cfSPaul Beesley 157840d553cfSPaul Beesley Size (in bytes) of a given section named ``<SECTION>``. If there is an 157940d553cfSPaul Beesley alignment constraint on the section's end address then ``__<SECTION>_SIZE__`` 158040d553cfSPaul Beesley corresponds to the size of the section's actual contents, rounded up to the 158140d553cfSPaul Beesley right boundary. In other words, ``__<SECTION>_SIZE__ = __<SECTION>_END__ - _<SECTION>_START__``. Refer to the value of ``__<SECTION>_UNALIGNED_SIZE__`` 158240d553cfSPaul Beesley to know the actual size of the section's contents. 158340d553cfSPaul Beesley 158440d553cfSPaul Beesley- ``__<SECTION>_UNALIGNED_SIZE__`` 158540d553cfSPaul Beesley 158640d553cfSPaul Beesley Size (in bytes) of a given section named ``<SECTION>`` without any padding or 158740d553cfSPaul Beesley rounding up due to some alignment constraint. In other words, 158840d553cfSPaul Beesley ``__<SECTION>_UNALIGNED_SIZE__ = __<SECTION>_UNALIGNED_END__ - __<SECTION>_START__``. 158940d553cfSPaul Beesley 159040d553cfSPaul BeesleySome of the linker symbols are mandatory as TF-A code relies on them to be 159140d553cfSPaul Beesleydefined. They are listed in the following subsections. Some of them must be 159240d553cfSPaul Beesleyprovided for each bootloader stage and some are specific to a given bootloader 159340d553cfSPaul Beesleystage. 159440d553cfSPaul Beesley 159540d553cfSPaul BeesleyThe linker scripts define some extra, optional symbols. They are not actually 159640d553cfSPaul Beesleyused by any code but they help in understanding the bootloader images' memory 159740d553cfSPaul Beesleylayout as they are easy to spot in the link map files. 159840d553cfSPaul Beesley 159940d553cfSPaul BeesleyCommon linker symbols 160040d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^^ 160140d553cfSPaul Beesley 160240d553cfSPaul BeesleyAll BL images share the following requirements: 160340d553cfSPaul Beesley 160440d553cfSPaul Beesley- The BSS section must be zero-initialised before executing any C code. 160540d553cfSPaul Beesley- The coherent memory section (if enabled) must be zero-initialised as well. 160640d553cfSPaul Beesley- The MMU setup code needs to know the extents of the coherent and read-only 160740d553cfSPaul Beesley memory regions to set the right memory attributes. When 160840d553cfSPaul Beesley ``SEPARATE_CODE_AND_RODATA=1``, it needs to know more specifically how the 160940d553cfSPaul Beesley read-only memory region is divided between code and data. 161040d553cfSPaul Beesley 161140d553cfSPaul BeesleyThe following linker symbols are defined for this purpose: 161240d553cfSPaul Beesley 161340d553cfSPaul Beesley- ``__BSS_START__`` 161440d553cfSPaul Beesley- ``__BSS_SIZE__`` 161540d553cfSPaul Beesley- ``__COHERENT_RAM_START__`` Must be aligned on a page-size boundary. 161640d553cfSPaul Beesley- ``__COHERENT_RAM_END__`` Must be aligned on a page-size boundary. 161740d553cfSPaul Beesley- ``__COHERENT_RAM_UNALIGNED_SIZE__`` 161840d553cfSPaul Beesley- ``__RO_START__`` 161940d553cfSPaul Beesley- ``__RO_END__`` 162040d553cfSPaul Beesley- ``__TEXT_START__`` 162140d553cfSPaul Beesley- ``__TEXT_END__`` 162240d553cfSPaul Beesley- ``__RODATA_START__`` 162340d553cfSPaul Beesley- ``__RODATA_END__`` 162440d553cfSPaul Beesley 162540d553cfSPaul BeesleyBL1's linker symbols 162640d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^ 162740d553cfSPaul Beesley 162840d553cfSPaul BeesleyBL1 being the ROM image, it has additional requirements. BL1 resides in ROM and 162940d553cfSPaul Beesleyit is entirely executed in place but it needs some read-write memory for its 163040d553cfSPaul Beesleymutable data. Its ``.data`` section (i.e. its allocated read-write data) must be 163140d553cfSPaul Beesleyrelocated from ROM to RAM before executing any C code. 163240d553cfSPaul Beesley 163340d553cfSPaul BeesleyThe following additional linker symbols are defined for BL1: 163440d553cfSPaul Beesley 163540d553cfSPaul Beesley- ``__BL1_ROM_END__`` End address of BL1's ROM contents, covering its code 163640d553cfSPaul Beesley and ``.data`` section in ROM. 163740d553cfSPaul Beesley- ``__DATA_ROM_START__`` Start address of the ``.data`` section in ROM. Must be 163840d553cfSPaul Beesley aligned on a 16-byte boundary. 163940d553cfSPaul Beesley- ``__DATA_RAM_START__`` Address in RAM where the ``.data`` section should be 164040d553cfSPaul Beesley copied over. Must be aligned on a 16-byte boundary. 164140d553cfSPaul Beesley- ``__DATA_SIZE__`` Size of the ``.data`` section (in ROM or RAM). 164240d553cfSPaul Beesley- ``__BL1_RAM_START__`` Start address of BL1 read-write data. 164340d553cfSPaul Beesley- ``__BL1_RAM_END__`` End address of BL1 read-write data. 164440d553cfSPaul Beesley 164540d553cfSPaul BeesleyHow to choose the right base addresses for each bootloader stage image 164640d553cfSPaul Beesley~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 164740d553cfSPaul Beesley 164840d553cfSPaul BeesleyThere is currently no support for dynamic image loading in TF-A. This means 164940d553cfSPaul Beesleythat all bootloader images need to be linked against their ultimate runtime 165040d553cfSPaul Beesleylocations and the base addresses of each image must be chosen carefully such 165140d553cfSPaul Beesleythat images don't overlap each other in an undesired way. As the code grows, 165240d553cfSPaul Beesleythe base addresses might need adjustments to cope with the new memory layout. 165340d553cfSPaul Beesley 165440d553cfSPaul BeesleyThe memory layout is completely specific to the platform and so there is no 165540d553cfSPaul Beesleygeneral recipe for choosing the right base addresses for each bootloader image. 165640d553cfSPaul BeesleyHowever, there are tools to aid in understanding the memory layout. These are 165740d553cfSPaul Beesleythe link map files: ``build/<platform>/<build-type>/bl<x>/bl<x>.map``, with ``<x>`` 165840d553cfSPaul Beesleybeing the stage bootloader. They provide a detailed view of the memory usage of 165940d553cfSPaul Beesleyeach image. Among other useful information, they provide the end address of 166040d553cfSPaul Beesleyeach image. 166140d553cfSPaul Beesley 166240d553cfSPaul Beesley- ``bl1.map`` link map file provides ``__BL1_RAM_END__`` address. 166340d553cfSPaul Beesley- ``bl2.map`` link map file provides ``__BL2_END__`` address. 166440d553cfSPaul Beesley- ``bl31.map`` link map file provides ``__BL31_END__`` address. 166540d553cfSPaul Beesley- ``bl32.map`` link map file provides ``__BL32_END__`` address. 166640d553cfSPaul Beesley 166740d553cfSPaul BeesleyFor each bootloader image, the platform code must provide its start address 166840d553cfSPaul Beesleyas well as a limit address that it must not overstep. The latter is used in the 166940d553cfSPaul Beesleylinker scripts to check that the image doesn't grow past that address. If that 167040d553cfSPaul Beesleyhappens, the linker will issue a message similar to the following: 167140d553cfSPaul Beesley 167240d553cfSPaul Beesley:: 167340d553cfSPaul Beesley 167440d553cfSPaul Beesley aarch64-none-elf-ld: BLx has exceeded its limit. 167540d553cfSPaul Beesley 167640d553cfSPaul BeesleyAdditionally, if the platform memory layout implies some image overlaying like 167740d553cfSPaul Beesleyon FVP, BL31 and TSP need to know the limit address that their PROGBITS 167840d553cfSPaul Beesleysections must not overstep. The platform code must provide those. 167940d553cfSPaul Beesley 168040d553cfSPaul BeesleyTF-A does not provide any mechanism to verify at boot time that the memory 168140d553cfSPaul Beesleyto load a new image is free to prevent overwriting a previously loaded image. 168240d553cfSPaul BeesleyThe platform must specify the memory available in the system for all the 168340d553cfSPaul Beesleyrelevant BL images to be loaded. 168440d553cfSPaul Beesley 168540d553cfSPaul BeesleyFor example, in the case of BL1 loading BL2, ``bl1_plat_sec_mem_layout()`` will 168640d553cfSPaul Beesleyreturn the region defined by the platform where BL1 intends to load BL2. The 168740d553cfSPaul Beesley``load_image()`` function performs bounds check for the image size based on the 168840d553cfSPaul Beesleybase and maximum image size provided by the platforms. Platforms must take 168940d553cfSPaul Beesleythis behaviour into account when defining the base/size for each of the images. 169040d553cfSPaul Beesley 169140d553cfSPaul BeesleyMemory layout on Arm development platforms 169240d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ 169340d553cfSPaul Beesley 169440d553cfSPaul BeesleyThe following list describes the memory layout on the Arm development platforms: 169540d553cfSPaul Beesley 169640d553cfSPaul Beesley- A 4KB page of shared memory is used for communication between Trusted 169740d553cfSPaul Beesley Firmware and the platform's power controller. This is located at the base of 169840d553cfSPaul Beesley Trusted SRAM. The amount of Trusted SRAM available to load the bootloader 169940d553cfSPaul Beesley images is reduced by the size of the shared memory. 170040d553cfSPaul Beesley 170140d553cfSPaul Beesley The shared memory is used to store the CPUs' entrypoint mailbox. On Juno, 170240d553cfSPaul Beesley this is also used for the MHU payload when passing messages to and from the 170340d553cfSPaul Beesley SCP. 170440d553cfSPaul Beesley 170540d553cfSPaul Beesley- Another 4 KB page is reserved for passing memory layout between BL1 and BL2 170640d553cfSPaul Beesley and also the dynamic firmware configurations. 170740d553cfSPaul Beesley 170840d553cfSPaul Beesley- On FVP, BL1 is originally sitting in the Trusted ROM at address ``0x0``. On 170940d553cfSPaul Beesley Juno, BL1 resides in flash memory at address ``0x0BEC0000``. BL1 read-write 171040d553cfSPaul Beesley data are relocated to the top of Trusted SRAM at runtime. 171140d553cfSPaul Beesley 171240d553cfSPaul Beesley- BL2 is loaded below BL1 RW 171340d553cfSPaul Beesley 171440d553cfSPaul Beesley- EL3 Runtime Software, BL31 for AArch64 and BL32 for AArch32 (e.g. SP_MIN), 171540d553cfSPaul Beesley is loaded at the top of the Trusted SRAM, such that its NOBITS sections will 171640d553cfSPaul Beesley overwrite BL1 R/W data and BL2. This implies that BL1 global variables 171740d553cfSPaul Beesley remain valid only until execution reaches the EL3 Runtime Software entry 171840d553cfSPaul Beesley point during a cold boot. 171940d553cfSPaul Beesley 172040d553cfSPaul Beesley- On Juno, SCP_BL2 is loaded temporarily into the EL3 Runtime Software memory 1721be653a69SPaul Beesley region and transferred to the SCP before being overwritten by EL3 Runtime 172240d553cfSPaul Beesley Software. 172340d553cfSPaul Beesley 172440d553cfSPaul Beesley- BL32 (for AArch64) can be loaded in one of the following locations: 172540d553cfSPaul Beesley 172640d553cfSPaul Beesley - Trusted SRAM 172740d553cfSPaul Beesley - Trusted DRAM (FVP only) 172840d553cfSPaul Beesley - Secure region of DRAM (top 16MB of DRAM configured by the TrustZone 172940d553cfSPaul Beesley controller) 173040d553cfSPaul Beesley 173140d553cfSPaul Beesley When BL32 (for AArch64) is loaded into Trusted SRAM, it is loaded below 173240d553cfSPaul Beesley BL31. 173340d553cfSPaul Beesley 173440d553cfSPaul BeesleyThe location of the BL32 image will result in different memory maps. This is 173540d553cfSPaul Beesleyillustrated for both FVP and Juno in the following diagrams, using the TSP as 173640d553cfSPaul Beesleyan example. 173740d553cfSPaul Beesley 1738e1c5026aSPaul Beesley.. note:: 1739e1c5026aSPaul Beesley Loading the BL32 image in TZC secured DRAM doesn't change the memory 174040d553cfSPaul Beesley layout of the other images in Trusted SRAM. 174140d553cfSPaul Beesley 174240d553cfSPaul BeesleyCONFIG section in memory layouts shown below contains: 174340d553cfSPaul Beesley 174440d553cfSPaul Beesley:: 174540d553cfSPaul Beesley 174640d553cfSPaul Beesley +--------------------+ 174740d553cfSPaul Beesley |bl2_mem_params_descs| 174840d553cfSPaul Beesley |--------------------| 174940d553cfSPaul Beesley | fw_configs | 175040d553cfSPaul Beesley +--------------------+ 175140d553cfSPaul Beesley 175240d553cfSPaul Beesley``bl2_mem_params_descs`` contains parameters passed from BL2 to next the 175340d553cfSPaul BeesleyBL image during boot. 175440d553cfSPaul Beesley 1755089fc624SManish V Badarkhe``fw_configs`` includes soc_fw_config, tos_fw_config, tb_fw_config and fw_config. 175640d553cfSPaul Beesley 175740d553cfSPaul Beesley**FVP with TSP in Trusted SRAM with firmware configs :** 175840d553cfSPaul Beesley(These diagrams only cover the AArch64 case) 175940d553cfSPaul Beesley 176040d553cfSPaul Beesley:: 176140d553cfSPaul Beesley 176240d553cfSPaul Beesley DRAM 176340d553cfSPaul Beesley 0xffffffff +----------+ 1764a52c5251SManish V Badarkhe | EL3 TZC | 1765a52c5251SManish V Badarkhe 0xffe00000 |----------| (secure) 1766a52c5251SManish V Badarkhe | AP TZC | 1767a52c5251SManish V Badarkhe 0xff000000 +----------+ 176840d553cfSPaul Beesley : : 1769b4a87836SManish V Badarkhe 0x82100000 |----------| 177040d553cfSPaul Beesley |HW_CONFIG | 1771b4a87836SManish V Badarkhe 0x82000000 |----------| (non-secure) 177240d553cfSPaul Beesley | | 177340d553cfSPaul Beesley 0x80000000 +----------+ 177440d553cfSPaul Beesley 1775b4a87836SManish V Badarkhe Trusted DRAM 1776b4a87836SManish V Badarkhe 0x08000000 +----------+ 1777b4a87836SManish V Badarkhe |HW_CONFIG | 1778b4a87836SManish V Badarkhe 0x07f00000 |----------| 1779b4a87836SManish V Badarkhe : : 1780b4a87836SManish V Badarkhe | | 1781b4a87836SManish V Badarkhe 0x06000000 +----------+ 1782b4a87836SManish V Badarkhe 178340d553cfSPaul Beesley Trusted SRAM 178440d553cfSPaul Beesley 0x04040000 +----------+ loaded by BL2 +----------------+ 178540d553cfSPaul Beesley | BL1 (rw) | <<<<<<<<<<<<< | | 178640d553cfSPaul Beesley |----------| <<<<<<<<<<<<< | BL31 NOBITS | 178740d553cfSPaul Beesley | BL2 | <<<<<<<<<<<<< | | 178840d553cfSPaul Beesley |----------| <<<<<<<<<<<<< |----------------| 178940d553cfSPaul Beesley | | <<<<<<<<<<<<< | BL31 PROGBITS | 179040d553cfSPaul Beesley | | <<<<<<<<<<<<< |----------------| 179140d553cfSPaul Beesley | | <<<<<<<<<<<<< | BL32 | 1792089fc624SManish V Badarkhe 0x04003000 +----------+ +----------------+ 179340d553cfSPaul Beesley | CONFIG | 179440d553cfSPaul Beesley 0x04001000 +----------+ 179540d553cfSPaul Beesley | Shared | 179640d553cfSPaul Beesley 0x04000000 +----------+ 179740d553cfSPaul Beesley 179840d553cfSPaul Beesley Trusted ROM 179940d553cfSPaul Beesley 0x04000000 +----------+ 180040d553cfSPaul Beesley | BL1 (ro) | 180140d553cfSPaul Beesley 0x00000000 +----------+ 180240d553cfSPaul Beesley 180340d553cfSPaul Beesley**FVP with TSP in Trusted DRAM with firmware configs (default option):** 180440d553cfSPaul Beesley 180540d553cfSPaul Beesley:: 180640d553cfSPaul Beesley 180740d553cfSPaul Beesley DRAM 180840d553cfSPaul Beesley 0xffffffff +--------------+ 1809a52c5251SManish V Badarkhe | EL3 TZC | 1810a52c5251SManish V Badarkhe 0xffe00000 |--------------| (secure) 1811a52c5251SManish V Badarkhe | AP TZC | 1812a52c5251SManish V Badarkhe 0xff000000 +--------------+ 181340d553cfSPaul Beesley : : 1814b4a87836SManish V Badarkhe 0x82100000 |--------------| 181540d553cfSPaul Beesley | HW_CONFIG | 1816b4a87836SManish V Badarkhe 0x82000000 |--------------| (non-secure) 181740d553cfSPaul Beesley | | 181840d553cfSPaul Beesley 0x80000000 +--------------+ 181940d553cfSPaul Beesley 182040d553cfSPaul Beesley Trusted DRAM 182140d553cfSPaul Beesley 0x08000000 +--------------+ 1822b4a87836SManish V Badarkhe | HW_CONFIG | 1823b4a87836SManish V Badarkhe 0x07f00000 |--------------| 1824b4a87836SManish V Badarkhe : : 182540d553cfSPaul Beesley | BL32 | 182640d553cfSPaul Beesley 0x06000000 +--------------+ 182740d553cfSPaul Beesley 182840d553cfSPaul Beesley Trusted SRAM 182940d553cfSPaul Beesley 0x04040000 +--------------+ loaded by BL2 +----------------+ 183040d553cfSPaul Beesley | BL1 (rw) | <<<<<<<<<<<<< | | 183140d553cfSPaul Beesley |--------------| <<<<<<<<<<<<< | BL31 NOBITS | 183240d553cfSPaul Beesley | BL2 | <<<<<<<<<<<<< | | 183340d553cfSPaul Beesley |--------------| <<<<<<<<<<<<< |----------------| 183440d553cfSPaul Beesley | | <<<<<<<<<<<<< | BL31 PROGBITS | 183540d553cfSPaul Beesley | | +----------------+ 1836089fc624SManish V Badarkhe 0x04003000 +--------------+ 183740d553cfSPaul Beesley | CONFIG | 183840d553cfSPaul Beesley 0x04001000 +--------------+ 183940d553cfSPaul Beesley | Shared | 184040d553cfSPaul Beesley 0x04000000 +--------------+ 184140d553cfSPaul Beesley 184240d553cfSPaul Beesley Trusted ROM 184340d553cfSPaul Beesley 0x04000000 +--------------+ 184440d553cfSPaul Beesley | BL1 (ro) | 184540d553cfSPaul Beesley 0x00000000 +--------------+ 184640d553cfSPaul Beesley 184740d553cfSPaul Beesley**FVP with TSP in TZC-Secured DRAM with firmware configs :** 184840d553cfSPaul Beesley 184940d553cfSPaul Beesley:: 185040d553cfSPaul Beesley 185140d553cfSPaul Beesley DRAM 185240d553cfSPaul Beesley 0xffffffff +----------+ 1853a52c5251SManish V Badarkhe | EL3 TZC | 1854a52c5251SManish V Badarkhe 0xffe00000 |----------| (secure) 1855a52c5251SManish V Badarkhe | AP TZC | 1856a52c5251SManish V Badarkhe | (BL32) | 185740d553cfSPaul Beesley 0xff000000 +----------+ 185840d553cfSPaul Beesley | | 1859b4a87836SManish V Badarkhe 0x82100000 |----------| 186040d553cfSPaul Beesley |HW_CONFIG | 1861b4a87836SManish V Badarkhe 0x82000000 |----------| (non-secure) 186240d553cfSPaul Beesley | | 186340d553cfSPaul Beesley 0x80000000 +----------+ 186440d553cfSPaul Beesley 1865b4a87836SManish V Badarkhe Trusted DRAM 1866b4a87836SManish V Badarkhe 0x08000000 +----------+ 1867b4a87836SManish V Badarkhe |HW_CONFIG | 1868b4a87836SManish V Badarkhe 0x7f000000 |----------| 1869b4a87836SManish V Badarkhe : : 1870b4a87836SManish V Badarkhe | | 1871b4a87836SManish V Badarkhe 0x06000000 +----------+ 1872b4a87836SManish V Badarkhe 187340d553cfSPaul Beesley Trusted SRAM 187440d553cfSPaul Beesley 0x04040000 +----------+ loaded by BL2 +----------------+ 187540d553cfSPaul Beesley | BL1 (rw) | <<<<<<<<<<<<< | | 187640d553cfSPaul Beesley |----------| <<<<<<<<<<<<< | BL31 NOBITS | 187740d553cfSPaul Beesley | BL2 | <<<<<<<<<<<<< | | 187840d553cfSPaul Beesley |----------| <<<<<<<<<<<<< |----------------| 187940d553cfSPaul Beesley | | <<<<<<<<<<<<< | BL31 PROGBITS | 188040d553cfSPaul Beesley | | +----------------+ 1881089fc624SManish V Badarkhe 0x04003000 +----------+ 188240d553cfSPaul Beesley | CONFIG | 188340d553cfSPaul Beesley 0x04001000 +----------+ 188440d553cfSPaul Beesley | Shared | 188540d553cfSPaul Beesley 0x04000000 +----------+ 188640d553cfSPaul Beesley 188740d553cfSPaul Beesley Trusted ROM 188840d553cfSPaul Beesley 0x04000000 +----------+ 188940d553cfSPaul Beesley | BL1 (ro) | 189040d553cfSPaul Beesley 0x00000000 +----------+ 189140d553cfSPaul Beesley 189240d553cfSPaul Beesley**Juno with BL32 in Trusted SRAM :** 189340d553cfSPaul Beesley 189440d553cfSPaul Beesley:: 189540d553cfSPaul Beesley 1896a52c5251SManish V Badarkhe DRAM 1897a52c5251SManish V Badarkhe 0xFFFFFFFF +----------+ 1898a52c5251SManish V Badarkhe | SCP TZC | 1899a52c5251SManish V Badarkhe 0xFFE00000 |----------| 1900a52c5251SManish V Badarkhe | EL3 TZC | 1901a52c5251SManish V Badarkhe 0xFFC00000 |----------| (secure) 1902a52c5251SManish V Badarkhe | AP TZC | 1903a52c5251SManish V Badarkhe 0xFF000000 +----------+ 1904a52c5251SManish V Badarkhe | | 1905a52c5251SManish V Badarkhe : : (non-secure) 1906a52c5251SManish V Badarkhe | | 1907a52c5251SManish V Badarkhe 0x80000000 +----------+ 1908a52c5251SManish V Badarkhe 1909a52c5251SManish V Badarkhe 191040d553cfSPaul Beesley Flash0 191140d553cfSPaul Beesley 0x0C000000 +----------+ 191240d553cfSPaul Beesley : : 191340d553cfSPaul Beesley 0x0BED0000 |----------| 191440d553cfSPaul Beesley | BL1 (ro) | 191540d553cfSPaul Beesley 0x0BEC0000 |----------| 191640d553cfSPaul Beesley : : 191740d553cfSPaul Beesley 0x08000000 +----------+ BL31 is loaded 191840d553cfSPaul Beesley after SCP_BL2 has 191940d553cfSPaul Beesley Trusted SRAM been sent to SCP 192040d553cfSPaul Beesley 0x04040000 +----------+ loaded by BL2 +----------------+ 192140d553cfSPaul Beesley | BL1 (rw) | <<<<<<<<<<<<< | | 192240d553cfSPaul Beesley |----------| <<<<<<<<<<<<< | BL31 NOBITS | 192340d553cfSPaul Beesley | BL2 | <<<<<<<<<<<<< | | 192440d553cfSPaul Beesley |----------| <<<<<<<<<<<<< |----------------| 192540d553cfSPaul Beesley | SCP_BL2 | <<<<<<<<<<<<< | BL31 PROGBITS | 1926ddc93cbaSChris Kay | | <<<<<<<<<<<<< |----------------| 192740d553cfSPaul Beesley | | <<<<<<<<<<<<< | BL32 | 192840d553cfSPaul Beesley | | +----------------+ 192940d553cfSPaul Beesley | | 193040d553cfSPaul Beesley 0x04001000 +----------+ 193140d553cfSPaul Beesley | MHU | 193240d553cfSPaul Beesley 0x04000000 +----------+ 193340d553cfSPaul Beesley 193440d553cfSPaul Beesley**Juno with BL32 in TZC-secured DRAM :** 193540d553cfSPaul Beesley 193640d553cfSPaul Beesley:: 193740d553cfSPaul Beesley 193840d553cfSPaul Beesley DRAM 1939a52c5251SManish V Badarkhe 0xFFFFFFFF +----------+ 1940a52c5251SManish V Badarkhe | SCP TZC | 1941a52c5251SManish V Badarkhe 0xFFE00000 |----------| 1942a52c5251SManish V Badarkhe | EL3 TZC | 1943a52c5251SManish V Badarkhe 0xFFC00000 |----------| (secure) 1944a52c5251SManish V Badarkhe | AP TZC | 1945a52c5251SManish V Badarkhe | (BL32) | 1946a52c5251SManish V Badarkhe 0xFF000000 +----------+ 194740d553cfSPaul Beesley | | 194840d553cfSPaul Beesley : : (non-secure) 194940d553cfSPaul Beesley | | 195040d553cfSPaul Beesley 0x80000000 +----------+ 195140d553cfSPaul Beesley 195240d553cfSPaul Beesley Flash0 195340d553cfSPaul Beesley 0x0C000000 +----------+ 195440d553cfSPaul Beesley : : 195540d553cfSPaul Beesley 0x0BED0000 |----------| 195640d553cfSPaul Beesley | BL1 (ro) | 195740d553cfSPaul Beesley 0x0BEC0000 |----------| 195840d553cfSPaul Beesley : : 195940d553cfSPaul Beesley 0x08000000 +----------+ BL31 is loaded 196040d553cfSPaul Beesley after SCP_BL2 has 196140d553cfSPaul Beesley Trusted SRAM been sent to SCP 196240d553cfSPaul Beesley 0x04040000 +----------+ loaded by BL2 +----------------+ 196340d553cfSPaul Beesley | BL1 (rw) | <<<<<<<<<<<<< | | 196440d553cfSPaul Beesley |----------| <<<<<<<<<<<<< | BL31 NOBITS | 196540d553cfSPaul Beesley | BL2 | <<<<<<<<<<<<< | | 196640d553cfSPaul Beesley |----------| <<<<<<<<<<<<< |----------------| 196740d553cfSPaul Beesley | SCP_BL2 | <<<<<<<<<<<<< | BL31 PROGBITS | 1968ddc93cbaSChris Kay | | +----------------+ 196940d553cfSPaul Beesley 0x04001000 +----------+ 197040d553cfSPaul Beesley | MHU | 197140d553cfSPaul Beesley 0x04000000 +----------+ 197240d553cfSPaul Beesley 197343f35ef5SPaul Beesley.. _firmware_design_fip: 197440d553cfSPaul Beesley 197540d553cfSPaul BeesleyFirmware Image Package (FIP) 197640d553cfSPaul Beesley---------------------------- 197740d553cfSPaul Beesley 197840d553cfSPaul BeesleyUsing a Firmware Image Package (FIP) allows for packing bootloader images (and 197940d553cfSPaul Beesleypotentially other payloads) into a single archive that can be loaded by TF-A 198040d553cfSPaul Beesleyfrom non-volatile platform storage. A driver to load images from a FIP has 198140d553cfSPaul Beesleybeen added to the storage layer and allows a package to be read from supported 198240d553cfSPaul Beesleyplatform storage. A tool to create Firmware Image Packages is also provided 198340d553cfSPaul Beesleyand described below. 198440d553cfSPaul Beesley 198540d553cfSPaul BeesleyFirmware Image Package layout 198640d553cfSPaul Beesley~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 198740d553cfSPaul Beesley 198840d553cfSPaul BeesleyThe FIP layout consists of a table of contents (ToC) followed by payload data. 198940d553cfSPaul BeesleyThe ToC itself has a header followed by one or more table entries. The ToC is 199040d553cfSPaul Beesleyterminated by an end marker entry, and since the size of the ToC is 0 bytes, 199140d553cfSPaul Beesleythe offset equals the total size of the FIP file. All ToC entries describe some 199240d553cfSPaul Beesleypayload data that has been appended to the end of the binary package. With the 199340d553cfSPaul Beesleyinformation provided in the ToC entry the corresponding payload data can be 199440d553cfSPaul Beesleyretrieved. 199540d553cfSPaul Beesley 199640d553cfSPaul Beesley:: 199740d553cfSPaul Beesley 199840d553cfSPaul Beesley ------------------ 199940d553cfSPaul Beesley | ToC Header | 200040d553cfSPaul Beesley |----------------| 200140d553cfSPaul Beesley | ToC Entry 0 | 200240d553cfSPaul Beesley |----------------| 200340d553cfSPaul Beesley | ToC Entry 1 | 200440d553cfSPaul Beesley |----------------| 200540d553cfSPaul Beesley | ToC End Marker | 200640d553cfSPaul Beesley |----------------| 200740d553cfSPaul Beesley | | 200840d553cfSPaul Beesley | Data 0 | 200940d553cfSPaul Beesley | | 201040d553cfSPaul Beesley |----------------| 201140d553cfSPaul Beesley | | 201240d553cfSPaul Beesley | Data 1 | 201340d553cfSPaul Beesley | | 201440d553cfSPaul Beesley ------------------ 201540d553cfSPaul Beesley 201640d553cfSPaul BeesleyThe ToC header and entry formats are described in the header file 201740d553cfSPaul Beesley``include/tools_share/firmware_image_package.h``. This file is used by both the 201840d553cfSPaul Beesleytool and TF-A. 201940d553cfSPaul Beesley 202040d553cfSPaul BeesleyThe ToC header has the following fields: 202140d553cfSPaul Beesley 202240d553cfSPaul Beesley:: 202340d553cfSPaul Beesley 202440d553cfSPaul Beesley `name`: The name of the ToC. This is currently used to validate the header. 202540d553cfSPaul Beesley `serial_number`: A non-zero number provided by the creation tool 202640d553cfSPaul Beesley `flags`: Flags associated with this data. 202740d553cfSPaul Beesley Bits 0-31: Reserved 202840d553cfSPaul Beesley Bits 32-47: Platform defined 202940d553cfSPaul Beesley Bits 48-63: Reserved 203040d553cfSPaul Beesley 203140d553cfSPaul BeesleyA ToC entry has the following fields: 203240d553cfSPaul Beesley 203340d553cfSPaul Beesley:: 203440d553cfSPaul Beesley 203540d553cfSPaul Beesley `uuid`: All files are referred to by a pre-defined Universally Unique 203640d553cfSPaul Beesley IDentifier [UUID] . The UUIDs are defined in 203740d553cfSPaul Beesley `include/tools_share/firmware_image_package.h`. The platform translates 203840d553cfSPaul Beesley the requested image name into the corresponding UUID when accessing the 203940d553cfSPaul Beesley package. 204040d553cfSPaul Beesley `offset_address`: The offset address at which the corresponding payload data 204140d553cfSPaul Beesley can be found. The offset is calculated from the ToC base address. 204240d553cfSPaul Beesley `size`: The size of the corresponding payload data in bytes. 204340d553cfSPaul Beesley `flags`: Flags associated with this entry. None are yet defined. 204440d553cfSPaul Beesley 204540d553cfSPaul BeesleyFirmware Image Package creation tool 204640d553cfSPaul Beesley~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 204740d553cfSPaul Beesley 204840d553cfSPaul BeesleyThe FIP creation tool can be used to pack specified images into a binary 204940d553cfSPaul Beesleypackage that can be loaded by TF-A from platform storage. The tool currently 205040d553cfSPaul Beesleyonly supports packing bootloader images. Additional image definitions can be 205140d553cfSPaul Beesleyadded to the tool as required. 205240d553cfSPaul Beesley 205340d553cfSPaul BeesleyThe tool can be found in ``tools/fiptool``. 205440d553cfSPaul Beesley 205540d553cfSPaul BeesleyLoading from a Firmware Image Package (FIP) 205640d553cfSPaul Beesley~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 205740d553cfSPaul Beesley 205840d553cfSPaul BeesleyThe Firmware Image Package (FIP) driver can load images from a binary package on 205940d553cfSPaul Beesleynon-volatile platform storage. For the Arm development platforms, this is 206040d553cfSPaul Beesleycurrently NOR FLASH. 206140d553cfSPaul Beesley 206240d553cfSPaul BeesleyBootloader images are loaded according to the platform policy as specified by 206340d553cfSPaul Beesleythe function ``plat_get_image_source()``. For the Arm development platforms, this 206440d553cfSPaul Beesleymeans the platform will attempt to load images from a Firmware Image Package 206540d553cfSPaul Beesleylocated at the start of NOR FLASH0. 206640d553cfSPaul Beesley 206740d553cfSPaul BeesleyThe Arm development platforms' policy is to only allow loading of a known set of 206840d553cfSPaul Beesleyimages. The platform policy can be modified to allow additional images. 206940d553cfSPaul Beesley 207040d553cfSPaul BeesleyUse of coherent memory in TF-A 207140d553cfSPaul Beesley------------------------------ 207240d553cfSPaul Beesley 207340d553cfSPaul BeesleyThere might be loss of coherency when physical memory with mismatched 207440d553cfSPaul Beesleyshareability, cacheability and memory attributes is accessed by multiple CPUs 207540d553cfSPaul Beesley(refer to section B2.9 of `Arm ARM`_ for more details). This possibility occurs 207640d553cfSPaul Beesleyin TF-A during power up/down sequences when coherency, MMU and caches are 207740d553cfSPaul Beesleyturned on/off incrementally. 207840d553cfSPaul Beesley 207940d553cfSPaul BeesleyTF-A defines coherent memory as a region of memory with Device nGnRE attributes 208040d553cfSPaul Beesleyin the translation tables. The translation granule size in TF-A is 4KB. This 208140d553cfSPaul Beesleyis the smallest possible size of the coherent memory region. 208240d553cfSPaul Beesley 208340d553cfSPaul BeesleyBy default, all data structures which are susceptible to accesses with 208440d553cfSPaul Beesleymismatched attributes from various CPUs are allocated in a coherent memory 208534760951SPaul Beesleyregion (refer to section 2.1 of :ref:`Porting Guide`). The coherent memory 208634760951SPaul Beesleyregion accesses are Outer Shareable, non-cacheable and they can be accessed with 208734760951SPaul Beesleythe Device nGnRE attributes when the MMU is turned on. Hence, at the expense of 208834760951SPaul Beesleyat least an extra page of memory, TF-A is able to work around coherency issues 208934760951SPaul Beesleydue to mismatched memory attributes. 209040d553cfSPaul Beesley 209140d553cfSPaul BeesleyThe alternative to the above approach is to allocate the susceptible data 209240d553cfSPaul Beesleystructures in Normal WriteBack WriteAllocate Inner shareable memory. This 209340d553cfSPaul Beesleyapproach requires the data structures to be designed so that it is possible to 209440d553cfSPaul Beesleywork around the issue of mismatched memory attributes by performing software 209540d553cfSPaul Beesleycache maintenance on them. 209640d553cfSPaul Beesley 209740d553cfSPaul BeesleyDisabling the use of coherent memory in TF-A 209840d553cfSPaul Beesley~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 209940d553cfSPaul Beesley 210040d553cfSPaul BeesleyIt might be desirable to avoid the cost of allocating coherent memory on 210140d553cfSPaul Beesleyplatforms which are memory constrained. TF-A enables inclusion of coherent 210240d553cfSPaul Beesleymemory in firmware images through the build flag ``USE_COHERENT_MEM``. 210340d553cfSPaul BeesleyThis flag is enabled by default. It can be disabled to choose the second 210440d553cfSPaul Beesleyapproach described above. 210540d553cfSPaul Beesley 210640d553cfSPaul BeesleyThe below sections analyze the data structures allocated in the coherent memory 210740d553cfSPaul Beesleyregion and the changes required to allocate them in normal memory. 210840d553cfSPaul Beesley 210940d553cfSPaul BeesleyCoherent memory usage in PSCI implementation 211040d553cfSPaul Beesley~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 211140d553cfSPaul Beesley 211240d553cfSPaul BeesleyThe ``psci_non_cpu_pd_nodes`` data structure stores the platform's power domain 211340d553cfSPaul Beesleytree information for state management of power domains. By default, this data 211440d553cfSPaul Beesleystructure is allocated in the coherent memory region in TF-A because it can be 211540d553cfSPaul Beesleyaccessed by multiple CPUs, either with caches enabled or disabled. 211640d553cfSPaul Beesley 211740d553cfSPaul Beesley.. code:: c 211840d553cfSPaul Beesley 211940d553cfSPaul Beesley typedef struct non_cpu_pwr_domain_node { 212040d553cfSPaul Beesley /* 212140d553cfSPaul Beesley * Index of the first CPU power domain node level 0 which has this node 212240d553cfSPaul Beesley * as its parent. 212340d553cfSPaul Beesley */ 212440d553cfSPaul Beesley unsigned int cpu_start_idx; 212540d553cfSPaul Beesley 212640d553cfSPaul Beesley /* 212740d553cfSPaul Beesley * Number of CPU power domains which are siblings of the domain indexed 212840d553cfSPaul Beesley * by 'cpu_start_idx' i.e. all the domains in the range 'cpu_start_idx 212940d553cfSPaul Beesley * -> cpu_start_idx + ncpus' have this node as their parent. 213040d553cfSPaul Beesley */ 213140d553cfSPaul Beesley unsigned int ncpus; 213240d553cfSPaul Beesley 213340d553cfSPaul Beesley /* 213440d553cfSPaul Beesley * Index of the parent power domain node. 213540d553cfSPaul Beesley */ 213640d553cfSPaul Beesley unsigned int parent_node; 213740d553cfSPaul Beesley 213840d553cfSPaul Beesley plat_local_state_t local_state; 213940d553cfSPaul Beesley 214040d553cfSPaul Beesley unsigned char level; 214140d553cfSPaul Beesley 214240d553cfSPaul Beesley /* For indexing the psci_lock array*/ 214340d553cfSPaul Beesley unsigned char lock_index; 214440d553cfSPaul Beesley } non_cpu_pd_node_t; 214540d553cfSPaul Beesley 214640d553cfSPaul BeesleyIn order to move this data structure to normal memory, the use of each of its 214740d553cfSPaul Beesleyfields must be analyzed. Fields like ``cpu_start_idx``, ``ncpus``, ``parent_node`` 214840d553cfSPaul Beesley``level`` and ``lock_index`` are only written once during cold boot. Hence removing 214940d553cfSPaul Beesleythem from coherent memory involves only doing a clean and invalidate of the 215040d553cfSPaul Beesleycache lines after these fields are written. 215140d553cfSPaul Beesley 215240d553cfSPaul BeesleyThe field ``local_state`` can be concurrently accessed by multiple CPUs in 215340d553cfSPaul Beesleydifferent cache states. A Lamport's Bakery lock ``psci_locks`` is used to ensure 215440d553cfSPaul Beesleymutual exclusion to this field and a clean and invalidate is needed after it 215540d553cfSPaul Beesleyis written. 215640d553cfSPaul Beesley 215740d553cfSPaul BeesleyBakery lock data 215840d553cfSPaul Beesley~~~~~~~~~~~~~~~~ 215940d553cfSPaul Beesley 216040d553cfSPaul BeesleyThe bakery lock data structure ``bakery_lock_t`` is allocated in coherent memory 216140d553cfSPaul Beesleyand is accessed by multiple CPUs with mismatched attributes. ``bakery_lock_t`` is 216240d553cfSPaul Beesleydefined as follows: 216340d553cfSPaul Beesley 216440d553cfSPaul Beesley.. code:: c 216540d553cfSPaul Beesley 216640d553cfSPaul Beesley typedef struct bakery_lock { 216740d553cfSPaul Beesley /* 216840d553cfSPaul Beesley * The lock_data is a bit-field of 2 members: 216940d553cfSPaul Beesley * Bit[0] : choosing. This field is set when the CPU is 217040d553cfSPaul Beesley * choosing its bakery number. 217140d553cfSPaul Beesley * Bits[1 - 15] : number. This is the bakery number allocated. 217240d553cfSPaul Beesley */ 217340d553cfSPaul Beesley volatile uint16_t lock_data[BAKERY_LOCK_MAX_CPUS]; 217440d553cfSPaul Beesley } bakery_lock_t; 217540d553cfSPaul Beesley 217640d553cfSPaul BeesleyIt is a characteristic of Lamport's Bakery algorithm that the volatile per-CPU 217740d553cfSPaul Beesleyfields can be read by all CPUs but only written to by the owning CPU. 217840d553cfSPaul Beesley 217940d553cfSPaul BeesleyDepending upon the data cache line size, the per-CPU fields of the 218040d553cfSPaul Beesley``bakery_lock_t`` structure for multiple CPUs may exist on a single cache line. 218140d553cfSPaul BeesleyThese per-CPU fields can be read and written during lock contention by multiple 218240d553cfSPaul BeesleyCPUs with mismatched memory attributes. Since these fields are a part of the 218340d553cfSPaul Beesleylock implementation, they do not have access to any other locking primitive to 218440d553cfSPaul Beesleysafeguard against the resulting coherency issues. As a result, simple software 218540d553cfSPaul Beesleycache maintenance is not enough to allocate them in coherent memory. Consider 218640d553cfSPaul Beesleythe following example. 218740d553cfSPaul Beesley 218840d553cfSPaul BeesleyCPU0 updates its per-CPU field with data cache enabled. This write updates a 218940d553cfSPaul Beesleylocal cache line which contains a copy of the fields for other CPUs as well. Now 219040d553cfSPaul BeesleyCPU1 updates its per-CPU field of the ``bakery_lock_t`` structure with data cache 219140d553cfSPaul Beesleydisabled. CPU1 then issues a DCIVAC operation to invalidate any stale copies of 219240d553cfSPaul Beesleyits field in any other cache line in the system. This operation will invalidate 219340d553cfSPaul Beesleythe update made by CPU0 as well. 219440d553cfSPaul Beesley 219540d553cfSPaul BeesleyTo use bakery locks when ``USE_COHERENT_MEM`` is disabled, the lock data structure 219640d553cfSPaul Beesleyhas been redesigned. The changes utilise the characteristic of Lamport's Bakery 219740d553cfSPaul Beesleyalgorithm mentioned earlier. The bakery_lock structure only allocates the memory 219840d553cfSPaul Beesleyfor a single CPU. The macro ``DEFINE_BAKERY_LOCK`` allocates all the bakery locks 2199da04341eSChris Kayneeded for a CPU into a section ``.bakery_lock``. The linker allocates the memory 220040d553cfSPaul Beesleyfor other cores by using the total size allocated for the bakery_lock section 220140d553cfSPaul Beesleyand multiplying it with (PLATFORM_CORE_COUNT - 1). This enables software to 220240d553cfSPaul Beesleyperform software cache maintenance on the lock data structure without running 220340d553cfSPaul Beesleyinto coherency issues associated with mismatched attributes. 220440d553cfSPaul Beesley 220540d553cfSPaul BeesleyThe bakery lock data structure ``bakery_info_t`` is defined for use when 220640d553cfSPaul Beesley``USE_COHERENT_MEM`` is disabled as follows: 220740d553cfSPaul Beesley 220840d553cfSPaul Beesley.. code:: c 220940d553cfSPaul Beesley 221040d553cfSPaul Beesley typedef struct bakery_info { 221140d553cfSPaul Beesley /* 221240d553cfSPaul Beesley * The lock_data is a bit-field of 2 members: 221340d553cfSPaul Beesley * Bit[0] : choosing. This field is set when the CPU is 221440d553cfSPaul Beesley * choosing its bakery number. 221540d553cfSPaul Beesley * Bits[1 - 15] : number. This is the bakery number allocated. 221640d553cfSPaul Beesley */ 221740d553cfSPaul Beesley volatile uint16_t lock_data; 221840d553cfSPaul Beesley } bakery_info_t; 221940d553cfSPaul Beesley 222040d553cfSPaul BeesleyThe ``bakery_info_t`` represents a single per-CPU field of one lock and 222140d553cfSPaul Beesleythe combination of corresponding ``bakery_info_t`` structures for all CPUs in the 222240d553cfSPaul Beesleysystem represents the complete bakery lock. The view in memory for a system 222340d553cfSPaul Beesleywith n bakery locks are: 222440d553cfSPaul Beesley 222540d553cfSPaul Beesley:: 222640d553cfSPaul Beesley 2227da04341eSChris Kay .bakery_lock section start 222840d553cfSPaul Beesley |----------------| 222940d553cfSPaul Beesley | `bakery_info_t`| <-- Lock_0 per-CPU field 223040d553cfSPaul Beesley | Lock_0 | for CPU0 223140d553cfSPaul Beesley |----------------| 223240d553cfSPaul Beesley | `bakery_info_t`| <-- Lock_1 per-CPU field 223340d553cfSPaul Beesley | Lock_1 | for CPU0 223440d553cfSPaul Beesley |----------------| 223540d553cfSPaul Beesley | .... | 223640d553cfSPaul Beesley |----------------| 223740d553cfSPaul Beesley | `bakery_info_t`| <-- Lock_N per-CPU field 223840d553cfSPaul Beesley | Lock_N | for CPU0 223940d553cfSPaul Beesley ------------------ 224040d553cfSPaul Beesley | XXXXX | 224140d553cfSPaul Beesley | Padding to | 224240d553cfSPaul Beesley | next Cache WB | <--- Calculate PERCPU_BAKERY_LOCK_SIZE, allocate 224340d553cfSPaul Beesley | Granule | continuous memory for remaining CPUs. 224440d553cfSPaul Beesley ------------------ 224540d553cfSPaul Beesley | `bakery_info_t`| <-- Lock_0 per-CPU field 224640d553cfSPaul Beesley | Lock_0 | for CPU1 224740d553cfSPaul Beesley |----------------| 224840d553cfSPaul Beesley | `bakery_info_t`| <-- Lock_1 per-CPU field 224940d553cfSPaul Beesley | Lock_1 | for CPU1 225040d553cfSPaul Beesley |----------------| 225140d553cfSPaul Beesley | .... | 225240d553cfSPaul Beesley |----------------| 225340d553cfSPaul Beesley | `bakery_info_t`| <-- Lock_N per-CPU field 225440d553cfSPaul Beesley | Lock_N | for CPU1 225540d553cfSPaul Beesley ------------------ 225640d553cfSPaul Beesley | XXXXX | 225740d553cfSPaul Beesley | Padding to | 225840d553cfSPaul Beesley | next Cache WB | 225940d553cfSPaul Beesley | Granule | 226040d553cfSPaul Beesley ------------------ 226140d553cfSPaul Beesley 226240d553cfSPaul BeesleyConsider a system of 2 CPUs with 'N' bakery locks as shown above. For an 226340d553cfSPaul Beesleyoperation on Lock_N, the corresponding ``bakery_info_t`` in both CPU0 and CPU1 2264da04341eSChris Kay``.bakery_lock`` section need to be fetched and appropriate cache operations need 226540d553cfSPaul Beesleyto be performed for each access. 226640d553cfSPaul Beesley 226740d553cfSPaul BeesleyOn Arm Platforms, bakery locks are used in psci (``psci_locks``) and power controller 226840d553cfSPaul Beesleydriver (``arm_lock``). 226940d553cfSPaul Beesley 227040d553cfSPaul BeesleyNon Functional Impact of removing coherent memory 227140d553cfSPaul Beesley~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 227240d553cfSPaul Beesley 227340d553cfSPaul BeesleyRemoval of the coherent memory region leads to the additional software overhead 227440d553cfSPaul Beesleyof performing cache maintenance for the affected data structures. However, since 227540d553cfSPaul Beesleythe memory where the data structures are allocated is cacheable, the overhead is 227640d553cfSPaul Beesleymostly mitigated by an increase in performance. 227740d553cfSPaul Beesley 227840d553cfSPaul BeesleyThere is however a performance impact for bakery locks, due to: 227940d553cfSPaul Beesley 228040d553cfSPaul Beesley- Additional cache maintenance operations, and 228140d553cfSPaul Beesley- Multiple cache line reads for each lock operation, since the bakery locks 228240d553cfSPaul Beesley for each CPU are distributed across different cache lines. 228340d553cfSPaul Beesley 228440d553cfSPaul BeesleyThe implementation has been optimized to minimize this additional overhead. 228540d553cfSPaul BeesleyMeasurements indicate that when bakery locks are allocated in Normal memory, the 228640d553cfSPaul Beesleyminimum latency of acquiring a lock is on an average 3-4 micro seconds whereas 228740d553cfSPaul Beesleyin Device memory the same is 2 micro seconds. The measurements were done on the 228840d553cfSPaul BeesleyJuno Arm development platform. 228940d553cfSPaul Beesley 229040d553cfSPaul BeesleyAs mentioned earlier, almost a page of memory can be saved by disabling 229140d553cfSPaul Beesley``USE_COHERENT_MEM``. Each platform needs to consider these trade-offs to decide 229240d553cfSPaul Beesleywhether coherent memory should be used. If a platform disables 229340d553cfSPaul Beesley``USE_COHERENT_MEM`` and needs to use bakery locks in the porting layer, it can 229440d553cfSPaul Beesleyoptionally define macro ``PLAT_PERCPU_BAKERY_LOCK_SIZE`` (see the 229534760951SPaul Beesley:ref:`Porting Guide`). Refer to the reference platform code for examples. 229640d553cfSPaul Beesley 229740d553cfSPaul BeesleyIsolating code and read-only data on separate memory pages 229840d553cfSPaul Beesley---------------------------------------------------------- 229940d553cfSPaul Beesley 230040d553cfSPaul BeesleyIn the Armv8-A VMSA, translation table entries include fields that define the 230140d553cfSPaul Beesleyproperties of the target memory region, such as its access permissions. The 230240d553cfSPaul Beesleysmallest unit of memory that can be addressed by a translation table entry is 230340d553cfSPaul Beesleya memory page. Therefore, if software needs to set different permissions on two 230440d553cfSPaul Beesleymemory regions then it needs to map them using different memory pages. 230540d553cfSPaul Beesley 230640d553cfSPaul BeesleyThe default memory layout for each BL image is as follows: 230740d553cfSPaul Beesley 230840d553cfSPaul Beesley:: 230940d553cfSPaul Beesley 231040d553cfSPaul Beesley | ... | 231140d553cfSPaul Beesley +-------------------+ 231240d553cfSPaul Beesley | Read-write data | 231340d553cfSPaul Beesley +-------------------+ Page boundary 231440d553cfSPaul Beesley | <Padding> | 231540d553cfSPaul Beesley +-------------------+ 231640d553cfSPaul Beesley | Exception vectors | 231740d553cfSPaul Beesley +-------------------+ 2 KB boundary 231840d553cfSPaul Beesley | <Padding> | 231940d553cfSPaul Beesley +-------------------+ 232040d553cfSPaul Beesley | Read-only data | 232140d553cfSPaul Beesley +-------------------+ 232240d553cfSPaul Beesley | Code | 232340d553cfSPaul Beesley +-------------------+ BLx_BASE 232440d553cfSPaul Beesley 2325e1c5026aSPaul Beesley.. note:: 2326e1c5026aSPaul Beesley The 2KB alignment for the exception vectors is an architectural 232740d553cfSPaul Beesley requirement. 232840d553cfSPaul Beesley 232940d553cfSPaul BeesleyThe read-write data start on a new memory page so that they can be mapped with 233040d553cfSPaul Beesleyread-write permissions, whereas the code and read-only data below are configured 233140d553cfSPaul Beesleyas read-only. 233240d553cfSPaul Beesley 233340d553cfSPaul BeesleyHowever, the read-only data are not aligned on a page boundary. They are 233440d553cfSPaul Beesleycontiguous to the code. Therefore, the end of the code section and the beginning 233540d553cfSPaul Beesleyof the read-only data one might share a memory page. This forces both to be 233640d553cfSPaul Beesleymapped with the same memory attributes. As the code needs to be executable, this 233740d553cfSPaul Beesleymeans that the read-only data stored on the same memory page as the code are 233840d553cfSPaul Beesleyexecutable as well. This could potentially be exploited as part of a security 233940d553cfSPaul Beesleyattack. 234040d553cfSPaul Beesley 234140d553cfSPaul BeesleyTF provides the build flag ``SEPARATE_CODE_AND_RODATA`` to isolate the code and 234240d553cfSPaul Beesleyread-only data on separate memory pages. This in turn allows independent control 234340d553cfSPaul Beesleyof the access permissions for the code and read-only data. In this case, 234440d553cfSPaul Beesleyplatform code gets a finer-grained view of the image layout and can 234540d553cfSPaul Beesleyappropriately map the code region as executable and the read-only data as 234640d553cfSPaul Beesleyexecute-never. 234740d553cfSPaul Beesley 234840d553cfSPaul BeesleyThis has an impact on memory footprint, as padding bytes need to be introduced 234940d553cfSPaul Beesleybetween the code and read-only data to ensure the segregation of the two. To 235040d553cfSPaul Beesleylimit the memory cost, this flag also changes the memory layout such that the 235140d553cfSPaul Beesleycode and exception vectors are now contiguous, like so: 235240d553cfSPaul Beesley 235340d553cfSPaul Beesley:: 235440d553cfSPaul Beesley 235540d553cfSPaul Beesley | ... | 235640d553cfSPaul Beesley +-------------------+ 235740d553cfSPaul Beesley | Read-write data | 235840d553cfSPaul Beesley +-------------------+ Page boundary 235940d553cfSPaul Beesley | <Padding> | 236040d553cfSPaul Beesley +-------------------+ 236140d553cfSPaul Beesley | Read-only data | 236240d553cfSPaul Beesley +-------------------+ Page boundary 236340d553cfSPaul Beesley | <Padding> | 236440d553cfSPaul Beesley +-------------------+ 236540d553cfSPaul Beesley | Exception vectors | 236640d553cfSPaul Beesley +-------------------+ 2 KB boundary 236740d553cfSPaul Beesley | <Padding> | 236840d553cfSPaul Beesley +-------------------+ 236940d553cfSPaul Beesley | Code | 237040d553cfSPaul Beesley +-------------------+ BLx_BASE 237140d553cfSPaul Beesley 237240d553cfSPaul BeesleyWith this more condensed memory layout, the separation of read-only data will 237340d553cfSPaul Beesleyadd zero or one page to the memory footprint of each BL image. Each platform 237440d553cfSPaul Beesleyshould consider the trade-off between memory footprint and security. 237540d553cfSPaul Beesley 237640d553cfSPaul BeesleyThis build flag is disabled by default, minimising memory footprint. On Arm 237740d553cfSPaul Beesleyplatforms, it is enabled. 237840d553cfSPaul Beesley 237940d553cfSPaul BeesleyPublish and Subscribe Framework 238040d553cfSPaul Beesley------------------------------- 238140d553cfSPaul Beesley 238240d553cfSPaul BeesleyThe Publish and Subscribe Framework allows EL3 components to define and publish 238340d553cfSPaul Beesleyevents, to which other EL3 components can subscribe. 238440d553cfSPaul Beesley 238540d553cfSPaul BeesleyThe following macros are provided by the framework: 238640d553cfSPaul Beesley 238740d553cfSPaul Beesley- ``REGISTER_PUBSUB_EVENT(event)``: Defines an event, and takes one argument, 238840d553cfSPaul Beesley the event name, which must be a valid C identifier. All calls to 238940d553cfSPaul Beesley ``REGISTER_PUBSUB_EVENT`` macro must be placed in the file 239040d553cfSPaul Beesley ``pubsub_events.h``. 239140d553cfSPaul Beesley 239240d553cfSPaul Beesley- ``PUBLISH_EVENT_ARG(event, arg)``: Publishes a defined event, by iterating 239340d553cfSPaul Beesley subscribed handlers and calling them in turn. The handlers will be passed the 239440d553cfSPaul Beesley parameter ``arg``. The expected use-case is to broadcast an event. 239540d553cfSPaul Beesley 239640d553cfSPaul Beesley- ``PUBLISH_EVENT(event)``: Like ``PUBLISH_EVENT_ARG``, except that the value 239740d553cfSPaul Beesley ``NULL`` is passed to subscribed handlers. 239840d553cfSPaul Beesley 239940d553cfSPaul Beesley- ``SUBSCRIBE_TO_EVENT(event, handler)``: Registers the ``handler`` to 240040d553cfSPaul Beesley subscribe to ``event``. The handler will be executed whenever the ``event`` 240140d553cfSPaul Beesley is published. 240240d553cfSPaul Beesley 240340d553cfSPaul Beesley- ``for_each_subscriber(event, subscriber)``: Iterates through all handlers 240440d553cfSPaul Beesley subscribed for ``event``. ``subscriber`` must be a local variable of type 240540d553cfSPaul Beesley ``pubsub_cb_t *``, and will point to each subscribed handler in turn during 240640d553cfSPaul Beesley iteration. This macro can be used for those patterns that none of the 240740d553cfSPaul Beesley ``PUBLISH_EVENT_*()`` macros cover. 240840d553cfSPaul Beesley 240940d553cfSPaul BeesleyPublishing an event that wasn't defined using ``REGISTER_PUBSUB_EVENT`` will 241040d553cfSPaul Beesleyresult in build error. Subscribing to an undefined event however won't. 241140d553cfSPaul Beesley 241240d553cfSPaul BeesleySubscribed handlers must be of type ``pubsub_cb_t``, with following function 241340d553cfSPaul Beesleysignature: 241440d553cfSPaul Beesley 241529c02529SPaul Beesley.. code:: c 241640d553cfSPaul Beesley 241740d553cfSPaul Beesley typedef void* (*pubsub_cb_t)(const void *arg); 241840d553cfSPaul Beesley 241940d553cfSPaul BeesleyThere may be arbitrary number of handlers registered to the same event. The 242040d553cfSPaul Beesleyorder in which subscribed handlers are notified when that event is published is 242140d553cfSPaul Beesleynot defined. Subscribed handlers may be executed in any order; handlers should 242240d553cfSPaul Beesleynot assume any relative ordering amongst them. 242340d553cfSPaul Beesley 242440d553cfSPaul BeesleyPublishing an event on a PE will result in subscribed handlers executing on that 242540d553cfSPaul BeesleyPE only; it won't cause handlers to execute on a different PE. 242640d553cfSPaul Beesley 242740d553cfSPaul BeesleyNote that publishing an event on a PE blocks until all the subscribed handlers 242840d553cfSPaul Beesleyfinish executing on the PE. 242940d553cfSPaul Beesley 243040d553cfSPaul BeesleyTF-A generic code publishes and subscribes to some events within. Platform 243140d553cfSPaul Beesleyports are discouraged from subscribing to them. These events may be withdrawn, 243240d553cfSPaul Beesleyrenamed, or have their semantics altered in the future. Platforms may however 243340d553cfSPaul Beesleyregister, publish, and subscribe to platform-specific events. 243440d553cfSPaul Beesley 243540d553cfSPaul BeesleyPublish and Subscribe Example 243640d553cfSPaul Beesley~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 243740d553cfSPaul Beesley 243840d553cfSPaul BeesleyA publisher that wants to publish event ``foo`` would: 243940d553cfSPaul Beesley 244040d553cfSPaul Beesley- Define the event ``foo`` in the ``pubsub_events.h``. 244140d553cfSPaul Beesley 244229c02529SPaul Beesley .. code:: c 244340d553cfSPaul Beesley 244440d553cfSPaul Beesley REGISTER_PUBSUB_EVENT(foo); 244540d553cfSPaul Beesley 244640d553cfSPaul Beesley- Depending on the nature of event, use one of ``PUBLISH_EVENT_*()`` macros to 244740d553cfSPaul Beesley publish the event at the appropriate path and time of execution. 244840d553cfSPaul Beesley 244940d553cfSPaul BeesleyA subscriber that wants to subscribe to event ``foo`` published above would 245040d553cfSPaul Beesleyimplement: 245140d553cfSPaul Beesley 245240d553cfSPaul Beesley.. code:: c 245340d553cfSPaul Beesley 245440d553cfSPaul Beesley void *foo_handler(const void *arg) 245540d553cfSPaul Beesley { 245640d553cfSPaul Beesley void *result; 245740d553cfSPaul Beesley 245840d553cfSPaul Beesley /* Do handling ... */ 245940d553cfSPaul Beesley 246040d553cfSPaul Beesley return result; 246140d553cfSPaul Beesley } 246240d553cfSPaul Beesley 246340d553cfSPaul Beesley SUBSCRIBE_TO_EVENT(foo, foo_handler); 246440d553cfSPaul Beesley 246540d553cfSPaul Beesley 246640d553cfSPaul BeesleyReclaiming the BL31 initialization code 246740d553cfSPaul Beesley~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 246840d553cfSPaul Beesley 246940d553cfSPaul BeesleyA significant amount of the code used for the initialization of BL31 is never 247040d553cfSPaul Beesleyneeded again after boot time. In order to reduce the runtime memory 247140d553cfSPaul Beesleyfootprint, the memory used for this code can be reclaimed after initialization 247240d553cfSPaul Beesleyhas finished and be used for runtime data. 247340d553cfSPaul Beesley 247440d553cfSPaul BeesleyThe build option ``RECLAIM_INIT_CODE`` can be set to mark this boot time code 247540d553cfSPaul Beesleywith a ``.text.init.*`` attribute which can be filtered and placed suitably 247640d553cfSPaul Beesleywithin the BL image for later reclamation by the platform. The platform can 247740d553cfSPaul Beesleyspecify the filter and the memory region for this init section in BL31 via the 247840d553cfSPaul Beesleyplat.ld.S linker script. For example, on the FVP, this section is placed 247940d553cfSPaul Beesleyoverlapping the secondary CPU stacks so that after the cold boot is done, this 248040d553cfSPaul Beesleymemory can be reclaimed for the stacks. The init memory section is initially 248140d553cfSPaul Beesleymapped with ``RO``, ``EXECUTE`` attributes. After BL31 initialization has 248240d553cfSPaul Beesleycompleted, the FVP changes the attributes of this section to ``RW``, 248340d553cfSPaul Beesley``EXECUTE_NEVER`` allowing it to be used for runtime data. The memory attributes 248440d553cfSPaul Beesleyare changed within the ``bl31_plat_runtime_setup`` platform hook. The init 248540d553cfSPaul Beesleysection section can be reclaimed for any data which is accessed after cold 248640d553cfSPaul Beesleyboot initialization and it is upto the platform to make the decision. 248740d553cfSPaul Beesley 248834760951SPaul Beesley.. _firmware_design_pmf: 248934760951SPaul Beesley 249040d553cfSPaul BeesleyPerformance Measurement Framework 249140d553cfSPaul Beesley--------------------------------- 249240d553cfSPaul Beesley 249340d553cfSPaul BeesleyThe Performance Measurement Framework (PMF) facilitates collection of 249440d553cfSPaul Beesleytimestamps by registered services and provides interfaces to retrieve them 249540d553cfSPaul Beesleyfrom within TF-A. A platform can choose to expose appropriate SMCs to 249640d553cfSPaul Beesleyretrieve these collected timestamps. 249740d553cfSPaul Beesley 249840d553cfSPaul BeesleyBy default, the global physical counter is used for the timestamp 249940d553cfSPaul Beesleyvalue and is read via ``CNTPCT_EL0``. The framework allows to retrieve 250040d553cfSPaul Beesleytimestamps captured by other CPUs. 250140d553cfSPaul Beesley 250240d553cfSPaul BeesleyTimestamp identifier format 250340d553cfSPaul Beesley~~~~~~~~~~~~~~~~~~~~~~~~~~~ 250440d553cfSPaul Beesley 250540d553cfSPaul BeesleyA PMF timestamp is uniquely identified across the system via the 250640d553cfSPaul Beesleytimestamp ID or ``tid``. The ``tid`` is composed as follows: 250740d553cfSPaul Beesley 250840d553cfSPaul Beesley:: 250940d553cfSPaul Beesley 251040d553cfSPaul Beesley Bits 0-7: The local timestamp identifier. 251140d553cfSPaul Beesley Bits 8-9: Reserved. 251240d553cfSPaul Beesley Bits 10-15: The service identifier. 251340d553cfSPaul Beesley Bits 16-31: Reserved. 251440d553cfSPaul Beesley 251540d553cfSPaul Beesley#. The service identifier. Each PMF service is identified by a 251640d553cfSPaul Beesley service name and a service identifier. Both the service name and 251740d553cfSPaul Beesley identifier are unique within the system as a whole. 251840d553cfSPaul Beesley 251940d553cfSPaul Beesley#. The local timestamp identifier. This identifier is unique within a given 252040d553cfSPaul Beesley service. 252140d553cfSPaul Beesley 252240d553cfSPaul BeesleyRegistering a PMF service 252340d553cfSPaul Beesley~~~~~~~~~~~~~~~~~~~~~~~~~ 252440d553cfSPaul Beesley 252540d553cfSPaul BeesleyTo register a PMF service, the ``PMF_REGISTER_SERVICE()`` macro from ``pmf.h`` 252640d553cfSPaul Beesleyis used. The arguments required are the service name, the service ID, 252740d553cfSPaul Beesleythe total number of local timestamps to be captured and a set of flags. 252840d553cfSPaul Beesley 252940d553cfSPaul BeesleyThe ``flags`` field can be specified as a bitwise-OR of the following values: 253040d553cfSPaul Beesley 253140d553cfSPaul Beesley:: 253240d553cfSPaul Beesley 253340d553cfSPaul Beesley PMF_STORE_ENABLE: The timestamp is stored in memory for later retrieval. 253440d553cfSPaul Beesley PMF_DUMP_ENABLE: The timestamp is dumped on the serial console. 253540d553cfSPaul Beesley 253640d553cfSPaul BeesleyThe ``PMF_REGISTER_SERVICE()`` reserves memory to store captured 253740d553cfSPaul Beesleytimestamps in a PMF specific linker section at build time. 253840d553cfSPaul BeesleyAdditionally, it defines necessary functions to capture and 253940d553cfSPaul Beesleyretrieve a particular timestamp for the given service at runtime. 254040d553cfSPaul Beesley 254140d553cfSPaul BeesleyThe macro ``PMF_REGISTER_SERVICE()`` only enables capturing PMF timestamps 254240d553cfSPaul Beesleyfrom within TF-A. In order to retrieve timestamps from outside of TF-A, the 254340d553cfSPaul Beesley``PMF_REGISTER_SERVICE_SMC()`` macro must be used instead. This macro 254440d553cfSPaul Beesleyaccepts the same set of arguments as the ``PMF_REGISTER_SERVICE()`` 254540d553cfSPaul Beesleymacro but additionally supports retrieving timestamps using SMCs. 254640d553cfSPaul Beesley 254740d553cfSPaul BeesleyCapturing a timestamp 254840d553cfSPaul Beesley~~~~~~~~~~~~~~~~~~~~~ 254940d553cfSPaul Beesley 255040d553cfSPaul BeesleyPMF timestamps are stored in a per-service timestamp region. On a 255140d553cfSPaul Beesleysystem with multiple CPUs, each timestamp is captured and stored 255240d553cfSPaul Beesleyin a per-CPU cache line aligned memory region. 255340d553cfSPaul Beesley 255440d553cfSPaul BeesleyHaving registered the service, the ``PMF_CAPTURE_TIMESTAMP()`` macro can be 255540d553cfSPaul Beesleyused to capture a timestamp at the location where it is used. The macro 255640d553cfSPaul Beesleytakes the service name, a local timestamp identifier and a flag as arguments. 255740d553cfSPaul Beesley 255840d553cfSPaul BeesleyThe ``flags`` field argument can be zero, or ``PMF_CACHE_MAINT`` which 255940d553cfSPaul Beesleyinstructs PMF to do cache maintenance following the capture. Cache 256040d553cfSPaul Beesleymaintenance is required if any of the service's timestamps are captured 256140d553cfSPaul Beesleywith data cache disabled. 256240d553cfSPaul Beesley 256340d553cfSPaul BeesleyTo capture a timestamp in assembly code, the caller should use 256440d553cfSPaul Beesley``pmf_calc_timestamp_addr`` macro (defined in ``pmf_asm_macros.S``) to 256540d553cfSPaul Beesleycalculate the address of where the timestamp would be stored. The 256640d553cfSPaul Beesleycaller should then read ``CNTPCT_EL0`` register to obtain the timestamp 256740d553cfSPaul Beesleyand store it at the determined address for later retrieval. 256840d553cfSPaul Beesley 256940d553cfSPaul BeesleyRetrieving a timestamp 257040d553cfSPaul Beesley~~~~~~~~~~~~~~~~~~~~~~ 257140d553cfSPaul Beesley 257240d553cfSPaul BeesleyFrom within TF-A, timestamps for individual CPUs can be retrieved using either 257340d553cfSPaul Beesley``PMF_GET_TIMESTAMP_BY_MPIDR()`` or ``PMF_GET_TIMESTAMP_BY_INDEX()`` macros. 257440d553cfSPaul BeesleyThese macros accept the CPU's MPIDR value, or its ordinal position 257540d553cfSPaul Beesleyrespectively. 257640d553cfSPaul Beesley 257740d553cfSPaul BeesleyFrom outside TF-A, timestamps for individual CPUs can be retrieved by calling 257840d553cfSPaul Beesleyinto ``pmf_smc_handler()``. 257940d553cfSPaul Beesley 258029c02529SPaul Beesley:: 258140d553cfSPaul Beesley 258240d553cfSPaul Beesley Interface : pmf_smc_handler() 258340d553cfSPaul Beesley Argument : unsigned int smc_fid, u_register_t x1, 258440d553cfSPaul Beesley u_register_t x2, u_register_t x3, 258540d553cfSPaul Beesley u_register_t x4, void *cookie, 258640d553cfSPaul Beesley void *handle, u_register_t flags 258740d553cfSPaul Beesley Return : uintptr_t 258840d553cfSPaul Beesley 258940d553cfSPaul Beesley smc_fid: Holds the SMC identifier which is either `PMF_SMC_GET_TIMESTAMP_32` 259040d553cfSPaul Beesley when the caller of the SMC is running in AArch32 mode 259140d553cfSPaul Beesley or `PMF_SMC_GET_TIMESTAMP_64` when the caller is running in AArch64 mode. 259240d553cfSPaul Beesley x1: Timestamp identifier. 259340d553cfSPaul Beesley x2: The `mpidr` of the CPU for which the timestamp has to be retrieved. 259440d553cfSPaul Beesley This can be the `mpidr` of a different core to the one initiating 259540d553cfSPaul Beesley the SMC. In that case, service specific cache maintenance may be 259640d553cfSPaul Beesley required to ensure the updated copy of the timestamp is returned. 259740d553cfSPaul Beesley x3: A flags value that is either 0 or `PMF_CACHE_MAINT`. If 259840d553cfSPaul Beesley `PMF_CACHE_MAINT` is passed, then the PMF code will perform a 259940d553cfSPaul Beesley cache invalidate before reading the timestamp. This ensures 260040d553cfSPaul Beesley an updated copy is returned. 260140d553cfSPaul Beesley 260240d553cfSPaul BeesleyThe remaining arguments, ``x4``, ``cookie``, ``handle`` and ``flags`` are unused 260340d553cfSPaul Beesleyin this implementation. 260440d553cfSPaul Beesley 260540d553cfSPaul BeesleyPMF code structure 260640d553cfSPaul Beesley~~~~~~~~~~~~~~~~~~ 260740d553cfSPaul Beesley 260840d553cfSPaul Beesley#. ``pmf_main.c`` consists of core functions that implement service registration, 260940d553cfSPaul Beesley initialization, storing, dumping and retrieving timestamps. 261040d553cfSPaul Beesley 261140d553cfSPaul Beesley#. ``pmf_smc.c`` contains the SMC handling for registered PMF services. 261240d553cfSPaul Beesley 261340d553cfSPaul Beesley#. ``pmf.h`` contains the public interface to Performance Measurement Framework. 261440d553cfSPaul Beesley 261540d553cfSPaul Beesley#. ``pmf_asm_macros.S`` consists of macros to facilitate capturing timestamps in 261640d553cfSPaul Beesley assembly code. 261740d553cfSPaul Beesley 261840d553cfSPaul Beesley#. ``pmf_helpers.h`` is an internal header used by ``pmf.h``. 261940d553cfSPaul Beesley 262040d553cfSPaul BeesleyArmv8-A Architecture Extensions 262140d553cfSPaul Beesley------------------------------- 262240d553cfSPaul Beesley 262340d553cfSPaul BeesleyTF-A makes use of Armv8-A Architecture Extensions where applicable. This 262440d553cfSPaul Beesleysection lists the usage of Architecture Extensions, and build flags 262540d553cfSPaul Beesleycontrolling them. 262640d553cfSPaul Beesley 2627be6484cbSManish PandeyBuild options 2628be6484cbSManish Pandey~~~~~~~~~~~~~ 262940d553cfSPaul Beesley 2630be6484cbSManish Pandey``ARM_ARCH_MAJOR`` and ``ARM_ARCH_MINOR`` 2631be6484cbSManish Pandey 2632be6484cbSManish PandeyThese build options serve dual purpose 2633be6484cbSManish Pandey 2634be6484cbSManish Pandey- Determine the architecture extension support in TF-A build: All the mandatory 2635be6484cbSManish Pandey architectural features up to ``ARM_ARCH_MAJOR.ARM_ARCH_MINOR`` are included 2636be6484cbSManish Pandey and unconditionally enabled by TF-A build system. 2637be6484cbSManish Pandey 2638be6484cbSManish Pandey- Passed to compiler via "-march" option to generate binary target : Tell the 2639be6484cbSManish Pandey compiler to emit instructions upto ``ARM_ARCH_MAJOR.ARM_ARCH_MINOR`` 2640be6484cbSManish Pandey 2641be6484cbSManish PandeyThe build system requires that the platform provides a valid numeric value based on 2642be6484cbSManish PandeyCPU architecture extension, otherwise it defaults to base Armv8.0-A architecture. 2643be6484cbSManish PandeySubsequent Arm Architecture versions also support extensions which were introduced 2644be6484cbSManish Pandeyin previous versions. 2645be6484cbSManish Pandey 2646be6484cbSManish Pandey**TO-DO** : Its planned to decouple the two functionalities and introduce a new macro 2647be6484cbSManish Pandeyfor compiler usage. The requirement for this decoupling arises becasue TF-A code 2648be6484cbSManish Pandeyalways provides support for the latest and greatest architecture features but this 2649be6484cbSManish Pandeyis not the case for the target compiler. 265040d553cfSPaul Beesley 265143f35ef5SPaul Beesley.. seealso:: :ref:`Build Options` 265240d553cfSPaul Beesley 265340d553cfSPaul BeesleyFor details on the Architecture Extension and available features, please refer 265440d553cfSPaul Beesleyto the respective Architecture Extension Supplement. 265540d553cfSPaul Beesley 265640d553cfSPaul BeesleyArmv8.1-A 265740d553cfSPaul Beesley~~~~~~~~~ 265840d553cfSPaul Beesley 265940d553cfSPaul BeesleyThis Architecture Extension is targeted when ``ARM_ARCH_MAJOR`` >= 8, or when 266040d553cfSPaul Beesley``ARM_ARCH_MAJOR`` == 8 and ``ARM_ARCH_MINOR`` >= 1. 266140d553cfSPaul Beesley 2662c97cba4eSSoby Mathew- By default, a load-/store-exclusive instruction pair is used to implement 2663c97cba4eSSoby Mathew spinlocks. The ``USE_SPINLOCK_CAS`` build option when set to 1 selects the 2664c97cba4eSSoby Mathew spinlock implementation using the ARMv8.1-LSE Compare and Swap instruction. 2665c97cba4eSSoby Mathew Notice this instruction is only available in AArch64 execution state, so 2666c97cba4eSSoby Mathew the option is only available to AArch64 builds. 266740d553cfSPaul Beesley 266840d553cfSPaul BeesleyArmv8.2-A 266940d553cfSPaul Beesley~~~~~~~~~ 267040d553cfSPaul Beesley 267140d553cfSPaul Beesley- The presence of ARMv8.2-TTCNP is detected at runtime. When it is present, the 267240d553cfSPaul Beesley Common not Private (TTBRn_ELx.CnP) bit is enabled to indicate that multiple 267340d553cfSPaul Beesley Processing Elements in the same Inner Shareable domain use the same 267440d553cfSPaul Beesley translation table entries for a given stage of translation for a particular 267540d553cfSPaul Beesley translation regime. 267640d553cfSPaul Beesley 267740d553cfSPaul BeesleyArmv8.3-A 267840d553cfSPaul Beesley~~~~~~~~~ 267940d553cfSPaul Beesley 268040d553cfSPaul Beesley- Pointer authentication features of Armv8.3-A are unconditionally enabled in 268140d553cfSPaul Beesley the Non-secure world so that lower ELs are allowed to use them without 268240d553cfSPaul Beesley causing a trap to EL3. 268340d553cfSPaul Beesley 268440d553cfSPaul Beesley In order to enable the Secure world to use it, ``CTX_INCLUDE_PAUTH_REGS`` 268540d553cfSPaul Beesley must be set to 1. This will add all pointer authentication system registers 268640d553cfSPaul Beesley to the context that is saved when doing a world switch. 268740d553cfSPaul Beesley 268840d553cfSPaul Beesley The TF-A itself has support for pointer authentication at runtime 26899fc59639SAlexei Fedorov that can be enabled by setting ``BRANCH_PROTECTION`` option to non-zero and 269040d553cfSPaul Beesley ``CTX_INCLUDE_PAUTH_REGS`` to 1. This enables pointer authentication in BL1, 269140d553cfSPaul Beesley BL2, BL31, and the TSP if it is used. 269240d553cfSPaul Beesley 269340d553cfSPaul Beesley Note that Pointer Authentication is enabled for Non-secure world irrespective 269440d553cfSPaul Beesley of the value of these build flags if the CPU supports it. 269540d553cfSPaul Beesley 269640d553cfSPaul Beesley If ``ARM_ARCH_MAJOR == 8`` and ``ARM_ARCH_MINOR >= 3`` the code footprint of 269740d553cfSPaul Beesley enabling PAuth is lower because the compiler will use the optimized 269840d553cfSPaul Beesley PAuth instructions rather than the backwards-compatible ones. 269940d553cfSPaul Beesley 27009fc59639SAlexei FedorovArmv8.5-A 27019fc59639SAlexei Fedorov~~~~~~~~~ 27029fc59639SAlexei Fedorov 27039fc59639SAlexei Fedorov- Branch Target Identification feature is selected by ``BRANCH_PROTECTION`` 2704700e7685SManish Pandey option set to 1. This option defaults to 0. 270588d493fbSJustin Chadwell 270688d493fbSJustin Chadwell- Memory Tagging Extension feature is unconditionally enabled for both worlds 270788d493fbSJustin Chadwell (at EL0 and S-EL0) if it is only supported at EL0. If instead it is 270888d493fbSJustin Chadwell implemented at all ELs, it is unconditionally enabled for only the normal 270988d493fbSJustin Chadwell world. To enable it for the secure world as well, the build option 271088d493fbSJustin Chadwell ``CTX_INCLUDE_MTE_REGS`` is required. If the hardware does not implement 271188d493fbSJustin Chadwell MTE support at all, it is always disabled, no matter what build options 271288d493fbSJustin Chadwell are used. 27139fc59639SAlexei Fedorov 271440d553cfSPaul BeesleyArmv7-A 271540d553cfSPaul Beesley~~~~~~~ 271640d553cfSPaul Beesley 271740d553cfSPaul BeesleyThis Architecture Extension is targeted when ``ARM_ARCH_MAJOR`` == 7. 271840d553cfSPaul Beesley 271940d553cfSPaul BeesleyThere are several Armv7-A extensions available. Obviously the TrustZone 272040d553cfSPaul Beesleyextension is mandatory to support the TF-A bootloader and runtime services. 272140d553cfSPaul Beesley 272240d553cfSPaul BeesleyPlatform implementing an Armv7-A system can to define from its target 272340d553cfSPaul BeesleyCortex-A architecture through ``ARM_CORTEX_A<X> = yes`` in their 272440d553cfSPaul Beesley``platform.mk`` script. For example ``ARM_CORTEX_A15=yes`` for a 272540d553cfSPaul BeesleyCortex-A15 target. 272640d553cfSPaul Beesley 272740d553cfSPaul BeesleyPlatform can also set ``ARM_WITH_NEON=yes`` to enable neon support. 2728be653a69SPaul BeesleyNote that using neon at runtime has constraints on non secure world context. 272940d553cfSPaul BeesleyTF-A does not yet provide VFP context management. 273040d553cfSPaul Beesley 273140d553cfSPaul BeesleyDirective ``ARM_CORTEX_A<x>`` and ``ARM_WITH_NEON`` are used to set 273240d553cfSPaul Beesleythe toolchain target architecture directive. 273340d553cfSPaul Beesley 273440d553cfSPaul BeesleyPlatform may choose to not define straight the toolchain target architecture 273540d553cfSPaul Beesleydirective by defining ``MARCH32_DIRECTIVE``. 273640d553cfSPaul BeesleyI.e: 273740d553cfSPaul Beesley 273829c02529SPaul Beesley.. code:: make 273940d553cfSPaul Beesley 274040d553cfSPaul Beesley MARCH32_DIRECTIVE := -mach=armv7-a 274140d553cfSPaul Beesley 274240d553cfSPaul BeesleyCode Structure 274340d553cfSPaul Beesley-------------- 274440d553cfSPaul Beesley 274540d553cfSPaul BeesleyTF-A code is logically divided between the three boot loader stages mentioned 274640d553cfSPaul Beesleyin the previous sections. The code is also divided into the following 274740d553cfSPaul Beesleycategories (present as directories in the source code): 274840d553cfSPaul Beesley 274940d553cfSPaul Beesley- **Platform specific.** Choice of architecture specific code depends upon 275040d553cfSPaul Beesley the platform. 275140d553cfSPaul Beesley- **Common code.** This is platform and architecture agnostic code. 275240d553cfSPaul Beesley- **Library code.** This code comprises of functionality commonly used by all 275340d553cfSPaul Beesley other code. The PSCI implementation and other EL3 runtime frameworks reside 275440d553cfSPaul Beesley as Library components. 275540d553cfSPaul Beesley- **Stage specific.** Code specific to a boot stage. 275640d553cfSPaul Beesley- **Drivers.** 275740d553cfSPaul Beesley- **Services.** EL3 runtime services (eg: SPD). Specific SPD services 275840d553cfSPaul Beesley reside in the ``services/spd`` directory (e.g. ``services/spd/tspd``). 275940d553cfSPaul Beesley 276040d553cfSPaul BeesleyEach boot loader stage uses code from one or more of the above mentioned 276140d553cfSPaul Beesleycategories. Based upon the above, the code layout looks like this: 276240d553cfSPaul Beesley 276340d553cfSPaul Beesley:: 276440d553cfSPaul Beesley 276540d553cfSPaul Beesley Directory Used by BL1? Used by BL2? Used by BL31? 276640d553cfSPaul Beesley bl1 Yes No No 276740d553cfSPaul Beesley bl2 No Yes No 276840d553cfSPaul Beesley bl31 No No Yes 276940d553cfSPaul Beesley plat Yes Yes Yes 277040d553cfSPaul Beesley drivers Yes No Yes 277140d553cfSPaul Beesley common Yes Yes Yes 277240d553cfSPaul Beesley lib Yes Yes Yes 277340d553cfSPaul Beesley services No No Yes 277440d553cfSPaul Beesley 277540d553cfSPaul BeesleyThe build system provides a non configurable build option IMAGE_BLx for each 277640d553cfSPaul Beesleyboot loader stage (where x = BL stage). e.g. for BL1 , IMAGE_BL1 will be 277740d553cfSPaul Beesleydefined by the build system. This enables TF-A to compile certain code only 277840d553cfSPaul Beesleyfor specific boot loader stages 277940d553cfSPaul Beesley 278040d553cfSPaul BeesleyAll assembler files have the ``.S`` extension. The linker source files for each 278140d553cfSPaul Beesleyboot stage have the extension ``.ld.S``. These are processed by GCC to create the 278240d553cfSPaul Beesleylinker scripts which have the extension ``.ld``. 278340d553cfSPaul Beesley 278440d553cfSPaul BeesleyFDTs provide a description of the hardware platform and are used by the Linux 278540d553cfSPaul Beesleykernel at boot time. These can be found in the ``fdts`` directory. 278640d553cfSPaul Beesley 278734760951SPaul Beesley.. rubric:: References 278840d553cfSPaul Beesley 278934760951SPaul Beesley- `Trusted Board Boot Requirements CLIENT (TBBR-CLIENT) Armv8-A (ARM DEN0006D)`_ 279034760951SPaul Beesley 279134760951SPaul Beesley- `Power State Coordination Interface PDD`_ 279234760951SPaul Beesley 279371ac931fSSandrine Bailleux- `SMC Calling Convention`_ 279434760951SPaul Beesley 279534760951SPaul Beesley- :ref:`Interrupt Management Framework` 279640d553cfSPaul Beesley 279740d553cfSPaul Beesley-------------- 279840d553cfSPaul Beesley 279942d4d3baSArvind Ram Prakash*Copyright (c) 2013-2023, Arm Limited and Contributors. All rights reserved.* 280040d553cfSPaul Beesley 280134760951SPaul Beesley.. _Power State Coordination Interface PDD: http://infocenter.arm.com/help/topic/com.arm.doc.den0022d/Power_State_Coordination_Interface_PDD_v1_1_DEN0022D.pdf 28023ba55a3cSlaurenw-arm.. _SMCCC: https://developer.arm.com/docs/den0028/latest 280340d553cfSPaul Beesley.. _PSCI: http://infocenter.arm.com/help/topic/com.arm.doc.den0022d/Power_State_Coordination_Interface_PDD_v1_1_DEN0022D.pdf 280440d553cfSPaul Beesley.. _Power State Coordination Interface PDD: http://infocenter.arm.com/help/topic/com.arm.doc.den0022d/Power_State_Coordination_Interface_PDD_v1_1_DEN0022D.pdf 280562c9be71SPetre-Ionut Tudor.. _Arm ARM: https://developer.arm.com/docs/ddi0487/latest 28063ba55a3cSlaurenw-arm.. _SMC Calling Convention: https://developer.arm.com/docs/den0028/latest 280740d553cfSPaul Beesley.. _Trusted Board Boot Requirements CLIENT (TBBR-CLIENT) Armv8-A (ARM DEN0006D): https://developer.arm.com/docs/den0006/latest/trusted-board-boot-requirements-client-tbbr-client-armv8-a 28087446c266SZelalem Aweke.. _Arm Confidential Compute Architecture (Arm CCA): https://www.arm.com/why-arm/architecture/security-features/arm-confidential-compute-architecture 280940d553cfSPaul Beesley 2810a2c320a8SPaul Beesley.. |Image 1| image:: ../resources/diagrams/rt-svc-descs-layout.png 2811