xref: /rk3399_ARM-atf/docs/design/firmware-design.rst (revision b4292bc65eafcd36ad72d4301c12461184579bb6)
18aa05055SPaul BeesleyFirmware Design
28aa05055SPaul Beesley===============
340d553cfSPaul Beesley
440d553cfSPaul BeesleyTrusted Firmware-A (TF-A) implements a subset of the Trusted Board Boot
534760951SPaul BeesleyRequirements (TBBR) Platform Design Document (PDD) for Arm reference
634760951SPaul Beesleyplatforms.
734760951SPaul Beesley
834760951SPaul BeesleyThe TBB sequence starts when the platform is powered on and runs up
940d553cfSPaul Beesleyto the stage where it hands-off control to firmware running in the normal
1040d553cfSPaul Beesleyworld in DRAM. This is the cold boot path.
1140d553cfSPaul Beesley
1234760951SPaul BeesleyTF-A also implements the `Power State Coordination Interface PDD`_ as a
1340d553cfSPaul Beesleyruntime service. PSCI is the interface from normal world software to firmware
1440d553cfSPaul Beesleyimplementing power management use-cases (for example, secondary CPU boot,
1540d553cfSPaul Beesleyhotplug and idle). Normal world software can access TF-A runtime services via
1640d553cfSPaul Beesleythe Arm SMC (Secure Monitor Call) instruction. The SMC instruction must be
1734760951SPaul Beesleyused as mandated by the SMC Calling Convention (`SMCCC`_).
1840d553cfSPaul Beesley
1940d553cfSPaul BeesleyTF-A implements a framework for configuring and managing interrupts generated
2040d553cfSPaul Beesleyin either security state. The details of the interrupt management framework
2134760951SPaul Beesleyand its design can be found in :ref:`Interrupt Management Framework`.
2240d553cfSPaul Beesley
2340d553cfSPaul BeesleyTF-A also implements a library for setting up and managing the translation
2434760951SPaul Beesleytables. The details of this library can be found in
2534760951SPaul Beesley:ref:`Translation (XLAT) Tables Library`.
2640d553cfSPaul Beesley
2740d553cfSPaul BeesleyTF-A can be built to support either AArch64 or AArch32 execution state.
2840d553cfSPaul Beesley
2940d553cfSPaul BeesleyCold boot
3040d553cfSPaul Beesley---------
3140d553cfSPaul Beesley
3240d553cfSPaul BeesleyThe cold boot path starts when the platform is physically turned on. If
3340d553cfSPaul Beesley``COLD_BOOT_SINGLE_CPU=0``, one of the CPUs released from reset is chosen as the
3440d553cfSPaul Beesleyprimary CPU, and the remaining CPUs are considered secondary CPUs. The primary
3540d553cfSPaul BeesleyCPU is chosen through platform-specific means. The cold boot path is mainly
3640d553cfSPaul Beesleyexecuted by the primary CPU, other than essential CPU initialization executed by
3740d553cfSPaul Beesleyall CPUs. The secondary CPUs are kept in a safe platform-specific state until
3840d553cfSPaul Beesleythe primary CPU has performed enough initialization to boot them.
3940d553cfSPaul Beesley
4034760951SPaul BeesleyRefer to the :ref:`CPU Reset` for more information on the effect of the
4140d553cfSPaul Beesley``COLD_BOOT_SINGLE_CPU`` platform build option.
4240d553cfSPaul Beesley
4340d553cfSPaul BeesleyThe cold boot path in this implementation of TF-A depends on the execution
4440d553cfSPaul Beesleystate. For AArch64, it is divided into five steps (in order of execution):
4540d553cfSPaul Beesley
4640d553cfSPaul Beesley-  Boot Loader stage 1 (BL1) *AP Trusted ROM*
4740d553cfSPaul Beesley-  Boot Loader stage 2 (BL2) *Trusted Boot Firmware*
4840d553cfSPaul Beesley-  Boot Loader stage 3-1 (BL31) *EL3 Runtime Software*
4940d553cfSPaul Beesley-  Boot Loader stage 3-2 (BL32) *Secure-EL1 Payload* (optional)
5040d553cfSPaul Beesley-  Boot Loader stage 3-3 (BL33) *Non-trusted Firmware*
5140d553cfSPaul Beesley
5240d553cfSPaul BeesleyFor AArch32, it is divided into four steps (in order of execution):
5340d553cfSPaul Beesley
5440d553cfSPaul Beesley-  Boot Loader stage 1 (BL1) *AP Trusted ROM*
5540d553cfSPaul Beesley-  Boot Loader stage 2 (BL2) *Trusted Boot Firmware*
5640d553cfSPaul Beesley-  Boot Loader stage 3-2 (BL32) *EL3 Runtime Software*
5740d553cfSPaul Beesley-  Boot Loader stage 3-3 (BL33) *Non-trusted Firmware*
5840d553cfSPaul Beesley
5940d553cfSPaul BeesleyArm development platforms (Fixed Virtual Platforms (FVPs) and Juno) implement a
6040d553cfSPaul Beesleycombination of the following types of memory regions. Each bootloader stage uses
6140d553cfSPaul Beesleyone or more of these memory regions.
6240d553cfSPaul Beesley
6340d553cfSPaul Beesley-  Regions accessible from both non-secure and secure states. For example,
6440d553cfSPaul Beesley   non-trusted SRAM, ROM and DRAM.
6540d553cfSPaul Beesley-  Regions accessible from only the secure state. For example, trusted SRAM and
6640d553cfSPaul Beesley   ROM. The FVPs also implement the trusted DRAM which is statically
6740d553cfSPaul Beesley   configured. Additionally, the Base FVPs and Juno development platform
6840d553cfSPaul Beesley   configure the TrustZone Controller (TZC) to create a region in the DRAM
6940d553cfSPaul Beesley   which is accessible only from the secure state.
7040d553cfSPaul Beesley
7140d553cfSPaul BeesleyThe sections below provide the following details:
7240d553cfSPaul Beesley
7340d553cfSPaul Beesley-  dynamic configuration of Boot Loader stages
7440d553cfSPaul Beesley-  initialization and execution of the first three stages during cold boot
7540d553cfSPaul Beesley-  specification of the EL3 Runtime Software (BL31 for AArch64 and BL32 for
7640d553cfSPaul Beesley   AArch32) entrypoint requirements for use by alternative Trusted Boot
7740d553cfSPaul Beesley   Firmware in place of the provided BL1 and BL2
7840d553cfSPaul Beesley
7940d553cfSPaul BeesleyDynamic Configuration during cold boot
8040d553cfSPaul Beesley~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
8140d553cfSPaul Beesley
8240d553cfSPaul BeesleyEach of the Boot Loader stages may be dynamically configured if required by the
8340d553cfSPaul Beesleyplatform. The Boot Loader stage may optionally specify a firmware
8440d553cfSPaul Beesleyconfiguration file and/or hardware configuration file as listed below:
8540d553cfSPaul Beesley
8640d553cfSPaul Beesley-  HW_CONFIG - The hardware configuration file. Can be shared by all Boot Loader
8740d553cfSPaul Beesley   stages and also by the Normal World Rich OS.
8840d553cfSPaul Beesley-  TB_FW_CONFIG - Trusted Boot Firmware configuration file. Shared between BL1
8940d553cfSPaul Beesley   and BL2.
9040d553cfSPaul Beesley-  SOC_FW_CONFIG - SoC Firmware configuration file. Used by BL31.
9140d553cfSPaul Beesley-  TOS_FW_CONFIG - Trusted OS Firmware configuration file. Used by Trusted OS
9240d553cfSPaul Beesley   (BL32).
9340d553cfSPaul Beesley-  NT_FW_CONFIG - Non Trusted Firmware configuration file. Used by Non-trusted
9440d553cfSPaul Beesley   firmware (BL33).
9540d553cfSPaul Beesley
9640d553cfSPaul BeesleyThe Arm development platforms use the Flattened Device Tree format for the
9740d553cfSPaul Beesleydynamic configuration files.
9840d553cfSPaul Beesley
9940d553cfSPaul BeesleyEach Boot Loader stage can pass up to 4 arguments via registers to the next
10040d553cfSPaul Beesleystage.  BL2 passes the list of the next images to execute to the *EL3 Runtime
10140d553cfSPaul BeesleySoftware* (BL31 for AArch64 and BL32 for AArch32) via `arg0`. All the other
10240d553cfSPaul Beesleyarguments are platform defined. The Arm development platforms use the following
10340d553cfSPaul Beesleyconvention:
10440d553cfSPaul Beesley
10540d553cfSPaul Beesley-  BL1 passes the address of a meminfo_t structure to BL2 via ``arg1``. This
10640d553cfSPaul Beesley   structure contains the memory layout available to BL2.
10740d553cfSPaul Beesley-  When dynamic configuration files are present, the firmware configuration for
10840d553cfSPaul Beesley   the next Boot Loader stage is populated in the first available argument and
10940d553cfSPaul Beesley   the generic hardware configuration is passed the next available argument.
11040d553cfSPaul Beesley   For example,
11140d553cfSPaul Beesley
11240d553cfSPaul Beesley   -  If TB_FW_CONFIG is loaded by BL1, then its address is passed in ``arg0``
11340d553cfSPaul Beesley      to BL2.
11440d553cfSPaul Beesley   -  If HW_CONFIG is loaded by BL1, then its address is passed in ``arg2`` to
11540d553cfSPaul Beesley      BL2. Note, ``arg1`` is already used for meminfo_t.
11640d553cfSPaul Beesley   -  If SOC_FW_CONFIG is loaded by BL2, then its address is passed in ``arg1``
11740d553cfSPaul Beesley      to BL31. Note, ``arg0`` is used to pass the list of executable images.
11840d553cfSPaul Beesley   -  Similarly, if HW_CONFIG is loaded by BL1 or BL2, then its address is
11940d553cfSPaul Beesley      passed in ``arg2`` to BL31.
12040d553cfSPaul Beesley   -  For other BL3x images, if the firmware configuration file is loaded by
12140d553cfSPaul Beesley      BL2, then its address is passed in ``arg0`` and if HW_CONFIG is loaded
12240d553cfSPaul Beesley      then its address is passed in ``arg1``.
12340d553cfSPaul Beesley
12440d553cfSPaul BeesleyBL1
12540d553cfSPaul Beesley~~~
12640d553cfSPaul Beesley
12740d553cfSPaul BeesleyThis stage begins execution from the platform's reset vector at EL3. The reset
12840d553cfSPaul Beesleyaddress is platform dependent but it is usually located in a Trusted ROM area.
12940d553cfSPaul BeesleyThe BL1 data section is copied to trusted SRAM at runtime.
13040d553cfSPaul Beesley
13140d553cfSPaul BeesleyOn the Arm development platforms, BL1 code starts execution from the reset
13240d553cfSPaul Beesleyvector defined by the constant ``BL1_RO_BASE``. The BL1 data section is copied
13340d553cfSPaul Beesleyto the top of trusted SRAM as defined by the constant ``BL1_RW_BASE``.
13440d553cfSPaul Beesley
13540d553cfSPaul BeesleyThe functionality implemented by this stage is as follows.
13640d553cfSPaul Beesley
13740d553cfSPaul BeesleyDetermination of boot path
13840d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^^^^^^^
13940d553cfSPaul Beesley
14040d553cfSPaul BeesleyWhenever a CPU is released from reset, BL1 needs to distinguish between a warm
14140d553cfSPaul Beesleyboot and a cold boot. This is done using platform-specific mechanisms (see the
14234760951SPaul Beesley``plat_get_my_entrypoint()`` function in the :ref:`Porting Guide`). In the case
14334760951SPaul Beesleyof a warm boot, a CPU is expected to continue execution from a separate
14440d553cfSPaul Beesleyentrypoint. In the case of a cold boot, the secondary CPUs are placed in a safe
14540d553cfSPaul Beesleyplatform-specific state (see the ``plat_secondary_cold_boot_setup()`` function in
14634760951SPaul Beesleythe :ref:`Porting Guide`) while the primary CPU executes the remaining cold boot
14734760951SPaul Beesleypath as described in the following sections.
14840d553cfSPaul Beesley
14940d553cfSPaul BeesleyThis step only applies when ``PROGRAMMABLE_RESET_ADDRESS=0``. Refer to the
15034760951SPaul Beesley:ref:`CPU Reset` for more information on the effect of the
15140d553cfSPaul Beesley``PROGRAMMABLE_RESET_ADDRESS`` platform build option.
15240d553cfSPaul Beesley
15340d553cfSPaul BeesleyArchitectural initialization
15440d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^^^^^^^^^
15540d553cfSPaul Beesley
15640d553cfSPaul BeesleyBL1 performs minimal architectural initialization as follows.
15740d553cfSPaul Beesley
15840d553cfSPaul Beesley-  Exception vectors
15940d553cfSPaul Beesley
16040d553cfSPaul Beesley   BL1 sets up simple exception vectors for both synchronous and asynchronous
16140d553cfSPaul Beesley   exceptions. The default behavior upon receiving an exception is to populate
16240d553cfSPaul Beesley   a status code in the general purpose register ``X0/R0`` and call the
16334760951SPaul Beesley   ``plat_report_exception()`` function (see the :ref:`Porting Guide`). The
16434760951SPaul Beesley   status code is one of:
16540d553cfSPaul Beesley
16640d553cfSPaul Beesley   For AArch64:
16740d553cfSPaul Beesley
16840d553cfSPaul Beesley   ::
16940d553cfSPaul Beesley
17040d553cfSPaul Beesley       0x0 : Synchronous exception from Current EL with SP_EL0
17140d553cfSPaul Beesley       0x1 : IRQ exception from Current EL with SP_EL0
17240d553cfSPaul Beesley       0x2 : FIQ exception from Current EL with SP_EL0
17340d553cfSPaul Beesley       0x3 : System Error exception from Current EL with SP_EL0
17440d553cfSPaul Beesley       0x4 : Synchronous exception from Current EL with SP_ELx
17540d553cfSPaul Beesley       0x5 : IRQ exception from Current EL with SP_ELx
17640d553cfSPaul Beesley       0x6 : FIQ exception from Current EL with SP_ELx
17740d553cfSPaul Beesley       0x7 : System Error exception from Current EL with SP_ELx
17840d553cfSPaul Beesley       0x8 : Synchronous exception from Lower EL using aarch64
17940d553cfSPaul Beesley       0x9 : IRQ exception from Lower EL using aarch64
18040d553cfSPaul Beesley       0xa : FIQ exception from Lower EL using aarch64
18140d553cfSPaul Beesley       0xb : System Error exception from Lower EL using aarch64
18240d553cfSPaul Beesley       0xc : Synchronous exception from Lower EL using aarch32
18340d553cfSPaul Beesley       0xd : IRQ exception from Lower EL using aarch32
18440d553cfSPaul Beesley       0xe : FIQ exception from Lower EL using aarch32
18540d553cfSPaul Beesley       0xf : System Error exception from Lower EL using aarch32
18640d553cfSPaul Beesley
18740d553cfSPaul Beesley   For AArch32:
18840d553cfSPaul Beesley
18940d553cfSPaul Beesley   ::
19040d553cfSPaul Beesley
19140d553cfSPaul Beesley       0x10 : User mode
19240d553cfSPaul Beesley       0x11 : FIQ mode
19340d553cfSPaul Beesley       0x12 : IRQ mode
19440d553cfSPaul Beesley       0x13 : SVC mode
19540d553cfSPaul Beesley       0x16 : Monitor mode
19640d553cfSPaul Beesley       0x17 : Abort mode
19740d553cfSPaul Beesley       0x1a : Hypervisor mode
19840d553cfSPaul Beesley       0x1b : Undefined mode
19940d553cfSPaul Beesley       0x1f : System mode
20040d553cfSPaul Beesley
20140d553cfSPaul Beesley   The ``plat_report_exception()`` implementation on the Arm FVP port programs
20240d553cfSPaul Beesley   the Versatile Express System LED register in the following format to
20340d553cfSPaul Beesley   indicate the occurrence of an unexpected exception:
20440d553cfSPaul Beesley
20540d553cfSPaul Beesley   ::
20640d553cfSPaul Beesley
20740d553cfSPaul Beesley       SYS_LED[0]   - Security state (Secure=0/Non-Secure=1)
20840d553cfSPaul Beesley       SYS_LED[2:1] - Exception Level (EL3=0x3, EL2=0x2, EL1=0x1, EL0=0x0)
20940d553cfSPaul Beesley                      For AArch32 it is always 0x0
21040d553cfSPaul Beesley       SYS_LED[7:3] - Exception Class (Sync/Async & origin). This is the value
21140d553cfSPaul Beesley                      of the status code
21240d553cfSPaul Beesley
21340d553cfSPaul Beesley   A write to the LED register reflects in the System LEDs (S6LED0..7) in the
21440d553cfSPaul Beesley   CLCD window of the FVP.
21540d553cfSPaul Beesley
21640d553cfSPaul Beesley   BL1 does not expect to receive any exceptions other than the SMC exception.
21740d553cfSPaul Beesley   For the latter, BL1 installs a simple stub. The stub expects to receive a
21840d553cfSPaul Beesley   limited set of SMC types (determined by their function IDs in the general
21940d553cfSPaul Beesley   purpose register ``X0/R0``):
22040d553cfSPaul Beesley
22140d553cfSPaul Beesley   -  ``BL1_SMC_RUN_IMAGE``: This SMC is raised by BL2 to make BL1 pass control
22240d553cfSPaul Beesley      to EL3 Runtime Software.
22334760951SPaul Beesley   -  All SMCs listed in section "BL1 SMC Interface" in the :ref:`Firmware Update (FWU)`
22440d553cfSPaul Beesley      Design Guide are supported for AArch64 only. These SMCs are currently
22540d553cfSPaul Beesley      not supported when BL1 is built for AArch32.
22640d553cfSPaul Beesley
22740d553cfSPaul Beesley   Any other SMC leads to an assertion failure.
22840d553cfSPaul Beesley
22940d553cfSPaul Beesley-  CPU initialization
23040d553cfSPaul Beesley
23140d553cfSPaul Beesley   BL1 calls the ``reset_handler()`` function which in turn calls the CPU
23240d553cfSPaul Beesley   specific reset handler function (see the section: "CPU specific operations
23340d553cfSPaul Beesley   framework").
23440d553cfSPaul Beesley
23540d553cfSPaul Beesley-  Control register setup (for AArch64)
23640d553cfSPaul Beesley
23740d553cfSPaul Beesley   -  ``SCTLR_EL3``. Instruction cache is enabled by setting the ``SCTLR_EL3.I``
23840d553cfSPaul Beesley      bit. Alignment and stack alignment checking is enabled by setting the
23940d553cfSPaul Beesley      ``SCTLR_EL3.A`` and ``SCTLR_EL3.SA`` bits. Exception endianness is set to
24040d553cfSPaul Beesley      little-endian by clearing the ``SCTLR_EL3.EE`` bit.
24140d553cfSPaul Beesley
24240d553cfSPaul Beesley   -  ``SCR_EL3``. The register width of the next lower exception level is set
24340d553cfSPaul Beesley      to AArch64 by setting the ``SCR.RW`` bit. The ``SCR.EA`` bit is set to trap
24440d553cfSPaul Beesley      both External Aborts and SError Interrupts in EL3. The ``SCR.SIF`` bit is
24540d553cfSPaul Beesley      also set to disable instruction fetches from Non-secure memory when in
24640d553cfSPaul Beesley      secure state.
24740d553cfSPaul Beesley
24840d553cfSPaul Beesley   -  ``CPTR_EL3``. Accesses to the ``CPACR_EL1`` register from EL1 or EL2, or the
24940d553cfSPaul Beesley      ``CPTR_EL2`` register from EL2 are configured to not trap to EL3 by
25040d553cfSPaul Beesley      clearing the ``CPTR_EL3.TCPAC`` bit. Access to the trace functionality is
25140d553cfSPaul Beesley      configured not to trap to EL3 by clearing the ``CPTR_EL3.TTA`` bit.
25240d553cfSPaul Beesley      Instructions that access the registers associated with Floating Point
25340d553cfSPaul Beesley      and Advanced SIMD execution are configured to not trap to EL3 by
25440d553cfSPaul Beesley      clearing the ``CPTR_EL3.TFP`` bit.
25540d553cfSPaul Beesley
25640d553cfSPaul Beesley   -  ``DAIF``. The SError interrupt is enabled by clearing the SError interrupt
25740d553cfSPaul Beesley      mask bit.
25840d553cfSPaul Beesley
25940d553cfSPaul Beesley   -  ``MDCR_EL3``. The trap controls, ``MDCR_EL3.TDOSA``, ``MDCR_EL3.TDA`` and
26040d553cfSPaul Beesley      ``MDCR_EL3.TPM``, are set so that accesses to the registers they control
26140d553cfSPaul Beesley      do not trap to EL3. AArch64 Secure self-hosted debug is disabled by
26240d553cfSPaul Beesley      setting the ``MDCR_EL3.SDD`` bit. Also ``MDCR_EL3.SPD32`` is set to
26340d553cfSPaul Beesley      disable AArch32 Secure self-hosted privileged debug from S-EL1.
26440d553cfSPaul Beesley
26540d553cfSPaul Beesley-  Control register setup (for AArch32)
26640d553cfSPaul Beesley
26740d553cfSPaul Beesley   -  ``SCTLR``. Instruction cache is enabled by setting the ``SCTLR.I`` bit.
26840d553cfSPaul Beesley      Alignment checking is enabled by setting the ``SCTLR.A`` bit.
26940d553cfSPaul Beesley      Exception endianness is set to little-endian by clearing the
27040d553cfSPaul Beesley      ``SCTLR.EE`` bit.
27140d553cfSPaul Beesley
27240d553cfSPaul Beesley   -  ``SCR``. The ``SCR.SIF`` bit is set to disable instruction fetches from
27340d553cfSPaul Beesley      Non-secure memory when in secure state.
27440d553cfSPaul Beesley
27540d553cfSPaul Beesley   -  ``CPACR``. Allow execution of Advanced SIMD instructions at PL0 and PL1,
27640d553cfSPaul Beesley      by clearing the ``CPACR.ASEDIS`` bit. Access to the trace functionality
27740d553cfSPaul Beesley      is configured not to trap to undefined mode by clearing the
27840d553cfSPaul Beesley      ``CPACR.TRCDIS`` bit.
27940d553cfSPaul Beesley
28040d553cfSPaul Beesley   -  ``NSACR``. Enable non-secure access to Advanced SIMD functionality and
28140d553cfSPaul Beesley      system register access to implemented trace registers.
28240d553cfSPaul Beesley
28340d553cfSPaul Beesley   -  ``FPEXC``. Enable access to the Advanced SIMD and floating-point
28440d553cfSPaul Beesley      functionality from all Exception levels.
28540d553cfSPaul Beesley
28640d553cfSPaul Beesley   -  ``CPSR.A``. The Asynchronous data abort interrupt is enabled by clearing
28740d553cfSPaul Beesley      the Asynchronous data abort interrupt mask bit.
28840d553cfSPaul Beesley
28940d553cfSPaul Beesley   -  ``SDCR``. The ``SDCR.SPD`` field is set to disable AArch32 Secure
29040d553cfSPaul Beesley      self-hosted privileged debug.
29140d553cfSPaul Beesley
29240d553cfSPaul BeesleyPlatform initialization
29340d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^^^^
29440d553cfSPaul Beesley
29540d553cfSPaul BeesleyOn Arm platforms, BL1 performs the following platform initializations:
29640d553cfSPaul Beesley
29740d553cfSPaul Beesley-  Enable the Trusted Watchdog.
29840d553cfSPaul Beesley-  Initialize the console.
29940d553cfSPaul Beesley-  Configure the Interconnect to enable hardware coherency.
30040d553cfSPaul Beesley-  Enable the MMU and map the memory it needs to access.
30140d553cfSPaul Beesley-  Configure any required platform storage to load the next bootloader image
30240d553cfSPaul Beesley   (BL2).
30340d553cfSPaul Beesley-  If the BL1 dynamic configuration file, ``TB_FW_CONFIG``, is available, then
30440d553cfSPaul Beesley   load it to the platform defined address and make it available to BL2 via
30540d553cfSPaul Beesley   ``arg0``.
30640d553cfSPaul Beesley-  Configure the system timer and program the `CNTFRQ_EL0` for use by NS-BL1U
30740d553cfSPaul Beesley   and NS-BL2U firmware update images.
30840d553cfSPaul Beesley
30940d553cfSPaul BeesleyFirmware Update detection and execution
31040d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
31140d553cfSPaul Beesley
31240d553cfSPaul BeesleyAfter performing platform setup, BL1 common code calls
31334760951SPaul Beesley``bl1_plat_get_next_image_id()`` to determine if :ref:`Firmware Update (FWU)` is
31434760951SPaul Beesleyrequired or to proceed with the normal boot process. If the platform code
31534760951SPaul Beesleyreturns ``BL2_IMAGE_ID`` then the normal boot sequence is executed as described
31634760951SPaul Beesleyin the next section, else BL1 assumes that :ref:`Firmware Update (FWU)` is
31734760951SPaul Beesleyrequired and execution passes to the first image in the
31834760951SPaul Beesley:ref:`Firmware Update (FWU)` process. In either case, BL1 retrieves a descriptor
31934760951SPaul Beesleyof the next image by calling ``bl1_plat_get_image_desc()``. The image descriptor
32034760951SPaul Beesleycontains an ``entry_point_info_t`` structure, which BL1 uses to initialize the
32134760951SPaul Beesleyexecution state of the next image.
32240d553cfSPaul Beesley
32340d553cfSPaul BeesleyBL2 image load and execution
32440d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^^^^^^^^^
32540d553cfSPaul Beesley
32640d553cfSPaul BeesleyIn the normal boot flow, BL1 execution continues as follows:
32740d553cfSPaul Beesley
32840d553cfSPaul Beesley#. BL1 prints the following string from the primary CPU to indicate successful
32940d553cfSPaul Beesley   execution of the BL1 stage:
33040d553cfSPaul Beesley
33140d553cfSPaul Beesley   ::
33240d553cfSPaul Beesley
33340d553cfSPaul Beesley       "Booting Trusted Firmware"
33440d553cfSPaul Beesley
33540d553cfSPaul Beesley#. BL1 loads a BL2 raw binary image from platform storage, at a
33640d553cfSPaul Beesley   platform-specific base address. Prior to the load, BL1 invokes
33740d553cfSPaul Beesley   ``bl1_plat_handle_pre_image_load()`` which allows the platform to update or
33840d553cfSPaul Beesley   use the image information. If the BL2 image file is not present or if
33940d553cfSPaul Beesley   there is not enough free trusted SRAM the following error message is
34040d553cfSPaul Beesley   printed:
34140d553cfSPaul Beesley
34240d553cfSPaul Beesley   ::
34340d553cfSPaul Beesley
34440d553cfSPaul Beesley       "Failed to load BL2 firmware."
34540d553cfSPaul Beesley
34640d553cfSPaul Beesley#. BL1 invokes ``bl1_plat_handle_post_image_load()`` which again is intended
34740d553cfSPaul Beesley   for platforms to take further action after image load. This function must
34840d553cfSPaul Beesley   populate the necessary arguments for BL2, which may also include the memory
34940d553cfSPaul Beesley   layout. Further description of the memory layout can be found later
35040d553cfSPaul Beesley   in this document.
35140d553cfSPaul Beesley
35240d553cfSPaul Beesley#. BL1 passes control to the BL2 image at Secure EL1 (for AArch64) or at
35340d553cfSPaul Beesley   Secure SVC mode (for AArch32), starting from its load address.
35440d553cfSPaul Beesley
35540d553cfSPaul BeesleyBL2
35640d553cfSPaul Beesley~~~
35740d553cfSPaul Beesley
35840d553cfSPaul BeesleyBL1 loads and passes control to BL2 at Secure-EL1 (for AArch64) or at Secure
35940d553cfSPaul BeesleySVC mode (for AArch32) . BL2 is linked against and loaded at a platform-specific
36040d553cfSPaul Beesleybase address (more information can be found later in this document).
36140d553cfSPaul BeesleyThe functionality implemented by BL2 is as follows.
36240d553cfSPaul Beesley
36340d553cfSPaul BeesleyArchitectural initialization
36440d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^^^^^^^^^
36540d553cfSPaul Beesley
36640d553cfSPaul BeesleyFor AArch64, BL2 performs the minimal architectural initialization required
36740d553cfSPaul Beesleyfor subsequent stages of TF-A and normal world software. EL1 and EL0 are given
36840d553cfSPaul Beesleyaccess to Floating Point and Advanced SIMD registers by clearing the
36940d553cfSPaul Beesley``CPACR.FPEN`` bits.
37040d553cfSPaul Beesley
37140d553cfSPaul BeesleyFor AArch32, the minimal architectural initialization required for subsequent
37240d553cfSPaul Beesleystages of TF-A and normal world software is taken care of in BL1 as both BL1
37340d553cfSPaul Beesleyand BL2 execute at PL1.
37440d553cfSPaul Beesley
37540d553cfSPaul BeesleyPlatform initialization
37640d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^^^^
37740d553cfSPaul Beesley
37840d553cfSPaul BeesleyOn Arm platforms, BL2 performs the following platform initializations:
37940d553cfSPaul Beesley
38040d553cfSPaul Beesley-  Initialize the console.
38140d553cfSPaul Beesley-  Configure any required platform storage to allow loading further bootloader
38240d553cfSPaul Beesley   images.
38340d553cfSPaul Beesley-  Enable the MMU and map the memory it needs to access.
38440d553cfSPaul Beesley-  Perform platform security setup to allow access to controlled components.
38540d553cfSPaul Beesley-  Reserve some memory for passing information to the next bootloader image
38640d553cfSPaul Beesley   EL3 Runtime Software and populate it.
38740d553cfSPaul Beesley-  Define the extents of memory available for loading each subsequent
38840d553cfSPaul Beesley   bootloader image.
38940d553cfSPaul Beesley-  If BL1 has passed TB_FW_CONFIG dynamic configuration file in ``arg0``,
39040d553cfSPaul Beesley   then parse it.
39140d553cfSPaul Beesley
39240d553cfSPaul BeesleyImage loading in BL2
39340d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^
39440d553cfSPaul Beesley
39540d553cfSPaul BeesleyBL2 generic code loads the images based on the list of loadable images
39640d553cfSPaul Beesleyprovided by the platform. BL2 passes the list of executable images
39740d553cfSPaul Beesleyprovided by the platform to the next handover BL image.
39840d553cfSPaul Beesley
39940d553cfSPaul BeesleyThe list of loadable images provided by the platform may also contain
40040d553cfSPaul Beesleydynamic configuration files. The files are loaded and can be parsed as
40140d553cfSPaul Beesleyneeded in the ``bl2_plat_handle_post_image_load()`` function. These
40240d553cfSPaul Beesleyconfiguration files can be passed to next Boot Loader stages as arguments
40340d553cfSPaul Beesleyby updating the corresponding entrypoint information in this function.
40440d553cfSPaul Beesley
40540d553cfSPaul BeesleySCP_BL2 (System Control Processor Firmware) image load
40640d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
40740d553cfSPaul Beesley
40840d553cfSPaul BeesleySome systems have a separate System Control Processor (SCP) for power, clock,
40940d553cfSPaul Beesleyreset and system control. BL2 loads the optional SCP_BL2 image from platform
41040d553cfSPaul Beesleystorage into a platform-specific region of secure memory. The subsequent
41140d553cfSPaul Beesleyhandling of SCP_BL2 is platform specific. For example, on the Juno Arm
41240d553cfSPaul Beesleydevelopment platform port the image is transferred into SCP's internal memory
41340d553cfSPaul Beesleyusing the Boot Over MHU (BOM) protocol after being loaded in the trusted SRAM
41440d553cfSPaul Beesleymemory. The SCP executes SCP_BL2 and signals to the Application Processor (AP)
41540d553cfSPaul Beesleyfor BL2 execution to continue.
41640d553cfSPaul Beesley
41740d553cfSPaul BeesleyEL3 Runtime Software image load
41840d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
41940d553cfSPaul Beesley
42040d553cfSPaul BeesleyBL2 loads the EL3 Runtime Software image from platform storage into a platform-
42140d553cfSPaul Beesleyspecific address in trusted SRAM. If there is not enough memory to load the
42240d553cfSPaul Beesleyimage or image is missing it leads to an assertion failure.
42340d553cfSPaul Beesley
42440d553cfSPaul BeesleyAArch64 BL32 (Secure-EL1 Payload) image load
42540d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
42640d553cfSPaul Beesley
42740d553cfSPaul BeesleyBL2 loads the optional BL32 image from platform storage into a platform-
42840d553cfSPaul Beesleyspecific region of secure memory. The image executes in the secure world. BL2
42940d553cfSPaul Beesleyrelies on BL31 to pass control to the BL32 image, if present. Hence, BL2
43040d553cfSPaul Beesleypopulates a platform-specific area of memory with the entrypoint/load-address
43140d553cfSPaul Beesleyof the BL32 image. The value of the Saved Processor Status Register (``SPSR``)
43240d553cfSPaul Beesleyfor entry into BL32 is not determined by BL2, it is initialized by the
43340d553cfSPaul BeesleySecure-EL1 Payload Dispatcher (see later) within BL31, which is responsible for
43440d553cfSPaul Beesleymanaging interaction with BL32. This information is passed to BL31.
43540d553cfSPaul Beesley
43640d553cfSPaul BeesleyBL33 (Non-trusted Firmware) image load
43740d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
43840d553cfSPaul Beesley
43940d553cfSPaul BeesleyBL2 loads the BL33 image (e.g. UEFI or other test or boot software) from
44040d553cfSPaul Beesleyplatform storage into non-secure memory as defined by the platform.
44140d553cfSPaul Beesley
44240d553cfSPaul BeesleyBL2 relies on EL3 Runtime Software to pass control to BL33 once secure state
44340d553cfSPaul Beesleyinitialization is complete. Hence, BL2 populates a platform-specific area of
44440d553cfSPaul Beesleymemory with the entrypoint and Saved Program Status Register (``SPSR``) of the
44540d553cfSPaul Beesleynormal world software image. The entrypoint is the load address of the BL33
44640d553cfSPaul Beesleyimage. The ``SPSR`` is determined as specified in Section 5.13 of the
44734760951SPaul Beesley`Power State Coordination Interface PDD`_. This information is passed to the
44834760951SPaul BeesleyEL3 Runtime Software.
44940d553cfSPaul Beesley
45040d553cfSPaul BeesleyAArch64 BL31 (EL3 Runtime Software) execution
45140d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
45240d553cfSPaul Beesley
45340d553cfSPaul BeesleyBL2 execution continues as follows:
45440d553cfSPaul Beesley
45540d553cfSPaul Beesley#. BL2 passes control back to BL1 by raising an SMC, providing BL1 with the
45640d553cfSPaul Beesley   BL31 entrypoint. The exception is handled by the SMC exception handler
45740d553cfSPaul Beesley   installed by BL1.
45840d553cfSPaul Beesley
45940d553cfSPaul Beesley#. BL1 turns off the MMU and flushes the caches. It clears the
46040d553cfSPaul Beesley   ``SCTLR_EL3.M/I/C`` bits, flushes the data cache to the point of coherency
46140d553cfSPaul Beesley   and invalidates the TLBs.
46240d553cfSPaul Beesley
46340d553cfSPaul Beesley#. BL1 passes control to BL31 at the specified entrypoint at EL3.
46440d553cfSPaul Beesley
46540d553cfSPaul BeesleyRunning BL2 at EL3 execution level
46640d553cfSPaul Beesley~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
46740d553cfSPaul Beesley
46840d553cfSPaul BeesleySome platforms have a non-TF-A Boot ROM that expects the next boot stage
46940d553cfSPaul Beesleyto execute at EL3. On these platforms, TF-A BL1 is a waste of memory
47040d553cfSPaul Beesleyas its only purpose is to ensure TF-A BL2 is entered at S-EL1. To avoid
47140d553cfSPaul Beesleythis waste, a special mode enables BL2 to execute at EL3, which allows
47240d553cfSPaul Beesleya non-TF-A Boot ROM to load and jump directly to BL2. This mode is selected
47340d553cfSPaul Beesleywhen the build flag BL2_AT_EL3 is enabled. The main differences in this
47440d553cfSPaul Beesleymode are:
47540d553cfSPaul Beesley
47640d553cfSPaul Beesley#. BL2 includes the reset code and the mailbox mechanism to differentiate
47740d553cfSPaul Beesley   cold boot and warm boot. It runs at EL3 doing the arch
47840d553cfSPaul Beesley   initialization required for EL3.
47940d553cfSPaul Beesley
48040d553cfSPaul Beesley#. BL2 does not receive the meminfo information from BL1 anymore. This
48140d553cfSPaul Beesley   information can be passed by the Boot ROM or be internal to the
48240d553cfSPaul Beesley   BL2 image.
48340d553cfSPaul Beesley
48440d553cfSPaul Beesley#. Since BL2 executes at EL3, BL2 jumps directly to the next image,
48540d553cfSPaul Beesley   instead of invoking the RUN_IMAGE SMC call.
48640d553cfSPaul Beesley
48740d553cfSPaul Beesley
48840d553cfSPaul BeesleyWe assume 3 different types of BootROM support on the platform:
48940d553cfSPaul Beesley
49040d553cfSPaul Beesley#. The Boot ROM always jumps to the same address, for both cold
49140d553cfSPaul Beesley   and warm boot. In this case, we will need to keep a resident part
49240d553cfSPaul Beesley   of BL2 whose memory cannot be reclaimed by any other image. The
49340d553cfSPaul Beesley   linker script defines the symbols __TEXT_RESIDENT_START__ and
49440d553cfSPaul Beesley   __TEXT_RESIDENT_END__ that allows the platform to configure
49540d553cfSPaul Beesley   correctly the memory map.
49640d553cfSPaul Beesley#. The platform has some mechanism to indicate the jump address to the
49740d553cfSPaul Beesley   Boot ROM. Platform code can then program the jump address with
49840d553cfSPaul Beesley   psci_warmboot_entrypoint during cold boot.
49940d553cfSPaul Beesley#. The platform has some mechanism to program the reset address using
50040d553cfSPaul Beesley   the PROGRAMMABLE_RESET_ADDRESS feature. Platform code can then
50140d553cfSPaul Beesley   program the reset address with psci_warmboot_entrypoint during
50240d553cfSPaul Beesley   cold boot, bypassing the boot ROM for warm boot.
50340d553cfSPaul Beesley
50440d553cfSPaul BeesleyIn the last 2 cases, no part of BL2 needs to remain resident at
50540d553cfSPaul Beesleyruntime. In the first 2 cases, we expect the Boot ROM to be able to
50640d553cfSPaul Beesleydifferentiate between warm and cold boot, to avoid loading BL2 again
50740d553cfSPaul Beesleyduring warm boot.
50840d553cfSPaul Beesley
50940d553cfSPaul BeesleyThis functionality can be tested with FVP loading the image directly
51040d553cfSPaul Beesleyin memory and changing the address where the system jumps at reset.
51140d553cfSPaul BeesleyFor example:
51240d553cfSPaul Beesley
51340d553cfSPaul Beesley	-C cluster0.cpu0.RVBAR=0x4022000
51440d553cfSPaul Beesley	--data cluster0.cpu0=bl2.bin@0x4022000
51540d553cfSPaul Beesley
51640d553cfSPaul BeesleyWith this configuration, FVP is like a platform of the first case,
51740d553cfSPaul Beesleywhere the Boot ROM jumps always to the same address. For simplification,
51840d553cfSPaul BeesleyBL32 is loaded in DRAM in this case, to avoid other images reclaiming
51940d553cfSPaul BeesleyBL2 memory.
52040d553cfSPaul Beesley
52140d553cfSPaul Beesley
52240d553cfSPaul BeesleyAArch64 BL31
52340d553cfSPaul Beesley~~~~~~~~~~~~
52440d553cfSPaul Beesley
52540d553cfSPaul BeesleyThe image for this stage is loaded by BL2 and BL1 passes control to BL31 at
52640d553cfSPaul BeesleyEL3. BL31 executes solely in trusted SRAM. BL31 is linked against and
52740d553cfSPaul Beesleyloaded at a platform-specific base address (more information can be found later
52840d553cfSPaul Beesleyin this document). The functionality implemented by BL31 is as follows.
52940d553cfSPaul Beesley
53040d553cfSPaul BeesleyArchitectural initialization
53140d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^^^^^^^^^
53240d553cfSPaul Beesley
53340d553cfSPaul BeesleyCurrently, BL31 performs a similar architectural initialization to BL1 as
53440d553cfSPaul Beesleyfar as system register settings are concerned. Since BL1 code resides in ROM,
53540d553cfSPaul Beesleyarchitectural initialization in BL31 allows override of any previous
53640d553cfSPaul Beesleyinitialization done by BL1.
53740d553cfSPaul Beesley
53840d553cfSPaul BeesleyBL31 initializes the per-CPU data framework, which provides a cache of
53940d553cfSPaul Beesleyfrequently accessed per-CPU data optimised for fast, concurrent manipulation
54040d553cfSPaul Beesleyon different CPUs. This buffer includes pointers to per-CPU contexts, crash
54140d553cfSPaul Beesleybuffer, CPU reset and power down operations, PSCI data, platform data and so on.
54240d553cfSPaul Beesley
54340d553cfSPaul BeesleyIt then replaces the exception vectors populated by BL1 with its own. BL31
54440d553cfSPaul Beesleyexception vectors implement more elaborate support for handling SMCs since this
54540d553cfSPaul Beesleyis the only mechanism to access the runtime services implemented by BL31 (PSCI
54640d553cfSPaul Beesleyfor example). BL31 checks each SMC for validity as specified by the
54734760951SPaul Beesley`SMC Calling Convention PDD`_ before passing control to the required SMC
54840d553cfSPaul Beesleyhandler routine.
54940d553cfSPaul Beesley
55040d553cfSPaul BeesleyBL31 programs the ``CNTFRQ_EL0`` register with the clock frequency of the system
55140d553cfSPaul Beesleycounter, which is provided by the platform.
55240d553cfSPaul Beesley
55340d553cfSPaul BeesleyPlatform initialization
55440d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^^^^
55540d553cfSPaul Beesley
55640d553cfSPaul BeesleyBL31 performs detailed platform initialization, which enables normal world
55740d553cfSPaul Beesleysoftware to function correctly.
55840d553cfSPaul Beesley
55940d553cfSPaul BeesleyOn Arm platforms, this consists of the following:
56040d553cfSPaul Beesley
56140d553cfSPaul Beesley-  Initialize the console.
56240d553cfSPaul Beesley-  Configure the Interconnect to enable hardware coherency.
56340d553cfSPaul Beesley-  Enable the MMU and map the memory it needs to access.
56440d553cfSPaul Beesley-  Initialize the generic interrupt controller.
56540d553cfSPaul Beesley-  Initialize the power controller device.
56640d553cfSPaul Beesley-  Detect the system topology.
56740d553cfSPaul Beesley
56840d553cfSPaul BeesleyRuntime services initialization
56940d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
57040d553cfSPaul Beesley
57140d553cfSPaul BeesleyBL31 is responsible for initializing the runtime services. One of them is PSCI.
57240d553cfSPaul Beesley
57340d553cfSPaul BeesleyAs part of the PSCI initializations, BL31 detects the system topology. It also
57440d553cfSPaul Beesleyinitializes the data structures that implement the state machine used to track
57540d553cfSPaul Beesleythe state of power domain nodes. The state can be one of ``OFF``, ``RUN`` or
57640d553cfSPaul Beesley``RETENTION``. All secondary CPUs are initially in the ``OFF`` state. The cluster
57740d553cfSPaul Beesleythat the primary CPU belongs to is ``ON``; any other cluster is ``OFF``. It also
57840d553cfSPaul Beesleyinitializes the locks that protect them. BL31 accesses the state of a CPU or
57940d553cfSPaul Beesleycluster immediately after reset and before the data cache is enabled in the
58040d553cfSPaul Beesleywarm boot path. It is not currently possible to use 'exclusive' based spinlocks,
58140d553cfSPaul Beesleytherefore BL31 uses locks based on Lamport's Bakery algorithm instead.
58240d553cfSPaul Beesley
58340d553cfSPaul BeesleyThe runtime service framework and its initialization is described in more
58440d553cfSPaul Beesleydetail in the "EL3 runtime services framework" section below.
58540d553cfSPaul Beesley
58640d553cfSPaul BeesleyDetails about the status of the PSCI implementation are provided in the
58740d553cfSPaul Beesley"Power State Coordination Interface" section below.
58840d553cfSPaul Beesley
58940d553cfSPaul BeesleyAArch64 BL32 (Secure-EL1 Payload) image initialization
59040d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
59140d553cfSPaul Beesley
59240d553cfSPaul BeesleyIf a BL32 image is present then there must be a matching Secure-EL1 Payload
59340d553cfSPaul BeesleyDispatcher (SPD) service (see later for details). During initialization
59440d553cfSPaul Beesleythat service must register a function to carry out initialization of BL32
59540d553cfSPaul Beesleyonce the runtime services are fully initialized. BL31 invokes such a
59640d553cfSPaul Beesleyregistered function to initialize BL32 before running BL33. This initialization
59740d553cfSPaul Beesleyis not necessary for AArch32 SPs.
59840d553cfSPaul Beesley
59940d553cfSPaul BeesleyDetails on BL32 initialization and the SPD's role are described in the
60043f35ef5SPaul Beesley:ref:`firmware_design_sel1_spd` section below.
60140d553cfSPaul Beesley
60240d553cfSPaul BeesleyBL33 (Non-trusted Firmware) execution
60340d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
60440d553cfSPaul Beesley
60540d553cfSPaul BeesleyEL3 Runtime Software initializes the EL2 or EL1 processor context for normal-
60640d553cfSPaul Beesleyworld cold boot, ensuring that no secure state information finds its way into
60740d553cfSPaul Beesleythe non-secure execution state. EL3 Runtime Software uses the entrypoint
60840d553cfSPaul Beesleyinformation provided by BL2 to jump to the Non-trusted firmware image (BL33)
60940d553cfSPaul Beesleyat the highest available Exception Level (EL2 if available, otherwise EL1).
61040d553cfSPaul Beesley
61140d553cfSPaul BeesleyUsing alternative Trusted Boot Firmware in place of BL1 & BL2 (AArch64 only)
61240d553cfSPaul Beesley~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
61340d553cfSPaul Beesley
61440d553cfSPaul BeesleySome platforms have existing implementations of Trusted Boot Firmware that
61540d553cfSPaul Beesleywould like to use TF-A BL31 for the EL3 Runtime Software. To enable this
61640d553cfSPaul Beesleyfirmware architecture it is important to provide a fully documented and stable
61740d553cfSPaul Beesleyinterface between the Trusted Boot Firmware and BL31.
61840d553cfSPaul Beesley
61940d553cfSPaul BeesleyFuture changes to the BL31 interface will be done in a backwards compatible
62040d553cfSPaul Beesleyway, and this enables these firmware components to be independently enhanced/
62140d553cfSPaul Beesleyupdated to develop and exploit new functionality.
62240d553cfSPaul Beesley
62340d553cfSPaul BeesleyRequired CPU state when calling ``bl31_entrypoint()`` during cold boot
62440d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
62540d553cfSPaul Beesley
62640d553cfSPaul BeesleyThis function must only be called by the primary CPU.
62740d553cfSPaul Beesley
62840d553cfSPaul BeesleyOn entry to this function the calling primary CPU must be executing in AArch64
62940d553cfSPaul BeesleyEL3, little-endian data access, and all interrupt sources masked:
63040d553cfSPaul Beesley
63140d553cfSPaul Beesley::
63240d553cfSPaul Beesley
63340d553cfSPaul Beesley    PSTATE.EL = 3
63440d553cfSPaul Beesley    PSTATE.RW = 1
63540d553cfSPaul Beesley    PSTATE.DAIF = 0xf
63640d553cfSPaul Beesley    SCTLR_EL3.EE = 0
63740d553cfSPaul Beesley
63840d553cfSPaul BeesleyX0 and X1 can be used to pass information from the Trusted Boot Firmware to the
63940d553cfSPaul Beesleyplatform code in BL31:
64040d553cfSPaul Beesley
64140d553cfSPaul Beesley::
64240d553cfSPaul Beesley
64340d553cfSPaul Beesley    X0 : Reserved for common TF-A information
64440d553cfSPaul Beesley    X1 : Platform specific information
64540d553cfSPaul Beesley
64640d553cfSPaul BeesleyBL31 zero-init sections (e.g. ``.bss``) should not contain valid data on entry,
64740d553cfSPaul Beesleythese will be zero filled prior to invoking platform setup code.
64840d553cfSPaul Beesley
64940d553cfSPaul BeesleyUse of the X0 and X1 parameters
65040d553cfSPaul Beesley'''''''''''''''''''''''''''''''
65140d553cfSPaul Beesley
65240d553cfSPaul BeesleyThe parameters are platform specific and passed from ``bl31_entrypoint()`` to
65340d553cfSPaul Beesley``bl31_early_platform_setup()``. The value of these parameters is never directly
65440d553cfSPaul Beesleyused by the common BL31 code.
65540d553cfSPaul Beesley
65640d553cfSPaul BeesleyThe convention is that ``X0`` conveys information regarding the BL31, BL32 and
65740d553cfSPaul BeesleyBL33 images from the Trusted Boot firmware and ``X1`` can be used for other
65840d553cfSPaul Beesleyplatform specific purpose. This convention allows platforms which use TF-A's
65940d553cfSPaul BeesleyBL1 and BL2 images to transfer additional platform specific information from
66040d553cfSPaul BeesleySecure Boot without conflicting with future evolution of TF-A using ``X0`` to
66140d553cfSPaul Beesleypass a ``bl31_params`` structure.
66240d553cfSPaul Beesley
66340d553cfSPaul BeesleyBL31 common and SPD initialization code depends on image and entrypoint
66440d553cfSPaul Beesleyinformation about BL33 and BL32, which is provided via BL31 platform APIs.
66540d553cfSPaul BeesleyThis information is required until the start of execution of BL33. This
66640d553cfSPaul Beesleyinformation can be provided in a platform defined manner, e.g. compiled into
66740d553cfSPaul Beesleythe platform code in BL31, or provided in a platform defined memory location
66840d553cfSPaul Beesleyby the Trusted Boot firmware, or passed from the Trusted Boot Firmware via the
66940d553cfSPaul BeesleyCold boot Initialization parameters. This data may need to be cleaned out of
67040d553cfSPaul Beesleythe CPU caches if it is provided by an earlier boot stage and then accessed by
67140d553cfSPaul BeesleyBL31 platform code before the caches are enabled.
67240d553cfSPaul Beesley
67340d553cfSPaul BeesleyTF-A's BL2 implementation passes a ``bl31_params`` structure in
67440d553cfSPaul Beesley``X0`` and the Arm development platforms interpret this in the BL31 platform
67540d553cfSPaul Beesleycode.
67640d553cfSPaul Beesley
67740d553cfSPaul BeesleyMMU, Data caches & Coherency
67840d553cfSPaul Beesley''''''''''''''''''''''''''''
67940d553cfSPaul Beesley
68040d553cfSPaul BeesleyBL31 does not depend on the enabled state of the MMU, data caches or
68140d553cfSPaul Beesleyinterconnect coherency on entry to ``bl31_entrypoint()``. If these are disabled
68240d553cfSPaul Beesleyon entry, these should be enabled during ``bl31_plat_arch_setup()``.
68340d553cfSPaul Beesley
68440d553cfSPaul BeesleyData structures used in the BL31 cold boot interface
68540d553cfSPaul Beesley''''''''''''''''''''''''''''''''''''''''''''''''''''
68640d553cfSPaul Beesley
68740d553cfSPaul BeesleyThese structures are designed to support compatibility and independent
68840d553cfSPaul Beesleyevolution of the structures and the firmware images. For example, a version of
68940d553cfSPaul BeesleyBL31 that can interpret the BL3x image information from different versions of
69040d553cfSPaul BeesleyBL2, a platform that uses an extended entry_point_info structure to convey
69140d553cfSPaul Beesleyadditional register information to BL31, or a ELF image loader that can convey
69240d553cfSPaul Beesleymore details about the firmware images.
69340d553cfSPaul Beesley
69440d553cfSPaul BeesleyTo support these scenarios the structures are versioned and sized, which enables
69540d553cfSPaul BeesleyBL31 to detect which information is present and respond appropriately. The
69640d553cfSPaul Beesley``param_header`` is defined to capture this information:
69740d553cfSPaul Beesley
69840d553cfSPaul Beesley.. code:: c
69940d553cfSPaul Beesley
70040d553cfSPaul Beesley    typedef struct param_header {
70140d553cfSPaul Beesley        uint8_t type;       /* type of the structure */
70240d553cfSPaul Beesley        uint8_t version;    /* version of this structure */
70340d553cfSPaul Beesley        uint16_t size;      /* size of this structure in bytes */
70440d553cfSPaul Beesley        uint32_t attr;      /* attributes: unused bits SBZ */
70540d553cfSPaul Beesley    } param_header_t;
70640d553cfSPaul Beesley
70740d553cfSPaul BeesleyThe structures using this format are ``entry_point_info``, ``image_info`` and
70840d553cfSPaul Beesley``bl31_params``. The code that allocates and populates these structures must set
70940d553cfSPaul Beesleythe header fields appropriately, and the ``SET_PARAM_HEAD()`` a macro is defined
71040d553cfSPaul Beesleyto simplify this action.
71140d553cfSPaul Beesley
71240d553cfSPaul BeesleyRequired CPU state for BL31 Warm boot initialization
71340d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
71440d553cfSPaul Beesley
71540d553cfSPaul BeesleyWhen requesting a CPU power-on, or suspending a running CPU, TF-A provides
71640d553cfSPaul Beesleythe platform power management code with a Warm boot initialization
71740d553cfSPaul Beesleyentry-point, to be invoked by the CPU immediately after the reset handler.
71840d553cfSPaul BeesleyOn entry to the Warm boot initialization function the calling CPU must be in
71940d553cfSPaul BeesleyAArch64 EL3, little-endian data access and all interrupt sources masked:
72040d553cfSPaul Beesley
72140d553cfSPaul Beesley::
72240d553cfSPaul Beesley
72340d553cfSPaul Beesley    PSTATE.EL = 3
72440d553cfSPaul Beesley    PSTATE.RW = 1
72540d553cfSPaul Beesley    PSTATE.DAIF = 0xf
72640d553cfSPaul Beesley    SCTLR_EL3.EE = 0
72740d553cfSPaul Beesley
72840d553cfSPaul BeesleyThe PSCI implementation will initialize the processor state and ensure that the
72940d553cfSPaul Beesleyplatform power management code is then invoked as required to initialize all
73040d553cfSPaul Beesleynecessary system, cluster and CPU resources.
73140d553cfSPaul Beesley
73240d553cfSPaul BeesleyAArch32 EL3 Runtime Software entrypoint interface
73340d553cfSPaul Beesley~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
73440d553cfSPaul Beesley
73540d553cfSPaul BeesleyTo enable this firmware architecture it is important to provide a fully
73640d553cfSPaul Beesleydocumented and stable interface between the Trusted Boot Firmware and the
73740d553cfSPaul BeesleyAArch32 EL3 Runtime Software.
73840d553cfSPaul Beesley
73940d553cfSPaul BeesleyFuture changes to the entrypoint interface will be done in a backwards
74040d553cfSPaul Beesleycompatible way, and this enables these firmware components to be independently
74140d553cfSPaul Beesleyenhanced/updated to develop and exploit new functionality.
74240d553cfSPaul Beesley
74340d553cfSPaul BeesleyRequired CPU state when entering during cold boot
74440d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
74540d553cfSPaul Beesley
74640d553cfSPaul BeesleyThis function must only be called by the primary CPU.
74740d553cfSPaul Beesley
74840d553cfSPaul BeesleyOn entry to this function the calling primary CPU must be executing in AArch32
74940d553cfSPaul BeesleyEL3, little-endian data access, and all interrupt sources masked:
75040d553cfSPaul Beesley
75140d553cfSPaul Beesley::
75240d553cfSPaul Beesley
75340d553cfSPaul Beesley    PSTATE.AIF = 0x7
75440d553cfSPaul Beesley    SCTLR.EE = 0
75540d553cfSPaul Beesley
75640d553cfSPaul BeesleyR0 and R1 are used to pass information from the Trusted Boot Firmware to the
75740d553cfSPaul Beesleyplatform code in AArch32 EL3 Runtime Software:
75840d553cfSPaul Beesley
75940d553cfSPaul Beesley::
76040d553cfSPaul Beesley
76140d553cfSPaul Beesley    R0 : Reserved for common TF-A information
76240d553cfSPaul Beesley    R1 : Platform specific information
76340d553cfSPaul Beesley
76440d553cfSPaul BeesleyUse of the R0 and R1 parameters
76540d553cfSPaul Beesley'''''''''''''''''''''''''''''''
76640d553cfSPaul Beesley
76740d553cfSPaul BeesleyThe parameters are platform specific and the convention is that ``R0`` conveys
76840d553cfSPaul Beesleyinformation regarding the BL3x images from the Trusted Boot firmware and ``R1``
76940d553cfSPaul Beesleycan be used for other platform specific purpose. This convention allows
77040d553cfSPaul Beesleyplatforms which use TF-A's BL1 and BL2 images to transfer additional platform
77140d553cfSPaul Beesleyspecific information from Secure Boot without conflicting with future
77240d553cfSPaul Beesleyevolution of TF-A using ``R0`` to pass a ``bl_params`` structure.
77340d553cfSPaul Beesley
77440d553cfSPaul BeesleyThe AArch32 EL3 Runtime Software is responsible for entry into BL33. This
77540d553cfSPaul Beesleyinformation can be obtained in a platform defined manner, e.g. compiled into
77640d553cfSPaul Beesleythe AArch32 EL3 Runtime Software, or provided in a platform defined memory
77740d553cfSPaul Beesleylocation by the Trusted Boot firmware, or passed from the Trusted Boot Firmware
77840d553cfSPaul Beesleyvia the Cold boot Initialization parameters. This data may need to be cleaned
77940d553cfSPaul Beesleyout of the CPU caches if it is provided by an earlier boot stage and then
78040d553cfSPaul Beesleyaccessed by AArch32 EL3 Runtime Software before the caches are enabled.
78140d553cfSPaul Beesley
78240d553cfSPaul BeesleyWhen using AArch32 EL3 Runtime Software, the Arm development platforms pass a
78340d553cfSPaul Beesley``bl_params`` structure in ``R0`` from BL2 to be interpreted by AArch32 EL3 Runtime
78440d553cfSPaul BeesleySoftware platform code.
78540d553cfSPaul Beesley
78640d553cfSPaul BeesleyMMU, Data caches & Coherency
78740d553cfSPaul Beesley''''''''''''''''''''''''''''
78840d553cfSPaul Beesley
78940d553cfSPaul BeesleyAArch32 EL3 Runtime Software must not depend on the enabled state of the MMU,
79040d553cfSPaul Beesleydata caches or interconnect coherency in its entrypoint. They must be explicitly
79140d553cfSPaul Beesleyenabled if required.
79240d553cfSPaul Beesley
79340d553cfSPaul BeesleyData structures used in cold boot interface
79440d553cfSPaul Beesley'''''''''''''''''''''''''''''''''''''''''''
79540d553cfSPaul Beesley
79640d553cfSPaul BeesleyThe AArch32 EL3 Runtime Software cold boot interface uses ``bl_params`` instead
79740d553cfSPaul Beesleyof ``bl31_params``. The ``bl_params`` structure is based on the convention
79840d553cfSPaul Beesleydescribed in AArch64 BL31 cold boot interface section.
79940d553cfSPaul Beesley
80040d553cfSPaul BeesleyRequired CPU state for warm boot initialization
80140d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
80240d553cfSPaul Beesley
80340d553cfSPaul BeesleyWhen requesting a CPU power-on, or suspending a running CPU, AArch32 EL3
80440d553cfSPaul BeesleyRuntime Software must ensure execution of a warm boot initialization entrypoint.
80540d553cfSPaul BeesleyIf TF-A BL1 is used and the PROGRAMMABLE_RESET_ADDRESS build flag is false,
80640d553cfSPaul Beesleythen AArch32 EL3 Runtime Software must ensure that BL1 branches to the warm
80740d553cfSPaul Beesleyboot entrypoint by arranging for the BL1 platform function,
80840d553cfSPaul Beesleyplat_get_my_entrypoint(), to return a non-zero value.
80940d553cfSPaul Beesley
81040d553cfSPaul BeesleyIn this case, the warm boot entrypoint must be in AArch32 EL3, little-endian
81140d553cfSPaul Beesleydata access and all interrupt sources masked:
81240d553cfSPaul Beesley
81340d553cfSPaul Beesley::
81440d553cfSPaul Beesley
81540d553cfSPaul Beesley    PSTATE.AIF = 0x7
81640d553cfSPaul Beesley    SCTLR.EE = 0
81740d553cfSPaul Beesley
81840d553cfSPaul BeesleyThe warm boot entrypoint may be implemented by using TF-A
81940d553cfSPaul Beesley``psci_warmboot_entrypoint()`` function. In that case, the platform must fulfil
82034760951SPaul Beesleythe pre-requisites mentioned in the
82134760951SPaul Beesley:ref:`PSCI Library Integration guide for Armv8-A AArch32 systems`.
82240d553cfSPaul Beesley
82340d553cfSPaul BeesleyEL3 runtime services framework
82440d553cfSPaul Beesley------------------------------
82540d553cfSPaul Beesley
82640d553cfSPaul BeesleySoftware executing in the non-secure state and in the secure state at exception
82740d553cfSPaul Beesleylevels lower than EL3 will request runtime services using the Secure Monitor
82840d553cfSPaul BeesleyCall (SMC) instruction. These requests will follow the convention described in
82940d553cfSPaul Beesleythe SMC Calling Convention PDD (`SMCCC`_). The `SMCCC`_ assigns function
83040d553cfSPaul Beesleyidentifiers to each SMC request and describes how arguments are passed and
83140d553cfSPaul Beesleyreturned.
83240d553cfSPaul Beesley
83340d553cfSPaul BeesleyThe EL3 runtime services framework enables the development of services by
83440d553cfSPaul Beesleydifferent providers that can be easily integrated into final product firmware.
83540d553cfSPaul BeesleyThe following sections describe the framework which facilitates the
83640d553cfSPaul Beesleyregistration, initialization and use of runtime services in EL3 Runtime
83740d553cfSPaul BeesleySoftware (BL31).
83840d553cfSPaul Beesley
83940d553cfSPaul BeesleyThe design of the runtime services depends heavily on the concepts and
84040d553cfSPaul Beesleydefinitions described in the `SMCCC`_, in particular SMC Function IDs, Owning
84140d553cfSPaul BeesleyEntity Numbers (OEN), Fast and Yielding calls, and the SMC32 and SMC64 calling
84240d553cfSPaul Beesleyconventions. Please refer to that document for more detailed explanation of
84340d553cfSPaul Beesleythese terms.
84440d553cfSPaul Beesley
84540d553cfSPaul BeesleyThe following runtime services are expected to be implemented first. They have
84640d553cfSPaul Beesleynot all been instantiated in the current implementation.
84740d553cfSPaul Beesley
84840d553cfSPaul Beesley#. Standard service calls
84940d553cfSPaul Beesley
85040d553cfSPaul Beesley   This service is for management of the entire system. The Power State
85140d553cfSPaul Beesley   Coordination Interface (`PSCI`_) is the first set of standard service calls
85240d553cfSPaul Beesley   defined by Arm (see PSCI section later).
85340d553cfSPaul Beesley
85440d553cfSPaul Beesley#. Secure-EL1 Payload Dispatcher service
85540d553cfSPaul Beesley
85640d553cfSPaul Beesley   If a system runs a Trusted OS or other Secure-EL1 Payload (SP) then
85740d553cfSPaul Beesley   it also requires a *Secure Monitor* at EL3 to switch the EL1 processor
85840d553cfSPaul Beesley   context between the normal world (EL1/EL2) and trusted world (Secure-EL1).
85940d553cfSPaul Beesley   The Secure Monitor will make these world switches in response to SMCs. The
86040d553cfSPaul Beesley   `SMCCC`_ provides for such SMCs with the Trusted OS Call and Trusted
86140d553cfSPaul Beesley   Application Call OEN ranges.
86240d553cfSPaul Beesley
86340d553cfSPaul Beesley   The interface between the EL3 Runtime Software and the Secure-EL1 Payload is
86440d553cfSPaul Beesley   not defined by the `SMCCC`_ or any other standard. As a result, each
86540d553cfSPaul Beesley   Secure-EL1 Payload requires a specific Secure Monitor that runs as a runtime
86640d553cfSPaul Beesley   service - within TF-A this service is referred to as the Secure-EL1 Payload
86740d553cfSPaul Beesley   Dispatcher (SPD).
86840d553cfSPaul Beesley
86940d553cfSPaul Beesley   TF-A provides a Test Secure-EL1 Payload (TSP) and its associated Dispatcher
87040d553cfSPaul Beesley   (TSPD). Details of SPD design and TSP/TSPD operation are described in the
87143f35ef5SPaul Beesley   :ref:`firmware_design_sel1_spd` section below.
87240d553cfSPaul Beesley
87340d553cfSPaul Beesley#. CPU implementation service
87440d553cfSPaul Beesley
87540d553cfSPaul Beesley   This service will provide an interface to CPU implementation specific
87640d553cfSPaul Beesley   services for a given platform e.g. access to processor errata workarounds.
87740d553cfSPaul Beesley   This service is currently unimplemented.
87840d553cfSPaul Beesley
87940d553cfSPaul BeesleyAdditional services for Arm Architecture, SiP and OEM calls can be implemented.
88040d553cfSPaul BeesleyEach implemented service handles a range of SMC function identifiers as
88140d553cfSPaul Beesleydescribed in the `SMCCC`_.
88240d553cfSPaul Beesley
88340d553cfSPaul BeesleyRegistration
88440d553cfSPaul Beesley~~~~~~~~~~~~
88540d553cfSPaul Beesley
88640d553cfSPaul BeesleyA runtime service is registered using the ``DECLARE_RT_SVC()`` macro, specifying
88740d553cfSPaul Beesleythe name of the service, the range of OENs covered, the type of service and
88840d553cfSPaul Beesleyinitialization and call handler functions. This macro instantiates a ``const struct rt_svc_desc`` for the service with these details (see ``runtime_svc.h``).
88940d553cfSPaul BeesleyThis structure is allocated in a special ELF section ``rt_svc_descs``, enabling
89040d553cfSPaul Beesleythe framework to find all service descriptors included into BL31.
89140d553cfSPaul Beesley
89240d553cfSPaul BeesleyThe specific service for a SMC Function is selected based on the OEN and call
89340d553cfSPaul Beesleytype of the Function ID, and the framework uses that information in the service
89440d553cfSPaul Beesleydescriptor to identify the handler for the SMC Call.
89540d553cfSPaul Beesley
89640d553cfSPaul BeesleyThe service descriptors do not include information to identify the precise set
89740d553cfSPaul Beesleyof SMC function identifiers supported by this service implementation, the
89840d553cfSPaul Beesleysecurity state from which such calls are valid nor the capability to support
89940d553cfSPaul Beesley64-bit and/or 32-bit callers (using SMC32 or SMC64). Responding appropriately
90040d553cfSPaul Beesleyto these aspects of a SMC call is the responsibility of the service
90140d553cfSPaul Beesleyimplementation, the framework is focused on integration of services from
90240d553cfSPaul Beesleydifferent providers and minimizing the time taken by the framework before the
90340d553cfSPaul Beesleyservice handler is invoked.
90440d553cfSPaul Beesley
90540d553cfSPaul BeesleyDetails of the parameters, requirements and behavior of the initialization and
90640d553cfSPaul Beesleycall handling functions are provided in the following sections.
90740d553cfSPaul Beesley
90840d553cfSPaul BeesleyInitialization
90940d553cfSPaul Beesley~~~~~~~~~~~~~~
91040d553cfSPaul Beesley
91140d553cfSPaul Beesley``runtime_svc_init()`` in ``runtime_svc.c`` initializes the runtime services
91240d553cfSPaul Beesleyframework running on the primary CPU during cold boot as part of the BL31
91340d553cfSPaul Beesleyinitialization. This happens prior to initializing a Trusted OS and running
91440d553cfSPaul BeesleyNormal world boot firmware that might in turn use these services.
91540d553cfSPaul BeesleyInitialization involves validating each of the declared runtime service
91640d553cfSPaul Beesleydescriptors, calling the service initialization function and populating the
91740d553cfSPaul Beesleyindex used for runtime lookup of the service.
91840d553cfSPaul Beesley
91940d553cfSPaul BeesleyThe BL31 linker script collects all of the declared service descriptors into a
92040d553cfSPaul Beesleysingle array and defines symbols that allow the framework to locate and traverse
92140d553cfSPaul Beesleythe array, and determine its size.
92240d553cfSPaul Beesley
92340d553cfSPaul BeesleyThe framework does basic validation of each descriptor to halt firmware
92440d553cfSPaul Beesleyinitialization if service declaration errors are detected. The framework does
92540d553cfSPaul Beesleynot check descriptors for the following error conditions, and may behave in an
92640d553cfSPaul Beesleyunpredictable manner under such scenarios:
92740d553cfSPaul Beesley
92840d553cfSPaul Beesley#. Overlapping OEN ranges
92940d553cfSPaul Beesley#. Multiple descriptors for the same range of OENs and ``call_type``
93040d553cfSPaul Beesley#. Incorrect range of owning entity numbers for a given ``call_type``
93140d553cfSPaul Beesley
93240d553cfSPaul BeesleyOnce validated, the service ``init()`` callback is invoked. This function carries
93340d553cfSPaul Beesleyout any essential EL3 initialization before servicing requests. The ``init()``
93440d553cfSPaul Beesleyfunction is only invoked on the primary CPU during cold boot. If the service
93540d553cfSPaul Beesleyuses per-CPU data this must either be initialized for all CPUs during this call,
93640d553cfSPaul Beesleyor be done lazily when a CPU first issues an SMC call to that service. If
93740d553cfSPaul Beesley``init()`` returns anything other than ``0``, this is treated as an initialization
93840d553cfSPaul Beesleyerror and the service is ignored: this does not cause the firmware to halt.
93940d553cfSPaul Beesley
94040d553cfSPaul BeesleyThe OEN and call type fields present in the SMC Function ID cover a total of
94140d553cfSPaul Beesley128 distinct services, but in practice a single descriptor can cover a range of
94240d553cfSPaul BeesleyOENs, e.g. SMCs to call a Trusted OS function. To optimize the lookup of a
94340d553cfSPaul Beesleyservice handler, the framework uses an array of 128 indices that map every
94440d553cfSPaul Beesleydistinct OEN/call-type combination either to one of the declared services or to
94540d553cfSPaul Beesleyindicate the service is not handled. This ``rt_svc_descs_indices[]`` array is
94640d553cfSPaul Beesleypopulated for all of the OENs covered by a service after the service ``init()``
94740d553cfSPaul Beesleyfunction has reported success. So a service that fails to initialize will never
94840d553cfSPaul Beesleyhave it's ``handle()`` function invoked.
94940d553cfSPaul Beesley
95040d553cfSPaul BeesleyThe following figure shows how the ``rt_svc_descs_indices[]`` index maps the SMC
95140d553cfSPaul BeesleyFunction ID call type and OEN onto a specific service handler in the
95240d553cfSPaul Beesley``rt_svc_descs[]`` array.
95340d553cfSPaul Beesley
95440d553cfSPaul Beesley|Image 1|
95540d553cfSPaul Beesley
95640d553cfSPaul BeesleyHandling an SMC
95740d553cfSPaul Beesley~~~~~~~~~~~~~~~
95840d553cfSPaul Beesley
95940d553cfSPaul BeesleyWhen the EL3 runtime services framework receives a Secure Monitor Call, the SMC
96040d553cfSPaul BeesleyFunction ID is passed in W0 from the lower exception level (as per the
96140d553cfSPaul Beesley`SMCCC`_). If the calling register width is AArch32, it is invalid to invoke an
96240d553cfSPaul BeesleySMC Function which indicates the SMC64 calling convention: such calls are
96340d553cfSPaul Beesleyignored and return the Unknown SMC Function Identifier result code ``0xFFFFFFFF``
96440d553cfSPaul Beesleyin R0/X0.
96540d553cfSPaul Beesley
96640d553cfSPaul BeesleyBit[31] (fast/yielding call) and bits[29:24] (owning entity number) of the SMC
96740d553cfSPaul BeesleyFunction ID are combined to index into the ``rt_svc_descs_indices[]`` array. The
96840d553cfSPaul Beesleyresulting value might indicate a service that has no handler, in this case the
96940d553cfSPaul Beesleyframework will also report an Unknown SMC Function ID. Otherwise, the value is
97040d553cfSPaul Beesleyused as a further index into the ``rt_svc_descs[]`` array to locate the required
97140d553cfSPaul Beesleyservice and handler.
97240d553cfSPaul Beesley
97340d553cfSPaul BeesleyThe service's ``handle()`` callback is provided with five of the SMC parameters
97440d553cfSPaul Beesleydirectly, the others are saved into memory for retrieval (if needed) by the
97540d553cfSPaul Beesleyhandler. The handler is also provided with an opaque ``handle`` for use with the
97640d553cfSPaul Beesleysupporting library for parameter retrieval, setting return values and context
97740d553cfSPaul Beesleymanipulation; and with ``flags`` indicating the security state of the caller. The
97840d553cfSPaul Beesleyframework finally sets up the execution stack for the handler, and invokes the
97940d553cfSPaul Beesleyservices ``handle()`` function.
98040d553cfSPaul Beesley
981e34cc0ceSMadhukar PappireddyOn return from the handler the result registers are populated in X0-X7 as needed
982e34cc0ceSMadhukar Pappireddybefore restoring the stack and CPU state and returning from the original SMC.
98340d553cfSPaul Beesley
98440d553cfSPaul BeesleyException Handling Framework
98540d553cfSPaul Beesley----------------------------
98640d553cfSPaul Beesley
98740d553cfSPaul BeesleyPlease refer to the `Exception Handling Framework`_ document.
98840d553cfSPaul Beesley
98940d553cfSPaul BeesleyPower State Coordination Interface
99040d553cfSPaul Beesley----------------------------------
99140d553cfSPaul Beesley
99240d553cfSPaul BeesleyTODO: Provide design walkthrough of PSCI implementation.
99340d553cfSPaul Beesley
99440d553cfSPaul BeesleyThe PSCI v1.1 specification categorizes APIs as optional and mandatory. All the
99540d553cfSPaul Beesleymandatory APIs in PSCI v1.1, PSCI v1.0 and in PSCI v0.2 draft specification
99640d553cfSPaul Beesley`Power State Coordination Interface PDD`_ are implemented. The table lists
99740d553cfSPaul Beesleythe PSCI v1.1 APIs and their support in generic code.
99840d553cfSPaul Beesley
99940d553cfSPaul BeesleyAn API implementation might have a dependency on platform code e.g. CPU_SUSPEND
100040d553cfSPaul Beesleyrequires the platform to export a part of the implementation. Hence the level
100140d553cfSPaul Beesleyof support of the mandatory APIs depends upon the support exported by the
100240d553cfSPaul Beesleyplatform port as well. The Juno and FVP (all variants) platforms export all the
100340d553cfSPaul Beesleyrequired support.
100440d553cfSPaul Beesley
100540d553cfSPaul Beesley+-----------------------------+-------------+-------------------------------+
100640d553cfSPaul Beesley| PSCI v1.1 API               | Supported   | Comments                      |
100740d553cfSPaul Beesley+=============================+=============+===============================+
100840d553cfSPaul Beesley| ``PSCI_VERSION``            | Yes         | The version returned is 1.1   |
100940d553cfSPaul Beesley+-----------------------------+-------------+-------------------------------+
101040d553cfSPaul Beesley| ``CPU_SUSPEND``             | Yes\*       |                               |
101140d553cfSPaul Beesley+-----------------------------+-------------+-------------------------------+
101240d553cfSPaul Beesley| ``CPU_OFF``                 | Yes\*       |                               |
101340d553cfSPaul Beesley+-----------------------------+-------------+-------------------------------+
101440d553cfSPaul Beesley| ``CPU_ON``                  | Yes\*       |                               |
101540d553cfSPaul Beesley+-----------------------------+-------------+-------------------------------+
101640d553cfSPaul Beesley| ``AFFINITY_INFO``           | Yes         |                               |
101740d553cfSPaul Beesley+-----------------------------+-------------+-------------------------------+
101840d553cfSPaul Beesley| ``MIGRATE``                 | Yes\*\*     |                               |
101940d553cfSPaul Beesley+-----------------------------+-------------+-------------------------------+
102040d553cfSPaul Beesley| ``MIGRATE_INFO_TYPE``       | Yes\*\*     |                               |
102140d553cfSPaul Beesley+-----------------------------+-------------+-------------------------------+
102240d553cfSPaul Beesley| ``MIGRATE_INFO_CPU``        | Yes\*\*     |                               |
102340d553cfSPaul Beesley+-----------------------------+-------------+-------------------------------+
102440d553cfSPaul Beesley| ``SYSTEM_OFF``              | Yes\*       |                               |
102540d553cfSPaul Beesley+-----------------------------+-------------+-------------------------------+
102640d553cfSPaul Beesley| ``SYSTEM_RESET``            | Yes\*       |                               |
102740d553cfSPaul Beesley+-----------------------------+-------------+-------------------------------+
102840d553cfSPaul Beesley| ``PSCI_FEATURES``           | Yes         |                               |
102940d553cfSPaul Beesley+-----------------------------+-------------+-------------------------------+
103040d553cfSPaul Beesley| ``CPU_FREEZE``              | No          |                               |
103140d553cfSPaul Beesley+-----------------------------+-------------+-------------------------------+
103240d553cfSPaul Beesley| ``CPU_DEFAULT_SUSPEND``     | No          |                               |
103340d553cfSPaul Beesley+-----------------------------+-------------+-------------------------------+
103440d553cfSPaul Beesley| ``NODE_HW_STATE``           | Yes\*       |                               |
103540d553cfSPaul Beesley+-----------------------------+-------------+-------------------------------+
103640d553cfSPaul Beesley| ``SYSTEM_SUSPEND``          | Yes\*       |                               |
103740d553cfSPaul Beesley+-----------------------------+-------------+-------------------------------+
103840d553cfSPaul Beesley| ``PSCI_SET_SUSPEND_MODE``   | No          |                               |
103940d553cfSPaul Beesley+-----------------------------+-------------+-------------------------------+
104040d553cfSPaul Beesley| ``PSCI_STAT_RESIDENCY``     | Yes\*       |                               |
104140d553cfSPaul Beesley+-----------------------------+-------------+-------------------------------+
104240d553cfSPaul Beesley| ``PSCI_STAT_COUNT``         | Yes\*       |                               |
104340d553cfSPaul Beesley+-----------------------------+-------------+-------------------------------+
104440d553cfSPaul Beesley| ``SYSTEM_RESET2``           | Yes\*       |                               |
104540d553cfSPaul Beesley+-----------------------------+-------------+-------------------------------+
104640d553cfSPaul Beesley| ``MEM_PROTECT``             | Yes\*       |                               |
104740d553cfSPaul Beesley+-----------------------------+-------------+-------------------------------+
104840d553cfSPaul Beesley| ``MEM_PROTECT_CHECK_RANGE`` | Yes\*       |                               |
104940d553cfSPaul Beesley+-----------------------------+-------------+-------------------------------+
105040d553cfSPaul Beesley
105140d553cfSPaul Beesley\*Note : These PSCI APIs require platform power management hooks to be
105240d553cfSPaul Beesleyregistered with the generic PSCI code to be supported.
105340d553cfSPaul Beesley
105440d553cfSPaul Beesley\*\*Note : These PSCI APIs require appropriate Secure Payload Dispatcher
105540d553cfSPaul Beesleyhooks to be registered with the generic PSCI code to be supported.
105640d553cfSPaul Beesley
105740d553cfSPaul BeesleyThe PSCI implementation in TF-A is a library which can be integrated with
105840d553cfSPaul BeesleyAArch64 or AArch32 EL3 Runtime Software for Armv8-A systems. A guide to
105940d553cfSPaul Beesleyintegrating PSCI library with AArch32 EL3 Runtime Software can be found
106034760951SPaul Beesleyat :ref:`PSCI Library Integration guide for Armv8-A AArch32 systems`.
106134760951SPaul Beesley
106234760951SPaul Beesley.. _firmware_design_sel1_spd:
106340d553cfSPaul Beesley
106440d553cfSPaul BeesleySecure-EL1 Payloads and Dispatchers
106540d553cfSPaul Beesley-----------------------------------
106640d553cfSPaul Beesley
106740d553cfSPaul BeesleyOn a production system that includes a Trusted OS running in Secure-EL1/EL0,
106840d553cfSPaul Beesleythe Trusted OS is coupled with a companion runtime service in the BL31
106940d553cfSPaul Beesleyfirmware. This service is responsible for the initialisation of the Trusted
107040d553cfSPaul BeesleyOS and all communications with it. The Trusted OS is the BL32 stage of the
107140d553cfSPaul Beesleyboot flow in TF-A. The firmware will attempt to locate, load and execute a
107240d553cfSPaul BeesleyBL32 image.
107340d553cfSPaul Beesley
107440d553cfSPaul BeesleyTF-A uses a more general term for the BL32 software that runs at Secure-EL1 -
107540d553cfSPaul Beesleythe *Secure-EL1 Payload* - as it is not always a Trusted OS.
107640d553cfSPaul Beesley
107740d553cfSPaul BeesleyTF-A provides a Test Secure-EL1 Payload (TSP) and a Test Secure-EL1 Payload
107840d553cfSPaul BeesleyDispatcher (TSPD) service as an example of how a Trusted OS is supported on a
107940d553cfSPaul Beesleyproduction system using the Runtime Services Framework. On such a system, the
108040d553cfSPaul BeesleyTest BL32 image and service are replaced by the Trusted OS and its dispatcher
108140d553cfSPaul Beesleyservice. The TF-A build system expects that the dispatcher will define the
108240d553cfSPaul Beesleybuild flag ``NEED_BL32`` to enable it to include the BL32 in the build either
108340d553cfSPaul Beesleyas a binary or to compile from source depending on whether the ``BL32`` build
108440d553cfSPaul Beesleyoption is specified or not.
108540d553cfSPaul Beesley
108640d553cfSPaul BeesleyThe TSP runs in Secure-EL1. It is designed to demonstrate synchronous
108740d553cfSPaul Beesleycommunication with the normal-world software running in EL1/EL2. Communication
108840d553cfSPaul Beesleyis initiated by the normal-world software
108940d553cfSPaul Beesley
109040d553cfSPaul Beesley-  either directly through a Fast SMC (as defined in the `SMCCC`_)
109140d553cfSPaul Beesley
109240d553cfSPaul Beesley-  or indirectly through a `PSCI`_ SMC. The `PSCI`_ implementation in turn
109340d553cfSPaul Beesley   informs the TSPD about the requested power management operation. This allows
109440d553cfSPaul Beesley   the TSP to prepare for or respond to the power state change
109540d553cfSPaul Beesley
109640d553cfSPaul BeesleyThe TSPD service is responsible for.
109740d553cfSPaul Beesley
109840d553cfSPaul Beesley-  Initializing the TSP
109940d553cfSPaul Beesley
110040d553cfSPaul Beesley-  Routing requests and responses between the secure and the non-secure
110140d553cfSPaul Beesley   states during the two types of communications just described
110240d553cfSPaul Beesley
110340d553cfSPaul BeesleyInitializing a BL32 Image
110440d553cfSPaul Beesley~~~~~~~~~~~~~~~~~~~~~~~~~
110540d553cfSPaul Beesley
110640d553cfSPaul BeesleyThe Secure-EL1 Payload Dispatcher (SPD) service is responsible for initializing
110740d553cfSPaul Beesleythe BL32 image. It needs access to the information passed by BL2 to BL31 to do
110840d553cfSPaul Beesleyso. This is provided by:
110940d553cfSPaul Beesley
111040d553cfSPaul Beesley.. code:: c
111140d553cfSPaul Beesley
111240d553cfSPaul Beesley    entry_point_info_t *bl31_plat_get_next_image_ep_info(uint32_t);
111340d553cfSPaul Beesley
111440d553cfSPaul Beesleywhich returns a reference to the ``entry_point_info`` structure corresponding to
111540d553cfSPaul Beesleythe image which will be run in the specified security state. The SPD uses this
111640d553cfSPaul BeesleyAPI to get entry point information for the SECURE image, BL32.
111740d553cfSPaul Beesley
111840d553cfSPaul BeesleyIn the absence of a BL32 image, BL31 passes control to the normal world
111940d553cfSPaul Beesleybootloader image (BL33). When the BL32 image is present, it is typical
112040d553cfSPaul Beesleythat the SPD wants control to be passed to BL32 first and then later to BL33.
112140d553cfSPaul Beesley
112240d553cfSPaul BeesleyTo do this the SPD has to register a BL32 initialization function during
112340d553cfSPaul Beesleyinitialization of the SPD service. The BL32 initialization function has this
112440d553cfSPaul Beesleyprototype:
112540d553cfSPaul Beesley
112640d553cfSPaul Beesley.. code:: c
112740d553cfSPaul Beesley
112840d553cfSPaul Beesley    int32_t init(void);
112940d553cfSPaul Beesley
113040d553cfSPaul Beesleyand is registered using the ``bl31_register_bl32_init()`` function.
113140d553cfSPaul Beesley
113240d553cfSPaul BeesleyTF-A supports two approaches for the SPD to pass control to BL32 before
113340d553cfSPaul Beesleyreturning through EL3 and running the non-trusted firmware (BL33):
113440d553cfSPaul Beesley
113540d553cfSPaul Beesley#. In the BL32 setup function, use ``bl31_set_next_image_type()`` to
113640d553cfSPaul Beesley   request that the exit from ``bl31_main()`` is to the BL32 entrypoint in
113740d553cfSPaul Beesley   Secure-EL1. BL31 will exit to BL32 using the asynchronous method by
113840d553cfSPaul Beesley   calling ``bl31_prepare_next_image_entry()`` and ``el3_exit()``.
113940d553cfSPaul Beesley
114040d553cfSPaul Beesley   When the BL32 has completed initialization at Secure-EL1, it returns to
114140d553cfSPaul Beesley   BL31 by issuing an SMC, using a Function ID allocated to the SPD. On
114240d553cfSPaul Beesley   receipt of this SMC, the SPD service handler should switch the CPU context
114340d553cfSPaul Beesley   from trusted to normal world and use the ``bl31_set_next_image_type()`` and
114440d553cfSPaul Beesley   ``bl31_prepare_next_image_entry()`` functions to set up the initial return to
114540d553cfSPaul Beesley   the normal world firmware BL33. On return from the handler the framework
114640d553cfSPaul Beesley   will exit to EL2 and run BL33.
114740d553cfSPaul Beesley
114840d553cfSPaul Beesley#. The BL32 setup function registers an initialization function using
114940d553cfSPaul Beesley   ``bl31_register_bl32_init()`` which provides a SPD-defined mechanism to
115040d553cfSPaul Beesley   invoke a 'world-switch synchronous call' to Secure-EL1 to run the BL32
115140d553cfSPaul Beesley   entrypoint.
1152e1c5026aSPaul Beesley
1153e1c5026aSPaul Beesley   .. note::
1154e1c5026aSPaul Beesley      The Test SPD service included with TF-A provides one implementation
115540d553cfSPaul Beesley      of such a mechanism.
115640d553cfSPaul Beesley
115740d553cfSPaul Beesley   On completion BL32 returns control to BL31 via a SMC, and on receipt the
115840d553cfSPaul Beesley   SPD service handler invokes the synchronous call return mechanism to return
115940d553cfSPaul Beesley   to the BL32 initialization function. On return from this function,
116040d553cfSPaul Beesley   ``bl31_main()`` will set up the return to the normal world firmware BL33 and
116140d553cfSPaul Beesley   continue the boot process in the normal world.
116240d553cfSPaul Beesley
116340d553cfSPaul BeesleyCrash Reporting in BL31
116440d553cfSPaul Beesley-----------------------
116540d553cfSPaul Beesley
116640d553cfSPaul BeesleyBL31 implements a scheme for reporting the processor state when an unhandled
116740d553cfSPaul Beesleyexception is encountered. The reporting mechanism attempts to preserve all the
116840d553cfSPaul Beesleyregister contents and report it via a dedicated UART (PL011 console). BL31
116940d553cfSPaul Beesleyreports the general purpose, EL3, Secure EL1 and some EL2 state registers.
117040d553cfSPaul Beesley
117140d553cfSPaul BeesleyA dedicated per-CPU crash stack is maintained by BL31 and this is retrieved via
117240d553cfSPaul Beesleythe per-CPU pointer cache. The implementation attempts to minimise the memory
117340d553cfSPaul Beesleyrequired for this feature. The file ``crash_reporting.S`` contains the
117440d553cfSPaul Beesleyimplementation for crash reporting.
117540d553cfSPaul Beesley
117640d553cfSPaul BeesleyThe sample crash output is shown below.
117740d553cfSPaul Beesley
117840d553cfSPaul Beesley::
117940d553cfSPaul Beesley
1180*b4292bc6SAlexei Fedorov    x0             = 0x000000002a4a0000
1181*b4292bc6SAlexei Fedorov    x1             = 0x0000000000000001
1182*b4292bc6SAlexei Fedorov    x2             = 0x0000000000000002
1183*b4292bc6SAlexei Fedorov    x3             = 0x0000000000000003
1184*b4292bc6SAlexei Fedorov    x4             = 0x0000000000000004
1185*b4292bc6SAlexei Fedorov    x5             = 0x0000000000000005
1186*b4292bc6SAlexei Fedorov    x6             = 0x0000000000000006
1187*b4292bc6SAlexei Fedorov    x7             = 0x0000000000000007
1188*b4292bc6SAlexei Fedorov    x8             = 0x0000000000000008
1189*b4292bc6SAlexei Fedorov    x9             = 0x0000000000000009
1190*b4292bc6SAlexei Fedorov    x10            = 0x0000000000000010
1191*b4292bc6SAlexei Fedorov    x11            = 0x0000000000000011
1192*b4292bc6SAlexei Fedorov    x12            = 0x0000000000000012
1193*b4292bc6SAlexei Fedorov    x13            = 0x0000000000000013
1194*b4292bc6SAlexei Fedorov    x14            = 0x0000000000000014
1195*b4292bc6SAlexei Fedorov    x15            = 0x0000000000000015
1196*b4292bc6SAlexei Fedorov    x16            = 0x0000000000000016
1197*b4292bc6SAlexei Fedorov    x17            = 0x0000000000000017
1198*b4292bc6SAlexei Fedorov    x18            = 0x0000000000000018
1199*b4292bc6SAlexei Fedorov    x19            = 0x0000000000000019
1200*b4292bc6SAlexei Fedorov    x20            = 0x0000000000000020
1201*b4292bc6SAlexei Fedorov    x21            = 0x0000000000000021
1202*b4292bc6SAlexei Fedorov    x22            = 0x0000000000000022
1203*b4292bc6SAlexei Fedorov    x23            = 0x0000000000000023
1204*b4292bc6SAlexei Fedorov    x24            = 0x0000000000000024
1205*b4292bc6SAlexei Fedorov    x25            = 0x0000000000000025
1206*b4292bc6SAlexei Fedorov    x26            = 0x0000000000000026
1207*b4292bc6SAlexei Fedorov    x27            = 0x0000000000000027
1208*b4292bc6SAlexei Fedorov    x28            = 0x0000000000000028
1209*b4292bc6SAlexei Fedorov    x29            = 0x0000000000000029
1210*b4292bc6SAlexei Fedorov    x30            = 0x0000000088000b78
1211*b4292bc6SAlexei Fedorov    scr_el3        = 0x000000000003073d
1212*b4292bc6SAlexei Fedorov    sctlr_el3      = 0x00000000b0cd183f
1213*b4292bc6SAlexei Fedorov    cptr_el3       = 0x0000000000000000
1214*b4292bc6SAlexei Fedorov    tcr_el3        = 0x000000008080351c
1215*b4292bc6SAlexei Fedorov    daif           = 0x00000000000002c0
1216*b4292bc6SAlexei Fedorov    mair_el3       = 0x00000000004404ff
1217*b4292bc6SAlexei Fedorov    spsr_el3       = 0x0000000060000349
1218*b4292bc6SAlexei Fedorov    elr_el3        = 0x0000000088000114
1219*b4292bc6SAlexei Fedorov    ttbr0_el3      = 0x0000000004018201
1220*b4292bc6SAlexei Fedorov    esr_el3        = 0x00000000be000000
1221*b4292bc6SAlexei Fedorov    far_el3        = 0x0000000000000000
1222*b4292bc6SAlexei Fedorov    spsr_el1       = 0x0000000000000000
1223*b4292bc6SAlexei Fedorov    elr_el1        = 0x0000000000000000
1224*b4292bc6SAlexei Fedorov    spsr_abt       = 0x0000000000000000
1225*b4292bc6SAlexei Fedorov    spsr_und       = 0x0000000000000000
1226*b4292bc6SAlexei Fedorov    spsr_irq       = 0x0000000000000000
1227*b4292bc6SAlexei Fedorov    spsr_fiq       = 0x0000000000000000
1228*b4292bc6SAlexei Fedorov    sctlr_el1      = 0x0000000030d00800
1229*b4292bc6SAlexei Fedorov    actlr_el1      = 0x0000000000000000
1230*b4292bc6SAlexei Fedorov    cpacr_el1      = 0x0000000000000000
1231*b4292bc6SAlexei Fedorov    csselr_el1     = 0x0000000000000000
1232*b4292bc6SAlexei Fedorov    sp_el1         = 0x0000000000000000
1233*b4292bc6SAlexei Fedorov    esr_el1        = 0x0000000000000000
1234*b4292bc6SAlexei Fedorov    ttbr0_el1      = 0x0000000000000000
1235*b4292bc6SAlexei Fedorov    ttbr1_el1      = 0x0000000000000000
1236*b4292bc6SAlexei Fedorov    mair_el1       = 0x0000000000000000
1237*b4292bc6SAlexei Fedorov    amair_el1      = 0x0000000000000000
1238*b4292bc6SAlexei Fedorov    tcr_el1        = 0x0000000000000000
1239*b4292bc6SAlexei Fedorov    tpidr_el1      = 0x0000000000000000
1240*b4292bc6SAlexei Fedorov    tpidr_el0      = 0x0000000000000000
1241*b4292bc6SAlexei Fedorov    tpidrro_el0    = 0x0000000000000000
1242*b4292bc6SAlexei Fedorov    par_el1        = 0x0000000000000000
1243*b4292bc6SAlexei Fedorov    mpidr_el1      = 0x0000000080000000
1244*b4292bc6SAlexei Fedorov    afsr0_el1      = 0x0000000000000000
1245*b4292bc6SAlexei Fedorov    afsr1_el1      = 0x0000000000000000
1246*b4292bc6SAlexei Fedorov    contextidr_el1 = 0x0000000000000000
1247*b4292bc6SAlexei Fedorov    vbar_el1       = 0x0000000000000000
1248*b4292bc6SAlexei Fedorov    cntp_ctl_el0   = 0x0000000000000000
1249*b4292bc6SAlexei Fedorov    cntp_cval_el0  = 0x0000000000000000
1250*b4292bc6SAlexei Fedorov    cntv_ctl_el0   = 0x0000000000000000
1251*b4292bc6SAlexei Fedorov    cntv_cval_el0  = 0x0000000000000000
1252*b4292bc6SAlexei Fedorov    cntkctl_el1    = 0x0000000000000000
1253*b4292bc6SAlexei Fedorov    sp_el0         = 0x0000000004014940
1254*b4292bc6SAlexei Fedorov    isr_el1        = 0x0000000000000000
1255*b4292bc6SAlexei Fedorov    dacr32_el2     = 0x0000000000000000
1256*b4292bc6SAlexei Fedorov    ifsr32_el2     = 0x0000000000000000
1257*b4292bc6SAlexei Fedorov    icc_hppir0_el1 = 0x00000000000003ff
1258*b4292bc6SAlexei Fedorov    icc_hppir1_el1 = 0x00000000000003ff
1259*b4292bc6SAlexei Fedorov    icc_ctlr_el3   = 0x0000000000080400
1260*b4292bc6SAlexei Fedorov    gicd_ispendr regs (Offsets 0x200-0x278)
1261*b4292bc6SAlexei Fedorov    Offset		    Value
1262*b4292bc6SAlexei Fedorov    0x200:	     0x0000000000000000
1263*b4292bc6SAlexei Fedorov    0x208:	     0x0000000000000000
1264*b4292bc6SAlexei Fedorov    0x210:	     0x0000000000000000
1265*b4292bc6SAlexei Fedorov    0x218:	     0x0000000000000000
1266*b4292bc6SAlexei Fedorov    0x220:	     0x0000000000000000
1267*b4292bc6SAlexei Fedorov    0x228:	     0x0000000000000000
1268*b4292bc6SAlexei Fedorov    0x230:	     0x0000000000000000
1269*b4292bc6SAlexei Fedorov    0x238:	     0x0000000000000000
1270*b4292bc6SAlexei Fedorov    0x240:	     0x0000000000000000
1271*b4292bc6SAlexei Fedorov    0x248:	     0x0000000000000000
1272*b4292bc6SAlexei Fedorov    0x250:	     0x0000000000000000
1273*b4292bc6SAlexei Fedorov    0x258:	     0x0000000000000000
1274*b4292bc6SAlexei Fedorov    0x260:	     0x0000000000000000
1275*b4292bc6SAlexei Fedorov    0x268:	     0x0000000000000000
1276*b4292bc6SAlexei Fedorov    0x270:	     0x0000000000000000
1277*b4292bc6SAlexei Fedorov    0x278:	     0x0000000000000000
127840d553cfSPaul Beesley
127940d553cfSPaul BeesleyGuidelines for Reset Handlers
128040d553cfSPaul Beesley-----------------------------
128140d553cfSPaul Beesley
128240d553cfSPaul BeesleyTF-A implements a framework that allows CPU and platform ports to perform
128340d553cfSPaul Beesleyactions very early after a CPU is released from reset in both the cold and warm
128440d553cfSPaul Beesleyboot paths. This is done by calling the ``reset_handler()`` function in both
128540d553cfSPaul Beesleythe BL1 and BL31 images. It in turn calls the platform and CPU specific reset
128640d553cfSPaul Beesleyhandling functions.
128740d553cfSPaul Beesley
128840d553cfSPaul BeesleyDetails for implementing a CPU specific reset handler can be found in
128940d553cfSPaul BeesleySection 8. Details for implementing a platform specific reset handler can be
129034760951SPaul Beesleyfound in the :ref:`Porting Guide` (see the ``plat_reset_handler()`` function).
129140d553cfSPaul Beesley
129240d553cfSPaul BeesleyWhen adding functionality to a reset handler, keep in mind that if a different
129340d553cfSPaul Beesleyreset handling behavior is required between the first and the subsequent
129440d553cfSPaul Beesleyinvocations of the reset handling code, this should be detected at runtime.
129540d553cfSPaul BeesleyIn other words, the reset handler should be able to detect whether an action has
129640d553cfSPaul Beesleyalready been performed and act as appropriate. Possible courses of actions are,
129740d553cfSPaul Beesleye.g. skip the action the second time, or undo/redo it.
129840d553cfSPaul Beesley
129940d553cfSPaul BeesleyConfiguring secure interrupts
130040d553cfSPaul Beesley-----------------------------
130140d553cfSPaul Beesley
130240d553cfSPaul BeesleyThe GIC driver is responsible for performing initial configuration of secure
130340d553cfSPaul Beesleyinterrupts on the platform. To this end, the platform is expected to provide the
130440d553cfSPaul BeesleyGIC driver (either GICv2 or GICv3, as selected by the platform) with the
130540d553cfSPaul Beesleyinterrupt configuration during the driver initialisation.
130640d553cfSPaul Beesley
130740d553cfSPaul BeesleySecure interrupt configuration are specified in an array of secure interrupt
130840d553cfSPaul Beesleyproperties. In this scheme, in both GICv2 and GICv3 driver data structures, the
130940d553cfSPaul Beesley``interrupt_props`` member points to an array of interrupt properties. Each
131040d553cfSPaul Beesleyelement of the array specifies the interrupt number and its attributes
131140d553cfSPaul Beesley(priority, group, configuration). Each element of the array shall be populated
131240d553cfSPaul Beesleyby the macro ``INTR_PROP_DESC()``. The macro takes the following arguments:
131340d553cfSPaul Beesley
131440d553cfSPaul Beesley- 10-bit interrupt number,
131540d553cfSPaul Beesley
131640d553cfSPaul Beesley- 8-bit interrupt priority,
131740d553cfSPaul Beesley
131840d553cfSPaul Beesley- Interrupt type (one of ``INTR_TYPE_EL3``, ``INTR_TYPE_S_EL1``,
131940d553cfSPaul Beesley  ``INTR_TYPE_NS``),
132040d553cfSPaul Beesley
132140d553cfSPaul Beesley- Interrupt configuration (either ``GIC_INTR_CFG_LEVEL`` or
132240d553cfSPaul Beesley  ``GIC_INTR_CFG_EDGE``).
132340d553cfSPaul Beesley
132434760951SPaul Beesley.. _firmware_design_cpu_ops_fwk:
132534760951SPaul Beesley
132640d553cfSPaul BeesleyCPU specific operations framework
132740d553cfSPaul Beesley---------------------------------
132840d553cfSPaul Beesley
132940d553cfSPaul BeesleyCertain aspects of the Armv8-A architecture are implementation defined,
133040d553cfSPaul Beesleythat is, certain behaviours are not architecturally defined, but must be
133140d553cfSPaul Beesleydefined and documented by individual processor implementations. TF-A
133240d553cfSPaul Beesleyimplements a framework which categorises the common implementation defined
133340d553cfSPaul Beesleybehaviours and allows a processor to export its implementation of that
133440d553cfSPaul Beesleybehaviour. The categories are:
133540d553cfSPaul Beesley
133640d553cfSPaul Beesley#. Processor specific reset sequence.
133740d553cfSPaul Beesley
133840d553cfSPaul Beesley#. Processor specific power down sequences.
133940d553cfSPaul Beesley
134040d553cfSPaul Beesley#. Processor specific register dumping as a part of crash reporting.
134140d553cfSPaul Beesley
134240d553cfSPaul Beesley#. Errata status reporting.
134340d553cfSPaul Beesley
134440d553cfSPaul BeesleyEach of the above categories fulfils a different requirement.
134540d553cfSPaul Beesley
134640d553cfSPaul Beesley#. allows any processor specific initialization before the caches and MMU
134740d553cfSPaul Beesley   are turned on, like implementation of errata workarounds, entry into
134840d553cfSPaul Beesley   the intra-cluster coherency domain etc.
134940d553cfSPaul Beesley
135040d553cfSPaul Beesley#. allows each processor to implement the power down sequence mandated in
135140d553cfSPaul Beesley   its Technical Reference Manual (TRM).
135240d553cfSPaul Beesley
135340d553cfSPaul Beesley#. allows a processor to provide additional information to the developer
135440d553cfSPaul Beesley   in the event of a crash, for example Cortex-A53 has registers which
135540d553cfSPaul Beesley   can expose the data cache contents.
135640d553cfSPaul Beesley
135740d553cfSPaul Beesley#. allows a processor to define a function that inspects and reports the status
135840d553cfSPaul Beesley   of all errata workarounds on that processor.
135940d553cfSPaul Beesley
136040d553cfSPaul BeesleyPlease note that only 2. is mandated by the TRM.
136140d553cfSPaul Beesley
136240d553cfSPaul BeesleyThe CPU specific operations framework scales to accommodate a large number of
136340d553cfSPaul Beesleydifferent CPUs during power down and reset handling. The platform can specify
136440d553cfSPaul Beesleyany CPU optimization it wants to enable for each CPU. It can also specify
136540d553cfSPaul Beesleythe CPU errata workarounds to be applied for each CPU type during reset
136640d553cfSPaul Beesleyhandling by defining CPU errata compile time macros. Details on these macros
136734760951SPaul Beesleycan be found in the :ref:`Arm CPU Specific Build Macros` document.
136840d553cfSPaul Beesley
136940d553cfSPaul BeesleyThe CPU specific operations framework depends on the ``cpu_ops`` structure which
137040d553cfSPaul Beesleyneeds to be exported for each type of CPU in the platform. It is defined in
137140d553cfSPaul Beesley``include/lib/cpus/aarch64/cpu_macros.S`` and has the following fields : ``midr``,
137240d553cfSPaul Beesley``reset_func()``, ``cpu_pwr_down_ops`` (array of power down functions) and
137340d553cfSPaul Beesley``cpu_reg_dump()``.
137440d553cfSPaul Beesley
137540d553cfSPaul BeesleyThe CPU specific files in ``lib/cpus`` export a ``cpu_ops`` data structure with
137640d553cfSPaul Beesleysuitable handlers for that CPU. For example, ``lib/cpus/aarch64/cortex_a53.S``
137740d553cfSPaul Beesleyexports the ``cpu_ops`` for Cortex-A53 CPU. According to the platform
137840d553cfSPaul Beesleyconfiguration, these CPU specific files must be included in the build by
137940d553cfSPaul Beesleythe platform makefile. The generic CPU specific operations framework code exists
138040d553cfSPaul Beesleyin ``lib/cpus/aarch64/cpu_helpers.S``.
138140d553cfSPaul Beesley
138240d553cfSPaul BeesleyCPU specific Reset Handling
138340d553cfSPaul Beesley~~~~~~~~~~~~~~~~~~~~~~~~~~~
138440d553cfSPaul Beesley
138540d553cfSPaul BeesleyAfter a reset, the state of the CPU when it calls generic reset handler is:
138640d553cfSPaul BeesleyMMU turned off, both instruction and data caches turned off and not part
138740d553cfSPaul Beesleyof any coherency domain.
138840d553cfSPaul Beesley
138940d553cfSPaul BeesleyThe BL entrypoint code first invokes the ``plat_reset_handler()`` to allow
139040d553cfSPaul Beesleythe platform to perform any system initialization required and any system
139140d553cfSPaul Beesleyerrata workarounds that needs to be applied. The ``get_cpu_ops_ptr()`` reads
139240d553cfSPaul Beesleythe current CPU midr, finds the matching ``cpu_ops`` entry in the ``cpu_ops``
139340d553cfSPaul Beesleyarray and returns it. Note that only the part number and implementer fields
139440d553cfSPaul Beesleyin midr are used to find the matching ``cpu_ops`` entry. The ``reset_func()`` in
139540d553cfSPaul Beesleythe returned ``cpu_ops`` is then invoked which executes the required reset
139640d553cfSPaul Beesleyhandling for that CPU and also any errata workarounds enabled by the platform.
139740d553cfSPaul BeesleyThis function must preserve the values of general purpose registers x20 to x29.
139840d553cfSPaul Beesley
139940d553cfSPaul BeesleyRefer to Section "Guidelines for Reset Handlers" for general guidelines
140040d553cfSPaul Beesleyregarding placement of code in a reset handler.
140140d553cfSPaul Beesley
140240d553cfSPaul BeesleyCPU specific power down sequence
140340d553cfSPaul Beesley~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
140440d553cfSPaul Beesley
140540d553cfSPaul BeesleyDuring the BL31 initialization sequence, the pointer to the matching ``cpu_ops``
140640d553cfSPaul Beesleyentry is stored in per-CPU data by ``init_cpu_ops()`` so that it can be quickly
140740d553cfSPaul Beesleyretrieved during power down sequences.
140840d553cfSPaul Beesley
140940d553cfSPaul BeesleyVarious CPU drivers register handlers to perform power down at certain power
141040d553cfSPaul Beesleylevels for that specific CPU. The PSCI service, upon receiving a power down
141140d553cfSPaul Beesleyrequest, determines the highest power level at which to execute power down
141240d553cfSPaul Beesleysequence for a particular CPU. It uses the ``prepare_cpu_pwr_dwn()`` function to
141340d553cfSPaul Beesleypick the right power down handler for the requested level. The function
141440d553cfSPaul Beesleyretrieves ``cpu_ops`` pointer member of per-CPU data, and from that, further
141540d553cfSPaul Beesleyretrieves ``cpu_pwr_down_ops`` array, and indexes into the required level. If the
141640d553cfSPaul Beesleyrequested power level is higher than what a CPU driver supports, the handler
141740d553cfSPaul Beesleyregistered for highest level is invoked.
141840d553cfSPaul Beesley
141940d553cfSPaul BeesleyAt runtime the platform hooks for power down are invoked by the PSCI service to
142040d553cfSPaul Beesleyperform platform specific operations during a power down sequence, for example
142140d553cfSPaul Beesleyturning off CCI coherency during a cluster power down.
142240d553cfSPaul Beesley
142340d553cfSPaul BeesleyCPU specific register reporting during crash
142440d553cfSPaul Beesley~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
142540d553cfSPaul Beesley
142640d553cfSPaul BeesleyIf the crash reporting is enabled in BL31, when a crash occurs, the crash
142740d553cfSPaul Beesleyreporting framework calls ``do_cpu_reg_dump`` which retrieves the matching
142840d553cfSPaul Beesley``cpu_ops`` using ``get_cpu_ops_ptr()`` function. The ``cpu_reg_dump()`` in
142940d553cfSPaul Beesley``cpu_ops`` is invoked, which then returns the CPU specific register values to
143040d553cfSPaul Beesleybe reported and a pointer to the ASCII list of register names in a format
143140d553cfSPaul Beesleyexpected by the crash reporting framework.
143240d553cfSPaul Beesley
143334760951SPaul Beesley.. _firmware_design_cpu_errata_reporting:
143434760951SPaul Beesley
143540d553cfSPaul BeesleyCPU errata status reporting
143640d553cfSPaul Beesley~~~~~~~~~~~~~~~~~~~~~~~~~~~
143740d553cfSPaul Beesley
143840d553cfSPaul BeesleyErrata workarounds for CPUs supported in TF-A are applied during both cold and
143940d553cfSPaul Beesleywarm boots, shortly after reset. Individual Errata workarounds are enabled as
144040d553cfSPaul Beesleybuild options. Some errata workarounds have potential run-time implications;
144140d553cfSPaul Beesleytherefore some are enabled by default, others not. Platform ports shall
144240d553cfSPaul Beesleyoverride build options to enable or disable errata as appropriate. The CPU
144340d553cfSPaul Beesleydrivers take care of applying errata workarounds that are enabled and applicable
144434760951SPaul Beesleyto a given CPU. Refer to :ref:`arm_cpu_macros_errata_workarounds` for more
144534760951SPaul Beesleyinformation.
144640d553cfSPaul Beesley
144740d553cfSPaul BeesleyFunctions in CPU drivers that apply errata workaround must follow the
144840d553cfSPaul Beesleyconventions listed below.
144940d553cfSPaul Beesley
145040d553cfSPaul BeesleyThe errata workaround must be authored as two separate functions:
145140d553cfSPaul Beesley
145240d553cfSPaul Beesley-  One that checks for errata. This function must determine whether that errata
145340d553cfSPaul Beesley   applies to the current CPU. Typically this involves matching the current
145440d553cfSPaul Beesley   CPUs revision and variant against a value that's known to be affected by the
145540d553cfSPaul Beesley   errata. If the function determines that the errata applies to this CPU, it
145640d553cfSPaul Beesley   must return ``ERRATA_APPLIES``; otherwise, it must return
145740d553cfSPaul Beesley   ``ERRATA_NOT_APPLIES``. The utility functions ``cpu_get_rev_var`` and
145840d553cfSPaul Beesley   ``cpu_rev_var_ls`` functions may come in handy for this purpose.
145940d553cfSPaul Beesley
146040d553cfSPaul BeesleyFor an errata identified as ``E``, the check function must be named
146140d553cfSPaul Beesley``check_errata_E``.
146240d553cfSPaul Beesley
146340d553cfSPaul BeesleyThis function will be invoked at different times, both from assembly and from
146440d553cfSPaul BeesleyC run time. Therefore it must follow AAPCS, and must not use stack.
146540d553cfSPaul Beesley
146640d553cfSPaul Beesley-  Another one that applies the errata workaround. This function would call the
146740d553cfSPaul Beesley   check function described above, and applies errata workaround if required.
146840d553cfSPaul Beesley
146940d553cfSPaul BeesleyCPU drivers that apply errata workaround can optionally implement an assembly
147040d553cfSPaul Beesleyfunction that report the status of errata workarounds pertaining to that CPU.
147140d553cfSPaul BeesleyFor a driver that registers the CPU, for example, ``cpux`` via ``declare_cpu_ops``
147240d553cfSPaul Beesleymacro, the errata reporting function, if it exists, must be named
147340d553cfSPaul Beesley``cpux_errata_report``. This function will always be called with MMU enabled; it
147440d553cfSPaul Beesleymust follow AAPCS and may use stack.
147540d553cfSPaul Beesley
147640d553cfSPaul BeesleyIn a debug build of TF-A, on a CPU that comes out of reset, both BL1 and the
147740d553cfSPaul Beesleyruntime firmware (BL31 in AArch64, and BL32 in AArch32) will invoke errata
147840d553cfSPaul Beesleystatus reporting function, if one exists, for that type of CPU.
147940d553cfSPaul Beesley
148040d553cfSPaul BeesleyTo report the status of each errata workaround, the function shall use the
148140d553cfSPaul Beesleyassembler macro ``report_errata``, passing it:
148240d553cfSPaul Beesley
148340d553cfSPaul Beesley-  The build option that enables the errata;
148440d553cfSPaul Beesley
148540d553cfSPaul Beesley-  The name of the CPU: this must be the same identifier that CPU driver
148640d553cfSPaul Beesley   registered itself with, using ``declare_cpu_ops``;
148740d553cfSPaul Beesley
148840d553cfSPaul Beesley-  And the errata identifier: the identifier must match what's used in the
148940d553cfSPaul Beesley   errata's check function described above.
149040d553cfSPaul Beesley
149140d553cfSPaul BeesleyThe errata status reporting function will be called once per CPU type/errata
149240d553cfSPaul Beesleycombination during the software's active life time.
149340d553cfSPaul Beesley
149440d553cfSPaul BeesleyIt's expected that whenever an errata workaround is submitted to TF-A, the
149540d553cfSPaul Beesleyerrata reporting function is appropriately extended to report its status as
149640d553cfSPaul Beesleywell.
149740d553cfSPaul Beesley
149840d553cfSPaul BeesleyReporting the status of errata workaround is for informational purpose only; it
149940d553cfSPaul Beesleyhas no functional significance.
150040d553cfSPaul Beesley
150140d553cfSPaul BeesleyMemory layout of BL images
150240d553cfSPaul Beesley--------------------------
150340d553cfSPaul Beesley
150440d553cfSPaul BeesleyEach bootloader image can be divided in 2 parts:
150540d553cfSPaul Beesley
150640d553cfSPaul Beesley-  the static contents of the image. These are data actually stored in the
150740d553cfSPaul Beesley   binary on the disk. In the ELF terminology, they are called ``PROGBITS``
150840d553cfSPaul Beesley   sections;
150940d553cfSPaul Beesley
151040d553cfSPaul Beesley-  the run-time contents of the image. These are data that don't occupy any
151140d553cfSPaul Beesley   space in the binary on the disk. The ELF binary just contains some
151240d553cfSPaul Beesley   metadata indicating where these data will be stored at run-time and the
151340d553cfSPaul Beesley   corresponding sections need to be allocated and initialized at run-time.
151440d553cfSPaul Beesley   In the ELF terminology, they are called ``NOBITS`` sections.
151540d553cfSPaul Beesley
151640d553cfSPaul BeesleyAll PROGBITS sections are grouped together at the beginning of the image,
151740d553cfSPaul Beesleyfollowed by all NOBITS sections. This is true for all TF-A images and it is
151840d553cfSPaul Beesleygoverned by the linker scripts. This ensures that the raw binary images are
151940d553cfSPaul Beesleyas small as possible. If a NOBITS section was inserted in between PROGBITS
152040d553cfSPaul Beesleysections then the resulting binary file would contain zero bytes in place of
152140d553cfSPaul Beesleythis NOBITS section, making the image unnecessarily bigger. Smaller images
152240d553cfSPaul Beesleyallow faster loading from the FIP to the main memory.
152340d553cfSPaul Beesley
1524f8578e64SSamuel HollandFor BL31, a platform can specify an alternate location for NOBITS sections
1525f8578e64SSamuel Holland(other than immediately following PROGBITS sections) by setting
1526f8578e64SSamuel Holland``SEPARATE_NOBITS_REGION`` to 1 and defining ``BL31_NOBITS_BASE`` and
1527f8578e64SSamuel Holland``BL31_NOBITS_LIMIT``.
1528f8578e64SSamuel Holland
152940d553cfSPaul BeesleyLinker scripts and symbols
153040d553cfSPaul Beesley~~~~~~~~~~~~~~~~~~~~~~~~~~
153140d553cfSPaul Beesley
153240d553cfSPaul BeesleyEach bootloader stage image layout is described by its own linker script. The
153340d553cfSPaul Beesleylinker scripts export some symbols into the program symbol table. Their values
153440d553cfSPaul Beesleycorrespond to particular addresses. TF-A code can refer to these symbols to
153540d553cfSPaul Beesleyfigure out the image memory layout.
153640d553cfSPaul Beesley
153740d553cfSPaul BeesleyLinker symbols follow the following naming convention in TF-A.
153840d553cfSPaul Beesley
153940d553cfSPaul Beesley-  ``__<SECTION>_START__``
154040d553cfSPaul Beesley
154140d553cfSPaul Beesley   Start address of a given section named ``<SECTION>``.
154240d553cfSPaul Beesley
154340d553cfSPaul Beesley-  ``__<SECTION>_END__``
154440d553cfSPaul Beesley
154540d553cfSPaul Beesley   End address of a given section named ``<SECTION>``. If there is an alignment
154640d553cfSPaul Beesley   constraint on the section's end address then ``__<SECTION>_END__`` corresponds
154740d553cfSPaul Beesley   to the end address of the section's actual contents, rounded up to the right
154840d553cfSPaul Beesley   boundary. Refer to the value of ``__<SECTION>_UNALIGNED_END__`` to know the
154940d553cfSPaul Beesley   actual end address of the section's contents.
155040d553cfSPaul Beesley
155140d553cfSPaul Beesley-  ``__<SECTION>_UNALIGNED_END__``
155240d553cfSPaul Beesley
155340d553cfSPaul Beesley   End address of a given section named ``<SECTION>`` without any padding or
155440d553cfSPaul Beesley   rounding up due to some alignment constraint.
155540d553cfSPaul Beesley
155640d553cfSPaul Beesley-  ``__<SECTION>_SIZE__``
155740d553cfSPaul Beesley
155840d553cfSPaul Beesley   Size (in bytes) of a given section named ``<SECTION>``. If there is an
155940d553cfSPaul Beesley   alignment constraint on the section's end address then ``__<SECTION>_SIZE__``
156040d553cfSPaul Beesley   corresponds to the size of the section's actual contents, rounded up to the
156140d553cfSPaul Beesley   right boundary. In other words, ``__<SECTION>_SIZE__ = __<SECTION>_END__ - _<SECTION>_START__``. Refer to the value of ``__<SECTION>_UNALIGNED_SIZE__``
156240d553cfSPaul Beesley   to know the actual size of the section's contents.
156340d553cfSPaul Beesley
156440d553cfSPaul Beesley-  ``__<SECTION>_UNALIGNED_SIZE__``
156540d553cfSPaul Beesley
156640d553cfSPaul Beesley   Size (in bytes) of a given section named ``<SECTION>`` without any padding or
156740d553cfSPaul Beesley   rounding up due to some alignment constraint. In other words,
156840d553cfSPaul Beesley   ``__<SECTION>_UNALIGNED_SIZE__ = __<SECTION>_UNALIGNED_END__ - __<SECTION>_START__``.
156940d553cfSPaul Beesley
157040d553cfSPaul BeesleySome of the linker symbols are mandatory as TF-A code relies on them to be
157140d553cfSPaul Beesleydefined. They are listed in the following subsections. Some of them must be
157240d553cfSPaul Beesleyprovided for each bootloader stage and some are specific to a given bootloader
157340d553cfSPaul Beesleystage.
157440d553cfSPaul Beesley
157540d553cfSPaul BeesleyThe linker scripts define some extra, optional symbols. They are not actually
157640d553cfSPaul Beesleyused by any code but they help in understanding the bootloader images' memory
157740d553cfSPaul Beesleylayout as they are easy to spot in the link map files.
157840d553cfSPaul Beesley
157940d553cfSPaul BeesleyCommon linker symbols
158040d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^^
158140d553cfSPaul Beesley
158240d553cfSPaul BeesleyAll BL images share the following requirements:
158340d553cfSPaul Beesley
158440d553cfSPaul Beesley-  The BSS section must be zero-initialised before executing any C code.
158540d553cfSPaul Beesley-  The coherent memory section (if enabled) must be zero-initialised as well.
158640d553cfSPaul Beesley-  The MMU setup code needs to know the extents of the coherent and read-only
158740d553cfSPaul Beesley   memory regions to set the right memory attributes. When
158840d553cfSPaul Beesley   ``SEPARATE_CODE_AND_RODATA=1``, it needs to know more specifically how the
158940d553cfSPaul Beesley   read-only memory region is divided between code and data.
159040d553cfSPaul Beesley
159140d553cfSPaul BeesleyThe following linker symbols are defined for this purpose:
159240d553cfSPaul Beesley
159340d553cfSPaul Beesley-  ``__BSS_START__``
159440d553cfSPaul Beesley-  ``__BSS_SIZE__``
159540d553cfSPaul Beesley-  ``__COHERENT_RAM_START__`` Must be aligned on a page-size boundary.
159640d553cfSPaul Beesley-  ``__COHERENT_RAM_END__`` Must be aligned on a page-size boundary.
159740d553cfSPaul Beesley-  ``__COHERENT_RAM_UNALIGNED_SIZE__``
159840d553cfSPaul Beesley-  ``__RO_START__``
159940d553cfSPaul Beesley-  ``__RO_END__``
160040d553cfSPaul Beesley-  ``__TEXT_START__``
160140d553cfSPaul Beesley-  ``__TEXT_END__``
160240d553cfSPaul Beesley-  ``__RODATA_START__``
160340d553cfSPaul Beesley-  ``__RODATA_END__``
160440d553cfSPaul Beesley
160540d553cfSPaul BeesleyBL1's linker symbols
160640d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^
160740d553cfSPaul Beesley
160840d553cfSPaul BeesleyBL1 being the ROM image, it has additional requirements. BL1 resides in ROM and
160940d553cfSPaul Beesleyit is entirely executed in place but it needs some read-write memory for its
161040d553cfSPaul Beesleymutable data. Its ``.data`` section (i.e. its allocated read-write data) must be
161140d553cfSPaul Beesleyrelocated from ROM to RAM before executing any C code.
161240d553cfSPaul Beesley
161340d553cfSPaul BeesleyThe following additional linker symbols are defined for BL1:
161440d553cfSPaul Beesley
161540d553cfSPaul Beesley-  ``__BL1_ROM_END__`` End address of BL1's ROM contents, covering its code
161640d553cfSPaul Beesley   and ``.data`` section in ROM.
161740d553cfSPaul Beesley-  ``__DATA_ROM_START__`` Start address of the ``.data`` section in ROM. Must be
161840d553cfSPaul Beesley   aligned on a 16-byte boundary.
161940d553cfSPaul Beesley-  ``__DATA_RAM_START__`` Address in RAM where the ``.data`` section should be
162040d553cfSPaul Beesley   copied over. Must be aligned on a 16-byte boundary.
162140d553cfSPaul Beesley-  ``__DATA_SIZE__`` Size of the ``.data`` section (in ROM or RAM).
162240d553cfSPaul Beesley-  ``__BL1_RAM_START__`` Start address of BL1 read-write data.
162340d553cfSPaul Beesley-  ``__BL1_RAM_END__`` End address of BL1 read-write data.
162440d553cfSPaul Beesley
162540d553cfSPaul BeesleyHow to choose the right base addresses for each bootloader stage image
162640d553cfSPaul Beesley~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
162740d553cfSPaul Beesley
162840d553cfSPaul BeesleyThere is currently no support for dynamic image loading in TF-A. This means
162940d553cfSPaul Beesleythat all bootloader images need to be linked against their ultimate runtime
163040d553cfSPaul Beesleylocations and the base addresses of each image must be chosen carefully such
163140d553cfSPaul Beesleythat images don't overlap each other in an undesired way. As the code grows,
163240d553cfSPaul Beesleythe base addresses might need adjustments to cope with the new memory layout.
163340d553cfSPaul Beesley
163440d553cfSPaul BeesleyThe memory layout is completely specific to the platform and so there is no
163540d553cfSPaul Beesleygeneral recipe for choosing the right base addresses for each bootloader image.
163640d553cfSPaul BeesleyHowever, there are tools to aid in understanding the memory layout. These are
163740d553cfSPaul Beesleythe link map files: ``build/<platform>/<build-type>/bl<x>/bl<x>.map``, with ``<x>``
163840d553cfSPaul Beesleybeing the stage bootloader. They provide a detailed view of the memory usage of
163940d553cfSPaul Beesleyeach image. Among other useful information, they provide the end address of
164040d553cfSPaul Beesleyeach image.
164140d553cfSPaul Beesley
164240d553cfSPaul Beesley-  ``bl1.map`` link map file provides ``__BL1_RAM_END__`` address.
164340d553cfSPaul Beesley-  ``bl2.map`` link map file provides ``__BL2_END__`` address.
164440d553cfSPaul Beesley-  ``bl31.map`` link map file provides ``__BL31_END__`` address.
164540d553cfSPaul Beesley-  ``bl32.map`` link map file provides ``__BL32_END__`` address.
164640d553cfSPaul Beesley
164740d553cfSPaul BeesleyFor each bootloader image, the platform code must provide its start address
164840d553cfSPaul Beesleyas well as a limit address that it must not overstep. The latter is used in the
164940d553cfSPaul Beesleylinker scripts to check that the image doesn't grow past that address. If that
165040d553cfSPaul Beesleyhappens, the linker will issue a message similar to the following:
165140d553cfSPaul Beesley
165240d553cfSPaul Beesley::
165340d553cfSPaul Beesley
165440d553cfSPaul Beesley    aarch64-none-elf-ld: BLx has exceeded its limit.
165540d553cfSPaul Beesley
165640d553cfSPaul BeesleyAdditionally, if the platform memory layout implies some image overlaying like
165740d553cfSPaul Beesleyon FVP, BL31 and TSP need to know the limit address that their PROGBITS
165840d553cfSPaul Beesleysections must not overstep. The platform code must provide those.
165940d553cfSPaul Beesley
166040d553cfSPaul BeesleyTF-A does not provide any mechanism to verify at boot time that the memory
166140d553cfSPaul Beesleyto load a new image is free to prevent overwriting a previously loaded image.
166240d553cfSPaul BeesleyThe platform must specify the memory available in the system for all the
166340d553cfSPaul Beesleyrelevant BL images to be loaded.
166440d553cfSPaul Beesley
166540d553cfSPaul BeesleyFor example, in the case of BL1 loading BL2, ``bl1_plat_sec_mem_layout()`` will
166640d553cfSPaul Beesleyreturn the region defined by the platform where BL1 intends to load BL2. The
166740d553cfSPaul Beesley``load_image()`` function performs bounds check for the image size based on the
166840d553cfSPaul Beesleybase and maximum image size provided by the platforms. Platforms must take
166940d553cfSPaul Beesleythis behaviour into account when defining the base/size for each of the images.
167040d553cfSPaul Beesley
167140d553cfSPaul BeesleyMemory layout on Arm development platforms
167240d553cfSPaul Beesley^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
167340d553cfSPaul Beesley
167440d553cfSPaul BeesleyThe following list describes the memory layout on the Arm development platforms:
167540d553cfSPaul Beesley
167640d553cfSPaul Beesley-  A 4KB page of shared memory is used for communication between Trusted
167740d553cfSPaul Beesley   Firmware and the platform's power controller. This is located at the base of
167840d553cfSPaul Beesley   Trusted SRAM. The amount of Trusted SRAM available to load the bootloader
167940d553cfSPaul Beesley   images is reduced by the size of the shared memory.
168040d553cfSPaul Beesley
168140d553cfSPaul Beesley   The shared memory is used to store the CPUs' entrypoint mailbox. On Juno,
168240d553cfSPaul Beesley   this is also used for the MHU payload when passing messages to and from the
168340d553cfSPaul Beesley   SCP.
168440d553cfSPaul Beesley
168540d553cfSPaul Beesley-  Another 4 KB page is reserved for passing memory layout between BL1 and BL2
168640d553cfSPaul Beesley   and also the dynamic firmware configurations.
168740d553cfSPaul Beesley
168840d553cfSPaul Beesley-  On FVP, BL1 is originally sitting in the Trusted ROM at address ``0x0``. On
168940d553cfSPaul Beesley   Juno, BL1 resides in flash memory at address ``0x0BEC0000``. BL1 read-write
169040d553cfSPaul Beesley   data are relocated to the top of Trusted SRAM at runtime.
169140d553cfSPaul Beesley
169240d553cfSPaul Beesley-  BL2 is loaded below BL1 RW
169340d553cfSPaul Beesley
169440d553cfSPaul Beesley-  EL3 Runtime Software, BL31 for AArch64 and BL32 for AArch32 (e.g. SP_MIN),
169540d553cfSPaul Beesley   is loaded at the top of the Trusted SRAM, such that its NOBITS sections will
169640d553cfSPaul Beesley   overwrite BL1 R/W data and BL2. This implies that BL1 global variables
169740d553cfSPaul Beesley   remain valid only until execution reaches the EL3 Runtime Software entry
169840d553cfSPaul Beesley   point during a cold boot.
169940d553cfSPaul Beesley
170040d553cfSPaul Beesley-  On Juno, SCP_BL2 is loaded temporarily into the EL3 Runtime Software memory
1701be653a69SPaul Beesley   region and transferred to the SCP before being overwritten by EL3 Runtime
170240d553cfSPaul Beesley   Software.
170340d553cfSPaul Beesley
170440d553cfSPaul Beesley-  BL32 (for AArch64) can be loaded in one of the following locations:
170540d553cfSPaul Beesley
170640d553cfSPaul Beesley   -  Trusted SRAM
170740d553cfSPaul Beesley   -  Trusted DRAM (FVP only)
170840d553cfSPaul Beesley   -  Secure region of DRAM (top 16MB of DRAM configured by the TrustZone
170940d553cfSPaul Beesley      controller)
171040d553cfSPaul Beesley
171140d553cfSPaul Beesley   When BL32 (for AArch64) is loaded into Trusted SRAM, it is loaded below
171240d553cfSPaul Beesley   BL31.
171340d553cfSPaul Beesley
171440d553cfSPaul BeesleyThe location of the BL32 image will result in different memory maps. This is
171540d553cfSPaul Beesleyillustrated for both FVP and Juno in the following diagrams, using the TSP as
171640d553cfSPaul Beesleyan example.
171740d553cfSPaul Beesley
1718e1c5026aSPaul Beesley.. note::
1719e1c5026aSPaul Beesley   Loading the BL32 image in TZC secured DRAM doesn't change the memory
172040d553cfSPaul Beesley   layout of the other images in Trusted SRAM.
172140d553cfSPaul Beesley
172240d553cfSPaul BeesleyCONFIG section in memory layouts shown below contains:
172340d553cfSPaul Beesley
172440d553cfSPaul Beesley::
172540d553cfSPaul Beesley
172640d553cfSPaul Beesley    +--------------------+
172740d553cfSPaul Beesley    |bl2_mem_params_descs|
172840d553cfSPaul Beesley    |--------------------|
172940d553cfSPaul Beesley    |     fw_configs     |
173040d553cfSPaul Beesley    +--------------------+
173140d553cfSPaul Beesley
173240d553cfSPaul Beesley``bl2_mem_params_descs`` contains parameters passed from BL2 to next the
173340d553cfSPaul BeesleyBL image during boot.
173440d553cfSPaul Beesley
173540d553cfSPaul Beesley``fw_configs`` includes soc_fw_config, tos_fw_config and tb_fw_config.
173640d553cfSPaul Beesley
173740d553cfSPaul Beesley**FVP with TSP in Trusted SRAM with firmware configs :**
173840d553cfSPaul Beesley(These diagrams only cover the AArch64 case)
173940d553cfSPaul Beesley
174040d553cfSPaul Beesley::
174140d553cfSPaul Beesley
174240d553cfSPaul Beesley                   DRAM
174340d553cfSPaul Beesley    0xffffffff +----------+
174440d553cfSPaul Beesley               :          :
174540d553cfSPaul Beesley               |----------|
174640d553cfSPaul Beesley               |HW_CONFIG |
174740d553cfSPaul Beesley    0x83000000 |----------|  (non-secure)
174840d553cfSPaul Beesley               |          |
174940d553cfSPaul Beesley    0x80000000 +----------+
175040d553cfSPaul Beesley
175140d553cfSPaul Beesley               Trusted SRAM
175240d553cfSPaul Beesley    0x04040000 +----------+  loaded by BL2  +----------------+
175340d553cfSPaul Beesley               | BL1 (rw) |  <<<<<<<<<<<<<  |                |
175440d553cfSPaul Beesley               |----------|  <<<<<<<<<<<<<  |  BL31 NOBITS   |
175540d553cfSPaul Beesley               |   BL2    |  <<<<<<<<<<<<<  |                |
175640d553cfSPaul Beesley               |----------|  <<<<<<<<<<<<<  |----------------|
175740d553cfSPaul Beesley               |          |  <<<<<<<<<<<<<  | BL31 PROGBITS  |
175840d553cfSPaul Beesley               |          |  <<<<<<<<<<<<<  |----------------|
175940d553cfSPaul Beesley               |          |  <<<<<<<<<<<<<  |     BL32       |
176040d553cfSPaul Beesley    0x04002000 +----------+                 +----------------+
176140d553cfSPaul Beesley               |  CONFIG  |
176240d553cfSPaul Beesley    0x04001000 +----------+
176340d553cfSPaul Beesley               |  Shared  |
176440d553cfSPaul Beesley    0x04000000 +----------+
176540d553cfSPaul Beesley
176640d553cfSPaul Beesley               Trusted ROM
176740d553cfSPaul Beesley    0x04000000 +----------+
176840d553cfSPaul Beesley               | BL1 (ro) |
176940d553cfSPaul Beesley    0x00000000 +----------+
177040d553cfSPaul Beesley
177140d553cfSPaul Beesley**FVP with TSP in Trusted DRAM with firmware configs (default option):**
177240d553cfSPaul Beesley
177340d553cfSPaul Beesley::
177440d553cfSPaul Beesley
177540d553cfSPaul Beesley                     DRAM
177640d553cfSPaul Beesley    0xffffffff +--------------+
177740d553cfSPaul Beesley               :              :
177840d553cfSPaul Beesley               |--------------|
177940d553cfSPaul Beesley               |  HW_CONFIG   |
178040d553cfSPaul Beesley    0x83000000 |--------------|  (non-secure)
178140d553cfSPaul Beesley               |              |
178240d553cfSPaul Beesley    0x80000000 +--------------+
178340d553cfSPaul Beesley
178440d553cfSPaul Beesley                Trusted DRAM
178540d553cfSPaul Beesley    0x08000000 +--------------+
178640d553cfSPaul Beesley               |     BL32     |
178740d553cfSPaul Beesley    0x06000000 +--------------+
178840d553cfSPaul Beesley
178940d553cfSPaul Beesley                 Trusted SRAM
179040d553cfSPaul Beesley    0x04040000 +--------------+  loaded by BL2  +----------------+
179140d553cfSPaul Beesley               |   BL1 (rw)   |  <<<<<<<<<<<<<  |                |
179240d553cfSPaul Beesley               |--------------|  <<<<<<<<<<<<<  |  BL31 NOBITS   |
179340d553cfSPaul Beesley               |     BL2      |  <<<<<<<<<<<<<  |                |
179440d553cfSPaul Beesley               |--------------|  <<<<<<<<<<<<<  |----------------|
179540d553cfSPaul Beesley               |              |  <<<<<<<<<<<<<  | BL31 PROGBITS  |
179640d553cfSPaul Beesley               |              |                 +----------------+
179740d553cfSPaul Beesley               +--------------+
179840d553cfSPaul Beesley               |    CONFIG    |
179940d553cfSPaul Beesley    0x04001000 +--------------+
180040d553cfSPaul Beesley               |    Shared    |
180140d553cfSPaul Beesley    0x04000000 +--------------+
180240d553cfSPaul Beesley
180340d553cfSPaul Beesley                 Trusted ROM
180440d553cfSPaul Beesley    0x04000000 +--------------+
180540d553cfSPaul Beesley               |   BL1 (ro)   |
180640d553cfSPaul Beesley    0x00000000 +--------------+
180740d553cfSPaul Beesley
180840d553cfSPaul Beesley**FVP with TSP in TZC-Secured DRAM with firmware configs :**
180940d553cfSPaul Beesley
181040d553cfSPaul Beesley::
181140d553cfSPaul Beesley
181240d553cfSPaul Beesley                   DRAM
181340d553cfSPaul Beesley    0xffffffff +----------+
181440d553cfSPaul Beesley               |  BL32    |  (secure)
181540d553cfSPaul Beesley    0xff000000 +----------+
181640d553cfSPaul Beesley               |          |
181740d553cfSPaul Beesley               |----------|
181840d553cfSPaul Beesley               |HW_CONFIG |
181940d553cfSPaul Beesley    0x83000000 |----------|  (non-secure)
182040d553cfSPaul Beesley               |          |
182140d553cfSPaul Beesley    0x80000000 +----------+
182240d553cfSPaul Beesley
182340d553cfSPaul Beesley               Trusted SRAM
182440d553cfSPaul Beesley    0x04040000 +----------+  loaded by BL2  +----------------+
182540d553cfSPaul Beesley               | BL1 (rw) |  <<<<<<<<<<<<<  |                |
182640d553cfSPaul Beesley               |----------|  <<<<<<<<<<<<<  |  BL31 NOBITS   |
182740d553cfSPaul Beesley               |   BL2    |  <<<<<<<<<<<<<  |                |
182840d553cfSPaul Beesley               |----------|  <<<<<<<<<<<<<  |----------------|
182940d553cfSPaul Beesley               |          |  <<<<<<<<<<<<<  | BL31 PROGBITS  |
183040d553cfSPaul Beesley               |          |                 +----------------+
183140d553cfSPaul Beesley    0x04002000 +----------+
183240d553cfSPaul Beesley               |  CONFIG  |
183340d553cfSPaul Beesley    0x04001000 +----------+
183440d553cfSPaul Beesley               |  Shared  |
183540d553cfSPaul Beesley    0x04000000 +----------+
183640d553cfSPaul Beesley
183740d553cfSPaul Beesley               Trusted ROM
183840d553cfSPaul Beesley    0x04000000 +----------+
183940d553cfSPaul Beesley               | BL1 (ro) |
184040d553cfSPaul Beesley    0x00000000 +----------+
184140d553cfSPaul Beesley
184240d553cfSPaul Beesley**Juno with BL32 in Trusted SRAM :**
184340d553cfSPaul Beesley
184440d553cfSPaul Beesley::
184540d553cfSPaul Beesley
184640d553cfSPaul Beesley                  Flash0
184740d553cfSPaul Beesley    0x0C000000 +----------+
184840d553cfSPaul Beesley               :          :
184940d553cfSPaul Beesley    0x0BED0000 |----------|
185040d553cfSPaul Beesley               | BL1 (ro) |
185140d553cfSPaul Beesley    0x0BEC0000 |----------|
185240d553cfSPaul Beesley               :          :
185340d553cfSPaul Beesley    0x08000000 +----------+                  BL31 is loaded
185440d553cfSPaul Beesley                                             after SCP_BL2 has
185540d553cfSPaul Beesley               Trusted SRAM                  been sent to SCP
185640d553cfSPaul Beesley    0x04040000 +----------+  loaded by BL2  +----------------+
185740d553cfSPaul Beesley               | BL1 (rw) |  <<<<<<<<<<<<<  |                |
185840d553cfSPaul Beesley               |----------|  <<<<<<<<<<<<<  |  BL31 NOBITS   |
185940d553cfSPaul Beesley               |   BL2    |  <<<<<<<<<<<<<  |                |
186040d553cfSPaul Beesley               |----------|  <<<<<<<<<<<<<  |----------------|
186140d553cfSPaul Beesley               | SCP_BL2  |  <<<<<<<<<<<<<  | BL31 PROGBITS  |
186240d553cfSPaul Beesley               |----------|  <<<<<<<<<<<<<  |----------------|
186340d553cfSPaul Beesley               |          |  <<<<<<<<<<<<<  |     BL32       |
186440d553cfSPaul Beesley               |          |                 +----------------+
186540d553cfSPaul Beesley               |          |
186640d553cfSPaul Beesley    0x04001000 +----------+
186740d553cfSPaul Beesley               |   MHU    |
186840d553cfSPaul Beesley    0x04000000 +----------+
186940d553cfSPaul Beesley
187040d553cfSPaul Beesley**Juno with BL32 in TZC-secured DRAM :**
187140d553cfSPaul Beesley
187240d553cfSPaul Beesley::
187340d553cfSPaul Beesley
187440d553cfSPaul Beesley                   DRAM
187540d553cfSPaul Beesley    0xFFE00000 +----------+
187640d553cfSPaul Beesley               |  BL32    |  (secure)
187740d553cfSPaul Beesley    0xFF000000 |----------|
187840d553cfSPaul Beesley               |          |
187940d553cfSPaul Beesley               :          :  (non-secure)
188040d553cfSPaul Beesley               |          |
188140d553cfSPaul Beesley    0x80000000 +----------+
188240d553cfSPaul Beesley
188340d553cfSPaul Beesley                  Flash0
188440d553cfSPaul Beesley    0x0C000000 +----------+
188540d553cfSPaul Beesley               :          :
188640d553cfSPaul Beesley    0x0BED0000 |----------|
188740d553cfSPaul Beesley               | BL1 (ro) |
188840d553cfSPaul Beesley    0x0BEC0000 |----------|
188940d553cfSPaul Beesley               :          :
189040d553cfSPaul Beesley    0x08000000 +----------+                  BL31 is loaded
189140d553cfSPaul Beesley                                             after SCP_BL2 has
189240d553cfSPaul Beesley               Trusted SRAM                  been sent to SCP
189340d553cfSPaul Beesley    0x04040000 +----------+  loaded by BL2  +----------------+
189440d553cfSPaul Beesley               | BL1 (rw) |  <<<<<<<<<<<<<  |                |
189540d553cfSPaul Beesley               |----------|  <<<<<<<<<<<<<  |  BL31 NOBITS   |
189640d553cfSPaul Beesley               |   BL2    |  <<<<<<<<<<<<<  |                |
189740d553cfSPaul Beesley               |----------|  <<<<<<<<<<<<<  |----------------|
189840d553cfSPaul Beesley               | SCP_BL2  |  <<<<<<<<<<<<<  | BL31 PROGBITS  |
189940d553cfSPaul Beesley               |----------|                 +----------------+
190040d553cfSPaul Beesley    0x04001000 +----------+
190140d553cfSPaul Beesley               |   MHU    |
190240d553cfSPaul Beesley    0x04000000 +----------+
190340d553cfSPaul Beesley
190443f35ef5SPaul Beesley.. _firmware_design_fip:
190540d553cfSPaul Beesley
190640d553cfSPaul BeesleyFirmware Image Package (FIP)
190740d553cfSPaul Beesley----------------------------
190840d553cfSPaul Beesley
190940d553cfSPaul BeesleyUsing a Firmware Image Package (FIP) allows for packing bootloader images (and
191040d553cfSPaul Beesleypotentially other payloads) into a single archive that can be loaded by TF-A
191140d553cfSPaul Beesleyfrom non-volatile platform storage. A driver to load images from a FIP has
191240d553cfSPaul Beesleybeen added to the storage layer and allows a package to be read from supported
191340d553cfSPaul Beesleyplatform storage. A tool to create Firmware Image Packages is also provided
191440d553cfSPaul Beesleyand described below.
191540d553cfSPaul Beesley
191640d553cfSPaul BeesleyFirmware Image Package layout
191740d553cfSPaul Beesley~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
191840d553cfSPaul Beesley
191940d553cfSPaul BeesleyThe FIP layout consists of a table of contents (ToC) followed by payload data.
192040d553cfSPaul BeesleyThe ToC itself has a header followed by one or more table entries. The ToC is
192140d553cfSPaul Beesleyterminated by an end marker entry, and since the size of the ToC is 0 bytes,
192240d553cfSPaul Beesleythe offset equals the total size of the FIP file. All ToC entries describe some
192340d553cfSPaul Beesleypayload data that has been appended to the end of the binary package. With the
192440d553cfSPaul Beesleyinformation provided in the ToC entry the corresponding payload data can be
192540d553cfSPaul Beesleyretrieved.
192640d553cfSPaul Beesley
192740d553cfSPaul Beesley::
192840d553cfSPaul Beesley
192940d553cfSPaul Beesley    ------------------
193040d553cfSPaul Beesley    | ToC Header     |
193140d553cfSPaul Beesley    |----------------|
193240d553cfSPaul Beesley    | ToC Entry 0    |
193340d553cfSPaul Beesley    |----------------|
193440d553cfSPaul Beesley    | ToC Entry 1    |
193540d553cfSPaul Beesley    |----------------|
193640d553cfSPaul Beesley    | ToC End Marker |
193740d553cfSPaul Beesley    |----------------|
193840d553cfSPaul Beesley    |                |
193940d553cfSPaul Beesley    |     Data 0     |
194040d553cfSPaul Beesley    |                |
194140d553cfSPaul Beesley    |----------------|
194240d553cfSPaul Beesley    |                |
194340d553cfSPaul Beesley    |     Data 1     |
194440d553cfSPaul Beesley    |                |
194540d553cfSPaul Beesley    ------------------
194640d553cfSPaul Beesley
194740d553cfSPaul BeesleyThe ToC header and entry formats are described in the header file
194840d553cfSPaul Beesley``include/tools_share/firmware_image_package.h``. This file is used by both the
194940d553cfSPaul Beesleytool and TF-A.
195040d553cfSPaul Beesley
195140d553cfSPaul BeesleyThe ToC header has the following fields:
195240d553cfSPaul Beesley
195340d553cfSPaul Beesley::
195440d553cfSPaul Beesley
195540d553cfSPaul Beesley    `name`: The name of the ToC. This is currently used to validate the header.
195640d553cfSPaul Beesley    `serial_number`: A non-zero number provided by the creation tool
195740d553cfSPaul Beesley    `flags`: Flags associated with this data.
195840d553cfSPaul Beesley        Bits 0-31: Reserved
195940d553cfSPaul Beesley        Bits 32-47: Platform defined
196040d553cfSPaul Beesley        Bits 48-63: Reserved
196140d553cfSPaul Beesley
196240d553cfSPaul BeesleyA ToC entry has the following fields:
196340d553cfSPaul Beesley
196440d553cfSPaul Beesley::
196540d553cfSPaul Beesley
196640d553cfSPaul Beesley    `uuid`: All files are referred to by a pre-defined Universally Unique
196740d553cfSPaul Beesley        IDentifier [UUID] . The UUIDs are defined in
196840d553cfSPaul Beesley        `include/tools_share/firmware_image_package.h`. The platform translates
196940d553cfSPaul Beesley        the requested image name into the corresponding UUID when accessing the
197040d553cfSPaul Beesley        package.
197140d553cfSPaul Beesley    `offset_address`: The offset address at which the corresponding payload data
197240d553cfSPaul Beesley        can be found. The offset is calculated from the ToC base address.
197340d553cfSPaul Beesley    `size`: The size of the corresponding payload data in bytes.
197440d553cfSPaul Beesley    `flags`: Flags associated with this entry. None are yet defined.
197540d553cfSPaul Beesley
197640d553cfSPaul BeesleyFirmware Image Package creation tool
197740d553cfSPaul Beesley~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
197840d553cfSPaul Beesley
197940d553cfSPaul BeesleyThe FIP creation tool can be used to pack specified images into a binary
198040d553cfSPaul Beesleypackage that can be loaded by TF-A from platform storage. The tool currently
198140d553cfSPaul Beesleyonly supports packing bootloader images. Additional image definitions can be
198240d553cfSPaul Beesleyadded to the tool as required.
198340d553cfSPaul Beesley
198440d553cfSPaul BeesleyThe tool can be found in ``tools/fiptool``.
198540d553cfSPaul Beesley
198640d553cfSPaul BeesleyLoading from a Firmware Image Package (FIP)
198740d553cfSPaul Beesley~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
198840d553cfSPaul Beesley
198940d553cfSPaul BeesleyThe Firmware Image Package (FIP) driver can load images from a binary package on
199040d553cfSPaul Beesleynon-volatile platform storage. For the Arm development platforms, this is
199140d553cfSPaul Beesleycurrently NOR FLASH.
199240d553cfSPaul Beesley
199340d553cfSPaul BeesleyBootloader images are loaded according to the platform policy as specified by
199440d553cfSPaul Beesleythe function ``plat_get_image_source()``. For the Arm development platforms, this
199540d553cfSPaul Beesleymeans the platform will attempt to load images from a Firmware Image Package
199640d553cfSPaul Beesleylocated at the start of NOR FLASH0.
199740d553cfSPaul Beesley
199840d553cfSPaul BeesleyThe Arm development platforms' policy is to only allow loading of a known set of
199940d553cfSPaul Beesleyimages. The platform policy can be modified to allow additional images.
200040d553cfSPaul Beesley
200140d553cfSPaul BeesleyUse of coherent memory in TF-A
200240d553cfSPaul Beesley------------------------------
200340d553cfSPaul Beesley
200440d553cfSPaul BeesleyThere might be loss of coherency when physical memory with mismatched
200540d553cfSPaul Beesleyshareability, cacheability and memory attributes is accessed by multiple CPUs
200640d553cfSPaul Beesley(refer to section B2.9 of `Arm ARM`_ for more details). This possibility occurs
200740d553cfSPaul Beesleyin TF-A during power up/down sequences when coherency, MMU and caches are
200840d553cfSPaul Beesleyturned on/off incrementally.
200940d553cfSPaul Beesley
201040d553cfSPaul BeesleyTF-A defines coherent memory as a region of memory with Device nGnRE attributes
201140d553cfSPaul Beesleyin the translation tables. The translation granule size in TF-A is 4KB. This
201240d553cfSPaul Beesleyis the smallest possible size of the coherent memory region.
201340d553cfSPaul Beesley
201440d553cfSPaul BeesleyBy default, all data structures which are susceptible to accesses with
201540d553cfSPaul Beesleymismatched attributes from various CPUs are allocated in a coherent memory
201634760951SPaul Beesleyregion (refer to section 2.1 of :ref:`Porting Guide`). The coherent memory
201734760951SPaul Beesleyregion accesses are Outer Shareable, non-cacheable and they can be accessed with
201834760951SPaul Beesleythe Device nGnRE attributes when the MMU is turned on. Hence, at the expense of
201934760951SPaul Beesleyat least an extra page of memory, TF-A is able to work around coherency issues
202034760951SPaul Beesleydue to mismatched memory attributes.
202140d553cfSPaul Beesley
202240d553cfSPaul BeesleyThe alternative to the above approach is to allocate the susceptible data
202340d553cfSPaul Beesleystructures in Normal WriteBack WriteAllocate Inner shareable memory. This
202440d553cfSPaul Beesleyapproach requires the data structures to be designed so that it is possible to
202540d553cfSPaul Beesleywork around the issue of mismatched memory attributes by performing software
202640d553cfSPaul Beesleycache maintenance on them.
202740d553cfSPaul Beesley
202840d553cfSPaul BeesleyDisabling the use of coherent memory in TF-A
202940d553cfSPaul Beesley~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
203040d553cfSPaul Beesley
203140d553cfSPaul BeesleyIt might be desirable to avoid the cost of allocating coherent memory on
203240d553cfSPaul Beesleyplatforms which are memory constrained. TF-A enables inclusion of coherent
203340d553cfSPaul Beesleymemory in firmware images through the build flag ``USE_COHERENT_MEM``.
203440d553cfSPaul BeesleyThis flag is enabled by default. It can be disabled to choose the second
203540d553cfSPaul Beesleyapproach described above.
203640d553cfSPaul Beesley
203740d553cfSPaul BeesleyThe below sections analyze the data structures allocated in the coherent memory
203840d553cfSPaul Beesleyregion and the changes required to allocate them in normal memory.
203940d553cfSPaul Beesley
204040d553cfSPaul BeesleyCoherent memory usage in PSCI implementation
204140d553cfSPaul Beesley~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
204240d553cfSPaul Beesley
204340d553cfSPaul BeesleyThe ``psci_non_cpu_pd_nodes`` data structure stores the platform's power domain
204440d553cfSPaul Beesleytree information for state management of power domains. By default, this data
204540d553cfSPaul Beesleystructure is allocated in the coherent memory region in TF-A because it can be
204640d553cfSPaul Beesleyaccessed by multiple CPUs, either with caches enabled or disabled.
204740d553cfSPaul Beesley
204840d553cfSPaul Beesley.. code:: c
204940d553cfSPaul Beesley
205040d553cfSPaul Beesley    typedef struct non_cpu_pwr_domain_node {
205140d553cfSPaul Beesley        /*
205240d553cfSPaul Beesley         * Index of the first CPU power domain node level 0 which has this node
205340d553cfSPaul Beesley         * as its parent.
205440d553cfSPaul Beesley         */
205540d553cfSPaul Beesley        unsigned int cpu_start_idx;
205640d553cfSPaul Beesley
205740d553cfSPaul Beesley        /*
205840d553cfSPaul Beesley         * Number of CPU power domains which are siblings of the domain indexed
205940d553cfSPaul Beesley         * by 'cpu_start_idx' i.e. all the domains in the range 'cpu_start_idx
206040d553cfSPaul Beesley         * -> cpu_start_idx + ncpus' have this node as their parent.
206140d553cfSPaul Beesley         */
206240d553cfSPaul Beesley        unsigned int ncpus;
206340d553cfSPaul Beesley
206440d553cfSPaul Beesley        /*
206540d553cfSPaul Beesley         * Index of the parent power domain node.
206640d553cfSPaul Beesley         */
206740d553cfSPaul Beesley        unsigned int parent_node;
206840d553cfSPaul Beesley
206940d553cfSPaul Beesley        plat_local_state_t local_state;
207040d553cfSPaul Beesley
207140d553cfSPaul Beesley        unsigned char level;
207240d553cfSPaul Beesley
207340d553cfSPaul Beesley        /* For indexing the psci_lock array*/
207440d553cfSPaul Beesley        unsigned char lock_index;
207540d553cfSPaul Beesley    } non_cpu_pd_node_t;
207640d553cfSPaul Beesley
207740d553cfSPaul BeesleyIn order to move this data structure to normal memory, the use of each of its
207840d553cfSPaul Beesleyfields must be analyzed. Fields like ``cpu_start_idx``, ``ncpus``, ``parent_node``
207940d553cfSPaul Beesley``level`` and ``lock_index`` are only written once during cold boot. Hence removing
208040d553cfSPaul Beesleythem from coherent memory involves only doing a clean and invalidate of the
208140d553cfSPaul Beesleycache lines after these fields are written.
208240d553cfSPaul Beesley
208340d553cfSPaul BeesleyThe field ``local_state`` can be concurrently accessed by multiple CPUs in
208440d553cfSPaul Beesleydifferent cache states. A Lamport's Bakery lock ``psci_locks`` is used to ensure
208540d553cfSPaul Beesleymutual exclusion to this field and a clean and invalidate is needed after it
208640d553cfSPaul Beesleyis written.
208740d553cfSPaul Beesley
208840d553cfSPaul BeesleyBakery lock data
208940d553cfSPaul Beesley~~~~~~~~~~~~~~~~
209040d553cfSPaul Beesley
209140d553cfSPaul BeesleyThe bakery lock data structure ``bakery_lock_t`` is allocated in coherent memory
209240d553cfSPaul Beesleyand is accessed by multiple CPUs with mismatched attributes. ``bakery_lock_t`` is
209340d553cfSPaul Beesleydefined as follows:
209440d553cfSPaul Beesley
209540d553cfSPaul Beesley.. code:: c
209640d553cfSPaul Beesley
209740d553cfSPaul Beesley    typedef struct bakery_lock {
209840d553cfSPaul Beesley        /*
209940d553cfSPaul Beesley         * The lock_data is a bit-field of 2 members:
210040d553cfSPaul Beesley         * Bit[0]       : choosing. This field is set when the CPU is
210140d553cfSPaul Beesley         *                choosing its bakery number.
210240d553cfSPaul Beesley         * Bits[1 - 15] : number. This is the bakery number allocated.
210340d553cfSPaul Beesley         */
210440d553cfSPaul Beesley        volatile uint16_t lock_data[BAKERY_LOCK_MAX_CPUS];
210540d553cfSPaul Beesley    } bakery_lock_t;
210640d553cfSPaul Beesley
210740d553cfSPaul BeesleyIt is a characteristic of Lamport's Bakery algorithm that the volatile per-CPU
210840d553cfSPaul Beesleyfields can be read by all CPUs but only written to by the owning CPU.
210940d553cfSPaul Beesley
211040d553cfSPaul BeesleyDepending upon the data cache line size, the per-CPU fields of the
211140d553cfSPaul Beesley``bakery_lock_t`` structure for multiple CPUs may exist on a single cache line.
211240d553cfSPaul BeesleyThese per-CPU fields can be read and written during lock contention by multiple
211340d553cfSPaul BeesleyCPUs with mismatched memory attributes. Since these fields are a part of the
211440d553cfSPaul Beesleylock implementation, they do not have access to any other locking primitive to
211540d553cfSPaul Beesleysafeguard against the resulting coherency issues. As a result, simple software
211640d553cfSPaul Beesleycache maintenance is not enough to allocate them in coherent memory. Consider
211740d553cfSPaul Beesleythe following example.
211840d553cfSPaul Beesley
211940d553cfSPaul BeesleyCPU0 updates its per-CPU field with data cache enabled. This write updates a
212040d553cfSPaul Beesleylocal cache line which contains a copy of the fields for other CPUs as well. Now
212140d553cfSPaul BeesleyCPU1 updates its per-CPU field of the ``bakery_lock_t`` structure with data cache
212240d553cfSPaul Beesleydisabled. CPU1 then issues a DCIVAC operation to invalidate any stale copies of
212340d553cfSPaul Beesleyits field in any other cache line in the system. This operation will invalidate
212440d553cfSPaul Beesleythe update made by CPU0 as well.
212540d553cfSPaul Beesley
212640d553cfSPaul BeesleyTo use bakery locks when ``USE_COHERENT_MEM`` is disabled, the lock data structure
212740d553cfSPaul Beesleyhas been redesigned. The changes utilise the characteristic of Lamport's Bakery
212840d553cfSPaul Beesleyalgorithm mentioned earlier. The bakery_lock structure only allocates the memory
212940d553cfSPaul Beesleyfor a single CPU. The macro ``DEFINE_BAKERY_LOCK`` allocates all the bakery locks
213040d553cfSPaul Beesleyneeded for a CPU into a section ``bakery_lock``. The linker allocates the memory
213140d553cfSPaul Beesleyfor other cores by using the total size allocated for the bakery_lock section
213240d553cfSPaul Beesleyand multiplying it with (PLATFORM_CORE_COUNT - 1). This enables software to
213340d553cfSPaul Beesleyperform software cache maintenance on the lock data structure without running
213440d553cfSPaul Beesleyinto coherency issues associated with mismatched attributes.
213540d553cfSPaul Beesley
213640d553cfSPaul BeesleyThe bakery lock data structure ``bakery_info_t`` is defined for use when
213740d553cfSPaul Beesley``USE_COHERENT_MEM`` is disabled as follows:
213840d553cfSPaul Beesley
213940d553cfSPaul Beesley.. code:: c
214040d553cfSPaul Beesley
214140d553cfSPaul Beesley    typedef struct bakery_info {
214240d553cfSPaul Beesley        /*
214340d553cfSPaul Beesley         * The lock_data is a bit-field of 2 members:
214440d553cfSPaul Beesley         * Bit[0]       : choosing. This field is set when the CPU is
214540d553cfSPaul Beesley         *                choosing its bakery number.
214640d553cfSPaul Beesley         * Bits[1 - 15] : number. This is the bakery number allocated.
214740d553cfSPaul Beesley         */
214840d553cfSPaul Beesley         volatile uint16_t lock_data;
214940d553cfSPaul Beesley    } bakery_info_t;
215040d553cfSPaul Beesley
215140d553cfSPaul BeesleyThe ``bakery_info_t`` represents a single per-CPU field of one lock and
215240d553cfSPaul Beesleythe combination of corresponding ``bakery_info_t`` structures for all CPUs in the
215340d553cfSPaul Beesleysystem represents the complete bakery lock. The view in memory for a system
215440d553cfSPaul Beesleywith n bakery locks are:
215540d553cfSPaul Beesley
215640d553cfSPaul Beesley::
215740d553cfSPaul Beesley
215840d553cfSPaul Beesley    bakery_lock section start
215940d553cfSPaul Beesley    |----------------|
216040d553cfSPaul Beesley    | `bakery_info_t`| <-- Lock_0 per-CPU field
216140d553cfSPaul Beesley    |    Lock_0      |     for CPU0
216240d553cfSPaul Beesley    |----------------|
216340d553cfSPaul Beesley    | `bakery_info_t`| <-- Lock_1 per-CPU field
216440d553cfSPaul Beesley    |    Lock_1      |     for CPU0
216540d553cfSPaul Beesley    |----------------|
216640d553cfSPaul Beesley    | ....           |
216740d553cfSPaul Beesley    |----------------|
216840d553cfSPaul Beesley    | `bakery_info_t`| <-- Lock_N per-CPU field
216940d553cfSPaul Beesley    |    Lock_N      |     for CPU0
217040d553cfSPaul Beesley    ------------------
217140d553cfSPaul Beesley    |    XXXXX       |
217240d553cfSPaul Beesley    | Padding to     |
217340d553cfSPaul Beesley    | next Cache WB  | <--- Calculate PERCPU_BAKERY_LOCK_SIZE, allocate
217440d553cfSPaul Beesley    |  Granule       |       continuous memory for remaining CPUs.
217540d553cfSPaul Beesley    ------------------
217640d553cfSPaul Beesley    | `bakery_info_t`| <-- Lock_0 per-CPU field
217740d553cfSPaul Beesley    |    Lock_0      |     for CPU1
217840d553cfSPaul Beesley    |----------------|
217940d553cfSPaul Beesley    | `bakery_info_t`| <-- Lock_1 per-CPU field
218040d553cfSPaul Beesley    |    Lock_1      |     for CPU1
218140d553cfSPaul Beesley    |----------------|
218240d553cfSPaul Beesley    | ....           |
218340d553cfSPaul Beesley    |----------------|
218440d553cfSPaul Beesley    | `bakery_info_t`| <-- Lock_N per-CPU field
218540d553cfSPaul Beesley    |    Lock_N      |     for CPU1
218640d553cfSPaul Beesley    ------------------
218740d553cfSPaul Beesley    |    XXXXX       |
218840d553cfSPaul Beesley    | Padding to     |
218940d553cfSPaul Beesley    | next Cache WB  |
219040d553cfSPaul Beesley    |  Granule       |
219140d553cfSPaul Beesley    ------------------
219240d553cfSPaul Beesley
219340d553cfSPaul BeesleyConsider a system of 2 CPUs with 'N' bakery locks as shown above. For an
219440d553cfSPaul Beesleyoperation on Lock_N, the corresponding ``bakery_info_t`` in both CPU0 and CPU1
219540d553cfSPaul Beesley``bakery_lock`` section need to be fetched and appropriate cache operations need
219640d553cfSPaul Beesleyto be performed for each access.
219740d553cfSPaul Beesley
219840d553cfSPaul BeesleyOn Arm Platforms, bakery locks are used in psci (``psci_locks``) and power controller
219940d553cfSPaul Beesleydriver (``arm_lock``).
220040d553cfSPaul Beesley
220140d553cfSPaul BeesleyNon Functional Impact of removing coherent memory
220240d553cfSPaul Beesley~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
220340d553cfSPaul Beesley
220440d553cfSPaul BeesleyRemoval of the coherent memory region leads to the additional software overhead
220540d553cfSPaul Beesleyof performing cache maintenance for the affected data structures. However, since
220640d553cfSPaul Beesleythe memory where the data structures are allocated is cacheable, the overhead is
220740d553cfSPaul Beesleymostly mitigated by an increase in performance.
220840d553cfSPaul Beesley
220940d553cfSPaul BeesleyThere is however a performance impact for bakery locks, due to:
221040d553cfSPaul Beesley
221140d553cfSPaul Beesley-  Additional cache maintenance operations, and
221240d553cfSPaul Beesley-  Multiple cache line reads for each lock operation, since the bakery locks
221340d553cfSPaul Beesley   for each CPU are distributed across different cache lines.
221440d553cfSPaul Beesley
221540d553cfSPaul BeesleyThe implementation has been optimized to minimize this additional overhead.
221640d553cfSPaul BeesleyMeasurements indicate that when bakery locks are allocated in Normal memory, the
221740d553cfSPaul Beesleyminimum latency of acquiring a lock is on an average 3-4 micro seconds whereas
221840d553cfSPaul Beesleyin Device memory the same is 2 micro seconds. The measurements were done on the
221940d553cfSPaul BeesleyJuno Arm development platform.
222040d553cfSPaul Beesley
222140d553cfSPaul BeesleyAs mentioned earlier, almost a page of memory can be saved by disabling
222240d553cfSPaul Beesley``USE_COHERENT_MEM``. Each platform needs to consider these trade-offs to decide
222340d553cfSPaul Beesleywhether coherent memory should be used. If a platform disables
222440d553cfSPaul Beesley``USE_COHERENT_MEM`` and needs to use bakery locks in the porting layer, it can
222540d553cfSPaul Beesleyoptionally define macro ``PLAT_PERCPU_BAKERY_LOCK_SIZE`` (see the
222634760951SPaul Beesley:ref:`Porting Guide`). Refer to the reference platform code for examples.
222740d553cfSPaul Beesley
222840d553cfSPaul BeesleyIsolating code and read-only data on separate memory pages
222940d553cfSPaul Beesley----------------------------------------------------------
223040d553cfSPaul Beesley
223140d553cfSPaul BeesleyIn the Armv8-A VMSA, translation table entries include fields that define the
223240d553cfSPaul Beesleyproperties of the target memory region, such as its access permissions. The
223340d553cfSPaul Beesleysmallest unit of memory that can be addressed by a translation table entry is
223440d553cfSPaul Beesleya memory page. Therefore, if software needs to set different permissions on two
223540d553cfSPaul Beesleymemory regions then it needs to map them using different memory pages.
223640d553cfSPaul Beesley
223740d553cfSPaul BeesleyThe default memory layout for each BL image is as follows:
223840d553cfSPaul Beesley
223940d553cfSPaul Beesley::
224040d553cfSPaul Beesley
224140d553cfSPaul Beesley       |        ...        |
224240d553cfSPaul Beesley       +-------------------+
224340d553cfSPaul Beesley       |  Read-write data  |
224440d553cfSPaul Beesley       +-------------------+ Page boundary
224540d553cfSPaul Beesley       |     <Padding>     |
224640d553cfSPaul Beesley       +-------------------+
224740d553cfSPaul Beesley       | Exception vectors |
224840d553cfSPaul Beesley       +-------------------+ 2 KB boundary
224940d553cfSPaul Beesley       |     <Padding>     |
225040d553cfSPaul Beesley       +-------------------+
225140d553cfSPaul Beesley       |  Read-only data   |
225240d553cfSPaul Beesley       +-------------------+
225340d553cfSPaul Beesley       |       Code        |
225440d553cfSPaul Beesley       +-------------------+ BLx_BASE
225540d553cfSPaul Beesley
2256e1c5026aSPaul Beesley.. note::
2257e1c5026aSPaul Beesley   The 2KB alignment for the exception vectors is an architectural
225840d553cfSPaul Beesley   requirement.
225940d553cfSPaul Beesley
226040d553cfSPaul BeesleyThe read-write data start on a new memory page so that they can be mapped with
226140d553cfSPaul Beesleyread-write permissions, whereas the code and read-only data below are configured
226240d553cfSPaul Beesleyas read-only.
226340d553cfSPaul Beesley
226440d553cfSPaul BeesleyHowever, the read-only data are not aligned on a page boundary. They are
226540d553cfSPaul Beesleycontiguous to the code. Therefore, the end of the code section and the beginning
226640d553cfSPaul Beesleyof the read-only data one might share a memory page. This forces both to be
226740d553cfSPaul Beesleymapped with the same memory attributes. As the code needs to be executable, this
226840d553cfSPaul Beesleymeans that the read-only data stored on the same memory page as the code are
226940d553cfSPaul Beesleyexecutable as well. This could potentially be exploited as part of a security
227040d553cfSPaul Beesleyattack.
227140d553cfSPaul Beesley
227240d553cfSPaul BeesleyTF provides the build flag ``SEPARATE_CODE_AND_RODATA`` to isolate the code and
227340d553cfSPaul Beesleyread-only data on separate memory pages. This in turn allows independent control
227440d553cfSPaul Beesleyof the access permissions for the code and read-only data. In this case,
227540d553cfSPaul Beesleyplatform code gets a finer-grained view of the image layout and can
227640d553cfSPaul Beesleyappropriately map the code region as executable and the read-only data as
227740d553cfSPaul Beesleyexecute-never.
227840d553cfSPaul Beesley
227940d553cfSPaul BeesleyThis has an impact on memory footprint, as padding bytes need to be introduced
228040d553cfSPaul Beesleybetween the code and read-only data to ensure the segregation of the two. To
228140d553cfSPaul Beesleylimit the memory cost, this flag also changes the memory layout such that the
228240d553cfSPaul Beesleycode and exception vectors are now contiguous, like so:
228340d553cfSPaul Beesley
228440d553cfSPaul Beesley::
228540d553cfSPaul Beesley
228640d553cfSPaul Beesley       |        ...        |
228740d553cfSPaul Beesley       +-------------------+
228840d553cfSPaul Beesley       |  Read-write data  |
228940d553cfSPaul Beesley       +-------------------+ Page boundary
229040d553cfSPaul Beesley       |     <Padding>     |
229140d553cfSPaul Beesley       +-------------------+
229240d553cfSPaul Beesley       |  Read-only data   |
229340d553cfSPaul Beesley       +-------------------+ Page boundary
229440d553cfSPaul Beesley       |     <Padding>     |
229540d553cfSPaul Beesley       +-------------------+
229640d553cfSPaul Beesley       | Exception vectors |
229740d553cfSPaul Beesley       +-------------------+ 2 KB boundary
229840d553cfSPaul Beesley       |     <Padding>     |
229940d553cfSPaul Beesley       +-------------------+
230040d553cfSPaul Beesley       |       Code        |
230140d553cfSPaul Beesley       +-------------------+ BLx_BASE
230240d553cfSPaul Beesley
230340d553cfSPaul BeesleyWith this more condensed memory layout, the separation of read-only data will
230440d553cfSPaul Beesleyadd zero or one page to the memory footprint of each BL image. Each platform
230540d553cfSPaul Beesleyshould consider the trade-off between memory footprint and security.
230640d553cfSPaul Beesley
230740d553cfSPaul BeesleyThis build flag is disabled by default, minimising memory footprint. On Arm
230840d553cfSPaul Beesleyplatforms, it is enabled.
230940d553cfSPaul Beesley
231040d553cfSPaul BeesleyPublish and Subscribe Framework
231140d553cfSPaul Beesley-------------------------------
231240d553cfSPaul Beesley
231340d553cfSPaul BeesleyThe Publish and Subscribe Framework allows EL3 components to define and publish
231440d553cfSPaul Beesleyevents, to which other EL3 components can subscribe.
231540d553cfSPaul Beesley
231640d553cfSPaul BeesleyThe following macros are provided by the framework:
231740d553cfSPaul Beesley
231840d553cfSPaul Beesley-  ``REGISTER_PUBSUB_EVENT(event)``: Defines an event, and takes one argument,
231940d553cfSPaul Beesley   the event name, which must be a valid C identifier. All calls to
232040d553cfSPaul Beesley   ``REGISTER_PUBSUB_EVENT`` macro must be placed in the file
232140d553cfSPaul Beesley   ``pubsub_events.h``.
232240d553cfSPaul Beesley
232340d553cfSPaul Beesley-  ``PUBLISH_EVENT_ARG(event, arg)``: Publishes a defined event, by iterating
232440d553cfSPaul Beesley   subscribed handlers and calling them in turn. The handlers will be passed the
232540d553cfSPaul Beesley   parameter ``arg``. The expected use-case is to broadcast an event.
232640d553cfSPaul Beesley
232740d553cfSPaul Beesley-  ``PUBLISH_EVENT(event)``: Like ``PUBLISH_EVENT_ARG``, except that the value
232840d553cfSPaul Beesley   ``NULL`` is passed to subscribed handlers.
232940d553cfSPaul Beesley
233040d553cfSPaul Beesley-  ``SUBSCRIBE_TO_EVENT(event, handler)``: Registers the ``handler`` to
233140d553cfSPaul Beesley   subscribe to ``event``. The handler will be executed whenever the ``event``
233240d553cfSPaul Beesley   is published.
233340d553cfSPaul Beesley
233440d553cfSPaul Beesley-  ``for_each_subscriber(event, subscriber)``: Iterates through all handlers
233540d553cfSPaul Beesley   subscribed for ``event``. ``subscriber`` must be a local variable of type
233640d553cfSPaul Beesley   ``pubsub_cb_t *``, and will point to each subscribed handler in turn during
233740d553cfSPaul Beesley   iteration. This macro can be used for those patterns that none of the
233840d553cfSPaul Beesley   ``PUBLISH_EVENT_*()`` macros cover.
233940d553cfSPaul Beesley
234040d553cfSPaul BeesleyPublishing an event that wasn't defined using ``REGISTER_PUBSUB_EVENT`` will
234140d553cfSPaul Beesleyresult in build error. Subscribing to an undefined event however won't.
234240d553cfSPaul Beesley
234340d553cfSPaul BeesleySubscribed handlers must be of type ``pubsub_cb_t``, with following function
234440d553cfSPaul Beesleysignature:
234540d553cfSPaul Beesley
234629c02529SPaul Beesley.. code:: c
234740d553cfSPaul Beesley
234840d553cfSPaul Beesley   typedef void* (*pubsub_cb_t)(const void *arg);
234940d553cfSPaul Beesley
235040d553cfSPaul BeesleyThere may be arbitrary number of handlers registered to the same event. The
235140d553cfSPaul Beesleyorder in which subscribed handlers are notified when that event is published is
235240d553cfSPaul Beesleynot defined. Subscribed handlers may be executed in any order; handlers should
235340d553cfSPaul Beesleynot assume any relative ordering amongst them.
235440d553cfSPaul Beesley
235540d553cfSPaul BeesleyPublishing an event on a PE will result in subscribed handlers executing on that
235640d553cfSPaul BeesleyPE only; it won't cause handlers to execute on a different PE.
235740d553cfSPaul Beesley
235840d553cfSPaul BeesleyNote that publishing an event on a PE blocks until all the subscribed handlers
235940d553cfSPaul Beesleyfinish executing on the PE.
236040d553cfSPaul Beesley
236140d553cfSPaul BeesleyTF-A generic code publishes and subscribes to some events within. Platform
236240d553cfSPaul Beesleyports are discouraged from subscribing to them. These events may be withdrawn,
236340d553cfSPaul Beesleyrenamed, or have their semantics altered in the future. Platforms may however
236440d553cfSPaul Beesleyregister, publish, and subscribe to platform-specific events.
236540d553cfSPaul Beesley
236640d553cfSPaul BeesleyPublish and Subscribe Example
236740d553cfSPaul Beesley~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
236840d553cfSPaul Beesley
236940d553cfSPaul BeesleyA publisher that wants to publish event ``foo`` would:
237040d553cfSPaul Beesley
237140d553cfSPaul Beesley-  Define the event ``foo`` in the ``pubsub_events.h``.
237240d553cfSPaul Beesley
237329c02529SPaul Beesley   .. code:: c
237440d553cfSPaul Beesley
237540d553cfSPaul Beesley      REGISTER_PUBSUB_EVENT(foo);
237640d553cfSPaul Beesley
237740d553cfSPaul Beesley-  Depending on the nature of event, use one of ``PUBLISH_EVENT_*()`` macros to
237840d553cfSPaul Beesley   publish the event at the appropriate path and time of execution.
237940d553cfSPaul Beesley
238040d553cfSPaul BeesleyA subscriber that wants to subscribe to event ``foo`` published above would
238140d553cfSPaul Beesleyimplement:
238240d553cfSPaul Beesley
238340d553cfSPaul Beesley.. code:: c
238440d553cfSPaul Beesley
238540d553cfSPaul Beesley    void *foo_handler(const void *arg)
238640d553cfSPaul Beesley    {
238740d553cfSPaul Beesley         void *result;
238840d553cfSPaul Beesley
238940d553cfSPaul Beesley         /* Do handling ... */
239040d553cfSPaul Beesley
239140d553cfSPaul Beesley         return result;
239240d553cfSPaul Beesley    }
239340d553cfSPaul Beesley
239440d553cfSPaul Beesley    SUBSCRIBE_TO_EVENT(foo, foo_handler);
239540d553cfSPaul Beesley
239640d553cfSPaul Beesley
239740d553cfSPaul BeesleyReclaiming the BL31 initialization code
239840d553cfSPaul Beesley~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
239940d553cfSPaul Beesley
240040d553cfSPaul BeesleyA significant amount of the code used for the initialization of BL31 is never
240140d553cfSPaul Beesleyneeded again after boot time. In order to reduce the runtime memory
240240d553cfSPaul Beesleyfootprint, the memory used for this code can be reclaimed after initialization
240340d553cfSPaul Beesleyhas finished and be used for runtime data.
240440d553cfSPaul Beesley
240540d553cfSPaul BeesleyThe build option ``RECLAIM_INIT_CODE`` can be set to mark this boot time code
240640d553cfSPaul Beesleywith a ``.text.init.*`` attribute which can be filtered and placed suitably
240740d553cfSPaul Beesleywithin the BL image for later reclamation by the platform. The platform can
240840d553cfSPaul Beesleyspecify the filter and the memory region for this init section in BL31 via the
240940d553cfSPaul Beesleyplat.ld.S linker script. For example, on the FVP, this section is placed
241040d553cfSPaul Beesleyoverlapping the secondary CPU stacks so that after the cold boot is done, this
241140d553cfSPaul Beesleymemory can be reclaimed for the stacks. The init memory section is initially
241240d553cfSPaul Beesleymapped with ``RO``, ``EXECUTE`` attributes. After BL31 initialization has
241340d553cfSPaul Beesleycompleted, the FVP changes the attributes of this section to ``RW``,
241440d553cfSPaul Beesley``EXECUTE_NEVER`` allowing it to be used for runtime data. The memory attributes
241540d553cfSPaul Beesleyare changed within the ``bl31_plat_runtime_setup`` platform hook. The init
241640d553cfSPaul Beesleysection section can be reclaimed for any data which is accessed after cold
241740d553cfSPaul Beesleyboot initialization and it is upto the platform to make the decision.
241840d553cfSPaul Beesley
241934760951SPaul Beesley.. _firmware_design_pmf:
242034760951SPaul Beesley
242140d553cfSPaul BeesleyPerformance Measurement Framework
242240d553cfSPaul Beesley---------------------------------
242340d553cfSPaul Beesley
242440d553cfSPaul BeesleyThe Performance Measurement Framework (PMF) facilitates collection of
242540d553cfSPaul Beesleytimestamps by registered services and provides interfaces to retrieve them
242640d553cfSPaul Beesleyfrom within TF-A. A platform can choose to expose appropriate SMCs to
242740d553cfSPaul Beesleyretrieve these collected timestamps.
242840d553cfSPaul Beesley
242940d553cfSPaul BeesleyBy default, the global physical counter is used for the timestamp
243040d553cfSPaul Beesleyvalue and is read via ``CNTPCT_EL0``. The framework allows to retrieve
243140d553cfSPaul Beesleytimestamps captured by other CPUs.
243240d553cfSPaul Beesley
243340d553cfSPaul BeesleyTimestamp identifier format
243440d553cfSPaul Beesley~~~~~~~~~~~~~~~~~~~~~~~~~~~
243540d553cfSPaul Beesley
243640d553cfSPaul BeesleyA PMF timestamp is uniquely identified across the system via the
243740d553cfSPaul Beesleytimestamp ID or ``tid``. The ``tid`` is composed as follows:
243840d553cfSPaul Beesley
243940d553cfSPaul Beesley::
244040d553cfSPaul Beesley
244140d553cfSPaul Beesley    Bits 0-7: The local timestamp identifier.
244240d553cfSPaul Beesley    Bits 8-9: Reserved.
244340d553cfSPaul Beesley    Bits 10-15: The service identifier.
244440d553cfSPaul Beesley    Bits 16-31: Reserved.
244540d553cfSPaul Beesley
244640d553cfSPaul Beesley#. The service identifier. Each PMF service is identified by a
244740d553cfSPaul Beesley   service name and a service identifier. Both the service name and
244840d553cfSPaul Beesley   identifier are unique within the system as a whole.
244940d553cfSPaul Beesley
245040d553cfSPaul Beesley#. The local timestamp identifier. This identifier is unique within a given
245140d553cfSPaul Beesley   service.
245240d553cfSPaul Beesley
245340d553cfSPaul BeesleyRegistering a PMF service
245440d553cfSPaul Beesley~~~~~~~~~~~~~~~~~~~~~~~~~
245540d553cfSPaul Beesley
245640d553cfSPaul BeesleyTo register a PMF service, the ``PMF_REGISTER_SERVICE()`` macro from ``pmf.h``
245740d553cfSPaul Beesleyis used. The arguments required are the service name, the service ID,
245840d553cfSPaul Beesleythe total number of local timestamps to be captured and a set of flags.
245940d553cfSPaul Beesley
246040d553cfSPaul BeesleyThe ``flags`` field can be specified as a bitwise-OR of the following values:
246140d553cfSPaul Beesley
246240d553cfSPaul Beesley::
246340d553cfSPaul Beesley
246440d553cfSPaul Beesley    PMF_STORE_ENABLE: The timestamp is stored in memory for later retrieval.
246540d553cfSPaul Beesley    PMF_DUMP_ENABLE: The timestamp is dumped on the serial console.
246640d553cfSPaul Beesley
246740d553cfSPaul BeesleyThe ``PMF_REGISTER_SERVICE()`` reserves memory to store captured
246840d553cfSPaul Beesleytimestamps in a PMF specific linker section at build time.
246940d553cfSPaul BeesleyAdditionally, it defines necessary functions to capture and
247040d553cfSPaul Beesleyretrieve a particular timestamp for the given service at runtime.
247140d553cfSPaul Beesley
247240d553cfSPaul BeesleyThe macro ``PMF_REGISTER_SERVICE()`` only enables capturing PMF timestamps
247340d553cfSPaul Beesleyfrom within TF-A. In order to retrieve timestamps from outside of TF-A, the
247440d553cfSPaul Beesley``PMF_REGISTER_SERVICE_SMC()`` macro must be used instead. This macro
247540d553cfSPaul Beesleyaccepts the same set of arguments as the ``PMF_REGISTER_SERVICE()``
247640d553cfSPaul Beesleymacro but additionally supports retrieving timestamps using SMCs.
247740d553cfSPaul Beesley
247840d553cfSPaul BeesleyCapturing a timestamp
247940d553cfSPaul Beesley~~~~~~~~~~~~~~~~~~~~~
248040d553cfSPaul Beesley
248140d553cfSPaul BeesleyPMF timestamps are stored in a per-service timestamp region. On a
248240d553cfSPaul Beesleysystem with multiple CPUs, each timestamp is captured and stored
248340d553cfSPaul Beesleyin a per-CPU cache line aligned memory region.
248440d553cfSPaul Beesley
248540d553cfSPaul BeesleyHaving registered the service, the ``PMF_CAPTURE_TIMESTAMP()`` macro can be
248640d553cfSPaul Beesleyused to capture a timestamp at the location where it is used. The macro
248740d553cfSPaul Beesleytakes the service name, a local timestamp identifier and a flag as arguments.
248840d553cfSPaul Beesley
248940d553cfSPaul BeesleyThe ``flags`` field argument can be zero, or ``PMF_CACHE_MAINT`` which
249040d553cfSPaul Beesleyinstructs PMF to do cache maintenance following the capture. Cache
249140d553cfSPaul Beesleymaintenance is required if any of the service's timestamps are captured
249240d553cfSPaul Beesleywith data cache disabled.
249340d553cfSPaul Beesley
249440d553cfSPaul BeesleyTo capture a timestamp in assembly code, the caller should use
249540d553cfSPaul Beesley``pmf_calc_timestamp_addr`` macro (defined in ``pmf_asm_macros.S``) to
249640d553cfSPaul Beesleycalculate the address of where the timestamp would be stored. The
249740d553cfSPaul Beesleycaller should then read ``CNTPCT_EL0`` register to obtain the timestamp
249840d553cfSPaul Beesleyand store it at the determined address for later retrieval.
249940d553cfSPaul Beesley
250040d553cfSPaul BeesleyRetrieving a timestamp
250140d553cfSPaul Beesley~~~~~~~~~~~~~~~~~~~~~~
250240d553cfSPaul Beesley
250340d553cfSPaul BeesleyFrom within TF-A, timestamps for individual CPUs can be retrieved using either
250440d553cfSPaul Beesley``PMF_GET_TIMESTAMP_BY_MPIDR()`` or ``PMF_GET_TIMESTAMP_BY_INDEX()`` macros.
250540d553cfSPaul BeesleyThese macros accept the CPU's MPIDR value, or its ordinal position
250640d553cfSPaul Beesleyrespectively.
250740d553cfSPaul Beesley
250840d553cfSPaul BeesleyFrom outside TF-A, timestamps for individual CPUs can be retrieved by calling
250940d553cfSPaul Beesleyinto ``pmf_smc_handler()``.
251040d553cfSPaul Beesley
251129c02529SPaul Beesley::
251240d553cfSPaul Beesley
251340d553cfSPaul Beesley    Interface : pmf_smc_handler()
251440d553cfSPaul Beesley    Argument  : unsigned int smc_fid, u_register_t x1,
251540d553cfSPaul Beesley                u_register_t x2, u_register_t x3,
251640d553cfSPaul Beesley                u_register_t x4, void *cookie,
251740d553cfSPaul Beesley                void *handle, u_register_t flags
251840d553cfSPaul Beesley    Return    : uintptr_t
251940d553cfSPaul Beesley
252040d553cfSPaul Beesley    smc_fid: Holds the SMC identifier which is either `PMF_SMC_GET_TIMESTAMP_32`
252140d553cfSPaul Beesley        when the caller of the SMC is running in AArch32 mode
252240d553cfSPaul Beesley        or `PMF_SMC_GET_TIMESTAMP_64` when the caller is running in AArch64 mode.
252340d553cfSPaul Beesley    x1: Timestamp identifier.
252440d553cfSPaul Beesley    x2: The `mpidr` of the CPU for which the timestamp has to be retrieved.
252540d553cfSPaul Beesley        This can be the `mpidr` of a different core to the one initiating
252640d553cfSPaul Beesley        the SMC.  In that case, service specific cache maintenance may be
252740d553cfSPaul Beesley        required to ensure the updated copy of the timestamp is returned.
252840d553cfSPaul Beesley    x3: A flags value that is either 0 or `PMF_CACHE_MAINT`.  If
252940d553cfSPaul Beesley        `PMF_CACHE_MAINT` is passed, then the PMF code will perform a
253040d553cfSPaul Beesley        cache invalidate before reading the timestamp.  This ensures
253140d553cfSPaul Beesley        an updated copy is returned.
253240d553cfSPaul Beesley
253340d553cfSPaul BeesleyThe remaining arguments, ``x4``, ``cookie``, ``handle`` and ``flags`` are unused
253440d553cfSPaul Beesleyin this implementation.
253540d553cfSPaul Beesley
253640d553cfSPaul BeesleyPMF code structure
253740d553cfSPaul Beesley~~~~~~~~~~~~~~~~~~
253840d553cfSPaul Beesley
253940d553cfSPaul Beesley#. ``pmf_main.c`` consists of core functions that implement service registration,
254040d553cfSPaul Beesley   initialization, storing, dumping and retrieving timestamps.
254140d553cfSPaul Beesley
254240d553cfSPaul Beesley#. ``pmf_smc.c`` contains the SMC handling for registered PMF services.
254340d553cfSPaul Beesley
254440d553cfSPaul Beesley#. ``pmf.h`` contains the public interface to Performance Measurement Framework.
254540d553cfSPaul Beesley
254640d553cfSPaul Beesley#. ``pmf_asm_macros.S`` consists of macros to facilitate capturing timestamps in
254740d553cfSPaul Beesley   assembly code.
254840d553cfSPaul Beesley
254940d553cfSPaul Beesley#. ``pmf_helpers.h`` is an internal header used by ``pmf.h``.
255040d553cfSPaul Beesley
255140d553cfSPaul BeesleyArmv8-A Architecture Extensions
255240d553cfSPaul Beesley-------------------------------
255340d553cfSPaul Beesley
255440d553cfSPaul BeesleyTF-A makes use of Armv8-A Architecture Extensions where applicable. This
255540d553cfSPaul Beesleysection lists the usage of Architecture Extensions, and build flags
255640d553cfSPaul Beesleycontrolling them.
255740d553cfSPaul Beesley
255840d553cfSPaul BeesleyIn general, and unless individually mentioned, the build options
255940d553cfSPaul Beesley``ARM_ARCH_MAJOR`` and ``ARM_ARCH_MINOR`` select the Architecture Extension to
256040d553cfSPaul Beesleytarget when building TF-A. Subsequent Arm Architecture Extensions are backward
256140d553cfSPaul Beesleycompatible with previous versions.
256240d553cfSPaul Beesley
256340d553cfSPaul BeesleyThe build system only requires that ``ARM_ARCH_MAJOR`` and ``ARM_ARCH_MINOR`` have a
256440d553cfSPaul Beesleyvalid numeric value. These build options only control whether or not
256540d553cfSPaul BeesleyArchitecture Extension-specific code is included in the build. Otherwise, TF-A
256640d553cfSPaul Beesleytargets the base Armv8.0-A architecture; i.e. as if ``ARM_ARCH_MAJOR`` == 8
256740d553cfSPaul Beesleyand ``ARM_ARCH_MINOR`` == 0, which are also their respective default values.
256840d553cfSPaul Beesley
256943f35ef5SPaul Beesley.. seealso:: :ref:`Build Options`
257040d553cfSPaul Beesley
257140d553cfSPaul BeesleyFor details on the Architecture Extension and available features, please refer
257240d553cfSPaul Beesleyto the respective Architecture Extension Supplement.
257340d553cfSPaul Beesley
257440d553cfSPaul BeesleyArmv8.1-A
257540d553cfSPaul Beesley~~~~~~~~~
257640d553cfSPaul Beesley
257740d553cfSPaul BeesleyThis Architecture Extension is targeted when ``ARM_ARCH_MAJOR`` >= 8, or when
257840d553cfSPaul Beesley``ARM_ARCH_MAJOR`` == 8 and ``ARM_ARCH_MINOR`` >= 1.
257940d553cfSPaul Beesley
2580c97cba4eSSoby Mathew-  By default, a load-/store-exclusive instruction pair is used to implement
2581c97cba4eSSoby Mathew   spinlocks. The ``USE_SPINLOCK_CAS`` build option when set to 1 selects the
2582c97cba4eSSoby Mathew   spinlock implementation using the ARMv8.1-LSE Compare and Swap instruction.
2583c97cba4eSSoby Mathew   Notice this instruction is only available in AArch64 execution state, so
2584c97cba4eSSoby Mathew   the option is only available to AArch64 builds.
258540d553cfSPaul Beesley
258640d553cfSPaul BeesleyArmv8.2-A
258740d553cfSPaul Beesley~~~~~~~~~
258840d553cfSPaul Beesley
258940d553cfSPaul Beesley-  The presence of ARMv8.2-TTCNP is detected at runtime. When it is present, the
259040d553cfSPaul Beesley   Common not Private (TTBRn_ELx.CnP) bit is enabled to indicate that multiple
259140d553cfSPaul Beesley   Processing Elements in the same Inner Shareable domain use the same
259240d553cfSPaul Beesley   translation table entries for a given stage of translation for a particular
259340d553cfSPaul Beesley   translation regime.
259440d553cfSPaul Beesley
259540d553cfSPaul BeesleyArmv8.3-A
259640d553cfSPaul Beesley~~~~~~~~~
259740d553cfSPaul Beesley
259840d553cfSPaul Beesley-  Pointer authentication features of Armv8.3-A are unconditionally enabled in
259940d553cfSPaul Beesley   the Non-secure world so that lower ELs are allowed to use them without
260040d553cfSPaul Beesley   causing a trap to EL3.
260140d553cfSPaul Beesley
260240d553cfSPaul Beesley   In order to enable the Secure world to use it, ``CTX_INCLUDE_PAUTH_REGS``
260340d553cfSPaul Beesley   must be set to 1. This will add all pointer authentication system registers
260440d553cfSPaul Beesley   to the context that is saved when doing a world switch.
260540d553cfSPaul Beesley
260640d553cfSPaul Beesley   The TF-A itself has support for pointer authentication at runtime
26079fc59639SAlexei Fedorov   that can be enabled by setting ``BRANCH_PROTECTION`` option to non-zero and
260840d553cfSPaul Beesley   ``CTX_INCLUDE_PAUTH_REGS`` to 1. This enables pointer authentication in BL1,
260940d553cfSPaul Beesley   BL2, BL31, and the TSP if it is used.
261040d553cfSPaul Beesley
261140d553cfSPaul Beesley   These options are experimental features.
261240d553cfSPaul Beesley
261340d553cfSPaul Beesley   Note that Pointer Authentication is enabled for Non-secure world irrespective
261440d553cfSPaul Beesley   of the value of these build flags if the CPU supports it.
261540d553cfSPaul Beesley
261640d553cfSPaul Beesley   If ``ARM_ARCH_MAJOR == 8`` and ``ARM_ARCH_MINOR >= 3`` the code footprint of
261740d553cfSPaul Beesley   enabling PAuth is lower because the compiler will use the optimized
261840d553cfSPaul Beesley   PAuth instructions rather than the backwards-compatible ones.
261940d553cfSPaul Beesley
26209fc59639SAlexei FedorovArmv8.5-A
26219fc59639SAlexei Fedorov~~~~~~~~~
26229fc59639SAlexei Fedorov
26239fc59639SAlexei Fedorov-  Branch Target Identification feature is selected by ``BRANCH_PROTECTION``
262488d493fbSJustin Chadwell   option set to 1. This option defaults to 0 and this is an experimental
262588d493fbSJustin Chadwell   feature.
262688d493fbSJustin Chadwell
262788d493fbSJustin Chadwell-  Memory Tagging Extension feature is unconditionally enabled for both worlds
262888d493fbSJustin Chadwell   (at EL0 and S-EL0) if it is only supported at EL0. If instead it is
262988d493fbSJustin Chadwell   implemented at all ELs, it is unconditionally enabled for only the normal
263088d493fbSJustin Chadwell   world. To enable it for the secure world as well, the build option
263188d493fbSJustin Chadwell   ``CTX_INCLUDE_MTE_REGS`` is required. If the hardware does not implement
263288d493fbSJustin Chadwell   MTE support at all, it is always disabled, no matter what build options
263388d493fbSJustin Chadwell   are used.
26349fc59639SAlexei Fedorov
263540d553cfSPaul BeesleyArmv7-A
263640d553cfSPaul Beesley~~~~~~~
263740d553cfSPaul Beesley
263840d553cfSPaul BeesleyThis Architecture Extension is targeted when ``ARM_ARCH_MAJOR`` == 7.
263940d553cfSPaul Beesley
264040d553cfSPaul BeesleyThere are several Armv7-A extensions available. Obviously the TrustZone
264140d553cfSPaul Beesleyextension is mandatory to support the TF-A bootloader and runtime services.
264240d553cfSPaul Beesley
264340d553cfSPaul BeesleyPlatform implementing an Armv7-A system can to define from its target
264440d553cfSPaul BeesleyCortex-A architecture through ``ARM_CORTEX_A<X> = yes`` in their
264540d553cfSPaul Beesley``platform.mk`` script. For example ``ARM_CORTEX_A15=yes`` for a
264640d553cfSPaul BeesleyCortex-A15 target.
264740d553cfSPaul Beesley
264840d553cfSPaul BeesleyPlatform can also set ``ARM_WITH_NEON=yes`` to enable neon support.
2649be653a69SPaul BeesleyNote that using neon at runtime has constraints on non secure world context.
265040d553cfSPaul BeesleyTF-A does not yet provide VFP context management.
265140d553cfSPaul Beesley
265240d553cfSPaul BeesleyDirective ``ARM_CORTEX_A<x>`` and ``ARM_WITH_NEON`` are used to set
265340d553cfSPaul Beesleythe toolchain  target architecture directive.
265440d553cfSPaul Beesley
265540d553cfSPaul BeesleyPlatform may choose to not define straight the toolchain target architecture
265640d553cfSPaul Beesleydirective by defining ``MARCH32_DIRECTIVE``.
265740d553cfSPaul BeesleyI.e:
265840d553cfSPaul Beesley
265929c02529SPaul Beesley.. code:: make
266040d553cfSPaul Beesley
266140d553cfSPaul Beesley   MARCH32_DIRECTIVE := -mach=armv7-a
266240d553cfSPaul Beesley
266340d553cfSPaul BeesleyCode Structure
266440d553cfSPaul Beesley--------------
266540d553cfSPaul Beesley
266640d553cfSPaul BeesleyTF-A code is logically divided between the three boot loader stages mentioned
266740d553cfSPaul Beesleyin the previous sections. The code is also divided into the following
266840d553cfSPaul Beesleycategories (present as directories in the source code):
266940d553cfSPaul Beesley
267040d553cfSPaul Beesley-  **Platform specific.** Choice of architecture specific code depends upon
267140d553cfSPaul Beesley   the platform.
267240d553cfSPaul Beesley-  **Common code.** This is platform and architecture agnostic code.
267340d553cfSPaul Beesley-  **Library code.** This code comprises of functionality commonly used by all
267440d553cfSPaul Beesley   other code. The PSCI implementation and other EL3 runtime frameworks reside
267540d553cfSPaul Beesley   as Library components.
267640d553cfSPaul Beesley-  **Stage specific.** Code specific to a boot stage.
267740d553cfSPaul Beesley-  **Drivers.**
267840d553cfSPaul Beesley-  **Services.** EL3 runtime services (eg: SPD). Specific SPD services
267940d553cfSPaul Beesley   reside in the ``services/spd`` directory (e.g. ``services/spd/tspd``).
268040d553cfSPaul Beesley
268140d553cfSPaul BeesleyEach boot loader stage uses code from one or more of the above mentioned
268240d553cfSPaul Beesleycategories. Based upon the above, the code layout looks like this:
268340d553cfSPaul Beesley
268440d553cfSPaul Beesley::
268540d553cfSPaul Beesley
268640d553cfSPaul Beesley    Directory    Used by BL1?    Used by BL2?    Used by BL31?
268740d553cfSPaul Beesley    bl1          Yes             No              No
268840d553cfSPaul Beesley    bl2          No              Yes             No
268940d553cfSPaul Beesley    bl31         No              No              Yes
269040d553cfSPaul Beesley    plat         Yes             Yes             Yes
269140d553cfSPaul Beesley    drivers      Yes             No              Yes
269240d553cfSPaul Beesley    common       Yes             Yes             Yes
269340d553cfSPaul Beesley    lib          Yes             Yes             Yes
269440d553cfSPaul Beesley    services     No              No              Yes
269540d553cfSPaul Beesley
269640d553cfSPaul BeesleyThe build system provides a non configurable build option IMAGE_BLx for each
269740d553cfSPaul Beesleyboot loader stage (where x = BL stage). e.g. for BL1 , IMAGE_BL1 will be
269840d553cfSPaul Beesleydefined by the build system. This enables TF-A to compile certain code only
269940d553cfSPaul Beesleyfor specific boot loader stages
270040d553cfSPaul Beesley
270140d553cfSPaul BeesleyAll assembler files have the ``.S`` extension. The linker source files for each
270240d553cfSPaul Beesleyboot stage have the extension ``.ld.S``. These are processed by GCC to create the
270340d553cfSPaul Beesleylinker scripts which have the extension ``.ld``.
270440d553cfSPaul Beesley
270540d553cfSPaul BeesleyFDTs provide a description of the hardware platform and are used by the Linux
270640d553cfSPaul Beesleykernel at boot time. These can be found in the ``fdts`` directory.
270740d553cfSPaul Beesley
270834760951SPaul Beesley.. rubric:: References
270940d553cfSPaul Beesley
271034760951SPaul Beesley-  `Trusted Board Boot Requirements CLIENT (TBBR-CLIENT) Armv8-A (ARM DEN0006D)`_
271134760951SPaul Beesley
271234760951SPaul Beesley-  `Power State Coordination Interface PDD`_
271334760951SPaul Beesley
271434760951SPaul Beesley-  `SMC Calling Convention PDD`_
271534760951SPaul Beesley
271634760951SPaul Beesley-  :ref:`Interrupt Management Framework`
271740d553cfSPaul Beesley
271840d553cfSPaul Beesley--------------
271940d553cfSPaul Beesley
272062c9be71SPetre-Ionut Tudor*Copyright (c) 2013-2020, Arm Limited and Contributors. All rights reserved.*
272140d553cfSPaul Beesley
272234760951SPaul Beesley.. _Power State Coordination Interface PDD: http://infocenter.arm.com/help/topic/com.arm.doc.den0022d/Power_State_Coordination_Interface_PDD_v1_1_DEN0022D.pdf
272340d553cfSPaul Beesley.. _SMCCC: http://infocenter.arm.com/help/topic/com.arm.doc.den0028b/ARM_DEN0028B_SMC_Calling_Convention.pdf
272440d553cfSPaul Beesley.. _PSCI: http://infocenter.arm.com/help/topic/com.arm.doc.den0022d/Power_State_Coordination_Interface_PDD_v1_1_DEN0022D.pdf
272540d553cfSPaul Beesley.. _Power State Coordination Interface PDD: http://infocenter.arm.com/help/topic/com.arm.doc.den0022d/Power_State_Coordination_Interface_PDD_v1_1_DEN0022D.pdf
272662c9be71SPetre-Ionut Tudor.. _Arm ARM: https://developer.arm.com/docs/ddi0487/latest
272740d553cfSPaul Beesley.. _SMC Calling Convention PDD: http://infocenter.arm.com/help/topic/com.arm.doc.den0028b/ARM_DEN0028B_SMC_Calling_Convention.pdf
272840d553cfSPaul Beesley.. _Trusted Board Boot Requirements CLIENT (TBBR-CLIENT) Armv8-A (ARM DEN0006D): https://developer.arm.com/docs/den0006/latest/trusted-board-boot-requirements-client-tbbr-client-armv8-a
272940d553cfSPaul Beesley
2730a2c320a8SPaul Beesley.. |Image 1| image:: ../resources/diagrams/rt-svc-descs-layout.png
2731