1Translation (XLAT) Tables Library 2================================= 3 4This document describes the design of the translation tables library (version 2) 5used by Trusted Firmware-A (TF-A). This library provides APIs to create page 6tables based on a description of the memory layout, as well as setting up system 7registers related to the Memory Management Unit (MMU) and performing the 8required Translation Lookaside Buffer (TLB) maintenance operations. 9 10More specifically, some use cases that this library aims to support are: 11 12#. Statically allocate translation tables and populate them (at run-time) based 13 on a description of the memory layout. The memory layout is typically 14 provided by the platform port as a list of memory regions; 15 16#. Support for generating translation tables pertaining to a different 17 translation regime than the exception level the library code is executing at; 18 19#. Support for dynamic mapping and unmapping of regions, even while the MMU is 20 on. This can be used to temporarily map some memory regions and unmap them 21 later on when no longer needed; 22 23#. Support for non-identity virtual to physical mappings to compress the virtual 24 address space; 25 26#. Support for changing memory attributes of memory regions at run-time. 27 28 29About version 1 and version 2 30----------------------------- 31 32This document focuses on version 2 of the library, whose sources are available 33in the ``lib/xlat_tables_v2`` directory. Version 1 of the library can still be 34found in ``lib/xlat_tables`` directory but it is less flexible and doesn't 35support dynamic mapping. Although potential bug fixes will be applied to both 36versions, future features enhancements will focus on version 2 and might not be 37back-ported to version 1. Therefore, it is recommended to use version 2, 38especially for new platform ports. 39 40However, please note that version 2 is still in active development and is not 41considered stable yet. Hence, compatibility breaks might be introduced. 42 43From this point onwards, this document will implicitly refer to version 2 of the 44library. 45 46 47Design concepts and interfaces 48------------------------------ 49 50This section presents some of the key concepts and data structures used in the 51translation tables library. 52 53`mmap` regions 54~~~~~~~~~~~~~~ 55 56An ``mmap_region`` is an abstract, concise way to represent a memory region to 57map. It is one of the key interfaces to the library. It is identified by: 58 59- its physical base address; 60- its virtual base address; 61- its size; 62- its attributes; 63- its mapping granularity (optional). 64 65See the ``struct mmap_region`` type in ``xlat_tables_v2.h``. 66 67The user usually provides a list of such mmap regions to map and lets the 68library transpose that in a set of translation tables. As a result, the library 69might create new translation tables, update or split existing ones. 70 71The region attributes specify the type of memory (for example device or cached 72normal memory) as well as the memory access permissions (read-only or 73read-write, executable or not, secure or non-secure, and so on). In the case of 74the EL1&0 translation regime, the attributes also specify whether the region is 75a User region (EL0) or Privileged region (EL1). See the ``MT_xxx`` definitions 76in ``xlat_tables_v2.h``. Note that for the EL1&0 translation regime the Execute 77Never attribute is set simultaneously for both EL1 and EL0. 78 79The granularity controls the translation table level to go down to when mapping 80the region. For example, assuming the MMU has been configured to use a 4KB 81granule size, the library might map a 2MB memory region using either of the two 82following options: 83 84- using a single level-2 translation table entry; 85- using a level-2 intermediate entry to a level-3 translation table (which 86 contains 512 entries, each mapping 4KB). 87 88The first solution potentially requires less translation tables, hence 89potentially less memory. However, if part of this 2MB region is later remapped 90with different memory attributes, the library might need to split the existing 91page tables to refine the mappings. If a single level-2 entry has been used 92here, a level-3 table will need to be allocated on the fly and the level-2 93modified to point to this new level-3 table. This has a performance cost at 94run-time. 95 96If the user knows upfront that such a remapping operation is likely to happen 97then they might enforce a 4KB mapping granularity for this 2MB region from the 98beginning; remapping some of these 4KB pages on the fly then becomes a 99lightweight operation. 100 101The region's granularity is an optional field; if it is not specified the 102library will choose the mapping granularity for this region as it sees fit (more 103details can be found in `The memory mapping algorithm`_ section below). 104 105Translation Context 106~~~~~~~~~~~~~~~~~~~ 107 108The library can create or modify translation tables pertaining to a different 109translation regime than the exception level the library code is executing at. 110For example, the library might be used by EL3 software (for instance BL31) to 111create translation tables pertaining to the S-EL1&0 translation regime. 112 113This flexibility comes from the use of *translation contexts*. A *translation 114context* constitutes the superset of information used by the library to track 115the status of a set of translation tables for a given translation regime. 116 117The library internally allocates a default translation context, which pertains 118to the translation regime of the current exception level. Additional contexts 119may be explicitly allocated and initialized using the 120``REGISTER_XLAT_CONTEXT()`` macro. Separate APIs are provided to act either on 121the default translation context or on an alternative one. 122 123To register a translation context, the user must provide the library with the 124following information: 125 126* A name. 127 128 The resulting translation context variable will be called after this name, to 129 which ``_xlat_ctx`` is appended. For example, if the macro name parameter is 130 ``foo``, the context variable name will be ``foo_xlat_ctx``. 131 132* The maximum number of `mmap` regions to map. 133 134 Should account for both static and dynamic regions, if applicable. 135 136* The number of sub-translation tables to allocate. 137 138 Number of translation tables to statically allocate for this context, 139 excluding the initial lookup level translation table, which is always 140 allocated. For example, if the initial lookup level is 1, this parameter would 141 specify the number of level-2 and level-3 translation tables to pre-allocate 142 for this context. 143 144* The size of the virtual address space. 145 146 Size in bytes of the virtual address space to map using this context. This 147 will incidentally determine the number of entries in the initial lookup level 148 translation table : the library will allocate as many entries as is required 149 to map the entire virtual address space. 150 151* The size of the physical address space. 152 153 Size in bytes of the physical address space to map using this context. 154 155The default translation context is internally initialized using information 156coming (for the most part) from platform-specific defines: 157 158- name: hard-coded to ``tf`` ; hence the name of the default context variable is 159 ``tf_xlat_ctx``; 160- number of `mmap` regions: ``MAX_MMAP_REGIONS``; 161- number of sub-translation tables: ``MAX_XLAT_TABLES``; 162- size of the virtual address space: ``PLAT_VIRT_ADDR_SPACE_SIZE``; 163- size of the physical address space: ``PLAT_PHY_ADDR_SPACE_SIZE``. 164 165Please refer to the :ref:`Porting Guide` for more details about these macros. 166 167 168Static and dynamic memory regions 169~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 170 171The library optionally supports dynamic memory mapping. This feature may be 172enabled using the ``PLAT_XLAT_TABLES_DYNAMIC`` platform build flag. 173 174When dynamic memory mapping is enabled, the library categorises mmap regions as 175*static* or *dynamic*. 176 177- *Static regions* are fixed for the lifetime of the system. They can only be 178 added early on, before the translation tables are created and populated. They 179 cannot be removed afterwards. 180 181- *Dynamic regions* can be added or removed any time. 182 183When the dynamic memory mapping feature is disabled, only static regions exist. 184 185The dynamic memory mapping feature may be used to map and unmap transient memory 186areas. This is useful when the user needs to access some memory for a fixed 187period of time, after which the memory may be discarded and reclaimed. For 188example, a memory region that is only required at boot time while the system is 189initializing, or to temporarily share a memory buffer between the normal world 190and trusted world. Note that it is up to the caller to ensure that these regions 191are not accessed concurrently while the regions are being added or removed. 192 193Although this feature provides some level of dynamic memory allocation, this 194does not allow dynamically allocating an arbitrary amount of memory at an 195arbitrary memory location. The user is still required to declare at compile-time 196the limits of these allocations ; the library will deny any mapping request that 197does not fit within this pre-allocated pool of memory. 198 199 200Library APIs 201------------ 202 203The external APIs exposed by this library are declared and documented in the 204``xlat_tables_v2.h`` header file. This should be the reference point for 205getting information about the usage of the different APIs this library 206provides. This section just provides some extra details and clarifications. 207 208Although the ``mmap_region`` structure is a publicly visible type, it is not 209recommended to populate these structures by hand. Instead, wherever APIs expect 210function arguments of type ``mmap_region_t``, these should be constructed using 211the ``MAP_REGION*()`` family of helper macros. This is to limit the risk of 212compatibility breaks, should the ``mmap_region`` structure type evolve in the 213future. 214 215The ``MAP_REGION()`` and ``MAP_REGION_FLAT()`` macros do not allow specifying a 216mapping granularity, which leaves the library implementation free to choose 217it. However, in cases where a specific granularity is required, the 218``MAP_REGION2()`` macro might be used instead. 219 220As explained earlier in this document, when the dynamic mapping feature is 221disabled, there is no notion of dynamic regions. Conceptually, there are only 222static regions. For this reason (and to retain backward compatibility with the 223version 1 of the library), the APIs that map static regions do not embed the 224word *static* in their functions names (for example ``mmap_add_region()``), in 225contrast with the dynamic regions APIs (for example 226``mmap_add_dynamic_region()``). 227 228Although the definition of static and dynamic regions is not based on the state 229of the MMU, the two are still related in some way. Static regions can only be 230added before ``init_xlat_tables()`` is called and ``init_xlat_tables()`` must be 231called while the MMU is still off. As a result, static regions cannot be added 232once the MMU has been enabled. Dynamic regions can be added with the MMU on or 233off. In practice, the usual call flow would look like this: 234 235#. The MMU is initially off. 236 237#. Add some static regions, add some dynamic regions. 238 239#. Initialize translation tables based on the list of mmap regions (using one of 240 the ``init_xlat_tables*()`` APIs). 241 242#. At this point, it is no longer possible to add static regions. Dynamic 243 regions can still be added or removed. 244 245#. Enable the MMU. 246 247#. Dynamic regions can continue to be added or removed. 248 249Because static regions are added early on at boot time and are all in the 250control of the platform initialization code, the ``mmap_add*()`` family of APIs 251are not expected to fail. They do not return any error code. 252 253Nonetheless, these APIs will check upfront whether the region can be 254successfully added before updating the translation context structure. If the 255library detects that there is insufficient memory to meet the request, or that 256the new region will overlap another one in an invalid way, or if any other 257unexpected error is encountered, they will print an error message on the UART. 258Additionally, when asserts are enabled (typically in debug builds), an assertion 259will be triggered. Otherwise, the function call will just return straight away, 260without adding the offending memory region. 261 262 263Library limitations 264------------------- 265 266Dynamic regions are not allowed to overlap each other. Static regions are 267allowed to overlap as long as one of them is fully contained inside the other 268one. This is allowed for backwards compatibility with the previous behaviour in 269the version 1 of the library. 270 271 272Implementation details 273---------------------- 274 275Code structure 276~~~~~~~~~~~~~~ 277 278The library is divided into 4 modules: 279 280- **Core module** 281 282 Provides the main functionality of the library, such as the initialization of 283 translation tables contexts and mapping/unmapping memory regions. This module 284 provides functions such as ``mmap_add_region_ctx`` that let the caller specify 285 the translation tables context affected by them. 286 287 See ``xlat_tables_core.c``. 288 289- **Active context module** 290 291 Instantiates the context that is used by the current BL image and provides 292 helpers to manipulate it, abstracting it from the rest of the code. 293 This module provides functions such as ``mmap_add_region``, that directly 294 affect the BL image using them. 295 296 See ``xlat_tables_context.c``. 297 298- **Utilities module** 299 300 Provides additional functionality like debug print of the current state of the 301 translation tables and helpers to query memory attributes and to modify them. 302 303 See ``xlat_tables_utils.c``. 304 305- **Architectural module** 306 307 Provides functions that are dependent on the current execution state 308 (AArch32/AArch64), such as the functions used for TLB invalidation, setup the 309 MMU, or calculate the Physical Address Space size. They do not need a 310 translation context to work on. 311 312 See ``aarch32/xlat_tables_arch.c`` and ``aarch64/xlat_tables_arch.c``. 313 314From mmap regions to translation tables 315~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 317A translation context contains a list of ``mmap_region_t``, which holds the 318information of all the regions that are mapped at any given time. Whenever there 319is a request to map (resp. unmap) a memory region, it is added to (resp. removed 320from) the ``mmap_region_t`` list. 321 322The mmap regions list is a conceptual way to represent the memory layout. At 323some point, the library has to convert this information into actual translation 324tables to program into the MMU. 325 326Before the ``init_xlat_tables()`` API is called, the library only acts on the 327mmap regions list. Adding a static or dynamic region at this point through one 328of the ``mmap_add*()`` APIs does not affect the translation tables in any way, 329they only get registered in the internal mmap region list. It is only when the 330user calls the ``init_xlat_tables()`` that the translation tables are populated 331in memory based on the list of mmap regions registered so far. This is an 332optimization that allows creation of the initial set of translation tables in 333one go, rather than having to edit them every time while the MMU is disabled. 334 335After the ``init_xlat_tables()`` API has been called, only dynamic regions can 336be added. Changes to the translation tables (as well as the mmap regions list) 337will take effect immediately. 338 339The memory mapping algorithm 340~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 341 342The mapping function is implemented as a recursive algorithm. It is however 343bound by the level of depth of the translation tables (the Armv8-A architecture 344allows up to 4 lookup levels). 345 346By default [#granularity]_, the algorithm will attempt to minimize the 347number of translation tables created to satisfy the user's request. It will 348favour mapping a region using the biggest possible blocks, only creating a 349sub-table if it is strictly necessary. This is to reduce the memory footprint of 350the firmware. 351 352The most common reason for needing a sub-table is when a specific mapping 353requires a finer granularity. Misaligned regions also require a finer 354granularity than what the user may had originally expected, using a lot more 355memory than expected. The reason is that all levels of translation are 356restricted to address translations of the same granularity as the size of the 357blocks of that level. For example, for a 4 KiB page size, a level 2 block entry 358can only translate up to a granularity of 2 MiB. If the Physical Address is not 359aligned to 2 MiB then additional level 3 tables are also needed. 360 361Note that not every translation level allows any type of descriptor. Depending 362on the page size, levels 0 and 1 of translation may only allow table 363descriptors. If a block entry could be able to describe a translation, but that 364level does not allow block descriptors, a table descriptor will have to be used 365instead, as well as additional tables at the next level. 366 367|Alignment Example| 368 369The mmap regions are sorted in a way that simplifies the code that maps 370them. Even though this ordering is only strictly needed for overlapping static 371regions, it must also be applied for dynamic regions to maintain a consistent 372order of all regions at all times. As each new region is mapped, existing 373entries in the translation tables are checked to ensure consistency. Please 374refer to the comments in the source code of the core module for more details 375about the sorting algorithm in use. 376 377TLB maintenance operations 378~~~~~~~~~~~~~~~~~~~~~~~~~~ 379 380The library takes care of performing TLB maintenance operations when required. 381For example, when the user requests removing a dynamic region, the library 382invalidates all TLB entries associated to that region to ensure that these 383changes are visible to subsequent execution, including speculative execution, 384that uses the changed translation table entries. 385 386A counter-example is the initialization of translation tables. In this case, 387explicit TLB maintenance is not required. The Armv8-A architecture guarantees 388that all TLBs are disabled from reset and their contents have no effect on 389address translation at reset [#tlb-reset-ref]_. Therefore, the TLBs invalidation 390is deferred to the ``enable_mmu*()`` family of functions, just before the MMU is 391turned on. 392 393TLB invalidation is not required when adding dynamic regions either. Dynamic 394regions are not allowed to overlap existing memory region. Therefore, if the 395dynamic mapping request is deemed legitimate, it automatically concerns memory 396that was not mapped in this translation regime and the library will have 397initialized its corresponding translation table entry to an invalid 398descriptor. Given that the TLBs are not architecturally permitted to hold any 399invalid translation table entry [#tlb-no-invalid-entry]_, this means that this 400mapping cannot be cached in the TLBs. 401 402.. rubric:: Footnotes 403 404.. [#granularity] That is, when mmap regions do not enforce their mapping 405 granularity. 406 407.. [#tlb-reset-ref] See section D4.9 ``Translation Lookaside Buffers (TLBs)``, 408 subsection ``TLB behavior at reset`` in Armv8-A, rev C.a. 409 410.. [#tlb-no-invalid-entry] See section D4.10.1 ``General TLB maintenance 411 requirements`` in Armv8-A, rev C.a. 412 413-------------- 414 415*Copyright (c) 2017-2019, Arm Limited and Contributors. All rights reserved.* 416 417.. |Alignment Example| image:: ../resources/diagrams/xlat_align.png 418