diff options
Diffstat (limited to 'cpu-supplement/port.rst')
-rw-r--r-- | cpu-supplement/port.rst | 422 |
1 files changed, 422 insertions, 0 deletions
diff --git a/cpu-supplement/port.rst b/cpu-supplement/port.rst new file mode 100644 index 0000000..9ccd2f4 --- /dev/null +++ b/cpu-supplement/port.rst @@ -0,0 +1,422 @@ +.. comment SPDX-License-Identifier: CC-BY-SA-4.0 + +.. COMMENT: COPYRIGHT (c) 1988-2002. +.. COMMENT: On-Line Applications Research Corporation (OAR). +.. COMMENT: All rights reserved. + +Port Specific Information +######################### + +This chaper provides a general description of the type of architecture specific +information which is in each of the architecture specific chapters that follow. +The outline of this chapter is identical to that of the architecture specific +chapters. + +In each of the architecture specific chapters, this introductory section will +provide an overview of the architecture: + +**Architecture Documents** + +In each of the architecture specific chapters, this section will provide +pointers on where to obtain documentation. + +CPU Model Dependent Features +============================ + +Microprocessors are generally classified into families with a variety of CPU +models or implementations within that family. Within a processor family, there +is a high level of binary compatibility. This family may be based on either an +architectural specification or on maintaining compatibility with a popular +processor. Recent microprocessor families such as the SPARC or PowerPC are +based on an architectural specification which is independent or any particular +CPU model or implementation. Older families such as the Motorola 68000 and the +Intel x86 evolved as the manufacturer strived to produce higher performance +processor models which maintained binary compatibility with older models. + +RTEMS takes advantage of the similarity of the various models within a CPU +family. Although the models do vary in significant ways, the high level of +compatibility makes it possible to share the bulk of the CPU dependent +executive code across the entire family. Each processor family supported by +RTEMS has a list of features which vary between CPU models within a family. +For example, the most common model dependent feature regardless of CPU family +is the presence or absence of a floating point unit or coprocessor. When +defining the list of features present on a particular CPU model, one simply +notes that floating point hardware is or is not present and defines a single +constant appropriately. Conditional compilation is utilized to include the +appropriate source code for this CPU model's feature set. It is important to +note that this means that RTEMS is thus compiled using the appropriate feature +set and compilation flags optimal for this CPU model used. The alternative +would be to generate a binary which would execute on all family members using +only the features which were always present. + +The set of CPU model feature macros are defined in the +:file:`cpukit/score/cpu/CPU/rtems/score/cpu.h` based upon the GNU tools +multilib variant that is appropriate for the particular CPU model defined on +the compilation command line. + +In each of the architecture specific chapters, this section presents the set of +features which vary across various implementations of the architecture that may +be of importance to RTEMS application developers. + +The subsections will vary amongst the target architecture chapters as the +specific features may vary. However, each port will include a few common +features such as the CPU Model Name and presence of a hardware Floating Point +Unit. The common features are described here. + +CPU Model Name +-------------- + +The macro ``CPU_MODEL_NAME`` is a string which designates the name of this CPU +model. For example, for the MC68020 processor model from the m68k +architecture, this macro is set to the string "mc68020". + +Floating Point Unit +------------------- + +In most architectures, the presence of a floating point unit is an option. It +does not matter whether the hardware floating point support is incorporated +on-chip or is an external coprocessor as long as it appears an FPU per the ISA. +However, if a hardware FPU is not present, it is possible that the floating +point emulation library for this CPU is not reentrant and thus context switched +by RTEMS. + +RTEMS provides two feature macros to indicate the FPU configuration: + +- CPU_HARDWARE_FP + is set to TRUE to indicate that a hardware FPU is present. + +- CPU_SOFTWARE_FP + is set to TRUE to indicate that a hardware FPU is not present and that the FP + software emulation will be context switched. + +Multilibs +========= + +Newlib and GCC provide several target libraries like the :file:`libc.a`, +:file:`libm.a` and :file:`libgcc.a`. These libraries are artifacts of the GCC +build process. Newlib is built together with GCC. To provide optimal support +for various chip derivatives and instruction set revisions multiple variants of +these libraries are available for each architecture. For example one set may +use software floating point support and another set may use hardware floating +point instructions. These sets of libraries are called *multilibs*. Each +library set corresponds to an application binary interface (ABI) and +instruction set. + +A multilib variant can be usually detected via built-in compiler defines at +compile-time. This mechanism is used by RTEMS to select for example the +context switch support for a particular BSP. The built-in compiler defines +corresponding to multilibs are the only architecture specific defines allowed +in the ``cpukit`` area of the RTEMS sources. + +Invoking the GCC with the ``-print-multi-lib`` option lists the available +multilibs. Each line of the output describes one multilib variant. The +default variant is denoted by ``.`` which is selected when no or contradicting +GCC machine options are selected. The multilib selection for a target is +specified by target makefile fragments (see file :file:`t-rtems` in the GCC +sources and section *The Target Makefile Fragment* +(https://gcc.gnu.org/onlinedocs/gccint/Target-Fragment.html#Target-Fragment) +in the *GCC Internals Manual* (https://gcc.gnu.org/onlinedocs/gccint/). + +Calling Conventions +=================== + +Each high-level language compiler generates subroutine entry and exit code +based upon a set of rules known as the compiler's calling convention. These +rules address the following issues: + +- register preservation and usage + +- parameter passing + +- call and return mechanism + +A compiler's calling convention is of importance when interfacing to +subroutines written in another language either assembly or high-level. Even +when the high-level language and target processor are the same, different +compilers may use different calling conventions. As a result, calling +conventions are both processor and compiler dependent. + +Calling Mechanism +----------------- + +In each of the architecture specific chapters, this subsection will describe +the instruction(s) used to perform a *normal* subroutine invocation. All RTEMS +directives are invoked as *normal* C language functions so it is important to +the user application to understand the call and return mechanism. + +Register Usage +-------------- + +In each of the architecture specific chapters, this subsection will detail the +set of registers which are *NOT* preserved across subroutine invocations. The +registers which are not preserved are assumed to be available for use as +scratch registers. Therefore, the contents of these registers should not be +assumed upon return from any RTEMS directive. + +In some architectures, there may be a set of registers made available +automatically as a side-effect of the subroutine invocation mechanism. + +Parameter Passing +----------------- + +In each of the architecture specific chapters, this subsection will describe +the mechanism by which the parameters or arguments are passed by the caller to +a subroutine. In some architectures, all parameters are passed on the stack +while in others some are passed in registers. + +User-Provided Routines +---------------------- + +All user-provided routines invoked by RTEMS, such as user extensions, device +drivers, and MPCI routines, must also adhere to these calling conventions. + +Memory Model +============ + +A processor may support any combination of memory models ranging from pure +physical addressing to complex demand paged virtual memory systems. RTEMS +supports a flat memory model which ranges contiguously over the processor's +allowable address space. RTEMS does not support segmentation or virtual memory +of any kind. The appropriate memory model for RTEMS provided by the targeted +processor and related characteristics of that model are described in this +chapter. + +Flat Memory Model +----------------- + +Most RTEMS target processors can be initialized to support a flat address +space. Although the size of addresses varies between architectures, on most +RTEMS targets, an address is 32-bits wide which defines addresses ranging from +0x00000000 to 0xFFFFFFFF (4 gigabytes). Each address is represented by a +32-bit value and is byte addressable. The address may be used to reference a +single byte, word (2-bytes), or long word (4 bytes). Memory accesses within +this address space may be performed in little or big endian fashion. + +On smaller CPU architectures supported by RTEMS, the address space may only be +20 or 24 bits wide. + +If the CPU model has support for virtual memory or segmentation, it is the +responsibility of the Board Support Package (BSP) to initialize the MMU +hardware to perform address translations which correspond to flat memory model. + +In each of the architecture specific chapters, this subsection will describe +any architecture characteristics that differ from this general description. + +Interrupt Processing +==================== + +Different types of processors respond to the occurrence of an interrupt in its +own unique fashion. In addition, each processor type provides a control +mechanism to allow for the proper handling of an interrupt. The processor +dependent response to the interrupt modifies the current execution state and +results in a change in the execution stream. Most processors require that an +interrupt handler utilize some special control mechanisms to return to the +normal processing stream. Although RTEMS hides many of the processor dependent +details of interrupt processing, it is important to understand how the RTEMS +interrupt manager is mapped onto the processor's unique architecture. + +RTEMS supports a dedicated interrupt stack for all architectures. On +architectures with hardware support for a dedicated interrupt stack, it will be +initialized such that when an interrupt occurs, the processor automatically +switches to this dedicated stack. On architectures without hardware support +for a dedicated interrupt stack which is separate from those of the tasks, +RTEMS will support switching to a dedicated stack for interrupt processing. + +Without a dedicated interrupt stack, every task in the system MUST have enough +stack space to accommodate the worst case stack usage of that particular task +and the interrupt service routines COMBINED. By supporting a dedicated +interrupt stack, RTEMS significantly lowers the stack requirements for each +task. + +A nested interrupt is processed similarly with the exception that since the CPU +is already executing on the interrupt stack, there is no need to switch to the +interrupt stack. + +In some configurations, RTEMS allocates the interrupt stack from the Workspace +Area. The amount of memory allocated for the interrupt stack is user +configured and based upon the ``confdefs.h`` parameter +``CONFIGURE_INTERRUPT_STACK_SIZE``. This parameter is described in detail in +the Configuring a System chapter of the User's Guide. On configurations in +which RTEMS allocates the interrupt stack, during the initialization process, +RTEMS will also install its interrupt stack. In other configurations, the +interrupt stack is allocated and installed by the Board Support Package (BSP). + +In each of the architecture specific chapters, this section discesses the +interrupt response and control mechanisms of the architecture as they pertain +to RTEMS. + +Vectoring of an Interrupt Handler +--------------------------------- + +In each of the architecture specific chapters, this subsection will describe +the architecture specific details of the interrupt vectoring process. In +particular, it should include a description of the Interrupt Stack Frame (ISF). + +Interrupt Levels +---------------- + +In each of the architecture specific chapters, this subsection will describe +how the interrupt levels available on this particular architecture are mapped +onto the 255 reserved in the task mode. The interrupt level value of zero (0) +should always mean that interrupts are enabled. + +Any use of an interrupt level that is is not undefined on a particular +architecture may result in behavior that is unpredictable. + +Disabling of Interrupts by RTEMS +-------------------------------- + +During the execution of directive calls, critical sections of code may be +executed. When these sections are encountered, RTEMS disables all external +interrupts before the execution of this section and restores them to the +previous level upon completion of the section. RTEMS has been optimized to +ensure that interrupts are disabled for the shortest number of instructions +possible. Since the precise number of instructions and their execution time +varies based upon target CPU family, CPU model, board memory speed, compiler +version, and optimization level, it is not practical to provide the precise +number for all possible RTEMS configurations. + +Historically, the measurements were made by hand analyzing and counting the +execution time of instruction sequences during interrupt disable critical +sections. For reference purposes, on a 16 Mhz Motorola MC68020, the maximum +interrupt disable period was typically approximately ten (10) to thirteen (13) +microseconds. This architecture was memory bound and had a slow bit scan +instruction. In contrast, during the same period a 14 Mhz SPARC would have a +worst case disable time of approximately two (2) to three (3) microseconds +because it had a single cycle bit scan instruction and used fewer cycles for +memory accesses. + +If you are interested in knowing the worst case execution time for a particular +version of RTEMS, please contact OAR Corporation and we will be happy to +product the results as a consulting service. + +Non-maskable interrupts (NMI) cannot be disabled, and ISRs which execute at +this level MUST NEVER issue RTEMS system calls. If a directive is invoked, +unpredictable results may occur due to the inability of RTEMS to protect its +critical sections. However, ISRs that make no system calls may safely execute +as non-maskable interrupts. + +Default Fatal Error Processing +============================== + +Upon detection of a fatal error by either the application or RTEMS during +initialization the ``rtems_fatal_error_occurred`` directive supplied by the +Fatal Error Manager is invoked. The Fatal Error Manager will invoke the +user-supplied fatal error handlers. If no user-supplied handlers are +configured or all of them return without taking action to shutdown the +processor or reset, a default fatal error handler is invoked. + +Most of the action performed as part of processing the fatal error are +described in detail in the Fatal Error Manager chapter in the User's Guide. +However, the if no user provided extension or BSP specific fatal error handler +takes action, the final default action is to invoke a CPU architecture specific +function. Typically this function disables interrupts and halts the processor. + +In each of the architecture specific chapters, this describes the precise +operations of the default CPU specific fatal error handler. + +Symmetric Multiprocessing +========================= + +This section contains information about the Symmetric Multiprocessing (SMP) +status of a particular architecture. + +Thread-Local Storage +==================== + +In order to support thread-local storage (TLS) the CPU port must implement the +facilities mandated by the application binary interface (ABI) of the CPU +architecture. The CPU port must initialize the TLS area in the +``_CPU_Context_Initialize()`` function. There are support functions available +via ``#include <rtems/score/tls.h>`` which implement Variants I and II +according to Ulrich Drepper, *ELF Handling For Thread-Local Storage*. + +``_TLS_TCB_at_area_begin_initialize()`` + Uses Variant I, TLS offsets emitted by linker takes the TCB into account. + For a reference implementation see :file:`cpukit/score/cpu/arm/cpu.c`. + +``_TLS_TCB_before_TLS_block_initialize()`` + Uses Variant I, TLS offsets emitted by linker neglects the TCB. For a + reference implementation see + :file:`c/src/lib/libcpu/powerpc/new-exceptions/cpu.c`. + +``_TLS_TCB_after_TLS_block_initialize()`` + Uses Variant II. For a reference implementation see + :file:`cpukit/score/cpu/sparc/cpu.c`. + +The board support package (BSP) must provide the following sections and symbols +in its linker command file: + +.. code-block:: c + + .tdata : { + _TLS_Data_begin = .; + *(.tdata .tdata.* .gnu.linkonce.td.*) + _TLS_Data_end = .; + } + .tbss : { + _TLS_BSS_begin = .; + *(.tbss .tbss.* .gnu.linkonce.tb.*) *(.tcommon) + _TLS_BSS_end = .; + } + _TLS_Data_size = _TLS_Data_end - _TLS_Data_begin; + _TLS_Data_begin = _TLS_Data_size != 0 ? _TLS_Data_begin : _TLS_BSS_begin; + _TLS_Data_end = _TLS_Data_size != 0 ? _TLS_Data_end : _TLS_BSS_begin; + _TLS_BSS_size = _TLS_BSS_end - _TLS_BSS_begin; + _TLS_Size = _TLS_BSS_end - _TLS_Data_begin; + _TLS_Alignment = MAX (ALIGNOF (.tdata), ALIGNOF (.tbss)); + +CPU counter +=========== + +The CPU support must implement the CPU counter interface. A CPU counter is +some free-running counter. It ticks usually with a frequency close to the CPU +or system bus clock. On some architectures the actual implementation is board +support package dependent. The CPU counter is used for profiling of low-level +functions. It is also used to implement two busy wait functions +``rtems_counter_delay_ticks()`` and ``rtems_counter_delay_nanoseconds()`` which +may be used in device drivers. It may be also used as an entropy source for +random number generators. + +The CPU counter interface uses a CPU port specific unsigned integer type +``CPU_Counter_ticks`` to represent CPU counter values. The CPU port must +provide the following two functions + +- ``_CPU_Counter_read()`` to read the current CPU counter value, and + +- ``_CPU_Counter_difference()`` to get the difference between two CPU + counter values. + +Interrupt Profiling +=================== + +The RTEMS profiling needs support by the CPU port for the interrupt entry and +exit times. In case profiling is enabled via the RTEMS build configuration +option ``--enable-profiling`` (in this case the pre-processor symbol +``RTEMS_PROFILING`` is defined) the CPU port may provide data for the interrupt +entry and exit times of the outer-most interrupt. The CPU port can feed +interrupt entry and exit times with the +``_Profiling_Outer_most_interrupt_entry_and_exit()`` function (``#include +<rtems/score/profiling.h>``). For an example please have a look at +:file:`cpukit/score/cpu/arm/arm_exc_interrupt.S`. + +Board Support Packages +====================== + +An RTEMS Board Support Package (BSP) must be designed to support a particular +processor model and target board combination. + +In each of the architecture specific chapters, this section will present a +discussion of architecture specific BSP issues. For more information on +developing a BSP, refer to BSP and Device Driver Development Guide and the +chapter titled Board Support Packages in the RTEMS Applications User's Guide. + +System Reset +------------ + +An RTEMS based application is initiated or re-initiated when the processor is +reset or transfer is passed to it from a boot monitor or ROM monitor. + +In each of the architecture specific chapters, this subsection describes the +actions that the BSP must tak assuming the application gets control when the +microprocessor is reset. |