summaryrefslogtreecommitdiffstats
path: root/user
diff options
context:
space:
mode:
authorChris Johns <chrisj@rtems.org>2019-02-15 16:03:59 +1100
committerChris Johns <chrisj@rtems.org>2019-02-20 09:05:35 +1100
commit8a73acb71305c58c6ea58aa0da84d4adb4c125cc (patch)
treebf67f62b560c58adbeeba1235bf649c6f9fa8469 /user
parent789b0caad7a2b715398927491b9e70bda718c987 (diff)
downloadrtems-docs-8a73acb71305c58c6ea58aa0da84d4adb4c125cc.tar.bz2
user: Add a Dynamic Loader section.
Diffstat (limited to 'user')
-rw-r--r--user/exe/executables.rst2
-rw-r--r--user/exe/execution.rst2
-rw-r--r--user/exe/index.rst5
-rw-r--r--user/exe/loader.rst866
-rw-r--r--user/installation/developer.rst3
-rw-r--r--user/tools/exeinfo.rst2
-rw-r--r--user/tools/linker.rst2
-rw-r--r--user/tools/symbols.rst2
8 files changed, 881 insertions, 3 deletions
diff --git a/user/exe/executables.rst b/user/exe/executables.rst
index be57fed..772c28e 100644
--- a/user/exe/executables.rst
+++ b/user/exe/executables.rst
@@ -77,6 +77,8 @@ built with ``make``, ``autotools``, ``cmake``, ``waf`` and more. User should
select a build system that meets their project, system, corporate or personal
needs.
+.. _MachineFlagsandABI:
+
Machine Flags and ABI
---------------------
.. index:: Machine flags
diff --git a/user/exe/execution.rst b/user/exe/execution.rst
index e05f788..281cb5d 100644
--- a/user/exe/execution.rst
+++ b/user/exe/execution.rst
@@ -2,6 +2,8 @@
.. Copyright (C) 2018 Chris Johns <chrisj@rtems.org>
+.. _TargetExecution:
+
Target Execution
================
.. index:: Target Execution
diff --git a/user/exe/index.rst b/user/exe/index.rst
index 136a410..02dab19 100644
--- a/user/exe/index.rst
+++ b/user/exe/index.rst
@@ -11,10 +11,11 @@ Executables
This section discusses what an RTEMS executable is and what happens when you
execute it in a target. The section discusses how an application executable is
-created, what happens when an executable is loaded and run as well as
-debugging an execiutable.
+created, what happens when an executable is loaded and run, debugging an
+execiutable, and creating and dynamically loading code.
.. include:: executables.rst
.. include:: execution.rst
.. include:: initialization.rst
.. include:: debugging.rst
+.. include:: loader.rst
diff --git a/user/exe/loader.rst b/user/exe/loader.rst
new file mode 100644
index 0000000..77d7cda
--- /dev/null
+++ b/user/exe/loader.rst
@@ -0,0 +1,866 @@
+.. SPDX-License-Identifier: CC-BY-SA-4.0
+
+.. Copyright (C) 2019 Chris Johns <chrisj@rtems.org>
+
+.. index:: Dynamic Loader
+
+Dynamic Loader
+==============
+.. index:: Dynamic Loader
+.. index:: Run-time Loader
+.. index:: RTL
+.. index:: Libdl
+
+RTEMS supports dynamically loading of executable code and data in the form of
+object files into a running system where the run-time loaded code can be
+executed and data accessed
+
+This section describes RTEMS loader, preparing and loading executable code into
+a running system, the supported architectures and any limitation that may exist
+with an architecture.
+
+The RTEMS operating system contains a link editor that runs on the target. The
+link editor supports loading Extendable Linker Format (ELF) relocatable
+executable object files locating the code and data in the target's address space
+as it is loaded. An executable object file's external references to function
+identifiers and data object identifiers are resolved and any external symbols
+can be made available in the global symbol table. The executing performance of
+dynamically loaded code is similar to the same code statically linked into an
+executable. This is a core requirement of the RTEMS link editor.
+
+.. _fig-dl-libdl:
+
+.. figure:: ../../images/user/libdl.png
+ :width: 95%
+ :alt: Run Time Loader (libdl)
+ :figclass: align-center
+
+ Run Time Loader (libdl)
+
+The RTEMS operating system's dynamic loader is not the same as the dynamic
+shared library support Unix or Windows have. Those operating systems use dynamic
+loading to share code between processes and this is an important feature in
+their design. RTEMS is a single address space operating system and that means
+there is no ability to share code at run-time. As a result code is loaded in a
+similar manner to static linking removing the need for any overheads sharing
+code may have.
+
+To load an executable object file it must be resident on a target and accessible
+by RTEMS's file system. The executable object file can be a single file or a
+collection in a library stored using the Unix standard archive format. The RTEMS
+loader supports the extended GNU format for long file names in archives.
+
+The RTEMS developers do not see dynamically loading of code as a real-time
+activity. A system should not respond to real-time external events by loading
+code. The loading of code should happen before a system is considered available
+and the activity the system is experiencing is low and stable.
+
+.. index: base image
+
+The statically linked executable that is loaded and run after reset is called
+the *base image*. The *base image* contains your base application that is used
+to dynamically load code, a global symbol table, the parts of the RTEMS
+operating system code used in the base image as well as functions and data from
+the tool suite libraries and packages you are using. Only the software
+referenced is used to create the base image. The parts of the libraries not
+referenced are not part of the executable or present in the global symbol table.
+
+Application software can locate a symbol by name and call the address or
+reference the data at that address. A function identifier located by a symbol
+does not have it's signatures checked, it is the responsibility of the caller to
+make sure the function is called with the correct arguments. It is the same for
+data objects, there is no type checking. Symbol versioning is not supported and
+supporting it does not make sense within the RTEMS operating system. An RTEMS
+target system is closed to normal users and software needs to be built from the
+same tool set and header files used to the build the base image.
+
+An executable object file's text or code has to be built for the target's
+architecture it is loaded on and it must be built with the same ABI flags the
+base image is built with. See :ref:`MachineFlagsandABI`.
+
+System Design
+-------------
+
+The use of dynamic loading in a project is a system design decision. Some
+systems will have strict requirements where loading code into a live system is
+not allowed while other projects will benefit from the system level flexibility
+dynamically loading code provides.
+
+Code loaded at run time needs to be resident or accessible to the target via
+RTEMS's file system. Targets that have suitable media or a network interface to
+NFS servers to hold the executable object and library files are best suited.
+
+Dynamically loading code uses more memory than statically linking the same code
+into the base image. The link editor maintains symbol tables where each symbol
+is a string, an address, and some additional data. The executable object files
+resident in memory each have data to manage them, the memory they use, and any
+dependencies they might have. The link editor is designed to minimize the memory
+overheads however only statically linked executables have no memory overhead.
+
+The link editor relocates the code and data into RAM fixing it to the load
+address as it is loaded. A target needs to have suitably configured memory
+available for the executable object file to reside in. The memory must be able
+to support read, write and executable type access. Fine control of the memory
+and it's modes can be supported using a customer allocator. Examples are systems
+that have a custom memory map, specialized memory for the execution of code or a
+requirement for read-only executable sections.
+
+The load address of an executable object file is determined by the load order
+and the allocator used. The default allocator for the link editor is the system
+heap which means the location a specific executable object file is loaded at
+depends on the memory allocated before it is loaded and when in the load order
+it is loaded. A statically linked executable's address map is fixed and this is
+considered important in some systems. A dynamically loaded system can be loaded
+in a repeatable manner if the load order is the same and the initialization
+sequence of the system is controlled. A custom allocator may also help.
+
+Management of dynamically loadable object files and libraries adds to the
+configuration management of the hosts in a project. The loadable files need to
+be released and tracked in a suitable configuration management process just like
+the base image is. Executable object files and libraries are specific to a
+version of RTEMS and cannot be mixed or moved and this needs to be carefully
+managed. Currently there are no checks an executable object file matches the
+version of the base image it is being loaded on. These extra configuration
+controlled items add to the overheads of a project and need to be considered.
+
+Dynamically loadable systems have a number of features that benefit some systems
+and products. Systems can be built on a base of trusted or *golden* modules. A
+number of projects using a common base of hardware can make use of proven
+modules reducing the testing and qualification overhead for each new release. A
+tested base image with libraries for common and available boards provides a
+simple and fast way for new users to trial and use RTEMS.
+
+A project can use dynamic loading during development, shipping statically linked
+executables in production. Hardware used by a development team can have more
+memory, extra media for disk drives, or a network interface.
+
+Loader Interface
+----------------
+.. index:: Loader Interface
+.. index:: Loading object files
+.. index:: dlfcn.h
+
+Run-time executable object file loading and management is via the standard's
+based calls provided by the header file ``<dlfcn.h>``. The details of the calls
+follow.
+
+.. _dlopen:
+.. index:: dlopen
+``void* dlopen(const char* path, int mode);``
+ The ``dlopen()`` function makes the symbols (function identifiers and data
+ object identifiers) in the executable object file specified by `file`
+ available to the calling program.
+
+ The executable object files eligible for this operation are in the ELF
+ format.
+
+ The link loader may load embedded dependencies in executable object files. In
+ such cases, a ``dlopen()`` operation may load those dependencies in addition
+ to the executable object file specified by `file`.
+
+ A successful ``dlopen()`` returns a `handle` which the caller may use on
+ subsequent calls to ``dlsym()``, ``dlinfo()`` and ``dlclose()``.
+
+ The value of the `handle` should not be interpreted in any way by the caller.
+
+ Subsequent calls to ``dlopen()`` for the same executable object file increases
+ the references to it.
+
+ The `file` argument is used to construct a pathname to the executable object
+ file or archive library of executable object files. If the `file` argument
+ contains a colon (``:``) the name of the executable object file in the library
+ follows and this file name may optionally end with ``@`` followed by a number
+ which is the absolute offset in the library file where the executable object
+ file starts. If an executable object file is not detected at the offset the
+ archive library's file table is searched.
+
+ If `file` is a null pointer, ``dlopen()`` returns a global symbol table
+ handle. This `handle` provides access to the global symbols from an ordered
+ set of executable object files consisting of the original base image file, the
+ set of executable object files loaded using ``dlopen()`` operations with the
+ ``RTLD_GLOBAL`` flag, and any dependencies loaded. As the latter sets of
+ executable object files can change during execution, the set of symbols made
+ available by this `handle` can also change dynamically.
+
+ Only a single copy of an executable object file is brought into the address
+ space, even if ``dlopen()`` is invoked multiple times in reference to the
+ executable object file, and even if different pathnames are used to reference
+ the executable object file.
+
+ Unresolved external symbols do not cause an error to be returned allowing the
+ loading of jointly dependent executable object files.
+
+ If ``dlopen()`` fails, it returns a null pointer, and sets an error condition
+ which may be interrogated with ``dlerror()``.
+
+ The `mode` parameter describes how ``dlopen()`` operates upon `file` with
+ respect to the processing of relocations and the scope of visibility of the
+ symbols provided within `file`. When an executable object file is brought into
+ the address space, it may contain references to symbols whose addresses are
+ not known until the executable object file is loaded.
+
+ If a loaded executable object file and any dependent executable object files
+ loaded with it contain any initialiser functions, they are called in the order
+ loaded before ``dlopen()`` returns.
+
+ The modes ``RTLD_LAZY`` and ``RTLD_NOW`` do not effect the type of relocation
+ performed, it is same for both modes. All relocations of an executable object
+ file and any dependent executable object files loaded with it are completed
+ before the ``dlopen()`` call returns. The execution performance of the code
+ loaded can be considered deterministic once ``dlopen()`` has returned.
+
+ Any executable object file loaded by ``dlopen()`` can reference global symbols
+ in the base image, any executable object files loaded included in the same
+ ``dlopen()`` invocation, and any executable object files that were loaded in
+ any ``dlopen()`` invocation and which specified the ``RTLD_GLOBAL`` flag. To
+ determine the scope of visibility for the symbols loaded with a ``dlopen()``
+ invocation, the `mode` parameter should be a bitwise-inclusive ``OR`` with one
+ of the following values:
+
+ ``RTLD_GLOBAL``
+ The executable object file's symbols are made available for relocation
+ processing of any other executable object file. In addition, symbol lookup
+ using ``dlopen(NULL,mode)`` and an associated ``dlsym()`` allows
+ executable object files loaded with this mode to be searched.
+
+ ``RTLD_LOCAL``
+ The executable object file's symbols shall not be made available for
+ relocation processing of any other executable object files.
+
+ If neither ``RTLD_GLOBAL`` nor ``RTLD_LOCAL`` is specified, the default
+ behavior is unspecified.
+
+ If ``RTLD_GLOBAL`` has been specified, the executable object file maintains
+ it's ``RTLD_GLOBAL`` status regardless of any previous or future specification
+ of ``RTLD_LOCAL``, as long as the executable object file remains in the
+ address space.
+
+ Symbols introduced through calls to ``dlopen()`` may be used in relocation
+ activities. Symbols that duplicate symbols already defined by the base image
+ or previous ``dlopen()`` calls are treated as an error and the object file is
+ not loaded. Symbols introduced through loading dependent executable object
+ files are ignored or not loaded depending on the method used to build the
+ executable object files.
+
+ The symbols introduced by ``dlopen()`` operations and available through
+ ``dlsym()`` are at a minimum those which are exported as identifiers of global
+ scope by the executable object file. Typically, such identifiers shall be
+ those that were specified in (for example) C source code as having ``extern``
+ linkage.
+
+.. _dlclose:
+.. index:: dlclose
+``int dlclose(void* handle);``
+ Releases a reference to the executable object file referenced by `handle`. If
+ the reference count drops to ``0``, the executable object file's global symbol
+ table is made unavailable. When all references to the global symbols the
+ executable object file provided have been removed the object file is removed
+ from the address space.
+
+ If the executable object being removed has any termination routines in it they
+ are called.
+
+.. _dlsym:
+.. index:: dlsym
+``void* dlsym(void* handle, const char* symbol);``
+ The ``dlsym()`` function obtains the address of a symbol (a function identifier
+ or a data object identifier) defined in the symbol table identified by the
+ handle argument. The handle argument is a symbol table handle returned from a
+ call to ``dlopen()`` (and which has not since been released by a call to
+ ``dlclose()``), and name is the symbol's name as a character string. The return
+ value from ``dlsym()``, cast to a pointer to the type of the named symbol, can
+ be used to call (in the case of a function) or access the contents of (in the
+ case of a data object) the named symbol.
+
+ The ``dlsym()`` function searches for the named symbol in the symbol table
+ referenced by handle and returns the address of the code or data location
+ specified by the null-terminated character string symbol. Which libraries and
+ objects are searched depends on the `handle` parameter.
+
+ Upon successful completion, if name names a function identifier, ``dlsym()``
+ returns the address of the function converted from type pointer to function to
+ type pointer to ``void``; otherwise, ``dlsym()`` shall return the address of
+ the data object associated with the data object identifier named by name
+ converted from a pointer to the type of the data object to a pointer to
+ ``void``. If `handle` does not refer to a valid symbol table handle or if the
+ symbol named by name cannot be found in the symbol table associated with
+ `handle`, ``dlsym()`` shall return a null pointer.
+
+.. _dlinfo:
+.. index:: dlinfo
+``int dlinfo(void* handle, int request, void* args);``
+
+ The ``dlinfo()`` function provides information about dynamically loaded object.
+ The action taken by ``dlinfo()`` and exact meaning and type of the argument
+ `args` depend on value of the `request` argument provided by the caller.
+
+ ``RTLD_DI_UNRESOLVED``
+ Return ``1`` in an indexer value pointed to by `args` if the symbol table
+ handle has unresolved relocation records to symbols. If the `handle` is the
+ global symbol table handle or ``RTLD_SELF`` return ``1`` if any unresolved
+ relocation records to symbols are present in any loaded executable object
+ files..
+
+.. _dlerror:
+.. index:: dlerror
+``const char *dlerror(void);``
+ The ``dlerror()`` function returns a null-terminated character string (with no
+ trailing ``<newline>``) that describes the last error that occurred during
+ dynamic linking processing. If no dynamic linking errors have occurred since
+ the last invocation of ``dlerror()``, ``dlerror()`` returns ``NULL``. Thus,
+ invoking ``dlerror()`` a second time, immediately following a prior
+ invocation, results in ``NULL`` being returned.
+
+This example opens an object file, checks for any unresolved symbols the object
+file may have, locates a global symbol in the object file, calls it then closes
+the object file:
+
+.. code-block:: c
+
+ #include <stdbool.h>
+ #include <stdio.h>
+ #include <dlfcn.h>
+
+ typedef int (*call_sig)(void);
+
+ bool load_object (void)
+ {
+ void* handle;
+ call_sig call;
+ int unresolved;
+
+ handle = dlopen ("/code.o", RTLD_NOW | RTLD_GLOBAL);
+ if (handle == NULL)
+ {
+ printf ("dlopen failed: %s\n", dlerror ());
+ return false;
+ }
+
+ if (dlinfo (handle, RTLD_DI_UNRESOLVED, &unresolved) < 0)
+ {
+ printf ("dlinfo failed: %s\n", dlerror ());
+ dlclose (handle);
+ return false;
+ }
+
+ if (unresolved != 0)
+ {
+ printf ("object.o has unresolved external symbols\n");
+ dlclose (handle);
+ return false;
+ }
+
+ call = dlsym (handle, "foo");
+ if (call == NULL)
+ {
+ printf("dlsym failed: symbol 'foo' not found\n");
+ dlclose (handle);
+ return false;
+ }
+
+ printf ("'foo()' returns: %i\n", call ());
+
+ if (dlclose (handle) < 0)
+ {
+ printf("dlclose failed: %s\n", dlerror());
+ return false;
+ }
+
+ return true;
+ }
+
+Symbols
+-------
+.. index:: symbol
+.. index:: global symbol
+.. index:: function identifier
+.. index:: data object identifier
+
+The RTEMS link editor manages the symbols for the base image and all resident
+executable object files. A symbol is an identifier string and a pointer value to
+a function identifier or a data object identifier. The symbols held in the
+symbol tables are used in the relocation of executable object files or they can
+be accessed by application code using the :ref:`dlsym() <dlsym>` call.
+
+.. index:: orphaned object file
+
+An executable object file's symbols are removed from the global symbol table
+when it is closed or orphaned. An executale object file cannot be unloaded if a
+symbol it provides is referenced by another object and that object is still
+resident. An executable object file that has no references to any of its symbols
+and was not explicitly loaded using the :ref:`dlopen() <dlopen>` call is
+orphaned and automatically removed from the address space.
+
+Base Image Symbols
+^^^^^^^^^^^^^^^^^^
+.. index:: base image symbols
+
+The base image symbol table provides access to the function and data objects
+statically linked into the base image. Loaded executable object files can be
+directly linked to the code and data resident in the base image.
+
+A statically linked RTEMS executable does not contain a symbol table, it has to
+be generated and either embedded into the executable or loaded as a specially
+created executable object file.
+
+The base image symbol table is dependent on the contents of the base image and
+this is not known until it has been linked. This means the base image symbol
+table needs to be constructed after the base image executable has been linked
+and the list of global symbols is known.
+
+The RTEMS Tools command :program:`rtems-syms` (see :ref:`RTEMSSymbols`) extracts
+the global and weak symbols from an RTEMS static executable file, creates a C
+file and compiles it creating a relocatable executable object file. This file
+can be linked with the static executable's object files and libraries to create
+a static executables with an embedded symbol table or the executable file can be
+loaded dynamically at run-time. The following needs to be observed:
+
+#. The option ``-e`` or ``--embedded`` to :program:`rtems-syms` creates an
+ executable object file to be embedded in the base image and not providing
+ either of these options creates a symbols executable object file that is
+ loaded at run-time. The same executable object file cannot be used to
+ embedded or load.
+
+#. The target C compiler and machine options need to be provided to make sure
+ the correct ABI for the target is used. See :ref:`MachineFlagsandABI`.
+
+.. index:: embedded symbol table
+.. _EmbbeddedSymbolTable:
+
+Embedded Symbols
+^^^^^^^^^^^^^^^^
+
+An embedded symbol table is *embedded* within the base image executable file and
+loaded when the static executable is loaded into memory by the bootloader. The
+symbol table is automatically added to the link editor's global symbol table
+when the first executable object file is loaded.
+
+The process to embed the symbol table requires linking the base image twice.
+The first link is to create a static executable that collects together the
+symbols to make the symbol table. The RTEMS Tools command `rtems-syms` extracts
+the global and weak symbols from the static executable ELF file, creates a C
+file and compiles it to create an executable object file. The base image is
+linked a second time and this time the symbol table executable object file is
+added to the list of object files.
+
+Embedding the symbol table means the chances of the symbol table and base image
+not matching is low, however it also means the symbol table is always present in
+the kernel image when dynamic loading may be optional. A project's build system
+is made more complex as it needs to have extra steps to link a second time.
+
+This example shows creating an embedded symbol table object file and linking it
+into the base image.
+
+.. code-block:: shell
+
+ $ sparc-rtems5-gcc -mcpu=cypress foo.o -lrtemsbsp -lrtemscpu -o foo.pre
+ $ rtems-syms -e -C sparc-rtems5-gcc -c "-mcpu=cypress" -o foo-sym.o foo.pre
+ $ sparc-rtems5-gcc -mcpu=cypress foo.o foo-sym.o -lrtemsbsp -lrtemscpu -o foo.exe
+
+The link command line steps in this example are not complete.
+
+.. _LoadableSymbolTable:
+
+Loadable Symbols
+^^^^^^^^^^^^^^^^
+
+A run-time loaded symbol table is the default for the command
+:program:`rtems-syms`. The symbol table executable object file is packaged with
+the other files to be dynamically loaded at run-time and placed on the target's
+file system. It needs to be loaded before any other executable object file are
+loaded or unresolved symbols can occur that will not be resolved.
+
+A run-time loaded symbol table does not consume any target resources until it is
+loaded. This is useful in a system that optionally needs to dynamically load
+code, for example as a development environment. The symbol table executable
+needs to exactly match the base image loading it or the behavior is
+unpredictable. No checks are made.
+
+The example shows creating and loading a symbol table executable object
+file. First create the symbol table's executable object file:
+
+.. code-block:: shell
+
+ $ sparc-rtems5-gcc -mcpu=cypress foo.o -lrtemsbsp -lrtemscpu -o foo.exe
+ $ rtems-syms -C sparc-rtems5-gcc -c "-mcpu=cypress" -o foo-sym.o foo.exe
+
+The link command line steps in this example are not complete.
+
+Load the symbol table:
+
+.. code-block:: c
+
+ #include <stdbool.h>
+ #include <stdio.h>
+ #include <dlfcn.h>
+
+ bool load (void)
+ {
+ void* handle = dlopen ("/foo-sym.o", RTLD_NOW | RTLD_GLOBAL);
+ if (handle == NULL)
+ {
+ printf ("failed to load the symbol table: %s\n", dlerror ());
+ return false;
+ }
+ return true;
+ }
+
+Unresolved Symbols
+------------------
+
+The RTEMS link editor does not return an error when an executable object file is
+loaded with unresolved symbols. This allows dependent object files to be
+loaded. For example an executable object file :file:`foo.o` contains the
+function ``foo()`` and that function calls ``bar()`` and an executable object
+file :file:`bar.o` contains a function ``bar()`` that calls the function
+``foo()``. Either of these executable object files can be loaded first as long
+both are loaded before any symbols are accessed.
+
+The link editor defers the resolution of unresolved symbols until the symbol is
+available in the global symbol table. Executing code or accessing data in a
+loaded executable object file with unresolved external symbols results in
+unpredictable behavior.
+
+All unresolved symbols are checked after an executable object file has been
+loaded. If a symbol is found and resolved any relocations that reference the
+symbol are fixed. If valid library files have been configured the symbol table's
+of each library are searched and if the symbol is found the dependent executable
+object file is loaded. This process repeats until no more symbols can be
+resolved.
+
+The ``dlinfo()`` call can be used to see if a loaded executable object file has
+any unresolved symbols:
+
+.. code-block:: c
+
+ #include <stdbool.h>
+ #include <stdio.h>
+ #include <dlfcn.h>
+
+ bool has_unresolved(void* handle)
+ {
+ int unresolved;
+ if (dlinfo (handle, RTLD_DI_UNRESOLVED, &unresolved) < 0)
+ {
+ printf ("dlinfo failed: %s\n", dlerror ());
+ return false;
+ }
+ return unresolved != 0;
+ }
+
+The handle ``RTLD_SELF`` checks for any unresolved symbols in all resident
+object files:
+
+.. code-block:: c
+
+ if (has_unresolved(RTLD_SELF))
+ printf("system has unsolved symbols\n");
+
+Libraries
+---------
+
+The RTEMS link editor supports loading executable object files from
+libraries. Executable object files can be explicitly loaded from a library using
+a specific path to :ref:`dlopen() <dlopen>` and treated the same as loading a
+stand alone executable object file. Libraries can be searched and an executable
+object file containing the search symbol can be loaded automatically as a
+dependent executable object file. A dependent executable object file loaded from
+a library with no symbol references to it's symbols is orphaned and
+automatically unloaded and removed from the address space.
+
+.. _fig-dl-libs:
+
+.. figure:: ../../images/user/libdl-load.png
+ :width: 65%
+ :alt: Loading Executable Object Files
+ :figclass: align-center
+
+ Loading Executable Object Files
+
+A library is an archive format file created using the RTEMS architecture
+prefixed :program:`ar` command. The RTEMS tool suite provides the :program:`ar`
+program and system libraries such as :file:`libc.a` and :file:`libm.a` for each
+architecture and ABI. Libraries used by the RTEMS link editor for searching must
+contain a symbol table created by the :program:`ranlib` program from the RTEMS
+tool suite.
+
+Searching a library's symbol table and loading an executable object file
+containing the symbol is called *dependent loading*. Dependent loading provides
+a simple way to manage the dependencies when loading an executable object
+file. If code in an executable object file references functions or data objects
+that are part of a library and the symbols are not part of the base image those
+symbols will not resolve unless the library is on the target and available for
+searching and loading. Dependent loading from libraries on the target provides a
+simple and understandable way to manage the dependency issue between the base
+image, loaded code and the system libraries.
+
+The RTEMS link editor checks for the configuration file :file:`/etc/libdl.conf`
+on each call to :ref:`dlopen() <dlopen>`. If the file has changed since the last
+check it is loaded again and the contents processed. The file format is:
+
+#. Comments start with the ``#`` character.
+#. A line is a wildcard path of libraries to search for. The wildcard search
+ uses the ``fnmatch()`` call. The ``fnmatch()`` function matches patterns
+ according to the rules used by a shell.
+
+Files that match the search pattern are verified as a library and if a symbol
+table is found it is loaded and the symbols it contains made search-able.
+
+A call to :ref:`dlopen() <dlopen>` checks the global symbols table and any
+references to relocation symbols not found are *unresolved* and added to the
+unresolved symbol table. Once the executable object file is loaded the link
+editor attempts to resolve any unresolved symbols. The unresolved symbol
+resolver checks each unresolved symbol against the global symbol table and if
+not found the available library symbol tables are searched. If a symbol is found
+in a library the dependent executable object file is loaded. The process repeats
+until all unresolved symbols have been resolved and the remaining unresolved
+symbols are not in the global symbol table or any libraries. The loading of a
+library executable object file will resolve at least one symbol and it may add
+more unresolved symbols requiring further searching of the libraries.
+
+.. index:: strip library
+
+A library of executable object files built by the RTEMS Tool suite can contain
+debug information and this should be stripped before loading on to the
+target. The tool suite's command :program:`strip` can strip all the object files
+in a library with a single command.
+
+.. code-block:: shell
+
+ $ sparc-rtems5-strip libc.a
+
+Large Memory
+------------
+
+The RTEMS link editor supports large memory relocations. Some architectures have
+instructions where the relative branch or jump offset from the instruction to
+the target address is limited. These instructions provide improved performance
+because less code generated compared to larger instructions which contain full
+address space references. The compact code helps lower cache pressure as well
+and providing improved performance for localalized functions and loops. The
+compiler defaults to generating the smaller instruction and as the final address
+map not known when generating the code, linkers need to provide glue code to
+extend the small address range to the enitre address space. This is called a
+trampoline. A trampoline is transparent to the execution of the code.
+
+The link editor parses an executable object file's relocation records to
+determine the number of trampolines needed. Added to this value are all
+unresolved symbols present in an executable object file after it is
+loaded. There is a slot allocated even if the symbol ends up being within range
+as there is no way to determine a symbol's address until it is loaded and the
+range calculated.
+
+The trampoline table is allocated a separate block of memory to the executable
+object file's text, data and constants memory. The trampoline parsing requires
+the executable object file's instructions (text) be in memory as the
+instructions are inspected by the architecture specific relocation support to
+determine an instruction's range. As a result the allocation for the trampoline
+table has to occur after the text memory has been allocated. Most instructions
+have relative offsets and the trampoline table is allocated at one end limiting
+the size of an object to half the maximum range.
+
+Trampolines support is available for the ARM and PowerPC architectures. The
+SPARC and Intel x86 architectures do not need trampolines and MIPS needs support
+added.
+
+Allocator
+---------
+
+The RTEMS link editor supports custom allocators. A custom allocator lets you
+manage the memory used by the RTEMS link editor as it runs. Allocators could
+provide:
+
+#. Support for the various types of memory that can be allocated allowing
+ specialised target support for specific use cases.
+#. Locking of read-only memory. The link editor unlocks read-only memory when it
+ needs to write to it.
+#. Separation of memory holding code and data from the heap.
+
+The allocator can be hooked using the ``rtems_rtl_alloc_hook`` call before any
+calls to :ref:`dlopen() <dlopen>` are made. The hook call returns the current
+allocate allowing the allocators to be chained.
+
+The default allocator uses the heap.
+
+.. _rtems_rtl_alloc_tags:
+.. index:: rtems_rtl_alloc_tags
+
+The allocator tags specify the type of memory the allocator is handling. The tag
+used to allocate memory at an address must be used when making allocator
+calls. The ``rtems_rtl_alloc_tags`` are:
+
+ .. index:: RTEMS_RTL_ALLOC_OBJECT
+ ``RTEMS_RTL_ALLOC_OBJECT``
+ Allocate a generic object. The link editor uses this memory for data
+ structures it uses to manage the linking process and resident executable
+ object files.
+
+ .. index:: RTEMS_RTL_ALLOC_SYMBOL
+ ``RTEMS_RTL_ALLOC_SYMBOL``
+ Allocate memory to hold symbol data.
+
+ .. index:: RTEMS_RTL_ALLOC_EXTERNAL
+ ``RTEMS_RTL_ALLOC_EXTERNAL``
+ Allocate memory for unresolved external symbols.
+
+ .. index:: RTEMS_RTL_ALLOC_READ
+ ``RTEMS_RTL_ALLOC_READ``
+ Allocate memory for read-only data such as constants and exception tables.
+
+ .. index:: RTEMS_RTL_ALLOC_READ_WRITE
+ ``RTEMS_RTL_ALLOC_READ_WRITE``
+ Allocate memory for read-write data such as a initialised, uninitialized and
+ common variables.
+
+ .. index:: RTEMS_RTL_ALLOC_READ_EXEC
+ ``RTEMS_RTL_ALLOC_READ_EXEC``
+ Allocate memory for code to be executed in. The address space is configure for
+ read and execute.
+
+.. _rtems_rtl_alloc_cmd:
+.. index:: rtems_rtl_alloc_cmd
+
+The commands are used to control the action the allocator performs. The
+``rtems_rtl_alloc_cmd`` are:
+
+ .. index:: RTEMS_RTL_ALLOC_NEW
+ ``RTEMS_RTL_ALLOC_NEW``
+ Allocate memory of the ``tag`` type. Returns ``NULL`` if the allocation fails.
+
+ .. index:: RTEMS_RTL_ALLOC_DEL
+ ``RTEMS_RTL_ALLOC_DEL``
+ Delete a previous allocation freeing the memory. The ``tag`` has to match
+ address of the memory being deleted.
+
+ .. index:: RTEMS_RTL_ALLOC_WR_ENABLE
+ ``RTEMS_RTL_ALLOC_WR_ENABLE``
+ Enable writes to a region of memory previously allocated. The ``tag`` has to
+ match the address of the memory being write enabled. The link editor may call
+ issue this command for memory that is already write enabled.
+
+ .. index:: RTEMS_RTL_ALLOC_WR_DISABLE
+ ``RTEMS_RTL_ALLOC_WR_DISABLE``
+ Disable writes to a region of memory previously allocated. The ``tag`` has to
+ match address of the memory being write disabled. The link editor may call
+ issue this command for memory that is writable and not to be write
+ disabled. The allocator need to manage this case.
+
+.. _rtems_rtl_allocator:
+.. index:: rtems_rtl_allocator
+
+The allocator handler is a single call to handle all allocator requests. The
+handler called on evey allocation action made by the link editor. The type of
+the function you need is:
+
+.. code-block:: c
+
+ typedef void (*rtems_rtl_allocator)(rtems_rtl_alloc_cmd cmd,
+ rtems_rtl_alloc_tag tag,
+ void** address,
+ size_t size);
+
+
+The arguments are:
+
+``cmd``
+ The command to action. See :ref:`_rtems_rtl_alloc_cmd`.
+
+``tag``
+ The type of memory the command is for. The ``tag`` must match the address for
+ commands other than ``RTEMS_RTL_ALLOC_OBJECT``.
+
+``address``
+ Pointer to the address. This is set of the ``RTEMS_RTL_ALLOC_OBJECT`` command
+ and read for the other commands. The ``tag`` must match the address for
+ commands that read the address from the pointer.
+
+``size``
+ The size of the memory to allocate. This is only valid for the
+ ``RTEMS_RTL_ALLOC_OBJECT`` command.
+
+The call to hook the allocator is:
+
+.. code-block:: c
+
+ rtems_rtl_allocator rtems_rtl_alloc_hook (rtems_rtl_allocator handler);
+
+The current allocator is returned. You can provide a full allocator or you can
+filter commands.
+
+Languages
+---------
+
+C is supported.
+
+C++ is supported. Initializer functions are called when an object is loaded and
+finalizer functions are called before it is unloaded and removed. Static
+constructions are initializer functions and static destructors are finalizer
+functions.
+
+C++ exceptions are handled across modules. The compiler generated exception
+tables present in an executable object file are registered with the architecture
+specific mechanism when loaded and deregistered when unloaded. An exception
+thrown in loaded code can be caught in the base image or another loaded
+module. If you are using C++ and exceptions it is recommended some exception
+code is added to the base image to place the architecture specific support in
+the base image.
+
+Thread Local Storage
+--------------------
+
+Thread local storage (TLS) is currenly not supported by the RTEMS link
+editor. The RTEMS executive needs to have a special allocator added to manage
+dynamically allocating TLS variables in a thread.
+
+If you need TLS support in dynamically loaded code please consider the RTEMS
+support options.
+
+Architectures
+-------------
+
+The following architectures are supported:
+
+ - ARM
+ - Blackfin
+ - H8300
+ - Intel x86 (i386)
+ - LM32
+ - M68K
+ - MIPS
+ - Moxie
+ - PowerPC
+ - SPARC
+ - V850
+
+ARM
+^^^
+
+The ARM relocation backend supports veneers which is trampolines.
+
+The veneer implementation is a single instruction and a 32bit target address
+making the overhead 8 bytes for each veneer. The performance overhead is a
+single instruction.
+
+PowerPC
+^^^^^^^
+
+The PowerPC relocation backend support trampolines and small data.
+
+The trampoline is four instructions and uses register 12 which the PowerPC ABI
+reserves for scratch use. The implementation loads the counter register and
+branches to the address it contains. The trampoline size is 16 bytes. The
+performance overhead is four instructions.
+
+The PowerPC relocation backend also supports small data. The sections of an
+executable object file are parsed and small data are tagged as needing
+architecture specific allocations. These sections are not allocated as part of
+the standard section allocation. Small data sections are allocated in the global
+small data region of memory. The size of this region is defined in the BSP's
+linker command file by setting ``bsp_section_small_data_area_size`` variable:
+
+.. code-block:: c
+
+ bsp_section_small_data_area_size = 65536;
+
+The maximum size of the small data region is 65536 bytes. It is recommended code
+built for loading uses the same settings for small base as the base image.
diff --git a/user/installation/developer.rst b/user/installation/developer.rst
index 70d676e..90912cc 100644
--- a/user/installation/developer.rst
+++ b/user/installation/developer.rst
@@ -90,6 +90,8 @@ requires:
If you are unsure how to specify the build set for the architecture you wish to
build, just ask the tool:
+.. code-block:: shell
+
$ ../source-builder/sb-set-builder --list-bsets <1>
RTEMS Source Builder - Set Builder, v4.11.0
Examining: config
@@ -750,4 +752,3 @@ Install the kernel to our prefix:
make[1]: Leaving directory '/c/opt/rtems/kernel/pc686'
/c/opt/rtems/kernel/pc686
$
-
diff --git a/user/tools/exeinfo.rst b/user/tools/exeinfo.rst
index 5e5948c..1913986 100644
--- a/user/tools/exeinfo.rst
+++ b/user/tools/exeinfo.rst
@@ -2,6 +2,8 @@
.. Copyright (C) 2017 Chris Johns <chrisj@rtems.org>
+.. _RTEMSExecutableInfomation:
+
RTEMS Executable Infomation
===========================
diff --git a/user/tools/linker.rst b/user/tools/linker.rst
index 73c892b..474f3c9 100644
--- a/user/tools/linker.rst
+++ b/user/tools/linker.rst
@@ -2,6 +2,8 @@
.. Copyright (C) 2017 Chris Johns <chrisj@rtems.org>
+.. _RTEMSLinker:
+
RTEMS Linker
============
diff --git a/user/tools/symbols.rst b/user/tools/symbols.rst
index 869baf2..16e04ee 100644
--- a/user/tools/symbols.rst
+++ b/user/tools/symbols.rst
@@ -2,6 +2,8 @@
.. Copyright (C) 2017 Chris Johns <chrisj@rtems.org>
+.. _RTEMSSymbols:
+
RTEMS Symbols
=============