From a9afb247ced5127f9930e06f4b59b62324ac7f7f Mon Sep 17 00:00:00 2001
From: David Oberhollenzer <goliath@infraroot.at>
Date: Fri, 10 Jan 2020 18:06:37 +0100
Subject: Initial commit

Signed-off-by: David Oberhollenzer <goliath@infraroot.at>
---
 README.md     |  22 ++
 crosscc.md    | 688 ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
 download.sh   |  30 +++
 elfstartup.md |  84 +++++++
 4 files changed, 824 insertions(+)
 create mode 100644 README.md
 create mode 100644 crosscc.md
 create mode 100755 download.sh
 create mode 100644 elfstartup.md

diff --git a/README.md b/README.md
new file mode 100644
index 0000000..820fd17
--- /dev/null
+++ b/README.md
@@ -0,0 +1,22 @@
+# Building a small Raspberry Pi System from Scratch
+
+What you are looking at right now is a collection of instructions on how to
+bootstrap a tiny system for a Raspberry Pi 3 board.
+
+We will bootstrap the system by building our own cross compiler toolchain
+and then using it to cross compile everything we need for a working Linux
+based OS.
+
+In contrast to similar guides, I try to explain why we are doing the things
+the way we are doing them, instead of just throwing a bunch of copy-paste
+command lines around (I'm looking at you, LFS).
+
+This guide is divided into the following parts:
+
+* [Building a cross compiler toolchain](crosscc.md).
+* Cross compiling a statically linked BusyBox and the kernel. The kernel has
+  a built-in initrd containing our BusyBox. We will make it boot on the
+  Rapsberry Pi and explore some parts of the Linux boot process.
+* Building a more sophisticated userland. The userland will be packed into a
+  SquashFS image. The BusyBox based initrd is modified to mount it and switch
+  into it.
diff --git a/crosscc.md b/crosscc.md
new file mode 100644
index 0000000..f8bb254
--- /dev/null
+++ b/crosscc.md
@@ -0,0 +1,688 @@
+# Building a Cross Compiler Toolchain
+
+As it turns out, building a cross compiler toolchain with recent GCC and
+binutils is a lot easier nowadays than it used to be.
+
+I'm building the toolchain on an AMD64 (aka x86_64) system. The steps have
+been tried on [Fedora](https://getfedora.org/) as well as on
+[OpenSUSE](https://www.opensuse.org/).
+
+The toolchain we are building generates 32 bit ARM code intended to run on
+a Raspberry Pi 3. [Musl](https://www.musl-libc.org/) is used a C standard
+library implementation. It should be possible to use the instructions provided
+here to any other system with some minor adjustments (i.e. read the manuals
+and do some thinking, don't just go ahead brainlessly).
+
+## Directory Setup
+
+First of all, you should create an empty directory somewhere where you want
+to build the cross toolchain and later the entire system.
+
+For convenience, we will store the absolute path to this directory inside a
+shell variable called **BUILDROOT** and create a few directories to organize
+our stuff in:
+
+    BUILDROOT=$(pwd)
+
+    mkdir -p "build" "src" "download" "toolchain/bin" "sysroot"
+
+I stored the downloaded packages in the **download** directory and extracted
+them to a directory called **src**.
+
+We will later build packages outside the source tree (GCC even requires that
+nowadays), inside a sub directory of **build**.
+
+Our final toolchain will end up in a directory called **toolchain**.
+
+We store the toolchain location inside another shell variable that I called
+**TCDIR** and prepend the executable path of our toolchain to the **PATH**
+variable:
+
+    TCDIR="$BUILDROOT/toolchain"
+    export PATH="$TCDIR/bin:$PATH"
+
+
+The **sysroot** directory will hold the cross compiled binaries for our target
+system, as well as headers and libraries used for cross compiling stuff. It is
+basically the `/` directory of the system we are going to build. For
+convenience, we will also store its absolute path in a shell variable:
+
+	SYSROOT="$BUILDROOT/sysroot"
+
+## Prerequisites
+
+The following source packages are required for building the toolchain. The
+links below point to the exact versions that I used.
+
+* [Linux](https://github.com/raspberrypi/linux/archive/raspberrypi-kernel_1.20190925-1.tar.gz).
+  Linux is a very popular OS kernel that happens to run on our target system.
+  We need it to build the the C standard library for our toolchain.
+* [Musl](https://www.musl-libc.org/releases/musl-1.1.24.tar.gz). A tiny
+  C standard library implementation.
+* [Binutils](https://ftp.gnu.org/gnu/binutils/binutils-2.33.1.tar.xz). This
+  contains the GNU assembler, linker and various tools for working with
+  executable files.
+* [GCC](https://ftp.gnu.org/gnu/gcc/gcc-9.2.0/gcc-9.2.0.tar.xz), the GNU
+  compiler collection. Contains compilers for C and other languages.
+
+For compiling the packages you will need:
+
+* gcc
+* g++
+* make
+* flex
+* bison
+* gperf
+* makeinfo
+* ncurses (with headers)
+* awk
+* automake
+* help2man
+* curl
+* pkg-config
+* libtool
+
+
+In case you wonder: you need the C++ compiler to build GCC. The GCC code base
+mainly uses C99, but with some additional C++ features. makeinfo is used by
+the GNU utilities that generate info pages from texinfo. ncurses is mainly
+needed by the kernel build system for `menuconfig`.
+
+The list should be fairly complete, but I can't guarantee that I didn't miss
+something. Normally I work on systems with tons of development tools and
+libraries already installed, so if something is missing, please install it
+and maybe let me know.
+
+## Downloading and unpacking everything
+
+Simply download the packages listed above into `download` and unpack them
+into `src`.
+
+For convenience, I provided a small shell script called `download.sh` that,
+when run inside `$BUILDROOT`, does this and also verifies the `sha256sum`
+of the packages, which will further make sure that you are using the **exact**
+same version as I am.
+
+Right now, you should have a directory tree that looks something like this:
+
+* build/
+* toolchain/
+   * bin/
+* src/
+   * binutils-2.33.1/
+   * gcc-9.2.0/
+   * musl-1.1.24/
+   * linux-raspberrypi-kernel_1.20190925-1/
+* download/
+   * binutils-2.33.1.tar.xz
+   * gcc-9.2.0.tar.xz
+   * musl-1.1.24.tar.gz
+   * raspberrypi-kernel_1.20190925-1.tar.gz
+* sysroot/
+
+For building GCC, we will need to download some additional support libraries.
+Namely gmp, mfpr, mpc and isl that have to be unpacked inside the GCC source
+tree. Luckily, GCC nowadays provides a shell script that will do that for us:
+
+	cd "$BUILDROOT/src/gcc-9.2.0"
+	./contrib/download_prerequisites
+	cd "$BUILDROOT"
+
+
+## Overview
+
+From now on, the rest of the process itself consists of the following steps:
+
+1. Installing the kernel headers to the sysroot directory.
+2. Compiling cross binutils.
+3. Compiling a minimal GCC cross compiler with minimal `libgcc`.
+4. Cross compiling the C standard library (in our case Musl).
+5. Compiling a full version of the GCC cross compiler with complete `libgcc`.
+
+The main reason for compiling GCC twice is the inter-dependency between the
+compiler and the standard library.
+
+First of all, the GCC build system needs to know *what* kind of C standard
+library we are using and *where* to find it. For dynamically linked programs,
+it also needs to know what loaded we are going to use, which is typically
+also provided by the C standard library. For more details, you can read this
+high level overview [how dyncamically linked ELF programs are run](elfstartup.md).
+
+Second, there is [libgcc](https://gcc.gnu.org/onlinedocs/gccint/Libgcc.html).
+`libgcc` contains low level platform specific helpers (like exception handling,
+soft float code, etc.) and is automatically linked to programs built with GCC.
+Libgcc source code comes with GCC and is compiled by the GCC build system
+specifically for our cross compiler & libc combination.
+
+However, some functions in the `libgcc` need functions from the C standard
+library. Some libc implementations directly use utility functions from `libgcc`
+such as stack unwinding helpers (provided by `libgcc_s`).
+
+After building a GCC cross compiler, we need to cross compile `libgcc`, so we
+can *then* cross compile other stuff that needs `libgcc` **like the libc**. But
+we need an already cross compiled libc in the first place for
+compiling `libgcc`.
+
+The solution is to build a minimalist GCC that targets an internal stub libc
+and provides a minimal `libgcc` that has lots of features disabled and uses
+the stubs instead of linking against libc.
+
+We can then cross compile the libc and let the compiler link it against the
+minimal `libgcc`.
+
+With that, we can then compile the full GCC, pointing it at the C standard
+library for the target system and build a fully featured `libgcc` along with
+it. We can simply install it *over* the existing GCC and `libgcc` in the
+toolchain directory (dynamic linking for the rescue).
+
+### Autotools and the canonical target tuple
+
+Most of the software we are going to build is using autotools based build
+systems. There are a few things we should know when working with autotools
+based packages.
+
+GNU autotools makes cross compilation easy. This was especially important in
+the early days of the GNU project when there were dozens of incompatible
+Unices on widely varying hardware platforms and the GNU packages were supposed
+to build and run on all of them.
+
+Nowadays autotools offers *decades* of being used in practice and is in my
+experience a lot more mature than more modern build systems and having a semi
+standard way of cross compiling stuff with standardized configuration knobs
+is very helpful.
+
+In contrast to many modern build systems, you don't need Autotools to run an
+Autotools based build system. The final build system it generates for the
+release tarballs just uses shell and `make`.
+
+### The configure script
+
+Pretty much every novice Ubuntu user has probably already seen this on Stack
+Overflow (and copy-pasted it) at least once:
+
+    ./configure
+    make
+    make install
+
+
+The `configure` shell script generates the actual `Makefile` from a
+template (`Makefile.in`) that is then used for building the package.
+
+The `configure` script itself and the `Makefile.in` are completely independent
+from autotools and were generated by `autoconf` and `automake`.
+
+If we don't want to clobber the source tree, we can also build a package
+*outside the source tree* like this:
+
+    ../path/to/source/configure
+    make
+
+When the package developers create a release tarball with `make distcheck`,
+the Autotools actually test whether the package can be built out-of-tree
+from a source directory that has all write permissions disabled.
+
+The `configure` script contains *a lot* of system checks and default flags that
+we can use for telling the build system how to compile the code.
+
+The main ones we need to know about for cross compiling are the following
+three options:
+
+* The **--build** option specifies what system we are *building* the
+  package on.
+* The **--host** option specifies what system the binaries will run on.
+* The **--target** option is specific for packages that contain compilers
+  and specify what system to generate output for.
+
+Those options take as an argument a dash seperated tuple that describtes
+a system and is made up the following way:
+
+	<architecture>-<kernel>-<userspace>
+
+Our 32 bit ARM system, running a Linux kernel with a Musl based user space,
+is described like this:
+
+	arm-linux-musleabihf
+
+The user space component itself consists of two parts: `musl` indicating
+the libc we use and `eabihf` indicating the ARM ABI that GCC should target.
+
+For the build system that I used, the tuple looks likes this:
+
+	x86_64-linux-gnu
+
+The final component `gnu` actually tells GCC that the system is using `glibc`.
+
+Both the user space component and the kernel component have exact meaning.
+Trying to insert clever free-form text for branding can mess up your build.
+If you really must, you can insert it between the kernel and the architecture,
+sort of like this:
+
+	x86_64-redhat-linux-gnu
+
+### The Makefile
+
+The generated Makefile also has a few tune-able knobs that we can use.
+
+Most importantly, we will use the `DESTDIR` variable that can be used to
+set a target directory where `make install` will install the programs.
+
+For instance, if `make install` would install a program to `/usr/bin/foo`,
+running `make DESTDIR=/tmp/test install` will instead install the program
+to `/tmp/test/usr/bin/foo`.
+
+The configure script has a similar option called **--prefix**. However this
+works in a different way and actually controls path prefix. The path in
+the **--prefix** will possibly be embedded in the program during compilation,
+while the `DESTDIR` variable does not affect compilation and changes the
+location of the root directory when installing.
+
+For instance if running the following:
+
+	./configure --prefix=/usr
+	make
+	make DESTDIR=/tmp/test install
+
+the program `foo` will be installed to `/tmp/test/usr/bin/foo` but the program
+and the build system "think" it has been installed to `/usr/bin/foo`.
+
+
+## Getting started
+
+At first, we set a few handy shell variables that will store the configuration
+of our toolchain:
+
+    TARGET="arm-linux-musleabihf"
+	HOST="x86_64-linux-gnu"
+    LINUX_ARCH="arm"
+    MUSL_CPU="arm"
+    GCC_CPU="armv6"
+
+The **TARGET** variable holds the *target triplet* of our system as described
+above.
+
+We also need the triplet for the local machine that we are going to build
+things on. For simplicity, I also set this manually.
+
+The **MUSL_CPU**, **GCC_CPU** and **LINUX_ARCH** variables hold the target
+CPU architecture. The later is used for the kernel build system, the former
+for the GCC build system and the for Musl.
+
+If you want to dynamically determine the **HOST** tuple, I suggest using
+[config.guess](https://git.savannah.gnu.org/gitweb/?p=config.git;a=tree):
+
+	$ HOST=$(./config.guess)
+	$ echo "$HOST"
+	x86_64-pc-linux-gnu
+
+There are reasons for why this script exists and why it is that long. Even
+on Linux distributions, there is no consistent way, to pull a machine triple
+out of a shell one liner.
+
+Some guides out there suggest using a shell builtin **MACHTYPE**:
+
+    $ echo "$MACHTYPE"
+    x86_64-redhat-linux-gnu
+
+The above is what I got on Fedora, however on Arch Linux I got this:
+
+    $ echo "$MACHTYPE"
+    x86_64
+
+Some other guides suggest using **OSTYPE**:
+
+    $ HOST=$(uname -m)-$OSTYPE
+    $ echo $HOST
+    x86_64-linux-gnu
+
+This works on Fedora and Arch Linux, but fails on OpenSuSE:
+
+	$ echo $OSTYPE
+	linux
+
+If you want to safe yourself a lot of headache, refrain from using such
+adhockery and simply use `config.guess`. I only listed this here to warn you,
+because I have seen some guides and tutorials out there using this nonsense.
+
+
+### Installing the kernel headers
+
+We create a build directory called **$BUILDROOT/build/linux**. Building the
+kernel outside its source tree works a bit different compared to autotools
+based stuff.
+
+To keep things clean, we use a shell variable **srcdir** to remember where
+we kept the binutils source. A pattern that we will repeat later:
+
+    export KBUILD_OUTPUT="$BUILDROOT/build/linux"
+    mkdir -p "$KBUILD_OUTPUT"
+
+    srcdir="$BUILDROOT/src/linux-raspberrypi-kernel_1.20190925-1"
+
+    cd "$srcdir"
+    make O="$KBUILD_OUTPUT" ARCH="$LINUX_ARCH" headers_check
+    make O="$KBUILD_OUTPUT" ARCH="$LINUX_ARCH" INSTALL_HDR_PATH="$SYSROOT" headers_install
+    cd "$BUILDROOT"
+
+
+According to the Makefile in the Linux source, you can either specify an
+environment variable called **KBUILD_OUTPUT**, or set a Makefile variable
+called **O**, where the later overrides the environment variable. The snippet
+above shows both ways.
+
+The *headers_check* target runs a few trivial sanity checks on the headers
+we are going to install. It checks if a header includes something nonexistent,
+if the declarations inside the headers are sane and if kernel internals are
+leaked into user space. For stock kernel tar-balls, this shouldn't be
+necessary, but could come in handy when working with kernel git trees,
+potentially with local modifications.
+
+Lastly (before switching back to the root directory), we actually install the
+kernel headers into the sysroot directory where the libc later expects them
+to be.
+
+The `sysroot` directory should now contain an `include` directory with a number
+of sub directories that contain kernel headers.
+
+Since I've seen the question in a few forums: it doesn't matter if the kernel
+version exactly matches the one running on your target system. The kernel
+system call ABI is stable, so you can use an older kernel. Only if you use a
+much newer kernel, the libc might end up exposing or using features that your
+kernel does not yet support.
+
+If you have some embedded board with a heavily modified vendor kernel (such as
+in our case) and little to no upstream support, the situation is a bit more
+difficult and you may prefer to use the exact kernel.
+
+Even then, if you have some board where the vendor tree breaks the
+ABI **take the board and burn it** (preferably outside; don't inhale
+the fumes).
+
+### Compiling cross binutils
+
+We will compile binutils outside the source tree, inside the directory
+**build/binutils**. So first, we create the build directory and switch into
+it:
+
+    mkdir -p "$BUILDROOT/build/binutils"
+    cd "$BUILDROOT/build/binutils"
+
+    srcdir="$BUILDROOT/src/binutils-2.33.1"
+
+From the binutils build directory we run the configure script:
+
+    $srcdir/configure --prefix="$TCDIR" --target="$TARGET" \
+                      --with-sysroot="$SYSROOT" \
+                      --disable-nls --disable-multilib
+
+We use the **--prefix** option to actually let the toolchain know that it is
+being installed in our toolchain directory, that we are going to run it from
+there and that it should locate helper programs in there.
+
+We also set the **--target** option to tell the build system what target the
+assembler, linker and other tools should generate **output** for. We don't
+explicitly set the **--host** or **--build** because we are compiling binutils
+to run on the local machine.
+
+We would only set the **--host** option to cross compile binutils itself with
+an existing toolchain to run on a different system than ours.
+
+The **--with-sysroot** option tells the build system that the root directory
+of the system we are going to build is in `$SYSROOT` and it should look inside
+that to find libraries.
+
+We disable the feature **nls** (native language support, i.e. cringe worthy
+translations of error messages to your native language, such as Deutsch
+or 中文), mainly because we don't need it and not doing something typically
+saves time.
+
+Regarding the multilib option: Some architectures support executing code for
+other, related architectures (e.g. x86 code can run x86_64). On GNU/Linux
+distributions that support that, you typically have different versions of the
+same libraries (e.g. in *lib/* and *lib32/* directories) with programs for
+different architectures being linked to the appropriate libraries. We are only
+interested in a single architecture and don't need that, so we
+set **--disable-multilib**.
+
+
+Now we can compile and install binutils:
+
+    make configure-host
+    make
+    make install
+    cd "$BUILDROOT"
+
+The first make target, *configure-host* is binutils specific and just tells it
+to check out the system it is *being built on*, i.e. your local machine and
+make sure it has all the tools it needs for compiling. If it reports a problem,
+**go fix it before continuing**.
+
+We then go on to build the binutils. You may want to speed up compilation by
+running a parallel build with **make -j NUMBER-OF-PROCESSES**.
+
+Lastly, we run *make install* to install the binutils in the configured
+toolchain directory and go back to our root directory.
+
+The `toolchain/bin` directory should now already contain a bunch of executables
+such as the assembler, linker and other tools that are prefixed with the host
+triplet.
+
+There is also a new directory called `toolchain/arm-linux-musleabihf` which
+contains a secondary system root with programs that aren't prefixed and linker
+scripts.
+
+### First pass GCC
+
+Similar to above, we create a directory for building the compiler, change
+into it and store the source location in a variable:
+
+    mkdir -p "$BUILDROOT/build/gcc-1"
+    cd "$BUILDROOT/build/gcc-1"
+
+    srcdir="$BUILDROOT/src/gcc-9.2.0"
+
+Notice, how the build directory is called *gcc-1*. For the second pass, we
+will later create a different build directory. Not only does this out of tree
+build allow us to cleanly start afresh (because the source is left untouched),
+but current versions of GCC will *flat out refuse* to build inside the
+source tree.
+
+    $srcdir/configure --prefix="$TCDIR" --target="$TARGET" --build="$HOST" \
+                      --host="$HOST" --with-sysroot="$SYSROOT" \
+                      --disable-nls --disable-shared --without-headers \
+                      --disable-multilib --disable-decimal-float \
+                      --disable-libgomp --disable-libmudflap \
+                      --disable-libssp --disable-libatomic \
+                      --disable-libquadmath --disable-threads \
+                      --enable-languages=c --with-newlib \
+					  --with-arch="$GCC_CPU" --with-float=hard \
+					  --with-fpu=neon-vfpv3
+
+The **--prefix**, **--target** and **--with-sysroot** work just like above for
+binutils.
+
+This time we explicitly specify **--build** (i.e. the system that we are going
+to compile GCC on) and **--host** (i.e. the system that the GCC will run on).
+In our case those are the same. I set those explicitly for GCC, because the GCC
+build system is notoriously fragile. Yes, *I have seen* older versions of GCC
+throw a fit or assume complete nonsense if you don't explicitly specify those
+and at this point I'm no longer willing to trust it.
+
+The option **--with-arch** gives the build system slightly more specific
+information about the target processor architecture. The two options after that
+are specific for our target and tell the buildsystem that GCC should use the
+hardware floating point unit and can emit neon instructions for vectorization.
+
+We also disable a bunch of stuff we don't need. I already explained *nls*
+and *multilib* above. We also disable a bunch of optimization stuff and helper
+libraries. Among other things, we also disable support for dynamic linking and
+threads as we don't have the libc yet.
+
+The option **--without-headers** tells the build system that we don't have the
+headers for the libc *yet* and it should use minimal stubs instead where it
+needs them. The **--with-newlib** option is *more of a hack*. It tells that we
+are going to use the [newlib](http://www.sourceware.org/newlib/) as C standard
+library. This isn't actually true, but forces the build system to disable some
+[libgcc features that depend on the libc](https://gcc.gnu.org/ml/gcc-help/2009-07/msg00368.html).
+
+The option **--enable-languages** accepts a comma separated list of languages
+that we want to build compilers for. For now, we only need a C compiler for
+compiling the libc.
+
+If you are interested: [Here is a detailed list of all GCC configure options.](https://gcc.gnu.org/install/configure.html)
+
+Now, lets build the compiler and `libgcc`:
+
+    make all-gcc all-target-libgcc
+    make install-gcc install-target-libgcc
+
+    cd "$BUILDROOT"
+
+We explicitly specify the make targets for *GCC* and *cross-compiled libgcc*
+for our target. We are not interested in anything else.
+
+For the first make, you **really** want to specify a *-j NUM-PROCESSES* option
+here. Even the first pass GCC we are building here will take a while to compile
+on an ordinary desktop machine.
+
+
+### C standard library
+
+We create our build directory and change there:
+
+    mkdir -p "$BUILDROOT/build/musl"
+    cd "$BUILDROOT/build/musl"
+
+    srcdir="$BUILDROOT/src/musl-1.1.24"
+
+Musl is quite easy to build but requires some special handling, because it
+doesn't use autotools. The configure script is actually a hand written shell
+script that tries to emulate some of the typical autotools handling:
+
+    CC="${TARGET}-gcc" $srcdir/configure --prefix=/ --target="$TARGET"
+
+We override the shell variable **CC** to point to the cross compiler that we
+just built. Remember, we added **$TCDIR/bin** to our **PATH**.
+
+We do the same thing for actually compiling musl and we explicitly set the
+**DESTDIR** variable for installing:
+
+    CC="${TARGET}-gcc" make
+    make DESTDIR="$SYSROOT" install
+
+    cd "$BUILDROOT"
+
+The important part here, that later also applies for autotools based stuff, is
+that we don't set **--prefix** to the sysroot directory. We set the prefix so
+that the build system "thinks" it compiles the library to be installed in `/`,
+but then we install the compiled binaries and headers to the sysroot directory.
+
+The `sysroot/include` directory should now contain a bunch of standard headers.
+Likewise, the `sysroot/lib` directory should now contain a `libc.so`, a bunch
+of dummy libraries, and the startup object code provided by Musl.
+
+The `sysroot/lib/ld-musl-armhf.so.1` is the loader for dynamically linked
+programs and in the case of Musl, just a symlink to `libc.so`.
+
+### Second pass GCC
+
+We are reusing the same source code from the first stage, but in a different
+build directory:
+
+    mkdir -p "$BUILDROOT/build/gcc-2"
+    cd "$BUILDROOT/build/gcc-2"
+
+    srcdir="$BUILDROOT/src/gcc-9.2.0"
+
+Most of the configure options should be familiar already:
+
+    $srcdir/configure --prefix="$TCDIR" --target="$TARGET" --build="$HOST" \
+                      --host="$HOST" --with-sysroot="$SYSROOT" \
+                      --disable-nls --enable-languages=c,c++ \
+                      --enable-c99 --enable-long-long \
+                      --disable-libmudflap --disable-multilib \
+                      --disable-libsanitizer --with-arch="$CPU" \
+                      --with-native-system-header-dir="/include" \
+					  --with-float=hard --with-fpu=neon-vfpv3
+
+For the second pass, we also build a C++ compiler. The options **--enable-c99**
+and **--enable-long-long** are actually C++ specific. When our final compiler
+runs in C++98 mode, we allow it to expose C99 functions from the libc through
+a GNU extension. We also allow it to support the *long long* data type
+standardized in C99.
+
+You may wonder why we didn't have to build a **libstdc++** between the
+first and second pass, like the libc. The source code for the *libstdc++*
+comes with the **g++** compiler and is built automatically like `libgcc`.
+On the one hand, it is really just a library that adds C++ stuff
+*on top of libc*, mostly header only code that is compiled with the actual
+C++ programs. On the other hand, C++ does not have a standard ABI and it is
+all compiler and OS specific. So compiler vendors will typically ship their
+own `libstdc++` implementation with the compiler.
+
+We **--disable-libsanitizer** because it simply won't build for musl. I tried
+fixing it, but it simply assumes too much about the nonstandard internals
+of the libc. A quick Google search reveals that it has **lots** of similar
+issues with all kinds of libc & kernel combinations, so even if I fix it on
+my system, you may run into other problems on your system or with different
+versions of packets. It even has different problems with different versions
+of glibc. Projects like buildroot simply disable it when using musl. It "only"
+provides a static code analysis plugin for the compiler.
+
+The option **--with-native-system-header-dir** is of special interest for our
+cross compiler. Since we pointed the root directory to **$SYSROOT**, the
+compiler will look for headers in **$SYSROOT/usr/include**, but we didn't
+install them to */usr/include*, we installed them to
+**$SYSROOT/include**, so we have to tell the build system that is should
+look in */include* (relative to the root directory) instead.
+
+All that's left now is building and installing the compiler:
+
+    make
+    make install
+
+    cd "$BUILDROOT"
+
+This time, we are going to build and install *everything*. You *really* want to
+do a parallel build here. On my AMD Ryzen based desktop PC, building with
+`make -j 16` takes about 3 minutes. On my Intel i5 laptop takes circa 15
+minutes. If you are using a laptop, you might want to open a window (assuming
+it is cold outside, i.e. won't help if you are in Taiwan).
+
+### Testing the Toolchain
+
+We quickly write our average hello world program into a file called **test.c**:
+
+    #include <stdio.h>
+
+    int main(void)
+    {
+        puts("Hello, world");
+        return 0;
+    }
+
+We can now use our cross compiler to compile this C file:
+
+    $ ${TARGET}-gcc test.c
+
+Running the program `file` on the resulting `a.out` will tell us that it has
+been properly compiled and linked for our target machine:
+
+    $ file a.out
+    a.out: ELF 32-bit LSB executable, ARM, EABI5 version 1 (SYSV), dynamically linked, interpreter /lib/ld-musl-armhf.so.1, not stripped
+
+Of course, you won't be able to run the program on your build system. You also
+won't be able to run it on Raspbian or similar, because it has been linked
+against our cross compiled Musl.
+
+Statically linking it should solve the problem:
+
+    $ ${TARGET}-gcc -static test.c
+    $ file a.out
+    a.out: ELF 32-bit LSB executable, ARM, EABI5 version 1 (SYSV), statically linked, with debug_info, not stripped
+    $ readelf -d a.out
+
+    There is no dynamic section in this file.
+
+This binary now does not require any libraries, any interpreters and does
+system calls directly. It should now run on your favourite Raspberry Pi
+distribution as-is.
diff --git a/download.sh b/download.sh
new file mode 100755
index 0000000..eb253a2
--- /dev/null
+++ b/download.sh
@@ -0,0 +1,30 @@
+#!/bin/sh
+
+set -e
+
+KERNEL="raspberrypi-kernel_1.20190925-1.tar.gz"
+MUSL="musl-1.1.24.tar.gz"
+BINUTILS="binutils-2.33.1.tar.xz"
+GCC="gcc-9.2.0.tar.xz"
+
+mkdir -p "download" "src"
+
+curl -L "https://github.com/raspberrypi/linux/archive/$KERNEL" > \
+     "download/$KERNEL"
+curl -L "https://www.musl-libc.org/releases/$MUSL" > "download/$MUSL"
+curl -L "https://ftp.gnu.org/gnu/binutils/$BINUTILS" > "download/$BINUTILS"
+curl -L "https://ftp.gnu.org/gnu/gcc/gcc-9.2.0/$GCC" > "download/$GCC"
+
+cat > download.sha256 <<_EOF
+ab66fc2d1c3ec0359b8e08843c9f33b63e8707efdff5e4cc5c200eae24722cbf  download/binutils-2.33.1.tar.xz
+ea6ef08f121239da5695f76c9b33637a118dcf63e24164422231917fa61fb206  download/gcc-9.2.0.tar.xz
+1370c9a812b2cf2a7d92802510cca0058cc37e66a7bedd70051f0a34015022a3  download/musl-1.1.24.tar.gz
+295651137abfaf3f1817d49051815a5eb0cc197d0100003d10e46f5eb0f45173  download/raspberrypi-kernel_1.20190925-1.tar.gz
+_EOF
+
+sha256sum -c download.sha256
+
+tar -xf "download/$KERNEL" -C "src"
+tar -xf "download/$MUSL" -C "src"
+tar -xf "download/$BINUTILS" -C "src"
+tar -xf "download/$GCC" -C "src"
diff --git a/elfstartup.md b/elfstartup.md
new file mode 100644
index 0000000..37ee0db
--- /dev/null
+++ b/elfstartup.md
@@ -0,0 +1,84 @@
+# Running dynamically linked programs on Linux
+
+This section provides a high level overview of the startup process of a
+dynamically linked program on Linux.
+
+When using the `exec` system call to run a program, the kernel maps it into
+memory and tries to determine what kind of executable it is by looking at
+the magic number. Based on the type of executable, some data structures are
+parsed and the program is run. For a statically linked ELF program, this means
+fiddling the entry point address out of the header and jumping to it (with
+a kernel to user space transition of course).
+
+The kernel also supports exec-ing programs that require an interpreter to be
+run. This mechanism is also used for implementing dynamically linked programs.
+
+Similar to how scripts have an interpreter field (`#!/bin/sh`
+or `#!/usr/bin/perl`), ELF files can also have an interpreter section. For
+dynamically linked ELF executables, the compiler sets the interpreter field
+to the loader (`ld-linux.so` or similar).
+
+The `ld-linux.so` loader is typically provided by the `libc` implementation
+(i.e. Musl, glibc, ...) then maps the actual executable into memory
+with `mmap(2)`, parses the dynamic section and mmaps the used libraries
+(possibly recursively since libraries may need other libraries), does
+some relocations if applicable and then jumps to the entry point address.
+
+The kernel itself actually has no concept of libraries. Thanks to this
+mechanism, it doesn't have to.
+
+The whole process of using an interpreter is actually done recursively. An
+interpreter can in-turn also have an interpreter. For instance if you exec
+a shell script that starts with `#!/bin/sh`, the kernel detects it to be a
+script (because it starts with `#!`), extracts the interpreter and then
+runs `/bin/sh <script-path>` instead. The kernel then detects that `/bin/sh`
+is an ELF binary (because it starts with `\x7fELF`) and extracts the
+interpreter field, which is set to `/lib/ld-linux.so`. So now the kernel
+tries to run `/lib/ld-linux.so /bin/sh <script-path>`. The `ld-linux.so` has
+no interpreter field set, so the kernel maps it into memory, extracts the
+entry point address and runs it.
+
+If `/bin/sh` were statically linked, the last step would be missing and the
+kernel would start executing right there. It should also be noted that Linux
+has a hard limit for interpreter recursion depth, typically set to 3 to
+support this exact standard case (script, interpreter, loader).
+
+The entry point of the ELF file that the loader jumps to is of course NOT
+the `main` function of the C program. It points to setup code provided by
+the libc implementation that does some initialization first, such as stack
+setup, getting the argument vector, initializing malloc or whatever other
+internals and then calls the `main` function. When `main` returns, the
+startup code calls the `exit` system call with the return value from `main`.
+
+The startup code is provided by the libc, typically in the form of an object
+file in `/lib`, e.g. `/lib/crt0.o`. The C compiler links executable programs
+against this object file and expects it to have a symbol called `_start`. The
+entry point address of the ELF file is set to the location of `_start` and the
+interpreter is set to the path of the loader.
+
+Finally, somewhere inside the `main` function of `/bin/sh` is run, it opens
+the file it has been provided on the command line and starts interpreting your
+shell script.
+
+## Take Away Message
+
+In summary, the compiler needs to know the following things about the libc:
+ - The path to the loader for dynamically linked programs.
+ - The path to the startup object code it needs to link against.
+ - The path of the libc itself to link against.
+
+If you try to run a program and you get the possibly most useless error
+message `no such file or directory`, it could have the following reasons:
+ - The kernel couldn't find the program you are trying to run.
+ - The kernel couldn't find the interpreter set by the program.
+ - The kernel couldn't find the interpreter of the interpreter.
+ - The loader couldn't find a library used by either your program, the
+   interpreter of your program, or another library that it loaded.
+
+So if you see that error message, don't panic, try to figure out the root
+cause by walking through this checklist. You can use the `ldd` program (that
+is provided by the libc) to display libraries that the loader would try to
+load. But **NEVER** use `ldd` on untrusted programs. Typical implementations
+of ldd try to execute the interpreter with special options to collect
+dependencies. An attacker could set this to something other than `ld-linux.so`
+and gain code execution.
-- 
cgit v1.2.3