purelyfunctional.org - Calling External Functions from JIT-compiled LLVM Modules using llvm-hs

Posted on April 2, 2018

llvm-hs provides bindings to LLVM’s ORC JIT APIs. These APIs let you JIT-compile LLVM modules and then call functions in those modules from your Haskell code. However, sometimes you want to use external libraries from within your LLVM module either because you want to make use of an existing library or because it might be easier to implement certain parts in other languages (e.g. C) than LLVM IR. Sam Griffin recently raised the question of how you can call functions in external libraries from a JIT-compiled module and while I had a rough idea of how to do this, I had never actually tried it myself. In this post, I present my findings on how you can accomplish this for both static and dynamic libraries.

We start with a very simple C file lib.c that defines a function called external_function which returns twice its argument. This is the function that we will attempt to call from our LLVM module.

#include <stdint.h>

int32_t external_function(int32_t x) {
    return 2 * x;
}

We can now compile this to an object file using gcc -fPIC -c -o lib.o lib.c. (-fPIC is only necessary when we want to produce a dynamic library but to keep things simple we will use the same object file for building the static and the dynamic library in this post).

The static library can now be created using ar rcs libexternalstatic.a lib.o. The dynamic library can be built using gcc -shared -o libexternaldynamic.so lib.o.

The LLVM module module.ll that we will be using in this post declares external_function and defines a function f which takes no argument and returns the result of applying external_function to 21.

; ModuleID = 'basic'
source_filename = "<string>"

declare i32 @external_function(i32)

define i32 @f() {
entry:
  %0 = call i32 @external_function(i32 21)
  ret i32 %0
}

Now that we have defined the module, we are ready to write the Haskell code to JIT the module and then finally call the f function. For this post, we will declare the module using LLVM’s textual IR and load it using llvm-hs’s withModuleFromLLVMAssembly but building the module using llvm-hs-pure’s AST works as well.

There are two points that you need to pay attention to if your JIT-compiled module references external functions (for both static and dynamic libraries):

Your resolver needs some way to find the symbol. We are going to use getSymbolAdressInProcess for this which is a function provided by llvm-hs that will search for loaded symbols in the current process.
getSymbolAddressInProcess will only find symbols in libraries that have been loaded before. This is accomplished by calling loadLibraryPermanently before you JIT the module. You can either pass the name of a dynamic library to loadLibraryPermanently or you can pass Nothing (equivalent to dlopen(NULL)) which will load the symbols in the current process including the symbols in shared libraries that the executable is linked against.

This leaves us with the following resolver:

resolver :: IRCompileLayer l -> SymbolResolver
resolver compileLayer =
  SymbolResolver
    (\s -> findSymbol compileLayer s True)
    (\s ->
       fmap
         (\a -> JITSymbol a (JITSymbolFlags False True))
         (getSymbolAddressInProcess s))

The implementation of main might look slightly complicated at a first glance, so let’s break it down:

We first call the aforementioned loadLibraryPermanently function to make sure that later calls to getSymbolAddressInProcess will find external_function.
Then follows a bit of boilerplate to initialize the LLVM context, load the module and create the ORC linking and compile layers.
We can now add the module to the ORC compile layer using withModule which is a bracket-style wrapper around addModule and removeModule.
Next, we mangle the symbol of the function that we want to call (f in this case) and search for the symbol in the compile layer.
Pattern matching on the resulting JITSymbol gives us back a WordPtr representing the address of f. We use wordPtrToPtr and castPtrToFunPtr to convert the WordPtr to a FunPtr.
Finally, we use a dynamic foreign import to convert the FunPtr to a Haskell function and call the resulting function.

main :: IO ()
main = do
  loadLibraryPermanently Nothing
  withContext $ \ctx ->
    withModuleFromLLVMAssembly ctx (File "module.ll") $ \mod' ->
      withHostTargetMachine $ \tm ->
        withObjectLinkingLayer $ \objectLayer ->
          withIRCompileLayer objectLayer tm $ \compileLayer -> do
            withModule
              compileLayer
              mod'
              (resolver compileLayer) $
              \_ -> do
                mainSymbol <- mangleSymbol compileLayer "f"
                (JITSymbol mainFn _) <- findSymbol compileLayer mainSymbol True
                result <- mkFun (castPtrToFunPtr (wordPtrToPtr mainFn))
                print result

If you want to use the dynamic library, then all that’s left to do is to add extra-libraries: externaldynamic to the executable section in our cabal file. Depending on where you placed the shared library, you will also have to set extra-lib-dirs to the directory containing the library so that it is found at link time and the LD_LIBRARY_PATH environment variable to make sure it is found when you run the executable.

If you want to use the static library, then things are a bit more involved: Just adding externalstatic to extra-libraries will not work since the linker will omit unused symbols when linking against static libraries. Since the linker does not know about the reference to external_function in our JIT compiled module, this symbol will thereby not end up in the binary. To fix this you need to use -Wl,--whole-archive,-lexternalstatic,--no-whole-archive in the ld-options section in your cabal file. This will force all symbols in the externalstatic library to be included in the final executable even if they are not referenced. We also need to ensure that the symbols end up in the dynamic symbol table since that is what getSymbolAddressInProcess will look at. The corresponding flag in GNU ld is called --export-dynamic but we use GHC’s -rdynamic option here (by adding it to ld-options) which will use --export-dynamic under the hood if you’re using GNU ld (but should also support other linkers). As for shared libraries, you might also need to set extra-lib-dirs to make sure that the library is found at link time. Since we are linking the library statically, there is no need for messing with LD_LIBRARY_PATH. If you followed the steps thus far, you might have noticed that this still does not quite work: You know longer get symbol resolution errors but you will get a segfault. Luckily, this can be fixed by changing the relocation model of the target machine to PIC instead of relying on the default set by withHostTargetMachine which seems to be Static on X86. (I think this has the effect of preventing LLVM from emitting call instructions to immediates but I am not entirely sure why this is necessary. If you do know more about this, I would love here from you!). The custom version of withHostTargetMachine that sets the relocation model looks as follows:

withHostTargetMachine :: (TargetMachine -> IO a) -> IO a
withHostTargetMachine f = do
  initializeAllTargets
  triple <- getProcessTargetTriple
  cpu <- getHostCPUName
  features <- getHostCPUFeatures
  (target, _) <- lookupTarget Nothing triple
  withTargetOptions $ \options ->
    withTargetMachine target triple cpu features options Reloc.PIC CodeModel.Default CodeGenOpt.Default f

Conclusion

While calling functions in external libraries from a JIT-compiled module is not particularly complicated, finding all the correct linker flags can be a bit tricky especially if you are not too familiar with linkers (which certainly applies to myself :)). Hopefully, this post can serve as a reference and spare others from having to go through the same trial and error process that I went through. You can find the full code mentioned in this blogpost on github. Note that I only tested this on Linux (specifically Archlinux 64bit), the linker flags might be slightly different on other systems.