Case Study: Reverse Engineering a Proprietary Android Native Library’s Logic with Frida JNI

Introduction: Unveiling Native Android Logic with Frida

Android applications often leverage native libraries (written in C/C++ and compiled into .so files) for performance-critical tasks, platform-specific interactions, or, frequently, for intellectual property protection and obfuscation. Reverse engineering these proprietary native libraries can be a significant challenge, as they lack the high-level constructs found in Java/Kotlin bytecode. This case study delves into how Frida, a dynamic instrumentation toolkit, can be effectively used to hook Java Native Interface (JNI) functions, inspect arguments, and even modify return values, thereby uncovering the hidden logic within these libraries without extensive static analysis.

Our focus will be on understanding the workflow from identifying a target native function to writing a Frida script that interacts with its JNI arguments and return values. This technique is invaluable for security researchers, developers debugging complex native integrations, and those simply curious about how certain Android functionalities operate under the hood.

Prerequisites and Setup

Before we begin, ensure you have the following:

A rooted Android device or emulator (necessary for running Frida server).
adb (Android Debug Bridge) installed and configured on your host machine.
Frida installed on your host machine (pip install frida-tools).
Frida server binary appropriate for your Android device’s architecture (e.g., frida-server-16.1.4-android-arm64).
A static analysis tool like Ghidra or IDA Pro (optional but highly recommended for initial function identification).

Setting up Frida Server on Android

1. Download the correct Frida server binary from Frida’s GitHub releases for your device’s architecture (e.g., arm64, x86_64).

2. Push the binary to the device, make it executable, and run it:

adb push frida-server /data/local/tmp/
adb shell "chmod 755 /data/local/tmp/frida-server"
adb shell "/data/local/tmp/frida-server &"

3. Verify Frida server is running by listing processes from your host machine:

frida-ps -U

You should see a list of processes from your connected Android device.

Identifying the Target Native Library and Functions

Our hypothetical target is an Android application com.example.proprietaryapp that uses a native library libcryptolib.so to perform some sensitive data processing or license verification. The first step is to locate this library and identify its exported JNI functions.

Locating the Library

The library will typically reside in /data/app/com.example.proprietaryapp-.../lib/arm64 (or similar architecture path) or bundled within the APK. You can use adb shell pm path com.example.proprietaryapp to find the APK path, then pull and decompile the APK to extract the .so files.

Finding JNI Exported Functions

JNI functions exported from a native library follow a specific naming convention: Java_PackageName_ClassName_MethodName. You can use static analysis tools or simple command-line utilities to find these.

Using nm on Linux/macOS or within WSL:

nm -D /path/to/libcryptolib.so | grep Java_

Alternatively, open libcryptolib.so in Ghidra or IDA Pro. Navigate to the Exports window and look for function names matching the JNI pattern. Let’s assume we find a function named Java_com_example_proprietaryapp_NativeProcessor_processData.

The Challenge: Obscured Logic and JNI Types

Inside Java_com_example_proprietaryapp_NativeProcessor_processData, the application likely handles sensitive data or cryptographic operations. Our goal is to intercept the data passed into this function, observe its return value, and potentially manipulate them. This requires understanding how JNI types (jstring, jbyteArray, jint, etc.) are handled in Frida.

A typical JNI function signature looks like this:

JNIEXPORT jbyteArray JNICALL Java_com_example_proprietaryapp_NativeProcessor_processData(
    JNIEnv* env, jobject thiz, jstring inputData, jbyteArray keyBuffer, jint operationMode)
{ /* ... */ }

Here:

JNIEnv* env: A pointer to the JNI environment, allowing interaction with the JVM.
jobject thiz: The object on which the native method is called (for non-static methods).
jstring inputData: A JNI string object.
jbyteArray keyBuffer: A JNI byte array object.
jint operationMode: A JNI integer (equivalent to C int).

Frida JNI Hooking: Intercepting Native Calls

Frida allows us to intercept calls to native functions and inspect their arguments. The key is to correctly interpret the JNI types using the JNIEnv object.

The Frida Script Structure

We’ll use Interceptor.attach to hook the native function. Inside the onEnter callback, we’ll access the arguments, and in onLeave, we’ll inspect or modify the return value.

Java.perform(function() {
    // Find the target module (native library)
    var targetModule = Module.findExportByName(null, "JNI_OnLoad").parent;
    // If JNI_OnLoad isn't exported or found, use Module.load('libcryptolib.so')
    // and then find its base address and exports.
    // A more robust way: Module.ensureInitialized('libcryptolib.so');
    // Then use Module.findBaseAddress('libcryptolib.so');
    
    // For simplicity, let's assume we know the module name.
    var libcrypto = Process.findModuleByName("libcryptolib.so");
    if (libcrypto) {
        console.log("Found libcryptolib.so at base address: " + libcrypto.base);

        // Get a pointer to the target native function
        var processData_ptr = libcrypto.findExportByName("Java_com_example_proprietaryapp_NativeProcessor_processData");

        if (processData_ptr) {
            console.log("Hooking Java_com_example_proprietaryapp_NativeProcessor_processData at " + processData_ptr);

            Interceptor.attach(processData_ptr, {
                onEnter: function(args) {
                    // args[0] is JNIEnv*
                    // args[1] is jobject thiz
                    // args[2] is jstring inputData
                    // args[3] is jbyteArray keyBuffer
                    // args[4] is jint operationMode

                    this.env = args[0]; // Store JNIEnv for later use

                    // --- Reading jstring inputData (args[2]) ---
                    // JNIEnv->GetStringUTFChars takes jstring and returns const char*
                    var jstringInputData = args[2];
                    var GetStringUTFChars = this.env.getFunction('GetStringUTFChars');
                    var inputDataJsString = GetStringUTFChars(jstringInputData, NULL).readCString();
                    console.log("[*] inputData: " + inputDataJsString);

                    // --- Reading jbyteArray keyBuffer (args[3]) ---
                    // JNIEnv->GetArrayLength takes jarray and returns jsize
                    var jbyteArrayKeyBuffer = args[3];
                    var GetArrayLength = this.env.getFunction('GetArrayLength');
                    var arrayLength = GetArrayLength(jbyteArrayKeyBuffer);
                    console.log("[*] keyBuffer length: " + arrayLength);

                    // JNIEnv->GetByteArrayElements takes jbyteArray and returns jbyte*
                    // It also takes a jboolean* isCopy for optional copy behavior
                    var GetByteArrayElements = this.env.getFunction('GetByteArrayElements');
                    var keyBufferPtr = GetByteArrayElements(jbyteArrayKeyBuffer, NULL);

                    // Read the bytes into a JavaScript ArrayBuffer
                    var keyBufferJsArray = Memory.readByteArray(keyBufferPtr, arrayLength);
                    console.log("[*] keyBuffer (hex): " + hexdump(keyBufferJsArray, {offset: 0, length: arrayLength, header: false, ansi: false}));

                    // JNIEnv->ReleaseByteArrayElements MUST be called to release resources
                    var ReleaseByteArrayElements = this.env.getFunction('ReleaseByteArrayElements');
                    ReleaseByteArrayElements(jbyteArrayKeyBuffer, keyBufferPtr, 0); // 0 means commit changes and free

                    // --- Reading jint operationMode (args[4]) ---
                    var operationMode = args[4].toInt32();
                    console.log("[*] operationMode: " + operationMode);

                    // Optional: Modify arguments (e.g., if you want to change inputData)
                    // To change jstring, you'd create a new jstring using NewStringUTF and replace args[2]
                    // var NewStringUTF = this.env.getFunction('NewStringUTF');
                    // var newJString = NewStringUTF('modified_input_data');
                    // args[2] = newJString;
                },
                onLeave: function(retval) {
                    // --- Inspecting/Modifying return value (jbyteArray) ---
                    console.log("[*] Original Return Value (jbyteArray address): " + retval);
                    
                    if (retval.isNull()) {
                        console.log("[!] Native function returned null.");
                        return;
                    }

                    var GetArrayLength = this.env.getFunction('GetArrayLength');
                    var retArrayLength = GetArrayLength(retval);
                    console.log("[*] Return value array length: " + retArrayLength);

                    var GetByteArrayElements = this.env.getFunction('GetByteArrayElements');
                    var retBufferPtr = GetByteArrayElements(retval, NULL);
                    var retBufferJsArray = Memory.readByteArray(retBufferPtr, retArrayLength);
                    console.log("[*] Return value (hex): " + hexdump(retBufferJsArray, {offset: 0, length: retArrayLength, header: false, ansi: false}));

                    var ReleaseByteArrayElements = this.env.getFunction('ReleaseByteArrayElements');
                    ReleaseByteArrayElements(retval, retBufferPtr, 0);

                    // Optional: Modify the return value
                    // To return a different byte array, you'd create a new jbyteArray using NewByteArray
                    // and then SetByteArrayRegion to fill it, then replace retval.
                    // var NewByteArray = this.env.getFunction('NewByteArray');
                    // var SetByteArrayRegion = this.env.getFunction('SetByteArrayRegion');
                    // var newArray = NewByteArray(16);
                    // var dummyBytes = [0xde, 0xad, 0xbe, 0xef, 0x00, 0x00, 0x00, 0x00, 0xca, 0xfe, 0xba, 0xbe, 0x00, 0x00, 0x00, 0x00];
                    // SetByteArrayRegion(newArray, 0, 16, Memory.alloc(16).writeByteArray(dummyBytes));
                    // retval.replace(newArray);
                    // console.log("[+] Return value replaced with dummy data!");
                }
            });
        } else {
            console.log("[-] Could not find Java_com_example_proprietaryapp_NativeProcessor_processData.");
        }
    } else {
        console.log("[-] Could not find libcryptolib.so.");
    }
});

Running the Frida Script

Save the above script as hook_native_processor.js. Then, inject it into the target application:

frida -U -l hook_native_processor.js com.example.proprietaryapp

As the application runs and calls NativeProcessor.processData, you will see the intercepted arguments and return values printed to your console, revealing the previously obscured logic.

Understanding JNIEnv and its Functions

The JNIEnv* env pointer is crucial. It provides access to a table of functions that allow native code to interact with the Java Virtual Machine. In Frida, we can access these functions directly:

env.getFunction('MethodName'): Retrieves a pointer to a JNIEnv function (e.g., GetStringUTFChars, GetArrayLength, NewStringUTF, NewByteArray, CallStaticObjectMethod).
These functions take JNI types as arguments and return appropriate values. You need to consult the JNI Specification for exact signatures.

For example, to convert a jstring to a JavaScript string:

var GetStringUTFChars = this.env.getFunction('GetStringUTFChars');
var javaString = args[2];
var cStringPtr = GetStringUTFChars(javaString, NULL);
var jsString = cStringPtr.readCString();
// Remember to release the C string buffer if GetStringUTFChars made a copy
// GetStringUTFChars(javaString, true) would indicate a copy was made
// A safer approach is to use ReleaseStringUTFChars if a copy might have been made.
// var ReleaseStringUTFChars = this.env.getFunction('ReleaseStringUTFChars');
// ReleaseStringUTFChars(javaString, cStringPtr);

For byte arrays, the pattern involves `GetArrayLength` and `GetByteArrayElements`/`ReleaseByteArrayElements`.

Conclusion

Frida provides an incredibly powerful and flexible platform for dynamic instrumentation, making it an indispensable tool for Android native library reverse engineering. By understanding JNI function signatures and how to interact with the JNIEnv pointer within a Frida script, you can overcome many challenges posed by proprietary native code. This approach enables deep inspection of runtime behavior, argument values, and return data, which is crucial for security analysis, debugging, and understanding complex application logic that is otherwise hidden.

While static analysis tools like Ghidra and IDA Pro are excellent for initial reconnaissance and identifying potential functions, Frida truly shines when you need to observe and interact with the code’s behavior at runtime. Combining both approaches offers the most comprehensive strategy for reverse engineering Android applications.

Android Mobile Specs & Compare Directory

Are you researching mobile hardware properties, processor SoCs, GPU chipsets, or RAM configurations? Access our complete specs catalog to compare up to 5 devices side-by-side!

Compare Devices Specs →