Targeted Memory Dumping: Pinpointing Specific Data Structures in Android Apps with Frida

Introduction: Beyond Generic Memory Dumps

In the realm of Android application penetration testing and reverse engineering, memory dumping is a powerful technique to extract sensitive information. While a full memory dump can yield vast amounts of data, it often presents an overwhelming challenge in terms of analysis, requiring significant effort to sift through gigabytes of raw bytes. This is where targeted memory dumping with Frida shines. Instead of indiscriminately dumping an entire process’s memory, we can use Frida to precisely locate, intercept, and extract specific data structures, object instances, or memory regions that are known or suspected to hold sensitive information.

This expert-level guide will walk you through the methodologies and Frida scripts to perform highly granular memory dumping, focusing on particular data structures within an Android application. We’ll explore techniques for both Java and native memory manipulation, providing actionable examples.

The Power of Precision: Why Targeted Dumping Matters

Targeted memory dumping offers several key advantages:

Efficiency: Reduces the amount of data to analyze, saving time and computational resources.
Precision: Directly extracts the relevant data, bypassing obfuscation or encryption mechanisms that might be present in storage.
Bypassing Anti-Tampering: Often allows extraction of data that is only briefly in memory, avoiding persistent storage analysis.
Dynamic Analysis: Captures data during runtime, reflecting the application’s current state and active secrets.

Prerequisites

Before diving in, ensure you have the following setup:

A rooted Android device or an emulator (e.g., AVD, Genymotion).
Frida installed on your host machine and Frida server running on the Android device.
Basic familiarity with Android application reverse engineering (e.g., using Jadx, Ghidra, or APKTool).
Basic understanding of JavaScript for writing Frida scripts.

Method 1: Scanning for Known Patterns in Memory (e.g., Strings)

Sometimes, a target value like an API key, a specific identifier, or a unique string might be residing directly in memory. Frida’s Memory.scanSync() API allows us to search for specific byte patterns within the process’s memory regions.

Scenario: Locating a Hardcoded API Key

Let’s assume we’ve identified through static analysis (e.g., decompiling with Jadx) that an application uses a string like "pk_live_some_very_secret_api_key", but it’s generated or placed in memory dynamically. We want to find its exact memory location and value during runtime.

Java.perform(function () { const pattern = "70 6B 5F 6C 69 76 65 5F 73 6F 6D 65 5F 76 65 72 79 5F 73 65 63 72 65 74 5F 61 70 69 5F 6B 65 79"; // Hex representation of "pk_live_some_very_secret_api_key" const pageSize = Process.pageSize; // Get system page size console.log("Searching for pattern: " + pattern); Process.enumerateRanges('r--').forEach(function (range) { try { const matches = Memory.scanSync(range.base, range.size, pattern); matches.forEach(function (match) { console.log("Found pattern at: " + match.address); const data = Memory.readCString(match.address); console.log("Extracted string: " + data); }); } catch (e) { // Handle potential access violations or other errors console.error("Error scanning range " + range.base + ": " + e.message); } }); console.log("Scan complete.");});

Explanation:

We convert the target string into its hexadecimal byte pattern.
Process.enumerateRanges('r--') iterates through all readable memory regions.
Memory.scanSync(address, size, pattern) attempts to find occurrences of our pattern within each region.
Upon finding a match, we print the address and use Memory.readCString() to extract the string from that address.

Method 2: Extracting Data from Live Java Object Instances

Often, sensitive data is stored within instances of specific Java classes. Frida’s Java API allows us to hook methods, access fields, and even instantiate objects to extract their internal state.

Scenario: Dumping a `byte[]` field from a Custom Secret Holder Class

Imagine an application has a class named com.example.app.SecretHolder which holds an encrypted key in a private byte[] field named encryptedKeyData.

Java.perform(function () { const SecretHolder = Java.use('com.example.app.SecretHolder'); // Hook the constructor to get instances when they are created SecretHolder.$init.implementation = function () { console.log('SecretHolder constructor called!'); this.$init(); // Call original constructor // Now 'this' refers to the new instance, we can access its fields const encryptedKeyData = this.encryptedKeyData.value; // Access the byte[] field if (encryptedKeyData) { console.log('Dumping encryptedKeyData (length: ' + encryptedKeyData.length + '):'); // Convert byte[] to Hex string for easy viewing let hexString = ''; for (let i = 0; i < encryptedKeyData.length; i++) { hexString += ('0' + (encryptedKeyData[i] & 0xFF).toString(16)).slice(-2); } console.log(hexString); } else { console.log('encryptedKeyData field is null or empty.'); } }; // Alternatively, if instances already exist and are static/globally accessible: /* Java.choose('com.example.app.SecretHolder', { onMatch: function(instance) { console.log('Found an existing SecretHolder instance: ' + instance); const encryptedKeyData = instance.encryptedKeyData.value; if (encryptedKeyData) { let hexString = ''; for (let i = 0; i < encryptedKeyData.length; i++) { hexString += ('0' + (encryptedKeyData[i] & 0xFF).toString(16)).slice(-2); } console.log('Dumping from existing instance: ' + hexString); } }, onComplete: function() { console.log('Java.choose complete.'); } }); */});

Explanation:

Java.use('com.example.app.SecretHolder') gets a JavaScript wrapper for the Java class.
SecretHolder.$init.implementation allows us to intercept the constructor. When a SecretHolder object is created, our custom logic runs.
Inside the constructor hook, this refers to the newly created instance. We can then directly access its public or private fields (Frida can often bypass Java visibility restrictions, though sometimes `Java.cast` or other techniques are needed for complex objects).
We extract the encryptedKeyData byte array and convert it to a hexadecimal string for display.
The commented-out Java.choose demonstrates how to find *existing* instances of a class in the heap, useful if you miss the constructor or need to interact with objects already alive.

Method 3: Intercepting Native Memory Operations

Many performance-critical or security-sensitive operations in Android apps are implemented in native code (C/C++), often through JNI. Frida’s Interceptor.attach() is crucial for hooking native functions and inspecting their arguments or return values, which frequently include pointers to memory buffers.

Scenario: Dumping Data Passed to a Native Crypto Function

Consider a native library (e.g., libnativecrypto.so) that has a function encryptData(const void* inputBuffer, size_t inputSize, void* outputBuffer, size_t outputSize). We want to dump the inputBuffer before encryption occurs.

Interceptor.attach(Module.findExportByName('libnativecrypto.so', 'encryptData'), { onEnter: function (args) { // args[0] is inputBuffer, args[1] is inputSize console.log('Hooked encryptData!'); this.inputBuffer = args[0]; this.inputSize = args[1].toInt32(); // Cast NativePointer to int32 }, onLeave: function (retval) { // We can dump the input buffer here or in onEnter if (this.inputBuffer && this.inputSize > 0) { console.log('Dumping inputBuffer before encryption (size: ' + this.inputSize + ' bytes):'); const dumpedBytes = Memory.readByteArray(this.inputBuffer, this.inputSize); // Convert ArrayBuffer to Hex String for display function arrayBufferToHexString(buffer) { const byteArray = new Uint8Array(buffer); let hexString = ''; for (let i = 0; i < byteArray.byteLength; i++) { hexString += ('0' + byteArray[i].toString(16)).slice(-2); } return hexString; } console.log(arrayBufferToHexString(dumpedBytes)); } else { console.log('Input buffer not available or size is zero.'); } }});

Explanation:

Module.findExportByName('libnativecrypto.so', 'encryptData') locates the address of the exported native function.
Interceptor.attach() hooks this function.
onEnter is called when the function is entered. We store the relevant arguments (inputBuffer and inputSize) in this context, making them accessible in onLeave.
onLeave is called when the function returns. Here, we use Memory.readByteArray(address, size) to dump the raw bytes from the inputBuffer. The result is an ArrayBuffer, which we then convert to a hexadecimal string for logging.

Analyzing Dumped Data

Once you have dumped the raw bytes or hex strings, you’ll need tools to analyze them. Depending on the data type, this could involve:

Hex Editors: For visual inspection of raw bytes (e.g., HxD, 010 Editor).
CyberChef: For various encoding/decoding, encryption, and data manipulation tasks.
Custom Scripts: Python scripts are ideal for parsing structured data, decrypting, or reassembling fragmented information.

Conclusion

Targeted memory dumping with Frida is an indispensable skill for advanced Android penetration testers and reverse engineers. By moving beyond full memory dumps and focusing on specific data structures and memory regions, you can significantly streamline your analysis, efficiently extract sensitive information, and gain deeper insights into an application’s runtime behavior. Mastering these techniques empowers you to pinpoint critical data, whether it’s an API key in a string, an encrypted blob in a Java object, or sensitive plaintext in a native buffer, ultimately enhancing your ability to identify and exploit vulnerabilities.

Android Mobile Specs & Compare Directory

Are you researching mobile hardware properties, processor SoCs, GPU chipsets, or RAM configurations? Access our complete specs catalog to compare up to 5 devices side-by-side!

Compare Devices Specs →

Introduction: Beyond Generic Memory Dumps

The Power of Precision: Why Targeted Dumping Matters

Prerequisites

Method 1: Scanning for Known Patterns in Memory (e.g., Strings)

Scenario: Locating a Hardcoded API Key

Method 2: Extracting Data from Live Java Object Instances

Scenario: Dumping a `byte[]` field from a Custom Secret Holder Class

Method 3: Intercepting Native Memory Operations

Scenario: Dumping Data Passed to a Native Crypto Function

Analyzing Dumped Data

Conclusion

Android Mobile Specs & Compare Directory

Related Technical Guides

Reverse Engineering Android Apps for Anti-Frida/Anti-Tampering Mechanisms and Their Bypass

Bypass Android SSL Pinning with Frida: A Comprehensive Step-by-Step How-To

Frida Biometric Bypass: A Step-by-Step Guide to Circumventing Android Fingerprint & Face Locks