Architecture

Dr.Semu’s architecture consists of four main components that work together to provide isolated malware analysis. All components operate in user-mode without requiring kernel drivers or modifications.

System components

LauncherCLI

Command-line interface and orchestration layer

virtual_FS_REG

Filesystem and registry virtualization

DrSemu Client

DynamoRIO-based instrumentation client

Detection Engine

Lua and Python rule execution

Architecture overview

┌─────────────────────────────────────────────────────────────┐
│                       LauncherCLI                            │
│  • Process orchestration                                     │
│  • Report management                                         │
│  • Command-line interface                                    │
└────────────┬────────────────────────────┬───────────────────┘
             │                            │
             ▼                            ▼
┌────────────────────────┐   ┌───────────────────────────────┐
│   virtual_FS_REG.exe   │   │      DynamoRIO Client         │
│  • ProjFS provider     │   │  • System call interception   │
│  • Registry cloning    │   │  • Behavior logging           │
│  • Path redirection    │   │  • JSON output                │
└────────────────────────┘   └───────────────────────────────┘
             │                            │
             └────────────┬───────────────┘
                          ▼
                ┌──────────────────┐
                │  Target Process  │
                │  (Isolated)      │
                └──────────────────┘
                          │
                          ▼
                ┌──────────────────┐
                │  JSON Reports    │
                └──────────────────┘
                          │
                          ▼
                ┌──────────────────┐
                │ Detection Engine │
                │ (Lua/Python)     │
                └──────────────────┘

Component lifecycle

1. Initialization phase

LauncherCLI starts

The launcher parses command-line arguments and initializes the environment (LauncherCLI.cpp:59-180).

Virtual environment setup

Creates VM-specific temp directory (e.g., dr_semu_0) and launches virtual_FS_REG.exe with elevated privileges (LauncherCLI.cpp:196-227).

Fake Explorer launch

Starts a fake explorer.exe process to serve as the parent for malware that checks process hierarchy (LauncherCLI.cpp:267-325).

2. Execution phase

Target launch

Target executable is launched under DynamoRIO with the Dr.Semu client DLL (LauncherCLI.cpp:584-668).

System call interception

DynamoRIO intercepts system calls before they enter the kernel (DrSemu.cpp:414-741).

Behavior logging

Each intercepted call is logged to a JSON structure with parameters and results (DrSemu/utils.hpp).

3. Analysis phase

Process termination

Target process completes or is terminated after time limit (DrSemu.cpp:34-40).

Report generation

JSON reports are written for each monitored process (DrSemu.cpp:369-392).

Rule execution

Detection engine runs Lua and Python rules against the collected data (run_detections/run_detections.cpp:16-69).

Communication mechanisms

Mailslots

Dr.Semu uses Windows mailslots for inter-process communication:

// LauncherCLI.cpp:186-191
const auto main_mailslot_name = launchercli::get_true_random_string(20);
dr_semu::shared::slot main_mailslot(main_mailslot_name);

Mailslots facilitate:

Process tracking (add and remove commands)
Status reporting from instrumented processes
Synchronization between components

Named pipes

Named pipes connect the launcher with the virtual filesystem provider:

// LauncherCLI.cpp:203-209
const auto pipe_name = launchercli::get_true_random_string(15);
dr_semu::shared::pipe virtual_fs_pipe(pipe_name);

Used for:

Virtual environment initialization
Configuration handshake
Cleanup coordination

Events

Named events signal process termination:

// LauncherCLI.cpp:284-291
const auto explorer_event_name = launchercli::get_true_random_string(15);
const auto explorer_kill_event = CreateEvent(nullptr, FALSE, FALSE, 
                                              explorer_event_name.c_str());

Data flow

Input

Target executable - Malware sample or suspicious file
Command-line arguments - Optional parameters passed to target
Detection rules - Lua/Python scripts in dr_rules/ directory

Processing

System call interception - Capture all filesystem, registry, process, and network operations
Path translation - Redirect operations to virtual environment
Behavior logging - Record parameters, return values, and timestamps

Output

Starter report (starter.json) - Process hierarchy and metadata
Dynamic reports ({pid}.json) - Per-process behavior logs
Static reports ({sha256}.json) - PE file analysis
Detection verdict - Classification result from rules

Isolation boundaries

All isolation is implemented in user-mode without kernel components.

Filesystem boundary

Mechanism: Windows Projected File System (ProjFS)
Implementation: virtual_FS_REG/virtualizationInstance.cpp
Scope: All file operations are redirected to %TEMP%\dr_semu_{vm_index}\

Registry boundary

Mechanism: Hive cloning and path rewriting
Implementation: virtual_FS_REG/virtual_reg.cpp
Scope: All registry keys are prefixed with dr_semu_{vm_index}!

Process boundary

Mechanism: Process tracking and handle filtering
Implementation: DrSemu/process_handlers.hpp
Scope: Child processes are automatically instrumented; external processes are hidden

Network boundary

Mechanism: Optional internet blocking
Implementation: DrSemu/networking_handlers.hpp
Scope: Configurable via disable_internet flag (DrSemu.cpp:148)

Multi-sample support

Dr.Semu supports concurrent analysis through VM isolation:

// LauncherCLI.cpp:476-487
std::vector<std::thread> threads{};
size_t vm_index = 1;
for (const auto& current_application : target_directory_files) {
    threads.emplace_back(vm_thread_function, current_application, vm_index);
    vm_index++;
}

Each VM gets:

Unique virtual filesystem root
Separate registry hives
Independent process tracking
Isolated report directory

Performance considerations

Resource usage

CPU: 2-10x overhead from DynamoRIO instrumentation
Memory: Additional RAM for virtual filesystem cache and DynamoRIO code cache
Disk: Temporary files in %TEMP% for each VM instance
I/O: Registry cloning adds ~5-10 seconds startup time

Optimization strategies

Parallel analysis: Run multiple samples concurrently (limited by CPU cores)
Cache reuse: Registry hives can be cached with use_cache parameter
Time limits: Set appropriate timeouts to prevent indefinite execution
Selective monitoring: Only configured system calls are intercepted

Isolation mechanisms

Deep dive into ProjFS and registry virtualization

Monitoring

System call interception with DynamoRIO

Detection rules

Writing and running detection rules

Building from source

Build and customize Dr.Semu

Get Started

Core Concepts

Usage

Detection Rules

Advanced

Troubleshooting

System components

LauncherCLI

virtual_FS_REG

DrSemu Client

Detection Engine

Architecture overview

Component lifecycle

1. Initialization phase

2. Execution phase

3. Analysis phase

Communication mechanisms

Mailslots

Named pipes

Events

Data flow

Input

Processing

Output

Isolation boundaries

Filesystem boundary

Registry boundary

Process boundary

Network boundary

Multi-sample support

Performance considerations

Resource usage

Optimization strategies

See also

Isolation mechanisms

Monitoring

Detection rules

Building from source

Build docs developers (and LLMs) love

Get Started

Core Concepts

Usage

Detection Rules

Advanced

Troubleshooting

​System components

LauncherCLI

virtual_FS_REG

DrSemu Client

Detection Engine

​Architecture overview

​Component lifecycle

​1. Initialization phase

​2. Execution phase

​3. Analysis phase

​Communication mechanisms

​Mailslots

​Named pipes

​Events

​Data flow

​Input

​Processing

​Output

​Isolation boundaries

​Filesystem boundary

​Registry boundary

​Process boundary

​Network boundary

​Multi-sample support

​Performance considerations

​Resource usage

​Optimization strategies

​See also

Isolation mechanisms

Monitoring

Detection rules

Building from source

Build docs developers (and LLMs) love

System components

Architecture overview

Component lifecycle

1. Initialization phase

2. Execution phase

3. Analysis phase

Communication mechanisms

Mailslots

Named pipes

Events

Data flow

Input

Processing

Output

Isolation boundaries

Filesystem boundary

Registry boundary

Process boundary

Network boundary

Multi-sample support

Performance considerations

Resource usage

Optimization strategies

See also