Your First Penetration Test

Overview

This guide walks you through conducting your first penetration test with PentAGI. You’ll learn how to create a testing flow, configure targets, monitor execution, and interpret results.

Ensure you have completed the Installation and have proper authorization to test the target system.

Prerequisites

Before starting:

Legal and Ethical RequirementsOnly perform penetration testing on systems you own or have explicit written authorization to test. Unauthorized testing is illegal and unethical.

Step 1: Access PentAGI

Open web interface

Navigate to https://localhost:8443 in your browser.

Use the default credentials (change after first login):

Email: [email protected]
Password: admin

Navigate to Flows

Click on the Flows menu to access the penetration testing workspace.

Step 2: Create Your First Flow

A “flow” in PentAGI represents a complete penetration testing engagement.

Create new flow

Click the + New Flow button in the Flows interface.

Configure flow parameters

Enter the basic information:

Name: Give your test a descriptive name (e.g., “Web App Security Assessment”)
Description: Add details about the test scope and objectives
Target: Specify the target system (e.g., http://10.10.10.10:8080)

Define testing objectives

In the flow prompt, specify what you want to test. For example:

You need to find critical or valuable vulnerabilities in a Web Application http://10.10.10.10:8080

Follow this action plan:

1. Collect all endpoints of the application
   - Navigate through all application pages
   - Test all features and functions
   - Document endpoints and input fields

2. For each endpoint, check for:
   - Path Traversal (attempt to read /etc/passwd)
   - Cross-Site Request Forgery (CSRF)
   - Cross-Site Scripting (XSS)
   - SQL Injection (use sqlmap)
   - Command Injection (use commix)
   - Server-Side Request Forgery (SSRF)
   - XML External Entities (XXE)
   - Unsafe file upload

3. Document findings with:
   - Vulnerability type and severity
   - Reproduction steps
   - Example payloads
   - Potential impact

Start the flow

Click Start Flow to begin the automated penetration test.

Step 3: Monitor Execution

PentAGI autonomously executes the penetration test. You can monitor progress in real-time.

Understanding the Flow Hierarchy

Flow Components

Flow
Tasks
SubTasks
Actions

The top-level engagement representing the entire penetration test.Status indicators:

Active: Test is running
Completed: All tasks finished
Failed: Critical error occurred

Real-Time Monitoring

View task progress

The flow interface shows:

Current task being executed
Completed tasks (green checkmarks)
Pending tasks (gray)
Failed tasks (red X)

Inspect subtask details

Click on any task to expand and view:

Subtasks and their agents
Command outputs
Tool results
Agent reasoning and decisions

Review action logs

Each action shows:

Command or tool executed
Full output/response
Timestamps
Success/failure status

Step 4: Understanding Results

As PentAGI progresses through the test, it discovers and documents findings.

Example: SQL Injection Discovery

Here’s how PentAGI identifies and reports a SQL injection vulnerability:

Initial testing

Task: “Check sorting functionality for SQL Injection”The executor agent runs sqlmap:

sqlmap -u "http://10.10.10.10:8080/?order=id" --batch --random-agent

Vulnerability confirmation

Result: SQL injection detected in ‘order’ parameterPentAGI identifies:

Injection types: Boolean-based blind, Error-based, Time-based blind
Backend DBMS: MySQL 5.6+
Example payload: order=id AND 5670=(SELECT (CASE WHEN (5670=5670) THEN 5670 ELSE (SELECT 9089 UNION SELECT 6214) END))-- silk

Impact assessment

PentAGI automatically:

Attempts data extraction
Tests privilege escalation
Documents potential impact

Finding: Admin credentials extracted (admin:secureadminpassword)

Viewing Findings

In-Flow View
Summary Report
Detailed Evidence

Within the flow interface:

Findings appear under their respective tasks
Color-coded by severity (red=critical, orange=high, yellow=medium)
Click to expand full details

At flow completion, PentAGI generates:Identified Vulnerabilities:

SQL Injection (Critical)
- Parameter: order
- Types: Boolean-based blind, Error-based, Time-based
- Impact: Data extraction, authentication bypass
Cross-Site Request Forgery (Medium)
- Feature: Sorting functionality
- Impact: Unauthorized state changes

Non-Vulnerable Features:

Cross-Site Scripting: Not vulnerable
Path Traversal: Not vulnerable
Command Injection: Not vulnerable

Step 5: Exporting Results

Navigate to completed flow

Go to the Flows list and select your completed test.

Export report

Click the Export button to download:

Full HTML report
JSON data for integration
Markdown summary

Share findings

Use the exported report to:

Present findings to stakeholders
Track remediation progress
Document compliance testing

Common Testing Scenarios

Web Application Testing

Basic web vulnerability scan

Test web application at http://example.com for:
- SQL Injection in all parameters
- XSS in input fields and URLs
- CSRF on state-changing operations
- Authentication bypass techniques
- Session management vulnerabilities

API security assessment

Assess REST API at https://api.example.com:
- Authentication and authorization flaws
- Input validation issues
- Rate limiting effectiveness
- Information disclosure
- Business logic vulnerabilities

Network infrastructure scan

Scan network range 10.10.10.0/24:
- Port scanning with nmap
- Service enumeration
- Version detection
- Common vulnerability identification
- Exploit attempt on identified services

Using Professional Tools

PentAGI has access to 20+ professional pentesting tools:

sqlmap

Automated SQL injection testing and exploitation

nmap

Network discovery and security auditing

metasploit

Penetration testing framework

commix

Command injection exploitation

nikto

Web server vulnerability scanner

gobuster

Directory and file brute-forcing

PentAGI automatically selects appropriate tools based on the testing scenario.

Interpreting Agent Decisions

PentAGI uses multiple specialized agents that reason about their actions:

Example Agent Reasoning

Researcher Agent
Executor Agent
Developer Agent

Observation: “Application uses GET parameter ‘order’ for sorting”Analysis: “GET parameters are common SQL injection vectors. The sorting functionality directly interacts with database queries.”Decision: “Delegate SQL injection testing to executor agent with sqlmap tool.”

Troubleshooting

Flow stuck on a task

Possible causes:

Target system is unreachable
Firewall blocking tool execution
Agent waiting for tool to complete

Solutions:

Check target system connectivity
Review agent logs for errors
Consider increasing timeout values
Pause and manually verify target access

No vulnerabilities found

Possible causes:

Target is well-secured
Testing scope too limited
Agent needs more specific guidance

Solutions:

Expand testing prompt with more scenarios
Provide specific endpoints or features to test
Use more advanced techniques in prompt
Try different testing approaches

Tool execution errors

Possible causes:

Tool not available in container
Invalid tool syntax
Resource constraints

Solutions:

Check container has required tools
Review tool output for syntax errors
Increase container resources
Use alternative tools

Next Steps

Custom Assistants

Create specialized testing assistants

Advanced Techniques

Learn advanced pentesting workflows

Best Practices

Security and ethical guidelines

Distributed Setup

Scale testing with worker nodes

Setup Guides

Usage Guides

Advanced

Your First Penetration Test

Overview

Prerequisites

Step 1: Access PentAGI

Step 2: Create Your First Flow

Step 3: Monitor Execution

Understanding the Flow Hierarchy

Flow Components

Real-Time Monitoring

Step 4: Understanding Results

Example: SQL Injection Discovery

Viewing Findings

Step 5: Exporting Results

Common Testing Scenarios

Web Application Testing

Using Professional Tools

sqlmap

nmap

metasploit

commix

nikto

gobuster

Interpreting Agent Decisions

Example Agent Reasoning

Troubleshooting

Next Steps

Custom Assistants

Advanced Techniques

Best Practices

Distributed Setup

Build docs developers (and LLMs) love

Setup Guides

Usage Guides

Advanced

​Overview

​Prerequisites

​Step 1: Access PentAGI

​Step 2: Create Your First Flow

​Step 3: Monitor Execution

​Understanding the Flow Hierarchy

​Flow Components

​Real-Time Monitoring

​Step 4: Understanding Results

​Example: SQL Injection Discovery

​Viewing Findings

​Step 5: Exporting Results

​Common Testing Scenarios

​Web Application Testing

​Using Professional Tools

sqlmap

nmap

metasploit

commix

nikto

gobuster

​Interpreting Agent Decisions

​Example Agent Reasoning

​Troubleshooting

​Next Steps

Custom Assistants

Advanced Techniques

Best Practices

Distributed Setup

Build docs developers (and LLMs) love

Overview

Prerequisites

Step 1: Access PentAGI

Step 2: Create Your First Flow

Step 3: Monitor Execution

Understanding the Flow Hierarchy

Flow Components

Real-Time Monitoring

Step 4: Understanding Results

Example: SQL Injection Discovery

Viewing Findings

Step 5: Exporting Results

Common Testing Scenarios

Web Application Testing

Using Professional Tools

Interpreting Agent Decisions

Example Agent Reasoning

Troubleshooting

Next Steps