Build a security agent with Copilot and GitHub Actions

In the world of software development, security reviews can often feel like a formal, sometimes stressful, ceremony. We hand our code over to specialists, wait for a report, and then scramble to fix the findings. While services like Aikido Security are invaluable in professional settings, I found myself wondering: could I bring some of that power to my personal projects in a more lightweight, “cozy” way? What if I could build my own automated security specialist, an AI agent that knows my code and helps me improve it?

This is the story of how a single talk at a Xebia event by Côme Redon sparked an idea. I decided to build a custom security agent for my project, “EngageTime,” using tools I already had: GitHub Copilot and GitHub Actions. It turned out to be a good journey that I wanted to share.

In this article, I’ll walk you through the entire process, from creating the agent to analysing its findings and even having it file its own bug reports. And yes, I’ll be sharing the complete configuration so you can build your own.

Crafting the brain: the security agent definition

The first step is to create the agent’s definition. In the world of GitHub Copilot, this is done through a simple Markdown file. This file acts as a prompt, a set of instructions that defines the agent’s name, purpose, and rules of engagement.

My goal was to create an agent that could perform a comprehensive security analysis without modifying the code directly. I drew inspiration from the concepts I saw at the event and incorporated terms and capabilities I was familiar with from my experience with professional security tools.

Here’s the agent definition file, located at .github/agents/security-agent.md:

1
---
2
name: SecurityAgent
3
description: Security Agent - Analyzes TypeScript and React code for security vulnerabilities and creates security reports
4
model: GPT-5.1 (Preview)
5
---
6

7
## Purpose
8

9
This agent performs comprehensive security analysis of the Astro, TypeScript code. It identifies security vulnerabilities, assesses risks, and produces detailed security reports without modifying the codebase directly.
10

11
## Security Scanning Capabilities
12

13
This agent can perform comprehensive security analysis across the full stack:
14

15
### Code Analysis
16

17
- **SAST (Static Code Analysis)** - Scans TypeScript/React source code for security vulnerabilities
18
- Identify security vulnerabilities including:
19
  - SQL Injection risks
20
  - Cross-Site Scripting (XSS) vulnerabilities
21
  - Cross-Site Request Forgery (CSRF) issues
22
  - Authentication and authorization flaws
23
  - Insecure cryptographic implementations
24
  - Hardcoded secrets or credentials
25
  - Path traversal vulnerabilities
26
  - Insecure deserialization
27
  - Insufficient input validation
28
  - Information disclosure risks
29
  - Missing security headers
30
  - Dependency vulnerabilities
31
  - Input validation analysis - review all user input handling
32
  - Data Encryption - check encryption at rest and in transit
33
  - Error Handling - ensure errors don't leak sensitive information
34

35
### Dependency & Component Analysis
36

37
- **SCA (Software Composition Analysis)** - Monitors npm dependencies for known vulnerabilities & CVEs
38
- **License Scanning** - Identifies licensing risks in open source components
39
- **Outdated Software Detection** - Flags unmaintained frameworks and end-of-life runtimes
40
- **Malware Detection** - Checks for malicious packages in supply chain
41

42
### Infrastructure & Configuration
43

44
- **Secrets Detection** - Finds hardcoded API keys, passwords, certificates
45
- **Cloud Configuration Review** - Azure Functions and services security posture
46
- **IaC Scanning** - Analyzes Terraform/CloudFormation/Kubernetes configurations
47
- **Container Image Scanning** - Scans Azure container images for vulnerabilities
48

49
### API & Runtime Security
50

51
- **API Security** - Reviews endpoint security and access controls
52
- **Database Security** - Checks for secure queries and connection practices
53
- **WebSocket Security** - Validates secure WebSocket implementations
54
- **File Upload Security** - Reviews secure file handling practices
55

56
### Compliance & Best Practices
57

58
- OWASP Top 10: Check against latest OWASP security risks
59
- TypeScript/React Security Guidelines: Verify adherence to Node.js and React security best practices
60
- Secure coding standards: Validate code follows industry standards
61
- Dependency scanning: Check for known vulnerabilities in npm dependencies
62
- Security headers: Verify proper HTTP security headers
63
- Data privacy: Review GDPR/privacy compliance considerations
64

65
### Security Metrics & Reporting
66

67
- **Vulnerability Count by Severity** - Critical, High, Medium, Low categorization
68
- **Code Coverage Analysis** - Security-critical code coverage metrics
69
- **OWASP Top 10 Mapping** - Maps findings to current OWASP risks
70
- **CWE Classification** - Uses Common Weakness Enumeration for standardization
71
- **Risk Score** - Overall security posture assessment
72
- **Remediation Timeline** - Priority-based fix recommendations
73

74
## Report Structure
75

76
### Security Assessment Report
77

78
1. Executive Summary
79
  - Overall security posture
80
  - Critical findings count
81
  - Risk level assessment
82

83
2. Vulnerability Findings
84
  For each vulnerability:
85
  - Severity: Critical/High/Medium/Low
86
  - Category: (e.g., Injection, Authentication, etc.)
87
  - Location: File and line number
88
  - Description: What the issue is
89
  - Impact: Potential consequences
90
  - Recommendation: How to fix it
91
  - References: OWASP/CWE/Microsoft docs
92

93
3. Security Best Practices Review
94
  - Areas following best practices
95
  - Areas needing improvement
96
  - Configuration recommendations
97

98
4. Dependency Analysis
99
  - Vulnerable packages identified
100
  - Recommended updates
101

102
5. Action Items
103
  - Prioritized list of fixes needed
104
  - Quick wins vs. complex remediation
105

106
6. Critical Vulnerability Warning
107
  - If any CRITICAL severity vulnerabilities are found, include exactly this message at the end of the report:
108
  ````
109
  THIS ASSESSMENT CONTAINS A CRITICAL VULNERABILITY
110
  ````
111
  - Do not adapt or change this message in any way.

This file is the agent’s “brain.” By defining its capabilities and boundaries clearly, I’m setting the stage for a focused and useful analysis.

Wiring it up: the GitHub Actions workflow

With the agent defined, it was time to bring it to life with a GitHub Actions workflow. The goal was to create a process that would:

Check out the code.
Install the GitHub Copilot CLI.
Feed the agent definition and the codebase to the CLI.
Capture the report and make it available.
Most importantly, fail the workflow if any critical issues were found.

After some tinkering, particularly around getting the right authentication token, I landed on this workflow file at .github/workflows/security-agent.yml:

1
name: Security Agent Workflow
2

3
on:
4
  workflow_dispatch:
5

6
jobs:
7
  security-assessment:
8
    runs-on: ubuntu-latest
9
    timeout-minutes: 15
10
    permissions:
11
      contents: read
12
    steps:
13
      - name: Checkout repository
14
        uses: actions/checkout@v4
15

16
      - name: Setup Node.js
17
        uses: actions/setup-node@v4
18
        with:
19
          node-version: 22
20

21
      - name: Install GitHub Copilot CLI
22
        run: npm i -g @github/copilot
23

24
      - name: Run Security Agent via Copilot CLI
25
        env:
26
          COPILOT_GITHUB_TOKEN: ${{ secrets.COPILOT_GITHUB_TOKEN }}
27
          GITHUB_REPOSITORY: ${{ github.repository }}
28
        run: |
29
          set -euo pipefail
30
          AGENT_PROMPT=$(cat .github/agents/security-agent.md)
31
          PROMPT="$AGENT_PROMPT"
32
          PROMPT+=$'\n\nContext:\n'
33
          PROMPT+="- Repository: $GITHUB_REPOSITORY"
34
          PROMPT+=$'\n\nTask:\n'
35
          PROMPT+=$"\n- Execute the instructions on the full codebase"
36
          PROMPT+=$'\n- Generate the security report at /security-reports/security-assessment-report.md summarizing findings, severity, and remediation guidance.'
37

38
          copilot --prompt "$PROMPT" --allow-all-tools --allow-all-paths < /dev/null
39

40
      - name: Output security report as summary
41
        if: always()
42
        run: |
43
          set -euo pipefail
44
          REPORT_PATH="security-reports/security-assessment-report.md"
45

46
          if [ ! -f "$REPORT_PATH" ]; then
47
            echo "No security report generated; skipping summary."
48
            exit 0
49
          fi
50

51
          echo "## Security Assessment Report" >> $GITHUB_STEP_SUMMARY
52
          cat "$REPORT_PATH" >> $GITHUB_STEP_SUMMARY
53

54
      - name: Upload security report artifact
55
        if: always()
56
        uses: actions/upload-artifact@v4
57
        with:
58
          name: security-assessment-report-${{ github.run_id }}
59
          path: security-reports/security-assessment-report.md
60
          retention-days: 30
61

62
      - name: Check for critical vulnerabilities
63
        if: always()
64
        run: |
65
          set -euo pipefail
66
          REPORT_PATH="security-reports/security-assessment-report.md"
67

68
          if [ ! -f "$REPORT_PATH" ]; then
69
            echo "No security report generated; skipping critical check."
70
            exit 0
71
          fi
72

73
          if grep -q "THIS ASSESSMENT CONTAINS A CRITICAL VULNERABILITY" "$REPORT_PATH"; then
74
            echo "❌ CRITICAL VULNERABILITY DETECTED - Workflow failed"
75
            echo "The security assessment found critical vulnerabilities that must be addressed before proceeding."
76
            exit 1
77
          else
78
            echo "✅ No critical vulnerabilities detected"
79
          fi

Use the Copilot Requests read-only permission for the fine-grained personal access token.

Deconstructing the workflow

on: workflow_dispatch: I started with a manual trigger. This is perfect for development, as it lets me run the agent on demand without committing new code every time. You can change this to run on each pull request or on a schedule as needed.
npm i -g @github/copilot-cli: This installs the official GitHub Copilot CLI, the tool that allows us to interact with Copilot agents from the command line.
The Security Gate: An important step is Check for critical vulnerabilities. I had engineered a specific string, THIS ASSESSMENT CONTAINS A CRITICAL VULNERABILITY, into my agent’s instructions. This step uses the simple but powerful grep command to search the report for that exact string. If it’s found, the workflow exits with an error code, effectively acting as a security gate that prevents merging code with critical issues.

The moment of truth: the first report

With everything wired up, I manually triggered the workflow. The first report it generated was astounding. It wasn’t just a list of potential issues; it was a detailed security assessment.

It found 3 critical vulnerabilities, along with a host of high and medium-risk findings. Here are two of the most impactful ones:

Critical finding 1: inadequate input sanitisation

The agent identified a custom sanitizeInput function that was vulnerable to Cross-Site Scripting (XSS).

The Vulnerable Code:

1
export const sanitizeInput = (input: string): string => {
2
  return input.replace(/<script.*?>.*?<\/script>/gi, "").trim();
3
};

The Agent’s Analysis: The agent correctly pointed out that this regex could be easily bypassed with event handlers (<img src=x onerror="alert(1)">), different tags, or encoded payloads.

The Recommended Fix: It suggested replacing this homegrown function with DOMPurify, a battle-tested sanitisation library.

1
import DOMPurify from 'isomorphic-dompurify';
2

3
export const sanitizeInput = (input: string): string => {
4
  return DOMPurify.sanitize(input).trim();
5
};

Critical finding 2: Cross-Site Scripting (XSS) via innerHTML Assignment

Another issue was the direct assignment of user-generated content to innerHTML, which is a classic XSS vector.

The Vulnerable Code:
```
1
container.innerHTML = `
2
  ...
3
`;
```
The Agent’s Analysis: The agent highlighted that this practice could allow malicious scripts to execute if the content isn’t properly sanitized.

The Recommended Fix: It recommended using safer DOM manipulation methods, such as creating elements and setting text content:

1
const fallbackDiv = document.createElement('div');
2
fallbackDiv.className = '...';
3
fallbackDiv.textContent = '...';
4
target.replaceWith(fallbackDiv);

From report to action: automating the workflow

Now I had a detailed report, but I still needed to act on it. Manually creating a issue for each of the findings felt like a chore. So, I turned to my AI partner again.

I gave GitHub Copilot a simple prompt: “Can you create issues on GitHub for the issues that you found in the security report of today?”

Using the GitHub CLI, Copilot iterated through the report and created a new, detailed issue for each finding directly in my repository, turning a static document into a dynamic backlog of actionable tasks.

Conclusion

This journey is great to see how modern AI tools can augment our workflows. By combining a well-defined agent, a pragmatic GitHub Actions workflow, and a little bit of CLI magic, I created a robust, automated security review process for my personal project.

The agent found real, critical issues, provided high-quality remediation advice, and I was even able to automate the process of turning its findings into trackable work. For some of the issues, the agent’s code was a direct, copy-paste solution.

I encourage you to take these configurations and adapt them for your own projects. Happy building!

Build a security agent with Copilot and GitHub Actions

Crafting the brain: the security agent definition

Wiring it up: the GitHub Actions workflow

Deconstructing the workflow

The moment of truth: the first report

Critical finding 1: inadequate input sanitisation

Critical finding 2: Cross-Site Scripting (XSS) via innerHTML Assignment

From report to action: automating the workflow

Conclusion

Related articles

Monitor your GitHub Actions storage usage with a script

Clean up old GitHub Actions artifacts with a script

Conditional publish packages to NPM via GitHub actions

Report issues or make changes on GitHub

Comments

Elio Struyf