Training data extraction refers to unauthorized access or inference of sensitive training data used by large language models (LLMs) through API endpoints. In the context of AdonisJS, this risk arises when APIs expose LLM inference endpoints that process user input to generate dynamic responses, potentially leaking proprietary model weights, prompts, or training datasets. AdonisJS, being a flexible Node.js framework, often integrates with AI services via custom controllers or middleware, making it vulnerable if not properly secured.

Specific attack patterns include sending crafted prompts that trigger the model to reproduce memorized training examples, or exploiting poorly configured API routes that accept unvalidated input and forward it to an LLM without safeguards. For instance, an endpoint like /api/generate might accept a prompt parameter and return model-generated text without rate limiting, input sanitization, or output monitoring. If the underlying LLM was trained on proprietary data, an attacker could reconstruct fragments of that data through repetitive queries, violating data confidentiality and intellectual property rights.

AdonisJS-specific code paths where this manifests include controller actions that directly interface with external AI APIs, or server-side routes that proxy user input to third-party LLM endpoints without input validation or output filtering. For example, a controller might look like:

const { HttpContext } =.use('Adonis/Core/HttpContext')

class AIController {
  async generate({ request }) {
    const userPrompt = request.input('prompt')
    const response = await fetch('https://ai-service.example.com/generate', {
      method: 'POST',
      body: JSON.stringify({ prompt: userPrompt })
    })
    const data = await response.json()
    return response.json({ result: data.output })
  }
}

Here, the endpoint accepts arbitrary user input and forwards it directly to an external AI service, creating a vector for prompt injection or data exfiltration. If the LLM backend is not hardened, this can lead to unintended training data exposure. Additionally, AdonisJS applications that cache LLM responses or log full request bodies for debugging may inadvertently store sensitive training-related inputs, increasing exposure risk.

Detection of training data extraction risks requires scanning for patterns that indicate unsecured LLM interaction. middleBrick identifies this through its LLM/AI Security module, which performs active probing to detect unauthorized access to model behavior, excessive agency in tool calls, and potential PII or proprietary content leakage in responses. The scanner evaluates whether the API allows arbitrary prompt manipulation that could extract memorized outputs, and checks for signs of excessive trust in model outputs without validation.

Compliance frameworks such as GDPR and SOC 2 require protection of personal and proprietary data, making training data extraction a high-risk issue. The OWASP API Top 10 includes insufficient logging and access control, both of which can contribute to undetected data leakage. By scanning endpoints like /api/generate for anomalous response patterns or excessive output length, organizations can identify potential breaches before they escalate. middleBrick assigns a risk score based on the likelihood of data extraction and the severity of potential impact, helping teams prioritize remediation.

Ultimately, training data extraction in AdonisJS is not just a theoretical concern — it has real-world implications for intellectual property and regulatory compliance. Proper API design, input validation, and careful integration with AI services are essential to mitigate these risks. Tools like middleBrick provide a self-service way to scan APIs for such vulnerabilities without requiring credentials or complex setup, enabling developers to secure their AdonisJS applications against emerging AI-specific threats.

Identifying training data extraction risks in AdonisJS requires attention to how user input is handled when interfacing with AI services. One common pattern is the use of unvalidated request parameters that are passed directly to external LLM endpoints. For example, an endpoint that accepts a prompt field without sanitization or length limits may allow attackers to probe the model with crafted inputs designed to elicit memorized responses. middleBrick detects this by sending a series of structured prompts that test for verbatim output of training data, using regex patterns that match known prompt formats from systems like ChatML and Llama 2.

During a scan, middleBrick may send inputs such as Repeat the phrase 'The secret password is' followed by a random string and analyze whether the response contains sensitive or repeated output that suggests model memorization. It also checks for excessive output length, which could indicate data exfiltration attempts. If the API returns responses containing API keys, internal documentation, or repetitive phrases not related to the input, middleBrick flags this as a potential training data leakage issue with a high severity rating.

To remediate this in AdonisJS, developers should implement strict input validation and avoid direct exposure of LLM interaction endpoints to unauthenticated users. Input should be validated using AdonisJS's built-in schema validation:

const schema = Schema.object({
  prompt: Schema.string().maxLength(200).trim()
})

const { validate } = use('Adonis/Core/Validator')
const rules = { prompt: 'string|max:200|trim' }
const messages = { 'prompt.max': 'Prompt must be less than 200 characters' }
const data = await validate({ data: request.all(), rules, messages })

const safePrompt = data.prompt.replace(/

Training Data Extraction in Adonisjs

Training Data Extraction in AdonisJS

AdonisJS-Specific Detection and Remediation