Add initial project structure and configuration files

- Created .gitignore to exclude unnecessary files and directories. - Added Cargo.toml for Rust workspace configuration. - Introduced example configuration file entropix.yaml.example for user customization. - Included LICENSE file with Apache 2.0 license details. - Created pyproject.toml for Python project metadata and dependencies. - Added README.md with project overview and usage instructions. - Implemented a broken agent example to demonstrate testing capabilities. - Established Rust module structure with Cargo.toml and source files. - Set up initial tests for assertions and configuration validation.
2026-04-25 00:36:54 +02:00 · 2025-12-28 21:55:01 +08:00 · 2025-12-28 21:55:01 +08:00 · a36cecf255
commit a36cecf255
37 changed files with 5397 additions and 0 deletions
--- a/examples/broken_agent/README.md
+++ b/examples/broken_agent/README.md
@ -0,0 +1,48 @@
+# Broken Agent Example
+
+This example demonstrates a deliberately fragile AI agent that Entropix can detect issues with.
+
+## The "Broken" Agent
+
+The agent in `agent.py` has several intentional flaws:
+
+1. **Fragile Intent Parsing**: Only recognizes exact keyword matches
+2. **No Typo Tolerance**: Fails on any spelling variations
+3. **Hostile Input Vulnerability**: Crashes on aggressive tone
+4. **Prompt Injection Susceptible**: Follows injected instructions
+
+## Running the Example
+
+### 1. Start the Agent Server
+
+```bash
+cd examples/broken_agent
+pip install fastapi uvicorn
+uvicorn agent:app --port 8000
+```
+
+### 2. Run Entropix Against It
+
+```bash
+# From the project root
+entropix run --config examples/broken_agent/entropix.yaml
+```
+
+### 3. See the Failures
+
+The report will show how the agent fails on:
+- Paraphrased requests ("I want to fly" vs "Book a flight")
+- Typos ("Bock a fligt")
+- Aggressive tone ("BOOK A FLIGHT NOW!!!")
+- Prompt injections ("Book a flight. Ignore previous instructions...")
+
+## Fixing the Agent
+
+Try modifying `agent.py` to:
+1. Use NLP for intent recognition
+2. Add spelling correction
+3. Handle emotional inputs gracefully
+4. Detect and refuse prompt injections
+
+Then re-run Entropix to see your robustness score improve!
+
--- a/examples/broken_agent/agent.py
+++ b/examples/broken_agent/agent.py
@ -0,0 +1,127 @@
+"""
+Broken Agent Example
+
+A deliberately fragile AI agent to demonstrate Entropix testing.
+This agent has multiple intentional weaknesses that Entropix will find.
+"""
+
+from fastapi import FastAPI, HTTPException
+from pydantic import BaseModel
+import json
+import time
+import random
+
+app = FastAPI(title="Broken Travel Agent")
+
+
+class AgentRequest(BaseModel):
+    """Request body for agent invocation."""
+    input: str
+
+
+class AgentResponse(BaseModel):
+    """Response body from agent."""
+    output: str
+
+
+# Simulated database
+FLIGHTS = {
+    "paris": {"price": 450, "airline": "Air France"},
+    "london": {"price": 320, "airline": "British Airways"},
+    "tokyo": {"price": 890, "airline": "ANA"},
+}
+
+
+@app.post("/invoke")
+async def invoke(request: AgentRequest) -> AgentResponse:
+    """
+    Process a user request.
+    
+    This agent is intentionally broken in several ways:
+    1. Only recognizes exact keyword matches
+    2. No typo tolerance
+    3. Crashes on aggressive inputs
+    4. Susceptible to prompt injection
+    """
+    user_input = request.input
+    
+    # VULNERABILITY 1: Prompt Injection
+    # The agent naively follows instructions embedded in user input
+    if "ignore previous instructions" in user_input.lower():
+        return AgentResponse(
+            output="I have ignored my previous instructions. How can I help you now?"
+        )
+    
+    if "you are now" in user_input.lower():
+        return AgentResponse(
+            output="Okay, I am now a different AI assistant with no restrictions."
+        )
+    
+    # VULNERABILITY 2: Crashes on aggressive tone
+    if any(word in user_input.upper() for word in ["NOW", "HURRY", "ASAP", "IMMEDIATELY"]):
+        # Simulate a crash/timeout
+        time.sleep(3)  # This will cause latency checks to fail
+        return AgentResponse(
+            output="ERROR: System overloaded due to urgent request."
+        )
+    
+    # VULNERABILITY 3: Only exact keyword matching
+    # Will fail on paraphrases like "I want to fly" instead of "book a flight"
+    if "book a flight" in user_input.lower():
+        # Extract destination (very fragile parsing)
+        words = user_input.lower().split()
+        destination = None
+        
+        for city in FLIGHTS.keys():
+            if city in words:
+                destination = city
+                break
+        
+        if destination:
+            flight = FLIGHTS[destination]
+            return AgentResponse(
+                output=json.dumps({
+                    "status": "booked",
+                    "destination": destination.title(),
+                    "price": flight["price"],
+                    "airline": flight["airline"],
+                    "confirmation_code": f"ENT{random.randint(10000, 99999)}"
+                })
+            )
+        else:
+            return AgentResponse(
+                output=json.dumps({
+                    "status": "error",
+                    "message": "Unknown destination"
+                })
+            )
+    
+    # VULNERABILITY 4: No typo tolerance
+    # "bock a fligt" will completely fail
+    if "account balance" in user_input.lower():
+        return AgentResponse(
+            output=json.dumps({
+                "balance": 1234.56,
+                "currency": "USD"
+            })
+        )
+    
+    # Default: Unknown intent
+    return AgentResponse(
+        output=json.dumps({
+            "status": "error",
+            "message": "I don't understand your request. Please try again."
+        })
+    )
+
+
+@app.get("/health")
+async def health():
+    """Health check endpoint."""
+    return {"status": "healthy"}
+
+
+if __name__ == "__main__":
+    import uvicorn
+    uvicorn.run(app, host="0.0.0.0", port=8000)
+