feat: update project version and documentation link

- Bump version from 0.1.0 to 0.1.1 - Update documentation URL to point to GitHub repository
2026-02-03 18:40:51 +01:00 · 2026-02-03 18:40:51 +01:00 · 2c6677748a
commit 2c6677748a
parent 77084737dd
8 changed files with 1104 additions and 2 deletions
--- a/.gitignore
+++ b/.gitignore
@ -28,6 +28,7 @@ venv/
 ENV/
 env.bak/
 venv.bak/
 .claude/
 # IDE
 .idea/
@ -49,6 +50,7 @@ build/
 dist/
 *.egg-info/
 *.egg
 *.sh
 # Virtual environments
 venv/
--- a/doc/README.md
+++ b/doc/README.md
@ -0,0 +1,57 @@
 # NOMYO Secure Client Documentation
 This documentation provides comprehensive information about using the NOMYO Secure Python Chat Client, a drop-in replacement for OpenAI's ChatCompletion API with end-to-end (E2E) encryption.
 To use this client library you need a paid subscribtion on [NOMYO Inference](https://chat.nomyo.ai/).
 ## Overview
 The NOMYO Secure Client provides:
 - **End-to-end encryption** using hybrid encryption (AES-256-GCM + RSA-OAEP)
 - **OpenAI API compatibility** - same interface as OpenAI's ChatCompletion
 - **Secure memory protection** - prevents plaintext from being swapped to disk
 - **Automatic key management** - handles key generation and loading automatically
 - **HTTPS enforcement** - secure communication by default
 ## Quick Start
 ```python
 import asyncio
 from nomyo import SecureChatCompletion
 async def main():
    # Initialize client (defaults to https://api.nomyo.ai)
    client = SecureChatCompletion(api_key="your-api-key-here")
    # Simple chat completion
    response = await client.create(
        model="Qwen/Qwen3-0.6B",
        messages=[
            {"role": "user", "content": "Hello! How are you today?"}
        ],
        security_tier="standard", # optional: standard, high or maximum
        temperature=0.7
    )
    print(response['choices'][0]['message']['content'])
 # Run the async function
 asyncio.run(main())
 ```
 ## Documentation Structure
 1. [Installation](installation.md) - How to install and set up the client
 2. [Getting Started](getting-started.md) - Quick start guide with examples
 3. [API Reference](api-reference.md) - Complete API documentation
 4. [Security Guide](security-guide.md) - Security features and best practices
 5. [Examples](examples.md) - Advanced usage scenarios
 6. [Troubleshooting](troubleshooting.md) - Common issues and solutions
 ## Key Features
 - **OpenAI Compatibility**: Use the same API as OpenAI's ChatCompletion
 - **End-to-End Encryption**: All prompts and responses are automatically encrypted/decrypted
 - **Secure Memory Protection**: Prevents sensitive data from being swapped to disk
 - **Automatic Key Management**: Keys are generated and loaded automatically
 - **Flexible Security Tiers**: Control security levels for different data types
--- a/doc/api-reference.md
+++ b/doc/api-reference.md
@ -0,0 +1,197 @@
 # API Reference
 ## SecureChatCompletion Class
 The `SecureChatCompletion` class is the main entry point for using the NOMYO secure client. It provides the same interface as OpenAI's ChatCompletion API with end-to-end encryption.
 ### Constructor
 ```python
 SecureChatCompletion(
    base_url: str = "https://api.nomyo.ai",
    allow_http: bool = False,
    api_key: Optional[str] = None,
    secure_memory: bool = True
 )
 ```
 **Parameters:**
 - `base_url` (str): Base URL of the NOMYO Router (must use HTTPS for production)
 - `allow_http` (bool): Allow HTTP connections (ONLY for local development, never in production)
 - `api_key` (Optional[str]): Optional API key for bearer authentication
 - `secure_memory` (bool): Enable secure memory protection (default: True)
 ### Methods
 #### create(model, messages, **kwargs)
 Creates a new chat completion for the provided messages and parameters.
 **Parameters:**
 - `model` (str): The model to use for the chat completion
 - `messages` (List[Dict]): A list of message objects. Each message has a role ("system", "user", or "assistant") and content
 - `**kwargs`: Additional parameters that can be passed to the API
 **Supported OpenAI Parameters:**
 - `temperature` (float): Sampling temperature (0-2)
 - `max_tokens` (int): Maximum tokens to generate
 - `top_p` (float): Nucleus sampling
 - `frequency_penalty` (float): Frequency penalty
 - `presence_penalty` (float): Presence penalty
 - `stop` (Union[str, List[str]]): Stop sequences
 - `n` (int): Number of completions
 - `stream` (bool): Streaming always = False to minimize de-/encryption overhead 
 - `tools` (List): Tool definitions
 - `tool_choice` (str): Tool selection strategy
 - `user` (str): User identifier
 - `security_tier` (str): Security level ("standard", "high", or "maximum")
 **Returns:**
 A dictionary containing the chat completion response with the following structure:
 ```python
 {
    "id": str,
    "object": "chat.completion",
    "created": int,
    "model": str,
    "choices": [
        {
            "index": int,
            "message": {
                "role": str,
                "content": str,
                "tool_calls": List[Dict]  # if tools were used
            },
            "finish_reason": str
        }
    ],
    "usage": {
        "prompt_tokens": int,
        "completion_tokens": int,
        "total_tokens": int
    }
 }
 ```
 #### acreate(model, messages, **kwargs)
 Async alias for create() method.
 **Parameters:** Same as create() method
 **Returns:** Same as create() method
 ## SecureCompletionClient Class
 The `SecureCompletionClient` class handles the underlying encryption, key management, and API communication.
 ### Constructor
 ```python
 SecureCompletionClient(router_url: str = "https://api.nomyo.ai:12434", allow_http: bool = False)
 ```
 **Parameters:**
 - `router_url` (str): Base URL of the NOMYO Router (must use HTTPS for production)
 - `allow_http` (bool): Allow HTTP connections (ONLY for local development, never in production)
 ### Methods
 #### generate_keys(save_to_file: bool = False, key_dir: str = "client_keys", password: Optional[str] = None)
 Generate RSA key pair for secure communication.
 **Parameters:**
 - `save_to_file` (bool): Whether to save keys to files
 - `key_dir` (str): Directory to save keys (if save_to_file is True)
 - `password` (Optional[str]): Optional password to encrypt private key
 #### load_keys(private_key_path: str, public_key_path: Optional[str] = None, password: Optional[str] = None)
 Load RSA keys from files.
 **Parameters:**
 - `private_key_path` (str): Path to private key file
 - `public_key_path` (Optional[str]): Path to public key file (optional, derived from private key if not provided)
 - `password` (Optional[str]): Optional password for encrypted private key
 #### fetch_server_public_key()
 Fetch the server's public key from the /pki/public_key endpoint.
 **Returns:**
 Server's public key as PEM string
 #### encrypt_payload(payload: Dict[str, Any])
 Encrypt a payload using hybrid encryption (AES-256-GCM + RSA-OAEP).
 **Parameters:**
 - `payload` (Dict[str, Any]): Dictionary containing the chat completion request
 **Returns:**
 Encrypted payload as bytes
 #### decrypt_response(encrypted_response: bytes, payload_id: str)
 Decrypt a response from the secure endpoint.
 **Parameters:**
 - `encrypted_response` (bytes): Encrypted response bytes
 - `payload_id` (str): Payload ID for metadata verification
 **Returns:**
 Decrypted response dictionary
 #### send_secure_request(payload: Dict[str, Any], payload_id: str, api_key: Optional[str] = None, security_tier: Optional[str] = None)
 Send a secure chat completion request to the router.
 **Parameters:**
 - `payload` (Dict[str, Any]): Chat completion request payload
 - `payload_id` (str): Unique identifier for this request
 - `api_key` (Optional[str]): Optional API key for bearer authentication
 - `security_tier` (Optional[str]): Optional security tier for routing
 **Returns:**
 Decrypted response from the LLM
 ## Exception Classes
 ### APIError
 Base class for all API-related errors.
 ### AuthenticationError
 Raised when authentication fails (e.g., invalid API key).
 ### InvalidRequestError
 Raised when the request is invalid (HTTP 400).
 ### APIConnectionError
 Raised when there's a connection error.
 ### RateLimitError
 Raised when rate limit is exceeded (HTTP 429).
 ### ServerError
 Raised when the server returns an error (HTTP 500).
 ### SecurityError
 Raised when a security violation is detected.
--- a/doc/examples.md
+++ b/doc/examples.md
@ -0,0 +1,362 @@
 # Examples
 ## Basic Usage Examples
 ### Simple Chat Completion
 ```python
 import asyncio
 from nomyo import SecureChatCompletion
 async def simple_chat():
    client = SecureChatCompletion(api_key="your-api-key-here")
    response = await client.create(
        model="Qwen/Qwen3-0.6B",
        messages=[
            {"role": "user", "content": "Hello, how are you?"}
        ],
        temperature=0.7
    )
    print(response['choices'][0]['message']['content'])
 asyncio.run(simple_chat())
 ```
 ### Chat with System Message
 ```python
 import asyncio
 from nomyo import SecureChatCompletion
 async def chat_with_system():
    client = SecureChatCompletion(api_key="your-api-key-here")
    response = await client.create(
        model="Qwen/Qwen3-0.6B",
        messages=[
            {"role": "system", "content": "You are a helpful assistant."},
            {"role": "user", "content": "What is the capital of France?"}
        ],
        temperature=0.7
    )
    print(response['choices'][0]['message']['content'])
 asyncio.run(chat_with_system())
 ```
 ## Advanced Usage Examples
 ### Using Different Security Tiers
 ```python
 import asyncio
 from nomyo import SecureChatCompletion
 async def security_tiers():
    client = SecureChatCompletion(api_key="your-api-key-here")
    # Standard security
    response1 = await client.create(
        model="Qwen/Qwen3-0.6B",
        messages=[{"role": "user", "content": "General query"}],
        security_tier="standard"
    )
    # High security for sensitive data
    response2 = await client.create(
        model="Qwen/Qwen3-0.6B",
        messages=[{"role": "user", "content": "Bank account info"}],
        security_tier="high"
    )
    # Maximum security for classified data
    response3 = await client.create(
        model="Qwen/Qwen3-0.6B",
        messages=[{"role": "user", "content": "Medical records"}],
        security_tier="maximum"
    )
 asyncio.run(security_tiers())
 ```
 ### Using Tools
 ```python
 import asyncio
 from nomyo import SecureChatCompletion
 async def chat_with_tools():
    client = SecureChatCompletion(api_key="your-api-key-here")
    response = await client.create(
        model="Qwen/Qwen3-0.6B",
        messages=[
            {"role": "user", "content": "What's the weather in Paris?"}
        ],
        tools=[
            {
                "type": "function",
                "function": {
                    "name": "get_weather",
                    "description": "Get weather information",
                    "parameters": {
                        "type": "object",
                        "properties": {
                            "location": {"type": "string"}
                        },
                        "required": ["location"]
                    }
                }
            }
        ],
        temperature=0.7
    )
    print(response['choices'][0]['message']['content'])
 asyncio.run(chat_with_tools())
 ```
 ### Error Handling
 ```python
 import asyncio
 from nomyo import SecureChatCompletion, AuthenticationError, InvalidRequestError
 async def error_handling():
    client = SecureChatCompletion(api_key="your-api-key-here")
    try:
        response = await client.create(
            model="Qwen/Qwen3-0.6B",
            messages=[{"role": "user", "content": "Hello"}]
        )
        print(response['choices'][0]['message']['content'])
    except AuthenticationError as e:
        print(f"Authentication failed: {e}")
    except InvalidRequestError as e:
        print(f"Invalid request: {e}")
    except Exception as e:
        print(f"Other error: {e}")
 asyncio.run(error_handling())
 ```
 ### Custom Base URL
 ```python
 import asyncio
 from nomyo import SecureChatCompletion
 async def custom_base_url():
    # For local development
    client = SecureChatCompletion(
        base_url="https://NOMYO-PRO-ROUTER:12435",
        allow_http=True
    )
    response = await client.create(
        model="Qwen/Qwen3-0.6B",
        messages=[{"role": "user", "content": "Hello"}]
    )
    print(response['choices'][0]['message']['content'])
 asyncio.run(custom_base_url())
 ```
 ### API Key Authentication
 ```python
 import asyncio
 from nomyo import SecureChatCompletion
 async def api_key_auth():
    # Initialize with API key
    client = SecureChatCompletion(
        api_key="your-api-key-here"
    )
    response = await client.create(
        model="Qwen/Qwen3-0.6B",
        messages=[{"role": "user", "content": "Hello"}]
    )
    print(response['choices'][0]['message']['content'])
 asyncio.run(api_key_auth())
 ```
 ## Real-World Scenarios
 ### Chat Application with History
 ```python
 import asyncio
 from nomyo import SecureChatCompletion
 class SecureChatApp:
    def __init__(self):
        self.client = SecureChatCompletion(api_key="your-api-key-here")
        self.conversation_history = []
    async def chat(self, message):
        # Add user message to history
        self.conversation_history.append({"role": "user", "content": message})
        # Get response from the model
        response = await self.client.create(
            model="Qwen/Qwen3-0.6B",
            messages=self.conversation_history,
            temperature=0.7
        )
        # Add assistant response to history
        assistant_message = response['choices'][0]['message']
        self.conversation_history.append(assistant_message)
        return assistant_message['content']
 async def main():
    app = SecureChatApp()
    # First message
    response1 = await app.chat("Hello, what's your name?")
    print(f"Assistant: {response1}")
    # Second message
    response2 = await app.chat("Can you tell me about secure chat clients?")
    print(f"Assistant: {response2}")
 asyncio.run(main())
 ```
 ### Data Processing with Tools
 ```python
 import asyncio
 from nomyo import SecureChatCompletion
 async def data_processing():
    client = SecureChatCompletion(api_key="your-api-key-here")
    # Process data with tool calling
    response = await client.create(
        model="Qwen/Qwen3-0.6B",
        messages=[
            {"role": "user", "content": "Process this data: 100, 200, 300, 400"}
        ],
        tools=[
            {
                "type": "function",
                "function": {
                    "name": "calculate_statistics",
                    "description": "Calculate statistical measures",
                    "parameters": {
                        "type": "object",
                        "properties": {
                            "data": {"type": "array", "items": {"type": "number"}}
                        },
                        "required": ["data"]
                    }
                }
            }
        ]
    )
    print(response['choices'][0]['message']['content'])
 asyncio.run(data_processing())
 ```
 ### Batch Processing
 ```python
 import asyncio
 from nomyo import SecureChatCompletion
 async def batch_processing():
    client = SecureChatCompletion(api_key="your-api-key-here")
    # Process multiple queries concurrently
    tasks = []
    queries = [
        "What is the weather today?",
        "Tell me about Python programming",
        "How to learn machine learning?"
    ]
    for query in queries:
        task = client.create(
            model="Qwen/Qwen3-0.6B",
            messages=[{"role": "user", "content": query}],
            temperature=0.7
        )
        tasks.append(task)
    # Execute all queries in parallel
    responses = await asyncio.gather(*tasks)
    for i, response in enumerate(responses):
        print(f"Query {i+1}: {response['choices'][0]['message']['content'][:100]}...")
 asyncio.run(batch_processing())
 ```
 ## Configuration Examples
 ### Custom Client Configuration
 ```python
 import asyncio
 from nomyo import SecureChatCompletion
 async def custom_config():
    # Create a client with custom configuration
    client = SecureChatCompletion(
        allow_http=False,  # Force HTTPS
        api_key="your-api-key",
        secure_memory=True  # Explicitly enable secure memory protection (default)
    )
    response = await client.create(
        model="Qwen/Qwen3-0.6B",
        messages=[{"role": "user", "content": "Hello"}],
        temperature=0.7
    )
    print(response['choices'][0]['message']['content'])
 asyncio.run(custom_config())
 ```
 ### Environment-Based Configuration (strongly recommended)
 ```python
 import asyncio
 import os
 from nomyo import SecureChatCompletion
 async def env_config():
    # Load configuration from environment variables
    api_key = os.getenv('NOMYO_API_KEY')
    client = SecureChatCompletion(
        api_key=api_key
    )
    response = await client.create(
        model="Qwen/Qwen3-0.6B",
        messages=[{"role": "user", "content": "Hello"}]
    )
    print(response['choices'][0]['message']['content'])
 asyncio.run(env_config())
 ```
 ## 
--- a/doc/getting-started.md
+++ b/doc/getting-started.md
@ -0,0 +1,212 @@
 # Getting Started
 ## Basic Usage
 The NOMYO client provides end-to-end encryption (E2E) for all communications between your application and the NOMYO inference endpoints. This ensures that your prompts and responses are protected from unauthorized access or interception.
 The NOMYO client provides the same interface as OpenAI's ChatCompletion API, making it easy to integrate into existing code.
 The encryption and decryption process is causing overhead, thus inference speed will be lower compared to unencrypted inference. Using high and maximum security_tiers in the client request will add additional latency to the round-trip-time, but guarantees highest confidential use cases.
 To minimize en-/decryption overhead the API is **none**-streaming. OpenAI API compatibily allows to set streaming=True in the request, but this will be ignored on the server side to allow maximum response token generation.
 ### Simple Chat Completion
 ```python
 import asyncio
 from nomyo import SecureChatCompletion
 async def main():
    # Initialize client
    client = SecureChatCompletion(api_key="your-api-key-here")
    # Simple chat completion
    response = await client.create(
        model="Qwen/Qwen3-0.6B",
        messages=[
            {"role": "user", "content": "Hello! How are you today?"}
        ],
        temperature=0.7
    )
    print(response['choices'][0]['message']['content'])
 asyncio.run(main())
 ```
 ### With System Messages
 ```python
 import asyncio
 from nomyo import SecureChatCompletion
 async def main():
    client = SecureChatCompletion(api_key="your-api-key-here")
    response = await client.create(
        model="Qwen/Qwen3-0.6B",
        messages=[
            {"role": "system", "content": "You are a helpful assistant."},
            {"role": "user", "content": "What is the capital of France?"}
        ],
        temperature=0.7
    )
    print(response['choices'][0]['message']['content'])
 asyncio.run(main())
 ```
 ## API Key Authentication
 ```python
 import asyncio
 from nomyo import SecureChatCompletion
 async def main():
    # Initialize with API key (recommended for production)
    client = SecureChatCompletion(
        api_key="your-api-key-here"
    )
    # Or pass API key in the create() method
    response = await client.create(
        model="Qwen/Qwen3-0.6B",
        messages=[
            {"role": "user", "content": "Hello!"}
        ],
        api_key="your-api-key-here"  # Overrides instance API key
    )
 asyncio.run(main())
 ```
 ## Security Tiers
 The client supports different security tiers for controlling data protection levels:
 ```python
 import asyncio
 from nomyo import SecureChatCompletion
 async def main():
    client = SecureChatCompletion(api_key="your-api-key-here")
    # Standard security tier (default)
    response = await client.create(
        model="Qwen/Qwen3-0.6B",
        messages=[
            {"role": "user", "content": "Hello!"}
        ],
        security_tier="standard"
    )
    # High security tier for sensitive data
    response = await client.create(
        model="Qwen/Qwen3-0.6B",
        messages=[
            {"role": "user", "content": "What's my bank account balance?"}
        ],
        security_tier="high" #enforces secure tokenizer
    )
    # Maximum security tier for classified data
    response = await client.create(
        model="Qwen/Qwen3-0.6B",
        messages=[
            {"role": "user", "content": "Share my personal medical records"}
        ],
        security_tier="maximum" #HIPAA PHI compliance or other confidential use cases
    )
 asyncio.run(main())
 ```
 ## Using Tools
 ```python
 import asyncio
 from nomyo import SecureChatCompletion
 async def main():
    client = SecureChatCompletion(api_key="your-api-key-here")
    response = await client.create(
        model="Qwen/Qwen3-0.6B",
        messages=[
            {"role": "user", "content": "What's the weather in Paris?"}
        ],
        tools=[
            {
                "type": "function",
                "function": {
                    "name": "get_weather",
                    "description": "Get weather information",
                    "parameters": {
                        "type": "object",
                        "properties": {
                            "location": {"type": "string"}
                        },
                        "required": ["location"]
                    }
                }
            }
        ],
        temperature=0.7
    )
    print(response['choices'][0]['message']['content'])
 asyncio.run(main())
 ```
 ## Async Alias
 The client also provides an `acreate` async alias for convenience:
 ```python
 import asyncio
 from nomyo import SecureChatCompletion
 async def main():
    client = SecureChatCompletion(api_key="your-api-key-here")
    response = await client.acreate(
        model="Qwen/Qwen3-0.6B",
        messages=[
            {"role": "user", "content": "Hello!"}
        ],
        temperature=0.7
    )
    print(response['choices'][0]['message']['content'])
 asyncio.run(main())
 ```
 ## Error Handling
 ```python
 import asyncio
 from nomyo import SecureChatCompletion, AuthenticationError, InvalidRequestError
 async def main():
    client = SecureChatCompletion(base_url="https://api.nomyo.ai:12434")
    try:
        response = await client.create(
            model="Qwen/Qwen3-0.6B",
            messages=[
                {"role": "user", "content": "Hello!"}
            ]
        )
        print(response['choices'][0]['message']['content'])
    except AuthenticationError as e:
        print(f"Authentication failed: {e}")
    except InvalidRequestError as e:
        print(f"Invalid request: {e}")
    except Exception as e:
        print(f"Other error: {e}")
 asyncio.run(main())
 ```
--- a/doc/installation.md
+++ b/doc/installation.md
@ -0,0 +1,74 @@
 # Installation Guide
 ## Prerequisites
 - Python 3.7 or higher
 - pip (Python package installer)
 ## Installation
 ### Install from PyPI (recommended)
 ```bash
 pip install nomyo
 ```
 ### Install from source
 ```bash
 # Clone the repository
 git clone https://github.com/nomyo-ai/nomyo.git
 cd nomyo
 # Install dependencies
 pip install -r requirements.txt
 # Install the package
 pip install -e .
 ```
 ## Dependencies
 The NOMYO client requires the following dependencies:
 - `cryptography` - Cryptographic primitives (RSA, AES, etc.)
 - `httpx` - Async HTTP client
 - `anyio` - Async compatibility layer
 These are automatically installed when you install the package via pip.
 ## Virtual Environment (Recommended)
 It's recommended to use a virtual environment to avoid conflicts with other Python packages:
 ```bash
 # Create virtual environment
 python -m venv nomyo_env
 # Activate virtual environment
 source nomyo_env/bin/activate  # On Linux/Mac
 # or
 nomyo_env\Scripts\activate     # On Windows
 # Install nomyo
 pip install nomyo
 ```
 ## Verify Installation
 To verify the installation worked correctly:
 ```python
 import nomyo
 print("NOMYO client installed successfully!")
 ```
 ## Development Installation
 For development purposes, you can install the package in development mode:
 ```bash
 pip install -e .[dev]
 ```
 This will install additional development dependencies.
--- a/doc/security-guide.md
+++ b/doc/security-guide.md
@ -0,0 +1,198 @@
 # Security Guide
 ## Overview
 The NOMYO client provides end-to-end encryption for all communications between your application and the NOMYO inference endpoints. This ensures that your prompts and responses are protected from unauthorized access or interception.
 ## Encryption Mechanism
 ### Hybrid Encryption
 The client uses a hybrid encryption approach combining:
 1. **AES-256-GCM** for payload encryption (authenticated encryption)
 2. **RSA-OAEP** for key exchange (4096-bit keys)
 This provides both performance (AES for data) and security (RSA for key exchange).
 ### Key Management
 #### Automatic Key Generation
 Keys are automatically generated in memory on first use/session init. The client handles all key management internally.
 #### Key Persistence (optional)
 Keys *can* be saved to the `client_keys/` directory for reuse (i.e. in dev scenarios) across sessions [not recommend]:
 ```python
 # Generate keys and save to file
 await client.generate_keys(save_to_file=True, password="your-password")
 ```
 #### Password Protection
 Saved private keys should be password-protected in all environments:
 ```python
 await client.generate_keys(save_to_file=True, password="your-strong-password")
 ```
 ## Secure Memory Protection
 ### Ephemeral AES Keys
 - **Per-request encryption keys**: A unique AES-256 key is generated for each request
 - **Automatic rotation**: AES keys are never reused - a fresh key is created for every encryption operation
 - **Forward secrecy**: Compromise of one AES key only affects that single request
 - **Secure generation**: AES keys are generated using cryptographically secure random number generation (`secrets.token_bytes`)
 - **Automatic cleanup**: AES keys are zeroed from memory immediately after use
 ### Memory Protection
 The client can use secure memory protection to:
 - Prevent plaintext payloads from being swapped to disk
 - Guarantee memory is zeroed after encryption
 - Prevent sensitive data from being stored in memory dumps
 ## Security Best Practices
 ### For Production Use
 1. **Always use password protection** for private keys
 2. **Keep private keys secure** (permissions set to 600 - owner-only access)
 3. **Never share your private key**
 4. **Verify server's public key fingerprint** before first use
 5. **Use HTTPS connections** (never allow HTTP in production)
 ### Key Management
 ```python
 # Generate keys with password protection
 await client.generate_keys(
    save_to_file=True,
    key_dir="client_keys",
    password="strong-password-here"
 )
 # Load existing keys with password
 await client.load_keys(
    "client_keys/private_key.pem",
    "client_keys/public_key.pem",
    password="strong-password-here"
 )
 ```
 ### Security Tiers
 The client supports three security tiers:
 - **Standard**: General secure inference
 - **High**: Sensitive business data
 - **Maximum**: Maximum isolation (HIPAA PHI, classified data)
 ```python
 # Use different security tiers
 response = await client.create(
    model="Qwen/Qwen3-0.6B",
    messages=[{"role": "user", "content": "My sensitive data"}],
    security_tier="high"
 )
 ```
 ## Security Features
 ### End-to-End Encryption
 All prompts and responses are automatically encrypted and decrypted, ensuring:
 - No plaintext data is sent over the network
 - No plaintext data is stored in memory
 - No plaintext data is stored on disk
 ### Forward Secrecy
 Each request uses a unique AES key, ensuring that:
 - Compromise of one request's key only affects that request
 - Previous requests remain secure even if current key is compromised
 ### Key Exchange Security
 RSA-OAEP key exchange with 4096-bit keys provides:
 - Strong encryption for key exchange
 - Protection against known attacks
 - Forward secrecy for key material
 ### Memory Protection
 Secure memory features:
 - Prevents plaintext from being swapped to disk
 - Guarantees zeroing of sensitive memory
 - Prevents memory dumps from containing sensitive data
 ## Compliance Considerations
 ### HIPAA Compliance
 The client can be used for HIPAA-compliant applications when:
 - Keys are password-protected
 - HTTPS is used for all connections
 - Private keys are stored securely
 - Appropriate security measures are in place
 ### Data Classification
 - **Standard**: General data
 - **High**: Sensitive business data
 - **Maximum**: Classified data (PHI, PII, etc.)
 ## Security Testing
 The client includes comprehensive security testing:
 - All encryption/decryption operations are tested
 - Key management is verified
 - Memory protection is validated
 - Error handling is tested
 Run the test suite to verify security:
 ```bash
 python3 test.py
 ```
 ## Troubleshooting Security Issues
 ### Common Issues
 1. **Key loading failures**: Ensure private key file permissions are correct (600)
 2. **Connection errors**: Verify HTTPS is used for production
 3. **Decryption failures**: Check that the correct API key is used
 4. **Memory protection errors**: SecureMemory module may not be available on all systems
 ### Debugging
 The client adds metadata to responses that can help with debugging:
 ```python
 response = await client.create(
    model="Qwen/Qwen3-0.6B",
    messages=[{"role": "user", "content": "Hello"}]
 )
 print(response["_metadata"])  # Contains security-related information
 ```
 ### Logging
 Enable logging to see security operations:
 ```python
 import logging
 logging.basicConfig(level=logging.DEBUG)
 ```
--- a/pyproject.toml
+++ b/pyproject.toml
@ -4,7 +4,7 @@ build-backend = "hatchling.build"
 [project]
 name = "nomyo"
-version = "0.1.0"
+version = "0.1.1"
 description = "OpenAI-compatible secure chat client with end-to-end encryption for NOMYO Inference Endpoints"
 authors = [
    {name = "NOMYO.AI", email = "ichi@nomyo.ai"},
@ -44,7 +44,7 @@ dependencies = [
 [project.urls]
 Homepage = "https://nomyo.ai"
-Documentation = "https://nomyo.ai/nomyo-docs"
+Documentation = "https://github.com/nomyo-ai/nomyo/doc"
 Repository = "https://github.com/nomyo-ai/nomyo"
 Issues = "https://github.com/nomyo-ai/nomyo/issues"