doc: correction

2026-03-06 16:00:19 +01:00 · 2026-03-06 16:00:19 +01:00 · 3d8e5044d6
commit 3d8e5044d6
parent b33bb415dd
2 changed files with 31 additions and 26 deletions
--- a/README.md
+++ b/README.md
@ -2,7 +2,6 @@
 **Async semantic caching for LLM API calls — reduce costs with one decorator.**
 [![PyPI](https://img.shields.io/pypi/v/semantic-llm-cache)](https://pypi.org/project/semantic-llm-cache/)
 [![License: MIT](https://img.shields.io/badge/License-MIT-blue.svg)](LICENSE)
 [![Python](https://img.shields.io/pypi/pyversions/semantic-llm-cache)](https://pypi.org/project/semantic-llm-cache/)
@ -21,16 +20,17 @@ LLM API calls are expensive and slow. In production applications, **20-40% of pr
 ## What changed from the original
 | Area                 | Original                  | This fork                                                           |
-| -------------------- | ------------------------- | ------------------------------------------------------------------- |
+| ---------------------- | --------------------------- | --------------------------------------------------------------------- |
 | Backends             | sync (`sqlite3`, `redis`) | async (`aiosqlite`, `redis.asyncio`)                                |
 | `@cache` decorator   | sync only                 | auto-detects async/sync                                             |
-| `EmbeddingCache`     | sync `encode()`           | adds `async aencode()` via `asyncio.to_thread`                      |
+| `EmbeddingCache`     | sync`encode()`            | adds`async aencode()` via `asyncio.to_thread`                       |
-| `CacheContext`       | sync only                 | supports both `with` and `async with`                               |
+| `CacheContext`       | sync only                 | supports both`with` and `async with`                                |
-| `CachedLLM`          | `chat()`                  | adds `achat()`                                                      |
+| `CachedLLM`          | `chat()`                  | adds`achat()`                                                       |
 | Utility functions    | sync                      | `clear_cache`, `invalidate`, `warm_cache`, `export_cache` all async |
-| `StorageBackend` ABC | sync abstract methods     | all abstract methods are `async def`                                |
+| `StorageBackend` ABC | sync abstract methods     | all abstract methods are`async def`                                 |
-| Min Python           | 3.9                       | 3.10 (uses `X \| Y` union syntax)                                   |
+| Min Python           | 3.9                       | 3.10 (uses`X | Y` union syntax)                                     |
 ## Installation
@ -198,14 +198,15 @@ async def my_llm_function(prompt: str) -> str:
 ### Parameters
-| Parameter    | Type          | Default     | Description                                               |
+
-| ------------ | ------------- | ----------- | --------------------------------------------------------- |
+| Parameter    | Type         | Default     | Description                                               |
-| `similarity` | `float`       | `1.0`       | Cosine similarity threshold (1.0 = exact, 0.9 = semantic) |
+| -------------- | -------------- | ------------- | ----------------------------------------------------------- |
-| `ttl`        | `int \| None` | `3600`      | Time-to-live in seconds (None = never expires)            |
+| `similarity` | `float`      | `1.0`       | Cosine similarity threshold (1.0 = exact, 0.9 = semantic) |
-| `backend`    | `Backend`     | `None`      | Storage backend (None = in-memory)                        |
+| `ttl`        | `int | None` | `3600`      | Time-to-live in seconds (None = never expires)            |
-| `namespace`  | `str`         | `"default"` | Isolate different use cases                               |
+| `backend`    | `Backend`    | `None`      | Storage backend (None = in-memory)                        |
-| `enabled`    | `bool`        | `True`      | Enable/disable caching                                    |
+| `namespace`  | `str`        | `"default"` | Isolate different use cases                               |
-| `key_func`   | `Callable`    | `None`      | Custom cache key function                                 |
+| `enabled`    | `bool`       | `True`      | Enable/disable caching                                    |
 | `key_func`   | `Callable`   | `None`      | Custom cache key function                                 |
 ### Utility Functions
@ -221,19 +222,21 @@ from semantic_llm_cache.stats import (
 ## Backends
-| Backend         | Description                          | I/O                       |
+
-| --------------- | ------------------------------------ | ------------------------- |
+| Backend         | Description                          | I/O                        |
 | ----------------- | -------------------------------------- | ---------------------------- |
 | `MemoryBackend` | In-memory LRU (default)              | none — runs in event loop |
-| `SQLiteBackend` | Persistent, file-based (`aiosqlite`) | async non-blocking        |
+| `SQLiteBackend` | Persistent, file-based (`aiosqlite`) | async non-blocking         |
-| `RedisBackend`  | Distributed (`redis.asyncio`)        | async non-blocking        |
+| `RedisBackend`  | Distributed (`redis.asyncio`)        | async non-blocking         |
 ## Embedding Providers
-| Provider                      | Quality                      | Notes                       |
+
-| ----------------------------- | ---------------------------- | --------------------------- |
+| Provider                      | Quality                      | Notes                      |
-| `DummyEmbeddingProvider`      | hash-only, no semantic match | zero deps, default          |
+| ------------------------------- | ------------------------------ | ---------------------------- |
-| `SentenceTransformerProvider` | high (local model)           | requires `[semantic]` extra |
+| `DummyEmbeddingProvider`      | hash-only, no semantic match | zero deps, default         |
-| `OpenAIEmbeddingProvider`     | high (API)                   | requires `[openai]` extra   |
+| `SentenceTransformerProvider` | high (local model)           | requires`[semantic]` extra |
 | `OpenAIEmbeddingProvider`     | high (API)                   | requires`[openai]` extra   |
 Embedding inference is offloaded via `asyncio.to_thread` — model loading is blocking and should be done at application startup, not on first request.
@ -250,8 +253,9 @@ embedding = await embedding_cache.aencode("my prompt")
 ## Performance
 | Metric                     | Value                                    |
-| -------------------------- | ---------------------------------------- |
+| ---------------------------- | ------------------------------------------ |
 | Cache hit latency          | <10ms                                    |
 | Embedding overhead on miss | ~50ms (sentence-transformers, offloaded) |
 | Typical hit rate           | 25-40%                                   |
--- a/pyproject.toml
+++ b/pyproject.toml
@ -20,6 +20,7 @@ keywords = [
    "openai",
    "anthropic",
    "ollama",
    "llama.cpp",
    "prompt",
    "optimization",
    "cost-reduction",