fix(gateway): accept raw utf-8 text in text-load (#729)

Co-authored-by: nanqinhu <139929317+nanqinhu@users.noreply.github.com>
This commit is contained in:
CommitHu502Craft 2026-03-30 23:58:58 +08:00 committed by Cyber MacGeddon
parent 5a9db2da50
commit 7af1d60db8
4 changed files with 91 additions and 9 deletions

View file

@ -8,8 +8,7 @@ required:
properties:
text:
type: string
description: Text content (base64 encoded)
format: byte
description: Text content, either raw text or base64 encoded for compatibility with older clients
example: VGhpcyBpcyB0aGUgZG9jdW1lbnQgdGV4dC4uLg==
id:
type: string

View file

@ -8,7 +8,7 @@ post:
## Text Load Overview
Fire-and-forget document loading:
- **Input**: Text content (base64 encoded)
- **Input**: Text content (raw UTF-8 or base64 encoded)
- **Process**: Chunk, embed, store
- **Output**: None (202 Accepted)
@ -26,7 +26,14 @@ post:
## Text Format
Text must be base64 encoded:
Text may be sent as raw UTF-8 text:
```
{
"text": "Cancer survival: 2.74× higher hazard ratio"
}
```
Older clients may still send base64 encoded text:
```
text_content = "This is the document..."
encoded = base64.b64encode(text_content.encode('utf-8'))
@ -78,12 +85,12 @@ post:
simpleLoad:
summary: Load text document
value:
text: VGhpcyBpcyB0aGUgZG9jdW1lbnQgdGV4dC4uLg==
text: This is the document text...
id: doc-123
user: alice
collection: research
withMetadata:
summary: Load with RDF metadata
summary: Load with RDF metadata using base64 text
value:
text: UXVhbnR1bSBjb21wdXRpbmcgdXNlcyBxdWFudHVtIG1lY2hhbmljcyBwcmluY2lwbGVzLi4u
id: doc-456