mirror of
https://github.com/MODSetter/SurfSense.git
synced 2026-04-26 17:26:23 +02:00
feat: Added Speech to Text support.
- Supports audio & video files. - Will be useful for Youtube vids which dont have transcripts.
This commit is contained in:
parent
57987ecc76
commit
a8080d2dc7
8 changed files with 172 additions and 73 deletions
|
|
@ -65,6 +65,7 @@ Edit the `.env` file and set the following variables:
|
|||
| UNSTRUCTURED_API_KEY | API key for Unstructured.io service |
|
||||
| FIRECRAWL_API_KEY | API key for Firecrawl service (if using crawler) |
|
||||
| TTS_SERVICE | Text-to-Speech API provider for Podcasts (e.g., `openai/tts-1`, `azure/neural`, `vertex_ai/`). See [supported providers](https://docs.litellm.ai/docs/text_to_speech#supported-providers) |
|
||||
| STT_SERVICE | Speech-to-Text API provider for Podcasts (e.g., `openai/whisper-1`). See [supported providers](https://docs.litellm.ai/docs/audio_transcription#supported-providers) |
|
||||
|
||||
**Important**: Since LLM calls are routed through LiteLLM, include API keys for the LLM providers you're using:
|
||||
|
||||
|
|
@ -86,6 +87,8 @@ Edit the `.env` file and set the following variables:
|
|||
| FAST_LLM_API_BASE | Custom API base URL for the fast LLM |
|
||||
| STRATEGIC_LLM_API_BASE | Custom API base URL for the strategic LLM |
|
||||
| LONG_CONTEXT_LLM_API_BASE | Custom API base URL for the long context LLM |
|
||||
| TTS_SERVICE_API_BASE | Custom API base URL for the Text-to-Speech (TTS) service |
|
||||
| STT_SERVICE_API_BASE | Custom API base URL for the Speech-to-Text (STT) service |
|
||||
|
||||
### 2. Install Dependencies
|
||||
|
||||
|
|
|
|||
Loading…
Add table
Add a link
Reference in a new issue