gpt4free/docs
hlohaus b68b9ff6be feat: add audio generation support for multiple providers
- Added new examples for `client.media.generate` with `PollinationsAI`, `EdgeTTS`, and `Gemini` in `docs/media.md`
- Modified `PollinationsAI.py` to default to `default_audio_model` when audio data is present
- Adjusted `PollinationsAI.py` to conditionally construct message list from `prompt` when media is being generated
- Rearranged `PollinationsAI.py` response handling to yield `save_response_media` after checking for non-JSON content types
- Added support in `EdgeTTS.py` to use default values for `language`, `locale`, and `format` from class attributes
- Improved voice selection logic in `EdgeTTS.py` to fallback to default locale or language when not explicitly provided
- Updated `EdgeTTS.py` to yield `AudioResponse` with `text` field included
- Modified `Gemini.py` to support `.ogx` audio generation when `model == "gemini-audio"` or `audio` is passed
- Used `format_image_prompt` in `Gemini.py` to create audio prompt and saved audio file using `synthesize`
- Appended `AudioResponse` to `Gemini.py` for audio generation flow
- Added `save()` method to `Image` class in `stubs.py` to support saving `/media/` files locally
- Changed `client/__init__.py` to fallback to `options["text"]` if `alt` is missing in `Images.create`
- Ensured `AudioResponse` in `copy_images.py` includes the `text` (prompt) field
- Added `Annotated` fallback definition in `api/__init__.py` for compatibility with older Python versions
2025-04-19 06:23:46 +02:00
..
guides Added missing periods. 2024-05-03 11:06:01 -04:00
images Add Authentication Setup Guide 2024-12-07 03:01:43 +01:00
async_client.md Add example for video generation 2025-03-27 09:38:31 +01:00
authentication.md Code review changes 2025-03-30 17:57:42 +02:00
client.md Adding New Models and Enhancing Provider Functionality (#2689) 2025-02-07 13:54:00 +01:00
configuration.md Update model configurations, provider implementations, and documentation (#2577) 2025-01-24 03:47:57 +01:00
docker.md Add nodriver to Gemini provider, 2024-11-17 11:06:37 +01:00
file.md feat: add EdgeTTS audio provider and global image→media refactor 2025-04-19 03:20:57 +02:00
git.md Update (docs/ README.md g4f/client/client.py) 2024-11-12 08:44:48 +02:00
gui.md Support "Think Deeper" in Copilot 2025-03-30 15:40:46 +02:00
interference-api.md Update docs: Using the OpenAI Library 2025-03-22 07:32:30 +01:00
legacy.md Update legacy.md 2025-02-21 09:53:56 +01:00
local.md Updated almost all documentation and added new documentation for the local interface 2024-10-19 20:21:14 +03:00
media.md feat: add audio generation support for multiple providers 2025-04-19 06:23:46 +02:00
providers-and-models.md docs: update providers documentation and enhance support for Blackbox HAR auth 2025-04-04 13:36:28 +03:00
pydantic_ai.md Add ARTA image provider 2025-03-11 02:49:24 +01:00
requests.md Fix invalid escape in requests module 2025-01-06 23:20:29 +01:00
requirements.md Update requirements.md 2025-02-21 09:56:20 +01:00
selecting_a_provider.md Fix generate Images with OpenaiChat 2025-01-24 17:49:07 +01:00
vision.md Add multiple images support (#2478) 2024-12-13 22:20:58 +01:00
webview.md Add Installation Guide for Windows 2024-04-06 10:47:43 +02:00