feat: add audio generation support for multiple providers

mirror of https://github.com/xtekky/gpt4free.git synced 2025-12-06 02:30:41 -08:00

- Added new examples for `client.media.generate` with `PollinationsAI`, `EdgeTTS`, and `Gemini` in `docs/media.md`
- Modified `PollinationsAI.py` to default to `default_audio_model` when audio data is present
- Adjusted `PollinationsAI.py` to conditionally construct message list from `prompt` when media is being generated
- Rearranged `PollinationsAI.py` response handling to yield `save_response_media` after checking for non-JSON content types
- Added support in `EdgeTTS.py` to use default values for `language`, `locale`, and `format` from class attributes
- Improved voice selection logic in `EdgeTTS.py` to fallback to default locale or language when not explicitly provided
- Updated `EdgeTTS.py` to yield `AudioResponse` with `text` field included
- Modified `Gemini.py` to support `.ogx` audio generation when `model == "gemini-audio"` or `audio` is passed
- Used `format_image_prompt` in `Gemini.py` to create audio prompt and saved audio file using `synthesize`
- Appended `AudioResponse` to `Gemini.py` for audio generation flow
- Added `save()` method to `Image` class in `stubs.py` to support saving `/media/` files locally
- Changed `client/__init__.py` to fallback to `options["text"]` if `alt` is missing in `Images.create`
- Ensured `AudioResponse` in `copy_images.py` includes the `text` (prompt) field
- Added `Annotated` fallback definition in `api/__init__.py` for compatibility with older Python versions

This commit is contained in:

hlohaus

2025-04-19 06:23:46 +02:00

parent 2f46008228

commit b68b9ff6be

8 changed files with 90 additions and 34 deletions

									
										2

g4f/image/copy_images.py
									
										View file
										
				@ -70,7 +70,7 @@ async def save_response_media(response: StreamResponse, prompt: str, tags: list[

				    if response.method == "GET":

				        media_url = f"{media_url}?url={str(response.url)}"

				    if content_type.startswith("audio/"):

				        yield AudioResponse(media_url)

				        yield AudioResponse(media_url, text=prompt)

				    elif content_type.startswith("video/"):

				        yield VideoResponse(media_url, prompt)

				    else:

Rows
Columns

feat: add audio generation support for multiple providers

2 g4f/image/copy_images.py Unescape Escape View file

2

g4f/image/copy_images.py

View file