Support image output #1130

Kludex · 2025-03-15T13:20:05Z

Closes Support for model Gemini Flash 2.0 Image Generation #1126

Still a lot to do, and decide... It's still not type safe, and can't use message_history properly.

The main.py in the files already work tho.

github-actions · 2025-03-15T13:24:16Z

Docs Preview

commit:	`e8ba35b`
Preview URL:	https://6c9c1503-pydantic-ai-previews.pydantic.workers.dev

DouweM · 2025-04-30T00:05:31Z

@Kludex Are you planning to work on this or are we better off closing it for now?

Kludex · 2025-04-30T08:05:55Z

This is still in my radar, I prefer to keep it open.

ollz272 · 2025-06-30T20:35:13Z

hi, would love access to this feature, is there an ETA?

lshamis · 2025-07-07T16:52:31Z

I think something like this will be necessary sooner than later. Many models can/will generate interleaved multimodal content.

Slightly philosophical question, but why are the output types of an LLM different from those of ToolCall?

DouweM · 2025-07-07T19:51:36Z

Slightly philosophical question, but why are the output types of an LLM different from those of ToolCall?

@lshamis Because the types of data LLMs support as input (whether that's via the user prompt as a tool call result) are not the same as the types of data they can output. For example, all models support text input and text output, and many support image, video, audio, and document input, but only a handful support image output, and as far as I know none can output e.g. PDF files. So there's necessarily a difference between the types of things we allow tools to output (as it's anything that can be sent back to the model as input) and what models themselves can output.

dorukgezici · 2025-09-18T22:31:14Z

Hey @Kludex and @DouweM , I would be happy to contribute on this as our startup is completely media gen focused (images and video). Would appreciate some guidance though since it seems like a core overhaul. We currently have custom tools implemented that use native Google and OpenAI clients.

DouweM · 2025-09-18T22:50:53Z

@dorukgezici Much appreciated! However having just spent some time looking into https://platform.openai.com/docs/guides/tools-image-generation and https://ai.google.dev/gemini-api/docs/image-generation, I think this'd take me a few hours to implement in a clean way, and someone less familiar with our architecture a lot more than that to get it work and then get it through the review cycle 😅 So I'm gonna have a crack at this tomorrow or on my flight to our team offsite on Sunday :)

DouweM · 2025-09-19T20:47:05Z

Closing in favor of #2970

Support image output

e8ba35b

Kludex marked this pull request as draft March 15, 2025 13:20

Kludex mentioned this pull request Mar 16, 2025

Support for model Gemini Flash 2.0 Image Generation #1126

Closed

DouweM assigned Kludex Apr 30, 2025

DouweM mentioned this pull request Jul 7, 2025

Support multimodal model responses #2140

Closed

DouweM mentioned this pull request Jul 18, 2025

BinaryContent returned by a tool is replaced with e4fcfe by agent #2243

Closed

2 tasks

DouweM mentioned this pull request Aug 4, 2025

OpenAI Built In Code Interpreter not returning image files from OpenAI Response #2391

Closed

2 tasks

DouweM assigned DouweM and unassigned Kludex Sep 18, 2025

DouweM mentioned this pull request Sep 19, 2025

Support image generation and output with Google and OpenAI #2970

Merged

9 tasks

DouweM closed this Sep 19, 2025

Viicos deleted the playing-with-gemini-images-output branch November 19, 2025 19:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support image output #1130

Support image output #1130

Uh oh!

Kludex commented Mar 15, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Mar 15, 2025

Uh oh!

DouweM commented Apr 30, 2025

Uh oh!

Kludex commented Apr 30, 2025

Uh oh!

ollz272 commented Jun 30, 2025

Uh oh!

lshamis commented Jul 7, 2025

Uh oh!

DouweM commented Jul 7, 2025

Uh oh!

dorukgezici commented Sep 18, 2025

Uh oh!

DouweM commented Sep 18, 2025 •

edited

Loading

Uh oh!

DouweM commented Sep 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Support image output #1130

Support image output #1130

Uh oh!

Conversation

Kludex commented Mar 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Mar 15, 2025

Docs Preview

Uh oh!

DouweM commented Apr 30, 2025

Uh oh!

Kludex commented Apr 30, 2025

Uh oh!

ollz272 commented Jun 30, 2025

Uh oh!

lshamis commented Jul 7, 2025

Uh oh!

DouweM commented Jul 7, 2025

Uh oh!

dorukgezici commented Sep 18, 2025

Uh oh!

DouweM commented Sep 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

DouweM commented Sep 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Kludex commented Mar 15, 2025 •

edited

Loading

DouweM commented Sep 18, 2025 •

edited

Loading