-
Notifications
You must be signed in to change notification settings - Fork 2
feat(gemini): Phase 4 -- vision / multimodal (inlineData) #1596
Copy link
Copy link
Closed
Labels
enhancementNew feature or requestNew feature or request
Description
Part of #1592
Scope
Wire up image input via Gemini's inlineData parts format.
Files to Modify
crates/zeph-llm/src/gemini.rs-- convertMessagePart::ImagetoinlineDatapart in message builder
Key Implementation Details
- Images sent as
inlineDataparts withincontents[].parts[]:{ "inlineData": { "mimeType": "image/jpeg", "data": "base64..." } } - All Gemini 2.0+ models support vision natively (no separate vision model needed)
- Multiple images per message supported
- Mixed text + image parts in single message supported
- Zeph's
ImageData { data: Vec<u8>, mime_type: String }maps directly
Acceptance Criteria
-
supports_vision()returnstrue(already set in Phase 1) -
MessagePart::Imagecorrectly converted toinlineDataformat - Multiple images in single message work
- Mixed text + image parts produce correct parts array
- Unit tests with mock image payloads
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or request