Fix VLM and Chat Documentation Discrepancies by kovtcharov · Pull Request #328 · amd/gaia

kovtcharov · 2026-02-09T09:57:24Z

Summary

Comprehensive fixes for VLM and RAG documentation discrepancies identified in detailed documentation review.

Changes (6 files)

1. docs/spec/vlm-client.mdx

✅ Update default VLM model: Qwen2.5-VL-7B → Qwen3-VL-4B-Instruct-GGUF (8 occurrences)
✅ Fix timeout: 300s for extraction (was 60s), clarify 60s for loading
✅ Document MIME type auto-detection (PNG, JPEG, GIF, WebP, BMP support)

2. src/gaia/rag/sdk.py (CODE)

✅ Normalize VLM default: Qwen2.5-VL-7B → Qwen3-VL-4B-Instruct-GGUF
Ensures consistency with VLMClient and other GAIA components

3. docs/spec/rag-sdk.mdx

✅ Update VLM model reference: Qwen2.5-VL-7B → Qwen3-VL-4B

4. docs/reference/cli.mdx

✅ Fix VLM model in init profiles table: Qwen2.5-VL-7B → Qwen3-VL-4B

5. docs/guides/chat.mdx

✅ Fix ChatConfig assistant_name default: "assistant" → "gaia"
✅ Fix VLM model in PDF indexing note: Qwen2.5-VL-7B → Qwen3-VL-4B

6. src/gaia/vlm/mixin.py (CODE)

✅ Remove invalid Qwen3-VL-8B-Instruct-GGUF example (model doesn't exist)

Verification

All changes verified against implementation:

src/gaia/llm/vlm_client.py:73 - Qwen3-VL-4B default ✓
src/gaia/llm/vlm_client.py:253 - 300s timeout ✓
src/gaia/llm/vlm_client.py:27-58 - MIME detection ✓
src/gaia/chat/sdk.py:39 - "gaia" assistant_name ✓

Impact

✅ Consistent VLM model across all GAIA components (Qwen3-VL-4B)
✅ Accurate timeout expectations (5 min for complex forms)
✅ Documented MIME type auto-detection feature
✅ Correct ChatConfig defaults
✅ Removed invalid code examples

Testing

Documentation and minor code changes only
No breaking changes
All updates align with existing implementation

- Update VLM default model to Qwen3-VL-4B-Instruct-GGUF (was Qwen2.5-VL-7B) - Fix VLM timeout: 300s for extraction, 60s for loading (was 60s for both) - Fix ChatConfig assistant_name default to "gaia" (was "assistant") - Remove invalid Qwen3-VL-8B model example Aligns documentation with implementation in src/gaia/llm/vlm_client.py and src/gaia/chat/sdk.py

Update RAGConfig default VLM model from Qwen2.5-VL-7B-Instruct-GGUF to Qwen3-VL-4B-Instruct-GGUF for consistency with VLMClient and other GAIA components (EMR, SD agents). This ensures consistent VLM model defaults across the framework.

- Update all Qwen2.5-VL-7B references to Qwen3-VL-4B across docs - Document VLM MIME type auto-detection (PNG, JPEG, GIF, WebP, BMP) - Normalize RAG SDK to use Qwen3-VL-4B for consistency Files updated: - src/gaia/rag/sdk.py (code) - docs/spec/rag-sdk.mdx - docs/reference/cli.mdx - docs/guides/chat.mdx - docs/spec/vlm-client.mdx

- Update Qwen3-30B → Qwen3-Coder-30B (full model name) - Update Qwen2.5-VL → Qwen3-VL-4B Installer plan now uses correct, current model names.

kovtcharov requested a review from kovtcharov-amd as a code owner February 9, 2026 09:57

github-actions bot added the documentation Documentation changes label Feb 9, 2026

Normalize RAG SDK to use Qwen3-VL-4B model

91e6a0c

Update RAGConfig default VLM model from Qwen2.5-VL-7B-Instruct-GGUF to Qwen3-VL-4B-Instruct-GGUF for consistency with VLMClient and other GAIA components (EMR, SD agents). This ensures consistent VLM model defaults across the framework.

github-actions bot added rag RAG system changes performance Performance-critical changes labels Feb 9, 2026

kovtcharov added this to the v0.15.4 milestone Feb 9, 2026

kovtcharov self-assigned this Feb 9, 2026

kovtcharov force-pushed the kalin/fix-vlm-documentation branch from 73bffff to 0244b48 Compare February 9, 2026 10:05

Fix VLM model names in installer plan

f461d5c

- Update Qwen3-30B → Qwen3-Coder-30B (full model name) - Update Qwen2.5-VL → Qwen3-VL-4B Installer plan now uses correct, current model names.

kovtcharov force-pushed the kalin/fix-vlm-documentation branch from 0244b48 to f461d5c Compare February 9, 2026 10:06

kovtcharov enabled auto-merge February 9, 2026 10:07

kovtcharov-amd approved these changes Feb 9, 2026

View reviewed changes

kovtcharov added this pull request to the merge queue Feb 9, 2026

Merged via the queue into main with commit 66116fa Feb 10, 2026
51 checks passed

kovtcharov deleted the kalin/fix-vlm-documentation branch February 10, 2026 00:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix VLM and Chat Documentation Discrepancies#328

Fix VLM and Chat Documentation Discrepancies#328
kovtcharov merged 4 commits intomainfrom
kalin/fix-vlm-documentation

kovtcharov commented Feb 9, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

kovtcharov commented Feb 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes (6 files)

1. docs/spec/vlm-client.mdx

2. src/gaia/rag/sdk.py (CODE)

3. docs/spec/rag-sdk.mdx

4. docs/reference/cli.mdx

5. docs/guides/chat.mdx

6. src/gaia/vlm/mixin.py (CODE)

Verification

Impact

Testing

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

kovtcharov commented Feb 9, 2026 •

edited

Loading