VL-JEPA inspired pipeline — compress images/text locally via Ollama, send compact payloads to any LLM API. Cut token costs by ~80%.
python ai computer-vision python3 gemini openai embedding claude multimodal vision-language cost-reduction local-llm ollama llm-pipeline prompt-compression token-optimization selective-decode vl-jepa api-cost
-
Updated
Jun 17, 2026 - Python