Release

April 16, 20251 Minute Read

Using vision input in Copilot Chat with Claude and Gemini is now in public preview

You now have more choices when chatting with Copilot about images in VS Code, Visual Studio, and on the immersive mode on github.com. Starting today, you can use the vision capability with the Claude Sonnet 3.5, Claude Sonnet 3.7, Gemini 2.0 Flash, Gemini 2.5 Pro, and GPT-4o models.

Some ideas to get you started:

  • Add screenshots of errors with Copilot to have it interpret the image and suggest solutions for the issue.
  • Share mockups of new designs, and Vision will help you bring them to life.
  • Ask questions about architecture diagrams.

Currently, the supported image types are JPEG/JPG, PNG, GIF, and WEBP.

When using Vision on VS Code and Visual Studio, make sure you have the Copilot Editor Preview Features policy enabled to get access. On github.com, get started simply by selecting a Claude or Gemini model from the model picker.

This feature was previously only available for GPT-4o in VS Code and Visual Studio and on github.com.

https://github.blog/wp-content/uploads/2025/04/433574313-ac8ab067-e359-4868-a6b4-0f4d1560e07f.mp4#t=0.001

To learn more, read the documentation about using Vision in Copilot Chat.

Please share your feedback in our community discussions.

Subscribe to our developer newsletter

Discover tips, technical guides, and best practices in our biweekly newsletter just for devs.

By submitting, I agree to let GitHub and its affiliates use my information for personalized communications, targeted advertising, and campaign effectiveness. See the GitHub Privacy Statement for more details.

Using vision input in Copilot Chat with Claude and Gemini is now in public preview - GitHub Changelog