Vision lets Claude interpret images alongside text, such as screenshots, diagrams, and documents.
You include images in the prompt and Claude can describe, extract, or reason about them. It powers document understanding, UI analysis, and chart reading. It expands prompting and tool use beyond plain text.