Azure Vision is a powerful tool within Foundry Tools (formerly Azure AI Services). Azure Vision provides a set of prebuilt APIs that enable applications and agents to visually interpret the world. Vision provides capabilities such as image analysis, object detection, spatial understanding, and optical character recognition (OCR).
Computer vision enables machines to interpret, analyze, and pull meaningful data from images and videos, replicating human sight and cognitive abilities. This AI technology uses deep learning and neural networks to recognize objects, people, and patterns with high degrees of accuracy. Computer vision in AI has many real-world applications, including medical imaging, face recognition, defect ...
The custom vision API from Microsoft Azure learns to recognize specific content in imagery and becomes smarter with training and time. Read on to learn more.
Foundry Tools are Microsoft’s suite of prebuilt AI capabilities—Vision, Speech, Language, Translator, Content Understanding, and Document Intelligence—designed to help organizations quickly add advanced AI features to their apps and agents. These tools are fully managed, scalable, and part of the unified Microsoft Foundry platform, making it easier to innovate and transform business ...
Foundry Tools offers many pricing options for the Computer Vision API. Choose between free and standard pricing categories to get started.
Pricing details for Custom Vision Service from Azure AI Services. Customize state-of-the-art computer vision models for your unique use case.
Foundry Models is a hub for discovering foundation models. The catalog includes some of the most popular large language and vision foundation models curated by Microsoft, OpenAI, Anthropic Claude, DeepSeek, xAI, Hugging Face, Meta, Mistral AI, Cohere, Deci, Stability AI, Nixtla, and NVIDIA. These models are packaged for out-of-the-box implementation and optimized for use in Foundry.