One of the most compelling capabilities of AI is how it interprets visual information — encompassing computer vision, object and label detection, optical character recognition, and contextual image understanding, all working together in real time.
One of the most compelling capabilities of AI is how it interprets visual information — encompassing computer vision, object and label detection, optical character recognition, and contextual image understanding, all working together in real time.
CNNs are a type of deep learning algorithm designed to process and analyse visual data. They use layers of convolutional filters to scan images, detecting features like edges, textures, and shapes. These features are combined to recognise complex patterns and classify image content.
CNNs are the backbone of modern computer vision systems, trained on large labeled datasets using supervised learning.
Microsoft’s Seeing AI app
Microsoft’s Seeing AI app uses computer vision to help blind or low-vision users navigate the world — reading text aloud, recognising faces and emotions, and describing scenes using the same underlying vision APIs. This shows how AI can create deeply human-centered applications that extend independence and dignity.
Google Cloud Vision Drag-and-Drop
Microsoft Azure Computer Vision Playground
Tips
Don't just read—experiment. Browse our hands-on experiments designed to help you apply what you've learned.
Browse Experiments