The Digital Eye: Amazon Rekognition

The Digital Eye: Amazon Rekognition

Master vision in the cloud. Explore image and video analysis, face recognition, and the powerful 'Custom Labels' feature.

Seeing the Visible

Amazon Rekognition is a fully managed service that makes it easy to add image and video analysis to your applications. Using deep learning, it can identify objects, people, text, scenes, and activities in both static images and streaming video.

For the AWS Certified AI Practitioner exam, you need to understand the "Specialized Sub-features" of Rekognition and which business problems they solve.


1. Key Features: The Rekognition Toolbox

Object and Scene Detection (Labels)

Rekognition can identify thousands of objects (bikes, cars) and scenes (wedding, beach).

  • Use Case: Automatically tagging millions of photos in a travel app.

Image Moderation

This is a high-demand exam topic. Rekognition can detect "Inappropriate" or "Offensive" content.

  • Use Case: A social media app automatically blocking adult or violent content before it is shown to other users.

Facial Analysis and Face Comparison

Note the difference:

  • Analysis: Detects attributes (Happy, Sad, Wearing glasses, Male/Female).
  • Comparison: Takes two photos and says, "There is a 99% probability this is the same person."
  • Use Case: Amazon Key—allowing delivery people into a building by comparing their face to their ID photo.

Celebrity Recognition

Identifies hundreds of thousands of famous people in sports, politics, and media.

  • Use Case: A news agency automatically tagging photos of politicians in their archive.

Text-in-Image

Reads text appearing on things like street signs, t-shirts, and license plates.

  • Use Case: A smart city app reading license plates to manage parking traffic.

2. Advanced Feature: Custom Labels

As we mentioned in Module 4, what if you need to detect something "Non-Standard"?

  • Standard Rekognition: Detects "A car."
  • Custom Labels: Detects "A 2024 Tesla Model 3 with a specific bumper dent."

You provide a few dozen images of the specific object, and AWS "fine-tunes" its existing vision model for you. You get the benefit of custom ML without the complexity of SageMaker.


3. Working with Video (Streaming and Stored)

Rekognition isn't just for JPEG files. It can process Kinesis Video Streams in real-time.

  • Use Case: A factory monitor detecting if a worker isn't wearing a hard hat and sounding an alarm instantly.
graph LR
    subgraph Input_Sources
    A[S3 Bucket: Photos]
    B[Kinesis: Live Video Feed]
    end
    
    subgraph Rekognition_Engine
    C{Feature Selected}
    C -->|Detect| D[Object Labels]
    C -->|Identify| E[Celebrities]
    C -->|Compare| F[Faces]
    C -->|Filter| G[Content Moderation]
    end
    
    subgraph Actions
    H[Metadata Stored in DB]
    I[Real-time Security Alert]
    J[Automatic Content Block]
    end
    
    A & B --> C
    D --> H
    E --> H
    F --> I
    G --> J

4. Summary: The Strategic Vision

When you see "Image," "Video," "Moderation," or "Identify Celebrity" in an exam question, your brain should immediately click to Amazon Rekognition.

Remember: Rekognition is API-based. You don't have to manage servers. You just pay per image or per minute of video analyzed.


Exercise: Identify the Rekognition Feature

A dating app wants to ensure that all profile pictures are "Work-Appropriate" and that the person's face is clearly visible and smiling. Which two Rekognition features should they use?

  • A. Object Detection and Celebrity Recognition.
  • B. Content Moderation and Facial Analysis.
  • C. Custom Labels and Text-in-Image.
  • D. Face Comparison and Video Analysis.

The Answer is B! Content Moderation checks for "Appropriateness." Facial Analysis checks for "Smiling" and "Visibility."


Knowledge Check

?Knowledge Check

Which feature of Amazon Rekognition should be used to detect if a person in a photo is wearing a Hard Hat and a Safety Vest?

What's Next?

Vision is one thing, but how do we understand the "Meaning" of text? In our next lesson, we dive into Amazon Comprehend for Natural Language Processing.

Subscribe to our newsletter

Get the latest posts delivered right to your inbox.

Subscribe on LinkedIn