The differences between human vision and computer vision and why you need domain randomization

April 24,2023

Intro

Most companies believe they can go outside, snap some pictures, and train a robust Computer Vision (CV) model. As the autonomous car companies have shown, that is rarely the case. The reason is that computers learn to identify objects differently than humans.

Humans excel at recognizing familiar objects, faces, and patterns, even when presented in different orientations or with partial information. Our brain's ability to perceive depth and spatial relationships allows us to easily navigate our environment. While computer vision has made significant strides in object and pattern recognition, it often falls short compared to human capabilities, especially in tasks requiring a deep understanding of context or spatial relationships.

I walk my 2-year-old daughter to daycare every day. Along the way, I point out objects we see and identify them - “bicycle,” “bus,” “pumpkin,” “Christmas tree,” etc. It only takes a handful of examples until she catches on and starts to call out the object herself. After another dozen positive confirmations, her human cognitive abilities really shine.

She identifies a Christmas tree 15 stories up in an apartment window that I never even noticed. She points out a bus turning a corner six blocks away. It doesn’t matter if it’s dawn, dusk, rainy, snowy, foggy, or windy (we live in Chicago, so she has experienced it all and some). She can still identify objects with an accuracy that continues to impress me.

At present, CV models can’t learn the same way as humans. CV models must be exposed to a much greater diversity of examples during training, or they can fall prey to scene-object bias - when a model utilizes background information to infer an object.

Let’s say you are building a CV model to detect yellow school buses. If you go outside and collect a ‘diverse’ set of images, such as the ones below:

Human Vision

In the human mind, that seems like a good variety of angles and perspectives of what a school bus could look like. However, for a CV model, it may think a school bus is ‘a yellow rectangle on a black surface.’

Why do you need Domain Randomization?

The challenge is when the model sees school buses in the following scenarios. It may not be able to identify them correctly. As they aren’t ‘ yellow rectangles on a black surface.’

Data Vision

Bifrost solves this by allowing developers to randomize both objects and backgrounds - buses can be different colors and appear on various surfaces in different lighting and weather conditions.

This is called domain randomization - which is critical to building robust computer vision models. It mitigates bias and ensures that the model learns what an object looks like regardless of context.


Want to learn more about domain randomization and other techniques to build better computer vision models? Reach out at hello@bifrost.ai or here!

Share this article:
August 25,2023

Similarity and Diversity: The Core Foundations of Robust Computer Vision Models

In the vibrant field of artificial intelligence (AI), computer vision stands out as one of the most...

August 21,2023

The Business Value of Synthetic Data: Accelerating Growth While Reducing Costs

In the contemporary data-driven business landscape, acquiring quality data for machine learning (ML)...

March 21,2023

It's 2022 and Data Labeling Still Sucks

You've heard it before. Labeling data for machine learning sucks. Labeling is laborious, time consum...

March 16,2023

Modern Strategies for Data Curation in Computer Vision

AI systems are extremely powerful. But when they fail, they often mess up spectacularly. Unsurprisin...

March 22,2023

How Your Data Collection Strategy Influences Your AI's Behavior

In this article, we explain how your choice of data collection method influences AI behavior, and li...

August 29,2023

How to Improve your Models Effectively - Beyond mAP as a Metric

By the end of this blog post, you should be able to: - Understand the best practices behind iterati...

March 21,2023

How I Beat The State-of-the-Art in One Week as an Intern

How we managed to beat the state of the art in synthetic-trained aircraft detection within a week us...

March 13,2023

Why Synthetic Data is the Unfair Advantage for AI

In the last decade, we’ve seen AI create brand new industries to solve some of the world’s most crit...

September 07,2023

Selecting the Correct Class Label Ontology

Discover why choosing the right class labels is pivotal for top-tier computer vision models. Learn w...

June 08,2023

Unlocking the Complexities of Synthetic Data: Challenges, Lessons & The Way Forward

Diverse, well-labeled data has become the biggest bottleneck to building computer vision application...

May 23,2023

How to Generate Synthetic 3D Data with Bifrost

Create a Bifrost.ai account and start generating synthetic data today! Contact us at sales@bifrost.a...

August 25,2023

Similarity and Diversity: The Core Foundations of Robust Computer Vision Models

In the vibrant field of artificial intelligence (AI), computer vision stands out as one of the most...

August 21,2023

The Business Value of Synthetic Data: Accelerating Growth While Reducing Costs

In the contemporary data-driven business landscape, acquiring quality data for machine learning (ML)...

March 21,2023

It's 2022 and Data Labeling Still Sucks

You've heard it before. Labeling data for machine learning sucks. Labeling is laborious, time consum...

March 16,2023

Modern Strategies for Data Curation in Computer Vision

AI systems are extremely powerful. But when they fail, they often mess up spectacularly. Unsurprisin...

March 22,2023

How Your Data Collection Strategy Influences Your AI's Behavior

In this article, we explain how your choice of data collection method influences AI behavior, and li...

August 29,2023

How to Improve your Models Effectively - Beyond mAP as a Metric

By the end of this blog post, you should be able to: - Understand the best practices behind iterati...

March 21,2023

How I Beat The State-of-the-Art in One Week as an Intern

How we managed to beat the state of the art in synthetic-trained aircraft detection within a week us...

March 13,2023

Why Synthetic Data is the Unfair Advantage for AI

In the last decade, we’ve seen AI create brand new industries to solve some of the world’s most crit...

September 07,2023

Selecting the Correct Class Label Ontology

Discover why choosing the right class labels is pivotal for top-tier computer vision models. Learn w...

June 08,2023

Unlocking the Complexities of Synthetic Data: Challenges, Lessons & The Way Forward

Diverse, well-labeled data has become the biggest bottleneck to building computer vision application...

May 23,2023

How to Generate Synthetic 3D Data with Bifrost

Create a Bifrost.ai account and start generating synthetic data today! Contact us at sales@bifrost.a...

August 25,2023

Similarity and Diversity: The Core Foundations of Robust Computer Vision Models

In the vibrant field of artificial intelligence (AI), computer vision stands out as one of the most...

August 21,2023

The Business Value of Synthetic Data: Accelerating Growth While Reducing Costs

In the contemporary data-driven business landscape, acquiring quality data for machine learning (ML)...

March 21,2023

It's 2022 and Data Labeling Still Sucks

You've heard it before. Labeling data for machine learning sucks. Labeling is laborious, time consum...

March 16,2023

Modern Strategies for Data Curation in Computer Vision

AI systems are extremely powerful. But when they fail, they often mess up spectacularly. Unsurprisin...

March 22,2023

How Your Data Collection Strategy Influences Your AI's Behavior

In this article, we explain how your choice of data collection method influences AI behavior, and li...

August 29,2023

How to Improve your Models Effectively - Beyond mAP as a Metric

By the end of this blog post, you should be able to: - Understand the best practices behind iterati...

March 21,2023

How I Beat The State-of-the-Art in One Week as an Intern

How we managed to beat the state of the art in synthetic-trained aircraft detection within a week us...

March 13,2023

Why Synthetic Data is the Unfair Advantage for AI

In the last decade, we’ve seen AI create brand new industries to solve some of the world’s most crit...

September 07,2023

Selecting the Correct Class Label Ontology

Discover why choosing the right class labels is pivotal for top-tier computer vision models. Learn w...

June 08,2023

Unlocking the Complexities of Synthetic Data: Challenges, Lessons & The Way Forward

Diverse, well-labeled data has become the biggest bottleneck to building computer vision application...

May 23,2023

How to Generate Synthetic 3D Data with Bifrost

Create a Bifrost.ai account and start generating synthetic data today! Contact us at sales@bifrost.a...