01. Introduction

Weighting:

Goal: to recognize objects and their motions.

Signal processing on 1D data; computer vision on 2D data.

Image processing on still images; computer vision on video and still images.

Difficulties

The sensory gap: gap between reality and what a recording of the scene can capture.

The semantic gap: lack of contextual knowledge about a scene.

The human visual system is very good:

Recovering 3D information

Several cues available:

Or actual depth hardware:

Labs: Intel Realsense D435.

Processing

The higher-level the processing, the less generic and the more domain-specific knowledge is required.

Low-Level Image Processing:

Approaches