Video Features Extractor

Extracting several kinds of visual representations from videos.

Supported visual features

The following frame-level (*_features) and video-level (*_gloabal) visual representations are supported:

Note: *_sem_* representations are based on the classification level (probability distribution) of respective models.

This package has been tested for extracting visual representations from videos of the following video-caption datasets: