Enhancing video object Detection through feature extraction-based video compression

Projects* | 2023 | Links Demo | Source

In this project, we delve into the realm of video object detection to explore the fascinating intersection of video compression and object detection. Our primary objective is to enhance video object detection by using feature enhancing maps for video compression.

Problem Statement

Problem:

Traditional video compression methods are tailored to human vision.
Applications like automated video surveillance rely on computers to analyze the video frames.
Traditional video compression often compromises object recognition precision.

Proposal

Enhancing video object detection by employing feature extraction-based video compression technique. Methodology:

Step 1: Extract features from RAW video frames.
Step 2: Calculate Object Importance maps from features.
Step 3: Devise Bit Allocation strategy from the Object Importance map.
Step 4: Use Bit allocation strategy to encode video frames.

The original image which is being analyzed.

Methodology

Step 1

Pass the frames to a YOLO model, and extract features from CNN Layers of the model.
Features essentially highlight important areas and offer insightful information of the frame.

Step 2

CNN has 32 kernels in its first layer.
Condense the features from each kernel by multiplying the average of each kernel.
Calculate the L2 Normalized value for each kernel and average it.
This is the object importance value for each pixel.

Step 3

Divide the object importance maps into blocks.
Calculate the QP values for each block.
Store the QP values in a file so it can be used for video compression.

Step 4

Pass the QP values into the FFMPEG video compressor

Results

The successful implementation of this project holds transformative potential in the medical field. Rapid and automated age prediction from chest X-rays can significantly enhance diagnostic workflows, aiding healthcare professionals in making informed decisions promptly. This project stands as a testament to the fusion of technology and healthcare for improved patient outcomes.

Study of the results of our video compressor compared to standard video compressor.

Study of the results of our video compressor compared to standard video compressor in terms of PSNR graph.

Conclusion

The proposed methodology showcases a promising approach to enhance object detection accuracy through customized video compression. The integration improved performance compared to traditional methods in terms of:

Higher confidence in predictions
Increased number of true detections
Decreased number of false detections
Better PSNR vs BitRate

Enhancing video object Detection through feature extraction-based video compression

Problem Statement

Proposal