Automatic Video Summarization Using HEVC and CNN Features

Issa, Obada

dc.contributor.advisor	Shanableh, Tamer
dc.contributor.author	Issa, Obada
dc.date.accessioned	2023-03-02T07:48:03Z
dc.date.available	2023-03-02T07:48:03Z
dc.date.issued	2022-11
dc.identifier.other	35.232-2022.58
dc.identifier.uri	http://hdl.handle.net/11073/25177
dc.description	A Master of Science thesis in Computer Engineering by Obada Issa entitled, “Automatic Video Summarization Using HEVC and CNN Features”, submitted in November 2022. Thesis advisor is Dr. Tamer Shanableh. Soft copy is available (Thesis, Completion Certificate, Approval Signatures, and AUS Archives Consent Form).	en_US
dc.description.abstract	With the incredible surge of the internet and surveillance footage, there is a vast number of digital videos. The need to summarize these videos within databases is very crucial. This is where video summarization comes in handy. Video summarization can be achieved by a number of techniques. This study proposes a novel solution for the detection of key-frames for static video summarization. We preprocessed the well-known video datasets by coding them using the HEVC video coding standard. During coding, 64 proposed features were generated from the coder for each frame. Additionally, we extracted RGB frames from the original raw videos and fed them into pre-trained CNN networks for feature extraction. These include GoogleNet, AlexNet, Inception-ResNet-v2, and VGG16. The modified datasets are made publicly available to the research community. A subset of the proposed HEVC feature set was used to identify duplicate or similar frames and eliminate them from the video. We also propose an elimination solution based on the sum of the absolute differences between a frame and its motion-compensated predecessor. The proposed solutions are compared with existing works based on an SIFT flow algorithm that uses CNN features. Subsequently, an optional dimensionality reduction based on stepwise regression was applied to the feature vectors prior to detecting key-frames. The proposed solution is compared with existing studies that use sparse autoencoders with CNN features for dimensionality reduction. The accuracy of the proposed key-frame detection system was assessed using the Positive Predictive Values, Sensitivity, and F-score metrics. Combining the proposed solution with Multi-CNN features and using a Random Forests classifier, it was shown that the proposed solution achieved an average F-score of 0.98.	en_US
dc.description.sponsorship	College of Engineering	en_US
dc.description.sponsorship	Department of Computer Science and Engineering	en_US
dc.language.iso	en_US	en_US
dc.relation.ispartofseries	Master of Science in Computer Engineering (MSCoE)	en_US
dc.subject	Static Video Summarization	en_US
dc.subject	Convolution Neural Networks (CNN)	en_US
dc.subject	Duplicate Frames	en_US
dc.subject	Sparse Autoencoders	en_US
dc.subject	Random Forests classifier	en_US
dc.subject	Video Coding	en_US
dc.subject	High Efficiency Video Codec (HEVC)	en_US
dc.subject	Motion estimation	en_US
dc.subject	Motion compensation	en_US
dc.subject	Stepwise regression	en_US
dc.title	Automatic Video Summarization Using HEVC and CNN Features	en_US
dc.type	Thesis	en_US

Files in this item

Name:: 35.232-2022.58a Obada Issa.pdf
Size:: 1.404Mb
Format:: PDF

View/ Open

This item appears in the following Collection(s)

Masters Theses

Show simple item record