Show simple item record

dc.contributor.advisorShanableh, Tamer
dc.contributor.authorIssa, Obada
dc.date.accessioned2023-03-02T07:48:03Z
dc.date.available2023-03-02T07:48:03Z
dc.date.issued2022-11
dc.identifier.other35.232-2022.58
dc.identifier.urihttp://hdl.handle.net/11073/25177
dc.descriptionA Master of Science thesis in Computer Engineering by Obada Issa entitled, “Automatic Video Summarization Using HEVC and CNN Features”, submitted in November 2022. Thesis advisor is Dr. Tamer Shanableh. Soft copy is available (Thesis, Completion Certificate, Approval Signatures, and AUS Archives Consent Form).en_US
dc.description.abstractWith the incredible surge of the internet and surveillance footage, there is a vast number of digital videos. The need to summarize these videos within databases is very crucial. This is where video summarization comes in handy. Video summarization can be achieved by a number of techniques. This study proposes a novel solution for the detection of key-frames for static video summarization. We preprocessed the well-known video datasets by coding them using the HEVC video coding standard. During coding, 64 proposed features were generated from the coder for each frame. Additionally, we extracted RGB frames from the original raw videos and fed them into pre-trained CNN networks for feature extraction. These include GoogleNet, AlexNet, Inception-ResNet-v2, and VGG16. The modified datasets are made publicly available to the research community. A subset of the proposed HEVC feature set was used to identify duplicate or similar frames and eliminate them from the video. We also propose an elimination solution based on the sum of the absolute differences between a frame and its motion-compensated predecessor. The proposed solutions are compared with existing works based on an SIFT flow algorithm that uses CNN features. Subsequently, an optional dimensionality reduction based on stepwise regression was applied to the feature vectors prior to detecting key-frames. The proposed solution is compared with existing studies that use sparse autoencoders with CNN features for dimensionality reduction. The accuracy of the proposed key-frame detection system was assessed using the Positive Predictive Values, Sensitivity, and F-score metrics. Combining the proposed solution with Multi-CNN features and using a Random Forests classifier, it was shown that the proposed solution achieved an average F-score of 0.98.en_US
dc.description.sponsorshipCollege of Engineeringen_US
dc.description.sponsorshipDepartment of Computer Science and Engineeringen_US
dc.language.isoen_USen_US
dc.relation.ispartofseriesMaster of Science in Computer Engineering (MSCoE)en_US
dc.subjectStatic Video Summarizationen_US
dc.subjectConvolution Neural Networks (CNN)en_US
dc.subjectDuplicate Framesen_US
dc.subjectSparse Autoencodersen_US
dc.subjectRandom Forests classifieren_US
dc.subjectVideo Codingen_US
dc.subjectHigh Efficiency Video Codec (HEVC)en_US
dc.subjectMotion estimationen_US
dc.subjectMotion compensationen_US
dc.subjectStepwise regressionen_US
dc.titleAutomatic Video Summarization Using HEVC and CNN Featuresen_US
dc.typeThesisen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record