• Login
    View Item 
    •   DSpace Home
    • College of Engineering (CEN)
    • Department of Computer Science and Engineering
    • View Item
    •   DSpace Home
    • College of Engineering (CEN)
    • Department of Computer Science and Engineering
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    CNN and HEVC Video Coding Features for Static Video Summarization

    Thumbnail
    View/ Open
    CNN_and_HEVC_Video_Coding_Features_for_Static_Video_Summarization.pdf (1.072Mb)
    Date
    2022
    Author
    Issa, Obada
    Shanableh, Tamer
    Advisor(s)
    Unknown advisor
    Type
    Article
    Peer-Reviewed
    Published version
    Metadata
    Show full item record
    Abstract
    This study proposes a novel solution for the detection of keyframes for static video summarization. We preprocessed the well-known video datasets by coding them using the HEVC video coding standard. During coding, 64 proposed features were generated from the coder for each frame. Additionally, we converted the original YUVs of the raw videos into RGB images and fed them into pretrained CNN networks for feature extraction. These include GoogleNet, AlexNet, Inception-ResNet-v2, and VGG16. The modified datasets are made publicly available to the research community. Before detecting keyframes in a video, it is important to identify and eliminate duplicate or similar video frames. A subset of the proposed HEVC feature set was used to identify these frames and eliminate them from the video. We also propose an elimination solution based on the sum of the absolute differences between a frame and its motion-compensated predecessor. The proposed solutions are compared with existing works based on an SIFT flow algorithm that uses CNN features. Subsequently, an optional dimensionality reduction based on stepwise regression was applied to the feature vectors prior to detecting key frames. The proposed solution is compared with existing studies that use sparse autoencoders with CNN features for dimensionality reduction. The accuracy of the proposed key-frame detection system was assessed using the positive predictive values, sensitivity, and F-scores. Combining the proposed solution with Multi-CNN features and using a random forest classifier, it was shown that the proposed solution achieved an average F-score of 0.98.
    DSpace URI
    http://hdl.handle.net/11073/24062
    External URI
    https://doi.org/10.1109/ACCESS.2022.3188638
    Collections
    • Department of Computer Science and Engineering

    Related items

    Showing items related by title, author, creator and subject.

    • Signing of Content Distribution Agreement between the American University of Sharjah and Emirates Cable TV & Multimedia L.L.C. [Etisalat] regarding Video On Demand (VOD) service 

      Chancellor's Office, / (2015-02-15)
    • Special Topics: Video Editing for Journalism and Documentaries - MCM 294 

      Smith, Susan (2010)
    • Special Topics: Video Editing - MCM 294 

      Smith, Susan (2011)

    Browse

    All of DSpaceCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsCollege/DeptArchive ReferenceSeriesThis CollectionBy Issue DateAuthorsTitlesSubjectsCollege/DeptArchive ReferenceSeries

    My Account

    LoginRegister

    Statistics

    View Usage Statistics

    DSpace software copyright © 2002-2016  DuraSpace
    Submission Policies | Terms of Use | Takedown Policy | Privacy Policy | About Us | Contact Us | Send Feedback

    Return to AUS
    Theme by 
    Atmire NV