31, March 2025

Video Text Summarizer

Author(s): 1. Afreen Shah, 2. Rakhi Gupta

Authors Affiliations:

  1. Afreen Shah , MSc. IT Student, K.C College, Churchgate, Mumbai, India
  2. Rakhi Gupta, Head of Department, Department  of IT  K.C College, Churchgate, Mumbai, India

DOIs:10.2017/IJRCS/202503017     |     Paper ID: IJRCS2025030017


Abstract
Keywords
Cite this Article/Paper as
References

In today's digital landscape, video has become a dominant medium for communication and information sharing. Extracting key insights from lengthy videos, such as educational lectures, is often challenging and time-consuming. The proposed Video Text Summarization System addresses this challenge by automating the creation of concise and informative text summaries from video content. The system follows a structured, multi-stage process: audio is first extracted from videos, then converted into text using advanced speech-to-text technology. The resulting transcripts are refined through comprehensive text preprocessing to ensure clarity and coherence. By leveraging both extractive and abstractive summarization methods, the system condenses essential information efficiently. Scalable and adaptable, this solution is ideal for applications in education, professional environments, and content creation.

Video Summarization, Text Summarization, Speech-to-Text, Audio Extraction, Multi-Stage Process, Extractive Summarization, Abstractive Summarization.

Afreen Shah, Rakhi Gupta(2025); Video Text Summarizer, International Journal of Research Culture Society,    ISSN(O): 2456-6683,  Volume – 9,   Issue –  3.,  Pp.131-134.        Available on – https://ijrcs.org/

[1]         Habib Khan, Tanveer Hussain, Samee Ullah Khan, Zulfiqar Ahmad Khan, Sung Wook Baik, Deep multi-scale pyramidal features network for supervised video summarization, Expert Systems with Applications, Volume 237, Part C, 2024

 [2]     Guoqiang Liang, Yanbing Lv, Shucheng Li, Xiahong Wang, Yanning Zhang, Video summarization with a dual-path attentive network,  Neurocomputing, Volume 467, 2022, ISSN 0925-2312

[3]     H. Li, J. Zhu, C. Ma, J. Zhang and C. Zong, “Read, Watch, Listen, and Summarize: Multi-Modal Summarization for Asynchronous Text, Image, Audio and Video,” in IEEE Transactions on Knowledge and Data Engineering, vol. 31, no. 5, pp. 996-1009, 1 May 2019

[4]    T. Tyagi, L. Dhari, Y. Nigam and R. Nagpal, “Video Summarization using Speech Recognition and Text Summarization,” 2023 4th International Conference for Emerging Technology (INCET), Belgaum, India, 2023

[5]       E. Apostolidis, E. Adamantidou, A. I. Metsai, V. Mezaris and I. Patras, “Video Summarization Using Deep Neural Networks: A Survey,” in Proceedings of the IEEE, vol. 109, no. 11, pp. 1838-1863, Nov. 2021

Loading


Download Full Paper

Download PDF No. of Downloads:4 | No. of Views: 13