Towards a Custom Designed Mechanism for Indexing and Retrieving Video Transcripts

Gabriel Turcu; Stella María Heras Barberá; Javier Palanca Cámara; Vicente J. Julián Inglada; Marian Cristian Mihaescu

Ayuda

Towards a Custom Designed Mechanism for Indexing and Retrieving Video Transcripts

Gabriel Turcu ^[2] ; Stella Heras ^[1] ; Javier Palanca ^[1] ; Julian, Vicente ^[1] ; Marian Cristian Mihaescu ^[2]
1. [1] Universidad Politécnica de Valencia
  
  Universidad Politécnica de Valencia
  
  Valencia, España
2. [2] [University of] Craiova. Faculty of Automatics, Computers and Electronics
Localización: Hybrid Artificial Intelligent Systems. 14th International Conference, HAIS 2019: León, Spain, September 4–6, 2019. Proceedings / coord. por Hilde Pérez García , Lidia Sánchez González , Manuel Castejón Limas , Héctor Quintián Pardo , Emilio Santiago Corchado Rodríguez , 2019, ISBN 978-3-030-29858-6, págs. 299-309
Idioma: inglés
Enlaces
- Texto completo
Resumen
- Finding appropriate e-Learning resources within a repository of videos represents a critical aspect for students. Given that transcripts are available for the entire set of videos, the problem reduces to obtaining a ranked list of video transcripts for a particular query. The paper presents a custom approach for searching the 16.012 available video transcripts from https://media.upv.es/ at Universitat Politècnica de València. An inherent difficulty of the problem comes from the fact that transcripts are in the Spanish language. The proposed solution embeds all the transcripts using feed-forward Neural-Net Language Models, clusters the embedded transcripts and builds a Latent Dirichlet Allocation (LDA) model for each cluster. We can then process a new query and find the transcripts that have the LDA results closest to the LDA results for our query.