Text-based video summarization