Which Words Do You Remember? Temporal Properties of Language Use in Digital Archives

Which Words Do You Remember? Temporal Properties of Language Use in Digital Archives

Abstract

Knowing the behavior of terms in written texts can help us tailor fit models, algorithms and resources to improve access to digital libraries and help us answer information needs in longer spanning archives. In this paper we investigate the behavior of English written text in blogs in comparison to traditional texts from the New York Times, The Times Archive, and the British National Corpus. We show that user generated content, similar to spoken content, differs in characteristics from ‘professionally’ written text and experiences a more dynamic behavior.

Publication
In International Conference on Theory and Practice of Digital Libraries, TPDL 2012
Date
Links
Avatar
Nina Tahmasebi
Associate Professor in Natural Language Processing