Publications

Filter by type:

We present Superlim, a multi-task NLP benchmark and analysis platform for evaluating Swedish language models, a counterpart to the …

We present the DURel tool that implements the annotation of semantic proximity between uses of words into an online, open source …

In this paper, we introduce a novel approach to tracing the evolution of word meaning over time.

In this chapter we provide an overview of computational modeling for semantic change using large and semi-large textual corpora

In this work we investigate the hypothesis that enriching contextualized models using fine-tuning tasks can improve their capacity to …

In this paper, we describe the creation of the largest resource of graded contextualized, diachronic word meaning annotation in four …

This chapter is to survey visualization and user interface solutions for understanding lexical semantic change and potential …

This article provides a comprehensive survey of recent computational techniques to tackle both diachronic conceptual change (semantic …

This volume offers a survey of this exciting new direction in the study of semantic change, a discussion of the many remaining …

Lexical Semantic Change detection, i.e., the task of identifying words that change meaning over time, is a very active research area, …

This data collection contains the post-evaluation data for SemEval-2020 Task 1

This paper is an overview of the opportunities and challenges of using large-scale text mining to answer research questions that stem …

Aspect-Based Sentiment Analysis constitutes a more fine-grained alternative to traditional sentiment analysis at sentence level. In …

Swedish Test Data for SemEval 2020 Task 1

State-of-the-art models of lexical semantic change detection suffer from noise stemming from vector space alignment. We have …

State-of-the-art models of lexical semantic change detection suffer from noise stemming from vector space alignment. We have …

We use a gold standard under construction for sentiment analysis in Swedish to explore how attitudes towards immigration change across …

We process and visualize Swedish parliamentary data using methods from statistics and machine learning, which allows us to obtain …

The KubHist Corpus is a massive corpus of Swedish historical newspapers, digitized by the Royal Swedish library, and available through …

In this paper, we discuss a data-intensive research methodology for the digital humanities. We highlight the differences and …

This article is a survey of recent computational techniques to tackle lexical semantic change, in particular we focus on diachronic …

Human language constantly evolves due to the changing world and the need for easier forms of expression and communication. In this …

We created the first ever sense-disambiguated sentiment lexicon (SenSALDO) as an open source resource, freely available from …

In this paper we describe the creation of a gold standard for the sentiment annotation of Swedish terms as a first step towards the …

In this paper we describe the creation of a gold standard for the sentiment annotation of Swedish terms as a first step towards the …

Detecting word sense changes can be of great interest in the field of digital humanities. Thus far, most investigations and automatic …

In this paper, we study the parameter tuning for both algorithms within the word sense disambiguation problem. The experiments are …

With advances in technology and culture, our language changes. We invent new words, add or change meanings of existing words and change …

We present a method for detecting word sense changes by utilizing automatically induced word senses. Our method works on the level of …

A free and open testsest for word sense change

In this paper we present a dataset of contemporary Swedish containing one billion words: the Gigaword corpus.

In this paper we present a Swedish Sentiment Lexicon

We present a case study on supervised classification of Swedish pseudo-coordination (SPC). The classification is attempted on the …

The concept of culturomics was born out of the availability of massive amounts of textual data and the interest to make sense of …

Advancements in technology and culture lead to changes in our language. These changes create a gap between the language known by users …

In this paper we describe the creation of a gold standard for the sentiment annotation of Swedish terms as a first step towards the …

With advancements in technology and culture, our language changes. We invent new words, add or change meanings of existing words and …

In this paper, we induce sense information using the curvature clustering algorith and investigate the effects of OCR errors on the …

The introduction of Social Media allowed more people to publish texts by removing barriers that are technical but also social such as …

High impact events, political changes and new technologies are reflected in our language and lead to constant evolution of terms, …

High impact events, political changes and new technologies are reflected in our language and lead to constant evolution of terms, …

Knowing the behavior of terms in written texts can help us tailor fit models, algorithms and resources to improve access to digital …

Knowing about the evolution of a term can significantly help when searching for relevant information, especially in case of sudden …

Semantic ambient media are the novel trend in the world of media reaching from the pioneering subareas such as ambient advertising to …

Proxy Credentials serve as a principal for authentication and authorization in the Grid. Despite their limited lifetime, they can be …

Word sense discrimination is the first, important step towards automatic detection of language evolution within large, historic …

The archival of content like publications or web pages is just the first step toward ?full? content preservation. It also has to be …

As archives contain documents that span over a long period of time, the language used to create these documents and the language used …

The correspondence between the terminology used for querying and the one used in content objects to be retrieved, is a crucial …