Swiss-AL DE Journalistic Corpus (high reach+regional, 2018-2025)

Ref. 3167

  

Dataset Overview

Dataset title

Swiss-AL DE Journalistic Corpus (high reach+regional, 2018-2025)

Canonical DOI

Used to cite the entire dataset, regardless of version updates.

https://doi.org/10.48656/v776-mk85

DOI

Used to cite a specific dataset version.

https://doi.org/10.48656/s1pt-k426

Dataset description language

English

Data Availability

The corpus can be searched on the Swiss-AL Platform (https://swiss-al.zhaw.ch/). While the raw data are not directly available on the platform for data-privacy and copyright reasons, the Swiss-AL Platform provides links to original documents whenever retrieval was possible.

Dataset Description

The corpus comprises texts from high-reach and regional Swiss daily and weekly newspapers in German, obtained through the Swiss Media Database (swissdox@LiRi).

Remarks about the documentation

On the Swiss-AL Platform documentation site, you can find more information about the corpora and the tools that can be used to access and visualize corpus data. Metadata about the corpus can be retrieved from the following website: https://github.zhaw.ch/pages/digitallinguistics/swiss-al-docu/corpora.html#de-journalistic-corpus-high-reach-regional-2018-2025

Version number

1.0

Embargo end date

-

Publication date

06.03.2026

Version notes

Version 1.0

Bibliographical citation

Krasselt, J., Lemmenmeier, D., Geckeler, S., Rothenhäusler, K., & Fluor, M. (2026). Swiss-AL DE Journalistic Corpus (high reach+regional, 2018-2025) (Version 1.0) [Data set]. LaRS - Language Repository of Switzerland. https://doi.org/10.48656/s1pt-k426

DIP MD5 hash

20695da025a183274c49df4e5bfb9ebd

Dataset contents

/
Swiss-AL_DE_Journalistic_Corpus_high_reach_regional_2018-2025.csv
metadata.yaml