What's New, Switzerland? Corpus

Ref. 2579

Dataset Overview

Dataset title

What's New, Switzerland? Corpus

Canonical DOI

Used to cite the entire dataset, regardless of version updates.

https://doi.org/10.48656/1sbx-qr60

DOI

Used to cite a specific dataset version.

https://doi.org/10.48656/pa3t-xh52

Dataset description language

English

Data URL

-

Data Availability

-

Dataset Description

The What's New, Switzerland? Corpus is a dataset of 72 authentic WhatsApp chats between 118 French-speaking users in Switzerland, collected in the framework of the "Evolving Language" NCCR. Chats were donated by users between August and October 2022. The data have been de-identified using a partly automated and partly manual workflow. Each chat is provided in two versions: an XML-TEI version (which includes extensive metadata about chats, users, and messages) and a plain text version. The dataset is available on demand for research purposes, under a restricted license contract.

Remarks about the documentation

The documentation is included in the dataset archive both in PDF format (README.pdf) and markdown format (README.md).

Version number

1.0

Embargo end date

-

Publication date

19.04.2024

Version notes

Version 1.0

Bibliographical citation

Xanthos, A., Gupta, P., Benkais, L., Doudot, L., & Grütter, A. (2024). What's New, Switzerland? Corpus (Version 1.0.0) [Data set]. LaRS - Language Repository of Switzerland. https://doi.org/10.48656/pa3t-xh52

DIP MD5 hash

c0c3bfa19c742dae5ea2f3cb717eb6f8

Dataset contents

swissubase_2579_1_0.zip
metadata.yaml