ArchiMob Release 1 (2016)

Ref. 2268

Dataset Overview

Dataset title

ArchiMob Release 1 (2016)

Canonical DOI

Used to cite the entire dataset, regardless of version updates.

https://doi.org/10.48656/b7nv-1485

DOI

Used to cite a specific dataset version.

https://doi.org/10.48656/4kzf-sp50

Dataset description language

English

Data URL

-

Data Availability

-

Dataset Description

The ArchiMob corpus represents German varieties spoken on the territory of Switzerland. It is the first electronic resource containing long samples of transcribed text in Swiss German, intended to be used for studying spatial distribution of morphosyntactic features and for natural language processing. This first release of the corpus, initially published in 2016, contains 34 transcribed interviews with a total of 528 381 tokens.

Remarks about the documentation

-

Version number

1.0

Embargo end date

-

Publication date

30.08.2023

Version notes

This is the first release of the ArchiMob corpus, published on 12/08/2016. It is also available at Zenodo: https://doi.org/10.5281/zenodo.1158572 For most purposes, the larger and more recent release 2 of the corpus will be more suitable.

Bibliographical citation

Scherrer, Y., Samardzic, T., & Glaser, E. (2023). ArchiMob Release 1 (2016) (Version 1.0.0) [Data set]. LaRS - Language Repository of Switzerland. https://doi.org/10.48656/4kzf-sp50

DIP MD5 hash

52c687ed2bfbbb2b0af0233c5794a566

Dataset contents

swissubase_2268_1_0.zip
ArchiMob_Release1_20160812_Documentation.pdf