SMILE Continuous DSGS Corpus L2 Phase 2

Ref. 3290

  

Dataset Overview

Dataset title

SMILE Continuous DSGS Corpus L2 Phase 2

Canonical DOI

Used to cite the entire dataset, regardless of version updates.

https://doi.org/10.48656/de29-dm79

DOI

Used to cite a specific dataset version.

https://doi.org/10.48656/x9e4-8z36

Dataset description language

English

Data URL

-

Data Availability

-

Dataset Description

This dataset contains videos of Phase 2/4 of DSGS L2 learner productions. The participants completed the same exercises as the native signers included in the L1 subcorpus. The dataset includes the original video recordings, corresponding MediaPipe videos, and annotations for two exercises. All data were prepared following appropriate data management and ethical guidelines, and relevant preprocessing steps were applied to ensure consistency and usability for research purposes as described in the data statements. Usage Information: This dataset contains .xz compressed files. While their content is safe, the .xz format may be blocklisted by your institution's IT infrastructure, and handling these files could result in an account suspension. Consider recompressing them locally before use.

Remarks about the documentation

The dataset contains two documentation files (README L2 and Data Statements) that provide detailed information on the data origin, the file structure, and the available metadata.

Version number

1.0

Embargo end date

-

Publication date

02.06.2026

Version notes

Version 1.0

Bibliographical citation

Battisti, A., & Ebling, S. (2026). SMILE Continuous DSGS Corpus L2 Phase 2 (Version 1.0) [Data set]. LaRS - Language Repository of Switzerland. https://doi.org/10.48656/x9e4-8z36

DIP MD5 hash

c420f50ae99147b95202e58cade47b76

Dataset contents

/
README_L2.txt
SMILE_Continuous_DSGS_Corpus_Data_statements.pdf
metadata.yaml