SMILE Continuous DSGS Corpus L2 Phase 3

Ref. 3291

  

Dataset Overview

Dataset title

SMILE Continuous DSGS Corpus L2 Phase 3

Canonical DOI

Used to cite the entire dataset, regardless of version updates.

https://doi.org/10.48656/7w44-s005

DOI

Used to cite a specific dataset version.

https://doi.org/10.48656/0zm5-kp54

Dataset description language

English

Data URL

-

Data Availability

-

Dataset Description

This dataset contains videos of Phase 3/4 of DSGS L2 learner productions. The participants completed the same exercises as the native signers included in the L1 subcorpus. The dataset includes the original video recordings, corresponding MediaPipe videos, and annotations for two exercises. All data were prepared following appropriate data management and ethical guidelines, and relevant preprocessing steps were applied to ensure consistency and usability for research purposes as described in the data statements. Usage Information: This dataset contains .xz compressed files. While their content is safe, the .xz format may be blocklisted by your institution's IT infrastructure, and handling these files could result in an account suspension. Consider recompressing them locally before use.

Remarks about the documentation

The dataset contains two documentation files (README L2 and Data Statements) that provide detailed information on the data origin, the file structure, and the available metadata.

Version number

1.0

Embargo end date

-

Publication date

03.06.2026

Version notes

Version 1.0

Bibliographical citation

Battisti, A., & Ebling, S. (2026). SMILE Continuous DSGS Corpus L2 Phase 3 (Version 1.0) [Data set]. LaRS - Language Repository of Switzerland. https://doi.org/10.48656/0zm5-kp54

DIP MD5 hash

9bae450cd39c8f2bf4fe17eba6bdfed2

Dataset contents

/
README_L2.txt
SMILE_Continuous_DSGS_Corpus_Data_statements.pdf
metadata.yaml