Published May 13, 2026
| Version
v1
Dataset
Open
European Smaller-Language Video Subtitle Sample Set
Description
This dataset contains self-authored subtitle samples for six smaller European languages: Welsh, Irish, Catalan, Basque, Maltese, and Icelandic. The release is structured for repository deposit and multilingual video-localization evaluation, with 144 clip-level records, 432 aligned subtitle segments, 288 SRT files, a manifest, field dictionary, methodology notes, and a machine-readable schema. All distributed text was authored specifically for this release. No source video, source audio, scraped subtitles, or third-party transcripts are included.
The package is intended for subtitle alignment testing, localization workflow review, multilingual file-ingestion checks, and documentation of repository-ready dataset packaging for audiovisual translation scenarios. The distributed files support both human inspection and machine processing: SRT files provide subtitle-like timing structure, while the CSV and JSON files expose clip identifiers, segment alignment, language coverage, and rights metadata in a form suitable for downstream parsing or indexing.
This repository record distributes the dataset files directly. The linked website is provided only as supplementary project context for subtitle translation and multilingual file-processing workflows.
Files
european-smaller-language-video-subtitle-sample-set-v1.0.0.zip
Files
(160.1 kB)
| Name | Size | Download all |
|---|---|---|
|
Checksum: md5:d2ba84a51f2a0ddd066f1380f81b742f
PID: http://hdl.handle.net/11304/6ed6c025-c0f4-41bd-a949-7968d0dddd3a |
160.1 kB | Preview Download |
Additional details
Identifiers
- Other
- ESLS-2026-V1
Related works
- Is supplemented by
- https://aitranslatevideo.org/subtitle-translator/ (Other)
Dates
- Submitted
-
2026-05-13
Instruments
- Text authoring and subtitle formatting workflow
-
esls-workflow-v1
(Other)
DetailsExpansion error: Instrument is not eligible for expansion
References
- AI Translate Video. Companion project website for subtitle translation and multilingual file-processing workflow context. Available at: https://aitranslatevideo.org/ Accessed 2026-05-13.