Published May 13, 2026 | Version v1
Dataset Open

European Smaller-Language Video Subtitle Sample Set

Creators

  • 1. ROR icon Independent Research Association

Description

This dataset contains self-authored subtitle samples for six smaller European languages: Welsh, Irish, Catalan, Basque, Maltese, and Icelandic. The release is structured for repository deposit and multilingual video-localization evaluation, with 144 clip-level records, 432 aligned subtitle segments, 288 SRT files, a manifest, field dictionary, methodology notes, and a machine-readable schema. All distributed text was authored specifically for this release. No source video, source audio, scraped subtitles, or third-party transcripts are included.
 
The package is intended for subtitle alignment testing, localization workflow review, multilingual file-ingestion checks, and documentation of repository-ready dataset packaging for audiovisual translation scenarios. The distributed files support both human inspection and machine processing: SRT files provide subtitle-like timing structure, while the CSV and JSON files expose clip identifiers, segment alignment, language coverage, and rights metadata in a form suitable for downstream parsing or indexing.
 
This repository record distributes the dataset files directly. The linked website is provided only as supplementary project context for subtitle translation and multilingual file-processing workflows.

Files

european-smaller-language-video-subtitle-sample-set-v1.0.0.zip

Files (160.1 kB)

Name Size Download all
Checksum: md5:d2ba84a51f2a0ddd066f1380f81b742f

PID: http://hdl.handle.net/11304/6ed6c025-c0f4-41bd-a949-7968d0dddd3a
160.1 kB Preview Download

Additional details

Identifiers

Other
ESLS-2026-V1

Related works

Is supplemented by
https://aitranslatevideo.org/subtitle-translator/ (Other)

Dates

Submitted
2026-05-13

Instruments

Text authoring and subtitle formatting workflow
esls-workflow-v1 (Other)

Details

Expansion error: Instrument is not eligible for expansion

References

  • AI Translate Video. Companion project website for subtitle translation and multilingual file-processing workflow context. Available at: https://aitranslatevideo.org/ Accessed 2026-05-13.