更新时间:2026-02-07 GMT+08:00
分享

视频类数据集格式要求

ModelArts Studio大模型开发平台支持创建视频类数据集,创建时可导入多种形式的数据,具体格式要求详见表1

表1 视频类数据集格式要求

文件内容

文件格式

文件要求

视频

mp4或avi

  • 支持mp4、avi视频格式上传,所有视频可以放在多个文件夹下,每个文件夹下可以同时包含mp4或avi格式的视频。
  • 从OBS导入:单个文件/压缩包大小不超过20GB,文件数量不限制。

视频+标注

视频+jsonl

  • 视频格式支持:mp4、avi
  • 标注文件格式:jsonl,jsonl文件仅支持UTF-8编码。

示例如下所示:

具体的jsonl标注文件参考:

{"video_fn": "13/ad098173-af09-48fe-95c3-e72fd629688e.mp4"视频相对路径,
"prompt": "A person pours a clear liquid from a bottle into a shot glass, then lifts the glass to their mouth and drinks the shot. The background includes a red coat and other indistinct background elements."视频摘要生成(简略),
"long_prompt": "A person is seen pouring a clear liquid from a green glass bottle into a small glass. The individual is wearing a white shirt with a lace collar and a beige cardigan. The background appears to be a cozy indoor setting, possibly a cafe or a restaurant, with red and white elements visible, such as a red coat hanging on the wall and a white table. The person carefully pours the liquid, ensuring it is filled to the brim of the glass. The liquid is clear and has some green leaves floating in it. The person then holds the glass up, possibly to show the contents or to prepare for a drink.",视频摘要生成(详细)
}

相关文档