Name	Name	Last commit message	Last commit date
parent directory ..
readme.md	readme.md

Name

Last commit message

Last commit date

HTM-AA Dataset [website]

HTM-AA means Auto-Aligned (AA) version of HowTo100M (HTM) dataset. It is an output of our Temporal Alignment Networks and a final goal of this project.

HTM-AA is a large-scale paired video-text dataset, automatically obtained without any human annotation. In our paper Table 4, we show it can improve the backbone visual representation.

For a video from the HowTo100M dataset, HTM-AA provides:

the visually alignable sentences taken from the YouTube ASR,
their corresponding video timestamps (in second).

Download

HTM-AA-v1(329MB csv) [from Oxford server] [from Google Drive]

Statistics

[website]

How To Load

import pandas as pd
htm_aa = pd.read_csv('htm_aa_v1.csv')

print(htm_aa.iloc[42].to_dict())
# {'vid': '6yooogsTG8k',
#  'timestamp': 284,
#  'text': "and starting with pink i'm just going to knead that a little bit just to make it nice and smooth"}

Reference

If you find this dataset useful for your project, please consider citing our paper:

@InProceedings{Han2022TAN,
    author       = "Tengda Han and Weidi Xie and Andrew Zisserman",
    title        = "Temporal Alignment Networks for Long-term Video",
    booktitle    = "CVPR",
    year         = "2022",
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

readme.md

HTM-AA Dataset [website]

Download

Statistics

How To Load

Reference

FilesExpand file tree

htm_aa

Directory actions

More options

Directory actions

More options

Latest commit

History

htm_aa

Folders and files

parent directory

readme.md

HTM-AA Dataset [website]

Download

Statistics

How To Load

Reference