The directory for data loaders for TAN experiments (not for end-to-end experiments).
We have shared the pre-processed ASR sentence of the entire HowTo100M dataset (i.e. the output of sentencify-text module for all HTM videos) on VGG server: https://www.robots.ox.ac.uk/~vgg/research/tan/index.html#htm-sentencify
You should download and place the files as
data/
sentencified_htm_370k.json
htm_align.json-
Pre-process HowTo100M ASR text with sentencify-text module
-
Store the processed ASR sentences as separate csv files, e.g.
abcdefghijk.csvcontainsstart,end,text 4.13,6.50,"so we've moved location for our dessert" 6.50,8.36,"and as you can see there's an amazing area" ...
-
Prepare an
vid_to_asr.jsonfile in this directory, containing a dictionary mappingvidto the csv path for ASR, e.g.{'abcdefghijk': 'your_path/abcdefghijk.csv', ...} -
Test your data preparation, run:
python loader_htm.py
You should see a list of strings without error.