[ot][notes] how to automatically tag actions in video data

Undescribed Horrific Abuse, One Victim & Survivor of Many gmkarl at gmail.com
Thu Dec 29 16:42:17 PST 2022


to find better models (again, i don't know if this is the "proper" way):

1. mim can only download models that are associated with installed
packages, so to find models from the dev-1.x branch, it must be
installed

git checkout dev-1.x
pip3 install -e .

2. use the search command to find the best performing model with an
installed config. note: there are also tiny-ram variants of models,
for low end systems, which don't perform as well

mim search mmaction2 --sort kinetics-700/top_1_accuracy --descending

3. for dev-1.x, the model performing best on kinetics-700 is presently
swin-large-p244-w877_in22k-pre_16xb8-amp-32x2x1-30e_kinetics700-rgb ,
which is a 782MB download. for stable 1.x, the model that comes up is
slowonly_imagenet-pretrained-r50_16xb16-8x8x1-steplr-150e_kinetics700-rgb
, although i had to switch to dev-1.x to get the download to succeed,
maybe the url changed.

mim download mmaction2 --config \
  swin-large-p244-w877_in22k-pre_16xb8-amp-32x2x1-30e_kinetics700-rgb --dest .

4. use new model to run demo (maybe?)

python3 demo/demo.py \
  swin-large-p244-w877_in22k-pre_16xb8-amp-32x2x1-30e_kinetics700-rgb.py \
  swin-large-p244-w877_in22k-pre_16xb8-amp-32x2x1-30e_kinetics700-rgb_20220930-f8d74db7.pth
\
  demo/demo.mp4 tools/data/kinetics/label_map_k400.txt --device=cpu
# crashes for me; i don't have the ram on my old system for the large model

sha256sums:

2692d16c712e24994aaa3cfb48f957a521e053ffb81c474e2c0b3e579c888650
tsn_imagenet-pretrained-r50_8xb32-1x1x8-100e_kinetics400-rgb_20220906-2692d16c.pth
690bdf66675aa117e37298de8feb44112f80cc0fecd1290b21a4cfc1433d26b4
tsn_imagenet-pretrained-r50_8xb32-1x1x8-100e_kinetics400-rgb.py
adf57e1194172fe30ee1d44715aa820915f7586997e775aa975d0e7c24de4342
slowonly_imagenet-pretrained-r50_16xb16-8x8x1-steplr-150e_kinetics700-rgb_20220901-4098e1eb.pth
c1640f2a3ae2e962a9a9f8ea2c370c4f34c37f78d96fd77a852239f5587ab619
slowonly_imagenet-pretrained-r50_16xb16-8x8x1-steplr-150e_kinetics700-rgb.py
f8d74db74207c862162ba47b6c9296c0a1641f23621f9dd47555e8d8e56b7494
swin-large-p244-w877_in22k-pre_16xb8-amp-32x2x1-30e_kinetics700-rgb_20220930-f8d74db7.pth
84239680a94b874841c67bdc49739799bac0077437e1578960ddef598a8a3cb3
swin-large-p244-w877_in22k-pre_16xb8-amp-32x2x1-30e_kinetics700-rgb.py


More information about the cypherpunks mailing list