Trokens: Semantic-Aware Relational Trajectory Tokens for Few-Shot Action Recognition

Pulkit Kumar*¹ · Shuaiyi Huang*¹ · Matthew Walmer¹ · Sai Saketh Rambhatla^1,2 · Abhinav Shrivastava¹

¹University of Maryland, College Park ²GenAI, Meta
ICCV 2025
^{*Equal contribution}

This repository contains the official code for the paper "Trokens: Semantic-Aware Relational Trajectory Tokens for Few-Shot Action Recognition".

Installation

Create a environment using either conda or venv:

conda create -n trokens python=3.10

Activate the environment:

conda activate trokens

Install all dependencies:

pip install -r requirements.txt

Setting Up Trokens Point Tracking Data

The pre-computed Trokens point tracking data is available on Hugging Face at: https://huggingface.co/datasets/pulkitkumar95/trokens_pt_data

To download and set up the point tracking data:

# Install huggingface_hub if not already installed
pip install huggingface_hub

# Download the dataset
from huggingface_hub import snapshot_download
snapshot_download(
    repo_id="pulkitkumar95/trokens_pt_data",
    repo_type="dataset",
    local_dir=<LOCAL_PATH_TO_SAVE_TROKENS_PT_DATA>
)

Or using the command line:

huggingface-cli download pulkitkumar95/trokens_pt_data --repo-type dataset --local-dir <LOCAL_PATH_TO_SAVE_TROKENS_PT_DATA>

Once the dataset is downlaoded, unzip the individual dataset directory by running:

cd <LOCAL_PATH_TO_SAVE_TROKENS_PT_DATA>/cotracker3_bip_fr_32
unzip *zip

Extracting Point Tracking Data for Custom Datasets

All the point tracking data provided was extracted using the scripts available in the point_tracking/ directory. For details on the extraction process and how to extract point tracking data for new custom datasets, please refer to point_tracking/README.md.

Training and Testing

Before running the training, set the following environment variables:

# Set Config name (e.g., ssv2_small,ssv2_full, hmdb, k400, finegym)
export CONFIG_TO_USE=ssv2_small
export DATASET=ssv2
export EXP_NAME=trokens_release
export SECONDAY_EXP_NAME=sample_exp

# Path to store PyTorch models and weights
export TORCH_HOME=<LOCAL_PATH_TO_SAVE_PYTORCH_MODELS>

# Path to dataset directory containing videos
export DATA_DIR=<LOCAL_PATH_TO_SAVE_DATASET>

# Path to pre-computed Trokens point tracking data and few shot info from huggingface.
export TROKENS_PT_DATA=<LOCAL_PATH_TO_SAVE_TROKENS_PT_DATA>

# Base output directory for experiments
export BASE_OUTPUT_DIR=<LOCAL_PATH_TO_SAVE_EXPERIMENTS>

Using the Sample Script

A sample training script is provided in scripts/trokens.sh. After setting the above environment variables and configuring the paths, you can run:

bash scripts/trokens.sh <config_name_to_use>

For example:

bash scripts/trokens.sh ssv2_small

Manual Training Command

Alternatively, to train the model manually, you can use the following command:

torchrun --nproc_per_node=$NUM_GPUS --master_port=$MASTER_PORT \
    tools/run_net.py --init_method env:// --new_dist_init \
    --cfg configs/trokens/$CONFIG_TO_USE.yaml \
    WANDB.ID $WANDB_ID \
    WANDB.EXP_NAME $EXP_NAME \
    MASTER_PORT $MASTER_PORT \
    OUTPUT_DIR $BASE_OUTPUT_DIR/$CONFIG_TO_USE/$EXP_NAME/$SECONDAY_EXP_NAME \
    NUM_GPUS $NUM_GPUS \
    DATA_LOADER.NUM_WORKERS $NUM_WORKERS \
    DATA.USE_RAND_AUGMENT True \
    DATA.PATH_TO_DATA_DIR $DATA_DIR \
    DATA.PATH_TO_TROKEN_PT_DATA $TROKENS_PT_DATA \
    FEW_SHOT.K_SHOT $K_SHOT \
    FEW_SHOT.TRAIN_QUERY_PER_CLASS 6 \
    FEW_SHOT.N_WAY $N_WAY \
    POINT_INFO.NAME $POINT_INFO_NAME \
    POINT_INFO.SAMPLING_TYPE cluster_sample \
    POINT_INFO.NUM_POINTS_TO_SAMPLE $NUM_POINTS_TO_SAMPLE \
    MODEL.FEAT_EXTRACTOR dino \
    MODEL.DINO_CONFIG dinov2_vitb14 \
    MODEL.MOTION_MODULE.USE_CROSS_MOTION_MODULE True \
    MODEL.MOTION_MODULE.USE_HOD_MOTION_MODULE True

Key parameters:

CONFIG_TO_USE: Configuration file to use (e.g., ssv2_full, hmdb, k400, finegym)
NUM_GPUS: Number of GPUs to use (e.g., 1)
NUM_WORKERS: Number of data loader workers (e.g., 16)
K_SHOT: Number of support examples per class (e.g., 1)
N_WAY: Number of classes per episode (e.g., 5)
POINT_INFO_NAME: Point tracking method name
NUM_POINTS_TO_SAMPLE: Number of trajectory points to sample
WANDB_ID: Weights & Biases experiment ID
EXP_NAME: Experiment name for wandb tracking
OUTPUT_DIR: Output directory (typically derived as $BASE_OUTPUT_DIR/$CONFIG_TO_USE/$EXP_NAME/$SECONDARY_EXP_NAME)

Development

This codebase is under active development. If you encounter any issues or have questions, please feel free to:

Open an issue in this repository
Contact Pulkit at pulkit[at]umd[dot]edu

Acknowledgments

This codebase is built upon two excellent repositories:

TATs: Trajectory-aligned Space-time Tokens for Few-shot Action Recognition
MoLo: Motion-augmented Long-form Video Understanding
ORViT: Object-Regions for Video Instance Recognition and Tracking

We thank the authors for making their code publicly available.

Citation

If you find this code and out paper useful for your research, please cite our papers:

@inproceedings{kumar2025trokens,
  title={Trokens: Semantic-Aware Relational Trajectory Tokens for Few-Shot Action Recognition},
  author={Kumar, Pulkit and Huang, Shuaiyi and Walmer, Matthew and Rambhatla, Sai Saketh and Shrivastava, Abhinav},
  booktitle={International Conference on Computer Vision},
  year={2025}
}

@inproceedings{kumar2024trajectory,
  title={Trajectory-aligned Space-time Tokens for Few-shot Action Recognition},
  author={Kumar, Pulkit and Padmanabhan, Namitha and Luo, Luke and Rambhatla, Sai Saketh and Shrivastava, Abhinav},
  booktitle={European Conference on Computer Vision},
  pages={474--493},
  year={2024},
  organization={Springer}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Trokens: Semantic-Aware Relational Trajectory Tokens for Few-Shot Action Recognition

Installation

Setting Up Trokens Point Tracking Data

Extracting Point Tracking Data for Custom Datasets

Training and Testing

Using the Sample Script

Manual Training Command

Development

Acknowledgments

Citation

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
configs/trokens		configs/trokens
point_tracking		point_tracking
scripts		scripts
tools		tools
trokens		trokens
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

pulkitkumar95/trokens

Folders and files

Latest commit

History

Repository files navigation

Trokens: Semantic-Aware Relational Trajectory Tokens for Few-Shot Action Recognition

Installation

Setting Up Trokens Point Tracking Data

Extracting Point Tracking Data for Custom Datasets

Training and Testing

Using the Sample Script

Manual Training Command

Development

Acknowledgments

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages