Skip to content

Latest commit

 

History

History
448 lines (385 loc) · 17.7 KB

TRAIN_AND_RUN_README.md

File metadata and controls

448 lines (385 loc) · 17.7 KB

Step-by-step how to run the Angel System pipeline

Table of Contents

Local Installation

Follow the following steps (the optional steps are for active development purposes):

Get required repositories:
git clone [email protected]:PTG-Kitware/angel_system.git
cd angel_system
git submodule update --init --recursive
Create the environment
# IF YOU DON'T ALREADY HAVE PYTHON 3.8.10 AVAILABLE
conda create --name angel_systen python=3.8.10
conda activate angel_test_env
poetry install

# OR JUST
poetry install

Docker Installation

Follow the following steps (the optional steps are for active development purposes):

Get required repositories:

git clone [email protected]:PTG-Kitware/angel_system.git
cd angel_system
git submodule update --init --recursive

Create the environment

./angel-docker-build.sh -f
./angel-workspace-shell.sh

Inside the docker container:

./workspace_build.sh; source install/setup.sh

Data

See our data "Where things are" document on Google Drive here.

Object Detection Source Data

TODO

Pose Detection Source Data

TODO

TCN Source Data

BBN Medical Datasets

Source data from BBN can be acquired from https://bbn.com/private/ptg-magic/. Consult a team member for login information.

Each task ("skill") has their own sub-page ("In-Lab Data" link) from which sets of data are described and referred to. Data is stored on their SFTP server, to which the "Click to Download" links refer to.

Storage of downloaded ZIP archives, and their subsequent extractions, should follow the pattern below.

bbn_data/
├── README.md  # Indicate where we have acquired this BBN data.
└── lab_data-golden/
    ├── m2_tourniquet/
    │   ├── positive/
    │   │   ├── Fri-Apr-21/
    │   │   │   ├── 20230420_122603_HoloLens.mp4
    │   │   │   ├── 20230420_122603_HoloLens.skill_labels_by_frame.txt
    │   │   │   ├── 20230420_123212_HoloLens.mp4
    │   │   │   ├── 20230420_123212_HoloLens.skill_labels_by_frame.txt
    │   │   │   ├── 20230420_124541_HoloLens.mp4
    │   │   │   ├── 20230420_124541_HoloLens.skill_labels_by_frame.txt
    │   │   │   ├── 20230420_125033_HoloLens.mp4
    │   │   │   ├── 20230420_125033_HoloLens.skill_labels_by_frame.txt
    │   │   │   ├── 20230420_125517_HoloLens.mp4
    │   │   │   └── 20230420_125517_HoloLens.skill_labels_by_frame.txt
    │   │   ├── Fri-Apr-21.zip
    │   │   ├── Mon-Apr-17/
    │   │   │   ...
    │   │   └── Mon-Apr-17.zip
    │   │       ...
    │   └── negative/
    │       ├── Fri-Aug-25/
    │       │   ...
    │       └── Fri-Aug-25.zip
    ├── m3_pressure_dressing/
    │   ...
    └── r18_chest_seal/
        ...

A script is provided at scripts/extract_bbn_video_archives.bash to automate the recursive extraction of such ZIP files. To operate this script, change directories to be in a parent directory under which the ZIP files to be extracted are located, and then execute the script:

cd ${PATH_TO_DATA}/
bash ${PATH_TO_ANGEL_SYSTEM}/scripts/extract_bbn_video_archives.bash

Golden data should be marked as read-only after downloading and extracting to prevent accidental modification of the files:

chmod a-w -R bbn_data/lab_data-golden/
Extracting Truth COCO and frame image files

BBN archives provide MP4 videos, however we will need individual image frames for the following steps. The script to convert BBN Truth data into a COCO format will also, by necessity for down-stream processes, extract frames and dump them into a symmetric layout in another writable location:

python-tpl/TCN_HPL/tcn_hpl/data/utils/bbn.py ...
# OR use the console-script entrypoint installed with the package
bbn_create_truth_coco \
  ./bbn_data/lab_data-golden \
  ./bbn_data/lab_data-working
  ../../config/activity_labels/medical/m2.yaml \
  activity-truth-COCO.json

Storage on Gyges

  • On gyges, raw data is located at
    • Zed "Pro" data /data/PTG/medical/bbn_data/Release_v0.5/v0.56/<task>
    • Hololens Golden data: /data/PTG/medical/bbn_data/lab_data-golden/
    • Hololens Working data: /data/PTG/medical/bbn_data/lab_data-working/
  • In this pipeline we are only provided with object detection ground truth training data, which is located /data/PTG/medical/object_anns/<task>
  • For real-time execution, we store our models in angel_system/model_files/ which are provisioned by Ansible by configuration.
Examples of files and what they are used for:
  • /angel_system/tmux/demos/medical/M3-Kitware.yml: defines complete ROS configuration for all nodes
  • /angel_system/model_files/coco/r18_test_activity_preds.mscoco.json: required to determine the step activation threshold for the GSP
  • /angel_system/model_files/data_mapping/r18_mapping.txt: required to define model activity step classes
  • /angel_system/model_files/models/r18_tcn.ckpt: TCN model checkpoint
  • /angel_system/model_files/models/hands_model.pt: hand detection trained model
  • /angel_system/model_files/models/r18_det.pt: object detection trained model

Train or acquire an Object Detector

Quick-start example for Yolo v7:

python3 python-tpl/yolov7/yolov7/train.py \
  --workers 8 --device 0 --batch-size 4 \
  --data configs/data/PTG/medical/m2_task_objects.yaml \
  --img 768 768 \
  --cfg configs/model/training/PTG/medical/yolov7_m2.yaml \
  --weights weights/yolov7.pt \
  --project /data/PTG/medical/training/yolo_object_detector/train/ \
  --name m2_all_v1_example

Train or acquire Pose Estimator

TODO:

Activity Classifier Training Procedure

We take the following steps:

  1. Generate activity classification truth COCO file.
  2. predict objects in the scene
  3. predict poses and patient bounding boxes in the scene
  4. generate interaction feature vectors for the TCN
  5. train the TCN

The following will use file path and value examples for the Medical M2 Tourniquet use-case.

Generate activity classification truth COCO file

Generate the truth MS-COCO file for per-frame activity truth annotations. This example presumes we are using BBN Medical data as our source (as of 2024/10/15).

python-tpl/TCN_HPL/tcn_hpl/data/utils/bbn.py \
  ~/data/darpa-ptg/bbn_data/lab_data-golden/m2_tourniquet \
  ~/data/darpa-ptg/bbn_data/lab_data-working/m2_tourniquet \
  ~/dev/darpa-ptg/angel_system/config/activity_labels/medical/m2.yaml \
  ~/data/darpa-ptg/bbn_data/lab_data-working/m2_tourniquet-activity_truth.coco.json

Train, validation, and testing splits can be split from COCO files. The kwcoco split tool may be utilized to create splits at the video level, otherwise splits may be created manually.

For example:

kwcoco split \
  --src ~/data/darpa-ptg/bbn_data/lab_data-working/m2_tourniquet-activity_truth.coco.json \
  --dst1 TRAIN-activity_truth.coco.json \
  --dst2 REMAINDER-activity_truth.coco.json \
  --splitter video \
  --rng 12345 \
  --factor 2
kwcoco split \
  --src REMAINDER-activity_truth.coco.json \
  --dst1 VALIDATION-activity_truth.coco.json \
  --dst2 TEST-activity_truth.coco.json \
  --splitter video \
  --rng 12345 \
  --factor 2
# Protect your files!
chmod a-w \
  TRAIN-activity_truth.coco.json \
  REMAINDER-activity_truth.coco.json \
  VALIDATION-activity_truth.coco.json \
  TEST-activity_truth.coco.json

Generate Object Predictions in the Scene

Note that the input COCO file is that which was generated in the previous step. This is to ensure that all represented videos and image frames are predicted on and present in both COCO files.

python-tpl/yolov7/yolov7/detect_ptg.py \
  -i TRAIN-activity_truth.coco.json \
  -o test_det_output.coco.json
  --model-hands ./model_files/object_detector/hands_model.pt \
  --model-objects ./model_files/object_detector/m2_det.pt \
  --model-device 0 \
  --img-size 768 \
# Repeat for other relevant activity truth inputs

Additional debug outputs may optionally be generated. See the -h/--help options for more details.

Generate Pose Predictions

Note that the input COCO file is that which was generated in the Generate activity classification truth COCO file section.

python-tpl/TCN_HPL/tcn_hpl/data/utils/pose_generation/generate_pose_data.py \
  -i ~/data/darpa-ptg/bbn_data/lab_data-working/m2_tourniquet/activity_truth.coco.json \
  -o ./test_pose_output.coco.json \
  --det-config ./python-tpl/TCN_HPL/tcn_hpl/data/utils/pose_generation/configs/medic_pose.yaml \
  --det-weights ./model_files/pose_estimation/pose_det_model.pth \
  --pose-config ./python-tpl/TCN_HPL/tcn_hpl/data/utils/pose_generation/configs/ViTPose_base_medic_casualty_256x192.py \
  --pose-weights ./model_files/pose_estimation/pose_model.pth
# Repeat for other relevant activity truth inputs

Configure TCN Training Experiment

Create a new version of, or modify an existing (preferring the former) and modify attributes appropriately for your experiment.

  • task_name -- This may be updated with a unique name that identifies this training experiment. This is mostly encouraged for when the paths:root_dir is shared between many experiments (see below).
  • paths:root_dir -- Update with the path to where the training "logs/" directory should go. This may be shared between training experiments, in which case customizing your "task_name" is important for separation, or set to be a unique directory per experiment.
  • data:coco_* -- Update with the appropriate paths to the COCO input files to be trained over.
  • data:target_framerate -- Update with the target framerate for input data to ensure consistent temporal spacing in the dataset content loading.
  • data:epoch_length -- Update to a length appropriate for the quantity of windows in the dataset. This may also be increased yet more if vector augmentation is being utilized to increase the variety of window variations seen during a single epoch.
  • data:train_dataset:window_size -- Update with the desired window size for this experiment.
  • data:train_dataset:vectorize -- Update with the type and hyperparameters for the specific vectorizer to utilize for this experiment.
  • data:train_dataset:transform_frame_data:transforms -- Update to include any vector generalized transformations/augmentations that should be utilized during dataset iteration.
    • The transforms utilized for train, validation and testing may be customized independently. By default, the test set will share the validation set's transforms, however the hyperparameters for the test dataset can of course be manually specified if something different is desired.
    • Currently, the hyperparameters for the test dataset is what will be utilized by the ROS2 node integration.
  • model:num_classes -- Update with the total number of activity classification classes.
  • model:net:dim -- Update with the dimensionality of the feature vector the configured vectorizer class will produce.

If a "master" activity truth COCO was specifically split for a training setup, then video/frame related object detections and pose estimations may be subset from equivalently "master" prediction COCO files via a utility provided in TCN-HPL:

# Detections
kwcoco_guided_subset \
  object_detections-yolov7_baseline_20241030.coco.json \
  TRAIN-activity_truth.coco.json \
  TRAIN-object_detections.coco.json
kwcoco_guided_subset \
  object_detections-yolov7_baseline_20241030.coco.json \
  VALIDATION-activity_truth.coco.json \
  VALIDATION-object_detections.coco.json
kwcoco_guided_subset \
  object_detections-yolov7_baseline_20241030.coco.json \
  TEST-activity_truth.coco.json \
  TEST-object_detections.coco.json
# Poses
kwcoco_guided_subset \
  pose_estimations-mmpose_baseline_20241030.coco.json \
  TRAIN-activity_truth.coco.json \
  TRAIN-pose_estimations.coco.json
kwcoco_guided_subset \
  pose_estimations-mmpose_baseline_20241030.coco.json \
  VALIDATION-activity_truth.coco.json \
  VALIDATION-pose_estimations.coco.json
kwcoco_guided_subset \
  pose_estimations-mmpose_baseline_20241030.coco.json \
  TEST-activity_truth.coco.json \
  TEST-pose_estimations.coco.json

Run TCN Training

Quick-start:

train_command \
  experiment=m2/feat_locsconfs \
  paths.root_dir="$PWD" \
  task_name=my_m2_training

The Global Step Predictor (GSP)

How the GSP relates to the TCN Activity Classifier

The above TCN activity classifier in its current configuration takes in a second or two of video artifacts (e.g. for the "Locs&Confs" version, pose joint pixel coordinates and confidences, the user's hand detection locations and confidences, and other procedure-relevant object pixel coordinates and confidences), and outputs confidences for each of a vector of activities (examples: "labels" in config/activity_labels/medical), assuming up to one activity is ocurring "presently."

Now, the GSP takes as input the confidence vector per frame window, and keeps track over time of which activities ("steps" in the GSP context) have occurred, and which step a user is on.

Basically, if the "next step" at any point has been activated long enough, and with enough confidence, the GSP progresses to that step as the latest "completed" step.

Assumptions:

  • One activity or "background" (none of the listed activities) happens at a time.
  • The activities must happen in a specific linear order.
  • So, if an activity is detected with strong confidence way out of order (e.g. we're on step 3 and we detect step 8), the GSP does not mark step 8 as completed.
  • A single "skipped step" is possible given some criteria. Skipping one step can be allowed unconditionally. Skipping one step can also be allowed if the "skipped step" has been activated with some "easier" criteria (a lower confidence threshold and/or fewer frames above that threshold). We can also configure to skip one step simply given that a threshold number of frames have passed since we completed the most recent step.

Training the GSP

To "train" the GSP, we simply compute the average true positive output scores per class- that is, the average confidence of each TCN activity classification in its output vector, only when ground truth states that activity is happening. This includes the background class.

To do this, we must run inference on videos for which the TCN has never seen ground truth (and are hopefully quite independent from the training videos). The validation or test splits of your dataset may suffice. Note: If you have run training, test set prediction outputs should have been produced, in a file named tcn_activity_predictions.kwcoco.json.

If you don't have that file, he TCN's training harness can be run with train=false to only run inference and save the test data's output in the needed KWCOCO output format. Example:

python train_command \
    experiment=r18/feat_locsconfs \
    paths.root_dir=/path/to/my/data/splits/ \ # See above TCN docs for training data structure
    task_name=r18_my_TCN_i_just_trained \
    train=false \
    ckpt_path=model_files/activity_classifier/r18_tcn.ckpt \

This should create a new predictions output file, e.g. tcn_activity_predictions.kwcoco.json.

Then, for each class, we filter the video outputs by those which the ground truth indicates that class activity is occurring. Then we simply average the TCN output for that activity, for those frames.

Given the test dataset split ground truth you just gave as input to the train_command, and the predictions output file your train_command produced, create the average TP activation numpy file as follows:

python angel_system/global_step_prediction/run_expirement.py r18 \
     path/to/TEST-activity_truth.coco.json \
     path/to/tcn_activity_predictions.kwcoco.json \
     path/to/tcn_activity_predictions.kwcoco.json

That numpy file then can be provisioned to load to the default GSP model_files filepath, e.g. in the case of the R18 task, model_files/task_monitor/global_step_predictor_act_avgs_r18.npy.

Docker local testing

to start the service run:

./angel-workspace-shell.sh
tmuxinator start demos/medical/Kitware-R18

This will execute the yaml file defined at /angel_system/ros/demos/medical/Kitware-R18.yaml. We can use the pre-recorded data by ensuring the sensor_input line points to a pre-recorded data as such:

- sensor_input: ros2 bag play ${ANGEL_WORKSPACE_DIR}/ros_bags/rosbag2_2024_04_02-19_57_59-R18_naked_mannequin_no_blue_tarp-timestamp_unconfident/rosbag2_2024_04_02-19_57_59_0.db3

to end the service run:

tmuxinator stop demos/medical/Kitware-R18

Any modifications to the code would require to stop the service and re-build the workspace

tmuxinator stop demos/medical/Kitware-R18
./workspace_build.sh; source install/setup.sh

Docker real-time

This step requires a user on the BBN systems to login to the Kitware machine. After it is set up:

ssh <username>@kitware-ptg-magic.eln.bbn.com
cd angel/angel_system
git pull
git submodule update --init --recursive
./angel-workspace-shell.sh
./workspace_build.sh; source install/setup.sh
tmuxinator start demos/medical/BBN-integrate-Kitware-R18