- Local installation
- Docker installation
- Data and pretrained models
- Training
- Training on lab data
- Docker local testing with pre-recorded data
- Real-time
Follow the following steps (the optional steps are for active development purposes):
git clone [email protected]:PTG-Kitware/angel_system.git
cd angel_system
git submodule update --init --recursive
# IF YOU DON'T ALREADY HAVE PYTHON 3.8.10 AVAILABLE
conda create --name angel_systen python=3.8.10
conda activate angel_test_env
poetry install
# OR JUST
poetry install
Follow the following steps (the optional steps are for active development purposes):
git clone [email protected]:PTG-Kitware/angel_system.git
cd angel_system
git submodule update --init --recursive
./angel-docker-build.sh -f
./angel-workspace-shell.sh
./workspace_build.sh; source install/setup.sh
See our data "Where things are" document on Google Drive here.
TODO
TODO
Source data from BBN can be acquired from https://bbn.com/private/ptg-magic/. Consult a team member for login information.
Each task ("skill") has their own sub-page ("In-Lab Data" link) from which sets of data are described and referred to. Data is stored on their SFTP server, to which the "Click to Download" links refer to.
Storage of downloaded ZIP archives, and their subsequent extractions, should follow the pattern below.
bbn_data/
├── README.md # Indicate where we have acquired this BBN data.
└── lab_data-golden/
├── m2_tourniquet/
│ ├── positive/
│ │ ├── Fri-Apr-21/
│ │ │ ├── 20230420_122603_HoloLens.mp4
│ │ │ ├── 20230420_122603_HoloLens.skill_labels_by_frame.txt
│ │ │ ├── 20230420_123212_HoloLens.mp4
│ │ │ ├── 20230420_123212_HoloLens.skill_labels_by_frame.txt
│ │ │ ├── 20230420_124541_HoloLens.mp4
│ │ │ ├── 20230420_124541_HoloLens.skill_labels_by_frame.txt
│ │ │ ├── 20230420_125033_HoloLens.mp4
│ │ │ ├── 20230420_125033_HoloLens.skill_labels_by_frame.txt
│ │ │ ├── 20230420_125517_HoloLens.mp4
│ │ │ └── 20230420_125517_HoloLens.skill_labels_by_frame.txt
│ │ ├── Fri-Apr-21.zip
│ │ ├── Mon-Apr-17/
│ │ │ ...
│ │ └── Mon-Apr-17.zip
│ │ ...
│ └── negative/
│ ├── Fri-Aug-25/
│ │ ...
│ └── Fri-Aug-25.zip
├── m3_pressure_dressing/
│ ...
└── r18_chest_seal/
...
A script is provided at scripts/extract_bbn_video_archives.bash
to automate
the recursive extraction of such ZIP files.
To operate this script, change directories to be in a parent directory
under which the ZIP files to be extracted are located, and then execute the
script:
cd ${PATH_TO_DATA}/
bash ${PATH_TO_ANGEL_SYSTEM}/scripts/extract_bbn_video_archives.bash
Golden data should be marked as read-only after downloading and extracting to prevent accidental modification of the files:
chmod a-w -R bbn_data/lab_data-golden/
BBN archives provide MP4 videos, however we will need individual image frames for the following steps. The script to convert BBN Truth data into a COCO format will also, by necessity for down-stream processes, extract frames and dump them into a symmetric layout in another writable location:
python-tpl/TCN_HPL/tcn_hpl/data/utils/bbn.py ...
# OR use the console-script entrypoint installed with the package
bbn_create_truth_coco \
./bbn_data/lab_data-golden \
./bbn_data/lab_data-working
../../config/activity_labels/medical/m2.yaml \
activity-truth-COCO.json
- On gyges, raw data is located at
- Zed "Pro" data
/data/PTG/medical/bbn_data/Release_v0.5/v0.56/<task>
- Hololens Golden data:
/data/PTG/medical/bbn_data/lab_data-golden/
- Hololens Working data:
/data/PTG/medical/bbn_data/lab_data-working/
- Zed "Pro" data
- In this pipeline we are only provided with object detection ground truth
training data, which is located
/data/PTG/medical/object_anns/<task>
- For real-time execution, we store our models in
angel_system/model_files/
which are provisioned by Ansible by configuration.
/angel_system/tmux/demos/medical/M3-Kitware.yml
: defines complete ROS configuration for all nodes/angel_system/model_files/coco/r18_test_activity_preds.mscoco.json
: required to determine the step activation threshold for the GSP/angel_system/model_files/data_mapping/r18_mapping.txt
: required to define model activity step classes/angel_system/model_files/models/r18_tcn.ckpt
: TCN model checkpoint/angel_system/model_files/models/hands_model.pt
: hand detection trained model/angel_system/model_files/models/r18_det.pt
: object detection trained model
Quick-start example for Yolo v7:
python3 python-tpl/yolov7/yolov7/train.py \
--workers 8 --device 0 --batch-size 4 \
--data configs/data/PTG/medical/m2_task_objects.yaml \
--img 768 768 \
--cfg configs/model/training/PTG/medical/yolov7_m2.yaml \
--weights weights/yolov7.pt \
--project /data/PTG/medical/training/yolo_object_detector/train/ \
--name m2_all_v1_example
TODO:
We take the following steps:
- Generate activity classification truth COCO file.
- predict objects in the scene
- predict poses and patient bounding boxes in the scene
- generate interaction feature vectors for the TCN
- train the TCN
The following will use file path and value examples for the Medical M2 Tourniquet use-case.
- Generate activity classification truth COCO file
- Generate Object Predictions in the Scene
- Generate Pose Predictions
- Configure TCN Training Experiment
- Run TCN Training
Generate the truth MS-COCO file for per-frame activity truth annotations. This example presumes we are using BBN Medical data as our source (as of 2024/10/15).
python-tpl/TCN_HPL/tcn_hpl/data/utils/bbn.py \
~/data/darpa-ptg/bbn_data/lab_data-golden/m2_tourniquet \
~/data/darpa-ptg/bbn_data/lab_data-working/m2_tourniquet \
~/dev/darpa-ptg/angel_system/config/activity_labels/medical/m2.yaml \
~/data/darpa-ptg/bbn_data/lab_data-working/m2_tourniquet-activity_truth.coco.json
Train, validation, and testing splits can be split from COCO files.
The kwcoco split
tool may be utilized to create splits at the video level,
otherwise splits may be created manually.
For example:
kwcoco split \
--src ~/data/darpa-ptg/bbn_data/lab_data-working/m2_tourniquet-activity_truth.coco.json \
--dst1 TRAIN-activity_truth.coco.json \
--dst2 REMAINDER-activity_truth.coco.json \
--splitter video \
--rng 12345 \
--factor 2
kwcoco split \
--src REMAINDER-activity_truth.coco.json \
--dst1 VALIDATION-activity_truth.coco.json \
--dst2 TEST-activity_truth.coco.json \
--splitter video \
--rng 12345 \
--factor 2
# Protect your files!
chmod a-w \
TRAIN-activity_truth.coco.json \
REMAINDER-activity_truth.coco.json \
VALIDATION-activity_truth.coco.json \
TEST-activity_truth.coco.json
Note that the input COCO file is that which was generated in the previous step. This is to ensure that all represented videos and image frames are predicted on and present in both COCO files.
python-tpl/yolov7/yolov7/detect_ptg.py \
-i TRAIN-activity_truth.coco.json \
-o test_det_output.coco.json
--model-hands ./model_files/object_detector/hands_model.pt \
--model-objects ./model_files/object_detector/m2_det.pt \
--model-device 0 \
--img-size 768 \
# Repeat for other relevant activity truth inputs
Additional debug outputs may optionally be generated.
See the -h
/--help
options for more details.
Note that the input COCO file is that which was generated in the
Generate activity classification truth COCO file
section.
python-tpl/TCN_HPL/tcn_hpl/data/utils/pose_generation/generate_pose_data.py \
-i ~/data/darpa-ptg/bbn_data/lab_data-working/m2_tourniquet/activity_truth.coco.json \
-o ./test_pose_output.coco.json \
--det-config ./python-tpl/TCN_HPL/tcn_hpl/data/utils/pose_generation/configs/medic_pose.yaml \
--det-weights ./model_files/pose_estimation/pose_det_model.pth \
--pose-config ./python-tpl/TCN_HPL/tcn_hpl/data/utils/pose_generation/configs/ViTPose_base_medic_casualty_256x192.py \
--pose-weights ./model_files/pose_estimation/pose_model.pth
# Repeat for other relevant activity truth inputs
Create a new version of, or modify an existing (preferring the former) and modify attributes appropriately for your experiment.
task_name
-- This may be updated with a unique name that identifies this training experiment. This is mostly encouraged for when thepaths:root_dir
is shared between many experiments (see below).paths:root_dir
-- Update with the path to where the training "logs/" directory should go. This may be shared between training experiments, in which case customizing your "task_name" is important for separation, or set to be a unique directory per experiment.data:coco_*
-- Update with the appropriate paths to the COCO input files to be trained over.data:target_framerate
-- Update with the target framerate for input data to ensure consistent temporal spacing in the dataset content loading.data:epoch_length
-- Update to a length appropriate for the quantity of windows in the dataset. This may also be increased yet more if vector augmentation is being utilized to increase the variety of window variations seen during a single epoch.data:train_dataset:window_size
-- Update with the desired window size for this experiment.data:train_dataset:vectorize
-- Update with the type and hyperparameters for the specific vectorizer to utilize for this experiment.data:train_dataset:transform_frame_data:transforms
-- Update to include any vector generalized transformations/augmentations that should be utilized during dataset iteration.- The transforms utilized for train, validation and testing may be customized independently. By default, the test set will share the validation set's transforms, however the hyperparameters for the test dataset can of course be manually specified if something different is desired.
- Currently, the hyperparameters for the test dataset is what will be utilized by the ROS2 node integration.
model:num_classes
-- Update with the total number of activity classification classes.model:net:dim
-- Update with the dimensionality of the feature vector the configured vectorizer class will produce.
If a "master" activity truth COCO was specifically split for a training setup, then video/frame related object detections and pose estimations may be subset from equivalently "master" prediction COCO files via a utility provided in TCN-HPL:
# Detections
kwcoco_guided_subset \
object_detections-yolov7_baseline_20241030.coco.json \
TRAIN-activity_truth.coco.json \
TRAIN-object_detections.coco.json
kwcoco_guided_subset \
object_detections-yolov7_baseline_20241030.coco.json \
VALIDATION-activity_truth.coco.json \
VALIDATION-object_detections.coco.json
kwcoco_guided_subset \
object_detections-yolov7_baseline_20241030.coco.json \
TEST-activity_truth.coco.json \
TEST-object_detections.coco.json
# Poses
kwcoco_guided_subset \
pose_estimations-mmpose_baseline_20241030.coco.json \
TRAIN-activity_truth.coco.json \
TRAIN-pose_estimations.coco.json
kwcoco_guided_subset \
pose_estimations-mmpose_baseline_20241030.coco.json \
VALIDATION-activity_truth.coco.json \
VALIDATION-pose_estimations.coco.json
kwcoco_guided_subset \
pose_estimations-mmpose_baseline_20241030.coco.json \
TEST-activity_truth.coco.json \
TEST-pose_estimations.coco.json
Quick-start:
train_command \
experiment=m2/feat_locsconfs \
paths.root_dir="$PWD" \
task_name=my_m2_training
The above TCN activity classifier in its current configuration takes in a second or two of video artifacts (e.g. for the "Locs&Confs" version, pose joint pixel coordinates and confidences, the user's hand detection locations and confidences, and other procedure-relevant object pixel coordinates and confidences), and outputs confidences for each of a vector of activities (examples: "labels" in config/activity_labels/medical), assuming up to one activity is ocurring "presently."
Now, the GSP takes as input the confidence vector per frame window, and keeps track over time of which activities ("steps" in the GSP context) have occurred, and which step a user is on.
Basically, if the "next step" at any point has been activated long enough, and with enough confidence, the GSP progresses to that step as the latest "completed" step.
Assumptions:
- One activity or "background" (none of the listed activities) happens at a time.
- The activities must happen in a specific linear order.
- So, if an activity is detected with strong confidence way out of order (e.g. we're on step 3 and we detect step 8), the GSP does not mark step 8 as completed.
- A single "skipped step" is possible given some criteria. Skipping one step can be allowed unconditionally. Skipping one step can also be allowed if the "skipped step" has been activated with some "easier" criteria (a lower confidence threshold and/or fewer frames above that threshold). We can also configure to skip one step simply given that a threshold number of frames have passed since we completed the most recent step.
To "train" the GSP, we simply compute the average true positive output scores per class- that is, the average confidence of each TCN activity classification in its output vector, only when ground truth states that activity is happening. This includes the background class.
To do this, we must run inference on videos for which the TCN has never seen ground truth (and are hopefully
quite independent from the training videos). The validation or test splits of your dataset may suffice.
Note: If you have run training, test set prediction outputs should have been produced, in a
file named tcn_activity_predictions.kwcoco.json
.
If you don't have that file, he TCN's training harness can be run with train=false
to only run
inference and save the test data's output in the needed KWCOCO output format. Example:
python train_command \
experiment=r18/feat_locsconfs \
paths.root_dir=/path/to/my/data/splits/ \ # See above TCN docs for training data structure
task_name=r18_my_TCN_i_just_trained \
train=false \
ckpt_path=model_files/activity_classifier/r18_tcn.ckpt \
This should create a new predictions output file, e.g. tcn_activity_predictions.kwcoco.json
.
Then, for each class, we filter the video outputs by those which the ground truth indicates that class activity is occurring. Then we simply average the TCN output for that activity, for those frames.
Given the test dataset split ground truth you just gave as input to the train_command
, and the predictions
output file your train_command
produced, create the average TP activation numpy file as follows:
python angel_system/global_step_prediction/run_expirement.py r18 \
path/to/TEST-activity_truth.coco.json \
path/to/tcn_activity_predictions.kwcoco.json \
path/to/tcn_activity_predictions.kwcoco.json
That numpy file then can be provisioned to load to the default GSP model_files
filepath, e.g. in the case
of the R18 task, model_files/task_monitor/global_step_predictor_act_avgs_r18.npy
.
to start the service run:
./angel-workspace-shell.sh
tmuxinator start demos/medical/Kitware-R18
This will execute the yaml file defined at /angel_system/ros/demos/medical/Kitware-R18.yaml
. We can use the pre-recorded data by ensuring the sensor_input
line points to a pre-recorded data as such:
- sensor_input: ros2 bag play ${ANGEL_WORKSPACE_DIR}/ros_bags/rosbag2_2024_04_02-19_57_59-R18_naked_mannequin_no_blue_tarp-timestamp_unconfident/rosbag2_2024_04_02-19_57_59_0.db3
to end the service run:
tmuxinator stop demos/medical/Kitware-R18
Any modifications to the code would require to stop the service and re-build the workspace
tmuxinator stop demos/medical/Kitware-R18
./workspace_build.sh; source install/setup.sh
This step requires a user on the BBN systems to login to the Kitware machine. After it is set up:
ssh <username>@kitware-ptg-magic.eln.bbn.com
cd angel/angel_system
git pull
git submodule update --init --recursive
./angel-workspace-shell.sh
./workspace_build.sh; source install/setup.sh
tmuxinator start demos/medical/BBN-integrate-Kitware-R18