Assessment

There are two tracks for each of the three tasks, one track for CTA modality and one track for MRA modality. The assessment of algorithms will be the same for both CTA and MRA modalities, and with the following metrics:

Metrics for Task-1-CoW-Segmentation

Seven evaluation metrics with equal weights for the multi-class (CoW vessels) segmentation task:

Class-average Dice similarity coefficient
Centerline Dice (clDice) on merged binary mask
Class-average 0-th Betti number error
Class-average Hausdorff Distance 95% Percentile (HD95)
Average F1 score (harmonic mean of the precision and recall) for detection of the "Group 2 CoW components" (as coined in our summary pre-print):
- Acom
- Pcoms
- 3rd-A2
Variant-balanced graph classification accuracy
- Same as the metric for Task-3!
- Classify the graph by detecting the edges
Variant-balanced topology match rate (as introduced in our summary pre-print)
- Not only do the CoW components need to be correctly detected
- Topology extracted from segmentation should also satisfy strict connectivity and neighborhood conditions:
  - Correct neighbourhood connectivity (connected to correct vessel classes)
  - No 0-th Betti number errors
  - No topological mistakes in connectivity, fragmentation, or crossover
- Not trivial to get a match in our topology matching analysis
- A more advanced and stringent metric than detection (metric #5) and graph classification (metric #6)
  - as it incorporates the detection and classification performance in its evaluation

Metrics for Task-2-CoW-ObjDet

For the CoW object detection task, we ask your algorithm to only provide one box per image, which is then compared with our ground-truth box on two related metrics:

Boundary intersection over union (Boundary IoU)

Please refer to this entry from "Metrics Reloaded" for more information on Boundary IoU.
IoU

Metrics for Task-3-CoW-Classification

Task 3 is to classify the CoW variant topology graph, which are represented by a graph with relevant anterior and posterior edge lists. The presence and absence of the edge of the edge list determine the topology graph class.

Your algorithm is expected to output the predicted anterior and posterior edge lists. We evaluate both lists using the metric:

Variant-balanced accuracy

Evaluation Code

Whatever we use for the challenge evaluation will be released and synchronized to the following repo in a transparent manner. Please refer to this repo for our assessment implementations.

GitHub

Please visit our GitHub repo:

👉 "TopCoW_Eval_Metrics"

for the implementations of our evaluation metric functions.

2024 version now online! 📐

Please feel free to leave an issue or let us know if you have further feedback or questions.