To overcome these challenges, we have assembled a real-world dataset for classifying both ICH and normal brain CT images. This dataset also supports the classification of three ICH types based on hemorrhage location: Deep, Subcortical, and Lobar.
Alongside the dataset, we introduce a novel neural network architecture, the Dual-Task Vision Transformer (DTViT), designed for automated ICH image classification and diagnosis. The DTViT leverages the Vision Transformer (ViT) encoder and its attention mechanisms to extract features from CT images. Our framework integrates two multilayer perceptron (MLP)-based decoders to concurrently detect the presence of ICH and classify the three hemorrhage types.