Contributors: Handwritten forms from 1,000 unique writers.
Resolution Quality: Images scanned at 200, 300, and 600 DPI to accommodate different research needs.
Diversity: Contributors vary by nationality, age, gender, handedness, and educational background.
Writing Styles: Includes natural, unrestricted handwriting styles.
Content Variety:
Unique Texts: 2,000 paragraphs on varied topics such as arts, education, health, nature, and technology, along with their line-segmented images.
Similar Texts: 2,000 paragraphs covering all Arabic characters and shapes, each with line-segmented images.
Free Texts: Paragraphs on topics freely chosen by the writers.
Annotation: All paragraph and line images come with manually verified ground truths and Latin transliterations of Arabic texts.
Dataset Splits: The dataset is organized into training (70%), validation (15%), and testing (15%) sets.