Datset of 3'960.877 images build from GitHub public repositories.
This dataset contains a column product of the automatic classification process of a machine learning convolutional network, with 6 posible categories related to software diagrams.
Label Name
0 None
1 Activity Diagram
2 Sequence Diagram
3 Class Diagram
4 Component Diagram
5 Use Case Diagram
6 Cloud Diagram
It also includes information on the repository from which it was extracted.