figshare
Browse

Git labeled dataset of image diagrams

Download (797.26 MB)
Version 2 2022-09-13, 09:34
Version 1 2022-07-29, 07:14
dataset
posted on 2022-09-13, 09:34 authored by Sergio Andres Rodriguez TorresSergio Andres Rodriguez Torres

Datset of  3'960.877 images build from GitHub public repositories.

This dataset contains a column product of the automatic classification process of a machine learning convolutional network, with 6 posible categories related to software diagrams.

Label        Name                        

0                None                          

1                Activity Diagram          

2                Sequence Diagram      

3                Class Diagram             

4                Component Diagram   

5                Use Case Diagram      

6                Cloud Diagram     

It also includes information on the repository from which it was extracted.


History