figshare
Browse
1/1
3 files

Subject Category Classification

Version 2 2020-10-26, 20:22
Version 1 2020-08-28, 13:35
dataset
posted on 2020-10-26, 20:22 authored by Bharath KandimallaBharath Kandimalla, Jian WuJian Wu, Shaurya Rohatgi
The dataset contains the following files.

* y_test.pkl: Test data set labels.

* y_pred_attention_20.pkl: predicted labels predicted by the attention layer (biGru + FastText + attention) after 30 iterations .

* 2020-kandimalla-citeseerx-subject-areas: classification results of 1 million citeseerx papers.

People can validate the accuracy and micro-F1 by using classification_report, confusion_matrix from sklearn framework.

Funding

National Science Foundation

History