figshare
Browse

PubMed classification 202505

Version 8 2025-10-02, 09:36
Version 7 2024-02-13, 12:54
Version 6 2023-06-12, 11:19
Version 5 2023-02-21, 07:14
Version 4 2022-09-16, 09:27
Version 3 2022-04-07, 09:09
Version 2 2021-11-25, 13:36
Version 1 2021-10-14, 09:34
dataset
posted on 2025-10-02, 09:36 authored by Peter SjögårdePeter Sjögårde
<p dir="ltr">The classification contains about 21 million PubMed publications from 1995 onward. It has been created using clustering in a citation network.</p><p dir="ltr">The January 2024 update is a complete new version of the classification based on new clustering and labeling.</p><h2>File description</h2><p dir="ltr">PMID_cluster_relation_[date].csv contains the relation between PMIDs and clusters. Four levels are included:</p><ul><li>Level 1 - Topics - Most granular</li><li>Level 2 - Specialties</li><li>Level 3 - Disciplines</li><li>Level 4 - Discipline group - Most coarse</li></ul><p dir="ltr"><u>Labels</u></p><p dir="ltr">For each level there is a table with labels (e.g. labels_lev1_[date].csv), related by an id (e.g lev1_cluster_id).</p><p dir="ltr"><u>Stats</u></p><p dir="ltr">For each level there is a table with statistics (e.g. lev1_stats). The table includes the columns below. For more information about the "Clinical", "Human", "Animal" and "Molecular/Cellular Biology" categories, see <a href="https://nih.figshare.com/collections/iCite_Database_Snapshots_NIH_Open_Citation_Collection_/4586573" target="_blank">https://nih.figshare.com/collections/iCite_Database_Snapshots_NIH_Open_Citation_Collection_/4586573</a></p><ul><li>p - The number of publications in the cluster in the initial clustering.</li><li>pct_clinical - The proportion of clinical articles</li><li>sum_clinical - The number of clinical articles</li><li>pct_human - The average of the fraction of MeSH terms that are in the "Human" category</li><li>sum_human - The sum of fraction of MeSH terms that are in the "Human" category</li><li>pct_animal - The average of the fraction of MeSH terms that are in the "Animal" category</li><li>sum_animal - The sum of fraction of MeSH terms that are in the "Animal" category</li><li>pct_molecular_cellular - The average of the fraction of MeSH terms that are in the "Molecular/Cellular Biology" category</li><li>sum_molecular_cellular - The sum of fraction of MeSH terms that are in the "Molecular/Cellular Biology" category</li></ul><p dir="ltr"><u>Visualizations:</u></p><ul><li><a href="https://petersjogarde.github.io/pm_classification/2025/research_areas/index.html" rel="noreferrer" target="_blank">Base map of PubMed 2015-2024</a></li><li><a href="https://petersjogarde.github.io/pm_classification/2025/research_areas_links/index.html" rel="noreferrer" target="_blank">Map of PubMed 2023 - Including hyperlinks to publications</a></li><li><a href="https://petersjogarde.github.io/pm_classification/2025/clinical/index.html" rel="noreferrer" target="_blank">Map colored by % Clinical</a></li><li><a href="https://petersjogarde.github.io/pm_classification/2025/human/index.html" rel="noreferrer" target="_blank">Map colored by % Human</a></li><li><a href="https://petersjogarde.github.io/pm_classification/2025/animal/index.html" rel="noreferrer" target="_blank">Map colored by % Animal</a></li><li><a href="https://petersjogarde.github.io/pm_classification/2025/cell/index.html" rel="noreferrer" target="_blank">Map colored by % Molecular/Cellular</a></li></ul><p dir="ltr"><a href="https://figshare.com/collections/PubMed_Classification/5610971" target="_blank">See the figshare collection for further description.</a></p>

History