awesome-scholarly-data-analysis
github.com/napsternxg/awesome-scholarly-data-analysis ↗A curated collection of resources on scholarly data analysis ranging from datasets, papers, and code about bibliometrics, citation analysis, and other scholarly commons resources.
202
GitHub Stars
433
Curated Resources
28
Categories
3 hours ago
Last Refreshed
Publication and CitationPeer ReviewGrants and FundingAcademic GenealogyAuthor ProfilesAuthor name disambiguationThesis datasetsInformation Extraction and NLPNetworksTaxonomies and Ontologies of Research ConceptsAffiliationsAltmetrics and DimensionsUser interface to publication datasets and analysisTools for collecting open access papersTools for classifying research papersVisualizationsLanguage Processing and Information ExtractionCitation and metadata extractionPublication and Publisher InfoAuthor Name DisambiguationJournalsConferencesWorkshopsSummer SchoolsCoursesAssociations & CommunityResearch GroupsBlogs
Use this list with your AI agent
Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:
"Show me author profiles resources from awesome-scholarly-data-analysis"
Installation instructions →What's inside
Author Profiles
- 100,000 top-scientists that provides standardized information on citations, h-index, co-authorship adjusted hm-index, citations to papers in different authorship positions and a composite indicator
- Author name gender and ethnicity dataset based on PubMed
- Author Profiles of scholarly authors in Wikipedia
- Canadian PhD career survey
- Career long various citation metrics for 100,000 top-scientists
- Career Transitions of CS students
Associations & Community
Publication and Citation
User interface to publication datasets and analysis
Information Extraction and NLP
- Academic PhraseBank
- ACA Wiki - Paper summaries of more than 1600 papers
Paper summaries of more than 1600 papers
- ACL Anthology Corpus - Full Text
Full Text
- ACL Anthology human summaries for 1000 papers
- ACL RD TEC 2.0
- ACM data affiliations
Academic Genealogy
- Academic Tree - Cross discipline academic genealogies
Cross discipline academic genealogies
- A dataset of mentorship in science with semantic and demographic estimations - Used in The academic Great Gatsby Curve paper
Used in The academic Great Gatsby Curve paper
- Chemistry Genealogy - curated at UIUC
curated at UIUC
- Economic Geneology
Taxonomies and Ontologies of Research Concepts
- AckExtract: Acknowledgement and its name entities extraction from scholarly papers
- ACM Computing Classification System
- Australian and New Zealand Standard Research Classification (ANZSRC)
- CiteSum: Citation Text-guided Scientific Extreme Summarization and Low-resource Domain Adaptation
- Computer Science Ontology
- CrossRef Open Funder's Registry
Peer Review
- ACL-18 Numerical Peer Review Dataset
- APE: Argument Pair Extraction - Annotated ICLR 2013-2020 review-rebuttal argument pair
Annotated ICLR 2013-2020 review-rebuttal argument pair
- Argument Mining Driven Analysis of Peer-Reviews Dataset
- Argument Mining for Understanding Peer Reviews
- CiteTracked: A Longitudinal Dataset of Peer Reviews and Citations - Contact Author
Contact Author
- eLife Open Peer Review Corpus
Showing a sample of 433 resources. View the full list on GitHub →