About Data
We use the dataset from https://openalex.org,
especially data about articles (scientific papers), their citations by other articles,
their references, authors, and journals. This dataset is available under CC0 1.0 Universal license at
https://openalex.s3.amazonaws.com/....
We take in all the data: 256,088,911 articles. Then we define our cluster of articles from the
Oncology field by using a custom method. Result: 18,085,165 onco-articles.
Additionally, we use the dataset from the authors of the RCR metric at
https://icite.od.nih.gov,
especially data about “etalon” articles used for calibration of the RCR method, and data to
verify our calculations. This dataset is available under CC0 1.0 Universal license at
https://nih.figshare.com/collections/iCite...
Our data license: to be defined.