About GROTOAP2-affiliations =========================== GROTOAP2-affiliations (GROund Truth for Open Access Publications) is a dataset useful for training and performance evaluation of affiliations parsing. GROTOAP2 was built automatically from PubMed Central Open Access Subset resources. It contains 8267 parsed affiliations, storing both the raw affiliation strings as well as fragments marked as institution, address and country. The dataset is available under CC-BY license. GROTOAP2-affiliations dataset can be downloaded from: http://cermine.ceon.pl/grotoap2/affiliations/. Authors ======= Dominika Tkaczyk Bartosz Tarnawski The content of GROTOAP2-affiliations ==================================== GROTOAP2-affiliations consists of: * 8267 parsed affiliations in XML format, * corresponding raw affiliation strings in the same order. Warsaw, 30 January 2015