-
Notifications
You must be signed in to change notification settings - Fork 3
Description
@nicklynch said
RSC have created a linkset for a 100 K mini dataset for SureChMBL
Linkset download for 100K http://ops.rsc.org/download/RDF-2016.01.12.zipThere is properties in this but we should not load ttl file that for now as not sure of licensing of the ACD/Labs for this
Would it be possible to load this into Dev IMS so we can start some testing on this and the API calls?
I've made https://github.com/openphacts/ops-rsc-surechembl-dataset that captures this and produces the linksets only at http://repository.mygrid.org.uk/artifactory/ops/org/openphacts/data/ops-rsc-linksets/0.20151104.0-SNAPSHOT/
I've validated these with IMS, and found that they all load, except SURE_CHEMBL/LINKSET_EXACT_SURE_CHEMBL20160112.ttl, e.g.
<http://ops.rsc.org/OPS1931331> skos:exactMatch <http://rdf.ebi.ac.uk/resource/chembl/molecule/SCHEMBL170241> .
<http://ops.rsc.org/OPS1931518> skos:exactMatch <http://rdf.ebi.ac.uk/resource/chembl/molecule/SCHEMBL167446> .
<http://ops.rsc.org/OPS1931950> skos:exactMatch <http://rdf.ebi.ac.uk/resource/chembl/molecule/SCHEMBL175877> .This is the most important one as it links the SCHEMBL identifiers to the OPS identifiers - thus linking SureChembl to the rest.
The DataSources.txt in IMS needs to be modified to support the new URI pattern for SureChembl. Nothing is currently resolved from those URIs - @agaulton - are they going to remain as those URIs?