Subscribe to Grand Challenge News Feed




 
 

 

Sample Data

We have had a number of requests regarding more details on the dataset that will be made available to semifinalists: below, we present a description of the full dataset and a few sample items.

EMBASE / MEDLINE
The biomedical platform EMBASE.com, including the EMBASE and MEDLINE databases, will be available for the semifinalists.

XML
We will make available a collection of all the full text of two years (2006 – 2007) of all of our life science journals as ELsevier XML records – the DTD can be found at http://www.sciencedirect.info/implementing/implementing_sdos/dtds/.  (approximately 300,000 articles) and for three journals, we will make the full text available all the way to Volume 1, Issue 1 – in total, we expect to make approximately 500,000 articles available in XML full text.

Scopus Records
For a different collection of articles, we will make available XML records as used in Scopus – a sample will be made available as soon as possible. For now, we include a copy of a Scopus html page, to provide a sample of the type of information available.

PDFs
For a selection of full-text journals, we will also make the pdfs (including images) available – see the samples below. This dataset can also be downloaded as a zip file.

Title + Author names

DOI URL for full text

XML

Scopus Record (in html)

PDF

Video on the Internet: An introduction to the digital encoding, compression, and transmission of moving image data, Boudier, T., Shotton, D.M.

http://dx.doi.org/10.1006/jsbi.1999.4097

-

shotton_structure.htm

shotton_structure_sdarticle.pdf

Retrieval effectiveness of an ontology-based model for information selection, Khan, L. McLeod, D., Hovy, E.

http://dx.doi.org/10.1007/s00778-003-0105-1

-

hovy_ontology.htm

-

Aggregation of bioinformatics data using Semantic Web technology, Susie Stephens, David LaVigna, Mike DiLascio, Joanne Luciano

http://dx.doi.org/10.1016/j.websem.2006.05.004

 

stephens.htm

stephens_sdarticle.pdf

A text-mining perspective on the requirements for electronically annotated abstracts, Leitner, F. Valencia, A.

http://dx.doi.org/10.1016/j.febslet.2008.02.072

valencia_febs.xml

valencia_febs.htm

valencia_febs_sdarticle.pdf

Are promyelocytic leukaemia protein nuclear bodies a scaffold for caspase-2 programmed cell death? Sanchez-Pulido, L., Valencia, A., Rojas, A.M.

http://dx.doi.org/10.1016/j.tibs.2007.08.001

valencia_celldeath.xml

valencia_celldeath.htm

valencia_celldeath_sdarticle.pdf

The Hippo Pathway Regulates the bantam microRNA to Control Cell Proliferation and Apoptosis in Drosophila, Thompson, B.J., Cohen, S.M.

http://dx.doi.org/10.1016/j.cell.2006.07.013

thompson.xml

thompson.htm

thompson_sdarticle.pdf

A Genetic Screen Implicates miRNA-372 and miRNA-373 As Oncogenes in Testicular Germ Cell Tumors, P. M. Voorhoeve, C.M. Schrier, A. J.M. Gillis, H. Stoop, R. Nagel, Y.-P. Liu, J. van Duijse, J. Drost, A. Griekspoor, et. al

http://dx.doi.org/10.1016/j.cell.2006.02.037

voorhoeve.xml

voorhoeve.htm

voorhoeve_sdarticle.pdf

.

Copyright © Elsevier 2008