Crossref journal-article
Oxford University Press (OUP)
Bioinformatics (286)
Abstract

Abstract Motivation: The world-wide community of life scientists has access to a large number of public bioinformatics databases and tools, which are developed and deployed using diverse technologies and designs. More and more of the resources offer programmatic web-service interface. However, efficient use of the resources is hampered by the lack of widely used, standard data-exchange formats for the basic, everyday bioinformatics data types. Results: BioXSD has been developed as a candidate for standard, canonical exchange format for basic bioinformatics data. BioXSD is represented by a dedicated XML Schema and defines syntax for biological sequences, sequence annotations, alignments and references to resources. We have adapted a set of web services to use BioXSD as the input and output format, and implemented a test-case workflow. This demonstrates that the approach is feasible and provides smooth interoperability. Semantics for BioXSD is provided by annotation with the EDAM ontology. We discuss in a separate section how BioXSD relates to other initiatives and approaches, including existing standards and the Semantic Web. Availability: The BioXSD 1.0 XML Schema is freely available at http://www.bioxsd.org/BioXSD-1.0.xsd under the Creative Commons BY-ND 3.0 license. The http://bioxsd.org web page offers documentation, examples of data in BioXSD format, example workflows with source codes in common programming languages, an updated list of compatible web services and tools and a repository of feature requests from the community. Contact:  matus.kalas@bccs.uib.no; developers@bioxsd.org; support@bioxsd.org

Bibliography

Kalaš, M., Puntervoll, P., Joseph, A., Bartaševičiūtė, E., Töpfer, A., Venkataraman, P., Pettifer, S., Bryne, J. C., Ison, J., Blanchet, C., Rapacki, K., & Jonassen, I. (2010). BioXSD: the common data-exchange format for everyday bioinformatics web services. Bioinformatics, 26(18), i540–i546.

Authors 12
  1. Matúš Kalaš (first)
  2. Pål Puntervoll (additional)
  3. Alexandre Joseph (additional)
  4. Edita Bartaševičiūtė (additional)
  5. Armin Töpfer (additional)
  6. Prabakar Venkataraman (additional)
  7. Steve Pettifer (additional)
  8. Jan Christian Bryne (additional)
  9. Jon Ison (additional)
  10. Christophe Blanchet (additional)
  11. Kristoffer Rapacki (additional)
  12. Inge Jonassen (additional)
References 27 Referenced 18
  1. 10.1016/S0022-2836(05)80360-2 / J. Mol. Biol. / Basic local alignment search tool by Altschul (1990)
  2. 10.1038/75556 / Nat. Genet. / Gene ontology: tool for the unification of biology. The gene ontology consortium by Ashburner (2000)
  3. 10.1145/1281700.1281707 / Proceedings of the 2007 Workshop on Experimental Computer Science. / An analysis of XML compression efficiency by Augeri (2007)
  4. 10.1093/protein/gzh013 / Protein Eng. Des. Sel. / Prediction of proprotein convertase cleavage sites by Duckert (2004)
  5. 10.1186/gb-2005-6-5-r44 / Genome Biol. / The Sequence Ontology: a tool for the unification of genome annotations by Eilbeck (2005)
  6. 10.1016/S0076-6879(96)66034-0 / Meth Enzymol. / GOR secondary structure prediction method version IV by Garnier (1996)
  7. 10.1186/1471-2105-8-312 / BMC Bioinformatics / MaxAlign: maximizing usable data in an alignment by Gouveia-Oliveira (2007)
  8. 10.1186/1471-2105-10-356 / BMC Bioinformatics / phyloXML: XML for evolutionary biology and comparative genomics by Han (2009)
  9. 10.1038/nbt926 / Nat. Biotechnol. / The HUPO PSI's Molecular Interaction format—a community standard for the representation of protein interaction data by Hermjakob (2004)
  10. 10.1093/bioinformatics/btg015 / Bioinformatics / The systems biology markup language (SBML): a medium for representation and exchange of biochemical network models by Hucka (2003)
  11. {'key': '2023012508282163000_B11', 'article-title': 'Treating shimantic web syndrome with ontologies', 'volume-title': 'Proceedings of First Advanced Knowledge Technologies Workshop on Semantic Web Services (AKT-SWS04) KMi.', 'author': 'Hull', 'year': '2004'} / Proceedings of First Advanced Knowledge Technologies Workshop on Semantic Web Services (AKT-SWS04) KMi. / Treating shimantic web syndrome with ontologies by Hull (2004)
  12. 10.1093/nar/gkl320 / Nucleic Acids Res. / Taverna: a tool for building and running workflows of services by Hull (2006)
  13. 10.1093/nar/gkp911 / Nucleic Acids Res. / phiSITE: database of gene regulation in bacteriophages by Klucar (2010)
  14. 10.1089/omi.2008.0A10 / OMICS / A standard MIGS/MIMS compliant XML schema: toward the development of the Genomic Contextual Data Markup Language (GCDML) by Kottmann (2008)
  15. 10.1093/protein/gzh062 / Protein Eng. Des. Sel. / Analysis and prediction of leucine-rich nuclear export signals by la Cour (2004)
  16. 10.1093/bioinformatics/btp329 / Bioinformatics / An active registry for bioinformatics web services by Pettifer (2009)
  17. 10.1093/nar/gkq297 / Nucleic Acids Res. / The EMBRACE Web service collection by Pettifer (2010)
  18. 10.1186/1471-2105-8-333 / BMC Bioinformatics / Integrating sequence and structural biology with DAS by Prlić (2007)
  19. 10.1093/bioinformatics/btn528 / Bioinformatics / The Protein Feature Ontology: a tool for the unification of protein feature annotations by Reeves (2008)
  20. 10.1186/1471-2105-7-490 / BMC Bioinformatics / XML schemas for common bioinformatic data types and their application in workflow systems by Seibel (2006)
  21. 10.1186/gb-2002-3-9-research0046 / Genome Biol. / Design and implementation of microarray gene expression markup language (MAGE-ML) by Spellman (2002)
  22. 10.1038/417119a / Nature / Creating a bioinformatics nation by Stein (2002)
  23. 10.1093/bib/bbn029 / Brief. Bioinform. / Experience using web services for biological sequence analysis by Stockinger (2008)
  24. 10.1093/nar/gkp846 / Nucleic Acids Res. / The Universal Protein Resource (UniProt) in 2010 by The UniProt Consortium (2010)
  25. 10.1093/nar/22.22.4673 / Nucleic Acids Res. / CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice by Thompson (1994)
  26. 10.1093/bioinformatics/bti082 / Bioinformatics / PDBML: the representation of archival macromolecular structure data in XML by Westbrook (2005)
  27. 10.1093/bib/3.4.331 / Brief. Bioinform. / BioMOBY: an open source biological web services proposal by Wilkinson (2002)
Dates
Type When
Created 14 years, 11 months ago (Sept. 7, 2010, 1:41 p.m.)
Deposited 2 years, 6 months ago (Jan. 25, 2023, 3:28 a.m.)
Indexed 2 years, 6 months ago (Jan. 28, 2023, 7:32 a.m.)
Issued 14 years, 11 months ago (Sept. 4, 2010)
Published 14 years, 11 months ago (Sept. 4, 2010)
Published Online 14 years, 11 months ago (Sept. 4, 2010)
Published Print 14 years, 11 months ago (Sept. 15, 2010)
Funders 0

None

@article{Kala__2010, title={BioXSD: the common data-exchange format for everyday bioinformatics web services}, volume={26}, ISSN={1367-4803}, url={http://dx.doi.org/10.1093/bioinformatics/btq391}, DOI={10.1093/bioinformatics/btq391}, number={18}, journal={Bioinformatics}, publisher={Oxford University Press (OUP)}, author={Kalaš, Matúš and Puntervoll, Pål and Joseph, Alexandre and Bartaševičiūtė, Edita and Töpfer, Armin and Venkataraman, Prabakar and Pettifer, Steve and Bryne, Jan Christian and Ison, Jon and Blanchet, Christophe and Rapacki, Kristoffer and Jonassen, Inge}, year={2010}, month=sep, pages={i540–i546} }