Skip to main content

Taverna, Reloaded

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6187))

Abstract

The Taverna workflow management system is an open source project with a history of widespread adoption within multiple experimental science communities, and a long-term ambition of effectively supporting the evolving need of those communities for complex, data-intensive, service-based experimental pipelines. This short paper describes how the recently overhauled technical architecture of Taverna addresses issues of efficiency, scalability, and extensibility, and presents performance results based on a collection of synthetic workflows, as well as a concrete case study involving a production workflow in the area of cancer research.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Couvares, P., Kosar, T., Roy, A., Weber, J., Wenger, K.: Workflow M. In: Workflows for e-Science, Springer, Heidelberg (2007)

    Google Scholar 

  2. Deelman, E., Chervenak, A.L.: Data Management Challenges of Data-Intensive Scientific Workflows. In: CCGRID, pp. 687–692 (2008)

    Google Scholar 

  3. Deelman, E., Singh, G., Su, M.-H., Blythe, J., Gil, Y., Kesselman, C., Mehta, G., Vahi, K., Bruce Berriman, G., Good, J., Laity, A.C., Jacob, J.C., Katz, D.S.: Pegasus: A framework for mapping complex scientific workflows onto distributed systems. Scientific Programming 13(3), 219–237 (2005)

    Google Scholar 

  4. Fisher, P., Hedeler, C., Wolstencroft, K., Hulme, H., Noyes, H., Kemp, S., Stevens, R., Brass, A.: A systematic strategy for large-scale analysis of genotype phenotype correlations: identification of candidate genes involved in trypanosomiasis. Nucleic Acids Research 35, 5625–5633 (2007)

    Article  Google Scholar 

  5. Foster, I.T., Vöckler, J.-S., Wilde, M., Zhao, Y.: Chimera: A Virtual Data System for Representing, Querying, and Automating Data Derivation. In: SSDBM, pp. 37–46. IEEE Computer Society, Los Alamitos (2002)

    Google Scholar 

  6. Gil, Y., Deelman, E., Ellisman, M., Fahringer, T., Fox, G., Gannon, D., Goble, C., Livny, M., Moreau, L., Myers, J.: Examining the Challenges of Scientific Workflows. Computer 40, 24–32 (2007)

    Article  Google Scholar 

  7. Hull, D., Wolstencroft, K., Stevens, R., Goble, C.A., Pocock, M.R., Li, P., Oinn, T.: Taverna: a tool for building and running workflows of services. Nucleic Acids Research 34, 729–732 (2006)

    Article  Google Scholar 

  8. Hwang, K., Briggs, F.A.: Computer architecture and parallel processing. McGraw-Hill, New York (1986)

    Google Scholar 

  9. Joel, S., Tahsin, K., Shannon, H., Stephen, L., Scott, O., et al.: e-Science, caGrid, and Translational Biomedical Research. Computer 41, 58–66 (2008)

    Google Scholar 

  10. Lee, E.A.: Dataflow Process Networks. Memorandum, UC Berkeley EECS Dept. (1994)

    Google Scholar 

  11. Shipp, M.A., Ross, K.N., Tamayo, P., Weng, A.P., Kutok, J.L., Aguiar, R.C.T.: Diffuse large B-cell lymphoma outcome prediction by gene-expression profiling and supervised machine learning. Nature Medicine 8, 68–74 (2002)

    Article  Google Scholar 

  12. Missier, P., Paton, N., Belhajjame, K.: Fine-grained and efficient lineage querying of collection-based workflow provenance. In: Procs. EDBT, Lausanne, Switzerland (2010)

    Google Scholar 

  13. Oinn, T., Addis, M., Ferris, J., Marvin, D., Senger, M., Greenwood, M., Carver, T., Glover, K., Pocock, M.R., Wipat, A., Li, P.: Taverna: A tool for the composition and enactment of bioinformatics workflows. Bioinformatics, 3045–3054 (November 2004)

    Google Scholar 

  14. Pautasso, C., Alonso, G.: Parallel Computing Patterns for Grid Workflows. In: Proc. of the HPDC 2006 Workshop on Workflows in Support of Large-Scale Science (WORKS 2006), Paris, France (2006)

    Google Scholar 

  15. Smedley, D., Haider, S., Ballester, B., Holland, R., London, D., Thorisson, G., Kasprzyk, A.: BioMart – biological queries made easy. BMC Genomics 10 (2009)

    Google Scholar 

  16. Turi, D., Missier, P., De Roure, D., Goble, C., Oinn, T.: Taverna Workflows: Syntax and Semantics. In: Proceedings of the 3rd e-Science conference, Bangalore, India (December 2007)

    Google Scholar 

  17. van der Aalst, W.M.P., ter Hofstede, A.H.M., Kiepuszewski, B., Barros, A.P.: Workflow Patterns. Distributed and Parallel Databases 14, 5–51 (2003)

    Article  Google Scholar 

  18. Foster, W.T.I., Madduri, R.: Combining the Power of Taverna and caGrid: Scientific Workflows that Enable Web-Scale Collaboration. IEEE Internet Computing 12, 61–68 (2008)

    Article  Google Scholar 

  19. Walker, E., Xu, W., Chandar, V.: Composing and executing parallel data-flow graphs with shell pipes. In: WORKS 2009: Proceedings of the 4th Workshop on Workflows in Support of Large-Scale Science, pp. 1–10. ACM, New York (2009)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Missier, P. et al. (2010). Taverna, Reloaded. In: Gertz, M., Ludäscher, B. (eds) Scientific and Statistical Database Management. SSDBM 2010. Lecture Notes in Computer Science, vol 6187. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-13818-8_33

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-13818-8_33

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-13817-1

  • Online ISBN: 978-3-642-13818-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics