skip to main content
10.1145/1966895.1966900acmotherconferencesArticle/Chapter ViewAbstractPublication PagesadConference Proceedingsconference-collections
research-article

An overview of the HDF5 technology suite and its applications

Published:25 March 2011Publication History

ABSTRACT

In this paper, we give an overview of the HDF5 technology suite and some of its applications. We discuss the HDF5 data model, the HDF5 software architecture and some of its performance enhancing capabilities.

References

  1. http://www.hdfgroup.org/HDF5/doc/index.htmlGoogle ScholarGoogle Scholar
  2. HDF5 User's Guide http://www.hdfgroup.org/HDF5/doc/UG/index.html.Google ScholarGoogle Scholar
  3. HDF5 Reference Manual http://www.hdfgroup.org/HDF5/doc/RM/RM_H5Front.html.Google ScholarGoogle Scholar
  4. Koziol, Q. 2011. HDF5 Encyclopedia of Parallel Computing. To appear.Google ScholarGoogle Scholar
  5. Date, C. J. and Darwen, H. 1998. Foundation for Object/Relational Databases -- The Third Manifesto. Addison Wesley. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Thomsen, E. 2002. OLAP Solutions: Building Multidimensional Information Systems. Second Edition. Wiley. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Brown, P. 2001. Object-Relational Database Development. Informix Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. HDF5 File Format Specification Version 2.0 http://www.hdfgroup.org/HDF5/doc/H5.format.htmlGoogle ScholarGoogle Scholar
  9. Performance Analysis and Issues http://www.hdfgroup.org/HDF5/doc/H5.user/Performance.htmlGoogle ScholarGoogle Scholar
  10. Universal File Interface (UFI) http://www.barrodale.com/bcs/universal-file-interface-ufiGoogle ScholarGoogle Scholar
  11. HDF5 Tools http://www.hdfgroup.org/HDF5/doc/RM/Tools.htmlGoogle ScholarGoogle Scholar
  12. HDFView http://www.hdfgroup.org/hdf-java-html/hdfview/Google ScholarGoogle Scholar
  13. MathWorks http://www.mathworks.com/Google ScholarGoogle Scholar
  14. Mathematica http://www.wolfram.com/Google ScholarGoogle Scholar
  15. VisIt https://wci.llnl.gov/codes/visit/Google ScholarGoogle Scholar
  16. EnSight http://www.ensight.com/Google ScholarGoogle Scholar
  17. HDF5 High-level APIs http://www.hdfgroup.org/HDF5/doc/HL/Google ScholarGoogle Scholar
  18. Gosink, L. et al. 2005. HDF5-FastQuery: Accelerating Complex Queries on HDF5 Datasets using Fast Bitmap Indices. http://crd.lbl.gov/~kewu/ps/LBNL-59602.pdfGoogle ScholarGoogle Scholar
  19. Mainzer, J. and Koziol, Q. 2010. RFC: High-Level HDF5 API routines for HPC Applications. http://www.hdfgroup.uiuc.edu/RFC/HDF5/HPC-High-Level-API/H5HPC_RFC-2010-09-28.pdfGoogle ScholarGoogle Scholar
  20. Folk, M. and Heber, G. and Koziol, Q. 2011. HDF5 Information Set. To appear.Google ScholarGoogle Scholar
  21. W3C. 2010. XQuery 3.0: An XML Query Language. http://www.w3.org/TR/xquery-30/Google ScholarGoogle Scholar
  22. Nam, B. and Sussman, A. 2003. Improving Access to Multidimensional Self-describing Scientific Datasets. http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.78.2998&rep=rep1&type=pdf Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Altet, F. and Vilata, I. 2007. OPSI: The indexing system of PyTables 2 Professional Edition. http://www.pytables.org/docs/OPSI-indexes.pdfGoogle ScholarGoogle Scholar
  24. Chan, C-Y. and Ioannidis, Y. E. 1998. Bitmap Index Design and Evaluation. SIGMOD. http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.87.1270&rep=rep1&type=pdf Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Lahdenmaeki, T. and Leach, M. 2004. Relational Database Index Design and the Optimizers. John Wiley & Sons, Inc. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Alekseyenko, A. V. and Lee, C. J. 2007. Nested Containment List (NCList): a new algorithm for accelerating interval query of genome alignment and interval databases. Bioinformatics Vol. 23, No 11, pp. 1386--1393. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. BioHDF http://www.biohdf.org/Google ScholarGoogle Scholar
  28. HDF5 Image and Palette Specification. 2007. http://www.hdfgroup.org/HDF5/doc/HL/RM_H5IM.htmlGoogle ScholarGoogle Scholar
  29. HDF5 Table Specification. 2002. http://www.hdfgroup.org/HDF5/doc/HL/H5TB_Spec.htmlGoogle ScholarGoogle Scholar
  30. HDF5 Dimension Scale Specification and Design Notes. 2005. http://www.hdfgroup.org/HDF5/doc/HL/H5DS_Spec.pdfGoogle ScholarGoogle Scholar
  31. Howison, M. et al. 2010. Tuning HDF5 for Lustre File Systems. https://secure.nersc.gov/projects/presentations/HDF5_DonofrioNERSC.pdfGoogle ScholarGoogle Scholar
  32. Introduction to the HDF5 Packet Table API. 2005. http://www.hdfgroup.org/HDF5/doc/HL/H5PT_Intro.htmlGoogle ScholarGoogle Scholar
  33. Informix DataBlades. 2010. http://www-01.ibm.com/software/data/informix/blades/Google ScholarGoogle Scholar
  34. Data Cartridge -- Oracle Wiki. 2010. http://wiki.oracle.com/page/Data+CartridgeGoogle ScholarGoogle Scholar
  35. Message Passing Interface. 2010. http://www.mcs.anl.gov/research/projects/mpi/Google ScholarGoogle Scholar
  36. NASA's Earth Observing System. 2010. http://eospso.gsfc.nasa.gov/Google ScholarGoogle Scholar
  37. The ECS SDP Toolkit Home Page. 2010. http://newsroom.gsfc.nasa.gov/sdptoolkit/TKDocuments.htmlGoogle ScholarGoogle Scholar
  38. NESDIS Satellite Information. 2010. http://www.nesdis.noaa.gov/SatInformation.htmlGoogle ScholarGoogle Scholar
  39. NetCDF (network Common Data Form). 2010. http://www.unidata.ucar.edu/software/netcdf/Google ScholarGoogle Scholar
  40. SciDB. Overview of SciDB, Large Scale Array Storage, Processing and Analysis. SIGMOD' 10. http://www.scidb.org/download/sigmod691-brown.pdfGoogle ScholarGoogle Scholar
  41. Kuehn, J. A. 1996. Faster Libraries for Creating Network-Portable Self-Describing Datasets. Cray User GroupGoogle ScholarGoogle Scholar
  42. XML Linking Language (XLink) Version 1.1. 2010. http://www.w3.org/TR/xlink11/Google ScholarGoogle Scholar
  43. Standardizing the Next Generation of Bioinformatics Software Development with BioHDF. BioHDF BoF SC09. 2009.Google ScholarGoogle Scholar
  44. Halpin, T. 2001. Information Modeling and Relational Databases. Morgan Kaufmann Publishers. Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. The NetCDF-4 Data Model. 2010.Google ScholarGoogle Scholar
  46. The "Classic" NetCDF Data Model. 2010.Google ScholarGoogle Scholar
  47. Barrodale Computing Services. 2010. http://www.barrodale.com/Google ScholarGoogle Scholar
  48. Aura OMI NO2 Level 3 Global (0.25 deg Grids) Data Product-OMNO2e Version 003. http://disc.sci.gsfc.nasa.gov/Aura/data-holdings/OMI/omno2e_v003.shtmlGoogle ScholarGoogle Scholar
  49. NASA. OMI Data Products and Data Access. 2010. http://disc.sci.gsfc.nasa.gov/Aura/overview/data-holdings/OMI/index.shtmlGoogle ScholarGoogle Scholar
  50. NASA. Aura. 2010. http://disc.sci.gsfc.nasa.gov/AuraGoogle ScholarGoogle Scholar
  51. Ramapriyan, H. K and Moses, J. 2011. NASA's Earth Science Data Systems -- Lessons Learned and Future Directions. To appear.Google ScholarGoogle Scholar
  52. HDF OPeNDAP Project. 2010. http://www.hdfgroup.org/projects/opendap/Google ScholarGoogle Scholar
  53. OPeNDAP: Open-source Project for a Network Data Access Protocol. 2010. http://www.opendap.org/Google ScholarGoogle Scholar
  54. Yang, M. and Lee H. 2009. Using a Friendly OPeNDAP Client Library to access HDF5 data. The 89th AMS annual meeting.Google ScholarGoogle Scholar
  55. Gallagher, J. et al. 2007. The Data Access Protocol -- DAP 2.0. ESE-RFC-004.1.1Google ScholarGoogle Scholar
  56. Kent, W. J., Zweig, A. S., Barber, G., Hinrichs, A. S. and Karolchik, D. 2010. BigWig and BigBed: enabling browsing of large distributed datasets. Bioinformatics, 26, 2204--2207. Google ScholarGoogle ScholarDigital LibraryDigital Library
  57. Li, H. 2011. Tabix: fast retrieval of sequence features from generic TAB-delimited files. Bioinformatics, 27, 718--719. Google ScholarGoogle ScholarDigital LibraryDigital Library
  58. Li, H., Handsaker, B., Wysoker, A., Fennell, T., Ruan, J., Homer, N., Marth, G., Abecasis, G. and Durbin, R. 2009. The Sequence Alignment/Map format and SAMtools. Bioinformatics, 25, 2078--2079. Google ScholarGoogle ScholarDigital LibraryDigital Library
  59. Mason, C. E., Zumbo, P., Sanders, S., Folk, M., Robinson, D., Aydt, R., Gollery, M., Welsh, M., Olson, N. E. and Smith, T. M. 2010. Standardizing the next generation of bioinformatics software development with BioHDF (HDF5). Adv Exp Med Biol, 680, 693--700.Google ScholarGoogle ScholarCross RefCross Ref
  60. Shendure, J. and Ji, H. (2008) Next-generation DNA sequencing. Nat Biotechnol, 26, 1135--1145.Google ScholarGoogle ScholarCross RefCross Ref
  61. Making Science Data Easier to Use with OPeNDAP. 2010. http://wiki.esipfed.org/index.php/Making_Science_Data_Easier_to_Use_with_OPeNDAP#EOS_HDF_Data_Readability_.28without_OPeNDAP.29Google ScholarGoogle Scholar
  62. Open Data Protocol (OData). 2010. http://www.odata.org/Google ScholarGoogle Scholar
  63. Kahn, S. D. 2011. On the future of genomic data, Science, 331, 728--729.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. An overview of the HDF5 technology suite and its applications

                  Recommendations

                  Comments

                  Login options

                  Check if you have access through your login credentials or your institution to get full access on this article.

                  Sign in
                  • Published in

                    cover image ACM Other conferences
                    AD '11: Proceedings of the EDBT/ICDT 2011 Workshop on Array Databases
                    March 2011
                    53 pages
                    ISBN:9781450306140
                    DOI:10.1145/1966895

                    Copyright © 2011 ACM

                    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

                    Publisher

                    Association for Computing Machinery

                    New York, NY, United States

                    Publication History

                    • Published: 25 March 2011

                    Permissions

                    Request permissions about this article.

                    Request Permissions

                    Check for updates

                    Qualifiers

                    • research-article

                  PDF Format

                  View or Download as a PDF file.

                  PDF

                  eReader

                  View online with eReader.

                  eReader