Skip to article frontmatterSkip to article content
Site not loading correctly?

This may be due to an incorrect BASE_URL configuration. See the MyST Documentation for reference.

An Introduction to OME-Zarr for Big Bioimaging Data

Authors
Affiliations
UCL
UCL
UCL

This book tries to explain the theory and practice behind handling large bioimaging datasets using the OME-Zarr data format Moore et al., 2023.

It is written for anyone working with bioimaging data that is ‘big’. This means too big to transfer the data quickly from where it’s stored to where you need it in one go. This can include downloading it over the internet for viewing, or loading it from your hard drive into memory for analysis.

The tools and methods in this texbook are tailored to working with big imaging data. Tackling this data adds complexity to the tools and methods, so if you have smaller data it might be easier to use other file formats and tools. See the Introduction to Bioimage Analysis textbook for other options.

Reading this book

Pre-requisites

This textbook assumes some familiarity with images. The practical examples in this textbook are written in the Python programming language, but you can still read the book with out running or understanding the code. The first two chapters give theoretical background and code is only used to explain concepts; the later chapters explain the practice of working with OME-Zarr datasets and contain code that you could copy and adapt for your own use. The Introduction to Bioimage Analysis textbook has a good primer on images in the context of both Python and Biology.

Reading the book

Every chapter is designed to be read from start to end in a linear fashion. Each chapter builds upon the previous ones, so they should be read in order. It is not designed to have hands-on examples, and should complement other resources that provide interactive lessons, such as the OME-Zarr lesson in the Bioimage Analysis Training Resources. Some of the code examples provided to explain concepts might be a useful starting point for writing your own data management and analysis code.

Running code

If you want to you can download the Python requirements used to generate the output in this book, and use them to create a fresh Python environment. You will also need to download data_helpers.py and the sample data set - the easiest way to do this is clone or download the GitHub repo.

Contributing

This is a community resource - everyone is welcome to read and contribute to the textbook! There are lots of different ways you can contribute:

Roadmap

The textbook is not complete; here are chapters we would like to add:

If you would like to write a chapter, or propose another new chapter to add to the roadmap, please get in touch!

Acknowledgments

The creation of this book was funded by the Handling Enormous Files from Tomographic Imaging Experiments (HEFTIE) project. HEFTIE was funded by the OSCARS project, which received funding from the European Commission’s Horizon Europe Research and Innovation programme under grant agreement No. 101129751. OSCARS and EU logos Thanks to Alessandro Felder for reviewing and providing helpful improvements and comments.

References
  1. Moore, J., Basurto-Lozada, D., Besson, S., Bogovic, J., Bragantini, J., Brown, E. M., Burel, J.-M., Casas Moreno, X., de Medeiros, G., Diel, E. E., Gault, D., Ghosh, S. S., Gold, I., Halchenko, Y. O., Hartley, M., Horsfall, D., Keller, M. S., Kittisopikul, M., Kovacs, G., … Swedlow, J. R. (2023). OME-Zarr: a cloud-optimized bioimaging file format with international community support. Histochemistry and Cell Biology, 160(3), 223–251. 10.1007/s00418-023-02209-1