Home

Contents

Pooch

A friend to fetch your data files

Just want to download a file without messing with requests and urllib? Trying to add sample datasets to your Python package? Pooch is here to help!

Pooch is a Python library that can manage data by downloading files from a server (only when needed) and storing them locally in a data cache (a folder on your computer).

  • Pure Python and minimal dependencies.

  • Download files over HTTP, FTP, and from data repositories like Zenodo and figshare.

  • Built-in post-processors to unzip/decompress the data after download.

  • Designed to be extended: create custom downloaders and post-processors.

Are you a scientist or researcher? Pooch can help you too!

  • Host your data on a repository and download using the DOI.

  • Automatically download data using code instead of telling colleagues to do it themselves.

  • Make sure everyone running the code has the same version of the data files.


Getting started

New to Pooch? Start here!

Why use Pooch?

Need help?

Ask on our community channels.

Join the conversation

Reference documentation

A list of modules and functions.

List of functions and classes (API)

Using Pooch for research?

Citations help support our work!

Citing Pooch


See also

Pooch is a part of the Fatiando a Terra project.