blob: 8acb74e01a2e2d0e735000177b567e646b066b03 (
plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
|
Does your Python package include sample datasets? Are you shipping
them with the code? Are they getting too big?
Pooch is here to help! It will manage a data registry by downloading
your data files from a server only when needed and storing them
locally in a data cache (a folder on your computer).
Here are Pooch's main features:
* Pure Python and minimal dependencies.
* Download a file only if necessary (it's not in the data cache
or needs to be updated).
* Verify download integrity through SHA256 hashes (also used to
check if a file needs to be updated).
* Designed to be extended: plug in custom download (FTP, scp,
etc) and post-processing (unzip, decompress, rename) functions.
* Includes utilities to unzip/decompress the data upon download
to save loading time.
* Can handle basic HTTP authentication (for servers that require
a login) and printing download progress bars.
* Easily set up an environment variable to overwrite the data
cache location.
|