Extract images from .idx3-ubyte file or GZIP via Python

python mnist

Download the training/test images and labels:

train-images-idx3-ubyte.gz: training set images
train-labels-idx1-ubyte.gz: training set labels
t10k-images-idx3-ubyte.gz: test set images
t10k-labels-idx1-ubyte.gz: test set labels

And uncompress them in a workdir, say samples/.

Get the python-mnist package from PyPi:

pip install python-mnist

Import the mnist package and read the training/test images:

from mnist import MNISTmndata = MNIST('samples')images, labels = mndata.load_training()# orimages, labels = mndata.load_testing()

To display an image to the console:

index = random.randrange(0, len(images))  # choose an index ;-)print(mndata.display(images[index]))

You'll get something like this:

.............................................................................................................................................................@@.......................@@@@@.....................@@@@......................@@..........................@............................@...........................@...........................@...@.......................@@@@@.@.....................@@@...@@....................@@.....@...........................@...........................@@..........................@@..........................@..........................@@....................@.....@.....................@....@@......................@@@@.........................@..................................................................................................

Explanation:

Each image of the images list is a Python list of unsigned bytes.
The labels is an Python array of unsigned bytes.

python mnist

(Using only matplotlib, gzip and numpy)
Extract image data:

import gzipf = gzip.open('train-images-idx3-ubyte.gz','r')image_size = 28num_images = 5import numpy as npf.read(16)buf = f.read(image_size * image_size * num_images)data = np.frombuffer(buf, dtype=np.uint8).astype(np.float32)data = data.reshape(num_images, image_size, image_size, 1)

Print images:

import matplotlib.pyplot as pltimage = np.asarray(data[2]).squeeze()plt.imshow(image)plt.show()

Print first 50 labels:

f = gzip.open('train-labels-idx1-ubyte.gz','r')f.read(8)for i in range(0,50):       buf = f.read(1)    labels = np.frombuffer(buf, dtype=np.uint8).astype(np.int64)    print(labels)

python mnist

You could actually use the idx2numpy package available at PyPI. It's extremely simple to use and directly converts the data to numpy arrays. Here's what you have to do:

Downloading the data

Download the MNIST dataset from the official website.
If you're using Linux then you can use wget to get it from command line itself. Just run:

wget http://yann.lecun.com/exdb/mnist/train-images-idx3-ubyte.gzwget http://yann.lecun.com/exdb/mnist/train-labels-idx1-ubyte.gzwget http://yann.lecun.com/exdb/mnist/t10k-images-idx3-ubyte.gzwget http://yann.lecun.com/exdb/mnist/t10k-labels-idx1-ubyte.gz

Decompressing the data

Unzip or decompress the data. On Linux, you could use gzip

Ultimately, you should have the following files:

data/train-images-idx3-ubytedata/train-labels-idx1-ubytedata/t10k-images-idx3-ubytedata/t10k-labels-idx1-ubyte

The prefix data/ is just because I've extracted them into a folder named data. Your question looks like you're well done till here, so keep reading.

Using idx2numpy

Here's a simple python code to read everything from the decompressed files as numpy arrays.

import idx2numpyimport numpy as npfile = 'data/train-images-idx3-ubyte'arr = idx2numpy.convert_from_file(file)# arr is now a np.ndarray type of object of shape 60000, 28, 28

You can now use it with OpenCV juts the same way how you display any other image, using something like

cv.imshow("Image", arr[4])

To install idx2numpy, you can use PyPI (pip package manager). Simply run the command:

pip install idx2numpy

CodeHunter

Extract images from .idx3-ubyte file or GZIP via Python

Downloading the data

Decompressing the data

Using idx2numpy

Recent Posts

How can I color dots in a xy scatterplot according to column value?

How to update a claim in ASP.NET Identity?

What does {0} mean when initializing an object?

Accessing members of items in a JSONArray with Java

How to log SQL statements in Spring Boot?

Powershell Get-WebSite name parameter is ignored

How to detect scroll to bottom of html element

Java synchronized method

How to test controllers with CodeIgniter?

Detect Visual Composer

Matplotlib: Specify format of floats for tick labels

Rails join a list of strings with commas and "and" before the last