Docker how to run pip requirements.txt only if there was a change? Docker how to run pip requirements.txt only if there was a change? python python

Docker how to run pip requirements.txt only if there was a change?


I'm assuming that at some point in your build process, you're copying your entire application into the Docker image with COPY or ADD:

COPY . /opt/appWORKDIR /opt/appRUN pip install -r requirements.txt

The problem is that you're invalidating the Docker build cache every time you're copying the entire application into the image. This will also invalidate the cache for all subsequent build steps.

To prevent this, I'd suggest copying only the requirements.txt file in a separate build step before adding the entire application into the image:

COPY requirements.txt /opt/app/requirements.txtWORKDIR /opt/appRUN pip install -r requirements.txtCOPY . /opt/app# continue as before...

As the requirements file itself probably changes only rarely, you'll be able to use the cached layers up until the point that you add your application code into the image.


This is directly mentioned in Docker's own "Best practices for writing Dockerfiles":

If you have multiple Dockerfile steps that use different files from your context, COPY them individually, rather than all at once. This will ensure that each step’s build cache is only invalidated (forcing the step to be re-run) if the specifically required files change.

For example:

COPY requirements.txt /tmp/RUN pip install --requirement /tmp/requirements.txtCOPY . /tmp/

Results in fewer cache invalidations for the RUN step, than if you put the COPY . /tmp/ before it.


Alternatively as a quicker means to run requirements.txt file without typing "yes" to confirm installation of libraries, you can re-write as:

COPY requirements.txt ./RUN pip install -y -r requirements.txtCOPY ./"dir"/* .