How to set up a python application with selenium in a docker container

First of all you need a Docker Image with all packages installed. Lets create a Dockerfile for this.

FROM ubuntu:bionicRUN apt-get update && apt-get install -y \    python3 python3-pip \    fonts-liberation libappindicator3-1 libasound2 libatk-bridge2.0-0 \    libnspr4 libnss3 lsb-release xdg-utils libxss1 libdbus-glib-1-2 \    curl unzip wget \    xvfb# install geckodriver and firefoxRUN GECKODRIVER_VERSION=`curl https://github.com/mozilla/geckodriver/releases/latest | grep -Po 'v[0-9]+.[0-9]+.[0-9]+'` && \    wget https://github.com/mozilla/geckodriver/releases/download/$GECKODRIVER_VERSION/geckodriver-$GECKODRIVER_VERSION-linux64.tar.gz && \    tar -zxf geckodriver-$GECKODRIVER_VERSION-linux64.tar.gz -C /usr/local/bin && \    chmod +x /usr/local/bin/geckodriver && \    rm geckodriver-$GECKODRIVER_VERSION-linux64.tar.gzRUN FIREFOX_SETUP=firefox-setup.tar.bz2 && \    apt-get purge firefox && \    wget -O $FIREFOX_SETUP "https://download.mozilla.org/?product=firefox-latest&os=linux64" && \    tar xjf $FIREFOX_SETUP -C /opt/ && \    ln -s /opt/firefox/firefox /usr/bin/firefox && \    rm $FIREFOX_SETUP# install chromedriver and google-chromeRUN CHROMEDRIVER_VERSION=`curl -sS chromedriver.storage.googleapis.com/LATEST_RELEASE` && \    wget https://chromedriver.storage.googleapis.com/$CHROMEDRIVER_VERSION/chromedriver_linux64.zip && \    unzip chromedriver_linux64.zip -d /usr/bin && \    chmod +x /usr/bin/chromedriver && \    rm chromedriver_linux64.zipRUN CHROME_SETUP=google-chrome.deb && \    wget -O $CHROME_SETUP "https://dl.google.com/linux/direct/google-chrome-stable_current_amd64.deb" && \    dpkg -i $CHROME_SETUP && \    apt-get install -y -f && \    rm $CHROME_SETUP# install phantomjsRUN wget https://bitbucket.org/ariya/phantomjs/downloads/phantomjs-2.1.1-linux-x86_64.tar.bz2 && \    tar -jxf phantomjs-2.1.1-linux-x86_64.tar.bz2 && \    cp phantomjs-2.1.1-linux-x86_64/bin/phantomjs /usr/local/bin/phantomjs && \    rm phantomjs-2.1.1-linux-x86_64.tar.bz2RUN pip3 install seleniumRUN pip3 install pyvirtualdisplayRUN pip3 install Selenium-ScreenshotENV LANG C.UTF-8ENV LC_ALL C.UTF-8ENV PYTHONUNBUFFERED=1ENV APP_HOME /usr/src/appWORKDIR /$APP_HOMECOPY . $APP_HOME/CMD tail -f /dev/nullCMD python3 example.py

It will run your program in the end. In my case it is example.py

Now place the example.py in the same directory as Dockerfile. An example for Firefox, Chrome and Phantom JS is given below.

import osimport loggingfrom pyvirtualdisplay import Displayfrom selenium import webdriverlogging.getLogger().setLevel(logging.INFO)BASE_URL = 'http://www.example.com/'def chrome_example():    display = Display(visible=0, size=(800, 600))    display.start()    logging.info('Initialized virtual display..')    chrome_options = webdriver.ChromeOptions()    chrome_options.add_argument('--no-sandbox')    chrome_options.add_experimental_option('prefs', {        'download.default_directory': os.getcwd(),        'download.prompt_for_download': False,    })    logging.info('Prepared chrome options..')    browser = webdriver.Chrome(chrome_options=chrome_options)    logging.info('Initialized chrome browser..')    browser.get(BASE_URL)    logging.info('Accessed %s ..', BASE_URL)    logging.info('Page title: %s', browser.title)    browser.quit()    display.stop()def firefox_example():    display = Display(visible=0, size=(800, 600))    display.start()    logging.info('Initialized virtual display..')    firefox_profile = webdriver.FirefoxProfile()    firefox_profile.set_preference('browser.download.folderList', 2)    firefox_profile.set_preference('browser.download.manager.showWhenStarting', False)    firefox_profile.set_preference('browser.download.dir', os.getcwd())    firefox_profile.set_preference('browser.helperApps.neverAsk.saveToDisk', 'text/csv')    logging.info('Prepared firefox profile..')    browser = webdriver.Firefox(firefox_profile=firefox_profile)    logging.info('Initialized firefox browser..')    browser.get(BASE_URL)    logging.info('Accessed %s ..', BASE_URL)    logging.info('Page title: %s', browser.title)    browser.quit()    display.stop()def phantomjs_example():    display = Display(visible=0, size=(800, 600))    display.start()    logging.info('Initialized virtual display..')    browser = webdriver.PhantomJS()    logging.info('Initialized phantomjs browser..')    browser.get(BASE_URL)    logging.info('Accessed %s ..', BASE_URL)    logging.info('Page title: %s', browser.title)    browser.quit()    display.stop()if __name__ == '__main__':    chrome_example()    firefox_example()    phantomjs_example()

In the end we will create Docker-compose.yml to simplify things up.

selenium:    build: .    ports:        - 4000:4000    volumes:        - ./data/:/data/    privileged: true

Build and run through following command.

docker-compose build && docker-compose up -d

You can also run it through docker command without using docker-compose

docker build -t selenium_docker .docker run --privileged -p 4000:4000 -d -it selenium_docker

Source:

https://github.com/dimmg/dockselpy

CodeHunter

How to set up a python application with selenium in a docker container

Recent Posts

How can I color dots in a xy scatterplot according to column value?

How to update a claim in ASP.NET Identity?

What does {0} mean when initializing an object?

Accessing members of items in a JSONArray with Java

How to log SQL statements in Spring Boot?

Powershell Get-WebSite name parameter is ignored

How to detect scroll to bottom of html element

Java synchronized method

How to test controllers with CodeIgniter?

Detect Visual Composer

Matplotlib: Specify format of floats for tick labels

Rails join a list of strings with commas and "and" before the last