Controlling Browser using Python?
Selenium Remote Control is a project that comes very close to what you are after. It is really easy to get working in Python with the selenium.webdriver
subpackage that comes with it. Once upon a time, these were two projects. They've now been unified.
Installation
Simple!
$ pip install -U selenium
Usage
>>> from selenium import webdriver>>> ff = webdriver.Firefox()>>> ff.get("http://stackoverflow.com/q/3369073/395287")>>> ff.save_screenshot("/absolute/path/to/webpage.png")
Notes
The documentation can be slightly confusing for Selenium, because there are two modes to interact with browsers. As well as the webdriver
mode, there is the ability to talk to a "standalone Selenium Remote Control server". That approach is what is documented first in the official documentation, but I would stick with webdriver
for the simple task here.
If you need to take screenshots, then you need to render the pages. I would recommend to use Selenium (as mentioned by Tim), or then spynner.
Here is a sample code of what you need using spynner:
import spynnerbrowser = spynner.Browser()browser.load("http://stackoverflow.com/q/3369073/")browser.snapshot().save('file.png')browser.close()
wxWebConnect is a wxWidgets library for controlling Gecko (Mozilla's rendering engine). Together with wxPython, it would allow you to write your own (minimal) web browser in Python, and hence control clicks.