Using pandas to read downloaded html file

python html import pandas

I think you are on to the right track by using an html parser like beautiful soup. pandas.read_html() reads an html table not an html page.

You would want to do something like this...

from bs4 import BeautifulSoupimport pandas as pdtable = BeautifulSoup(open('C:/age0.html','r').read()).find('table')df = pd.read_html(table) #I think it accepts BeatifulSoup object                         #otherwise try str(table) as input

python html import pandas

first of all install below packages for parsing purpose
- pip install BeautifulSoup4
- pip install lxml
- pip install html5lib

then use 'read_html' to read html table on any html page.

import pandas as pdspds_df = pds.read_html('C:/age0.html')pds_df[0]

I hope this will help.

Good Luck!!

CodeHunter

Using pandas to read downloaded html file

Recent Posts

How can I color dots in a xy scatterplot according to column value?

How to update a claim in ASP.NET Identity?

What does {0} mean when initializing an object?

Accessing members of items in a JSONArray with Java

How to log SQL statements in Spring Boot?

Powershell Get-WebSite name parameter is ignored

How to detect scroll to bottom of html element

Java synchronized method

How to test controllers with CodeIgniter?

Detect Visual Composer

Matplotlib: Specify format of floats for tick labels

Rails join a list of strings with commas and "and" before the last