Connect to SQLite in Apache Spark

scala sqlite apache-spark apache-spark-sql

There are two options you can try

Use JDBC directly

Open a separate, plain JDBC connection in your Spark job
Get the tables names from the JDBC meta data
Feed these into your for comprehension

Use a SQL query for the "dbtable" argument

You can specify a query as the value for the dbtable argument. Syntactically this query must "look" like a table, so it must be wrapped in a sub query.

In that query, get the meta data from the database:

val df = sqlContext.read.format("jdbc").options(  Map(    "url" -> "jdbc:postgresql:xxx",    "user" -> "x",    "password" -> "x",    "dbtable" -> "(select * from pg_tables) as t")).load()

This example works with PostgreSQL, you have to adapt it for SQLite.

Update

It seems that the JDBC driver only supports to iterate over one result set.Anyway, when you materialize the list of table names using collect(), then the following snippet should work:

val myTableNames = metaData.select("tbl_name").map(_.getString(0)).collect()for (t <- myTableNames) {  println(t.toString)  val tableData = sqlContext.read.format("jdbc")    .options(      Map(        "url" -> "jdbc:sqlite:/x.db",        "dbtable" -> t)).load()  tableData.show()}

CodeHunter

Connect to SQLite in Apache Spark

Use JDBC directly

Use a SQL query for the "dbtable" argument

Recent Posts

How can I color dots in a xy scatterplot according to column value?

How to update a claim in ASP.NET Identity?

What does {0} mean when initializing an object?

Accessing members of items in a JSONArray with Java

How to log SQL statements in Spring Boot?

Powershell Get-WebSite name parameter is ignored

How to detect scroll to bottom of html element

Java synchronized method

How to test controllers with CodeIgniter?

Detect Visual Composer

Matplotlib: Specify format of floats for tick labels

Rails join a list of strings with commas and "and" before the last