Skip to content

The Dataist Posts

Tutorial: Using Apache Zeppelin with MySQL

I’ve been playing with Apache Zeppelin for a little while now, and have been really impressed.  If you aren’t familiar with Zeppelin,  it is a tool for creating interactive notebooks to visualize data.  With the latest version, Zeppelin includes an interpreter for PostgreSQL and I discovered that you can use this interpreter to connect Zeppelin to a MySQL server and quickly visualize your data.

Share the joy
7 Comments

Apache Zeppelin Releases Version 0.5.5

Apache ZeppelinThe developers of Apache Zeppelin just released a new version of Apache Zeppelin.  The release notes are here, but it doesn’t look like anything too exciting.  I’d really like to see an interpreter for Apache Drill that works with Zeppelin as well as either a generic ODBC/JDBC interpreter OR a MySQL interpreter.  Both would be incredibly useful.

Share the joy
Leave a Comment

Querying Apache Drill via the RESTful API

Here is a quick iPython notebook I wrote up which demonstrates how to execute queries in Apache Drill using Drill’s RESTful interface.  I’ve had a lot of difficulties getting Drill to “talk” to Python via JDBC and ODBC.  I think the problems however are related to my computer’s configuration, but in any event, this code works.

Querying Apache Drill via RESTful Interface

Share the joy
1 Comment

Strata talk featured on ProPublica!

It looks like my Strata talk sparked some conversation and an article at ProPublica!

http://www.propublica.org/article/your-smart-home-knows-a-lot-about-you

Smart Home Knows a Lot About YouAfter reflecting on the matter more, I hope that people will start to understand that these home automation d
evices really are data collection devices for the manufacturer of the device.  The Automatic, in my opinion, while it is a very neat device, provides little information that the driver wouldn’t alrea
y know about themselves and hence little benefit to the customer.  However, to the Automatic company, when you start aggregating this data, it provides a wealth of data to them.  Therefore, devices should have some sort of ranking as to benefit to consumer vs. benefit to company.

Share the joy
Leave a Comment