Tuesday, October 10, 2017

Flatten Complex Nested Parquet Files on Hadoop With Herringbone

DZone Database Zone
Flatten Complex Nested Parquet Files on Hadoop With Herringbone
Flatten Complex Nested Parquet Files on Hadoop With Herringbone

Herringbone is a suite of tools for working with Parquet files on HDFS, and with Impala and Hive. Please visit my GitHub and this documentation for more details.

Installation

Note: You must be using a Hadoop machine; Herringbone needs a Hadoop environment.

No comments:

Fun With SQL: Functions in Postgres

DZone Database Zone Fun With SQL: Functions in Postgres In our previous  Fun with SQL  post on the  Citus Data  blog, we covered w...