I'm not sure where else to ask this question, so I'll ask it here, as I think this might serve as a nice reference for future users who might have a similar question.
Are there any known production usages of Apache HAWQ (http://hawq.incubator.apache.org/)? I would like to compare this service with others such as Presto, Spark, Impala, etc. But I haven't come across any real-world usages of it other than nice-looking benchmarks. And finally, if you have used this personally, what have been your experiences with it?
Currently there is no independent doc for apache hawq. But the community is moving the doc from pivotal hdb to apache hawq. And the docs links on the page is linking to the hdb docs (http://hdb.docs.pivotal.io/211/hdb/index.html). You can refer to this one first and you can find incubator-hawq-docs projects at https://github.com/apache/incubator-hawq-docs.
Besides, if you do not know where to ask questions, you could subscribe the dev and user mail list, send email to [email protected] / [email protected] to subscribe and send emails to [email protected] / [email protected] to ask questions.
Pivotal HDB (Commercial offering of HAWQ) is at various clients. Hawq is true 100% SQL compliant SQL engine based on MPP history. This is a unique product with state of art Query optimizer and dynamic partition elimination, very robust HDFS data federation features with Hbase, Hive, JSON, ORC(beta), and native hadoop file system. Hawq uses parquet storage format so tables created in hawq can be used in Hadoop eco-system.Hawq has ability to collect stats on external tables for faster data access. Support ACID transaction(Insert). On top of all these most compelling feature is doing data science using language extensions right in sql, supports R, Python, Java, Perl. I have seen implementations of HAWQ in Auto, oil and gas, IOT, healthcare industries. The typical use case i experienced is BI on top of hadoop, Data science model training and executing models, Interactive SQL on structured data. Since HAWQ is born out of Greenplum heritage, Some of the features that hawq are hard to find in competitive products. Hawq perfectly complements the Hadoop eco-system.