Originally Posted on https://trino.io/blog/2020/10/20/intro-to-hive-connector.html
TL;DR: The Hive connector is what you use in Presto for reading data from object storage that is organized according to the rules laid out by Hive, without using the Hive runtime code.
One of the most confusing aspects when starting Presto is the Hive connector. Typically, you seek out the use of Presto when you experience an intensely slow query turnaround from your existing Hadoop, Spark, or Hive infrastructure. In fact, the genesis of Presto came about due to these slow Hive query conditions at Facebook back in 2012.
So when you learn that Presto…
This blog was originally posted on my website.
There’s something I have to get off my chest. If you really need to, just read the TLDR and listen to the Justin Bieber parody posted below. If you’re confused by the lingo, the rest of the post will fill in any gaps.
TL;DR: Benchmarketing, the practice of using benchmarks for marketing, is bad. Consumers should run their own benchmarks and ideally open-source them instead of relying on an internal and biased report.
Enjoy the song I wrote about this silly practice.
For the longest time, I have wondered what is…
U.S. Marine turned software engineer and developer 🥑 working to foster the open source Trino community.