jeudi 8 janvier 2015

Presto / PrestoDB - Query .... failed: Catalog hive does not exist

Using Amazon EMR, Hive .13, Hadoop 2.x, and Presto Server 0.89. Trying to set up Presto to query data that is usually queried through Hive. Hive metadata is stored in MySQL. Presto Server is installed set up on all nodes. For the most part everything is set up as is documented on prestodb.io.


I first start the server on all nodes, and then start the CLI on the coordinator/name node. When I try to run a query using the below commands I get a "Query .... failed: Catalog hive does not exist" error:



presto-cli --server aws.internal.ip.of.coordinator:8080 --catalog hive ###also have tried localhost:8080
presto:default> show tables;
"Query .... failed: Catalog hive does not exist"


Configs:


etc/config.properties (only on coordinator)



coordinator=true
node-scheduler.include-coordinator=false
http-server.http.port=8080
task.max-memory=1GB
discovery-server.enabled=true
discovery.uri=http://ift.tt/1Av2TdK


etc/config.properties (only on workers)



coordinator=false
http-server.http.port=8080
task.max-memory=1GB
discovery.uri=http://ift.tt/1Av2TdK


catalog/hive.properties (all nodes)



connector.name=hive-hadoop2
hive.metastore.uri=thrift://aws.internal.ip.of.coordinator:9083


catalog/jmx.properties (all nodes)



connector.name=jmx


etc/jvm.config (all nodes)



-server
-Xmx16G
-XX:+UseConcMarkSweepGC
-XX:+ExplicitGCInvokesConcurrent
-XX:+CMSClassUnloadingEnabled
-XX:+AggressiveOpts
-XX:+HeapDumpOnOutOfMemoryError
-XX:OnOutOfMemoryError=kill -9 %p
-XX:ReservedCodeCacheSize=150M


etc/log.properties



com.facebook.presto=INFO


etc/node.properties



node.environment=production
node.id=master #different per node
node.data-dir=/mnt/presto-data




Aucun commentaire:

Enregistrer un commentaire