Install and configure h2O in hadoop

In any of the cluster node:
 from http://h2o-release.s3.amazonaws.com/h2o/rel-wheeler/2/index.html
 select: 
# cd /downloads
# wget http://h2o-release.s3.amazonaws.com/h2o/rel-wheeler/2/h2o-3.16.0.2-hdp2.6.zip
# unzip h2o-3.14.0.7-*.zip
# cd h2o-3.14.0.7-*
# mv h2o-3.14.0.7-* /usr/local/h2o-3.14
# chown -R hdfs:hadoop /usr/local/h2o-3.14

# ls -la /usr/local/h2o
total 98512
drwxr-xr-x   6 hdfs hdfs      4096 Sep 26 15:55 .
drwxr-xr-x. 14 root root      4096 Sep 26 15:39 ..
drwxr-xr-x   3 hdfs hdfs      4096 Sep 23 03:23 bindings
-rw-r--r--   1 hdfs hdfs 100846528 Sep 23 03:23 h2odriver.jar
drwxr-xr-x   2 hdfs hdfs      4096 Sep 23 03:23 python
drwxr-xr-x   2 hdfs hdfs      4096 Sep 23 03:23 R
-rw-r--r--   1 hdfs hdfs      1733 Sep 23 03:23 README.txt
drwxr-xr-x   2 hdfs hdfs      4096 Sep 26 16:11 start-info
# su - hdfs
$ hadoop jar h2odriver.jar -details
$ hadoop jar h2odriver.jar -nodes 3 -mapperXmx 7g -notify /usr/local/h2o/start-info/start.txt -disown -output hdfs://<namenode>:8020/user/hdfs/h2o-logs
For new h2o versions, the start.txt may not be available and 
h2odriver.jar may be h2o.jar
http://<IP of installed node>:54321/flow/index.html

Shut down the specified instance. All data will be lost.

kill process:
go to server where h2o is running:

# netstat -plant | grep 54321
tcp        0      0 0.0.0.0:54321               0.0.0.0:*                   LISTEN      225879/java

grep the process id:
# ps -ef | grep 225879
yarn     225879 225869 18 Dec01 ?        20:43:40 /mnt/vol1/jdk1.8.0_144//bin/java -server -XX:NewRatio=8 -Djava.net.preferIPv4Stack=true -Dhdp.version=2.6.1.0-129 -Xms7g -Xmx7g -verbose:gc -XX:+PrintGCDetails -XX:+PrintGCTimeStamps -Dlog4j.defaultInitOverride=true -Djava.io.tmpdir=/grid/12/hadoop/yarn/local/usercache/hdfs/appcache/application_1511423370011_8906/container_e37_1511423370011_8906_01_000002/tmp -Dlog4j.configuration=container-log4j.properties -Dyarn.app.container.log.dir=/grid/7/hadoop/yarn/log/application_1511423370011_8906/container_e37_1511423370011_8906_01_000002 -Dyarn.app.container.log.filesize=0 -Dhadoop.root.logger=INFO,CLA -Dhadoop.root.logfile=syslog org.apache.hadoop.mapred.YarnChild 9589 attempt_1511423370011_8906_m_000000_0 40681930227714
0 Comments

There are no comments yet

Leave a comment

Your email address will not be published. Required fields are marked *