hadoop2.4.1 安装hue with Yarn模式

Here are some urls about HUE,以下是一些参考链接

http://cloudera.github.io/hue/docs-3.7.0/manual.html
https://github.com/cloudera/hue/tree/branch-3.7.1
https://raw.githubusercontent.com/cloudera/hue/branch-3.7.1/desktop/conf.dist/hue.ini
http://gethue.com/hadoop-tutorial-hive-query-editor-with-hiveserver2-and/
http://gethue.com/category/hive/
首先,check your OS (Ubuntu or CentOS);

Make sure ,different OS should install some different liblary;

$ git clone http://github.com/cloudera/hue.git
$ cd hue
$ make apps
$ /usr/local/hue-3.7.1/build/env/bin/supervisor &


错误0,查看metastore的时候错误:

StructuredThriftTransportException: Could not start SASL: Error in sasl_client_start (-4) SASL(-4): no mechanism available: No worthy mechs found (code THRIFTTRANSPORT): TTransportException('Could not start SASL: Error in sasl_client_start (-4) SASL(-4): no mechanism available: No worthy mechs found',)

You could try NOSASL. I'm not entirely sure what state the hue 3.0 tarball is in... but NONE implies using the PLAIN kerberos plugin, which is missing from your list of installed cyrus sasl plugins. yum install cyrus-sasl-plain should do the trick I think.


在hive-site.xml中增加以下内容:
<property>
        <name>hive.server2.authentication</name>
        <value>NOSASL</value> <!-- 如果出现SLAS那就一定需要增加这个属性-->
</property>


错误一:

2015-03-31 14:54:59,081 INFO  [main]: service.AbstractService (AbstractService.java:start(104)) - Service:HiveServer2 is started.
2015-03-31 14:54:59,158 WARN  [Thread-6]: conf.HiveConf (HiveConf.java:initialize(1383)) - DEPRECATED: Configuration property hive.metastore.local no longer has any effect. Make sure to provide a valid value for hive.metastore.uris if you are connecting to a remote metastore.
2015-03-31 14:54:59,231 INFO  [Thread-6]: thrift.ThriftCLIService (ThriftBinaryCLIService.java:run(88)) - ThriftBinaryCLIService listening on /192.168.2.234:10001
2015-03-31 14:58:54,934 INFO  [pool-7-thread-2]: thrift.ThriftCLIService (ThriftCLIService.java:OpenSession(188)) - Client protocol version: HIVE_CLI_SERVICE_PROTOCOL_V5
2015-03-31 14:58:54,949 WARN  [pool-7-thread-2]: thrift.ThriftCLIService (ThriftCLIService.java:OpenSession(197)) - Error opening session:
org.apache.hive.service.cli.HiveSQLException: Failed to validate proxy privilage of hue for zjj
        at org.apache.hive.service.auth.HiveAuthFactory.verifyProxyAccess(HiveAuthFactory.java:326)
        at org.apache.hive.service.cli.thrift.ThriftCLIService.getProxyUser(ThriftCLIService.java:588)
        at org.apache.hive.service.cli.thrift.ThriftCLIService.getUserName(ThriftCLIService.java:244)
        at org.apache.hive.service.cli.thrift.ThriftCLIService.getSessionHandle(ThriftCLIService.java:258)
        at org.apache.hive.service.cli.thrift.ThriftCLIService.OpenSession(ThriftCLIService.java:191)
        at org.apache.hive.service.cli.thrift.TCLIService$Processor$OpenSession.getResult(TCLIService.java:1253)
        at org.apache.hive.service.cli.thrift.TCLIService$Processor$OpenSession.getResult(TCLIService.java:1238)
        at org.apache.thrift.ProcessFunction.process(ProcessFunction.java:39)
        at org.apache.thrift.TBaseProcessor.process(TBaseProcessor.java:39)
        at org.apache.hive.service.auth.TSetIpAddressProcessor.process(TSetIpAddressProcessor.java:55)
        at org.apache.thrift.server.TThreadPoolServer$WorkerProcess.run(TThreadPoolServer.java:206)
        at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
        at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
        at java.lang.Thread.run(Thread.java:745)
Caused by: org.apache.hadoop.security.authorize.AuthorizationException: User: hue is not allowed to impersonate zjj
        at org.apache.hadoop.security.authorize.ProxyUsers.authorize(ProxyUsers.java:143)
        at org.apache.hadoop.hive.shims.HadoopShimsSecure.authorizeProxyAccess(HadoopShimsSecure.java:525)
        at org.apache.hive.service.auth.HiveAuthFactory.verifyProxyAccess(HiveAuthFactory.java:322)
        ... 13 more
2015-03-31 15:03:18,263 INFO  [pool-7-thread-3]: thrift.ThriftCLIService (ThriftCLIService.java:OpenSession(188)) - Client protocol version: HIVE_CLI_SERVICE_PROTOCOL_V5
2015-03-31 15:03:18,268 WARN  [pool-7-thread-3]: thrift.ThriftCLIService (ThriftCLIService.java:OpenSession(197)) - Error opening session:
org.apache.hive.service.cli.HiveSQLException: Failed to validate proxy privilage of hue for zjj
        at org.apache.hive.service.auth.HiveAuthFactory.verifyProxyAccess(HiveAuthFactory.java:326)
        at org.apache.hive.service.cli.thrift.ThriftCLIService.getProxyUser(ThriftCLIService.java:588)
        at org.apache.hive.service.cli.thrift.ThriftCLIService.getUserName(ThriftCLIService.java:244)
        at org.apache.hive.service.cli.thrift.ThriftCLIService.getSessionHandle(ThriftCLIService.java:258)
        at org.apache.hive.service.cli.thrift.ThriftCLIService.OpenSession(ThriftCLIService.java:191)
        at org.apache.hive.service.cli.thrift.TCLIService$Processor$OpenSession.getResult(TCLIService.java:1253)
        at org.apache.hive.service.cli.thrift.TCLIService$Processor$OpenSession.getResult(TCLIService.java:1238)
        at org.apache.thrift.ProcessFunction.process(ProcessFunction.java:39)
        at org.apache.thrift.TBaseProcessor.process(TBaseProcessor.java:39)
        at org.apache.hive.service.auth.TSetIpAddressProcessor.process(TSetIpAddressProcessor.java:55)
        at org.apache.thrift.server.TThreadPoolServer$WorkerProcess.run(TThreadPoolServer.java:206)
        at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
        at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
        at java.lang.Thread.run(Thread.java:745)
Caused by: org.apache.hadoop.security.authorize.AuthorizationException: User: hue is not allowed to impersonate zjj
        at org.apache.hadoop.security.authorize.ProxyUsers.authorize(ProxyUsers.java:143)
        at org.apache.hadoop.hive.shims.HadoopShimsSecure.authorizeProxyAccess(HadoopShimsSecure.java:525)
        at org.apache.hive.service.auth.HiveAuthFactory.verifyProxyAccess(HiveAuthFactory.java:322)
        ... 13 more

zjj用户没有代理的权限去访问httfs服务;


在core-site.xml中增加以下内容:
/app/prog/hadoop/etc/hadoop/core-site.xml
  <property>
                <name>hadoop.proxyuser.httpfs.hosts</name>
                <value>*</value>
        </property>
        <property>
                <name>hadoop.proxyuser.httpfs.groups</name>
                <value>*</value>
        </property>
        <property>
                <name>hadoop.proxyuser.hue.hosts</name>
                <value>*</value>
        </property>
        <property>
                <name>hadoop.proxyuser.hue.groups</name>
                <value>*</value>
        </property>
</configuration>

/app/prog/hadoop/etc/hadoop/hdfs-site.xml 

  <property>
        <name>dfs.webhdfs.enabled</name>
        <value>true</value>
    </property>

错误二:
查看jobHistory的生活出现500错误:
在httpfs-site.xml中增加以下内容,并启动服务  hadoop/sbin/httpfs.sh start


/app/prog/hadoop-2.4.1/etc/hadoop/httpfs-site.xml

/app/prog/hadoop/sbin/httpfs.sh  start 启动httpfs服务 ,保证Job Browser正常运行。
<configuration>
        <property>
                <name>httpfs.proxyuser.hue.hosts</name>
                <value>*</value>
        </property>
        <property>
                <name>httpfs.proxyuser.hue.groups</name>
                <value>*</value>
        </property>
</configuration>

在hive-site.xml 里面增加:
<!-- this property "hive.metastore.local" never supported since hive 0.10-->
<property>
  <name>hive.metastore.local</name>
  <value>false</value>
</property>
<property>
        <name>hive.server2.thrift.bind.host</name>
        <value>192.168.2.234</value>
</property>
<property>
        <name>hive.server2.thrift.port</name>
        <value>10001</value>
        </property>
<property>
        <name>hive.server2.enable.doAs</name>
        <value>true</value>
</property>
<property>
        <name>hive.server2.authentication</name>
        <value>NOSASL</value> <!-- 如果出现SLAS那就一定需要增加这个属性-->
</property>
<property><!--启用hiveserver2-->
<name>hive.server2.enable.impersonation</name>
    <description>Enable user impersonation for HiveServer2</description>
<value>true</value>
</property>

hive的metastore 端口默认是9083,不需要做任何配置,但是在启动hue的时候需要先启动 metastore 和hiveserver 。

启动命令: 
nohup /app/prog/hive/bin/hive --service metastore &
检查metastore 是否启动 : netstat -anp |grep 9083 
nohup /app/prog/hive/bin/hive --service hiveserver2 &
检查hiveserver2 是否正常启动 :netstat -anp |grep 10001

错误三,如果出现以下错误:
File "/opt/app/prog/hue-3.7.1/build/env/lib/python2.6/site-packages/Django-1.4.5-py2.6.egg/django/core/handlers/base.py", line 111, in get_response
    response = callback(request, *callback_args, **callback_kwargs)
  File "/opt/app/prog/hue-3.7.1/apps/metastore/src/metastore/views.py", line 106, in show_tables
    db = dbms.get(request.user)
  File "/opt/app/prog/hue-3.7.1/apps/beeswax/src/beeswax/server/dbms.py", line 57, in get
    DBMS_CACHE[user.username][query_server['server_name']] = HiveServer2Dbms(HiveServerClientCompatible(HiveServerClient(query_server, user)), QueryHistory.SERVER_TYPE[1][0])
  File "/opt/app/prog/hue-3.7.1/apps/beeswax/src/beeswax/server/hive_server2_lib.py", line 307, in __init__
    use_sasl, mechanism, kerberos_principal_short_name, impersonation_enabled, ldap_username, ldap_password = self.get_security()
  File "/opt/app/prog/hue-3.7.1/apps/beeswax/src/beeswax/server/hive_server2_lib.py", line 369, in get_security
    raise Exception(_('%s server authentication not supported. Valid are %s.' % (hive_mechanism, HiveServerClient.HS2_MECHANISMS.keys())))
NameError: global name '_' is not defined
请检查你的hive-site.xml文件,一定是里面有参数没有配置正确。


以下是hue.ini 的完整稳定配置:
# Hue configuration file
# ===================================
#
# For complete documentation about the contents of this file, run
#   $ <hue_root>/build/env/bin/hue config_help
#
# All .ini files under the current directory are treated equally.  Their
# contents are merged to form the Hue configuration, which can
# can be viewed on the Hue at
#   http://<hue_host>:<port>/dump_config




###########################################################################
# General configuration for core Desktop features (authentication, etc)
###########################################################################


[desktop]


  # Set this to a random string, the longer the better.
  # This is used for secure hashing in the session store.
  secret_key=huejustonlyfortest


  # Webserver listens on this address and port
  http_host=0.0.0.0
  http_port=8888


  # Time zone name
  time_zone=Asia/Shanghai


  # Enable or disable Django debug mode.
  django_debug_mode=false


  # Enable or disable backtrace for server error
  http_500_debug_mode=false


  # Enable or disable memory profiling.
  ## memory_profiler=false


  # Server email for internal error messages
  ## django_server_email='[email protected]'


  # Email backend
  ## django_email_backend=django.core.mail.backends.smtp.EmailBackend


  # Webserver runs as this user
   server_user=zjj
   server_group=zjj


  # This should be the Hue admin and proxy user
   default_user=zjj


  # This should be the hadoop cluster admin
   default_hdfs_superuser=zjj


  # If set to false, runcpserver will not actually start the web server.
  # Used if Apache is being used as a WSGI container.
  ## enable_server=yes


  # Number of threads used by the CherryPy web server
  ## cherrypy_server_threads=10


  # Filename of SSL Certificate
  ## ssl_certificate=


  # Filename of SSL RSA Private Key
  ## ssl_private_key=


  # List of allowed and disallowed ciphers in cipher list format.
  # See http://www.openssl.org/docs/apps/ciphers.html for more information on cipher list format.
  ## ssl_cipher_list=DEFAULT:!aNULL:!eNULL:!LOW:!EXPORT:!SSLv2


  # LDAP username and password of the hue user used for LDAP authentications.
  # Set it to use LDAP Authentication with HiveServer2 and Impala.
  ## ldap_username=hue
  ## ldap_password=


  # Default encoding for site data
  ## default_site_encoding=utf-8


  # Help improve Hue with anonymous usage analytics.
  # Use Google Analytics to see how many times an application or specific section of an application is used, nothing more.
  ## collect_usage=true
  
  # Support for HTTPS termination at the load-balancer level with SECURE_PROXY_SSL_HEADER.
  ## secure_proxy_ssl_header=false


  # Comma-separated list of Django middleware classes to use.
  # See https://docs.djangoproject.com/en/1.4/ref/middleware/ for more details on middlewares in Django.
  ## middleware=desktop.auth.backend.LdapSynchronizationBackend


  # Comma-separated list of regular expressions, which match the redirect URL.
  # For example, to restrict to your local domain and FQDN, the following value can be used:
  # ^\/.*$,^http:\/\/www.mydomain.com\/.*$
  ## redirect_whitelist=


  # Comma separated list of apps to not load at server startup.
  # e.g.: pig,zookeeper
  ## app_blacklist=


  # The directory where to store the auditing logs. Auditing is disable if the value is empty.
  # e.g. /var/log/hue/audit.log
  ## audit_event_log_dir=


  # Size in KB/MB/GB for audit log to rollover.
  ## audit_log_max_file_size=100MB


  # Administrators
  # ----------------
  [[django_admins]]
    ## [[[admin1]]]
    ## name=john
    ## [email protected]


  # UI customizations
  # -------------------
  [[custom]]


    # Top banner HTML code
    # e.g. <H2>Test Lab A2 Hue Services</H2>
    ## banner_top_html=


  # Configuration options for user authentication into the web application
  # ------------------------------------------------------------------------
  [[auth]]


    # Authentication backend. Common settings are:
    # - django.contrib.auth.backends.ModelBackend (entirely Django backend)
    # - desktop.auth.backend.AllowAllBackend (allows everyone)
    # - desktop.auth.backend.AllowFirstUserDjangoBackend
    #     (Default. Relies on Django and user manager, after the first login)
    # - desktop.auth.backend.LdapBackend
    # - desktop.auth.backend.PamBackend
    # - desktop.auth.backend.SpnegoDjangoBackend
    # - desktop.auth.backend.RemoteUserDjangoBackend
    # - libsaml.backend.SAML2Backend
    # - libopenid.backend.OpenIDBackend
    # - liboauth.backend.OAuthBackend
    #     (Support Twitter, Facebook, Google+ and Linkedin
    ## backend=desktop.auth.backend.AllowFirstUserDjangoBackend


    # The service to use when querying PAM.
    ## pam_service=login


    # When using the desktop.auth.backend.RemoteUserDjangoBackend, this sets
    # the normalized name of the header that contains the remote user.
    # The HTTP header in the request is converted to a key by converting
    # all characters to uppercase, replacing any hyphens with underscores
    # and adding an HTTP_ prefix to the name. So, for example, if the header
    # is called Remote-User that would be configured as HTTP_REMOTE_USER
    #
    # Defaults to HTTP_REMOTE_USER
    ## remote_user_header=HTTP_REMOTE_USER


    # Synchronize a users groups when they login
    ## sync_groups_on_login=false


    # Ignore the case of usernames when searching for existing users.
    # Only supported in remoteUserDjangoBackend.
    ## ignore_username_case=false


    # Ignore the case of usernames when searching for existing users to authenticate with.
    # Only supported in remoteUserDjangoBackend.
    ## force_username_lowercase=false


    # Users will expire after they have not logged in for 'n' amount of seconds.
    # A negative number means that users will never expire.
    ## expires_after=-1


    # Apply 'expires_after' to superusers.
    ## expire_superusers=true


  # Configuration options for connecting to LDAP and Active Directory
  # -------------------------------------------------------------------
  [[ldap]]


    # The search base for finding users and groups
    ## base_dn="DC=mycompany,DC=com"


    # URL of the LDAP server
    ## ldap_url=ldap://auth.mycompany.com


    # A PEM-format file containing certificates for the CA's that
    # Hue will trust for authentication over TLS.
    # The certificate for the CA that signed the
    # LDAP server certificate must be included among these certificates.
    # See more here http://www.openldap.org/doc/admin24/tls.html.
    ## ldap_cert=
    ## use_start_tls=true


    # Distinguished name of the user to bind as -- not necessary if the LDAP server
    # supports anonymous searches
    ## bind_dn="CN=ServiceAccount,DC=mycompany,DC=com"


    # Password of the bind user -- not necessary if the LDAP server supports
    # anonymous searches
    ## bind_password=


    # Pattern for searching for usernames -- Use <username> for the parameter
    # For use when using LdapBackend for Hue authentication
    ## ldap_username_pattern="uid=<username>,ou=People,dc=mycompany,dc=com"


    # Create users in Hue when they try to login with their LDAP credentials
    # For use when using LdapBackend for Hue authentication
    ## create_users_on_login = true


    # Ignore the case of usernames when searching for existing users in Hue.
    ## ignore_username_case=false


    # Force usernames to lowercase when creating new users from LDAP.
    ## force_username_lowercase=false


    # Use search bind authentication.
    ## search_bind_authentication=true


    # Choose which kind of subgrouping to use: nested or suboordinate (deprecated).
    ## subgroups=suboordinate


    # Define the number of levels to search for nested members.
    ## nested_members_search_depth=10


    [[[users]]]


      # Base filter for searching for users
      ## user_filter="objectclass=*"


      # The username attribute in the LDAP schema
      ## user_name_attr=sAMAccountName


    [[[groups]]]


      # Base filter for searching for groups
      ## group_filter="objectclass=*"


      # The group name attribute in the LDAP schema
      ## group_name_attr=cn


      # The attribute of the group object which identifies the members of the group
      ## group_member_attr=members
    
    [[[ldap_servers]]]


      ## [[[[mycompany]]]]


        # The search base for finding users and groups
        ## base_dn="DC=mycompany,DC=com"


        # URL of the LDAP server
        ## ldap_url=ldap://auth.mycompany.com


        # A PEM-format file containing certificates for the CA's that
        # Hue will trust for authentication over TLS.
        # The certificate for the CA that signed the
        # LDAP server certificate must be included among these certificates.
        # See more here http://www.openldap.org/doc/admin24/tls.html.
        ## ldap_cert=
        ## use_start_tls=true


        # Distinguished name of the user to bind as -- not necessary if the LDAP server
        # supports anonymous searches
        ## bind_dn="CN=ServiceAccount,DC=mycompany,DC=com"


        # Password of the bind user -- not necessary if the LDAP server supports
        # anonymous searches
        ## bind_password=


        # Pattern for searching for usernames -- Use <username> for the parameter
        # For use when using LdapBackend for Hue authentication
        ## ldap_username_pattern="uid=<username>,ou=People,dc=mycompany,dc=com"


        ## Use search bind authentication.
        ## search_bind_authentication=true


        ## [[[[[users]]]]]


          # Base filter for searching for users
          ## user_filter="objectclass=Person"


          # The username attribute in the LDAP schema
          ## user_name_attr=sAMAccountName


        ## [[[[[groups]]]]]


          # Base filter for searching for groups
          ## group_filter="objectclass=groupOfNames"


          # The username attribute in the LDAP schema
          ## group_name_attr=cn


  # Configuration options for specifying the Desktop Database. For more info,
  # see http://docs.djangoproject.com/en/1.4/ref/settings/#database-engine
  # ------------------------------------------------------------------------
  [[database]]
    # Database engine is typically one of:
    # postgresql_psycopg2, mysql, sqlite3 or oracle.
    #
    # Note that for sqlite3, 'name', below is a path to the filename. For other backends, it is the database name.
    # Note for Oracle, options={'threaded':true} must be set in order to avoid crashes.
    # Note for Oracle, you can use the Oracle Service Name by setting "port=0" and then "name=<host>:<port>/<service_name>".
     engine=sqlite3
    ## host=
    ## port=
    ## user=
    ## password=
     name=/usr/local/hue-3.7.1/desktop/desktop.db
    ## options={}


  # Configuration options for specifying the Desktop session.
  # For more info, see https://docs.djangoproject.com/en/1.4/topics/http/sessions/
  # ------------------------------------------------------------------------
  [[session]]
    # The cookie containing the users' session ID will expire after this amount of time in seconds.
    # Default is 2 weeks.
    ## ttl=1209600


    # The cookie containing the users' session ID will be secure.
    # Should only be enabled with HTTPS.
    ## secure=false


    # The cookie containing the users' session ID will use the HTTP only flag.
    ## http_only=false


    # Use session-length cookies. Logs out the user when she closes the browser window.
    ## expire_at_browser_close=false




  # Configuration options for connecting to an external SMTP server
  # ------------------------------------------------------------------------
  [[smtp]]


    # The SMTP server information for email notification delivery
    host=localhost
    port=25
    user=
    password=


    # Whether to use a TLS (secure) connection when talking to the SMTP server
    tls=no


    # Default email address to use for various automated notification from Hue
    ## default_from_email=hue@localhost




  # Configuration options for Kerberos integration for secured Hadoop clusters
  # ------------------------------------------------------------------------
  [[kerberos]]


    # Path to Hue's Kerberos keytab file
    ## hue_keytab=
    # Kerberos principal name for Hue
    ## hue_principal=hue/hostname.foo.com
    # Path to kinit
    ## kinit_path=/path/to/kinit




  # Configuration options for using OAuthBackend (core) login
  # ------------------------------------------------------------------------
  [[oauth]]
    # The Consumer key of the application
    ## consumer_key=XXXXXXXXXXXXXXXXXXXXX


    # The Consumer secret of the application
    ## consumer_secret=XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX


    # The Request token URL
    ## request_token_url=https://api.twitter.com/oauth/request_token


    # The Access token URL
    ## access_token_url=https://api.twitter.com/oauth/access_token


    # The Authorize URL
    ## authenticate_url=https://api.twitter.com/oauth/authorize




###########################################################################
# Settings to configure SAML
###########################################################################


[libsaml]
  # Xmlsec1 binary path. This program should be executable by the user running Hue.
  ## xmlsec_binary=/usr/local/bin/xmlsec1


  # Entity ID for Hue acting as service provider.
  # Can also accept a pattern where '<base_url>' will be replaced with server URL base.
  ## entity_id="<base_url>/saml2/metadata/"


  # Create users from SSO on login.
  ## create_users_on_login=true


  # Required attributes to ask for from IdP.
  # This requires a comma separated list.
  ## required_attributes=uid


  # Optional attributes to ask for from IdP.
  # This requires a comma separated list.
  ## optional_attributes=


  # IdP metadata in the form of a file. This is generally an XML file containing metadata that the Identity Provider generates.
  ## metadata_file=


  # Private key to encrypt metadata with.
  ## key_file=


  # Signed certificate to send along with encrypted metadata.
  ## cert_file=


  # A mapping from attributes in the response from the IdP to django user attributes.
  ## user_attribute_mapping={'uid':'username'}


  # Have Hue initiated authn requests be signed and provide a certificate.
  ## authn_requests_signed=false


  # Have Hue initiated logout requests be signed and provide a certificate.
  ## logout_requests_signed=false


  ## Username can be sourced from 'attributes' or 'nameid'.
  ## username_source=attributes


  # Performs the logout or not.
  ## logout_enabled=true




###########################################################################
# Settings to configure OPENID
###########################################################################


[libopenid]
  # (Required) OpenId SSO endpoint url. 
  ## server_endpoint_url=https://www.google.com/accounts/o8/id


  # OpenId 1.1 identity url prefix to be used instead of SSO endpoint url
  # This is only supported if you are using an OpenId 1.1 endpoint
  ## identity_url_prefix=https://app.onelogin.com/openid/your_company.com/


  # Create users from OPENID on login.
  ## create_users_on_login=true


  # Use email for username
  ## use_email_for_username=true




###########################################################################
# Settings to configure OAuth
###########################################################################


[liboauth]
  # NOTE: 
  # To work, each of the active (i.e. uncommented) service must have 
  # applications created on the social network.
  # Then the "consumer key" and "consumer secret" must be provided here.
  #
  # The addresses where to do so are:
  # Twitter:  https://dev.twitter.com/apps
  # Google+ : https://cloud.google.com/
  # Facebook: https://developers.facebook.com/apps
  # Linkedin: https://www.linkedin.com/secure/developer
  #
  # Additionnaly, the following must be set in the application settings:
  # Twitter:  Callback URL (aka Redirect URL) must be set to http://YOUR_HUE_IP_OR_DOMAIN_NAME/oauth/social_login/oauth_authenticated
  # Google+ : CONSENT SCREEN must have email address
  # Facebook: Sandbox Mode must be DISABLED
  # Linkedin: "In OAuth User Agreement", r_emailaddress is REQUIRED
  
  # The Consumer key of the application
  ## consumer_key_twitter=              
  ## consumer_key_google=               
  ## consumer_key_facebook=             
  ## consumer_key_linkedin=


  # The Consumer secret of the application
  ## consumer_secret_twitter=           
  ## consumer_secret_google=            
  ## consumer_secret_facebook=          
  ## consumer_secret_linkedin=


  # The Request token URL
  ## request_token_url_twitter=https://api.twitter.com/oauth/request_token
  ## request_token_url_google=https://accounts.google.com/o/oauth2/auth
  ## request_token_url_linkedin=https://www.linkedin.com/uas/oauth2/authorization
  ## request_token_url_facebook=https://graph.facebook.com/oauth/authorize


  # The Access token URL
  ## access_token_url_twitter=https://api.twitter.com/oauth/access_token
  ## access_token_url_google=https://accounts.google.com/o/oauth2/token
  ## access_token_url_facebook=https://graph.facebook.com/oauth/access_token
  ## access_token_url_linkedin=https://api.linkedin.com/uas/oauth2/accessToken


  # The Authenticate URL
  ## authenticate_url_twitter=https://api.twitter.com/oauth/authorize
  ## authenticate_url_google=https://www.googleapis.com/oauth2/v1/userinfo?access_token=
  ## authenticate_url_facebook=https://graph.facebook.com/me?access_token=
  ## authenticate_url_linkedin=https://api.linkedin.com/v1/people/~:(email-address)?format=json&oauth2_access_token=


  # Username Map. Json Hash format.
  # Replaces username parts in order to simplify usernames obtained
  # Example: {"@sub1.domain.com":"_S1", "@sub2.domain.com":"_S2"}
  # converts '[email protected]' to 'email_S1'
  ## username_map={}


  # Whitelisted domains (only applies to Google OAuth). CSV format.
  ## whitelisted_domains_google=


###########################################################################
# Settings for the RDBMS application
###########################################################################


[librdbms]
  # The RDBMS app can have any number of databases configured in the databases
  # section. A database is known by its section name
  # (IE sqlite, mysql, psql, and oracle in the list below).


  [[databases]]
    # sqlite configuration.
    ## [[[sqlite]]]
      # Name to show in the UI.
      ## nice_name=SQLite


      # For SQLite, name defines the path to the database.
      ## name=/tmp/sqlite.db


      # Database backend to use.
      ## engine=sqlite


      # Database options to send to the server when connecting.
      # https://docs.djangoproject.com/en/1.4/ref/databases/
      ## options={}


    # mysql, oracle, or postgresql configuration.
    ## [[[mysql]]]
      # Name to show in the UI.
      ## nice_name="My SQL DB"


      # For MySQL and PostgreSQL, name is the name of the database.
      # For Oracle, Name is instance of the Oracle server. For express edition
      # this is 'xe' by default.
      ## name=mysqldb


      # Database backend to use. This can be:
      # 1. mysql
      # 2. postgresql
      # 3. oracle
      ## engine=mysql


      # IP or hostname of the database to connect to.
      ## host=localhost


      # Port the database server is listening to. Defaults are:
      # 1. MySQL: 3306
      # 2. PostgreSQL: 5432
      # 3. Oracle Express Edition: 1521
      ## port=3306


      # Username to authenticate with when connecting to the database.
      ## user=example


      # Password matching the username to authenticate with when
      # connecting to the database.
      ## password=example


      # Database options to send to the server when connecting.
      # https://docs.djangoproject.com/en/1.4/ref/databases/
      ## options={}


###########################################################################
# Settings to configure your Hadoop cluster.
###########################################################################


[hadoop]


  # Configuration for HDFS NameNode
  # ------------------------------------------------------------------------
  [[hdfs_clusters]]
    # HA support by using HttpFs


    [[[default]]]
      # Enter the filesystem uri
      fs_defaultfs=hdfs://192.168.2.234:8020


      # NameNode logical name.
      ## logical_name=


      # Use WebHdfs/HttpFs as the communication mechanism.
      # Domain should be the NameNode or HttpFs host.
      # Default port is 14000 for HttpFs.
       webhdfs_url=http://192.168.2.234:14000/webhdfs/v1


      # Change this if your HDFS cluster is Kerberos-secured
      ## security_enabled=false


      # Default umask for file and directory creation, specified in an octal value.
      ## umask=022


  # Configuration for YARN (MR2)
  # ------------------------------------------------------------------------
  [[yarn_clusters]]


    [[[default]]]
      # Enter the host on which you are running the ResourceManager
       resourcemanager_host=192.168.2.234


      # The port where the ResourceManager IPC listens on
      ## resourcemanager_port=8032


      # Whether to submit jobs to this cluster
      submit_to=True


      # Resource Manager logical name (required for HA)
      ## logical_name=


      # Change this if your YARN cluster is Kerberos-secured
      ## security_enabled=false


      # URL of the ResourceManager API
       resourcemanager_api_url=http://192.168.2.234:8088


      # URL of the ProxyServer API
       proxy_api_url=http://192.168.2.234:8088


      # URL of the HistoryServer API
       history_server_api_url=http://192.168.2.234:19888


    # HA support by specifying multiple clusters
    # e.g.


    # [[[ha]]]
      # Resource Manager logical name (required for HA)
      # logical_name=my-rm-name


  # Configuration for MapReduce (MR1)
  # ------------------------------------------------------------------------
  [[mapred_clusters]]


    [[[default]]]
      # Enter the host on which you are running the Hadoop JobTracker
      ## jobtracker_host=localhost


      # The port where the JobTracker IPC listens on
      ## jobtracker_port=8021


      # JobTracker logical name for HA
      ## logical_name=


      # Thrift plug-in port for the JobTracker
      ## thrift_port=9290


      # Whether to submit jobs to this cluster
      submit_to=False


      # Change this if your MapReduce cluster is Kerberos-secured
      ## security_enabled=false


    # HA support by specifying multiple clusters
    # e.g.


    # [[[ha]]]
      # Enter the logical name of the JobTrackers
      # logical_name=my-jt-name




###########################################################################
# Settings to configure the Filebrowser app
###########################################################################


[filebrowser]
  # Location on local filesystem where the uploaded archives are temporary stored.
  ## archive_upload_tempdir=/tmp


###########################################################################
# Settings to configure liboozie
###########################################################################


[liboozie]
  # The URL where the Oozie service runs on. This is required in order for
  # users to submit jobs. Empty value disables the config check.
   oozie_url=http://192.168.2.234:11000/oozie


  # Requires FQDN in oozie_url if enabled
   security_enabled=false


  # Location on HDFS where the workflows/coordinator are deployed when submitted.
   remote_deployement_dir=/user/zjj/oozie-zjj




###########################################################################
# Settings to configure the Oozie app
###########################################################################


[oozie]
  # Location on local FS where the examples are stored.
   local_data_dir=..../examples


  # Location on local FS where the data for the examples is stored.
   sample_data_dir=...thirdparty/sample_data


  # Location on HDFS where the oozie examples and workflows are stored.
   remote_data_dir=/user/zjj/oozie-expample


  # Maximum of Oozie workflows or coodinators to retrieve in one API call.
  ## oozie_jobs_count=100


  # Use Cron format for defining the frequency of a Coordinator instead of the old frequency number/unit.
  ## enable_cron_scheduling=true




###########################################################################
# Settings to configure Beeswax with Hive
###########################################################################


[beeswax]


  # Host where HiveServer2 is running.
  # If Kerberos security is enabled, use fully-qualified domain name (FQDN).
   hive_server_host=192.168.2.234


  # Port where HiveServer2 Thrift server runs on.
   hive_server_port=10001


  # Hive configuration directory, where hive-site.xml is located
  hive_conf_dir=/app/prog/hive/conf


  # Timeout in seconds for thrift calls to Hive service
  ## server_conn_timeout=120


  # Set a LIMIT clause when browsing a partitioned table.
  # A positive value will be set as the LIMIT. If 0 or negative, do not set any limit.
  ## browse_partitioned_table_limit=250


  # A limit to the number of rows that can be downloaded from a query.
  # A value of -1 means there will be no limit.
  # A maximum of 65,000 is applied to XLS downloads.
  ## download_row_limit=1000000


  # Hue will try to close the Hive query when the user leaves the editor page.
  # This will free all the query resources in HiveServer2, but also make its results inaccessible.
  ## close_queries=false


  # Thrift version to use when communicating with HiveServer2
  ## thrift_version=5


  [[ssl]]
    # SSL communication enabled for this server.
    ## enabled=false


    # Path to Certificate Authority certificates.
    ## cacerts=/etc/hue/cacerts.pem


    # Path to the private key file.
    ## key=/etc/hue/key.pem


    # Path to the public certificate file.
    ## cert=/etc/hue/cert.pem


    # Choose whether Hue should validate certificates received from the server.
    ## validate=true




###########################################################################
# Settings to configure Pig
###########################################################################


[pig]
  # Location of piggybank.jar on local filesystem.
  ## local_sample_dir=/usr/share/hue/apps/pig/examples


  # Location piggybank.jar will be copied to in HDFS.
  ## remote_data_dir=/user/hue/pig/examples




###########################################################################
# Settings to configure Sqoop
###########################################################################


[sqoop]
  # For autocompletion, fill out the librdbms section.


  # Sqoop server URL
  ## server_url=http://localhost:12000/sqoop




###########################################################################
# Settings to configure Proxy
###########################################################################


[proxy]
  # Comma-separated list of regular expressions,
  # which match 'host:port' of requested proxy target.
  ## whitelist=(localhost|127\.0\.0\.1):(50030|50070|50060|50075)


  # Comma-separated list of regular expressions,
  # which match any prefix of 'host:port/path' of requested proxy target.
  # This does not support matching GET parameters.
  ## blacklist=




###########################################################################
# Settings to configure Impala
###########################################################################


[impala]
  # Host of the Impala Server (one of the Impalad)
  ## server_host=localhost


  # Port of the Impala Server
  ## server_port=21050


  # Kerberos principal
  ## impala_principal=impala/hostname.foo.com


  # Turn on/off impersonation mechanism when talking to Impala
  ## impersonation_enabled=False


  # Number of initial rows of a result set to ask Impala to cache in order
  # to support re-fetching them for downloading them.
  # Set to 0 for disabling the option and backward compatibility.
  ## querycache_rows=50000


  # Timeout in seconds for thrift calls
  ## server_conn_timeout=120


  # Hue will try to close the Impala query when the user leaves the editor page.
  # This will free all the query resources in Impala, but also make its results inaccessible.
  ## close_queries=true


  # If QUERY_TIMEOUT_S > 0, the query will be timed out (i.e. cancelled) if Impala does not do any work
  # (compute or send back results) for that query within QUERY_TIMEOUT_S seconds.
  ## query_timeout_s=600




###########################################################################
# Settings to configure HBase Browser
###########################################################################


[hbase]
  # Comma-separated list of HBase Thrift servers for clusters in the format of '(name|host:port)'.
  # Use full hostname with security.
  ## hbase_clusters=(Cluster|localhost:9090)


  # HBase configuration directory, where hbase-site.xml is located.
  ## hbase_conf_dir=/etc/hbase/conf


  # Hard limit of rows or columns per row fetched before truncating.
  ## truncate_limit = 500


  # 'buffered' is the default of the HBase Thrift Server and supports security.
  # 'framed' can be used to chunk up responses,
  # which is useful when used in conjunction with the nonblocking server in Thrift.
  ## thrift_transport=buffered




###########################################################################
# Settings to configure Solr Search
###########################################################################


[search]


  # URL of the Solr Server
  ## solr_url=http://localhost:8983/solr/


  # Requires FQDN in solr_url if enabled
  ## security_enabled=false


  ## Query sent when no term is entered
  ## empty_query=*:*




###########################################################################
# Settings to configure Solr Indexer
###########################################################################


[indexer]


  # Location of the solrctl binary.
  ## solrctl_path=/usr/bin/solrctl


  # Location of the solr home.
  ## solr_home=/usr/lib/solr


  # Zookeeper ensemble.
  ## solr_zk_ensemble=localhost:2181/solr


  # The contents of this directory will be copied over to the solrctl host to its temporary directory.
  ## config_template_path=/../hue/desktop/libs/indexer/src/data/solr_configs




###########################################################################
# Settings to configure Job Designer
###########################################################################


[jobsub]


  # Location on local FS where examples and template are stored.
  ## local_data_dir=..../data


  # Location on local FS where sample data is stored
  ## sample_data_dir=...thirdparty/sample_data




###########################################################################
# Settings to configure Job Browser.
###########################################################################


[jobbrowser]
  # Share submitted jobs information with all users. If set to false,
  # submitted jobs are visible only to the owner and administrators.
  ## share_jobs=true




###########################################################################
# Settings to configure the Zookeeper application.
###########################################################################


[zookeeper]


  [[clusters]]


    [[[default]]]
      # Zookeeper ensemble. Comma separated list of Host/Port.
      # e.g. localhost:2181,localhost:2182,localhost:2183
      ## host_ports=localhost:2181


      # The URL of the REST contrib service (required for znode browsing)
      ## rest_url=http://localhost:9998




###########################################################################
# Settings to configure the Spark application.
###########################################################################


[spark]
  # URL of the REST Spark Job Server.
  ## server_url=http://localhost:8090/




###########################################################################
# Settings for the User Admin application
###########################################################################


[useradmin]
  # The name of the default user group that users will be a member of
  ## default_user_group=default




###########################################################################
# Settings for the Sentry lib
###########################################################################


[libsentry]
  # Hostname or IP of server.
  ## hostname=localhost


  # Port the sentry service is running on.
  ## port=8038


  # Sentry configuration directory, where sentry-site.xml is located.
  ## sentry_conf_dir=/etc/sentry/conf


以上配置是hive 和mysql在同一台机器,线上我们的mysql和hive的分开部署在不同的机器上,减小单台机器压力,配置大同小异,唯一需要修改的就是hue hiveserver2的hostname 和hive-site.xml的配置。









你可能感兴趣的:(hadoop2.4.1 安装hue with Yarn模式)