达达呀

Hadoop配置文件_mapred-default.xml












  mapreduce.jobtracker.jobhistory.location
  
   If job tracker is static the history files are stored 
  in this single well known place. If No value is set here, by default,
  it is in the local file system at ${hadoop.log.dir}/history.
  



  mapreduce.jobtracker.jobhistory.task.numberprogresssplits
  12
   Every task attempt progresses from 0.0 to 1.0 [unless
  it fails or is killed].  We record, for each task attempt, certain 
  statistics over each twelfth of the progress range.  You can change
  the number of intervals we divide the entire range of progress into
  by setting this property.  Higher values give more precision to the
  recorded data, but costs more memory in the job tracker at runtime.
  Each increment in this attribute costs 16 bytes per running task.
  



  mapreduce.job.userhistorylocation
  
   User can specify a location to store the history files of 
  a particular job. If nothing is specified, the logs are stored in 
  output directory. The files are stored in "_logs/history/" in the directory.
  User can stop logging by giving the value "none". 
  



  mapreduce.jobtracker.jobhistory.completed.location
  
   The completed job history files are stored at this single well 
  known location. If nothing is specified, the files are stored at 
  ${mapreduce.jobtracker.jobhistory.location}/done.
  



  mapreduce.job.committer.setup.cleanup.needed
  true
   true, if job needs job-setup and job-cleanup.
                false, otherwise  
  




  mapreduce.task.io.sort.factor
  10
  The number of streams to merge at once while sorting
  files.  This determines the number of open file handles.



  mapreduce.task.io.sort.mb
  100
  The total amount of buffer memory to use while sorting 
  files, in megabytes.  By default, gives each merge stream 1MB, which
  should minimize seeks.



  mapreduce.map.sort.spill.percent
  0.80
  The soft limit in the serialization buffer. Once reached, a
  thread will begin to spill the contents to disk in the background. Note that
  collection will not block if this threshold is exceeded while a spill is
  already in progress, so spills may be larger than this threshold when it is
  set to less than .5



  mapreduce.jobtracker.address
  local
  The host and port that the MapReduce job tracker runs
  at.  If "local", then jobs are run in-process as a single map
  and reduce task.
  



  mapreduce.local.clientfactory.class.name
  org.apache.hadoop.mapred.LocalClientFactory
  This the client factory that is responsible for 
  creating local job runner client



  mapreduce.jobtracker.http.address
  0.0.0.0:50030
  
    The job tracker http server address and port the server will listen on.
    If the port is 0 then the server will start on a free port.
  



  mapreduce.jobtracker.handler.count
  10
  
    The number of server threads for the JobTracker. This should be roughly
    4% of the number of tasktracker nodes.
  



  mapreduce.tasktracker.report.address
  127.0.0.1:0
  The interface and port that task tracker server listens on. 
  Since it is only connected to by the tasks, it uses the local interface.
  EXPERT ONLY. Should only be changed if your host does not have the loopback 
  interface.



  mapreduce.cluster.local.dir
  ${hadoop.tmp.dir}/mapred/local
  The local directory where MapReduce stores intermediate
  data files.  May be a comma-separated list of
  directories on different devices in order to spread disk i/o.
  Directories that do not exist are ignored.
  



  mapreduce.jobtracker.system.dir
  ${hadoop.tmp.dir}/mapred/system
  The directory where MapReduce stores control files.
  



  mapreduce.jobtracker.staging.root.dir
  ${hadoop.tmp.dir}/mapred/staging
  The root of the staging area for users' job files
  In practice, this should be the directory where users' home 
  directories are located (usually /user)
  



  mapreduce.cluster.temp.dir
  ${hadoop.tmp.dir}/mapred/temp
  A shared directory for temporary files.
  



  mapreduce.tasktracker.local.dir.minspacestart
  0
  If the space in mapreduce.cluster.local.dir drops under this, 
  do not ask for more tasks.
  Value in bytes.
  



  mapreduce.tasktracker.local.dir.minspacekill
  0
  If the space in mapreduce.cluster.local.dir drops under this, 
    do not ask more tasks until all the current ones have finished and 
    cleaned up. Also, to save the rest of the tasks we have running, 
    kill one of them, to clean up some space. Start with the reduce tasks,
    then go with the ones that have finished the least.
    Value in bytes.
  



  mapreduce.jobtracker.expire.trackers.interval
  600000
  Expert: The time-interval, in miliseconds, after which
  a tasktracker is declared 'lost' if it doesn't send heartbeats.
  



  mapreduce.tasktracker.instrumentation
  org.apache.hadoop.mapred.TaskTrackerMetricsInst
  Expert: The instrumentation class to associate with each TaskTracker.
  



  mapreduce.tasktracker.resourcecalculatorplugin
  
  
   Name of the class whose instance will be used to query resource information
   on the tasktracker.
   
   The class must be an instance of 
   org.apache.hadoop.util.ResourceCalculatorPlugin. If the value is null, the
   tasktracker attempts to use a class appropriate to the platform. 
   Currently, the only platform supported is Linux.
  



  mapreduce.tasktracker.taskmemorymanager.monitoringinterval
  5000
  The interval, in milliseconds, for which the tasktracker waits
   between two cycles of monitoring its tasks' memory usage. Used only if
   tasks' memory management is enabled via mapred.tasktracker.tasks.maxmemory.
   



  mapreduce.tasktracker.tasks.sleeptimebeforesigkill
  5000
  The time, in milliseconds, the tasktracker waits for sending a
  SIGKILL to a task, after it has been sent a SIGTERM. This is currently
  not used on WINDOWS where tasks are just sent a SIGTERM.
  



  mapreduce.job.maps
  2
  The default number of map tasks per job.
  Ignored when mapreduce.jobtracker.address is "local".  
  



  mapreduce.job.reduces
  1
  The default number of reduce tasks per job. Typically set to 99%
  of the cluster's reduce capacity, so that if a node fails the reduces can 
  still be executed in a single wave.
  Ignored when mapreduce.jobtracker.address is "local".
  



  mapreduce.jobtracker.restart.recover
  false
  "true" to enable (job) recovery upon restart,
               "false" to start afresh
  



  mapreduce.jobtracker.jobhistory.block.size
  3145728
  The block size of the job history file. Since the job recovery
               uses job history, its important to dump job history to disk as 
               soon as possible. Note that this is an expert level parameter.
               The default value is set to 3 MB.
  



  mapreduce.jobtracker.taskscheduler
  org.apache.hadoop.mapred.JobQueueTaskScheduler
  The class responsible for scheduling the tasks.



  mapreduce.job.running.map.limit
  0
  The maximum number of simultaneous map tasks per job.
  There is no limit if this value is 0 or negative.
  



  mapreduce.job.running.reduce.limit
  0
  The maximum number of simultaneous reduce tasks per job.
  There is no limit if this value is 0 or negative.
  



  mapreduce.job.reducer.preempt.delay.sec
  0
  The threshold in terms of seconds after which an unsatisfied mapper 
  request triggers reducer preemption to free space. Default 0 implies that the 
  reduces should be preempted immediately after allocation if there is currently no
  room for newly allocated mappers.
  



    mapreduce.job.max.split.locations
    10
    The max number of block locations to store for each split for 
    locality calculation.
    



  mapreduce.job.split.metainfo.maxsize
  10000000
  The maximum permissible size of the split metainfo file. 
  The JobTracker won't attempt to read split metainfo files bigger than
  the configured value.
  No limits if set to -1.
  



  mapreduce.jobtracker.taskscheduler.maxrunningtasks.perjob
  
  The maximum number of running tasks for a job before
  it gets preempted. No limits if undefined.
  



  mapreduce.map.maxattempts
  4
  Expert: The maximum number of attempts per map task.
  In other words, framework will try to execute a map task these many number
  of times before giving up on it.
  



  mapreduce.reduce.maxattempts
  4
  Expert: The maximum number of attempts per reduce task.
  In other words, framework will try to execute a reduce task these many number
  of times before giving up on it.
  



  mapreduce.reduce.shuffle.fetch.retry.enabled
  ${yarn.nodemanager.recovery.enabled}
  Set to enable fetch retry during host restart.



  mapreduce.reduce.shuffle.fetch.retry.interval-ms
  1000
  Time of interval that fetcher retry to fetch again when some 
  non-fatal failure happens because of some events like NM restart.
  



  mapreduce.reduce.shuffle.fetch.retry.timeout-ms
  30000
  Timeout value for fetcher to retry to fetch again when some 
  non-fatal failure happens because of some events like NM restart.



  mapreduce.reduce.shuffle.retry-delay.max.ms
  60000
  The maximum number of ms the reducer will delay before retrying
  to download map data.
  



  mapreduce.reduce.shuffle.parallelcopies
  5
  The default number of parallel transfers run by reduce
  during the copy(shuffle) phase.
  



  mapreduce.reduce.shuffle.connect.timeout
  180000
  Expert: The maximum amount of time (in milli seconds) reduce
  task spends in trying to connect to a tasktracker for getting map output.
  



  mapreduce.reduce.shuffle.read.timeout
  180000
  Expert: The maximum amount of time (in milli seconds) reduce
  task waits for map output data to be available for reading after obtaining
  connection.
  



  mapreduce.shuffle.connection-keep-alive.enable
  false
  set to true to support keep-alive connections.



  mapreduce.shuffle.connection-keep-alive.timeout
  5
  The number of seconds a shuffle client attempts to retain
   http connection. Refer "Keep-Alive: timeout=" header in
   Http specification
  



  mapreduce.task.timeout
  600000
  The number of milliseconds before a task will be
  terminated if it neither reads an input, writes an output, nor
  updates its status string.  A value of 0 disables the timeout.
  



  mapreduce.tasktracker.map.tasks.maximum
  2
  The maximum number of map tasks that will be run
  simultaneously by a task tracker.
  



  mapreduce.tasktracker.reduce.tasks.maximum
  2
  The maximum number of reduce tasks that will be run
  simultaneously by a task tracker.
  



  mapreduce.map.memory.mb
  1024
  The amount of memory to request from the scheduler for each
  map task.
  



  mapreduce.map.cpu.vcores
  1
  The number of virtual cores to request from the scheduler for
  each map task.
  



  mapreduce.reduce.memory.mb
  1024
  The amount of memory to request from the scheduler for each
  reduce task.
  



  mapreduce.reduce.cpu.vcores
  1
  The number of virtual cores to request from the scheduler for
  each reduce task.
  



  mapreduce.jobtracker.retiredjobs.cache.size
  1000
  The number of retired job status to keep in the cache.
  



  mapreduce.tasktracker.outofband.heartbeat
  false
  Expert: Set this to true to let the tasktracker send an 
  out-of-band heartbeat on task-completion for better latency.
  



  mapreduce.jobtracker.jobhistory.lru.cache.size
  5
  The number of job history files loaded in memory. The jobs are 
  loaded when they are first accessed. The cache is cleared based on LRU.
  



  mapreduce.jobtracker.instrumentation
  org.apache.hadoop.mapred.JobTrackerMetricsInst
  Expert: The instrumentation class to associate with each JobTracker.
  



  mapred.child.java.opts
  -Xmx200m
  Java opts for the task processes.
  The following symbol, if present, will be interpolated: @taskid@ is replaced 
  by current TaskID. Any other occurrences of '@' will go unchanged.
  For example, to enable verbose gc logging to a file named for the taskid in
  /tmp and to set the heap maximum to be a gigabyte, pass a 'value' of:
        -Xmx1024m -verbose:gc -Xloggc:/tmp/@[email protected]
  
  Usage of -Djava.library.path can cause programs to no longer function if
  hadoop native libraries are used. These values should instead be set as part 
  of LD_LIBRARY_PATH in the map / reduce JVM env using the mapreduce.map.env and 
  mapreduce.reduce.env config settings. 
  







  mapred.child.env
  
  User added environment variables for the task processes.
  Example :
  1) A=foo  This will set the env variable A to foo
  2) B=$B:c This is inherit nodemanager's B env variable on Unix.
  3) B=%B%;c This is inherit nodemanager's B env variable on Windows.
  







  mapreduce.admin.user.env
  
  
  Expert: Additional execution environment entries for 
  map and reduce task processes. This is not an additive property.
  You must preserve the original value if you want your map and
  reduce tasks to have access to native libraries (compression, etc). 
  When this value is empty, the command to set execution 
  envrionment will be OS dependent: 
  For linux, use LD_LIBRARY_PATH=$HADOOP_COMMON_HOME/lib/native.
  For windows, use PATH = %PATH%;%HADOOP_COMMON_HOME%\\bin.
  



  mapreduce.map.log.level
  INFO
  The logging level for the map task. The allowed levels are:
  OFF, FATAL, ERROR, WARN, INFO, DEBUG, TRACE and ALL.
  The setting here could be overridden if "mapreduce.job.log4j-properties-file"
  is set.
  



  mapreduce.reduce.log.level
  INFO
  The logging level for the reduce task. The allowed levels are:
  OFF, FATAL, ERROR, WARN, INFO, DEBUG, TRACE and ALL.
  The setting here could be overridden if "mapreduce.job.log4j-properties-file"
  is set.
  



  mapreduce.map.cpu.vcores
  1
  
      The number of virtual cores required for each map task.
  



  mapreduce.reduce.cpu.vcores
  1
  
      The number of virtual cores required for each reduce task.
  



  mapreduce.reduce.merge.inmem.threshold
  1000
  The threshold, in terms of the number of files 
  for the in-memory merge process. When we accumulate threshold number of files
  we initiate the in-memory merge and spill to disk. A value of 0 or less than
  0 indicates we want to DON'T have any threshold and instead depend only on
  the ramfs's memory consumption to trigger the merge.
  



  mapreduce.reduce.shuffle.merge.percent
  0.66
  The usage threshold at which an in-memory merge will be
  initiated, expressed as a percentage of the total memory allocated to
  storing in-memory map outputs, as defined by
  mapreduce.reduce.shuffle.input.buffer.percent.
  



  mapreduce.reduce.shuffle.input.buffer.percent
  0.70
  The percentage of memory to be allocated from the maximum heap
  size to storing map outputs during the shuffle.
  



  mapreduce.reduce.input.buffer.percent
  0.0
  The percentage of memory- relative to the maximum heap size- to
  retain map outputs during the reduce. When the shuffle is concluded, any
  remaining map outputs in memory must consume less than this threshold before
  the reduce can begin.
  



  mapreduce.reduce.shuffle.memory.limit.percent
  0.25
  Expert: Maximum percentage of the in-memory limit that a
  single shuffle can consume



  mapreduce.shuffle.ssl.enabled
  false
  
    Whether to use SSL for for the Shuffle HTTP endpoints.
  



  mapreduce.shuffle.ssl.file.buffer.size
  65536
  Buffer size for reading spills from file when using SSL.
  



  mapreduce.shuffle.max.connections
  0
  Max allowed connections for the shuffle.  Set to 0 (zero)
               to indicate no limit on the number of connections.
  



  mapreduce.shuffle.max.threads
  0
  Max allowed threads for serving shuffle connections. Set to zero
  to indicate the default of 2 times the number of available
  processors (as reported by Runtime.availableProcessors()). Netty is used to
  serve requests, so a thread is not needed for each connection.
  



  mapreduce.shuffle.transferTo.allowed
  
  This option can enable/disable using nio transferTo method in 
  the shuffle phase. NIO transferTo does not perform well on windows in the 
  shuffle phase. Thus, with this configuration property it is possible to 
  disable it, in which case custom transfer method will be used. Recommended 
  value is false when running Hadoop on Windows. For Linux, it is recommended 
  to set it to true. If nothing is set then the default value is false for 
  Windows, and true for Linux.
  



  mapreduce.shuffle.transfer.buffer.size
  131072
  This property is used only if 
  mapreduce.shuffle.transferTo.allowed is set to false. In that case, 
  this property defines the size of the buffer used in the buffer copy code
  for the shuffle phase. The size of this buffer determines the size of the IO
  requests.
  



  mapreduce.reduce.markreset.buffer.percent
  0.0
  The percentage of memory -relative to the maximum heap size- to
  be used for caching values when using the mark-reset functionality.
  



  mapreduce.map.speculative
  true
  If true, then multiple instances of some map tasks 
               may be executed in parallel.



  mapreduce.reduce.speculative
  true
  If true, then multiple instances of some reduce tasks 
               may be executed in parallel.



  mapreduce.job.speculative.speculative-cap-running-tasks
  0.1
  The max percent (0-1) of running tasks that
  can be speculatively re-executed at any time.



  mapreduce.job.speculative.speculative-cap-total-tasks
  0.01
  The max percent (0-1) of all tasks that
  can be speculatively re-executed at any time.



  mapreduce.job.speculative.minimum-allowed-tasks
  10
  The minimum allowed tasks that
  can be speculatively re-executed at any time.



  mapreduce.job.speculative.retry-after-no-speculate
  1000
  The waiting time(ms) to do next round of speculation
  if there is no task speculated in this round.



  mapreduce.job.speculative.retry-after-speculate
  15000
  The waiting time(ms) to do next round of speculation
  if there are tasks speculated in this round.



  mapreduce.job.map.output.collector.class
  org.apache.hadoop.mapred.MapTask$MapOutputBuffer
  
    The MapOutputCollector implementation(s) to use. This may be a comma-separated
    list of class names, in which case the map task will try to initialize each
    of the collectors in turn. The first to successfully initialize will be used.
  

 

  mapreduce.job.speculative.slowtaskthreshold
  1.0
  The number of standard deviations by which a task's
  ave progress-rates must be lower than the average of all running tasks'
  for the task to be considered too slow.
  



  mapreduce.job.jvm.numtasks
  1
  How many tasks to run per jvm. If set to -1, there is
  no limit. 
  



  mapreduce.job.ubertask.enable
  false
  Whether to enable the small-jobs "ubertask" optimization,
  which runs "sufficiently small" jobs sequentially within a single JVM.
  "Small" is defined by the following maxmaps, maxreduces, and maxbytes
  settings. Note that configurations for application masters also affect
  the "Small" definition - yarn.app.mapreduce.am.resource.mb must be
  larger than both mapreduce.map.memory.mb and mapreduce.reduce.memory.mb,
  and yarn.app.mapreduce.am.resource.cpu-vcores must be larger than
  both mapreduce.map.cpu.vcores and mapreduce.reduce.cpu.vcores to enable
  ubertask. Users may override this value.
  



  mapreduce.job.ubertask.maxmaps
  9
  Threshold for number of maps, beyond which job is considered
  too big for the ubertasking optimization.  Users may override this value,
  but only downward.
  



  mapreduce.job.ubertask.maxreduces
  1
  Threshold for number of reduces, beyond which job is considered
  too big for the ubertasking optimization.  CURRENTLY THE CODE CANNOT SUPPORT
  MORE THAN ONE REDUCE and will ignore larger values.  (Zero is a valid max,
  however.)  Users may override this value, but only downward.
  



  mapreduce.job.ubertask.maxbytes
  
  Threshold for number of input bytes, beyond which job is
  considered too big for the ubertasking optimization.  If no value is
  specified, dfs.block.size is used as a default.  Be sure to specify a
  default value in mapred-site.xml if the underlying filesystem is not HDFS.
  Users may override this value, but only downward.
  



    mapreduce.job.emit-timeline-data
    false
    Specifies if the Application Master should emit timeline data
    to the timeline server. Individual jobs can override this value.
    



  mapreduce.input.fileinputformat.split.minsize
  0
  The minimum size chunk that map input should be split
  into.  Note that some file formats may have minimum split sizes that
  take priority over this setting.



  mapreduce.input.fileinputformat.list-status.num-threads
  1
  The number of threads to use to list and fetch block locations
  for the specified input paths. Note: multiple threads should not be used
  if a custom non thread-safe path filter is used.
  



  mapreduce.jobtracker.maxtasks.perjob
  -1
  The maximum number of tasks for a single job.
  A value of -1 indicates that there is no maximum.  



  mapreduce.input.lineinputformat.linespermap
  1
  When using NLineInputFormat, the number of lines of input data
  to include in each split.



  mapreduce.client.submit.file.replication
  10
  The replication level for submitted job files.  This
  should be around the square root of the number of nodes.
  




  mapreduce.tasktracker.dns.interface
  default
  The name of the Network Interface from which a task
  tracker should report its IP address.
  
 
 

  mapreduce.tasktracker.dns.nameserver
  default
  The host name or IP address of the name server (DNS)
  which a TaskTracker should use to determine the host name used by
  the JobTracker for communication and display purposes.
  
 
 

  mapreduce.tasktracker.http.threads
  40
  The number of worker threads that for the http server. This is
               used for map output fetching
  



  mapreduce.tasktracker.http.address
  0.0.0.0:50060
  
    The task tracker http server address and port.
    If the port is 0 then the server will start on a free port.
  



  mapreduce.task.files.preserve.failedtasks
  false
  Should the files for failed tasks be kept. This should only be 
               used on jobs that are failing, because the storage is never
               reclaimed. It also prevents the map outputs from being erased
               from the reduce directory as they are consumed.






  mapreduce.output.fileoutputformat.compress
  false
  Should the job outputs be compressed?
  



  mapreduce.output.fileoutputformat.compress.type
  RECORD
  If the job outputs are to compressed as SequenceFiles, how should
               they be compressed? Should be one of NONE, RECORD or BLOCK.
  



  mapreduce.output.fileoutputformat.compress.codec
  org.apache.hadoop.io.compress.DefaultCodec
  If the job outputs are compressed, how should they be compressed?
  



  mapreduce.map.output.compress
  false
  Should the outputs of the maps be compressed before being
               sent across the network. Uses SequenceFile compression.
  



  mapreduce.map.output.compress.codec
  org.apache.hadoop.io.compress.DefaultCodec
  If the map outputs are compressed, how should they be 
               compressed?
  



  map.sort.class
  org.apache.hadoop.util.QuickSort
  The default sort class for sorting keys.
  



  mapreduce.task.userlog.limit.kb
  0
  The maximum size of user-logs of each task in KB. 0 disables the cap.
  



  yarn.app.mapreduce.am.container.log.limit.kb
  0
  The maximum size of the MRAppMaster attempt container logs in KB.
    0 disables the cap.
  



  yarn.app.mapreduce.task.container.log.backups
  0
  Number of backup files for task logs when using
    ContainerRollingLogAppender (CRLA). See
    org.apache.log4j.RollingFileAppender.maxBackupIndex. By default,
    ContainerLogAppender (CLA) is used, and container logs are not rolled. CRLA
    is enabled for tasks when both mapreduce.task.userlog.limit.kb and
    yarn.app.mapreduce.task.container.log.backups are greater than zero.
  



  yarn.app.mapreduce.am.container.log.backups
  0
  Number of backup files for the ApplicationMaster logs when using
    ContainerRollingLogAppender (CRLA). See
    org.apache.log4j.RollingFileAppender.maxBackupIndex. By default,
    ContainerLogAppender (CLA) is used, and container logs are not rolled. CRLA
    is enabled for the ApplicationMaster when both
    mapreduce.task.userlog.limit.kb and
    yarn.app.mapreduce.am.container.log.backups are greater than zero.
  



  yarn.app.mapreduce.shuffle.log.separate
  true
  If enabled ('true') logging generated by the client-side shuffle
    classes in a reducer will be written in a dedicated log file
    'syslog.shuffle' instead of 'syslog'.
  



  yarn.app.mapreduce.shuffle.log.limit.kb
  0
  Maximum size of the syslog.shuffle file in kilobytes
    (0 for no limit).
  



  yarn.app.mapreduce.shuffle.log.backups
  0
  If yarn.app.mapreduce.shuffle.log.limit.kb and
    yarn.app.mapreduce.shuffle.log.backups are greater than zero
    then a ContainerRollngLogAppender is used instead of ContainerLogAppender
    for syslog.shuffle. See
    org.apache.log4j.RollingFileAppender.maxBackupIndex
  



  mapreduce.job.userlog.retain.hours
  24
  The maximum time, in hours, for which the user-logs are to be 
               retained after the job completion.
  



  mapreduce.jobtracker.hosts.filename
  
  Names a file that contains the list of nodes that may
  connect to the jobtracker.  If the value is empty, all hosts are
  permitted.



  mapreduce.jobtracker.hosts.exclude.filename
  
  Names a file that contains the list of hosts that
  should be excluded by the jobtracker.  If the value is empty, no
  hosts are excluded.



  mapreduce.jobtracker.heartbeats.in.second
  100
  Expert: Approximate number of heart-beats that could arrive 
               at JobTracker in a second. Assuming each RPC can be processed 
               in 10msec, the default value is made 100 RPCs in a second.
  
 


  mapreduce.jobtracker.tasktracker.maxblacklists
  4
  The number of blacklists for a taskTracker by various jobs
               after which the task tracker could be blacklisted across
               all jobs. The tracker will be given a tasks later
               (after a day). The tracker will become a healthy
               tracker after a restart.
  
 


  mapreduce.job.maxtaskfailures.per.tracker
  3
  The number of task-failures on a tasktracker of a given job 
               after which new tasks of that job aren't assigned to it. It
               MUST be less than mapreduce.map.maxattempts and
               mapreduce.reduce.maxattempts otherwise the failed task will
               never be tried on a different node.
  



  mapreduce.client.output.filter
  FAILED
  The filter for controlling the output of the task's userlogs sent
               to the console of the JobClient. 
               The permissible options are: NONE, KILLED, FAILED, SUCCEEDED and 
               ALL.
  


  
    mapreduce.client.completion.pollinterval
    5000
    The interval (in milliseconds) between which the JobClient
    polls the JobTracker for updates about job status. You may want to set this
    to a lower value to make tests run faster on a single node system. Adjusting
    this value in production may lead to unwanted client-server traffic.
    
  

  
    mapreduce.client.progressmonitor.pollinterval
    1000
    The interval (in milliseconds) between which the JobClient
    reports status to the console and checks for job completion. You may want to set this
    to a lower value to make tests run faster on a single node system. Adjusting
    this value in production may lead to unwanted client-server traffic.
    
  

  
    mapreduce.jobtracker.persist.jobstatus.active
    true
    Indicates if persistency of job status information is
      active or not.
    
  

  
  mapreduce.jobtracker.persist.jobstatus.hours
  1
  The number of hours job status information is persisted in DFS.
    The job status information will be available after it drops of the memory
    queue and between jobtracker restarts. With a zero value the job status
    information is not persisted at all in DFS.
  


  
    mapreduce.jobtracker.persist.jobstatus.dir
    /jobtracker/jobsInfo
    The directory where the job status information is persisted
      in a file system to be available after it drops of the memory queue and
      between jobtracker restarts.
    
  

  
    mapreduce.task.profile
    false
    To set whether the system should collect profiler
     information for some of the tasks in this job? The information is stored
     in the user log directory. The value is "true" if task profiling
     is enabled.
  

  
    mapreduce.task.profile.maps
    0-2
     To set the ranges of map tasks to profile.
    mapreduce.task.profile has to be set to true for the value to be accounted.
    
  

  
    mapreduce.task.profile.reduces
    0-2
     To set the ranges of reduce tasks to profile.
    mapreduce.task.profile has to be set to true for the value to be accounted.
    
  

  
    mapreduce.task.profile.params
    -agentlib:hprof=cpu=samples,heap=sites,force=n,thread=y,verbose=n,file=%s
    JVM profiler parameters used to profile map and reduce task
      attempts. This string may contain a single format specifier %s that will
      be replaced by the path to profile.out in the task attempt log directory.
      To specify different profiling options for map tasks and reduce tasks,
      more specific parameters mapreduce.task.profile.map.params and
      mapreduce.task.profile.reduce.params should be used.
  

  
    mapreduce.task.profile.map.params
    ${mapreduce.task.profile.params}
    Map-task-specific JVM profiler parameters. See
      mapreduce.task.profile.params
  

  
    mapreduce.task.profile.reduce.params
    ${mapreduce.task.profile.params}
    Reduce-task-specific JVM profiler parameters. See
      mapreduce.task.profile.params
  

  
    mapreduce.task.skip.start.attempts
    2
     The number of Task attempts AFTER which skip mode 
    will be kicked off. When skip mode is kicked off, the 
    tasks reports the range of records which it will process 
    next, to the TaskTracker. So that on failures, TT knows which 
    ones are possibly the bad records. On further executions, 
    those are skipped.
    
  
  
  
    mapreduce.map.skip.proc.count.autoincr
    true
     The flag which if set to true, 
    SkipBadRecords.COUNTER_MAP_PROCESSED_RECORDS is incremented 
    by MapRunner after invoking the map function. This value must be set to 
    false for applications which process the records asynchronously 
    or buffer the input records. For example streaming. 
    In such cases applications should increment this counter on their own.
    
  
  
  
    mapreduce.reduce.skip.proc.count.autoincr
    true
     The flag which if set to true, 
    SkipBadRecords.COUNTER_REDUCE_PROCESSED_GROUPS is incremented 
    by framework after invoking the reduce function. This value must be set to 
    false for applications which process the records asynchronously 
    or buffer the input records. For example streaming. 
    In such cases applications should increment this counter on their own.
    
  
  
  
    mapreduce.job.skip.outdir
    
     If no value is specified here, the skipped records are 
    written to the output directory at _logs/skip.
    User can stop writing skipped records by giving the value "none". 
    
  

  
    mapreduce.map.skip.maxrecords
    0
     The number of acceptable skip records surrounding the bad 
    record PER bad record in mapper. The number includes the bad record as well.
    To turn the feature of detection/skipping of bad records off, set the 
    value to 0.
    The framework tries to narrow down the skipped range by retrying  
    until this threshold is met OR all attempts get exhausted for this task. 
    Set the value to Long.MAX_VALUE to indicate that framework need not try to 
    narrow down. Whatever records(depends on application) get skipped are 
    acceptable.
    
  
  
  
    mapreduce.reduce.skip.maxgroups
    0
     The number of acceptable skip groups surrounding the bad 
    group PER bad group in reducer. The number includes the bad group as well.
    To turn the feature of detection/skipping of bad groups off, set the 
    value to 0.
    The framework tries to narrow down the skipped range by retrying  
    until this threshold is met OR all attempts get exhausted for this task. 
    Set the value to Long.MAX_VALUE to indicate that framework need not try to 
    narrow down. Whatever groups(depends on application) get skipped are 
    acceptable.
    
  

  
    mapreduce.ifile.readahead
    true
    Configuration key to enable/disable IFile readahead.
    
  

  
    mapreduce.ifile.readahead.bytes
    4194304
    Configuration key to set the IFile readahead length in bytes.
    
  
  


  mapreduce.jobtracker.taskcache.levels
  2
   This is the max level of the task cache. For example, if
    the level is 2, the tasks cached are at the host level and at the rack
    level.
  



  mapreduce.job.queuename
  default
   Queue to which a job is submitted. This must match one of the
    queues defined in mapred-queues.xml for the system. Also, the ACL setup
    for the queue must allow the current user to submit a job to the queue.
    Before specifying a queue, ensure that the system is configured with 
    the queue, and access is allowed for submitting jobs to the queue.
  


  
    mapreduce.job.tags
    
     Tags for the job that will be passed to YARN at submission 
      time. Queries to YARN for applications can filter on these tags.
    
  


  mapreduce.cluster.acls.enabled
  false
   Specifies whether ACLs should be checked
    for authorization of users for doing various queue and job level operations.
    ACLs are disabled by default. If enabled, access control checks are made by
    JobTracker and TaskTracker when requests are made by users for queue
    operations like submit job to a queue and kill a job in the queue and job
    operations like viewing the job-details (See mapreduce.job.acl-view-job)
    or for modifying the job (See mapreduce.job.acl-modify-job) using
    Map/Reduce APIs, RPCs or via the console and web user interfaces.
    For enabling this flag(mapreduce.cluster.acls.enabled), this is to be set
    to true in mapred-site.xml on JobTracker node and on all TaskTracker nodes.
  



  mapreduce.job.acl-modify-job
   
   Job specific access-control list for 'modifying' the job. It
    is only used if authorization is enabled in Map/Reduce by setting the
    configuration property mapreduce.cluster.acls.enabled to true.
    This specifies the list of users and/or groups who can do modification
    operations on the job. For specifying a list of users and groups the
    format to use is "user1,user2 group1,group". If set to '*', it allows all
    users/groups to modify this job. If set to ' '(i.e. space), it allows
    none. This configuration is used to guard all the modifications with respect
    to this job and takes care of all the following operations:
      o killing this job
      o killing a task of this job, failing a task of this job
      o setting the priority of this job
    Each of these operations are also protected by the per-queue level ACL
    "acl-administer-jobs" configured via mapred-queues.xml. So a caller should
    have the authorization to satisfy either the queue-level ACL or the
    job-level ACL.

    Irrespective of this ACL configuration, (a) job-owner, (b) the user who
    started the cluster, (c) members of an admin configured supergroup
    configured via mapreduce.cluster.permissions.supergroup and (d) queue
    administrators of the queue to which this job was submitted to configured
    via acl-administer-jobs for the specific queue in mapred-queues.xml can
    do all the modification operations on a job.

    By default, nobody else besides job-owner, the user who started the cluster,
    members of supergroup and queue administrators can perform modification
    operations on a job.
  



  mapreduce.job.acl-view-job
   
   Job specific access-control list for 'viewing' the job. It is
    only used if authorization is enabled in Map/Reduce by setting the
    configuration property mapreduce.cluster.acls.enabled to true.
    This specifies the list of users and/or groups who can view private details
    about the job. For specifying a list of users and groups the
    format to use is "user1,user2 group1,group". If set to '*', it allows all
    users/groups to modify this job. If set to ' '(i.e. space), it allows
    none. This configuration is used to guard some of the job-views and at
    present only protects APIs that can return possibly sensitive information
    of the job-owner like
      o job-level counters
      o task-level counters
      o tasks' diagnostic information
      o task-logs displayed on the TaskTracker web-UI and
      o job.xml showed by the JobTracker's web-UI
    Every other piece of information of jobs is still accessible by any other
    user, for e.g., JobStatus, JobProfile, list of jobs in the queue, etc.

    Irrespective of this ACL configuration, (a) job-owner, (b) the user who
    started the cluster, (c) members of an admin configured supergroup
    configured via mapreduce.cluster.permissions.supergroup and (d) queue
    administrators of the queue to which this job was submitted to configured
    via acl-administer-jobs for the specific queue in mapred-queues.xml can
    do all the view operations on a job.

    By default, nobody else besides job-owner, the user who started the
    cluster, memebers of supergroup and queue administrators can perform
    view operations on a job.
  



  mapreduce.tasktracker.indexcache.mb
  10
   The maximum memory that a task tracker allows for the 
    index cache that is used when serving map outputs to reducers.
  



  mapreduce.job.token.tracking.ids.enabled
  false
  Whether to write tracking ids of tokens to
    job-conf. When true, the configuration property
    "mapreduce.job.token.tracking.ids" is set to the token-tracking-ids of
    the job



  mapreduce.job.token.tracking.ids
  
  When mapreduce.job.token.tracking.ids.enabled is
    set to true, this is set by the framework to the
    token-tracking-ids used by the job.



  mapreduce.task.merge.progress.records
  10000
   The number of records to process during merge before
   sending a progress notification to the TaskTracker.
  



  mapreduce.task.combine.progress.records
  10000
   The number of records to process during combine output collection
   before sending a progress notification.
  



  mapreduce.job.reduce.slowstart.completedmaps
  0.05
  Fraction of the number of maps in the job which should be 
  complete before reduces are scheduled for the job. 
  



mapreduce.job.complete.cancel.delegation.tokens
  true
   if false - do not unregister/cancel delegation tokens from 
    renewal, because same tokens may be used by spawned jobs
  



  mapreduce.tasktracker.taskcontroller
  org.apache.hadoop.mapred.DefaultTaskController
  TaskController which is used to launch and manage task execution 
  



  mapreduce.tasktracker.group
  
  Expert: Group to which TaskTracker belongs. If 
   LinuxTaskController is configured via mapreduce.tasktracker.taskcontroller,
   the group owner of the task-controller binary should be same as this group.
  



  mapreduce.shuffle.port
  13562
  Default port that the ShuffleHandler will run on. ShuffleHandler 
   is a service run at the NodeManager to facilitate transfers of intermediate 
   Map outputs to requesting Reducers.
  



  mapreduce.job.reduce.shuffle.consumer.plugin.class
  org.apache.hadoop.mapreduce.task.reduce.Shuffle
  
  Name of the class whose instance will be used
  to send shuffle requests by reducetasks of this job.
  The class must be an instance of org.apache.hadoop.mapred.ShuffleConsumerPlugin.
  





  mapreduce.tasktracker.healthchecker.script.path
  
  Absolute path to the script which is
  periodicallyrun by the node health monitoring service to determine if
  the node is healthy or not. If the value of this key is empty or the
  file does not exist in the location configured here, the node health
  monitoring service is not started.



  mapreduce.tasktracker.healthchecker.interval
  60000
  Frequency of the node health script to be run,
  in milliseconds



  mapreduce.tasktracker.healthchecker.script.timeout
  600000
  Time after node health script should be killed if 
  unresponsive and considered that the script has failed.



  mapreduce.tasktracker.healthchecker.script.args
  
  List of arguments which are to be passed to 
  node health script when it is being launched comma seperated.
  







 mapreduce.job.counters.limit
  120
  Limit on the number of user counters allowed per job.
  



  mapreduce.framework.name
  local
  The runtime framework for executing MapReduce jobs.
  Can be one of local, classic or yarn.
  



  yarn.app.mapreduce.am.staging-dir
  /tmp/hadoop-yarn/staging
  The staging dir used while submitting jobs.
  



  mapreduce.am.max-attempts
  2
  The maximum number of application attempts. It is a
  application-specific setting. It should not be larger than the global number
  set by resourcemanager. Otherwise, it will be override. The default number is
  set to 2, to allow at least one retry for AM.




 mapreduce.job.end-notification.url
 
 Indicates url which will be called on completion of job to inform
              end status of job.
              User can give at most 2 variables with URI : $jobId and $jobStatus.
              If they are present in URI, then they will be replaced by their
              respective values.




  mapreduce.job.end-notification.retry.attempts
  0
  The number of times the submitter of the job wants to retry job
    end notification if it fails. This is capped by
    mapreduce.job.end-notification.max.attempts



  mapreduce.job.end-notification.retry.interval
  1000
  The number of milliseconds the submitter of the job wants to
    wait before job end notification is retried if it fails. This is capped by
    mapreduce.job.end-notification.max.retry.interval



  mapreduce.job.end-notification.max.attempts
  5
  true
  The maximum number of times a URL will be read for providing job
    end notification. Cluster administrators can set this to limit how long
    after end of a job, the Application Master waits before exiting. Must be
    marked as final to prevent users from overriding this.
  


  
    mapreduce.job.log4j-properties-file
    
    Used to override the default settings of log4j in container-log4j.properties
    for NodeManager. Like container-log4j.properties, it requires certain
    framework appenders properly defined in this overriden file. The file on the
    path will be added to distributed cache and classpath. If no-scheme is given
    in the path, it defaults to point to a log4j file on the local FS.
    
  


  mapreduce.job.end-notification.max.retry.interval
  5000
  true
  The maximum amount of time (in milliseconds) to wait before
     retrying job end notification. Cluster administrators can set this to
     limit how long the Application Master waits before exiting. Must be marked
     as final to prevent users from overriding this.



  yarn.app.mapreduce.am.env
  
  User added environment variables for the MR App Master 
  processes. Example :
  1) A=foo  This will set the env variable A to foo
  2) B=$B:c This is inherit tasktracker's B env variable.  
  



  yarn.app.mapreduce.am.admin.user.env
  
   Environment variables for the MR App Master 
  processes for admin purposes. These values are set first and can be 
  overridden by the user env (yarn.app.mapreduce.am.env) Example :
  1) A=foo  This will set the env variable A to foo
  2) B=$B:c This is inherit app master's B env variable.  
  



  yarn.app.mapreduce.am.command-opts
  -Xmx1024m
  Java opts for the MR App Master processes.  
  The following symbol, if present, will be interpolated: @taskid@ is replaced 
  by current TaskID. Any other occurrences of '@' will go unchanged.
  For example, to enable verbose gc logging to a file named for the taskid in
  /tmp and to set the heap maximum to be a gigabyte, pass a 'value' of:
        -Xmx1024m -verbose:gc -Xloggc:/tmp/@[email protected]
  
  Usage of -Djava.library.path can cause programs to no longer function if
  hadoop native libraries are used. These values should instead be set as part 
  of LD_LIBRARY_PATH in the map / reduce JVM env using the mapreduce.map.env and 
  mapreduce.reduce.env config settings. 
  



  yarn.app.mapreduce.am.admin-command-opts
  
  Java opts for the MR App Master processes for admin purposes.
  It will appears before the opts set by yarn.app.mapreduce.am.command-opts and
  thus its options can be overridden user. 
  
  Usage of -Djava.library.path can cause programs to no longer function if
  hadoop native libraries are used. These values should instead be set as part 
  of LD_LIBRARY_PATH in the map / reduce JVM env using the mapreduce.map.env and 
  mapreduce.reduce.env config settings. 
  



  yarn.app.mapreduce.am.job.task.listener.thread-count
  30
  The number of threads used to handle RPC calls in the 
    MR AppMaster from remote tasks



  yarn.app.mapreduce.am.job.client.port-range
  
  Range of ports that the MapReduce AM can use when binding.
    Leave blank if you want all possible ports.  
    For example 50000-50050,50100-50200



  yarn.app.mapreduce.am.job.committer.cancel-timeout
  60000
  The amount of time in milliseconds to wait for the output
    committer to cancel an operation if the job is killed



  yarn.app.mapreduce.am.job.committer.commit-window
  10000
  Defines a time window in milliseconds for output commit
  operations.  If contact with the RM has occurred within this window then
  commits are allowed, otherwise the AM will not allow output commits until
  contact with the RM has been re-established.



  mapreduce.fileoutputcommitter.algorithm.version
  1
  The file output committer algorithm version
  valid algorithm version number: 1 or 2
  default to 1, which is the original algorithm

  In algorithm version 1,

  1. commitTask will rename directory
  $joboutput/_temporary/$appAttemptID/_temporary/$taskAttemptID/
  to
  $joboutput/_temporary/$appAttemptID/$taskID/

  2. recoverTask will also do a rename
  $joboutput/_temporary/$appAttemptID/$taskID/
  to
  $joboutput/_temporary/($appAttemptID + 1)/$taskID/

  3. commitJob will merge every task output file in
  $joboutput/_temporary/$appAttemptID/$taskID/
  to
  $joboutput/, then it will delete $joboutput/_temporary/
  and write $joboutput/_SUCCESS

  It has a performance regression, which is discussed in MAPREDUCE-4815.
  If a job generates many files to commit then the commitJob
  method call at the end of the job can take minutes.
  the commit is single-threaded and waits until all
  tasks have completed before commencing.

  algorithm version 2 will change the behavior of commitTask,
  recoverTask, and commitJob.

  1. commitTask will rename all files in
  $joboutput/_temporary/$appAttemptID/_temporary/$taskAttemptID/
  to $joboutput/

  2. recoverTask actually doesn't require to do anything, but for
  upgrade from version 1 to version 2 case, it will check if there
  are any files in
  $joboutput/_temporary/($appAttemptID - 1)/$taskID/
  and rename them to $joboutput/

  3. commitJob can simply delete $joboutput/_temporary and write
  $joboutput/_SUCCESS

  This algorithm will reduce the output commit time for
  large jobs by having the tasks commit directly to the final
  output directory as they were completing and commitJob had
  very little to do.
  



  yarn.app.mapreduce.am.scheduler.heartbeat.interval-ms
  1000
  The interval in ms at which the MR AppMaster should send
    heartbeats to the ResourceManager



  yarn.app.mapreduce.client-am.ipc.max-retries
  3
  The number of client retries to the AM - before reconnecting
    to the RM to fetch Application Status.



  yarn.app.mapreduce.client-am.ipc.max-retries-on-timeouts
  3
  The number of client retries on socket timeouts to the AM - before
    reconnecting to the RM to fetch Application Status.



  yarn.app.mapreduce.client.max-retries
  3
  The number of client retries to the RM/HS before
    throwing exception. This is a layer above the ipc.



  yarn.app.mapreduce.am.resource.mb
  1536
  The amount of memory the MR AppMaster needs.



  yarn.app.mapreduce.am.resource.cpu-vcores
  1
  
      The number of virtual CPU cores the MR AppMaster needs.
  



  yarn.app.mapreduce.am.hard-kill-timeout-ms
  10000
  
     Number of milliseconds to wait before the job client kills the application.
  



  yarn.app.mapreduce.client.job.max-retries
  0
  The number of retries the client will make for getJob and
  dependent calls.  The default is 0 as this is generally only needed for
  non-HDFS DFS where additional, high level retries are required to avoid
  spurious failures during the getJob call.  30 is a good value for
  WASB



  yarn.app.mapreduce.client.job.retry-interval
  2000
  The delay between getJob retries in ms for retries configured
  with yarn.app.mapreduce.client.job.max-retries.



  CLASSPATH for MR applications. A comma-separated list
  of CLASSPATH entries. If mapreduce.application.framework is set then this
  must specify the appropriate classpath for that archive, and the name of
  the archive must be present in the classpath.
  If mapreduce.app-submission.cross-platform is false, platform-specific
  environment vairable expansion syntax would be used to construct the default
  CLASSPATH entries.
  For Linux:
  $HADOOP_MAPRED_HOME/share/hadoop/mapreduce/*,
  $HADOOP_MAPRED_HOME/share/hadoop/mapreduce/lib/*.
  For Windows:
  %HADOOP_MAPRED_HOME%/share/hadoop/mapreduce/*,
  %HADOOP_MAPRED_HOME%/share/hadoop/mapreduce/lib/*.

  If mapreduce.app-submission.cross-platform is true, platform-agnostic default
  CLASSPATH for MR applications would be used:
  {{HADOOP_MAPRED_HOME}}/share/hadoop/mapreduce/*,
  {{HADOOP_MAPRED_HOME}}/share/hadoop/mapreduce/lib/*
  Parameter expansion marker will be replaced by NodeManager on container
  launch based on the underlying OS accordingly.
  
   mapreduce.application.classpath
   



  If enabled, user can submit an application cross-platform
  i.e. submit an application from a Windows client to a Linux/Unix server or
  vice versa.
  
  mapreduce.app-submission.cross-platform
  false



  Path to the MapReduce framework archive. If set, the framework
    archive will automatically be distributed along with the job, and this
    path would normally reside in a public location in an HDFS filesystem. As
    with distributed cache files, this can be a URL with a fragment specifying
    the alias to use for the archive name. For example,
    hdfs:/mapred/framework/hadoop-mapreduce-2.1.1.tar.gz#mrframework would
    alias the localized archive as "mrframework".

    Note that mapreduce.application.classpath must include the appropriate
    classpath for the specified framework. The base name of the archive, or
    alias of the archive if an alias is used, must appear in the specified
    classpath.
  
   mapreduce.application.framework.path
   



   mapreduce.job.classloader
   false
  Whether to use a separate (isolated) classloader for
    user classes in the task JVM.



   mapreduce.job.classloader.system.classes
   
  Used to override the default definition of the system classes for
    the job classloader. The system classes are a comma-separated list of
    patterns that indicate whether to load a class from the system classpath,
    instead from the user-supplied JARs, when mapreduce.job.classloader is
    enabled.

    A positive pattern is defined as:
        1. A single class name 'C' that matches 'C' and transitively all nested
            classes 'C$*' defined in C;
        2. A package name ending with a '.' (e.g., "com.example.") that matches
            all classes from that package.
    A negative pattern is defined by a '-' in front of a positive pattern
    (e.g., "-com.example.").

    A class is considered a system class if and only if it matches one of the
    positive patterns and none of the negative ones. More formally:
    A class is a member of the inclusion set I if it matches one of the positive
    patterns. A class is a member of the exclusion set E if it matches one of
    the negative patterns. The set of system classes S = I \ E.
  





  mapreduce.jobhistory.address
  0.0.0.0:10020
  MapReduce JobHistory Server IPC host:port



  mapreduce.jobhistory.webapp.address
  0.0.0.0:19888
  MapReduce JobHistory Server Web UI host:port



  mapreduce.jobhistory.keytab
  
    Location of the kerberos keytab file for the MapReduce
    JobHistory Server.
  
  /etc/security/keytab/jhs.service.keytab



  mapreduce.jobhistory.principal
  
    Kerberos principal name for the MapReduce JobHistory Server.
  
  jhs/[email protected]



  mapreduce.jobhistory.intermediate-done-dir
  ${yarn.app.mapreduce.am.staging-dir}/history/done_intermediate
  



  mapreduce.jobhistory.done-dir
  ${yarn.app.mapreduce.am.staging-dir}/history/done
  



  mapreduce.jobhistory.cleaner.enable
  true
  



  mapreduce.jobhistory.cleaner.interval-ms
  86400000
   How often the job history cleaner checks for files to delete, 
  in milliseconds. Defaults to 86400000 (one day). Files are only deleted if
  they are older than mapreduce.jobhistory.max-age-ms.
  



  mapreduce.jobhistory.max-age-ms
  604800000
   Job history files older than this many milliseconds will
  be deleted when the history cleaner runs. Defaults to 604800000 (1 week).
  



  mapreduce.jobhistory.client.thread-count
  10
  The number of threads to handle client API requests



  mapreduce.jobhistory.datestring.cache.size
  200000
  Size of the date string cache. Effects the number of directories
  which will be scanned to find a job.



  mapreduce.jobhistory.joblist.cache.size
  20000
  Size of the job list cache



  mapreduce.jobhistory.loadedjobs.cache.size
  5
  Size of the loaded job cache



  mapreduce.jobhistory.move.interval-ms
  180000
  Scan for history files to more from intermediate done dir to done
  dir at this frequency.
  



  mapreduce.jobhistory.move.thread-count
  3
  The number of threads used to move files.



  mapreduce.jobhistory.store.class
  
  The HistoryStorage class to use to cache history data.



  mapreduce.jobhistory.minicluster.fixed.ports
  false
  Whether to use fixed ports with the minicluster



  mapreduce.jobhistory.admin.address
  0.0.0.0:10033
  The address of the History server admin interface.



  mapreduce.jobhistory.admin.acl
  *
  ACL of who can be admin of the History server.



  mapreduce.jobhistory.recovery.enable
  false
  Enable the history server to store server state and recover
  server state upon startup.  If enabled then
  mapreduce.jobhistory.recovery.store.class must be specified.



  mapreduce.jobhistory.recovery.store.class
  org.apache.hadoop.mapreduce.v2.hs.HistoryServerFileSystemStateStoreService
  The HistoryServerStateStoreService class to store history server
  state for recovery.



  mapreduce.jobhistory.recovery.store.fs.uri
  ${hadoop.tmp.dir}/mapred/history/recoverystore
  
  The URI where history server state will be stored if
  HistoryServerFileSystemStateStoreService is configured as the recovery
  storage class.



  mapreduce.jobhistory.recovery.store.leveldb.path
  ${hadoop.tmp.dir}/mapred/history/recoverystore
  The URI where history server state will be stored if
  HistoryServerLeveldbSystemStateStoreService is configured as the recovery
  storage class.



  mapreduce.jobhistory.http.policy
  HTTP_ONLY
  
    This configures the HTTP endpoint for JobHistoryServer web UI.
    The following values are supported:
    - HTTP_ONLY : Service is provided only on http
    - HTTPS_ONLY : Service is provided only on https
  



  yarn.app.mapreduce.am.containerlauncher.threadpool-initial-size
  10
  The initial size of thread pool to launch containers in the
    app master.

你可能感兴趣的:(Hadoop)

hive底层原理 sql执行过程_Hive原理总结（完整版）
目录课程大纲(HIVE增强)31.Hive基本概念41.1Hive简介41.1.1什么是Hive41.1.2为什么使用Hive41.1.3Hive的特点41.2Hive架构51.2.1架构图51.2.2基本组成51.2.3各组件的基本功能51.3Hive与Hadoop的关系61.4Hive与传统数据库对比61.5Hive的数据存储62.Hive基本操作72.1DDL操作72.1.1创建表72.1.
六、深度剖析 Hadoop 分布式文件系统（HDFS）的数据存储机制与读写流程
深度剖析Hadoop分布式文件系统（HDFS）的数据存储机制与读写流程在当今大数据领域当中，Hadoop分布式文件系统（HDFS）作为极为关键的核心组件之一，为海量规模的数据的存储以及处理构筑起了坚实无比的根基。本文将会对HDFS的数据存储机制以及读写流程展开全面且深入的探究，通过将原理与实际的实例紧密结合的方式，助力广大读者更加全面地理解HDFS的工作原理以及其具体的应用场景。一、HDFS概述H
Linux教程（4）----[hive数据仓库工具] .房东的猫 Linux教程（完善中~~）linux
Hive基本概念Hive简介什么是HiveHive是基于Hadoop的一个数据仓库工具，可以将结构化的数据文件映射为一张数据库表，并提供类SQL查询功能。为什么使用Hive直接使用hadoop所面临的问题人员学习成本太高
【Hadoop】onekey_install脚本菜萝卜子 Linux hadoop 大数据分布式
hosts[root@kafka01hadoop-script]#cat/etc/hosts127.0.0.1localhostlocalhost.localdomainlocalhost4localhost4.localdomain4::1localhostlocalhost.localdomainlocalhost6localhost6.localdomain6192.168.100.150k
Hadoop与云原生集成：弹性扩缩容与OSS存储分离架构深度解析
Hadoop与云原生集成的必要性Hadoop在大数据领域的基石地位作为大数据处理领域的奠基性技术，Hadoop自2006年诞生以来已形成包含HDFS、YARN、MapReduce三大核心组件的完整生态体系。根据CSDN技术社区的分析报告，全球超过75%的《财富》500强企业仍在使用Hadoop处理EB级数据，其分布式文件系统HDFS通过数据分片（默认128MB块大小）和三副本存储机制，成功解决了P
Hive简介
文章目录Hive简介Hive特点Hive和RDBMS的对比Hive的架构Hive的数据组织Hive数据类型Hive简介1、Hive由Facebook实现并开源2、是基于Hadoop的一个数据仓库工具3、可以将结构化的数据映射为一张数据库表4、并提供HQL(HiveSQL)查询功能5、底层数据是存储在HDFS上6、Hive的本质是将SQL语句转换为MapReduce任务运行7、使不熟悉MapRedu
python基于Hadoop的NBA球员大数据分析与可视化系统
目录技术栈介绍具体实现截图系统设计研究方法：设计步骤设计流程核心代码部分展示研究方法详细视频演示试验方案论文大纲源码获取/详细视频演示技术栈介绍Django-SpringBoot-php-Node.js-flask本课题的研究方法和研究步骤基本合理，难度适中，本选题是学生所学专业知识的延续，符合学生专业发展方向，对于提高学生的基本知识和技能以及钻研能力有益。该学生能够在预定时间内完成该课题的设计。
大数据技术之集群数据迁移
dfs.namenode.rpc-address.nameservice1.namenode30hadoop104:8020dfs.namenode.rpc-address.nameservice1.namenode37hadoop106:8020dfs.namenode.http-address.nameservice1.namenode30hadoop104:9870dfs.namenode.
HIVE（二） 2301_78012738 hive 数据仓库
目录访问HIVE的三种方式DDLDML数据操作向表中装载数据数据导出常用函数Like和RLike分组Join排序分区表和分桶表访问HIVE的三种方式启动Hive命令，CtrlC退出客户端，执行测试语句，与sql一致[wyc@hadoop102hive]$bin/hive经验小结：在hive中执行语句报错：ExecutionError,returncode2fromorg.apache.hadoop
安全运维的 “五层防护”：构建全方位安全体系 KKKlucifer 安全运维
在数字化运维场景中，异构系统复杂、攻击手段隐蔽等挑战日益突出。保旺达基于“全域纳管-身份认证-行为监测-自动响应-审计溯源”的五层防护架构，融合AI、零信任等技术，构建全链路安全运维体系，以下从技术逻辑与实践落地展开解析：第一层：全域资产纳管——筑牢安全根基挑战云网基础设施包含分布式计算（Hadoop/Spark）、数据流处理（Storm/Flink）等异构组件，通信协议繁杂，传统方案难以全面纳管
Hive 事务表(ACID)问题梳理
文章目录问题描述分析原因什么是事务表概念事务表和普通内部表的区别相关配置事务表的适用场景注意事项设计原理与实现文件管理格式参考博客问题描述工作中需要使用pyspark读取Hive中的数据，但是发现可以获取metastore，外部表的数据可以读取，内部表数据有些表报错信息是：AnalysisException:org.apache.hadoop.hive.ql.metadata.HiveExcept
Docker快速构建Hive测试环境静谧星光 docker hive 容器编程
Docker是一种流行的容器化平台，可以帮助我们快速构建和管理应用程序的环境。在本文中，我们将学习如何使用Docker快速构建Hive测试环境。Hive是一个基于Hadoop的数据仓库基础设施，它提供了一种类似于SQL的查询语言，用于分析和处理大规模数据集。步骤1：安装Docker和DockerCompose首先，我们需要安装Docker和DockerCompose。您可以根据您的操作系统类型，从
HDFS 伪分布模式搭建与使用全攻略（适合初学者 & 开发测试环境） huihui450 hdfs hadoop 大数据
HDFS（HadoopDistributedFileSystem）作为Hadoop生态系统的核心组件，广泛应用于海量数据的分布式存储场景。对于开发者而言，伪分布模式提供了一种低成本、高还原度的学习与测试方式。本文将详细介绍如何在本地搭建并使用HDFS的伪分布模式，包括环境准备、配置过程、常用命令及常见问题排查，帮助你快速入门Hadoop分布式文件系统的实践操作。一、什么是伪分布模式？Hadoop有
YARN container cpu超核如何解决 fzip YARN 超核
在ApacheHadoopYARN中，ContainerCPU超核（即Container使用的CPU资源超过分配量）是一个常见问题，可能导致集群性能下降或不稳定。以下是解决该问题的详细步骤：1.问题诊断1.1确认超核现象查看YARNWebUI：访问http://:8088，检查Container的CPU使用率是否持续超过分配的vCore数。检查NodeManager日志：查看/var/log/ha
Hadoop-Mapreduce入门
Hadoop-Mapreduce入门MapReduce介绍mapreduce设计MapReduce编程规范入门案例WordCountMapReduce介绍MapReduce的思想核心是“分而治之”，适用于大量复杂的任务处理场景（大规模数据处理场景）。知识。Map负责“分”，把复杂的任务分解为若干个“简单的任务”来并行处理。可以进行拆分的前提是这些小任务可以并行计算，彼此间几乎没有依赖关系。Redu
Hadoop MapReduce入门且行且安~ 数据分析进阶之路 Linux命令 hadoop MapReduce入门
入门简介计算过程分为两个阶段Map和ReduceMap阶段并行处理输入数据Reduce阶段对Map结果进行汇总针对python语言来说：map函数或者reduce函数来说，输出的数据格式为元组tuple一个简单的MapReduce程序只需要指定map()reduce()input()output()剩下的由框架完成。Linux常见命令：-读取文件（文本文件，在Windows下使用记事本打开的文件）
Hadoop MapReduce 入门
一、Hadoop3.0.4环境准备1.环境要求Java8（Hadoop3.0.4不支持Java11+）单节点或多节点Linux系统（推荐Ubuntu18.04+）至少4GB内存（建议8GB+）50GB以上磁盘空间2.安装Java#安装Java8sudoapt-getinstallopenjdk-8-jdk#验证安装java-version3.下载与安装Hadoop3.0.4#下载Hadoop3.0
管理大数据存储的十大技巧 weixin_34238633 大数据数据库运维
在1990年，每一台应用服务器都倾向拥有直连式系统(DAS)。SAN的构建则是为了更大的规模和更高的效率提供共享的池存储。Hadoop已经逆转了这一趋势回归DAS。每一个Hadoop集群都拥有自身的——虽然是横向扩展型——直连式存储，这有助于Hadoop管理数据本地化，但也放弃了共享存储的规模和效率。如果你拥有多个实例或Hadoop发行版，那么你就将得到多个横向扩展的存储集群。而我们所遇到的最大挑
MapReduce数据处理过程2万字保姆级教程大模型大数据攻城狮 mapreduce 大数据 yarn cdh hadoop 大数据面试 shuffle
目录1.MapReduce的核心思想：分而治之的艺术2.HadoopMapReduce的架构：从宏观到微观3.WordCount实例：从代码到执行的完整旅程4.源码剖析：Job.submit的魔法5.Map任务的执行：从分片到键值对6.Shuffle阶段：MapReduce的幕后英雄7.Reduce任务的执行：从数据聚合到最终输出8.Combiner的魔法：提前聚合的性能利器9.Partition
Hadoop核心组件最全介绍 Cachel wood 大数据开发 hadoop 大数据分布式 spark 数据库计算机网络
文章目录一、Hadoop核心组件1.HDFS(HadoopDistributedFileSystem)2.YARN(YetAnotherResourceNegotiator)3.MapReduce二、数据存储与管理1.HBase2.Hive3.HCatalog4.Phoenix三、数据处理与计算1.Spark2.Flink3.Tez4.Storm5.Presto6.Impala四、资源调度与集群管
数据仓库技术及应用（Hive 产生背景与架构设计，存储模型与数据类型）娟恋无暇数据仓库笔记 hive
1.Hive产生背景传统Hadoop架构存在的一些问题：MapReduce编程必须掌握Java，门槛较高传统数据库开发、DBA、运维人员学习门槛高HDFS上没有Schema的概念，仅仅是一个纯文本文件Hive的产生：为了让用户从一个现有数据基础架构转移到Hadoop上现有数据基础架构大多基于关系型数据库和SQL查询Facebook诞生了Hive2.Hive是什么官网：https://hive.ap
缺少关键的 MapReduce 框架文件
计算圆周率时提醒Hadoop集群缺少关键的MapReduce框架文件mr-framework.tar.gz在http://master:7180/cmf/services/4/status里直接安装再次运行代码：
大数据 ETL 工具 Sqoop 深度解析与实战指南
一、Sqoop核心理论与应用场景1.1设计思想与技术定位Sqoop是Apache旗下的开源数据传输工具，核心设计基于MapReduce分布式计算框架，通过并行化的Map任务实现高效的数据批量迁移。其特点包括：批处理特性：基于MapReduce作业实现导入/导出，适合大规模离线数据迁移，不支持实时数据同步。异构数据源连接：支持关系型数据库（如MySQL、Oracle）与Hadoop生态（HDFS、H
安装Hadoop集群&入门&源码编译只年大数据 Hadoop hadoop 大数据分布式
安装Hadoop集群完全分布式先决条件准备三台机器NameStaticIPDESCbigdata102192.168.1.102DataNode、NodeManager、NameNodebigdata103192.168.1.103DataNode、NodeManager、ResourceManagerbigdata104192.168.1.104DataNode、NodeManager、Seco
Hadoop之HDFS 只年大数据 Hadoop HDFS hadoop hdfs 大数据
Hadoop之HDFSHDFS的Shell操作启动Hadoop集群（方便后续测试）[atguigu@hadoop102~]$sbin/start-dfs.sh[atguigu@hadoop102~]$sbin/start-yarn.sh-help：输出这个命令参数[atguigu@hadoop102~]$hadoopfs-helprm-ls：显示目录信息[atguigu@hadoop102~]$h
安装Python3.12报错：HTTP 429 TOO MANY REQUESTS for url ＜https://mirrors.ustc.edu.cn/anaconda/pkgs/free/li
安装Python3.12报错(base)[xxx@hadoop104python_shell]$condacreate--namepythonThirteenpython=3.12报错如下：Retrievingnotices:…working…ERRORconda.notices.fetch:get_channel_notice_response(63):Requesterrorforchanne
大数据分析技术的学习路径，不是绝对的，仅供参考水云桐程序员学习大数据数据分析学习方法
阶段一：基础筑基（1-3个月）1.编程语言：Python：掌握基础语法、数据结构、流程控制、函数、面向对象编程、常用库（NumPy,Pandas）。SQL：精通SELECT语句（过滤、排序、分组、聚合、连接）、DDL/DML基础。理解关系型数据库概念（表、主键、外键、索引）。MySQL或PostgreSQL是很好的起点。Java/Scala：深入理解Hadoop/Spark等框架会更有优势。初学者
头歌作业-HBase 开发：使用Java操作HBase http_lizi hbase java python
第一关packagestep1;importjava.io.IOException;importorg.apache.hadoop.conf.Configuration;importorg.apache.hadoop.hbase.HBaseConfiguration;importorg.apache.hadoop.hbase.HColumnDescriptor;importorg.apache.h
HDFS中fsimage和edits究竟是什么清平乐的技术博客大数据运维 hdfs hadoop 大数据
fsimage和edits是HadoopHDFS(Hadoop分布式文件系统)中的两个关键组件，用于存储文件系统的元数据，以确保文件系统的持久性和一致性。在理解它们的作用之前，我们先了解一下HDFS的基本工作原理。HDFS采用了一种分布式文件系统的架构，其中数据被划分成块并分布在不同的数据节点上，而元数据(文件和目录的信息)则由单独的组件进行管理。元数据的持久性和一致性非常重要，因为文件系统的正确
spark处理kafka的用户行为数据写入hive 月光一族吖 spark kafka hive
在CentOS上部署Hadoop（Hadoop3.4.1）和Hive（Hive3.1.2）的详细步骤说明。这份指南面向单机安装（伪集群模式），如果需要搭建真正的多节点集群，各节点间的网络互访、SSH免密登录以及配置同步需进一步调整。注意：本指南假设你已拥有root权限或者具有sudo权限，并且系统连接Internet（用于下载安装包）。步骤中的版本号可根据实际需要进行更改。一、环境准备更新系统软件
VMware Workstation 11 或者 VMware Player 7安装MAC OS X 10.10 Yosemite iwindyforest vmware mac os 10.10 workstation player
最近尝试了下VMware下安装MacOS 系统，安装过程中发现网上可供参考的文章都是VMware Workstation 10以下， MacOS X 10.9以下的文章，只能提供大概的思路，但是实际安装起来由于版本问题，走了不少弯路，所以我尝试写以下总结，希望能给有兴趣安装OSX的人提供一点帮助。写在前面的话：其实安装好后发现，由于我的th
关于《基于模型驱动的B/S在线开发平台》源代码开源的疑虑？ deathwknight JavaScript java 框架
本人从学习Java开发到现在已有10年整，从一个要自学 java买成javascript的小菜鸟，成长为只会java和javascript语言的老菜鸟（个人邮箱：[email protected]）一路走来，跌跌撞撞。用自己的三年多业余时间，瞎搞一个小东西（基于模型驱动的B/S在线开发平台，非MVC框架、非代码生成）。希望与大家一起分享，同时有许些疑虑，希望有人可以交流下平台
如何把maven项目转成web项目 Kai_Ge maven MyEclipse
创建Web工程，使用eclipse ee创建maven web工程 1.右键项目,选择Project Facets,点击Convert to faceted from 2.更改Dynamic Web Module的Version为2.5.(3.0为Java7的,Tomcat6不支持). 如果提示错误,可能需要在Java Compiler设置Compiler compl
主管？？？ Array_06 工作
转载：http://www.blogjava.net/fastzch/archive/2010/11/25/339054.html 很久以前跟同事参加的培训，同事整理得很详细，必须得转！前段时间，公司有组织中高阶主管及其培养干部进行了为期三天的管理训练培训。三天的课程下来，虽然内容较多，因对老师三天来的课程内容深有感触，故借着整理学习心得的机会，将三天来的培训课程做了一个
python内置函数大全 2002wmj python
最近一直在看python的document，打算在基础方面重点看一下python的keyword、Build-in Function、Build-in Constants、Build-in Types、Build-in Exception这四个方面，其实在看的时候发现整个《The Python Standard Library》章节都是很不错的，其中描述了很多不错的主题。先把Build-in Fu
JSP页面通过JQUERY合并行 357029540 JavaScript jquery
在写程序的过程中我们难免会遇到在页面上合并单元行的情况，如图所示如果对于会的同学可能很简单，但是对没有思路的同学来说还是比较麻烦的，提供一下用JQUERY实现的参考代码 function mergeCell(){ var trs = $("#table tr"); &nb
Java基础冰天百华 java基础
学习函数式编程 package base; import java.text.DecimalFormat; public class Main { public static void main(String[] args) { // Integer a = 4; // Double aa = (double)a / 100000; // Decimal
unix时间戳相互转换 adminjun 转换 unix 时间戳
如何在不同编程语言中获取现在的Unix时间戳(Unix timestamp)？ Java time JavaScript Math.round(new Date().getTime()/1000) getTime()返回数值的单位是毫秒 Microsoft .NET / C# epoch = (DateTime.Now.ToUniversalTime().Ticks - 62135
作为一个合格程序员该做的事 aijuans 程序员
作为一个合格程序员每天该做的事 1、总结自己一天任务的完成情况最好的方式是写工作日志，把自己今天完成了什么事情，遇见了什么问题都记录下来，日后翻看好处多多 2、考虑自己明天应该做的主要工作把明天要做的事情列出来，并按照优先级排列，第二天应该把自己效率最高的时间分配给最重要的工作 3、考虑自己一天工作中失误的地方，并想出避免下一次再犯的方法出错不要紧，最重
由html5视频播放引发的总结 ayaoxinchao html5 视频 video
前言项目中存在视频播放的功能，前期设计是以flash播放器播放视频的。但是现在由于需要兼容苹果的设备，必须采用html5的方式来播放视频。我就出于兴趣对html5播放视频做了简单的了解，不了解不知道，水真是很深。本文所记录的知识一些浅尝辄止的知识，说起来很惭愧。视频结构本该直接介绍html5的<video>的，但鉴于本人对视频
解决httpclient访问自签名https报javax.net.ssl.SSLHandshakeException: sun.security.validat bewithme httpclient
如果你构建了一个https协议的站点，而此站点的安全证书并不是合法的第三方证书颁发机构所签发，那么你用httpclient去访问此站点会报如下错误 javax.net.ssl.SSLHandshakeException: sun.security.validator.ValidatorException: PKIX path bu
Jedis连接池的入门级使用 bijian1013 redis redis数据库 jedis
Jedis连接池操作步骤如下： a.获取Jedis实例需要从JedisPool中获取； b.用完Jedis实例需要返还给JedisPool； c.如果Jedis在使用过程中出错，则也需要还给JedisPool； packag
变与不变 bingyingao 不变变亲情永恒
变与不变周末骑车转到了五年前租住的小区，曾经最爱吃的西北面馆、江西水饺、手工拉面早已不在，各种店铺都换了好几茬，这些是变的。三年前还很流行的一款手机在今天看起来已经落后的不像样子。三年前还运行的好好的一家公司，今天也已经不复存在。一座座高楼拔地而起，
【Scala十】Scala核心四：集合框架之List bit1129 scala
Spark的RDD作为一个分布式不可变的数据集合，它提供的转换操作，很多是借鉴于Scala的集合框架提供的一些函数，因此，有必要对Scala的集合进行详细的了解 1. 泛型集合都是协变的，对于List而言，如果B是A的子类，那么List[B]也是List[A]的子类，即可以把List[B]的实例赋值给List[A]变量 2. 给变量赋值(注意val关键字，a，b
Nested Functions in C bookjovi c closure
Nested Functions 又称closure，属于functional language中的概念，一直以为C中是不支持closure的，现在看来我错了，不过C标准中是不支持的，而GCC支持。既然GCC支持了closure，那么 lexical scoping自然也支持了，同时在C中label也是可以在nested functions中自由跳转的
Java-Collections Framework学习与总结-WeakHashMap BrokenDreams Collections
总结这个类之前，首先看一下Java引用的相关知识。Java的引用分为四种：强引用、软引用、弱引用和虚引用。强引用：就是常见的代码中的引用，如Object o = new Object();存在强引用的对象不会被垃圾收集
读《研磨设计模式》-代码笔记-解释器模式-Interpret bylijinnan java 设计模式
声明：本文只为方便我个人查阅和理解，详细的分析以及源代码请移步原作者的博客http://chjavach.iteye.com/ package design.pattern; /* * 解释器（Interpreter）模式的意图是可以按照自己定义的组合规则集合来组合可执行对象 * * 代码示例实现XML里面1.读取单个元素的值 2.读取单个属性的值 * 多
After Effects操作&快捷键 cherishLC After Effects
1、快捷键官方文档中文版：https://helpx.adobe.com/cn/after-effects/using/keyboard-shortcuts-reference.html 英文版：https://helpx.adobe.com/after-effects/using/keyboard-shortcuts-reference.html 2、常用快捷键
Maven 常用命令 crabdave maven
Maven 常用命令 mvn archetype:generate mvn install mvn clean mvn clean complie mvn clean test mvn clean install mvn clean package mvn test mvn package mvn site mvn dependency:res
shell bad substitution daizj shell 脚本
#!/bin/sh /data/script/common/run_cmd.exp 192.168.13.168 "impala-shell -islave4 -q 'insert OVERWRITE table imeis.${tableName} select ${selectFields}, ds, fnv_hash(concat(cast(ds as string), im
Java SE 第二讲（原生数据类型 Primitive Data Type） dcj3sjt126com java
Java SE 第二讲： 1. Windows: notepad, editplus, ultraedit, gvim Linux: vi, vim, gedit 2. Java 中的数据类型分为两大类： 1）原生数据类型（Primitive Data Type） 2）引用类型（对象类型）（R
CGridView中实现批量删除 dcj3sjt126com PHP yii
1，CGridView中的columns添加 array( 'selectableRows' => 2, 'footer' => '<button type="button" onclick="GetCheckbox();" style=&
Java中泛型的各种使用 dyy_gusi java 泛型
Java中的泛型的使用：1.普通的泛型使用在使用类的时候后面的<>中的类型就是我们确定的类型。 public class MyClass1<T> {//此处定义的泛型是T private T var; public T getVar() { return var; } public void setVa
Web开发技术十年发展历程 gcq511120594 Web 浏览器数据挖掘
回顾web开发技术这十年发展历程： Ajax 03年的时候我上六年级，那时候网吧刚在小县城的角落萌生。传奇，大话西游第一代网游一时风靡。我抱着试一试的心态给了网吧老板两块钱想申请个号玩玩，然后接下来的一个小时我一直在，注，册，账，号。彼时网吧用的512k的带宽，注册的时候，填了一堆信息，提交，页面跳转，嘣，”您填写的信息有误，请重填”。然后跳转回注册页面，以此循环。我现在时常想，如果当时a
openSession()与getCurrentSession()区别： hetongfei java DAO Hibernate
来自 http://blog.csdn.net/dy511/article/details/6166134 1.getCurrentSession创建的session会和绑定到当前线程,而openSession不会。 2. getCurrentSession创建的线程会在事务回滚或事物提交后自动关闭,而openSession必须手动关闭。这里getCurrentSession本地事务(本地
第一章安装Nginx+Lua开发环境 jinnianshilongnian nginx lua openresty
首先我们选择使用OpenResty，其是由Nginx核心加很多第三方模块组成，其最大的亮点是默认集成了Lua开发环境，使得Nginx可以作为一个Web Server使用。借助于Nginx的事件驱动模型和非阻塞IO，可以实现高性能的Web应用程序。而且OpenResty提供了大量组件如Mysql、Redis、Memcached等等，使在Nginx上开发Web应用更方便更简单。目前在京东如实时价格、秒
HSQLDB In-Process方式访问内存数据库 liyonghui160com
HSQLDB一大特色就是能够在内存中建立数据库，当然它也能将这些内存数据库保存到文件中以便实现真正的持久化。先睹为快！下面是一个In-Process方式访问内存数据库的代码示例：下面代码需要引入hsqldb.jar包（hsqldb-2.2.8） import java.s
Java线程的5个使用技巧 pda158 java 数据结构
Java线程有哪些不太为人所知的技巧与用法？　　萝卜白菜各有所爱。像我就喜欢Java。学无止境，这也是我喜欢它的一个原因。日常工作中你所用到的工具，通常都有些你从来没有了解过的东西，比方说某个方法或者是一些有趣的用法。比如说线程。没错，就是线程。或者确切说是Thread这个类。当我们在构建高可扩展性系统的时候，通常会面临各种各样的并发编程的问题，不过我们现在所要讲的可能会略有不同。
开发资源大整合：编程语言篇——JavaScript（1） shoothao JavaScript
概述：本系列的资源整合来自于github中各个领域的大牛，来收藏你感兴趣的东西吧。程序包管理器管理javascript库并提供对这些库的快速使用与打包的服务。 Bower - 用于web的程序包管理。 component - 用于客户端的程序包管理，构建更好的web应用程序。 spm - 全新的静态的文件包管
避免使用终结函数 vahoa.ma java jvm C++
终结函数（finalizer）通常是不可预测的，常常也是很危险的，一般情况下不是必要的。使用终结函数会导致不稳定的行为、更差的性能，以及带来移植性问题。不要把终结函数当做C++中的析构函数（destructors）的对应物。我自己总结了一下这一条的综合性结论是这样的： 1）在涉及使用资源，使用完毕后要释放资源的情形下，首先要用一个显示的方