mapred-site.xml

To the mapred-site.xml file on every host in your cluster, you must add the following information:

 

Table 13.5. mapred-site.xml

Property NameProperty ValueDescriptionFinal
mapreduce.jobtracker.kerberos.principal jt/_HOST@EXAMPLE.COM Kerberos principal name for the JobTracker 
mapreduce.tasktracker.kerberos.principal tt/_HOST@EXAMPLE.COM Kerberos principal name for the TaskTracker. _HOST" is replaced by the host name of the task tracker. 
hadoop.job.history.user.location none true
mapreduce.jobtracker.keytab.file /etc/security/keytabs/jt.service.keytab The keytab for the JobTracker principal  
mapreduce.tasktracker.keytab.file /etc/security/keytabs/tt.service.keytab The keytab for the Tasktracker principal 
mapreduce.jobtracker.staging.root.dir /user The path prefix for the location of the the staging directories. The next level is always the user's name. It is a path in the default file system  
mapreduce.tasktracker.group hadoop The group that the task controller uses for accessing the task controller. The mapred user must be a member and users should not be members. <question?> 
mapreduce.jobtracker.split.metainfo.maxsize 50000000 If the size of the split metainfo file is larger than this value, the JobTracker will fail the job during initialization. true
mapreduce.history.server.embedded false Should the Job History server be embedded within the JobTracker process true

mapreduce.history.server.http.address

Note: cluster variant

Example:

ip-10-111-59-170.ec2.internal:51111

  

mapreduce.jobhistory.kerberos.principal

Note: cluster variant

jt/_HOST@EXAMPLE.COM Kerberos principal name for JobHistory. This must map to the same user as the JT user. true

mapreduce.jobhistory.keytab.file

Note: cluster variant

/etc/security/keytabs/jt.service.keytab The keytab for the JobHistory principal 
mapred.jobtracker.blacklist.fault-timeout-window

Example:

180

3-hour sliding window - the value is specified in minutes.  
mapred.jobtracker.blacklist.fault-bucket-width

Example:

15

15-minute bucket size - the value is specified in minutes.  
mapred.queue.names default Comma separated list of queues configured for this jobtracker.  

The XML for these entries:

<property>  
        <name>mapreduce.jobtracker.kerberos.principal</name>  
        <value>jt/_HOST@EXAMPLE.COM</value>  
        <description> JT
        user name key.  </description> 
</property>   

<property>  
        <name>mapreduce.tasktracker.kerberos.principal</name>   
        <value>tt/_HOST@EXAMPLE.COM</value>  
        <description>tt
        user name key. "_HOST" is replaced by the host name of the task tracker.   
        </description> 
</property>      

<property>    
        <name>hadoop.job.history.user.location</name>    
        <value>none</value>    
        <final>true</final>  
</property>      

<property>   
        <name>mapreduce.jobtracker.keytab.file</name>   
        <value>/etc/security/keytabs/jt.service.keytab</value>   
        <description>       
        The keytab for the jobtracker principal.   
        </description>   
</property>    

<property>   
        <name>mapreduce.tasktracker.keytab.file</name>   
        <value>/etc/security/keytabs/tt.service.keytab</value>    
        <description>The filename of the keytab for the task
        tracker</description>  
</property>    

<property>   
        <name>mapreduce.jobtracker.staging.root.dir</name>   
        <value>/user</value>  
        <description>The Path prefix for where the staging
        directories should be placed. The next level is always the user's name. It
        is a path in the default file system.</description>  
</property>    

<property>      
        <name>mapreduce.tasktracker.group</name>      
        <value>hadoop</value>      
        <description>The group that the task controller uses for accessing the task controller.
        The mapred user must be a member and users should *not* be
        members.</description>    
</property>    

<property>    
        <name>mapreduce.jobtracker.split.metainfo.maxsize</name>    
        <value>50000000</value>     
        <final>true</final>     
        <description>If the size of the split metainfo file is larger than this, the JobTracker
        will fail the job during    
        initialize.   
        </description>  
</property>  

<property>    
        <name>mapreduce.history.server.embedded</name>     
        <value>false</value>    
        <description>Should job history server be embedded within Job tracker process</description>    
        <final>true</final>  
</property>    

<property>    
        <name>mapreduce.history.server.http.address</name>     
        <!--cluster variant -->     
        <value>ip-10-111-59-170.ec2.internal:51111</value>    
        <description>Http address of the history server</description>    
        <final>true</final>  
</property>    

<property>    
        <name>mapreduce.jobhistory.kerberos.principal</name>     
        <!--cluster variant -->  
        <value>jt/_HOST@EXAMPLE.COM</value>    
        <description>Job history user name key. (must map to same user as JT user)</description>  
</property>    

<property>   
        <name>mapreduce.jobhistory.keytab.file</name>     
        <!--cluster variant -->   
        <value>/etc/security/keytabs/jt.service.keytab</value>   
        <description>The keytab for the job history server
        principal.</description>  
</property>   

<property>  
        <name>mapred.jobtracker.blacklist.fault-timeout-window</name>  
        <value>180</value>  
        <description>     3-hour
        sliding window (value is in minutes)  
        </description> 
</property>   

<property>  
        <name>mapred.jobtracker.blacklist.fault-bucket-width</name>  
        <value>15</value>  
        <description>    
        15-minute bucket size (value is in minutes)  
        </description> 
</property>   

<property>  
        <name>mapred.queue.names</name>  
        <value>default</value>   <description>
        Comma separated list of queues configured for this jobtracker.</description> 
</property>    
    

loading table of contents...