fair_scheduler_design_doc.pdf
上传人:sy****28 上传时间:2024-09-14 格式:PDF 页数:10 大小:113KB 金币:16 举报 版权申诉
预览加载中,请您耐心等待几秒...

fair_scheduler_design_doc.pdf

fair_scheduler_design_doc.pdf

预览

在线预览结束,喜欢就下载吧,查找使用更方便

16 金币

下载此文档

如果您无法下载资料,请参考说明:

1、部分资料下载需要金币,请确保您的账户上有足够的金币

2、已购买过的文档,再次下载不重复扣费

3、资料包下载后请先用软件解压,在使用对应软件打开

HadoopFairSchedulerDesignDocumentAugust15,2009Contents1IntroductionTheHadoopFairSchedulerstartedasasimplemeanstoshareMapReduceclusters.Overtime,ithasgrowninfunctionalitytosupporthierarchicalscheduling,preemption,andmultiplewaysoforganizingandweighingjobs.ThisdocumentexplainsthegoalsandfeaturesoftheFairScheduleranditsinternaldesign.2FairSchedulerGoalsTheFairSchedulerwasdesignedwithfourmaingoals:1.Runsmalljobsquicklyeveniftheyaresharingaclusterwithlargejobs.UnlikeHadoop’sbuilt-inFIFOscheduler,fairschedulingletssmalljobsmakeprogressevenifalargejobisrunning,withoutstarvingthelargejob.2.Provideguaranteedservicelevelsto“production”jobs,toletthemrunalongsideexperimentaljobsinasharedcluster.3.Besimpletoadministerandconfigure.Theschedulershoulddosomethingreasonable“outofthebox,”andusersshouldonlyneedtoconfigureitastheydiscoverthattheywanttousemoreadvancedfeatures.4.Supportreconfigurationatruntime,withoutrequiringaclusterrestart.3SchedulerFeaturesThissectionprovidesaquickoverviewofthefeaturesoftheFairScheduler.Adetailedus-ageguideisavailableintheHadoopdocumentationinbuild/docs/fairscheduler.html.13.1PoolsTheFairSchedulergroupsjobsinto“pools”andperformsfairsharingbetweenthesepools.EachpoolcanuseeitherFIFOorfairsharingtoschedulejobsinternaltothepool.ThepoolthatajobisplacedinisdeterminedbyaJobConfproperty,the“poolnameproperty”.Bydefault,thisisuser.name,sothatthereisonepoolperuser.However,differentpropertiescanbeused,e.g.group.nametohaveonepoolperUnixgroup.Acommontrickistosetthepoolnamepropertytoanunusedpropertynamesuchaspool.nameandmakethisdefaulttouser.name,sothatthereisonepoolperuserbutitisalsopossibletoplacejobsinto“special”poolsbysettingtheirpool.namedirectly.Themapred-site.xmlsnippetbelowshowshowtodothis:<property><name>mapred.fairscheduler.poolnameproperty</name><value>pool.name</value></property><property><name>pool.name</name><value>${user.name}</value></property>3.2MinimumSharesNormally,activepools(thosethatcontain