Too many fetch faliuers

akash sabarad

I have a setup, 2 node hadoop cluster on Ubuntu 12.04 and Hadoop 1.2.1. While I am trying to run hadoop word count example I am gettig "Too many fetch faliure error". I have referred many articles but I am unable to figure out what should be the entries in Masters,Slaves and /etc/hosts file. My nodes names are "master" with ip 10.0.0.1 and "slaveone" with ip 10.0.0.2.

I need assistance in what should be the entries in masters,slaves and /etc/hosts file in both master and slave node?

Pradeep Gollakota

If you're unable to upgrade the cluster for whatever reason, you can try the following:

  1. Ensure that your hostname is bound to the network IP and NOT 127.0.0.1 in /etc/hosts
  2. Ensure that you're using only hostnames and not IPs to reference services.
  3. If the above are correct, try the following settings:

set mapred.reduce.slowstart.completed.maps=0.80
set tasktracker.http.threads=80
set mapred.reduce.parallel.copies=(>= 10)(10 should probably be sufficient)

Also checkout this SO post: Why I am getting "Too many fetch-failures" every other day

And this one: Too many fetch failures: Hadoop on cluster (x2)

And also this if the above don't help: http://grokbase.com/t/hadoop/common-user/098k7y5t4n/how-to-deal-with-too-many-fetch-failures For brevity and in interest of time, I'm putting what I found to be the most pertinent here.

The number 1 cause of this is something that causes a connection to get a map output to fail. I have seen: 1) firewall 2) misconfigured ip addresses (ie: the task tracker attempting the fetch received an incorrect ip address when it looked up the name of the tasktracker with the map segment) 3) rare, the http server on the serving tasktracker is overloaded due to insufficient threads or listen backlog, this can happen if the number of fetches per reduce is large and the number of reduces or the number of maps is very large.

There are probably other cases, this recently happened to me when I had 6000 maps and 20 reducers on a 10 node cluster, which I believe was case 3 above. Since I didn't actually need to reduce ( I got my summary data via counters in the map phase) I never re-tuned the cluster.

EDIT: Original answer said "Ensure that your hostname is bound to the network IP and 127.0.0.1 in /etc/hosts"

本文收集自互联网,转载请注明来源。

如有侵权,请联系[email protected] 删除。

编辑于
0

我来说两句

0条评论
登录后参与评论

相关文章

来自分类Dev

ArangoDB Too many open files

来自分类Dev

Entity Framework applying too many migrations

来自分类Dev

MYSQL Too many connections error will not go away

来自分类Dev

Traefik / Portainer-ERR_TOO_MANY_REDIRECTS

来自分类Dev

wordpress:ERR_TOO_MANY_REDIRECTS

来自分类Dev

ERR_TOO_MANY_REDIRECTS - Laravel

来自分类Dev

BluetoothLeAdvertiser AdvertiseCallback引发ADVERTISE_FAILED_TOO_MANY_ADVERTISERS

来自分类Dev

ERR_TOO_MANY_REDIRECTS错误的解决方案

来自分类Dev

Express.js ERR_TOO_MANY_REDIRECTS错误

来自分类Dev

无法加载资源:net :: ERR_TOO_MANY_REDIRECTS

来自分类Dev

PDFjs ERR_TOO_MANY_REDIRECTS请求。Cookies

来自分类Dev

错误:gRPC服务调用中too_many_pings错误

来自分类Dev

春季安全性ERR_TOO_MANY_REDIRECTS

来自分类Dev

无法加载资源jquery ERR_TOO_MANY_REDIRECTS

来自分类Dev

在 HandleUnauthorizedRequest 中获取 err_too_many_redirects

来自分类Dev

添加 SSL 后的 laravel ERR_TOO_MANY_REDIRECTS

来自分类Dev

IIS 在重写规则上得到 ERR_TOO_MANY_REDIRECTS

来自分类Dev

Nginx 和 Cloudflare:err_too_many_redirects

来自分类Dev

如何修复 SoundEffect qml 中的“E libOpenSLES: Too many objects”

来自分类Dev

将行添加到安全配置后,ERR_TOO_MANY_REDIRECTS

来自分类Dev

Android Augmented Reality Pro AR 9: activity gets back when displays too many markers

来自分类Dev

Linux system does not allow creation of sockets because of "Too many open files" *even after a reboot*

来自分类Dev

Grails-Spring Security REST-302响应网:: ERR_TOO_MANY_REDIRECTS

来自分类Dev

如何在VS Code中启用“ too-many-locals” Pylint消息?

来自分类Dev

如何在VS Code中启用“ too-many-locals” Pylint消息?

来自分类Dev

Google Cloud Platform负载平衡器-ERR_TOO_MANY_REDIRECTS

来自分类Dev

COM + VB6应用程序:RM_ENLIST_FAILED_TOO_MANY_ENLISTS错误

来自分类Dev

elasticsearch:cluster_block_exception TOO_MANY_REQUESTS / 12 / index只读/允许删除(api)

来自分类Dev

错误310(net :: ERR_TOO_MANY_REDIRECTS):本地wordpress网站的重定向过多

Related 相关文章

  1. 1

    ArangoDB Too many open files

  2. 2

    Entity Framework applying too many migrations

  3. 3

    MYSQL Too many connections error will not go away

  4. 4

    Traefik / Portainer-ERR_TOO_MANY_REDIRECTS

  5. 5

    wordpress:ERR_TOO_MANY_REDIRECTS

  6. 6

    ERR_TOO_MANY_REDIRECTS - Laravel

  7. 7

    BluetoothLeAdvertiser AdvertiseCallback引发ADVERTISE_FAILED_TOO_MANY_ADVERTISERS

  8. 8

    ERR_TOO_MANY_REDIRECTS错误的解决方案

  9. 9

    Express.js ERR_TOO_MANY_REDIRECTS错误

  10. 10

    无法加载资源:net :: ERR_TOO_MANY_REDIRECTS

  11. 11

    PDFjs ERR_TOO_MANY_REDIRECTS请求。Cookies

  12. 12

    错误:gRPC服务调用中too_many_pings错误

  13. 13

    春季安全性ERR_TOO_MANY_REDIRECTS

  14. 14

    无法加载资源jquery ERR_TOO_MANY_REDIRECTS

  15. 15

    在 HandleUnauthorizedRequest 中获取 err_too_many_redirects

  16. 16

    添加 SSL 后的 laravel ERR_TOO_MANY_REDIRECTS

  17. 17

    IIS 在重写规则上得到 ERR_TOO_MANY_REDIRECTS

  18. 18

    Nginx 和 Cloudflare:err_too_many_redirects

  19. 19

    如何修复 SoundEffect qml 中的“E libOpenSLES: Too many objects”

  20. 20

    将行添加到安全配置后,ERR_TOO_MANY_REDIRECTS

  21. 21

    Android Augmented Reality Pro AR 9: activity gets back when displays too many markers

  22. 22

    Linux system does not allow creation of sockets because of "Too many open files" *even after a reboot*

  23. 23

    Grails-Spring Security REST-302响应网:: ERR_TOO_MANY_REDIRECTS

  24. 24

    如何在VS Code中启用“ too-many-locals” Pylint消息?

  25. 25

    如何在VS Code中启用“ too-many-locals” Pylint消息?

  26. 26

    Google Cloud Platform负载平衡器-ERR_TOO_MANY_REDIRECTS

  27. 27

    COM + VB6应用程序:RM_ENLIST_FAILED_TOO_MANY_ENLISTS错误

  28. 28

    elasticsearch:cluster_block_exception TOO_MANY_REQUESTS / 12 / index只读/允许删除(api)

  29. 29

    错误310(net :: ERR_TOO_MANY_REDIRECTS):本地wordpress网站的重定向过多

热门标签

归档