Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-4749

reducer should output input data size when shuffling is done

VotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.19.0
    • 0.20.0
    • None
    • None
    • Reviewed
    • Added a new counter REDUCE_INPUT_BYTES.

    Description

      Sometimes we see a single slow reducer because of the load balancing problem. This information will be very useful to understand how imbalanced the load is.

      Should be easy to fix I guess, since reducer should have all information needed at the end of the shuffling phase.

      Attachments

        1. 4749.patch
          2 kB
          He Yongqiang

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            he yongqiang He Yongqiang
            zshao Zheng Shao
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment