[HBASE-12590] A solution for data skew in HBase-Mapreduce Job - ASF JIRA

1, Motivation In production environment, data skew is a very common case. A HBase table may contains a lot of small regions and several large regions. Small regions waste a lot of computing resources. If we use a job to scan a table with 3000 small regions


위의 링크에 따르면 


hbase.mapreduce.input.autobalance = true 


로 옵션을 지정하면


data 의 auto balance 가 된다고 한다.





+ Recent posts