https://issues.apache.org/jira/browse/HBASE-12590

 

[HBASE-12590] A solution for data skew in HBase-Mapreduce Job - ASF JIRA

1, Motivation In production environment, data skew is a very common case. A HBase table may contains a lot of small regions and several large regions. Small regions waste a lot of computing resources. If we use a job to scan a table with 3000 small regions

issues.apache.org

 

위의 링크에 따르면 

 

hbase.mapreduce.input.autobalance = true 

 

로 옵션을 지정하면

 

data 의 auto balance 가 된다고 한다.

 

 

 

 

+ Recent posts