Hadoop 版本:2.7.3 数据节点:32 块大小:512m 复制:3
我使用命令清除了 HDFS 中的所有数据
hdfs dfs -rm -r /*
清除 HDFS 集群后,很少有数据节点仍然显示Block pool used
,尽管块为零。
的输出hdfs fsck /
如下
Connecting to namenode via http://ip-10-0-2-7:50070/fsck?ugi=ubuntu&path=%2F
FSCK started by ubuntu (auth:SIMPLE) from /10.0.2.7 for path / at Tue Jul 18 04:34:19 UTC 2017
Status: HEALTHY
Total size: 0 B
Total dirs: 1
Total files: 0
Total symlinks: 0
Total blocks (validated): 0
Minimally replicated blocks: 0
Over-replicated blocks: 0
Under-replicated blocks: 0
Mis-replicated blocks: 0
Default replication factor: 3
Average block replication: 0.0
Corrupt blocks: 0
Missing replicas: 0
Number of data-nodes: 32
Number of racks: 1
FSCK ended at Tue Jul 18 04:34:19 UTC 2017 in 1 milliseconds
The filesystem under path '/' is HEALTHY
我知道块删除是异步过程,但是Block Pool Used
未设置为 0 的数据节点需要花费大量时间来释放块。谁能帮我弄清楚,为什么在这些服务器上,块删除很慢,或者任何其他问题,因为这些块正在被使用。