Copy Hadoop Data From One HDFS to Another

If you have two HDFS cluster operating on two different places (production vs alpha for example), sometimes you might want to copy some data from one cluster to another. To do it is easy using Hadoop’s internal “distcp” command: hadoop distcp hdfs://hadoop-namenode/data/2013/01 hdfs:///data/2013/ We have the following directory structure in …