Input data is partitioned in such a way that it can be distributed among a cluster of machines for processing in parallel.
输入数据使用这样一种方法进行分区,即在并行处理的计算机集群中分区的方法。
Parallel Tasks - Used when one needs to apply different operations, each with its own data input.
并行任务(Parallel Tasks)——在需要执行每个都具有自己数据输入的不同操作的时候使用。
Loading time also depended on whether data were written to intermediate files or not, and on whether input files were processed in sequence or in parallel.
载入时间也依赖于数据是否写入到中间文件,及依赖于输入文件是顺序处理或并行处理。
应用推荐