Parallel Processing

Articleby dmitribagh· Oct 08, 2015 at 07:15 PM· mark2atsafeedited · Feb 26, 2019 at 07:26 PM

Article created with FME Desktop 2015.0

Introduction

Each FME translation is usually a single process on your computer.However, FME can be set up to take advantage of multiple-core processors and improve parallelization of computations (doing multiple tasks at once).FME also makes use of hyper-threading, a technology used to make each physical core appear as two logical processors to the host operating system.

By using parallel processing, performance may be improved significantly over a single process.

Notes:

作为FME2019的，并行处理的选项已经从最变压器移除，只在定制变压器基础设施存在。一个单独的文章存在解释setting up parallel processing in a custom transformer。
对于更基本介绍并行处理，其中包括步骤一步教程练习，请参见文章How to Use Parallel Processing in FME。

Setting Up Parallel Processing

As a brief introduction, note that each parallel process in FME uses its own set of data, and data cannot be passed between processes.Therefore you must divide data into groups using a Group-By parameter, and set each group to be handled by a different process.

Here a user is calculating statistics about the number of visitors to parks in the city of Vancouver, using FME's StatisticsCalculator transformer.Each park has an attribute that defines which neighborhood it resides in.That neighborhood attribute is used to group the data and by setting a Parallel Processing Level, each group is handled by a separate process, potentially improving performance.

Parallel Processing Levels

The processing level determines how many processes run in parallel.Minimal creates the fewest processes.Extreme creates the most.The exact amount depends on the number of cores and processors on the computer being used:

However, there is a limit to the number of processes that FME will create.This limit is tied to the FME license level.FME Base Edition allows a maximum of 4 processes;Professional Edition: 8;All other editions: 16.

Parallel Processing Tips

Here are some general tips for parallel processing:

数据必须分割成组。如果没有选择组属性在变压器组通过设置，就没有并行处理。
组将被独立处理。在上面的例子中,每个社区将得到我ts own set of statistics.If features in one group depend on features in another, then processing them separately will produce incorrect results.
If you do not have an attribute that defines groups, then groups can be created using other transformers such as the ModuloCounter.This blog postexplores different techniques for creating artificial groups.

Parallel Processing and Performance

In theory, parallel processing should produce results faster than a single process.However, there are instances where that might not be the case:

Parallel processing only makes sense when the data volumes are big enough - for smaller datasets, the overhead of running multiple FMEs can easily make the translation slower than a single process.i.e.if there are only a handful of parks in the above example, then the benefits of parallel processing can be negated by the cost of starting multiple processes.
A greater number of parallel processes does not always correlate with better performance.For example, in "Aggressive" or "Extreme" mode, there might be so many processes that they are fighting each other (or the operating system) for system resources.

Parallel Processing and Custom Transformers

Rather than have a single transformer carry out parallel processing, it's possible to enable parallel processing on a whole group of transformers.

This is done by creating a Custom Transformer from that group.A custom transsformer has its own parameters for parallel processing, and it does not have to be limited to a single transformer within it.

使用自定义变压器像这也意味着，在“分组依据”和“平行过程”的设置可以不同（例如我可能会组我一起公园附近通过，但并行处理这些城市的基础上）。

并行处理实施例

这是一组实施例中，其中并行处理是使用的。对于一个教程，开展自己，看this page.

这里所有的例子都在64位Windows平台上的一个四核心（8个虚拟处理器）机4GB内存进行。请记住，结果可能取决于硬件配置和FME版本而异。

RasterDEMGenerator

Workspace as a Template

因为表面建模是如此强烈的过程中，采用并行处理可以是非常有益的。

本实施例中生成从点云输入DEM：

RasterDEMGenerator工作区

该RasterDEMGenerator基通过被设置为fme_basename处理每个点云作为自己的组。

No parallelism: 1m10s
Minimal parallelism: 44s
Moderate parallelism: 33s
Aggressive parallelism: 37s
Extreme parallelism: 37s

适度是最好的结果在这里，超过两倍的速度，因为没有并行。最小并行度比较慢，因为它不使用全部的处理功能。激进和极端的模式是慢，因为他们使用的是完全的处理能力，在每个人的费用。

在第二个实验中，点云文件按照他们的名字的第一个字母组合（这是SubstringExtractor变压器是什么）。

No parallelism: 1m52s
Minimal parallelism: 1m 01s
Moderate parallelism: 42s

使用较大的测试数据集显示了结果做规模与数据大小：

No parallelism: 2h20m
Minimal parallelism: 54m

TINGenerator

Workspace as a Template

TINGenerator类似于RasterDEMGenerator SurfaceModeller的另一个子集。

在该示例中单个TINGenerator需要5分钟，以产生表面。然而，即使并行之前，我们可以用一个特技其中一个TINGenerator使得小的表面（来自每个源的LAS）与第二TINGenerator到这些小的表面结合成一个单一的表面。

用户添加的图像

耦合这双曲面生成与并行处理给出了优异的结果：

Single TINGenerator: No parallelism: 5m01s
Two TINGenerators: No parallelism: 55s
Two TINGenerators: Minimal parallelism: 30s:
Two TINGenerators: Moderate parallelism: 28s
Two TINGenerators: Aggressive parallelism: 26s
Two TINGenerators: Extreme parallelism: 28s

基于所述测试的结果上面我们可以决定性地得出结论，并行处理允许更快的表面建模，并且可以推荐机器支撑多线程。

缓冲

Workspace as a Template

这个例子使用含有US主要道路shape文件数据集，其中所述意图是缓冲用25米缓冲区各道路。这一过程将包裹Bufferer变压器定制变压器的内部，使得在group-by参数可以使用不同的属性，以并行处理由参数。

因为它让我们创建组的最佳数量，这里是8和16之间，这非常有用。该循环计数：变压器是用来做这个，它的“最大计数”参数是创建组数：

用户添加的图像

对于45万个原路段的缓冲数字如下：

No parallelism: 2m51s
Moderate parallelism (4 groups): 1m29s
Moderate parallelism (8 groups): 1m30s
Moderate parallelism (16 groups): 1m33s
Moderate parallelism (24 groups): 1m36s
Moderate parallelism (50 groups): 1m54s

我们可以看到，在过去的测试组的尺寸较小不赔多道开销（发射了FME会话和FME实例之间发送功能）。这将是一个问题的少，每组有特征的数目大得多。

连线

并行处理可以在工作区中的任何变压器一起使用。在上面的例子中，并行处理的LineJoiner变压器（在定制变压器包裹起来）给出以下：

No parallelism: 1m11s
Moderate parallelism: 1m14s

我们可以得出这样的结论并行处理有超过正常连线没有优势。然而，当这些数据大约是5倍大，其结果是完全不同的：

No parallelism: 10m 23s
Minimal parallelism: 6m 06s
Moderate parallelism: 5m11s
Aggressive parallelism: 5m09s

没有并行处理，一个单一的过程可以养猪资源，瘫痪计算机而FME优化存储器使用和高速缓存数据到磁盘。

剪裁

Workspace as a Template

裁剪就是多进程是有益的其他操作。在这个例子中，我们看到美国的主要道路，被国家中已经加入，并把它们夹到县界。同样，我们使用的FIPS数量的第二位作出处理组：

用户添加的图像

大型数据集（〜45万层的功能）的结果给出如下：

No parallelism: 1m 44s
Minimal parallelism: 1m 17s
Moderate parallelism: 1m 17s
Aggressive parallelism: 1m 18s

以更大的数据集（〜2250000个特征）的结果更为明显：

No parallelism: 27m 01s
Moderate parallelism: 7m 33s

点云操纵：3D剪辑

Workspace as a Template

在3D裁剪点云可以是一个简单的表面过滤有用：

用户添加的图像

如下在相对较小的点云看结果：

No parallelism: 1m 45s
Aggressive: 1m 12s

Parallel Processing

Introduction

Notes:

Setting Up Parallel Processing

Parallel Processing Levels

Parallel Processing Tips

Parallel Processing and Performance

Parallel Processing and Custom Transformers

并行处理实施例

RasterDEMGenerator

TINGenerator

缓冲

连线

剪裁

点云操纵：3D剪辑

更多信息

Article

Follow this article

Navigation

Related Articles

Related Articles

How To Use Parallel Processing in FME

Converting Point Clouds to Surface Models Using the PointCloudLASClassifier

How to Read and Translate all Feature Classes from Multiple Esri Geodatabases

剪裁 and Tiling Point Cloud Data

In what order are features processed when there are parallel transformers

Maximum concurrent FME processes error

Creating Boundary and Point Features from a Point Cloud

If I have several datasets that get merged within a workspace how can I separate it out again