Stage2 Docs

Selecting Tools for Operations

When creating an operation in Stage2, choosing the right tool is a critical step. While Stage2 can operationalize any tool, the complexity of this process can vary greatly depending on the tool you select. Below, we outline key considerations for selecting a tool, and how these influence the ease of operationalization.

Key Considerations for Tool Selection

Support for Scripting
Tools with integrated scripting support (e.g., Python APIs, command-line utilities) are often easier to operationalize. This allows for programmatic control and seamless integration with Stage2.
Command Line Interface (CLI)
A robust CLI can simplify operationalization, as Stage2 can easily execute CLI commands within a containerized environment.
Licensing and Subscription Requirements
Free tools are generally more accessible unless your team already uses a licensed tool. Avoid introducing unnecessary costs for tools that are not integral to your workflow.
Familiarity with the Tool
Consider your familiarity with the tool and the learning curve associated with it. Tools you or your team are already comfortable with can save time during integration and troubleshooting.
Tool Size Relative to the Problem
The tool should be appropriately scaled to the task. Using overly complex or heavyweight software for a simple problem can increase overhead without adding value.

For example, using Autodesk Revit to generate a 2D image of a cube would be overkill.
Availability of Existing Docker Images
Stage2 relies on Docker for containerization and does not add any proprietary layers to this process. If a tool already has a base Docker image available on Docker Hub, it can significantly simplify the operationalization process.

Tool Selection for Image Preprocessing

For our problem—preprocessing images for machine learning—several tools could be considered:

1. Microsoft Paint

Pros: Extremely simple and user-friendly.
Cons: Limited functionality and lacks programmatic support. Not suitable for automation or integration in workflows.

2. GIMP

Pros: Open-source, free, and offers scripting support through Python or Scheme. It’s a flexible tool for image editing.
Cons: While powerful, the scripting interface can be somewhat complex for beginners.

3. Photoshop

Pros: Industry standard for image editing, offering extensive scripting capabilities and powerful features.
Cons: Requires a subscription, adding ongoing costs. Not ideal unless you already use Photoshop in your workflow.

4. ImageMagick (Chosen Tool)

Pros: Lightweight, open-source, and specifically designed for programmatic image manipulation. It has a powerful CLI that supports all the preprocessing tasks we need, including grayscale conversion, cropping, contrast adjustment, and normalization.
Cons: Requires some familiarity with command-line operations, but this is minimal compared to its flexibility and power.

Why ImageMagick?

For this example, ImageMagick is the ideal choice. Its command-line interface makes it easy to containerize and integrate into Stage2. Additionally, its lightweight design and ability to handle all the required preprocessing tasks efficiently make it perfect for the problem we are solving. Finally, being open-source, it avoids licensing costs and includes prebuilt Docker images available on Docker Hub, simplifying the operationalization process.

With the tool selected, the next step is to define how we’ll interact with it through parameters and outputs.
Continue to the next section: Designing an Operation Interface.

Selecting Tools for Operations​

Key Considerations for Tool Selection​

Tool Selection for Image Preprocessing​

1. Microsoft Paint​

2. GIMP​

3. Photoshop​

4. ImageMagick (Chosen Tool)​

Why ImageMagick?​