Container images

The image parameter of the TaskEnvironment to specify a container image. Every task defined using that TaskEnvironment will run in a container based on that image.

If a TaskEnvironment does not specify an image, it will use the default Flyte image ( ghcr.io/unionai-oss/flyte:latest).

Specifying your own image directly

You can directly reference an image by URL in the image parameter, like this:

env = flyte.TaskEnvironment(
    name="my_task_env",
    image="docker.io/myorg/myimage"
)

This works well if you have a pre-built image available in a public registry like Docker Hub or in a private registry that your Union/Flyte instance can access.

Specifying your own image with the flyte.Image object

You can also construct an image programmatically using the flyte.Image object.

The flyte.Image object provides a fluent interface for building container images with specific dependencies.

You start building your image with on of the from_ methods:

You can then layer on additional components using the with_ methods:

You can also specify an image in one shot (with no possibility of layering) with:

Additionally, the Image class provides:

Here are some examples of the most common patterns for building images with flyte.Image.

Example: Defining a custom image with Image.from_debian_base

The Image.from_debian_base() provides the default Flyte image as the base. This image is itself based on the official Python Docker image (specifically python:{version}-slim-bookworm) with the addition of the Flyte SDK pre-installed. Starting there, you can layer additional features onto your image. For example:

import flyte

# Define the task environment
env = flyte.TaskEnvironment(
    name="my_env",
    image = (
        flyte.Image.from_debian_base(
            name="my-image"
            python_version=(3, 13),
            registry="ghcr.io/my_gh_org" # Only needed for local builds
        )
        .with_apt_packages("git", "curl")
        .with_pip_packages("numpy", "pandas", "scikit-learn")
        .with_env_vars({"MY_CONFIG": "production"})
    )
)


# Supporting task definitions
...

# Main task definition
@env.task
def main(x_list: list[int] = list(range(10))) -> float:
    ...

# Init and run
if __name__ == "__main__":
    flyte.init_from_config("config.yaml")
    run = flyte.run(main, x_list=list(range(10)))
    print(run.name)
    print(run.url)
    run.wait(run)

The registry parameter is only needed if you are building the image locally. It is not required when using the Union backend ImageBuilder. See Image building for more details.

Example: Defining an image based on uv script metadata

Another common technique for defining an image is to use uv inline script metadata to specify your dependencies right in your Python file and then use the flyte.Image.from_uv_script() method to create a flyte.Image object. The from_uv_script method starts with the default Flyte image and adds the dependencies specified in the uv metadata. For example:

# /// script
# requires-python = ">=3.13"
# dependencies = [
#    "flyte",
#    "numpy",
#    "pandas",
#    "scikit-learn"
# ]
# ///

...

env = flyte.TaskEnvironment(
    name="my_env",
    image=flyte.Image.from_uv_script(
            __file__,
            name="my_image",
            registry="ghcr.io/my_gh_org" # Only needed for local builds
        )
)

# Supporting task definitions
...

# Main task definition
@env.task
def main(x_list: list[int] = list(range(10))) -> float:
    ...

# Init and run
if __name__ == "__main__":
    # Init for remote run on backend
    flyte.init_from_config("config.yaml")

    # Init for local run
    # flyte.init()

    run = flyte.run(main, x_list=list(range(10)))
    print(run.name)
    print(run.url)
    run.wait(run)

The advantage of this approach is that the dependencies used when running a script locally and when running it on the Flyte/Union backend are always the same (as long as you use uv to run your scripts locally). This means you can develop and test your scripts in a consistent environment, reducing the chances of encountering issues when deploying to the backend.

In the above example you can see how to use flyte.init_from_config() for remote runs and flyte.init() for local runs. Uncomment the flyte.init() line (and comment out flyte.init_from_config()) to enable local runs. Do the opposite to enable remote runs.

When using uv metadata in this way, be sure to include the flyte package in your uv script dependencies. This will ensure that flyte is installed when running the script locally using uv run. When running on the Flyte/Union backend, the flyte package from the uv script dependencies will overwrite the one included automatically from the default Flyte image.

Image building

There are two ways that the image can be built:

  • If you are running a Flyte OSS instance then the image will be built locally on your machine and pushed to the container registry you specified in the Image definition.
  • If you are running a Union instance, the image can be built locally, as with Flyte OSS, or using the Union ImageBuilder, which runs remotely on Union’s infrastructure.

Configuring the builder

In Earlier, we discussed the image.builder property in the config.yaml.

For Flyte OSS instances, this property must be set to local.

For Union instances, this property can be set to remote to use the Union ImageBuilder, or local to build the image locally on your machine.

Local image building

When image.builder in the config.yaml is set to local, flyte.run() does the following:

  • Builds the Docker image using your local Docker installation, installing the dependencies specified in the uv inline script metadata.
  • Pushes the image to the container registry you specified.
  • Deploys your code to the backend.
  • Kicks off the execution of your workflow
  • Before the task that uses your custom image is executed, the backend pulls the image from the registry to set up the container.

Above, we used registry="ghcr.io/my_gh_org".

Be sure to change ghcr.io/my_gh_org to the URL of your actual container registry.

You must ensure that:

  • Docker is running on your local machine.
  • You have successfully run docker login to that registry from your local machine (For example GitHub uses the syntax echo $GITHUB_TOKEN | docker login ghcr.io -u USERNAME --password-stdin)
  • Your Union/Flyte installation has read access to that registry.

If you are using the GitHub container registry (ghcr.io) note that images pushed there are private by default. You may need to go to the image URI, click Package Settings, and change the visibility to public in order to access the image.

Other registries (such as Docker Hub) require that you pre-create the image repository before pushing the image. In that case you can set it to public when you create it.

Public images are on the public internet and should only be used for testing purposes. Do not place proprietary code in public images.

Remote ImageBuilder

When image.builder in the config.yaml is set to remote (and you are running Union.ai), flyte.run() does the following:

  • Builds the Docker image on you Union instance with ImageBuilder, installing the dependencies specified in the uv inline script metadata.
  • Pushes the image to the internal container registry of your Union instance.
  • Deploys your code to the backend.
  • Kicks off the execution of your workflow.
  • Before the task that uses your custom image is executed, the backend pulls the image from the internal registry to set up the container.

There is no set up of Docker nor any access control configuration required on your part.