Server environment variables

This page documents the environment variables that you can use to configure the Vivaria server.

Unless explicitly specified, all environment variables are optional.

API and UI

Variable Name	Description	Required?
`MACHINE_NAME`	Your machine name, e.g. from running `hostname`. Must be lower-case, e.g. johns-macbook or joans-system-76.	True
`API_IP`	Tells pyhooks inside agent containers where to find the Vivaria server (this server).	True
`PORT`	What port to serve the Vivaria API on.	True
`UI_URL`	The URL on which Vivaria is serving its UI.	False
`NODE_ENV`	Controls several Vivaria features. For example, Vivaria uses sequential instead of random run IDs if `NODE_ENV` is 'production'.	False

Sentry

Variable Name	Description
`SENTRY_ENVIRONMENT`	Configures what environment the server/UI/pyhooks are running in, for Sentry.	False
`SENTRY_DSN`	Enables Sentry reporting in the server and specifies its DSN.	False
`SENTRY_DSN_REACT`	Enables Sentry reporting in the UI and specifies its DSN.	False
`SENTRY_DSN_PYTHON`	Enables Sentry reporting in pyhooks and specifies its DSN.	False

Datadog

Variable Name	Description
`DD_ENV`	Configures what environment the server is running in, for Datadog.

Database

Variable Name	Description	Required?
`PGHOST`	The host name or IP address of the PostgreSQL server.	True
`PGPORT`	The port number on which the PostgreSQL server is listening.	True
`PGDATABASE`	The name of the PostgreSQL database.	True
`PGUSER`	The username to connect to the PostgreSQL database.	True
`PGPASSWORD`	The password to authenticate the PostgreSQL user.	True
`PGSSLMODE`	The SSL mode to use when connecting to the PostgreSQL server. NOTE: `PGSSLMODE` is not accurately passed to the pg javascript client; the only useful alternative value here is "disabled".	True
`DB_CA_CERT_PATH`	A path to a CA certificate to use when connecting to the database.	False
`PG_READONLY_USER`	The username for a read-only user with access to the PostgreSQL database.	True
`PG_READONLY_PASSWORD`	The password for the read-only user.	True
`VIVARIA_PG_READONLY_HOST`	An optional separate host for read-only database connections. If not set, the read-only connection will use the same host as `PGHOST`.	False
`VIVARIA_PG_READONLY_PORT`	An optional separate port for read-only database connections. If not set, the read-only connection will use the same port as `PGPORT`.	False
`MAX_DATABASE_CONNECTIONS`	The maximum number of database connections that each Vivaria process is allowed to use.	False
`ACCESS_TOKEN_SECRET_KEY`	Used to encrypt and decrypt runs_t."encryptedAccessToken".	True

AWS and aux VMs

Variable Name	Description
`TASK_AWS_REGION`	Vivaria will create VMs for task environments in this AWS region.
`TASK_AWS_ACCESS_KEY_ID`	Vivaria can use this AWS access key to create VMs for task environments.
`TASK_AWS_SECRET_ACCESS_KEY`	Vivaria can use this AWS secret access key to create VMs for task environments.
`AUX_VM_HAS_PUBLIC_IP`	If 'true', aux VMs will have public IPs. Otherwise, access is only possible from within the aux VM's VPC. If you set this to false, be sure to set the subnet ID appropriately (i.e. choose a private subnet).
`AUX_VM_SUBNET_ID`	If set, Vivaria will create aux VMs in this subnet.
`AUX_VM_SECURITY_GROUP_ID`	Security group for the aux VM. If not set, Vivaria will create a new security group. Note: It is wise to finish all long-running aux VM tasks if you change this from being set to unset, or vice versa. Otherwise, the code is going to either try to delete a security group that's in use by aux VMs (and fail) or it will fail to clean up a security group.
`AUX_VM_EXTRA_TAGS`	Extra tags added to resources created for the aux VM. The string is parsed in a naive way, so don't put "=" or "," in the tag names or values.

Docker and the primary VM host

Vivaria communicates with VM hosts using the Docker CLI and will pass environment variables along to it. Use DOCKER_HOST or DOCKER_CONTEXT to configure how Vivaria connects to the primary VM host's Docker daemon. Use DOCKER_TLS_VERIFY to tell the Docker to use a provided TLS client certificate to authenticate the primary VM host's Docker daemon.

Variable Name	Description
`DOCKER_BUILD_PLATFORM`	If set, Vivaria will pass `DOCKER_BUILD_PLATFORM` to the --platform argument of docker build when building images.
`VIVARIA_DOCKER_BUILD_OUTPUT`	One of `load`, `save`, or `push`. Passed to `docker build` (e.g. `docker build --save`) to control if images are pushed to a remote registry.
`VIVARIA_DOCKER_IMAGE_NAME`	If set, Vivaria will build all task/run images as tags under this Docker image.
`VIVARIA_DOCKER_REGISTRY_TOKEN`	If set, Vivaria will check if images exist in a private Docker registry using a version check (`HEAD v2/${REPO_NAME}/manifests/${TAG}`)
`MP4_DOCKER_USE_GPUS`	Whether there are local GPUs that Vivaria can attach to task environments and agent containers.
`VM_HOST_LOGIN`	Used by Vivaria to connect to the VM host over SSH. This
`VM_HOST_HOSTNAME`	Should be the same as the hostname in `DOCKER_HOST`. Used by Vivaria to connect to the VM host over SSH, to set up iptables rules for no-internet task environments on the VM host and to grant users SSH access to the VM host. If unset, Vivaria will assume you want to use a Docker host running on the same machine as the Vivaria server. TODO: This is redundant with `VM_HOST_LOGIN` and should be removed.
`VM_HOST_SSH_KEY`	Path to an SSH key with root access on the VM host. If not set, Vivaria will fall back to the default SSH behaviour: using keys available in ssh-agent.
`FULL_INTERNET_NETWORK_NAME`	Vivaria will connect full-internet task environments to this Docker network.
`NO_INTERNET_NETWORK_NAME`	Vivaria will connect no-internet task environments to this Docker network.
`VM_HOST_MAX_CPU`	If the VM host's CPU usage is greater than this, Vivaria won't start any new runs.
`VM_HOST_MAX_MEMORY`	If the VM host's memory usage is greater than this, Vivaria won't start any new runs.

Kubernetes and EKS

You can configure Vivaria to run task environments and agent containers in:

A Kubernetes cluster using Amazon EKS, and/or
A Kubernetes cluster with machine that have GPUs, e.g. on a cloud provider like Voltage Park or FluidStack.

Variable Name	Description
`K8S_POD_CPU_COUNT_REQUEST`	Vivaria will start pods with this CPU request, unless a task's `manifest.yaml` explicitly requests a different amount.
`K8S_POD_RAM_GB_REQUEST`	Vivaria will start pods with this RAM request, unless a task's `manifest.yaml` explicitly requests a different amount.
`K8S_POD_DISK_GB_REQUEST`	Vivaria will start pods with this disk request, unless a task's `manifest.yaml` explicitly requests a different amount.
`VIVARIA_K8S_RUN_QUEUE_BATCH_SIZE`	When a user requests that Vivaria start a k8s run, Vivaria puts the run in a queue. This controls how many k8s runs Vivaria will pull from the queue at once. `VIVARIA_K8S_RUN_QUEUE_INTERVAL_MS` controls how often Vivaria will check the queue for new runs. For non-k8s runs, Vivaria will always pull one run from the queue at a time and `VIVARIA_RUN_QUEUE_INTERVAL_MS` controls how often Vivaria will check the queue for new runs.
`VIVARIA_K8S_RUN_QUEUE_INTERVAL_MS`	How often Vivaria will check the queue for new k8s runs, in milliseconds.

Kubernetes

Variable Name	Description
`VIVARIA_K8S_CLUSTER_URL`	The URL of the Kubernetes cluster used by Vivaria.
`VIVARIA_K8S_CLUSTER_CA_DATA`	Vivaria uses this to verify the Kubernetes cluster's identity, to prevent man-in-the-middle attacks. Vivaria puts this in the cluster's `certificate-authority-data` field in its kubeconfig object.
`VIVARIA_K8S_CLUSTER_NAMESPACE`	The namespace in the Kubernetes cluster where Vivaria will create resources. Defaults to 'default'.
`VIVARIA_K8S_CLUSTER_IMAGE_PULL_SECRET_NAME`	If you're pulling images from a private registry, put credentials for the registry in a Kubernetes secret as specified here: https://kubernetes.io/docs/tasks/configure-pod-container/pull-image-private-registry/ Then, set this to the name of the secret.
`VIVARIA_K8S_CLUSTER_CLIENT_CERTIFICATE_DATA`	The client certificate for the Kubernetes cluster. Vivaria puts this in the `client-certificate-data` field of the user it uses to authenticate to the cluster. Not needed if using EKS.
`VIVARIA_K8S_CLUSTER_CLIENT_KEY_DATA`	The client key for the Kubernetes cluster. Vivaria puts this in the `client-key-data` field of the user it uses to authenticate to the cluster. Not needed if using EKS.
`VIVARIA_EKS_CLUSTER_ID`	If using EKS, the name of the EKS cluster used by Vivaria.
`VIVARIA_EKS_CLUSTER_AWS_REGION`	If using EKS, the AWS region where the EKS cluster is located.
`VIVARIA_AWS_ACCESS_KEY_ID_FOR_EKS`	If using EKS, an AWS access key ID for an IAM user with permission to create and delete Pods in the EKS cluster.
`VIVARIA_AWS_SECRET_ACCESS_KEY_FOR_EKS`	If using EKS, the AWS secret access key for the IAM user with permission to create and delete Pods in the EKS cluster.

Kubernetes cluster with GPUs

Variable Name	Description
`VIVARIA_K8S_GPU_CLUSTER_URL`	The URL of the Kubernetes cluster with GPUs used by Vivaria.
`VIVARIA_K8S_GPU_CLUSTER_CA_DATA`	Vivaria uses this to verify the Kubernetes cluster's identity, to prevent man-in-the-middle attacks. Vivaria puts this in the cluster's `certificate-authority-data` field in its kubeconfig object.
`VIVARIA_K8S_GPU_CLUSTER_NAMESPACE`	The namespace in the Kubernetes cluster with GPUs where Vivaria will create resources. Defaults to 'default'.
`VIVARIA_K8S_GPU_CLUSTER_IMAGE_PULL_SECRET_NAME`	If you're pulling images from a private registry, put credentials for the registry in a Kubernetes secret as specified here: https://kubernetes.io/docs/tasks/configure-pod-container/pull-image-private-registry/ Then, set this to the name of the secret.
`VIVARIA_K8S_GPU_CLUSTER_CLIENT_CERTIFICATE_DATA`	The client certificate for the Kubernetes cluster with GPUs. Vivaria puts this in the `client-certificate-data` field of the user it uses to authenticate to the cluster.
`VIVARIA_K8S_GPU_CLUSTER_CLIENT_KEY_DATA`	The client key for the Kubernetes cluster with GPUs. Vivaria puts this in the `client-key-data` field of the user it uses to authenticate to the cluster.
`VIVARIA_API_IP_FOR_K8S_GPU_CLUSTER`	An IP address or hostname at which pods in the Kubernetes cluster with GPUs can find the Vivaria server.

Agent sandboxing

Variable Name	Description
`NON_INTERVENTION_FULL_INTERNET_MODELS`	A comma-separated list of model name regexes that Vivaria allows in fully automatic full-internet runs with no human supervision.
`AGENT_CPU_COUNT`	CPU limit for task environment Docker containers used in runs and task environments started by `viv task start`.
`AGENT_RAM_GB`	RAM limit in GiB for task environment Docker containers used in runs and task environments started by `viv task start`.
`TASK_ENVIRONMENT_STORAGE_GB`	Disk usage limit in GiB for task environment Docker containers used in runs and task environments started by `viv task start`. This only works if the Docker storage driver meets certain conditions: https://docs.docker.com/reference/cli/docker/container/run/#storage-opt. Use a value of `-1` to disable this functionality if your Docker storage driver doesn't meet those conditions, otherwise task environment creation will fail.
`TASK_OPERATION_TIMEOUT_MINUTES`	Maximum time allowed for a task operation (e.g. start, score, teardown). If an operation takes longer than this, an error will be thrown. Useful for limiting the impact of infinite loops and similar bugs in task code.
`NO_INTERNET_TASK_ENVIRONMENT_SANDBOXING_MODE`	If set to `iptables`, Vivaria will attempt to sandbox no-internet task environments using iptables rules. If set to `docker-network`, Vivaria won't attempt to sandbox no-internet task environments. Instead, it'll assume that it's running in a Docker container that's connected to no-internet task environments by an internal Docker network.
`SKIP_SAFETY_POLICY_CHECKING`	If set to true, Vivaria does NOT check agent-submitted actions in non-intervention full-internet actions using an LLM. Otherwise, Vivaria will check these actions using an LLM.
`JWT_DELEGATION_TOKEN_SECRET`	Secret for generating JWT delegation tokens for agent actions. For example, when a user uses the "Generate options" feature, Vivaria generates a delegation token, provides it to the agent, and uses the token to authenticate the agent's generation requests. This allows the agent to generate rating options even when the agent branch is paused, but only for 15 seconds and for one specific generation request.

Middleman

Middleman is an internal, unpublished web service that METR uses as a proxy between Vivaria and LLM APIs. Vivaria can either make LLM API requests directly to LLM providers or via Middleman.

Variable Name	Description
`VIVARIA_MIDDLEMAN_TYPE`	If this is set to `builtin`, Vivaria will make LLM API requests directly to LLM APIs (e.g. the OpenAI API). If set to `remote`, Vivaria will make LLM API requests to the Middleman service. If set to `noop`, Vivaria will throw if when asked to make an LLM API request. Note that if `VIVARIA_IS_READ_ONLY` is `true`, this value is ignored and treated as `noop`.
`CHAT_RATING_MODEL_REGEX`	A regex that matches the names of certain rating models. Instead of using these models' logprobs to calculate option ratings, Vivaria will fetch many single-token rating prompt completions and calculate probabilities from them.

If VIVARIA_MIDDLEMAN_TYPE is builtin, Vivaria can talk to one of several LLM API provider APIs:

OpenAI

Variable Name	Description
`OPENAI_API_URL`	The URL of the OpenAI API.
`OPENAI_API_KEY`	The API key for the OpenAI API.

Anthropic

Variable Name	Description
`ANTHROPIC_API_KEY`	The API key for the Anthropic API.
`ANTHROPIC_API_URL`	The URL of the Anthropic API, not including version.

Google GenAI

Variable Name	Description
`GEMINI_API_KEY`	The API key for the Gemini API.
`GEMINI_API_VERSION`	The version of the API, e.g. `v1beta`.

Additional providers supported by LangChain can be added pretty easily.

If VIVARIA_MIDDLEMAN_TYPE is remote:

Variable Name	Description
`MIDDLEMAN_API_URL`	The URL of the Middleman service.
`OPENAI_API_URL`	You may also set `OPENAI_API_URL` to change where the OpenAI clone API will forward requests to.

Authentication

Variable Name	Description
`USE_AUTH0`	Controls whether or not Vivaria will use Auth0 to authenticate users. If Auth0 is disabled, Vivaria will use static access and ID tokens.
`VIVARIA_IS_READ_ONLY`	If set to `true`, Vivaria will not require any authentication but will also only allow GET requests, creating a public-access read-only instance of Vivaria. `ACCESS_TOKEN` must also be configured in this case.
`VIVARIA_ACCESS_TOKEN_MIN_TTL_MS`	Optional. Vivaria will refuse to start runs using access tokens that expire sooner than this time-to-live.

See here for more information on how to set up Auth0.

If USE_AUTH0 is true:

Variable Name	Description
`ID_TOKEN_AUDIENCE`	The Client ID from the Settings tab on your Single Page Application's page in the Auth0 admin dashboard.
`ACCESS_TOKEN_AUDIENCE`	The Identifier on your Auth0 API page in the Auth0 admin dashboard.
`ISSUER`	The Domain from the Settings tab on your Auth0 application page in the Auth0 admin dashboard, converted to an HTTPS URL with a trailing slash.
`JWKS_URI`	`ISSUER` plus `.well-known/jwks.json`, e.g. https://test.us.auth0.com/.well-known/jwks.json.
`VIVARIA_AUTH0_CLIENT_ID_FOR_AGENT_APPLICATION`	Optional. The Client ID from the Settings tab on your Machine to Machine application's page in the Auth0 admin dashboard.
`VIVARIA_AUTH0_CLIENT_SECRET_FOR_AGENT_APPLICATION`	Optional. The Client Secret from the Settings tab on your Machine to Machine application's page in the Auth0 admin dashboard.

If USE_AUTH0 is false, set ID_TOKEN and ACCESS_TOKEN to unique, randomly-generated values for each Vivaria deployment that doesn't use Auth0. Vivaria gives ACCESS_TOKEN to both agents and users but gives ID_TOKEN only to users. If agents can access ID_TOKEN as well as ACCESS_TOKEN, then they can use it to call any Vivaria API endpoint.

Git operations

Variable Name	Description
`ALLOW_GIT_OPERATIONS`	When false, Vivaria will throw an error if a user tries to use functionality that requires local Git operations (e.g. cloning or fetching a repo from GitHub).

If ALLOW_GIT_OPERATIONS is true:

Variable Name	Description
`GITHUB_AGENT_ORG`	The GitHub organization that contains the agent repos.
`GITHUB_AGENT_HOST`	Can be used to override the default host for cloning agent repos, e.g. to use SSH or an access token.
`GITHUB_TASK_HOST`	Can be used to override the default host for cloning task repos, e.g. to use SSH or an access token.
`VIVARIA_DEFAULT_TASK_REPO_NAME`	Organization and repository (e.g. `METR/mp4-tasks`) of primary task repo.
`TASK_REPO_HTTPS_HOST`	HTTPS URL used to construct links to the task repo in the Vivaria UI.

Slack

Variable Name	Description
`SLACK_TOKEN`	OAuth token for Vivaria Slack Notifications app.

Other configuration

Variable Name	Description
`DONT_JSON_LOG`	If `DONT_JSON_LOG` is set to 0, Vivaria will log JSONL-formatted logs to a log file.
`SSH_PUBLIC_KEYS_WITH_ACCESS_TO_ALL_AGENT_CONTAINERS`	A list of SSH public keys that will be added to `.ssh/authorized_keys` in all agent containers. The list separator is a space, then three pipes, then another space. If this environment variable is unset, then by default the list is empty.
`DEFAULT_RUN_BATCH_CONCURRENCY_LIMIT`	If a user creates a run but doesn't specify a run batch, Vivaria automatically creates a default run batch for the user. The goal is to prevent users from accidentally starting hundreds or thousands of runs without specifying a concurrency limit for them. This environment variable sets the concurrency limit of the default run batch.
`VIVARIA_RUN_QUEUE_INTERVAL_MS`	When a user requests that Vivaria start a non-k8s run, Vivaria puts the run in a queue. This controls how often Vivaria will check the queue for new runs, in milliseconds. Vivaria will always pull one non-k8s run from the queue at a time. For k8s runs, `VIVARIA_K8S_RUN_QUEUE_INTERVAL_MS` controls how often Vivaria will check the queue for new runs and `VIVARIA_K8S_RUN_QUEUE_BATCH_SIZE` controls how many k8s runs Vivaria will pull at once.
`RUN_SUMMARY_GENERATION_MODEL`	The model to use for generating run summaries using the "Summary" tab on the runs page.
`RUNS_PAGE_QUERY_GENERATION_MODEL`	The model to use for generating queries in the runs page query editor.
`RUNS_PAGE_QUERY_GENERATION_MAX_TOKENS`	The maximum number of tokens that Vivaria will use for generating queries in the runs page query editor. Default is no limit.