Skyvern Changelog — April & May 2026

Skyvern Changelog — April & May 2026

Two months of shipping in one email — a draggable editor, cleaner MCP OAuth, stronger run controls, and plenty more. Headliners first, the full list at the end.


April 2026

Workflow Copilot v2

The workflow copilot got a lot more hands-on. Hard-cancel a run mid-flight instead of waiting it out, and follow it live as it streams status narration and surfaces block-level events (starts, completions, errors) while your workflow runs. When a test trips, it hands back the work-in-progress workflow so you can grab the draft and keep building. Self-healed retries are now visually separated from real failures, and a quick /discover in chat passes the whole build straight to the copilot.

Workflow Copilot v2 — live status narration with a streaming sub-event timeline, a self-healed retry shown apart from errors, and a hard-cancel button

MCP OAuth, Across Clients

Connecting Skyvern over MCP is now a clean OAuth flow. Multi-org users get an organization picker right in the authorize step. Skyvern runs as a native OpenAI Apps SDK app, ships safety hints for the ChatGPT connector, documents Codex remote OAuth end-to-end, and adds OpenClaw as a new target (skyvern setup openclaw). Under the hood: production OAuth callbacks, plus per-tool latency and error telemetry.

MCP OAuth — an authorize dialog with an organization picker for multi-org users, alongside supported clients: Claude, ChatGPT/Apps SDK, Codex, and OpenClaw

skyvern schedule + skyvern config CLI

Your terminal is now a control panel. skyvern schedule creates, lists, toggles, and deletes workflow schedules without leaving the command line. skyvern config reads and writes org-level settings — timeouts, concurrency, default proxy — the same way. Both run over the MCP protocol, so Claude Code, Codex, and other MCP clients can use them through the same interface.

Terminal showing the new skyvern schedule and skyvern config CLI commands — listing schedules in a table, creating a cron schedule, and reading an org-level config value

AI Summarize for Workflow Outputs

Stop squinting at nested JSON. A new Summarize with AI button sits next to every block and workflow output — one click turns raw data into a plain-language summary of what your workflow extracted.

AI Summarize for workflow outputs — raw JSON on the left and a generated plain-language summary on the right, produced by a "Summarize with AI" button

Workflow-Level Error Code Mapping

Set your error handling once — not on every block. Workflows now take an error_code_mapping field that every block inherits automatically. Define the defaults at the top, and override per-block only where you actually need to.

Workflow-level error code mapping — error codes mapped to messages once at the workflow level, with each block inheriting them and one block adding an override

Debugger Remembers Last-Used Values

Less busywork between test runs. The debugger now pre-fills the Run dialog with the values from your last run, so you're not re-typing the same inputs every time.

Debugger Run dialog pre-filled with parameter values from the last run, marked by a "Pre-filled from last run" badge

May 2026

Draggable Workflow Editor

The editor is now drag-and-drop — reorder your whole workflow, not just tweak blocks in place. Grab any block by its handle and drop it where it belongs: top level, inside a branch, deep in a loop. Collapse blocks and entire nested containers to focus, and your layout is remembered per workflow across sessions. It's fully keyboard-driven with screen-reader announcements, and every block's config autosaves as you type. (Recording mode still freezes drag, so a mid-record reorder can't shift block identities under the recorder.)

Draggable workflow editor — a workflow canvas with a grip handle on a block mid-drag, a drop indicator between blocks, several blocks collapsed to compact headers, and a global expand/collapse control

Redesigned Run Page

We redesigned the run page around a compact timeline sidebar and a dedicated detail panel — scan every block at a glance, then open any one for its inputs, output, and diagnostics. A live N-of-M completion counter tracks progress and visibly flags blocks that never ran. The app sidebar got a refresh in the same pass, for cleaner navigation everywhere.

Redesigned workflow run page — a compact timeline sidebar listing blocks with a live "7/12 blocks" completion counter, next to a detail panel showing the selected block's inputs, output, and diagnostics

While Loop Block

Loop until you're done — not a fixed number of times. The new While Loop block repeats a sub-workflow until your condition is met: paginate until there's no Next button, retry until a status flips, drain a queue until it's empty.

While Loop block — a workflow containing a While Loop block with a condition expression, wrapping a sub-workflow that repeats until the condition turns false

Analytics Dashboard

Your automations, by the numbers. The new /analytics page breaks down run volume, outcomes, and trends over time in one view — no exports, no spreadsheets. (Gated per-org, so it lights up only where it's enabled.)

Analytics dashboard — an /analytics page with charts of workflow run volume and success rate over time, plus per-workflow performance breakdowns

Workflow Copilot, Steadier

The copilot is steadier when sessions get long or messy. A persistent turn-narrative bubble tracks what the agent is doing — and survives page reloads, so you never lose your place. Hit an unexpected error? It's recoverable now: instead of a frozen chat, you get a clear explanation and a specific follow-up question drawn from what it actually found missing. The diff view finally catches changes nested inside conditionals and loops, and we cleared a batch of edge cases — credential scope leaking after an edit, session context lost across turns, accepted proposals getting dropped.

Workflow Copilot — a chat thread showing a persistent turn-narrative bubble summarizing the agent's current action, a recovered error with a clear explanation, and a diff panel highlighting a change nested inside a loop block

More Control Over Your Runs

Manage runs in bulk — cancel, retry, cap. The new Retry API re-runs a workflow from the start or from any block — no rebuilding inputs. POST /runs/cancel cancels many runs in a single request and tells you exactly which succeeded and which didn't. And two new guardrails — a runtime limit and a per-workflow step cap — stop runaway executions before they burn resources.

Run control APIs — an API panel showing POST /runs/cancel with a list of run IDs returning cancelled and failed arrays, a retry-from-block call, and workflow settings for a max runtime and a total step cap

Self-Hosting: Custom Proxies & CDP Auth Headers

Route any run through your own proxy — just pass {"proxy_location": {"url": "http://user:pass@host:port"}} on any task, workflow, or browser-session request; no managed proxy infrastructure required. And CDP connect headers are now a first-class workflow setting — your provider's auth headers are masked, stored, and applied to the CDP connection only, never forwarded to target sites. (Plus: msedge is the new default browser for self-hosted templates, and the Docker Compose quickstart bootstraps your .env files on first run.)

Self-hosting upgrades — a workflow settings panel with a custom proxy URL field and a masked CDP Connect Headers field, beside a terminal running the Docker Compose quickstart that auto-creates .env files

Browser Profiles, End to End

Saved browser profiles — the logged-in sessions your workflows reuse — are easier to set up and manage. A guided, step-by-step setup walks you through creating one, a dedicated management page lets you search, rename, and organize, and saving now gives you instant feedback with an in-progress row the moment you hit save.

Browser profiles — a dedicated browser profiles management page with search and rename controls, next to a guided step-by-step "create a profile" flow and an in-progress placeholder row appearing on save

Quick List — April & May

New features

  • Credits per run & run-level cost breakdown — a Credits badge on run pages, and task records now track LLM, proxy, and captcha costs as separate fields via the API
  • Credential Folders — organize credentials into named folders, with a "Move to folder" control and a folder filter that persists across credential tabs
  • Multiple schedules per workflow — independent schedules on one workflow (e.g. weekdays at 9am and weekends at noon)
  • Google Sheets connector (open source) — the Sheets auth flow is now part of the OSS package
  • Tagging API — tag CRUD, tag history, tag-key management, and batch tag endpoints
  • Blank-canvas workflow entry point — a "Blank Workflow" option plus an "Or start from a description" handoff to Discover
  • MCP setup in cloud settings — a dedicated MCP configuration section in account settings
  • Action Block selector + AI fallback fields — direct control over element targeting and fallback behavior
  • Block-scoped parameter prompts in the debugger — prompts scoped to the specific block under test
  • Runs tab on the browser session page — every run tied to a given session, in one place
  • Smarter element targeting — deterministic HTML element-tree compression for more reliable selectors
  • Saudi Arabia proxyRESIDENTIAL_SA added as a geolocation option
  • Password manager migration notice — UI banner when password manager settings need updating
  • Loop block failure-handling UI — clearer display of how failures are handled within loop blocks
  • OpenAI CUA model now configurable via feature flag

Improvements

  • Cross-run extraction cache — recurring scheduled workflows reuse extraction results across runs when page content hasn't changed, cutting redundant LLM calls
  • MCP runs are now classified as manual, matching UI-triggered model routing and queueing
  • MCP tool call performance telemetry tracks latency and errors across all MCP tool invocations
  • Generic remote browser vendor support — evaluate third-party CDP providers via a JSON config, including HTTP/HTTPS DevTools endpoints
  • Scheduled-trigger badge on the global Runs page
  • Faster GET workflow run API
  • Faster script-mode runs (skip speculative extract-action steps) and faster proxy selection in script mode
  • Reduced scroll-into-view settle time for snappier agent actions
  • LLM router retries with a fallback model when a response is cut off by a length limit
  • Prompt token cap prevents context-window overflows on high-complexity workflows
  • Extraction prompts capped in size before LLM calls, reducing 429 RESOURCE_EXHAUSTED errors on high-volume workflows
  • Hardened prompt input sanitization — untrusted page content is sanitized consistently at the template layer
  • Automatic webhook retry on transient infrastructure failures
  • Signed artifact content URLs with configurable expiry, plus presigned-URL fallback for self-hosted downloads
  • Archived-artifact status indicator on runs
  • Output-parameter size cap to avoid oversized payloads
  • Double-click action support and PDF embed detection inside multi-frame pages
  • Content blocking disabled for authenticated browser launches, improving login success rates
  • Browser dialogs (alert/confirm/prompt) surfaced to the agent so it can adapt
  • TOTP scoped to the active credential, and shown as placeholder text instead of pre-filled
  • Clearer Browsers vs. Browser Profiles descriptions, and a toast action-button polish pass
  • msedge is the default browser for self-hosted templates; quickstart bootstraps .env files
  • Fewer screenshots captured for API-triggered runs, lowering storage overhead
  • Auto-route browser type from browser profile source
  • Reduced scrape-phase mouse movement on recaptcha-protected sites
  • Renamed browser profile reset endpoint to /browser_session/reset_profile for clarity
  • Removed deprecated Anthropic and Bedrock Claude model configs