GPU access is expensive and fragile
AWS and GCP GPU instances cost $1–30/hr with 10-minute cold starts. You're paying for idle time while CUDA installs.
Your ML workflow deserves better than SSH tunnels and dead notebooks. Provision A100s in 30 seconds. Sync files in milliseconds. Run jobs that survive your laptop closing.
The problem
You've felt this. The GPU quota request. The notebook that worked yesterday. The training job that died when your WiFi dropped.
AWS and GCP GPU instances cost $1–30/hr with 10-minute cold starts. You're paying for idle time while CUDA installs.
16GB RAM doesn't train transformers. You need 64GB+ and GPUs — provisioned on demand, not bought upfront.
Every new machine means reinstalling CUDA, PyTorch, drivers. Days lost to dependency hell before you write a line of code.
Sharing a Jupyter server means SSH tunnels or JupyterHub configs. Each teammate needs isolated environments, not shared passwords.
SSH disconnects kill your training run at hour 47. Detached jobs keep running — results are waiting when you come back.
Notebook execution isn't reproducible across machines. Sandboxed environments with pinned runtimes eliminate the guesswork.
Features
One CLI. GPU environments. File sync. Detached jobs. No YAML required.
Jupyter and VS Code in the cloud with instant GPU access. T4, A100, or H100 — ready in ~30 seconds.
<50ms latency syncing local files to cloud workspaces. Edit locally, execute on GPU — no git push dance.
Run .ipynb files as background jobs. Stream logs, collect artifacts — training survives your commute.
PVC-backed workspaces with checkpoint and restore. Your datasets and models persist across sessions.
Environments auto-suspend after 30min idle. Resume in <10 seconds. Pay nothing while you sleep.
Per-second compute billing. No monthly GPU reservations. See exactly what each job cost before you scale.
How it works
No Terraform. No Kubernetes manifests. Just a CLI that speaks your language.
Spin up a GPU environment from your terminal. Jupyter or VS Code, pre-configured with CUDA and PyTorch.
$ flexx up --project image-classifier --gpu a100 Provisioning environment... ✓ Environment ready in 28s Jupyter: https://env.flexx.dev/abc123
Stream local files to your cloud workspace with sub-50ms latency. Edit in your IDE, run on GPU.
$ flexx sync . ✓ Synced 847 files (42ms avg latency)
Execute notebooks as detached jobs. Stream logs, download artifacts. Suspend when done — resume in seconds.
$ flexx run train.ipynb --gpu a100 Job started: job_7x9k2m $ flexx artifacts job_7x9k2m --download Job completed · 2.4 GPU-hours · $1.87
Pricing
Per-second billing. Scale to zero. No surprise bills from forgotten instances.
Planned for launch — join early access to lock in these rates
$10 credits included
Explore flexx with no credit card required.
+ pay-as-you-go credits
For individual ML engineers shipping real work.
Volume discounts available
For teams with shared billing and isolated environments.
Billed per second. Environments auto-suspend after 30min idle — storage billed separately at $0.10/GB/mo.
Early Access
We're building flexx for engineers who ship. Sign up for early access — if there's demand, we ship.