Reducing infrastructure lock-in: a pragmatic approach
Vendor lock-in has become a hot topic since Broadcom. Some teams talk about it with a sense of principled urgency, as if depending on a vendor were inherently a governance failure. It's not that simple.
Every infrastructure depends on something: a hypervisor, an OS, a network protocol, a backup solution. Zero lock-in is an illusion. The real question is: what do you depend on, to what degree, and what does changing cost?
Understanding your real dependency before wanting to reduce it
The first step isn't a migration plan. It's an honest inventory.
VMware dependency, for example, isn't uniform. An environment using only vSphere with external NFS storage and Linux VMs has relatively limited hypervisor dependency — the hypervisor can be replaced without touching the applications or most of the network.
An environment using NSX for microsegmentation, vSAN for distributed storage, Aria for automation, and vCenter with custom PowerCLI scripts for all routine operations — there, the dependency is deep and the exit cost is real.
The inventory covers:
- "Pure" hypervisor dependencies (VMs running on vSphere)
- Network dependencies (NSX, vDS, custom port groups)
- Storage dependencies (vSAN, VMDK-linked processes)
- Tooling dependencies (scripts, APIs, ITSM/monitoring integrations)
- Human dependencies (certifications, reflexes, internal documentation)
Progressive reduction: by component, not by revolution
The best lock-in reduction trajectory isn't "migrate everything now." It's reducing dependency component by component, starting with the least risky migrations where the gain is measurable.
Typical priority order in a mixed environment:
1. New workloads first on the target — don't deploy new VMs on VMware if the goal is exit. Every new VM deployed on VMware is one more VM to migrate later.
2. Non-critical workloads first — development, test, staging. These environments can absorb a cutover without business impact, and they allow you to refine procedures.
3. Tools and scripts — replace PowerCLI scripts with equivalent solutions on the target platform. This work can be done in parallel with migrations, and it will reduce operational dependency before all VMs have migrated.
4. Critical production last — with validated rollback, defined go/no-go criteria, and the organization capable of absorbing a high-risk operation.
What you can preserve from VMware during the transition
Just because you're reducing VMware dependency doesn't mean you need to migrate everything simultaneously.
Keeping some VMware components temporarily can be rational:
- vCenter as a hybrid management interface during coexistence
- Application components deeply integrated with NSX — until the network redesign is planned and separately budgeted
- Some critical workloads where migration timing cannot be constrained by the overall program pace
Coexistence isn't failure. It's an architectural posture that acknowledges that transformations take time and production cannot be endangered in the name of architectural coherence.
Change governance: the forgotten dimension
Lock-in reduction is a transformation program. It needs a clear sponsor, a calendar, dedicated resources, and a decision process for exceptions.
Without formal governance, lock-in naturally reconstitutes itself. Teams revert to familiar tools. New workloads are deployed on the platform they know. Architectural debt accumulates.
What must be formalized:
- The target platform by workload type — documented and validated by IT leadership
- The exception process — how to deviate if a constraint justifies it
- Program completion criteria — when can you consider dependency reduced to an acceptable level?
Mistakes to avoid
Confusing lock-in reduction with cost reduction — the two can coincide, but they're not the same objective. A VMware → Proxmox migration can reduce vendor dependency and cost more in the first year (if you include coexistence license costs, training, and residual incidents).
Migrating without maintaining rollback capability — reducing VMware lock-in by putting yourself in a situation where you can't return if necessary isn't autonomy. It's uncontrolled risk.
Forgetting human dependency — replacing VMware with Proxmox changes the lock-in profile, not its nature. If all Proxmox expertise rests on a single engineer, the lock-in is simply shifted from a vendor to a person.
What it looks like when done well
The organization that has managed its lock-in reduction well doesn't miss maintenance windows waiting for an architecture decision. It operates its platform with its own tools, its own tested procedures, and a team that understands what it operates.
It can also, if a new vendor presents a better proposition in 3 years, evaluate that proposition on its technical and economic merits — without being forced to stay or leave for uncontrolled reasons.
That's what sovereign architecture looks like.