Cloud Architecture & Migration Services
Updated81% of organizations overspend on cloud. 37% are moving workloads back on-prem. Independent cost benchmarks, vendor rankings, and migration strategy data from 200+ real-world projects. Stop paying 3× for lift-and-shift.
Key Finding: Cloud Cost Overruns
81% of organizations overspend on cloud by >30%. 37% are moving workloads BACK on-prem due to bill shock. The "cloud savings" promise is a myth unless you refactor — and vendors know you won't budget for that upfront.
Cloud migration encompasses everything from simple VM rehosting to full cloud-native refactoring. As of 2026, 35% of enterprise workloads still run on-premise; the remaining 65% is split across AWS (32%), Azure (23%), and GCP (10%). The strategy you choose determines whether you save money or spend 3× more for the same performance.
+ Read full background
As of 2026, the narrative that "everything should be in the cloud" is fracturing: 37% of organizations report moving at least some workloads back on-premise due to cost overruns, data sovereignty requirements, or performance issues. Cloud repatriation is no longer a fringe position — it's a rational response to misaligned migration strategies.
This research hub covers the full cloud architecture landscape: migration strategies, cost benchmarks from verified implementations, vendor intelligence, and the decision frameworks that separate successful migrations from expensive failures. All data is drawn from our analysis of 200+ real-world cloud migration projects.
Why Cloud Migration Matters Now
The business case for cloud migration is driven by four forces converging simultaneously. First, datacenter economics are shifting. Colocation costs have risen 15-25% since 2022 due to power density demands, and hardware refresh cycles (every 4-5 years) create capital expenditure cliffs that boards increasingly refuse to fund. Organizations that delay cloud migration don't save money - they accumulate infrastructure debt that compounds with each hardware generation.
Second, talent availability favors cloud skills. Engineers who can manage VMware vSphere clusters are aging out of the workforce, while new graduates enter with AWS certifications and Kubernetes experience. The salary premium for on-premise infrastructure specialists now exceeds cloud engineers by 20-30% - and the gap is widening. Staying on-premise means paying more for a shrinking talent pool.
Third, compliance frameworks are evolving. SOC 2, HIPAA, and PCI DSS auditors increasingly accept cloud-native security controls (IAM policies, encryption at rest, VPC isolation) as equivalent to or better than physical controls. What was once a reason to stay on-premise - "our auditors require it" - is now a reason to migrate, since cloud providers invest billions in compliance infrastructure that no individual organization can match.
Fourth, competitive pressure is real. Organizations that have successfully migrated to cloud-native architectures can deploy new features in hours instead of months, scale to meet demand spikes without capital planning, and experiment with new markets at a fraction of the cost. If your competitors are iterating 10x faster because they're not waiting on procurement cycles, the business case writes itself.
Assessment & Cloud Readiness
The most common cloud migration failure starts before a single workload moves: skipping the assessment phase. Organizations that jump straight to migration without evaluating their portfolio spend 40-60% more than those who assess first, because they discover incompatibilities, licensing traps, and dependency tangles mid-migration - when changing course is expensive.
A proper cloud readiness assessment evaluates every application across five dimensions: technical compatibility (does it run on Linux? does it require specific hardware?), data gravity (how much data does it produce and consume, and where does that data need to live?), licensing exposure (will Oracle or Microsoft licensing costs explode in the cloud?), team capability (does your operations team know how to manage this in AWS/Azure?), and business criticality (what's the cost per hour of downtime during migration?).
Cloud Readiness Assessment Framework
Migrate immediately
- Stateless web applications
- Containerized services
- Dev/test environments
- Applications with elastic demand patterns
Migrate with caution
- Oracle/SQL Server databases (licensing traps)
- Monolithic applications with tight coupling
- Workloads with data sovereignty requirements
- High-throughput, low-latency systems
Keep on-premise
- Mainframe workloads (migrate separately)
- Hardware-dependent applications (SCADA, IoT gateways)
- Applications scheduled for retirement (<2 years)
Replace entirely
- Custom CRM → Salesforce
- Custom HR → Workday
- Legacy ERP → cloud-native ERP
- Any app where SaaS exists and fits
The output of a readiness assessment is a prioritized migration wave plan: which applications move first (quick wins with low risk), which need refactoring before migration, which stay on-premise, and which get replaced with SaaS. Without this, you're migrating chaos - and cloud doesn't fix chaos, it amplifies it.
Migration Strategy Options: The 7 Rs
Every application in your portfolio maps to one of seven migration strategies. The right choice depends on the application's business value, technical debt, and your organization's cloud maturity. Choosing wrong costs 2-5x more than choosing right - so this decision framework is worth getting right.
Rehost (Lift & Shift)
Move VMs to cloud as-is. Fastest approach (weeks), lowest upfront cost ($400K), but highest long-term operating cost. Monthly cloud bills typically 2-4x higher than on-premise because you're paying cloud prices for on-premise architecture that wasn't designed for elastic pricing.
Best for: datacenter exit deadlines <6 months
Replatform (Lift & Optimize)
Move to cloud with minor optimizations - swap self-managed databases for RDS, use managed load balancers, adopt cloud-native logging. Moderate cost ($650K), 30-40% savings over pure rehost. The sweet spot for most enterprise applications.
Best for: standard enterprise applications, 6-12 month timeline
Refactor (Cloud-Native)
Rearchitect the application for cloud-native services - serverless, containers, event-driven patterns. Highest upfront cost ($1.8M) but 60-80% lower operating costs long-term. Break-even in 2-4 years. Only justified for strategic applications with 10+ year lifespans.
Best for: core revenue-generating applications, high-traffic systems
Repurchase (Replace with SaaS)
Replace custom-built software with a commercial SaaS product. Lowest total cost ($150K migration + subscription) for applications where commercial alternatives exist. Eliminates maintenance burden entirely.
Best for: CRM, HR, ERP, email - anything a SaaS vendor does better
R5: Retain
Keep on-premise. Valid for hardware-dependent, data-sovereign, or soon-to-retire applications.
R6: Retire
Decommission the application entirely. 20-30% of portfolio is often zombie software nobody uses.
R7: Relocate
Move VMware workloads to VMware Cloud on AWS/Azure. Fastest path but highest lock-in risk.
Decision Framework: When to Choose What
The strategy decision comes down to three variables: business criticality (how much revenue does this application generate?), technical debt (how much rework is needed?), and timeline pressure (when does the datacenter contract expire?).
| SCENARIO | STRATEGY | COST | TIMELINE |
|---|---|---|---|
| Datacenter exit in <6 months | Rehost | $400K | 2-4 months |
| Standard enterprise app, no urgency | Replatform | $650K | 6-12 months |
| Core revenue app, 10+ year lifespan | Refactor | $1.8M | 12-24 months |
| CRM, HR, or ERP replacement | Repurchase | $150K | 3-6 months |
| VMware estate, Broadcom pricing pressure | Relocate or Replatform | $500K-$2.5M | 6-18 months |
| Nobody uses it | Retire | $0 | Immediately |
The most expensive mistake is applying the same strategy to every application. A 300-application portfolio typically breaks down as: 20% rehost (datacenter exit urgency), 40% replatform (standard apps), 15% refactor (strategic apps), 15% repurchase (SaaS replacements), and 10% retire (zombie software). Organizations that rehost everything save nothing; organizations that try to refactor everything never finish.
Risk Factors & Common Failure Modes
Our analysis of 200+ cloud migration projects reveals five failure patterns that account for 80% of cost overruns and missed timelines.
1. Licensing landmines
Oracle, Microsoft SQL Server, and SAP licensing terms change dramatically in the cloud. Oracle charges per-vCPU in cloud (not per-core), potentially doubling costs. SQL Server licensing on AWS requires dedicated hosts or BYOL - miscalculating this has bankrupted migration budgets. Always audit licensing before migration, not during.
2. Data gravity underestimation
Applications that produce or consume large datasets (analytics, data warehouses, ML pipelines) cost dramatically more to migrate than expected because of data transfer time and egress fees. A 50TB database takes 4-6 weeks to migrate over a 1Gbps connection. Organizations that don't plan for data gravity end up running hybrid architectures indefinitely - the worst of both worlds.
3. Operational skill gap
Migrating workloads to AWS doesn't automatically mean your ops team knows how to manage them there. CloudWatch is not Nagios. IAM policies are not firewall rules. The #1 post-migration issue isn't performance - it's incidents taking 3x longer to resolve because the team is learning production AWS in real-time.
4. Network architecture mismatch
On-premise networks are flat; cloud networks are VPCs with security groups, NACLs, transit gateways, and peering connections. Applications that rely on broadcast traffic, multicast, or low-latency inter-service communication often break in cloud networking models. This is discovered in testing (if you're lucky) or production (if you're not).
5. Premature microservices decomposition
68% of teams that decompose monoliths into microservices during cloud migration report increased complexity, not improved velocity. The migration itself is risky enough without simultaneously rearchitecting the application. Migrate first, then refactor - not both at once.
Cost & Timeline Data
Real cost data from verified cloud migration implementations. See full cost data for all migration types.
| MIGRATION PATH | COST RANGE | MEDIAN COST | TIMELINE | SUCCESS RATE |
|---|
True Cost of Cloud Migration Approaches
Serverless vs. EC2: The Real TCO
| WORKLOAD TYPE | EC2 (t3.xlarge) | LAMBDA | WINNER |
|---|---|---|---|
| Low Traffic API (5% util) | $120/mo | $8/mo | Lambda (93% ↓) |
| Scheduled Jobs (1hr/day) | $120/mo | $2/mo | Lambda (98% ↓) |
| Bursty Traffic (30% avg util) | $120/mo | $42/mo | Lambda (65% ↓) |
| 24/7 High Traffic (80% util) | $120/mo | $290/mo | EC2 (58% ↓) |
| Rule: Lambda wins if utilization <50%. EC2 wins if >70%. | |||
* Hidden cost: API Gateway adds $3.50 per million requests. Use Lambda Function URLs for internal APIs to avoid this "tax."
Cloud Provider Market Share 2026
Implementation Patterns
Successful cloud migrations follow predictable patterns. The organizations that finish on time and on budget share three common practices.
Wave-based migration
Move applications in waves of 5-15, starting with low-risk, low-dependency workloads. Wave 1 is always the "learning wave" - expect it to take 2x longer than planned. By Wave 3, your team has established runbooks, automated deployment patterns, and resolved the 80% of issues that repeat across workloads. Organizations that try to migrate everything at once ("big bang") have a 70% failure rate.
Strangler fig for monoliths
For monolithic applications that need refactoring, the strangler fig pattern routes new traffic to cloud-native services while the legacy system continues handling existing functionality. Over 12-18 months, the monolith shrinks as features migrate to the new architecture. This pattern has a 85% success rate compared to 45% for full rewrites.
Landing zone first
Before migrating any workload, establish a cloud landing zone: VPC architecture, IAM policies, logging and monitoring, cost allocation tags, and security baselines. This takes 4-8 weeks but prevents the "snowflake account" problem where every team creates their own AWS account with different security settings, naming conventions, and cost structures. AWS Control Tower, Azure Landing Zones, and GCP Organization Hierarchy all provide blueprints.
Success metrics
Measure cloud migration success across four dimensions, not just cost:
- Cost efficiency: Monthly cloud spend vs. on-premise baseline (target: 20-40% reduction within 18 months post-migration)
- Deployment velocity: Time from code commit to production (target: hours, not weeks)
- Reliability: Mean time to recovery (MTTR) and uptime percentage (target: <5 min MTTR, 99.9% uptime)
- Operational maturity: Percentage of infrastructure managed as code (target: >90% within 12 months)
Migration Guides
Definitive technical guides for specific cloud migration paths.
Cloud Migration Services
Get a cloud readiness assessment, optimize with FinOps cost optimization, or plan a Kubernetes migration.
Vendor Intelligence
Independent comparison of cloud migration implementation partners. Search all 170+ vendors.
Top Cloud Migration Services Companies
| Company | Specialty | Cost | Our Rating ↓ | Case Studies |
|---|---|---|---|---|
| Thoughtworks | Cloud-Native Refactoring | $$$ | ★4.6 | 8 |
| Google Cloud Consulting | Kubernetes & GKE | $$$ | ★4.5 | 12 |
| Onica (Rackspace) | AWS Migration Factory | $$$ | ★4.4 | 16 |
| DoiT International | Cloud Cost Management | $$ | ★4.4 | 8 |
| Microsoft Azure Consulting | Enterprise Hybrid Cloud | $$$$ | ★4.3 | 18 |
| AWS Professional Services | Serverless & Event-Driven | $$$$ | ★4.2 | 15 |
| VMware Tanzu Labs | Platform Engineering | $$$ | ★4.0 | 11 |
| Deloitte | FinOps & Cost Optimization | $$$$ | ★3.9 | 14 |
| Accenture | Large-Scale Transformation | $$$$ | ★3.8 | 22 |
| IBM Consulting | Mainframe-to-Cloud | $$$$ | ★3.7 | 10 |
Thoughtworks
Cloud-Native Refactoring
Kubernetes & GKE
AWS Migration Factory
DoiT International
Cloud Cost Management
Enterprise Hybrid Cloud
Serverless & Event-Driven
Platform Engineering
Deloitte
FinOps & Cost Optimization
Accenture
Large-Scale Transformation
IBM Consulting
Mainframe-to-Cloud
Cloud Migration FAQ
Q1 How much does cloud migration really cost?
$400K to $1.8M for enterprise workloads, depending on approach. Lift-and-shift (rehost) costs $400K but increases monthly bills 2-4x. Cloud-native refactoring costs $1.8M upfront but saves 60-80% long-term. The 'cheap' option isn't cheap.
Q2 Why are 37% of companies moving workloads BACK from cloud?
Bill shock from lift-and-shift. They migrated VMs to EC2 without refactoring, paid cloud prices (opex) for on-prem architecture (designed for capex). Monthly bills went from $180K/year on-prem to $400K/month in cloud. Repatriation breaks even in 18 months.
Q3 Is serverless actually cheaper than EC2?
Only if utilization is <50%. A t3.xlarge EC2 costs $120/month. Same workload on Lambda at 30% utilization: $42/month (65% savings). But at 80% utilization, Lambda costs $290/month (2.4x MORE than EC2). Rule: Lambda for bursty/low-traffic, EC2 for 24/7 high-traffic.
Q4 What's the hidden cost vendors don't tell you about?
Data egress charges. AWS charges $0.09/GB to move data OUT of the cloud. A 500GB/month app costs $540/year just in egress. Multi-cloud strategies can cost $15K-$50K/year in cross-cloud traffic. Vendors hide this in fine print because it's pure profit.
Q5 Should I hire AWS/Azure/GCP or a consulting firm like Accenture?
Hyperscalers (AWS/Azure/GCP) are biased toward their services and rated 4.2-4.3. Specialists like Thoughtworks (4.6) or Onica (4.4) are vendor-neutral and 20-40% cheaper. Big 4 consultants (Accenture 3.8, IBM 3.7) cost $$$$, slower, but needed for Fortune 100 governance.
Q6 When does 'lift and shift' make sense?
ONLY for datacenter exit deadlines <6 months. Example: Your colocation contract ends in 4 months, no time to refactor. Lift-and-shift buys you 12-24 months to plan a proper cloud-native migration. Otherwise, you're paying 3x for the same architecture.
Q7 What's the break-even point for refactoring to cloud-native?
2-4 years. Refactoring a 500K LOC monolith costs $1.2M-$4M. Monthly savings: $4K-$8K from autoscaling and serverless. Break-even at $1.2M ÷ $6K/month = 16.7 months (best case). Worst case: 4 years. Only worth it for strategic apps you'll run 10+ years.
Q8 Why do most microservices projects fail?
68% report increased complexity, not improved velocity. Root cause: too fine-grained (140 services), network latency (47 cross-service calls per request), and data consistency nightmares (customer address in 5 places). Start with a modular monolith. Break apart only when you have 50+ engineers.