Re: Cloud versus on-prem in depth.
Trevor,
yes you come strong, and dont have all the facts straight, i happen to be deeply involved in those Clouds and know how they really work. i'm not a fan of public cloud, but think they have better architecture than traditional ones, and we should adopt their approach regardless if its Public/SP/on-prem. we can learn a lot from their innovation, most Enterprise tech haven't changed for 20 years.
if so many ppl left the cloud, it doesn't explain the surge in AWS and Azure revenue, i attended re:invent can tell you many Enterprise guys were there, some have a Cloud ONLY strategy ("CEO said we wont own any HW")
Fact #1, Azure doesnt use SAN/HCI, they use distributed FS (SMB3) and Blob store over 40/50GbE & RDMA, can point you to public links, VM images are files but many SaaS/PaaS offerings use files directly, they perform faster than most block storage, on the storage you can see their design is lots of NVMe & JBODs. millions of Office 365 users including myself and it works like a charm, better than the Exchange i used to have running over NetApp.
Fact #2, most data in the cloud is in those PaaS, SaaS and object not AWS EBS which is more expensive and limited, and all those file/object/db/.. PaaS & SaaS are using DAS or object for capacity. again hardly any SAN or HCI.
Fact #3, 80-100 bay 3.5" JBODs cost ~$5K attach 4 of those to couple of $5K servers and you have a $30K overhead on 3-4PB HDD storage, can add NVMe in those 2 servers for Metadata & Cache or use 2U 2.5" 60-72 bay JBOFs if you are after performance. this is probably 50-25% the raw HW cost of those HCI/Scale-out storage (Coho, Cohesity, Rubric, ..) with 2-4 servers in a 2U box with only 12-24 drives. we can go over the Excel offline, i did it so many times.
Fact #3, File and object doesn't have to be slow, MS can demo more than 1M IOPs FS, iguazio does >2M IOPs file access and 700K IOPs S3 (Chris saw that 1st hand), traditional all flash NAS guys are <200K IOPs and traditional S3 is ~10K IOPs but its an implementation issue. dont know how many block storage providers do 2M IOPs on 1:1 client:server.
Fact #4, Databases have evolved quite a bit, no more fixed schema rather dynamic + column compression, more use of Flash and tiering at the DB level, you will soon see others adopting our approach of using NVRAM for DB Journals, hot-metadata & Cache, how do you snapshot something that store data in 3-4 independent tiers ? how can you compress/dedup data that is already compressed at the DB level? Traditional DBs dont scale across more than few nodes simply since they work in lock-steps with the SAN, but new DBs from ALL vendors use DAS, distribute table spaces and coordination at the DB transaction level which is the only way to scale, yes you can shout as much as you want but seem to miss a lot of background.
Fact #5, Vertically integrated Databases are faster than over SAN or HCI, Checkout Exadata, Vertica, AWS Aurora as example, or iguazio . 1. with HCI you have two layers of coordination and network chatter 2. new Databases organize data in disk optimal fashion, striping and other storage smarts just degrade perf. yes, if you use MySQL which has a miserable architecture (real all rows/cols in a range for every query) it wont matter much, perf will always suck, just give it some flash so updates and indexes wont kill you.
MySQL will give you 10Ks TPs even if you give it a 1M IOPs flash, OTOH iguazio will get you 2M TPs (100x faster) because we manage the transaction all the way from the incoming RPC to the low-level memory page or sector w/o any Page-Cache/FS/Array serialization.
Yes, i agree orgs have many "Pets" and cant transition so fast, but they do it in steps e.g. move files from NAS to object, move to consume DB as a service with ODBC/JDBC, etc. some use consultants that do "App Modernization" (BTW not sure you heard but today most clouds offer non-disruptive live data migration for your DBs & Apps).
on the same time they build new strategic Apps that focus on Cloud and Mobile First, continuous customer engagement, real-time analytics. they do it because they are so afraid a startup like Uber or pay-pal or their competitor will disrupt their business. for the new Apps they need agility and elasticity, most IT shops cant handle those new "Cattle" Apps at the agility they need so some do it in the cloud.
would love to give you a deep dive 1:1 to show you why some of the myths you got used to are just myths.
Yaron