Cloudops and cloudops tools (such as AIops) are important, but you can have too much of a good thing Credit: rclassenlayouts / Aleksei Derin / Getty Images Cloud operations, or cloudops, is becoming the problem to solve as enterprises operationalize their cloud and multicloud deployments. The ideal pattern of management involves a pragmatic approach to operations by leveraging tools to automate predetermined procedures and runbooks. Right now, many overwhelmed managers toss cloudops tools at the problems and hope for the best. The issue is one of balance. To put cloudops solutions and tools to their best uses, you need to define why and what before you determine how. It’s not that the tools are unimportant; rather, the tools should meet specific requirements, such as performance, security, cost management, governance, automation, and self-healing. The operational aspects of cloud computing are the highest-priority concepts in my mind. Almost all other components of cloud computing have dependencies into cloudops. For example, security breaches can be spotted by process saturation monitoring from an AIops tool. I’ve defended against breach attempts in the past. The first indications came from cloudops tools, not from security systems. The focus needs to be on the optimal suite of cloudops tools that will best meet the optimal requirements. I’ve been calling this a “minimal viable cloudops toolset,” or MVCOT. The ideal MVCOT is a streamlined collection of tools that are optimized for the cloud and noncloud systems under management. How do you find this MVCOT nirvana? I would suggest the following: Define the current, short-term future, and long-term future state of the systems that will be under management. This seems like a no-brainer, but traditional on-premises systems are often left out of the mix and lack representation in the growth of system capacity. This will give you 65 percent of what you need to pick a MVCOT. Define the runbooks for each system. These will explain how each system needs to be managed to provide optimal service levels and minimize outages. If you think this is a lot of work, it is. You need to think about business continuity and disaster recovery issues, maintenance, scaling, security processes, etc. Now it’s time to select the right tools to make up your MVCOT. As your knowledge and experience evolves, you’ll have to replace or update tools on your initial optimal tools list. Although cloud architects and cloudops engineers are getting more informed about the use of cloudops tools, the temptation to never see a cloudops tool you didn’t like is still enticing. I urge you to put some additional thought into your purchase strategy for cloudops tools. Bonus: The rework you do will be minimal. See what I did there? Related content analysis Azure AI Foundry tools for changes in AI applications Microsoft’s launch of Azure AI Foundry at Ignite 2024 signals a welcome shift from chatbots to agents and to using AI for business process automation. By Simon Bisson Nov 20, 2024 7 mins Microsoft Azure Generative AI Development Tools analysis Succeeding with observability in the cloud Cloud observability practices are complex—just like the cloud deployments they seek to understand. The insights observability offers make it a challenge worth tackling. By David Linthicum Nov 19, 2024 5 mins Cloud Management Cloud Computing news Akka distributed computing platform adds Java SDK Akka enables development of applications that are primarily event-driven, deployable on Akka’s serverless platform or on AWS, Azure, or GCP cloud instances. By Paul Krill Nov 18, 2024 2 mins Java Scala Serverless Computing analysis Strategies to navigate the pitfalls of cloud costs Cloud providers waste a lot of their customers’ cloud dollars, but enterprises can take action. By David Linthicum Nov 15, 2024 6 mins Cloud Architecture Cloud Management Cloud Computing Resources Videos