Imagine losing power for a few hours or even the whole day because of some issues at a power station due to weather or a fuse. You cannot watch TV, no network connection, no heater, no ability to cook anything, the food in the refrigerator goes bad and the worst thing is - you cannot charge your phone! Now you stumble through the darkness looking for a torch. Doesn’t that feel like being whisked back to the stone Age? No wonder losing power is called a blackout. This is usually the point in your life when you take an oath to start talking to solar companies for a roof panel and purchase a standby power generator for tertiary backup. Just as power is fundamental to the 21st-century lifestyle, uptime is crucial for companies undergoing digitization with cloud-managed networking. Don’t you wonder how Extreme designs for CloudOps?
Image courtesy giphy.com
Let us start by defining a few terms. The popular term DevOps streamlines the processes associated with software development(Dev) and IT Operations(Ops) and aims to accelerate development lifecycles without compromising quality. CloudOps, on the other hand, is short for Cloud Operations which combines DevOps and the optimization of IT services for cloud-based solutions. Continuous operation means that we keep the cloud services running while providing you frequent new innovations. How often is regular - how about hundreds of updates in 365 days! Every single process from onboarding your devices, authenticating users, setting your network configuration, ongoing observability of your network, and user experience is built in the cloud. To properly architect a highly available cloud-managed solution with various services is a herculean task and takes talent as well as effort. To keep this blog short; let us focus on how we design the authentication mechanism. With millions of worldwide customers logging daily to ExtremeCloud IQ to manage their networks, this service needs to support large scale, low-latency, and high availability. Additionally, the database needs to be backed-up for disaster recovery. Now let us dig into the authentication architecture.
The Lego blocks of availability begins with the placement of the GDC, or the Global Data Center that is geographically disbursed between the US and Europe and load balanced. To maintain data localization or residency, the login information for EU customers exists only in the European instance. When users login from Asia Pacific region, the appropriate GDC to be utilized is determined based upon the measured latency. In addition to serving as the primary authentication mechanism to the ExtremeCloud IQ, the GDC also performs device redirection and other global services as required. All instances of the GDC are hosted within Amazon AWS. The RDC, or Regional Data Center, is hosted among various cloud providers depending on data retention time and location. For more details about the architecture don’t forget to check out this whitepaper.
The authentication service uses two types of data storage services. When you log in once to the GDC, the token is stored in the local GDC and it replicates to multiple remote GDCs. Some data is stored in a database while other is stored in an in-memory cache.
To recap, designing for Continuous Operations is a Herculean effort and this only covers the Authentication Server part of the GDC. If you want to monitor the availability of all the services in your own region, you can see them like this. To learn more about ExtremeCloud IQ, keep monitoring this space!
This blog was originally authored by Jeevan Patil, Senior Director, Product Management