Week 2 - Cloud Technologies

 

By the end of this module, you will know what cloud computing can be used for, how to enable a cloud API, and how to navigate the cloud shell.

Module videos:


Note: there is a single lab manual associated for this module in Blackboard. You’ll be doing a few Codelabs to give you experience with the technologies, however for credit they are aggregated into a single assignment.

Cloud Technologies

The benefit of using cloud technologies are that you are effectively using somebody else’s servers. The downside is that you are limited to the technologies supported by that provider. For instance, you will probably be hard-pressed to try and use BigQuery (Google-specific technology for sifting through massive datasets) on DigitalOcean. However, you may not need that technology! Again, consider all tradeoffs when selecting a provider.

Since we are using Google Cloud for this course, we are going to focus on the technologies supported by Google. Here is a page with the full list of Google Cloud services.

For reference, here are the products supported by AWS and Microsoft Azure.

You will notice a lot of overlap! Consider that each are competing for cloud dominance and that each provider wants you to pay for their service.

Class Assignment: Check Blackboard for Assignment 2 (select and describe 5 separate services)

IaaS vs. PaaS vs. SaaS

Module video: Cloud Infrastructure [11:53]

There are three different (main) paradigms when it comes to cloud computing. You more than likely have seen these acronyms before, however we should still define them for propriety’s sake. First, we have Infrastructure as a Service (IaaS), then Platform as a Service (PaaS), and Software as a Service (SaaS). Each of these terms effectively means transitioning your thing to become a cloud-based service.

If you are getting lost with these acronyms, reflect on that first word. It is the key to what is being turned into a service.

Figure 1 presents a sample (c/o Google) of some of their different categories of cloud services. On the left you see Compute Engine, which is their IaaS platform. There are a lot of options available within Compute Engine, however the most prominent/well-known tends to be virtual machines. Next you have App Engine, their PaaS platform. App Engine enables you to create applications without having to worry about pesky things like servers or virtual machines. On the right are the SaaS applications, where these tend to be the “other” cloud services you have (and will vary from provider to provider). Here, the main consideration is not the service but the software – effectively you put data in, have some sort of “transformation” occur, and receive changed data out. Consider applications such as translate (e.g., Google Translate) or machine learning (e.g., TensorFlow) and you have SaaS. No servers to configure, no applications to develop, simply configure the application to your needs and feed it data.

Google Cloud Service Options

Figure 1: Google Cloud Service Options

Next, we’ll highlight some of the more common aspects of cloud computing that you are most likely to run into.

Want a different perspective? Check out this post: https://www.bigcommerce.com/blog/saas-vs-paas-vs-iaas/

Virtual Machines [IaaS]

One of the more common cloud services that are available are virtual machines (VM). These are effectively computers that you can setup that are accessible anywhere you have a network connection (and the necessary technology to access it – for Google Cloud it is simply a browser). Figure 2 shows you a common VM concept, normal vs. sole tenancy. The concept here is that sole tenant nodes are strictly provisioned to you and your workloads – there are no other users sharing that host.

Google Cloud Normal Host vs. Sole-tenant node

Figure 2: Normal vs. Sole-tenant nodes

Figure 3 then shows off a sample VM that I created for one of the labs we’re working through. Here, you see the instance name, a bit of information about the VM that is relevant (machine type, logging availability, CPU, etc.). Effectively, you can pick what type of machines you want, where you want them located, and what operating systems to load onto them. These can be provisioned as necessary for your environment, including scaling as needed.

Sample Virtual Machine

Figure 3: Sample Virtual Machine

With VMs comes other options as well, such as virtual networks, persistent storage, and everything you need to make a fully virtualized ecosystem. For example, Figure 4 shows off a virtual private cloud (VPC) network architecture (c/o Google). Here, you can specify firewalls, subnets, etc.:

VPC network

Figure 4: VPC network

Note there are different regions specified as well! These can correlate to geographic locations of your customers/clients and can provide a better quality of service.

The following video will go over how to edit files in both the Cloud Shell and a (Linux) VM, as well as how to upload/download files with your VMs:

Module Video - Editing and Uploading [11:05]

AutoScaling

Everything we’ve talked about so far, really, could be accomplished in a simple virtual environment. One of the reasons to move to the cloud though are the elastic benefits. One example is autoscaling. Figure 5 (c/o Google) demonstrates an architecture for enabling autoscaling. Here, the Compute Engine is used in combination with an operating system image to provide templated VMs. Together with Cloud Storage for scripting capabilities, these form an instance group that is tied to the Autoscaler.

Autoscaling

Figure 5: Autoscaling

To use Autoscaling, you require a policy to specify how scaling should work. If you define a Scale-out policy as shown in Figure 6 (c/o Google), you will see that VMs can be provisioned based on the target CPU utilization. On the left are three VMs running at 100% and one at 85%. On average, CPU utilization is at 96.25% (385/4), meaning that it is above the target value of 75% utilization. When detected, the Autoscaler will provision additional machines as necessary to be under that threshold (in this case, 385/6 = 64.16%)!

Scale-out policy

Figure 6: Scale-out policy

Note: scale-in policies are available as well, however I assume you can make the mental leap that you may also want your VMs to be over-utilized as well.

Codelab - Autoscaling

Internet of Things [IaaS/PaaS]

Ah, the Internet of Things (IoT), what may be considered to be the latest in moving infrastructure up to the cloud. All of the tiny networked devices that we have available (smart watches, sensors, monitors, etc.) have been turned into a cloud service as well. However, the devices themselves are not necessarily made into a virtual infrastructure service. The telemetry being sent between devices has been cloudified, turning this topic into something that falls between IaaS and PaaS (at least, to me).

There are multiple components available, however at its core you are transmitting, analyzing, and receiving the data between devices. This particular page lists out the highlights of Google Cloud IoT. Note that competing providers have similar services as well!

Google Cloud IoT Technologies

Figure 7: Google Cloud IoT Technologies

Microservices [PaaS/SaaS/Somewhere In-Between]

Microservices tend to fall into the buzzwordy camp (for me at least), but at their core they tend to be the result of breaking up larger applications into more bite-sized pieces. Typically the end up being small portions of an application that have been migrated to functions that accept data and transform it (or relay it) in some fashion. There are multiple approaches for creating microservices, however in the cloud realm you may wish to consider using a serverless technology such as Cloud Functions (or Lambda/Azure Functions) or an App Engine (PaaS) type of environment.

Both approaches are completely feasible and both rely on code development. Figure 8 shows an example of a microservice architecture (c/o microservices.io). Here, you see multiple aspects of what typically would be a monolithic application broken down into manageable services, each accessible via some sort of endpoint (e.g., web, API, RESTful call, etc.).

Microservice Architecture

Figure 8: Microservice Architecture

Containers / Orchestration

Containers and Kubernetes (or K8s) are a discussion in and of themselves, however they offer an extremely attractive proposition for both IT and developers alike: a sane environment that can be deployed anywhere without needing to worry about specifics like machine type or operating system. Effectively containers become a light VM environment that enables cross-platform deployments. Kubernetes offers orchestration of containers, in essence, container management at scale. Figure 9 (c/o DZone) shows a sample Kubernetes architecture, including multiple worker nodes that contain Docker containers.

Kubernetes Sample Architecture

Figure 9: Kubernetes Sample Architecture

Analytics / Artificial Intelligence / Machine Learning [SaaS]

There are a plethora of available SaaS technologies available in Google Cloud, ranging from the most basic data analytics to advanced TensorFlow capabilities that can make accurate predictions based on an input dataset. Tools such as these will enable you to create highly-targeted applications and microservices with a highly-cohesive goal. We will be digging into these later on, however peruse this page to get a feel for some of the different technologies available.

Why are these categorized as SaaS you may ask? Well, you’re not writing the application right? You may be tweaking it, configuring it, etc., however you’re mainly feeding it data and expecting data out. Software as a service means that the software itself has been cloudified (naturally, along with the platform and infrastructure as well)!

Data Management [SaaS]

Many of the APIs available in the cloud provide some sort of data management and/or transformational capabilities. When we talk about management however, we generally are discussing some form of data storage. In Google Cloud this is BigQuery, a data management system that scales up to the big data range. As you’ll see later, there is also a specific Cloud shell program (bq) to interact directly with your data. With BigQuery comes application interaction, varying forms of database commands (both SQL and BigQuery specific), and analytics capabilities. We’ll go into more detail on BigQuery (and its associated storage technique BigTable) later on.

Demos - website hosting, failover network, API, cloud shell

Cloud Shell

Cloud Shell (or the Cloud SDK) is a command-line interface (CLI) for interacting with Google Cloud services, plus it provides a Linux-like interface that you may be familiar with. Here are the Google-specific commands to become familiar with:

  • gcloud

gcloud is your main interface with Google Cloud via the CLI. You can perform authentication, change your local configuration (e.g., project ID, user account, etc.), and interact with the various APIs. For instance, one of your labs will have you using gcloud to deploy and re-deploy App Engine programs.

  • gsutil

gsutil allows you to work with Cloud Storage buckets and objects. Effectively, this is your program for working with files and data that you are keeping in persistent storage.

Note that this command is being deprecated in favor of gcloud storage.

  • bq

bq interacts with BigQuery, Google’s service for interacting with large datasets. Here, you can run queries, manipulate datasets/tables, and interact with other entities that are part of the BigQuery ecosystem.

Figure 10 shows a sample screenshot of my Google Cloud console, with Cloud Shell activated. Note that you can open it by clicking the little terminal icon at the top right, and it will pop open the shell in the bottom of your screen.

Cloud Shell Screenshot

Figure 10: Cloud Shell Screenshot

For each of these commands, you can add the --help parameter to get more information about the tool and how it can be used. For example, entering gcloud --help shows information about the gcloud command itself, and gcloud app --help shows information about how to interact with App Engine via gcloud. Further note, as this is essentially a glorified Linux terminal, hitting q will escape the manual page that pops up whenever you look at the help files.

Text Editing

One thing you will be doing often in this course is writing/updatining files within the Cloud shell. You can use your favorite text editor of choice (vim, nano, emacs, etc.), or you can use the Cloud shell editor. To launch this, you can either click the Open Editor button in the Cloud shell interface, or type cloudshell edit <file>, where you replace <file> with whatever file you’re trying to edit. This action gives you a Notepad-like environment with a file browser on the side. Use this if you’re unfamiliar with CLI-based text editing.

Note: I will be using vim in all my demos. Don’t feel obligated to learn it, just be aware that’s how I prefer to edit files.

Codelabs, QwikLabs, and Colabs

This is a slight aside but I wanted to clarify the difference between these three “lab-ish” technologies we’ll be mainly using. You already have experience with Codelabs already if you did the VM setup assignment, however…

Codelabs are tutorial-style manuals that walk you through a task. These can be developed by anybody (and the tools for doing so have been made open source) and focus on a particular topic. We will be using a lot of these for our assignments. Of note is that these are free to participate in.

Qwiklabs (talked about last time) are paid-for labs that are similar in style to Codelabs, however they are better structured and generally will focus around a theme (called Quests). Qwiklabs can be used to get badges that you can share on social media noting that you have experience with a particular topic. There also tend to be videos associated with Quests as well. Of note is that these are not free, however if you are enrolled in class with me you will receive credits to apply to Qwiklabs as needed.

Colabs are an interesting aside in that they are effectively Jupyter notebooks demonstrating some topic. A Colab will typically have Markdown and executable code interspersed in a single document that you can interact with. These are free to the point that you will be charged cloud credits to use (i.e., the executing code runs against your billing account).

Additional Resources


Where noted, the original content was provided by Google LLC and modified for the purpose of the course, without input or endorsement from Google LLC.