CS 498 Cloud Computing Applications Midterm 1
Which of the following is NOT considered a PaaS? 1 point Microsoft Azure App Service Amazon AWS Lambda Amazon Elastic BeanStalk Google AppEngine
Amazon AWS Lambda - Correct! Lambda is an example of a FaaS
Which of these companies is least involved in providing IaaS? 1 / 1 point Amazon Microsoft Google Apple
Apple
Which of the following are not advantages of VPC 1 point Security Bandwidth Flexibility Data control
Bandwidth
Which is not an advantage of resilient distributed datasets? 1 point Support a wide range of applications Bandwidth is infinite Retain the attractive properties of MapReduce Allow apps to keep working sets in memory for efficient reuse
Bandwidth is infinite
Select all of the solutions provided by Hadoop. 1 point Caching for loop-invariant data Caching for fixpoint evaluation Data parallelism Loop-aware task scheduling
Data parallelism
A commercial off the shelf laptop, disconnected from the internet, has the storage, memory, and computational capacity to process Big Data in a timely manner. 1 / 1 point True False
False - "Correct! Big Data, by definition, cannot be processed in a timely manner by a single standard computer."
Google's cluster architecture uses index servers as frontend load-balancers, and document servers to hold the indexed information. True False
False - Correct! Index servers and document servers are backend workers, and the Google Web server is the frontend.
Which storage technology is the best for the following scenario? "An application that archives 1,000 TB of data for two years for compliance reasons" 1 point Glacier Deep Archive Dropbox AWS S3
Glacier Deep Archive
Which technology is the best suited for the following use case? "Finding the set of words utilized in the Wikipedia website" 1 point HDFS Spark Hadoop YARN
Hadoop
Which approach is more economical for the following scenario? "A long-running business serves 1,000 daily but 1,000,000 during the holiday session." 1 / 1 point Cloud computing Hybrid approach In-house servers
Hybrid approach -Correct! In-house servers are more economical during most normal days, while cloud computing needs to be involved during the holiday.
Which of the follow protocol lies in Application Layer? 1 point IMAP DNS HTTP UDP
IMAP, DNS, HTTP
Which approach is more economical for the following scenario? "An established, mature business serves 10,000 users during business hours (9am-5pm) and 100 users outside of business hours each day." 1 / 1 point In-house servers Cloud computing Hybrid approach
In-house servers - Correct! The servers are regularly and intensively utilized by a large amount of users during (long) business hours.
Which of the following statement about private subsets is true? 1 point Outgoing traffic from a private subnet can access the Internet Incoming traffic from Internet cannot access the private subsets Incoming traffic from Internet can access the private subsets Outgoing traffic from a private subnet cannot access the Internet
Incoming traffic from Internet cannot access the private subsets, Outgoing traffic from a private subnet cannot access the Internet
It is likely that a move towards Internet of Things (IoT) in the near future will 1 / 1 point increase the utility of clouds, because the IoT devices will generate a huge amount of data that will be processed by the cloud decrease the utility of clouds, because IoT brings computational power closer to the user, while cloud computing is far from the user have no effect on clouds, since they are totally unrelated technologies
Increase - Correct! IoT devices generate a huge amout of data but lack the needed processing capability
What is the best model of delivery for the following scenario? "A custom, lighting-fast storage solution for gigantic amount of data" 1 / 1 point Infrastructure-as-a-Service Platform-as-a-Service Software-as-a-Service
Infrastructure-as-a-Service - Correct! A customized operating system environment is needed in this storage solution.
Which are the benefits of service-oriented architecture (SOA)? 1 point Interaction Reduce costs Scalability Reusable Code
Interaction, Reduce Costs, Scalability, Reusable Code
Which technology will address the following need? "Data representation using a dictionary with key and value" 1 point TXT XML HTML JSON
JSON
Which technology will address the following need? "Human readable representation of data" 1 point RPC REST JSON RMI
JSON
Which of the following about JSON and XML is correct? 1 point JSON has no namespaces while XML has JSON does not have security issue while XML has JSON and XML does not have security issue JSON has namespaces while XML has
JSON has no namespaces while XML has
Apache Spark was created to solve the shortcomings of which technology? 1 point MapReduce Juju Apache Storm YARN
MapReduce
Cloud providers prefer variable jobs that have 1 / 1 point Positive Correlation Negative Correlation
Negative Correlation - Correct! Jobs with negative correlation have a small coefficient of variation, which can lead to higher utilization.
What is the bottleneck of the MapReduce programming model? 1 point Combine phase can't start until Map phase is completely finished Reduce phase can't start until Map phase is completely finished Combine phase can't start until Reduce phase is completely finished Map phase can't start until Reduce phase is completely finished
Reduce phase can't start until Map phase is completely finished
You want to build a word count program. Which of the following pseudo-code is the proper Reduce function for this program? Note that the indenting is not accurate. "Word Count Program: You have a huge text file that consists of many lines. The goal is to count the number of times each distinct word appears in the file." 1 point Reduce(key = line, value = contents): for each word in value: emit intermediate (word, 1) Reduce(key = word, values = uniq_counts): Sum all 1's in values list Emit result (word, sum) Reduce(key, values): for each value in intermediate values: value += 1; emit intermediate(key, values) Reduce(key = line, value = contents): result = 0; for each word in value: result += value; emit(key, result)
Reduce(key = word, values = uniq_counts): Sum all 1's in values list Emit result (word, sum)
Amazon S3 BLOB Storage uses a weak consistency model. 1 point True False
True
Multiplexing demand in a cloud infrastructure leads to higher utilization. 1 / 1 point True False
True
Software as a Service, typically provides load balancing automatically for you. 1 point True False
True
Which is one of the differences between Physical Ethernet Network and VPC network? 1 point Both of them intercept ARP request Physical Ethernet Network does not use ARP while VPC network implements ARP VPC network does not use ARP while Physical Ethernet Network implements ARP
VPC network does not use ARP while Physical Ethernet Network implements ARP
UDP is best suited for which of the following tasks? 1 point Reliablely transmiting a medium-sized file over a congested link Voice over IP (VoIP) Encrypted communications Communication between computers on the same rack in a data center
Voice over IP (VoIP)
Which technology will address the following need? "Data representation for (un)marshalling on different machines and programming languages" 1 point XML MBaaS RMI REST
XML
Which of the following is main formats of data representation? 1 point XML JSON HTTP
XML, JSON
Which technology is the best suited for the following use case? "Assigning resources to a highly parallel application" 1 point YARN Spark Hadoop HDFS
YARN
What is YARN? 1 point Yet Another Resource Negotiator Your Adept Resource Negotiator Your Applicable Resource Negotiator Your Adept Reason Negotiator
Yet Another Resource Negotiator
Which of the following is not a transport method? 1 point fetch write read flush
fetch
WebSockets 1 point requires the users to "poll" to receive data have longer latency than RESTful approaches have three phases: opening handshake, data transfer, closing handshake are built on top of UDP to minimize latency
have three phases: opening handshake, data transfer, closing handshake
When compared to Amazon EBS PIOPS, Amazon EFS (Elastic File Store) 1 point generally has lower per-operation latency is well suited for NoSQL databases generally has lower throughput is stored across multiple availability zones (AZs) rather than one
is stored across multiple availability zones (AZs) rather than one
Select all of the transformations which create an RDD. 1 point extract join groupBy filter map
join, groupBy, filter, map
Choose all that apply: Which are examples of clustered file systems? 1 point Ceph NFS SMB
All of the aboce
Select all of which are properties of HDFS: 1 point Throughput scales with attached HDs Massive throughput Synergistic with Hadoop Optimized for reads, sequential writes, and appends
All of the above
Given the CIDR of IP4, 10.10.1.16/29, which of the following IP4 is out of range? 1 point 00001010.00001010.00000001.00010010 00001010.00001010.00000001.00011000 00001010.00001010.00000001.00010001 00001010.00001010.00000001.00010110
00001010.00001010.00000001.00011000 - Correct. This is because it ranges from the last three bits 000 to 1111
The primary reason most commonly-used load-balancing algorithms are relatively simple is 1 point Because an exhaustive theoretical analysis of the algorithms has determined that complex load balancing algorithms all have exponential runtimes To save the developer time writing and maintaining code, since load-balancing isn't very important Because load balancers must make quick decisions, empirically more complex algorithms are likely to be slower than simpler algorithms Tradition. There are well-known complex load balancing algorithms that work better than simple algorithms (like round-robin) in almost all situtations.
Because load balancers must make quick decisions, empirically more complex algorithms are likely to be slower than simpler algorithms
Which technology will address the following need? "Send requests to a remote object" 1 point CORBA XML SSH Juju
CORBA
Which approach is more economical for the following scenario? "A long-running business needs 10,000 computers for one-time data processing." 1 / 1 point Cloud computing In-house servers Hybrid approach
Cloud computing - Correct! Purchasing 10,000 computers is too expensive for one-time data processing.
Which approach is the most sensible for the following scenario? "A new startup needs to quickly scale their infrastructure to serve 1,000,000 customers, or risk losing market share to their competitors." 1 / 1 point Cloud computing In-house servers Hybrid approach
Cloud computing - Correct. Setting up the infrastructure to serve 1,000,000 customers is time-consuming. Cloud computing should allow the startup to start serving customers much more quickly, beating their competitors to the market.
Which of these frameworks is not built on Spark? 1 point GraphX SparkSQL Mllib D-Streams
D-Streams
Amazon DynamoDB is an example of a 1 point Distributed NOSQL key/value storage service Centralized Big-Data blob storage Cloud-optimized SQL database Function as a Service (FaaS) dynamic container offerin
Distributed NOSQL key/value storage service
Which storage technology is the best for the following scenario? "Sync files on a few personal devices" 1 point Swift AWS Glacier AWS S3 Dropbox
Dropbox
Which of the following state the purpose of Apache Spark? 1 point Enhance programmability Add concurrency to the MapReduce model Extend the MapReduce model to better support two common classes of analytic applications Eliminate locking found in the MapReduce model
Enhance programmability, Extend the MapReduce model to better support two common classes of analytic applications
Which is NOT one of the three layers of a file system? 1 point Physical file system Ephemeral file system Logical file system Virtual file system
Ephemeral file system
Apache Spark cannot read from any Hadoop input. 1 point False True
False
Cloud services need to be cheaper (e.g. through economies of scale) to be economical compared to in-house servers. 1 / 1 point True False
False - Correct! Cloud services do not need to be cheaper to be economical, as long as the utility premium is less than the ratio between peak demand and average demand.
Amazon S3 BLOB Storage's consistency model guarantees that once an object is deleted by a process it cannot be read by any other process. 1 point True False
False - Correct! Once an object is deleted, it may take some time before all of the replicated copies of the object are deleted. Other processes may still read those replicated copies before they are all deleted.
Experiments have shown that AWS Instance Stores perform at least an order of magnitude or more better than AWS Elastic Block Stores in throughput tests in comparable settings. 1 point True False
False - Even though AWS Instance Stores can handle higher throughput because they are located on disks that are physically attached to the host computer, experiments show that their edge over EBS in throughput tests is not orders of magnitude, a testament to the efficiency of NVM over Fiber technologies and data center networking designs.
Ceph achieves high performance in part by storing data and its accompanying metadata in the same server for faster access. True False
False - In Ceph, the data and metadata are decoupled.Ceph improves performance by:* limiting interaction between clients and servers* leveling metadata load* offloading decision making to the many data servers
Internet-level Personal Filesystems (like Dropbox) have strict access pattern limits, ensuring that they make a profit on each customer. True False
False - This market segment does not accept access pattern limits, so cloud providers use statistics to extract average access patterns and set pricing. On average, they make a profit but may lose money on some users with high access patterns.
AWS Lambda is a good technology to use when you have a function that will take several days to run. 1 point True, AWS Lambda is optimized for long-running jobs False, AWS Lambda functions are not allowed to run for more than a few minutes
False, AWS Lambda functions are not allowed to run for more than a few minutes
NIST developed an official standard for REST, because it recognized the importance of interoperability in cloud environments. 1 point True, and the success of the standard directly led to the recent explosive growth of the cloud industry False, NIST is opposed to the goal of interoperability False, there is no "official standard" True, although IBM later developed their own competing standard
False, there is no "official standard"
Which of the following HTTP verbs are used in REST APIs? 1 point GET DELETE APPEND POST
GET, DELETE, POST
Which of these is an example of Software as a Service (SaaS) 1 / 1 point Juju Google AppEngine VMWare vCloud GMail
GMail - Correct! GMail is a specific cloud application.
A social media company wants to use DynamoDB for storing posts of users. Which secondary indexing method should it use? 1 point LSI GSI
GSI - It is tolerable if the tweets of a user show up one client and does not show up on another client immediately. GSI follows Eventual consistency model and ensures that the view of the clients will be the same eventually. GSI also does not constrain table size and hence can be used to store a large number of tweets.
Which of the following RPC frames is created by Google to deal with communication in big data deployment? JVM Apache Thrift Google Protocol Buffer
Google Protocol Buffer
Which approach is feasible for the following scenario with the minimum efforts? "ACME company needs to be able to change the cloud provider frequently." 0 / 1 point Packaged software Infrastructure-as-a-Service Platform-as-a-Service Software-as-a-Service
Infrastructure-as-a-Service - Correct! They need the minimum dependency on cloud platform to enable frequent switching.
Which approach is feasible for the following scenario with the minimum efforts? "ACME company needs to deploy a system with a modified OS." 1 / 1 point Packaged software Platform-as-a-Service Infrastructure-as-a-Service Software-as-a-Service
Infrastructure-as-a-Service - Correct! They need to deploy the customized OS system
For VPC, how to select the optimum route for network traffic? 1 point Shortest prefix match Any prefix match Longest prefix match
Longest prefix match
Which technology will address the following need? "Provides a way for mobile web applications to link to backend storage" 0 / 1 point JSON MBaaS XML REST
MBaaS - Correct! MBaaS allow the common services to be shared among apps without too much custimization.
Which is the function of VPC peering? 1 point Make communication between VPCs whether these two VPCs belong to the same account or different accounts. Only make communication between VPCs for different accounts Make communication between VPCs within the same account.
Make communication between VPCs whether these two VPCs belong to the same account or different accounts.
You want to build a word count program. Which of the following pseudo-code is the proper Map function for this program? Note that the indenting is not accurate. "Word Count Program: You have a huge text file that consists of many lines. The goal is to count the number of times each distinct word appears in the file." 1 point Map(key = line, values = uniq_counts): Sum all 1's in values list Emit result (word, sum) Map(key, values): for each value in intermediate values: value += 1; emit intermediate(key, values) Map(key = line, value = contents): for each word in value: emit intermediate (word, 1) Map(key = line, value = contents): result = 0; for each word in value: result += value; emit(key, result)
Map(key = line, value = contents): for each word in value: emit intermediate (word, 1)
You want to build an image smoother program. Which of the following is the proper Map function for this program? Note that the indenting is not accurate. "Image Smoother Program: To smooth an image, use a sliding mask and replace the value of each pixel." 1 point Map(key = x,y value = list of R,G,B) compute average of R,G,B emit intermediate(key, average R,G,B) Map(key = x,y, value = R,G,B) emit intermediate(key, value)
Map(key = x,y, value = R,G,B) emit intermediate(key, value)
Which of these is not considered serverless computing? 1 point Platform as a Service (PaaS) Function as a Service (FaaS) Metal as a Service (MaaS)
Metal as a Service (MaaS) - Correct! MaaS requires its users to run services on bare metal i.e., configure the server yourself and hence is not serverless.
Choose all that apply: Which of these are Internet-Level Personal Filesystems? 1 point Redis Microsoft OneDrive Glacier Dropbox Apple iCloud Drive
Microsoft OneDrive, Dropbox, Apple iCloud Drive
Which of the following technology can help private subnet access the Internet? 1 point CIDR Internet Gateway NAT
NAT
A company determines that the most economical decision is to use in-house servers. Over time the company's peak demand for computing resources decreases sharply, while its average demand remains steady. Should the company consider switching from in-house servers to a cloud approach? 1 / 1 point Yes No
No - "Correct! Cloud computing only makes economic sense when the Utility Premium is less than the ratio of Peak demand to Average demand. If Peak demand falls while Average demand remains steady, cloud computing becomes less economically attractive than before."
What three phrases does WebSocket have? 1 point Opening SSH, Sending message, Closing SSH Opening handshake, Data transfer, Closing Handshake Opening handshake, Sending message, Closing Handshake
Opening handshake, Data transfer, Closing Handshake
Which *aaS is best described by "The unit of compute is a full app"? 1 point PaaS MaaS IaaS FaaS
PaaS - Correct! PaaS enables customers to develop, run, and manage business applications without the need to build and maintain the infrastructure.It enables consumers to create custom applications that run computation with ease.
Which feature of Spark Scheduler avoids extra shuffles? 1 point Cache-aware work reuse and locality Dryad-like DAG Pipelining functions within a stage Partitioning-aware
Partitioning-aware
What is the best model of delivery for the following scenario? "A web hosting solution for PHP web applications" Infrastructure-as-a-Service Platform-as-a-Service Software-as-a-Service
Platform-as-a-Service - Correct! To provide web hosting solution, we allow users to run different web applications, but they don't need to develop their own operating system environemnt.
Which technology will address the following need? "Create, Update, Read, and Remove objects over the web" 1 point JSON XML REST RMI
REST
You want to build an image smoother program. Which of the following is the proper Reduce function for this program? Note that the indenting is not accurate. "Image Smoother Program: To smooth an image, use a sliding mask and replace the value of each pixel." 1 point Reduce(key = x,y value = list of R,G,B) compute average of R,G,B emit (key, average R,G,B) Reduce(key = x,y, value = R,G,B) emit (key, value)
Reduce(key = x,y value = list of R,G,B) compute average of R,G,B emit (key, average R,G,B)
Which of the following is not an event type of WebSocket? 1 point Open Message Resend Error
Resend
What evolved as the successor to XML-RPC? 1 point JSON REST SOAP HTTP/2 Push
SOAP
Applications are managed for you when using 1 / 1 point Platform as a Service (PaaS) Infrastructure as a Service (IaaS) Metal as a Service (MaaS) Software as a Service (SaaS)
SaaS - Correct! Only SaaS provide application-level management.
Which approach is feasible for the following scenario with the minimum efforts? "ACME company needs to provide a widely used application for its marketing team." Infrastructure-as-a-Service Packaged software Platform-as-a-Service Software-as-a-Service
Software-as-a-Service - Correct! They only provide a specific application
What is the best model of delivery for the following scenario? "An Electronic Health Record system for clinics and doctors" Software-as-a-Service Infrastructure-as-a-Service Platform-as-a-Service
Software-as-a-Service - Electronic Headlth Record system is a concrete cloud application.
Which technology is the best suited for the following use case? "Exploring a new large dataset" 1 point YARN HDFS Hadoop Spark
Spark
Which technology is the best suited for the following use case? "Training a machine learning model on a large dataset with several iterations" 1 point HDFS YARN Spark HADOOP
Spark
Which layer does WebSocket runs on top of? 1 point TCP UPD IP
TCP
Which of the follow protocol lies in Transport Layer? 1 point TCP HTTP UDP
TCP, UDP
BLOB stands for "Binary Large OBjects" 1 point True False
True
In the context of cloud computing, multi-tenancy means 1 / 1 point many users may share the same physical computer and database many computers may share the same rack many cloud providers may share the same data center many data centers may share the same electric plant
many users may share the same physical computer and database - Correct! Multi-tenancy refers to multiple users on the same platform or device.
Once you upload your code, AWS Lambda does NOT automatically handle 1 point autoscaling permanently saving state fault tolerance capacity provisioning
permanently saving state
Amazon S3 BLOB Storage aims to provide high availability primarily by 1 point using proprietary, expensive, high quality storage hardware that rarely fails only offering the service to users with predictable, regular workloads to limit congestion relying on subcontractors to provide excess capacity replicating the data to multiple machines
replicating the data to multiple machines
Check all that apply: AWS Instance Store is 1 point well-suited for caching located on disks that are physically separated from the host computer well-suited for temporary logs located on disks that are physically attached to the host computer well-suited for long-term persistent storage
well-suited for temporary logs, located on disks that are physically attached to the host computer, well-suited for caching