Practice Assessment for Exam DP-900: Microsoft Azure Data Fundamentals

Pataasin ang iyong marka sa homework at exams ngayon gamit ang Quizwiz!

Which two DML statements are used to modify the existing data in a table? Each correct answer presents a complete solution. Select all answers that apply. SELECT UPDATE INSERT MERGE

UPDATE and MERGE modify the existing data in a table. SELECT and INSERT do not affect the existing data in a table.

You have a folder that contains documents, images, and audio files. Which type of data do you have? Select only one answer. structured semi-structured unstructured

Unstructured data contains data such as documents, images, audio data, video data, and binary files.

Which job role is responsible for building data models and finding hidden data patterns? Select only one answer. database administrator data engineer data analyst

A data analyst is responsible for building data models, cleaning and transforming data, and finding hidden data patterns. A data engineer manages data privacy, monitoring data stores and data pipelines. A database administrator manages data security, implementing backup and recovery plan, and monitoring the performance of database solutions.

Which job role is responsible for managing the security of the data in a database, implementing backup and recovery plans, and monitoring the performance of database solutions? Select only one answer. database administrator data engineer data analyst

A database administrator manages data security, implementing backup and recovery plans and monitoring the performance of database solutions. A data engineer manages data privacy, monitoring data stores and data pipelines. A data analyst is responsible for building data models, cleaning and transforming data, and finding hidden data patterns.

Which two keys are needed to create a one-to-many relationship between two tables in a relational database? Each correct answer presents part of the solution. Select all answers that apply. a primary key a unique key a foreign key an index key

A primary key and a foreign key are needed for a one-to-many relationship. You need a primary key that must be unique, but is not just a unique key. Index keys are used for searches.

Which type of data structure should you use to optimize create, read, update, and delete (CRUD) operations for data saved in a multi-column tabular format? Select only one answer. relational database key/value store document database graph database

A relational database is the best option for CRUD operations and uses the least amount of storage space. A key/value store is used for simple lookups based on a single key to obtain a single value. A document database uses unstructured data such as JSON, and is optimized for retrieval, not CRUD operations. A graph database is used to store hierarchical data, such as organizational charts that have nodes and edges.

You need to recommend a solution that meets the following requirements: - Encapsulates a business logic that can rename the products in a database - Adds entries to tables What should you include in the recommendation? Select only one answer. a stored procedure an inline function a view a table-valued function

A stored procedure can encapsulate any type of business logic that can be reused in the application. A stored procedure can modify existing data as well as add new entries to tables. A stored procedure can be run from an application as well as from the server. An inline function cannot be used to complete the task because it cannot modify nor create objects. It can be used to query a database. A view cannot be used to complete the task because it cannot modify nor create objects. It can be used to query a database. A table-valued function cannot be used to complete the task because it cannot modify or create objects. It can be used to query a database.

You have a complex query that selects data from multiple tables. Which three database objects should allow you to reuse the query definition? Each correct answer presents a complete solution. Select all answers that apply. a table a view a function a stored procedure a temporary table

A view, a function, and a stored procedure allow you to reuse the query definition for a complex query that selects data from multiple tables.

Which type of data workload is optimized for read operations and stores large amounts of aggregated data? Select only one answer. transactional analytical time series graph

Analytical databases are highly denormalized to store large amounts of aggregated data and are optimized for reads. Time series workloads do not store large amounts of aggregated data. Graph workloads store hierarchical data. Transactional workloads are optimized for create, read, update, and delete (CRUD) operations.

Which storage solution allows you to aggregate data stored in JSON files for use in analytical reports without additional development effort? Select only one answer. Azure Cosmos DB Azure Blob storage Azure Data Lake Storage Azure SQL Database

Azure Cosmos DB allows you to aggregate data in analytical reports without additional development. Azure SQL Database does not store data in JSON files. Blob storage and Data Lake Storage do not allow you to aggregate data for analytical reports without additional development effort.

Which service allows you to store data as a graph database? Select only one answer. Azure Cosmos DB Azure Synapse Analytics Azure SQL Managed Instance Azure SQL Database

Azure Cosmos DB allows you to store data as a graph database. Azure Synapse Analytics, SQL Managed Instance, and Azure SQL Database do not.

You need to aggregate and store multiple JSON files that contain records for sales transactions. The solution must minimize the development effort. Which storage solution should you implement? Select only one answer. Azure Cosmos DB Azure Files Azure Blob storage Azure SQL Database

Azure Cosmos DB has a SQL API that is optimized to store and process (transform) JSON files. The SQL API allows you to query the documents by using SQL-like language. There is no additional learning curve here to complete the task. Azure Files is used as storage for any type of file. There are no built-in methods to query the file and aggregate data. Blob storage allows you to store any type of data. You must use Azure Synapse Analytics or Azure Databricks to be able to query and aggregate the data. Azure SQL Database is a relational database that keeps data in tables. You must create a process that queries the JSON files and stores them in a relational format.

Which service is built on Apache Spark and is compatible with other cloud providers? Select only one answer. Azure Databricks Azure Data Factory Azure Synapse Analytics Azure HDInsight

Azure Databricks - Databricks is used for processing large amounts of data, which is supported by multiple cloud providers. Data Factory is used to run ETL pipelines. Azure Synapse Analytics is an Azure native service built on Apache Spark. HDInsight is used to process large amounts of data by using Apache Hadoop.

Which type of Azure Storage is used to store large amounts of files to be shared with virtual machines by using SMB? Select only one answer. Azure Files Azure Data Lake Storage Gen2 Azure Storage page blobs Azure Table storage

Azure Files is used to share files by using NFS and SMB. Data Lake Storage Gen2 is used for storing huge amounts of data to be processed, not to be shared among virtual machines. Page blobs are used for VHDs. Table storage is used for two dimensional tables.

Which service is managed and serverless, avoids the use of Windows Server licenses, and allows for each workload to have its own instance of the service being used? Select only one answer. Azure SQL Database Azure SQL Managed Instance SQL Server on Azure Virtual Machines running Windows Server SQL Server on Azure Virtual Machines running Linux

Azure SQL Database is a serverless platform as a service (PaaS) SQL instance. SQL Managed Instance is a PaaS service, but databases are maintained in the same SQL Managed Instance cluster. SQL Server on Azure Virtual Machines running Windows or Linux are not serverless options.

What is fully managed and does not require changes to a database for migration? Select only one answer. Azure SQL Database Azure SQL Managed Instance SQL Server on Azure Virtual Machines running Windows SQL Server on Azure Virtual Machines running Linux

Azure SQL Managed Instance is a platform as a service (PaaS) service that supports most of the same features as Azure SQL. Azure SQL does not support as many SQL features as SQL Managed Instance. SQL Server on Azure Virtual Machines running Windows and Linux are not managed.

Which native Azure service can you use to process data by running Apache Spark jobs? Select only one answer. Azure Data Lake Storage Azure Cosmos DB Azure Synapse Analytics

Azure Synapse Analytics is an Azure native service built on Spark. Azure Data Lake Storage is used to store data. Azure Cosmos DB does not use a Spark platform.

Which two visuals in Microsoft Power BI allow you to visually compare numeric values for discrete categories? Each correct answer presents a complete solution. Select all answers that apply. a bar chart a column chart a matrix a card

Bar charts and column charts allow you to compare numeric values for discrete values. A card is used to track a single number or value. A matrix makes it easier to view data across multiple dimensions.

Which two storage solutions can be mounted in Azure Synapse Analytics and used to process large volumes of data? Each correct answer presents a complete solution. Select all answers that apply. Azure Blob storage Azure Data Lake Storage Azure Files Azure Table storage

Blob storage and Data Lake Storage can be used to store massive amounts of data and can be mounted in Azure Synapse Analytics. Azure Files and Table storage cannot be mounted in Azure Synapse Analytics.

Which type of database can be used for semi-structured data that will be processed by an Apache Spark pool in Azure Synapse Analytics? Select only one answer. column-family graph relational

Column-family databases are used to store unstructured, tabular data comprising rows and columns. Azure Synapse Analytics Spark pools do not directly support graph or relational databases.

Which service allows you to perform on-demand analysis of large volumes of data from text logs, websites and IoT devices by using a common querying language for all the data sources? Select only one answer. Azure Data Explorer Azure Data Lake Storage Gen2 Azure Stream Analytics Azure Cosmos DB

Data Explorer is used for the analysis of large amounts of text log data, websites, and IoT devices and uses a common querying language. Data Lake Storage Gen2 is a data source, Azure Stream Analytics is used to define streaming jobs, apply a perpetual query, and write the results to an output. Azure Cosmos DB stores data.

In a stream processing architecture, what can you use to persist the processed results as files? Select only one answer. Azure Event Hubs Azure Data Lake Storage Gen2 Azure Synapse Analytics Azure Databricks

Data Lake Storage Gen2 can be used to store files. Azure Synapse Analytics and Databricks can be used to persist the data in a database for further querying and analysis. Event Hubs is a data ingestion service.

Which type of Azure Storage is used to store large amounts of data that must be processed by services such as Azure Databricks, Azure Synapse Analytics, and Azure HDInsight? Select only one answer. Azure Files Azure Data Lake Storage Gen2 Azure Storage page blobs Azure Tables storage

Data Lake Storage Gen2 is used for storing huge amounts of data to be processed by services such as Databricks, Azure Synapse Analytics, and HDInsight. Page blobs are used for VHDs. Table storage is used for two dimensional tables. Azure Files is not optimized for large amounts of data to be processed by services such as HDInsight.

Which three services can be used to ingest data for stream processing? Each correct answer presents a complete solution. Azure Data Lake Storage Azure Event Hubs Azure IoT Hub Azure SQL Database Azure Functions

Data Lake Storage, Event Hubs, and IoT Hub are sources commonly used to ingest data for stream processing. Azure SQL Database and Azure Function are outputs.

Which type of data store uses star schemas, fact tables, and dimension tables? Select only one answer. relational database data warehouse cube data lake

Data warehouses use fact and dimension tables in a star/snowflake schema. Relational databases do not use fact and dimension tables. Cubes are generated from a data warehouse but are a table themselves. Data lakes store files.

Which job role is responsible for troubleshooting index performance, provisioning access to databases, and backing up databases? Select only one answer. database administrator database engineer database analyst database user

Database administrators maintain existing databases. Database engineers create databases. Database analysts use data from a database to make business decisions. Database users use a database.

Which job role is responsible for creating reports from a database and using OLAP cubes? Select only one answer. database administrator database engineer database analyst database user

Database analysts use data from a database to make business decisions. Database users use a database. Database administrators maintain existing databases. Database engineers create databases.

Which job role is responsible for designing database solutions, creating databases, and developing stored procedures? Select only one answer. database administrator database engineer database analyst database user

Database engineers create databases. Database analysts use data from a database to make business decisions. Database users use a database. Database administrators maintain existing databases.

Which two services allow you to pre-process a large volume of data by using Scala? Each correct answer presents a complete solution. Azure Databricks a serverless Apache Spark pool in Azure Synapse Analytics Azure Synapse Analytics serverless SQL pool Azure Synapse Analytics dedicated SQL pool

Databricks and the Spark pool in Azure Synapse Analytics run data processing for large amounts of data by using Scala.

You are processing data by using batches and streams. What should you use to ensure that transactional consistency is enforced during the processing? Select only one answer. Azure Delta Lake Azure Blobs storage Azure Tables storage Azure SQL Database

Delta Lake is an open-source storage layer that adds support to Data Lake Storage for transactional consistency.

Which Azure Cosmos DB API allows you to work with vertices and edges? Select only one answer. NoSQL Table Apache Cassandra Apache Gremlin

Gremlin is used for data in a graph. Entities are defined as vertices that form nodes. Nodes are connected by edges that represent relationships.

Which two services can be used as a source for stream processing? Each correct answer presents a complete solution. Azure Event Hubs Azure IoT Hub Azure Databricks Azure SQL Database

IoT Hub and Event Hubs can be used as sources for stream processing.

Which feature of transactional data processing guarantees that concurrent processes cannot see the data in an inconsistent state? Select only one answer. atomicity consistency isolation durability

Isolation in transactional data processing ensures that concurrent transactions cannot interfere with one another and must result in a consistent database state.

Which open-source database has built-in support for temporal data? Select only one answer. MySQL MariaDB PostgreSQL Oracle Database

MariaDB has built-in support for temporal data. It enables applications to query data as the data appeared in previous points in time.

What should you use to define an analytical model for Microsoft Power BI? Select only one answer. Power BI Desktop Power BI Phone App the Power BI service Azure Data Factory

Power BI Desktop is used to define analytical models. Phone App is used to view Power BI reports. Power BI Service is used to serve data. Data Factory is used to run ETL jobs.

Select the answer that correctly completes the sentence. [Answer choice] is a process to reduce duplicate data in a data base and ensure data integrity. Select only one answer. Indexing Normalization Projecting Refactoring

Normalization is a process to reduce data duplication. This can be done by separating entities into their own tables and establishing relationships between the tables.

You design an application that needs to store data based on the following requirements: - Store historical data from multiple data sources - Load data on a scheduled basis - Use a denormalized star or snowflake schema Which type of database should you use? Select only one answer. OLTP OLAP Azure Table storage graph

OLAP databases are used for snowflake schemas with historical data. Table storage is not suited for this type of data. Graph databases are used to store hierarchical data, such as organizational charts that have nodes and edges. OLTP databases are used for create, read, update, and delete (CRUD) operations, use the least amount of storage space, and are not for historical data in a snowflake schema.

Which type of visual in Microsoft Power BI should you use to compare categorized values as the proportions of a total value? Select only one answer. pie chart line chart bar chart scatter plot

Pie charts visually compare different values as a proportion of a total. Line charts are used to examine trends, usually over time. Bar charts are used to compare different values for discrete categories. Scatter plots are used to determine a relationship or correlation between two numeric values.

Which open-source database is a hybrid relational-object database? Select only one answer. MySQL MariaDB PostgreSQL Oracle Database

PostgreSQL is a hybrid relational-object database. You can store data in relational tables, or you can store custom data types with non-relational properties.

What are two characteristics of Azure Table storage? Each correct answer presents a complete solution. Select all answers that apply. Each RowKey value is unique within a table. Each RowKey value is unique within a table partition. Items in the same partitions are stored in a row key order. Tables use indexes to speed up queries.

RowKey is unique within a partition, not within a table. Items in the same partitions are stored in a row key order. Tables cannot have indexes to speed up queries.

Which SQL engine is optimized for IoT scenarios? Select only one answer. SQL Server on Azure Virtual Machines Azure SQL Database Azure SQL Edge Azure SQL Managed Instance

SQL Edge is optimized for IoT scenarios that must work with streaming time series data. SQL Server on Azure Virtual Machines is best used when you want to retain control over the server and database configuration. SQL Managed Instance is ideal for cloud migrations where you need minimal change to existing apps. Azure SQL Database is best used for new cloud solutions.

Which data service allows you to migrate an entire Microsoft SQL server to the cloud without requiring that you manage the infrastructure after the migration? Select only one answer. SQL Server on Azure Virtual Machines Azure SQL Database Azure SQL Managed Instance

SQL Managed Instance allows you to migrate an entire SQL server to the cloud without requiring that you manage the infrastructure after the migration. You must manage all aspects of SQL Server on Azure Virtual Machines. Azure SQL Database supports most, but not all, core database-level capabilities of SQL Server.

Which data service allows you to use every feature of Microsoft SQL Server in the cloud? Select only one answer. Azure SQL Database Azure SQL Managed Instance SQL Server on an Azure Virtual Machines running Windows SQL Server on an Azure Virtual Machines running Linux

SQL Server on an Azure Virtual Machines running Windows is the only option that supports all the SQL Server features in the cloud. Azure SQL Database, SQL Managed Instance, and SQL Server on an Azure Virtual Machines running Linux do not support all the SQL Server features.

You need to process many JSON files every minute, while keeping the data from the files accessible by using native queries. Which Azure Cosmos DB API should you use? Select only one answer. NoSQL Table Apache Cassandra Apache Gremlin

SQL is the native API in Cosmos DB. It manages data in the JSON format. The Cassandra API uses a column-family storage structure. The Table API is used to work with data in key/value tables. The Gremlin API is used with data in a graph structure.

Which service can you use to perpetually retrieve data from a Kafka queue, process the data, and write the data to Azure Data Lake? Select only one answer. Azure Synapse Analytics Azure Stream Analytics Azure Cosmos DB Azure Data Factory

Stream Analytics can handle stream processing from Kafka to Data Lake. Azure Synapse Analytics does not process streaming data. Azure Cosmos DB does not handle data streaming. Data Factory does not handle streams.

What is a characteristic of stream processing? All the data in a dataset is processed. Latency is measured in seconds or milliseconds. Stream processing is used to perform complex analysis. Stream processing can handle large datasets.

Stream processing is fast. It processes data as it arrives. Stream processing executes simple analytics or simply writes data to a sink. Stream processing handles small chunks of data.

Which Azure Blob storage access tier should you use for data that will be used once per year and can have an access time that takes more than an hour? Select only one answer. Hot Archive Cool

The Archive access tier is intended for data that must be retained but not frequently accessed. It can take more than one hour for this data to be available when accessed.

Which Azure Cosmos DB API should you use for data in a column-family storage structure? Select only one answer. MongoDB Table Apache Cassandra Apache Gremlin

The Cassandra API is used for tabular data in a column-family storage. The Gremlin API is used for graph databases. MongoDB API stores data in the Binary JSON (BSON) format. Table is used to retrieve key-value pairs.

Which Azure Cosmos DB API should you use for data in key/value tables? Select only one answer. MongoDB Table Apache Cassandra Apache Gremlin

The Table API is used for key/value pairs. The Cassandra API is used for tabular data in a column-family storage. The Gremlin API is used for graph databases. The MongoDB API stores data in the BSON format.

Which type of database should you use to store sequential data in the fastest way possible? Select only one answer. Azure SQL Database time series Azure Table storage graph

Time series databases are used to store sequential data. Table storage is not suited for time series. Graph databases are used to store hierarchical data, such as organizational charts that have nodes and edges. Azure SQL Database is the best option for create, read, update, and delete (CRUD) operations, uses the least amount of storage space, and is not optimized for time series.


Kaugnay na mga set ng pag-aaral

Chapter 16: Disorders of the Immune Response Porth

View Set

English society during Henry VII and Henry VIII's reign.

View Set

World Geography Semester Exam Study Guide 2

View Set

Module 6 - HESI Safety & Infection Control

View Set

Series 63: Section 4: Communication with Customers and Prospects

View Set