DP-203T00: Data Engineering on Microsoft Azure Quiz Questions and Answers

You are designing a data storage solution for a database that is expected to grow to 50 TB. The usage pattern is singleton inserts, singleton updates, and reporting. Which storage solution should you use?

Answer :
  • Azure SQL Database Hyperscale

Explanation :

A Hyperscale database is an Azure SQL database in the Hyperscale service tier that is backed by the Hyperscale scale-out storage technology. A Hyperscale database supports up to 100 TB of data and provides high throughput and performance, as well as rapid

What is a lambda architecture, and what does it try to solve?

Answer :
  • An architecture that splits incoming data into two paths - a batch path and a streaming path. This architecture helps address the need to provide real-time processing in addition to slower batch computations.

Explanation :

An architecture that splits incoming data into two paths - a batch path and a streaming path. This architecture helps address the need to provide real-time processing in addition to slower batch computations

You need to recommend a storage solution for a sales system that will receive thousands of small files per minute. The files will be in JSON, text, and CSV formats. The files will be processed and transformed before they are loaded into a data warehouse in Azure Synapse Analytics. The files must be stored and secured in folders. Which storage solution should you recommend?

Answer :
  • Azure Data Lake Storage Gen2

Explanation :

Azure provides several solutions for working with CSV and JSON files, depending on your needs. The primary landing place for these files is either Azure Storage or Azure Data Lake Store

You are designing an application. You plan to use Azure SQL Database to support the application. The application will extract data from the Azure SQL Database and create text documents. The text documents will be placed into a cloud-based storage solution. The text storage solution must be accessible from an SMB network share. You need to recommend a data storage solution for text documents. Which Azure data storage type should you recommend?

Answer :
  • Files

Explanation :

Azure Files enables you to set up highly available network file shares that can be accessed by using the standard Server Message Block (SMB) protocol.

What is the difference between a star schema and a snowflake schema?

Answer :
  • All dimensions in a star schema join directly to the fact table (denormalized) while some dimension tables in a snowflake schema are normalized

Explanation :

All dimensions in a star schema join directly to the fact table (denormalized) while some dimension tables in a snowflake schema are normalized

You are designing an Azure Cosmos DB database that will support vertices and edges. Which Cosmos DB API should you include in the design?

Answer :
  • Gremlin

Explanation :

The Azure Cosmos DB Gremlin API can be used to store massive graphs with billions of vertices and edges.