DAMA Practice Exam (150 Q's)
The role of the conceptual data model in the Metadata repository is?
To summarize the key data subject areas for a business area at a high level of abstraction to enable major data concepts to be understood.
25. How does the DMBOK refer to an organization that values data as an asset and manages data through all phases of its lifecycle? a. A data-centric organization b. A data management organization c. A data integration organization d. A data support organization e. A data legislative organization
a. A data-centric organization
12. What is the definition of an entity? a. A thing about which an organization collects information b. A thing that exists separate from other things c. A thing about which an organization processes information d. All of the above e. None of the above
a. A thing about which an organization collects information
7. What is a hash? a. An algorithm that converts text into encoded values b. A public key that is freely available and used to encode data along with a receiver's private key c. A clearinghouse for encrypted data d. A symmetric encryption method e. A meetup for Data Strategists
a. An algorithm that converts text into encoded values Topic: Data Security Explanation: A hash is a form of encryption that uses algorithms to convert data into a mathematical representation. The exact algorithms and the order of their application must be known in order to reverse the encryption process and reveal the original data (ch.7, section 1.3.8.1 Hash, p. 227).
9. Taxonomy refers to... a. Any classification or controlled vocabulary b. Classification of organizational resources c. Arrangement of controlled vocabulary d. Constrained set of organizational vocabulary e. Categorization of controlled phrases
a. Any classification or controlled vocabulary Topic: Document & Content Management Explanation: Taxonomy is an umbrella term referring to any classification or controlled vocabulary. The best-known example of a taxonomy might be the classification system for all living things that was developed by Swedish biologist Carl Linnaeus (ch. 9, section 1.3.2.6 Taxonomies, p. 312).
23. Which of the following is the best definition of a Data Warehouse? a. Any data stores or extracts used to support the delivery of Business Intelligence b. An explanation of how data is linked to physical products in a warehouse c. A data system based on incremental updates from Operational Systems d. A data system in which data is stored in normal form e. Any data store that can be accessed by business users and data analysts
a. Any data stores or extracts used to support the delivery of Business Intelligence Topic: Data Warehousing & Business Intelligence Explanation: A Data Warehouse is a data storage system that provides decision support to aid data practitioners in reporting and analysis (ch. 11, section 1.3.2 Data Warehouse, p. 384).
27. Which international initiative established a capital standard for banking institutions? a. Basel II b. Basel IV c. Basel I d. Basel III e. Basel
a. Basel II Topic: Data Security Explanation: The regulatory compliance requirement Basel II refers to the Principles for Effective Risk Data Aggregation and Risk Reporting. Since 2006, financial institutions doing business in European Union countries are required to report standard information providing liquidity (ch. 7, section 2.1.2 Regulatory Requirements, p. 246).
26. Which components are included in the three-schema approach to database management developed by SPARC? a. Conceptual, external, internal b. Conceptual, relational, dimensional c. Logical, physical, canonical d. Conceptual, local, physical e. Logical, physical, abstract Topic: Data Modeling & Design
a. Conceptual, external, internal
32. Which principles focus on ideas such as fairness, transparency, and reliability? a. Data Ethics principles b. Data Quality principles c. Data Governance principles d. Business Intelligence principle e. Data Management principles Topic: Data Ethics
a. Data Ethics principles
44. What is an alternate key? a. A key in a relational database that is unique for each record and used as the primary identifier for that record b. A sequentially generated unique number is attached with each record c. A candidate key not selected to be the primary key d. A key in a relational database that links records to other tables e. Another term for a surrogate key
c. A candidate key not selected to be the primary key Topic: Data Modeling & Design
14. Depending on the database technology or data modeling tool and whether the entities involved have mutual dependencies, when a relationship is defined between two entities, which of the following may be created implicitly? a. An attribute b. An identifier c. A foreign key d. A data model e. A simple key
c. A foreign key Topic: Data Modeling & Design Explanation: A foreign key is used in physical (and sometimes logical) data modeling to represent the relationship of records in one table to records in another table (ch. 5, section 1.3.3.2.5 Foreign Key, p. 132).
41. A specific type of metadata that focuses on the context, meaning, and relevance of data within an organization is called... a. Administrative Metadata b. Technical Metadata c. Business Metadata d. Operational Metadata e. Descriptive Metadata
c. Business Metadata Topic: Metadata Management
5. Which position should be responsible for leading the Data Governance Council (DGC)? a. DGC Chair should rotate among executive positions b. Chief Data Architect or Chief Data Modeler c. Chief Data Steward or Chief Data Officer d. Chief Information Officer or Chief Technology Officer e. Any executive can chair the DGC
c. Chief Data Steward or Chief Data Officer Topic: Data Governance Explanation: Data Stewards are accountable and responsible for Data Governance activities. They should have a portion of their work time dedicated to these activities, which could include leading the Data Governance Council. Alternatively, the Chief Data Officer could take on this responsibility (ch. 3, section 1.3.5 Types of Data Stewards, p. 76).
3 Within the Data Governance framework, which component incorporates education, training, and raising awareness? a. Roles b. Processes c. Communication d. Data e. Tools
c. Communication Topic: Data Governance Explanation: Communication helps convey the principles of Data Governance, create awareness about the importance of data quality, security, and compliance, and ensure that individuals are trained to follow the established processes (ch. 3, section 2.9 Engage Change Management, p. 85
10. Which position is responsible for the quality and use of the organization's data assets? a. Data Architect b. Data Modeler c. Data Steward d. Chief Information Officer e. Data Scientist
c. Data Steward Topic: Data Governance Explanation: The Data Architect and Data Modeler roles are related to database architecture. The Chief Information Officer is responsible for Information Technology across the enterprise. A Data Scientist uses an organization's data to derive insights. The Data Steward role is responsible for the quality and use of data assets within their specified domain (ch. 3, section 1.3.4 Data Stewardship, p. 75).
1. Which of the following Data Storage systems supplies advanced analytics? a. Operational Data Store b. Document Database c. Data Warehouse d. Transactional Data Store e. Conceptual Data Store Topic: Data Warehouse & Business Intelligence
c. Data Warehouse
45. What is the most common definition of metadata? a. Tracking data b. Vital management information c. Data about data d. Reliable data e. Risk management data
c. Data about data
18. What is the definition of a business rule in the context of Data Governance? a. Outlines the steps to take when a business disruption occurs b. Clarifies an organization's objectives c. Defines constraints on what can and cannot be done in the organization d. Contains an organization's contingency plans e. Clarifies an organization's goal
c. Defines constraints on what can and cannot be done in the organization Topic: Data Quality Explanation: Business rules describe how businesses should operate internally in order to be successful. Data-related business rules characterize data so that it is fit for its purpose within an organization (ch. 13, section 1.3.7 Data Quality Business Rule Types, p. 464).
46. The Physical Data Model does NOT include which of the following? a. Storage allocation details b. Indexes c. Indicator of Data Steward owner d. Table structures e. Primary and foreign keys Topic: Data Modeling & Design
c. Indicator of Data Steward owner
34. Which of the following statements is NOT true of DAMA's Environmental Factors Hexagon? a. It provides a key for reading the DMBOK context diagrams b. It shows the relationship between people, processes, and technology c. It elaborates on the relationship between knowledge areas d. It puts goals and principles at the center e. All of the statements are true
c. It elaborates on the relationship between knowledge areas ch.1
8. Which of the following is most crucial for risk management? a. Data Stewards b. Enterprise Data Model c. Metadata d. Normalization e. Data Warehouse
c. Metadata
27. Which of the following terms refers to multiple processes that are organized and executed in a system? a. Integration b. Development c. Orchestration d. Implementation e. Performance Topic: Data Integration & Interoperability
c. Orchestration
19. A dataset composed of customer credit reports purchased from a third-party vendor would be an example of: a. Transactional Data b. Protected Data c. Reference Data d. Master Data e. Metadata
c. Reference Data Topic: Master & Reference Data Management Explanation: Reference Data is any data used to provide context or relate a dataset to information external to an organization. Sometimes, Reference Data originates outside of the organization. In this case, credit reports from a third-party vendor would represent Reference Data (ch. 10, section 1.3.1 Differences Between Master & Reference Data, p. 352).
39. Which of the following statements regarding the principles of Data Management is NOT accurate? a. Data is not diminished when used b. Organizations should measure costs and benefits associated with data c. The primary goal is to ensure the organization understands stakeholders' requirements for the quality of data d. Metadata is used to manage and use the organization's data e. Data Management involves collaborations between teams Topic: Data Management
c. The primary goal is to ensure the organization understands stakeholders' requirements for the quality of data
30. Why would an organization choose to purchase Reference Data? a. To summarize basic information about their enterprise data b. To set up data compliance and governance processes c. To enhance data quality and to facilitate analysis across the organization d. To define how data will be captured and tracked e. To document transactional data systems
c. To enhance data quality and to facilitate analysis across the organization Topic: Master & Reference Data Management Explanation: An organization would choose to purchase Reference Data and make it available across the organization in order to ensure that its internal data is valid and consistent with external context (ch. 10, section 1.3.2 Reference Data, p. 353).
44. Which of the following is NOT a goal of Data Quality? a. Develop a governed approach to make data fit for purpose based on data consumers' requirements b. Identify and advocate for opportunities to improve the quality of data, through process and system improvements c. Understand and comply with all relevant regulations and policies for privacy, protection, and confidentiality d. Define and implement processes to measure, monitor, and report on data quality levels e. Define standards, requirements, and specifications for data quality controls as part of the data lifecycle Topic: Data Quality
c. Understand and comply with all relevant regulations and policies for privacy, protection, and confidentiality
7. What is the definition of Industry Reference Data? a. Datasets that are created and maintained by industry associations to provide a common standard for codifying important concepts b. A set of data that is used to calibrate, validate, or test a computational model c. A type of data that captures the physical features of a place and has an association with a location relative to Earth d. A type of data that is owned by an organization and is used for internal purposes only e. A set of data that provides information about the content, quality, condition, and other characteristics of a dataset
a. Datasets that are created and maintained by industry associations to provide a common standard for codifying important concepts Explanation: Industry Reference Data is a broad term to describe datasets that are created and maintained by industry associations or government bodies, rather than by individual organizations, in order to provide a common standard for codifying important concepts (ch. 10, section 1.3.2.3 Industry Reference Data, p. 356).
34. Which of the following is one of the responsibilities of a Data Architect? a. Describe the current state of data and information delivery throughout the enterprise b. Create frameworks and accountability diagrams c. Implement roadmaps d. Plan for operational success e. Reduce the risks associated with low quality data Topic: Data Architecture
a. Describe the current state of data and information delivery throughout the enterprise
4. What is the main objective when implementing a Data Warehouse? a. Enable effective business analysis and decision-making b. Determine the monetary value of data assets c. Align data with the priorities of the organization d. Improve a product or service e. Assess market trends in order to develop a better business strategy
a. Enable effective business analysis and decision-making
40. What is the purpose of ISO 8000? a. Enable the exchange of complex information in an application-neutral form b. Create, collect, store, maintain, transfer, process, and present metadata c. Promote timely and cost-effective data integration d. Report on potential data security risk e. Ensure that data can only be used or read using a specific licensed software application
a. Enable the exchange of complex information in an application-neutral form Topic: Data Quality
2. How is Data Governance defined? a. Exercise of authority and control over the management of data assets b. Set of interdependent functions, each with its own goals, activities, and responsibilities c. Data governance assists in representing information consistently and protecting sensitive information d. Evaluation of the current state of critical data management activities in order to plan for improvement e. Planning, implementation, and control activities for lifecycle management of data and information found in any form or medium
a. Exercise of authority and control over the management of data assets
13. Which of the following processes refers to the selection of required data from its source? a. Extract b. Transform c. Targeting d. Load e. Map
a. Extract Topic: Data Integration & Interoperability Explanation: The extract process involves selecting the required data and extracting it from its source. Then, extracted data is staged in a physical data store, on disk, or in memory (ch. 8, section 1.3.1.1 Extract, p. 273).
48. Which of the following is NOT a key? a. Logical key b. Surrogate key c. Primary key d. Alternate key e. Foreign key
a. Logical key Topic: Data Modeling & Design Explanation: An identifier (also called a key) is a set of one or more attributes that uniquely define an instance of an entity. Keys can be defined based on how they are constructed (e.g., simple, compound, composite, surrogate) or their function (e.g., candidate, primary, alternate). A logical key is NOT a type of key (ch. 5, section 1.3.3.3.2.1 Construction-type Keys, p. 133).
18. Which of the following provides a means of standardizing the possible values of a data element? a. A data modeling scheme b. A database c. A dimensional modeling d. A domain e. A dimension table
d. A domain Topic: Data Modeling & Design Explanation: In Data Modeling, a domain refers to the complete set of possible values that a data attribute can be assigned (ch. 5, section 1.3.3.4 Domain, p. 135).
41. What is Data Stewardship? a. The role responsible for creating policies, procedures, and rules that govern data in the organization b. A collection of tools that ensure an organization's privacy policy c. The creation of compelling vision for Data Management across the enterprise d. A position accountable and responsible for data within a specified domain e. A prioritized program of work with scoped boundaries Topic: Data Governance
d. A position accountable and responsible for data within a specified domain
35. What is the definition of a surrogate key? a. A unique alphanumeric sequence is attached to each record in a dataset b. A key that links records in a dataset to other data tables c. A set of data records that are independent of any other data d. A unique identifier attached to each record, which may be used as a primary key e. A document that identifies how data is linked to business rules
d. A unique identifier attached to each record, which may be used as a primary key ch.5
21. Which of the following is a Data Quality principle? a. Prevention b. Governance c. Criticality d. All of the above
d. All of the above
29. Which of the following is a valid description of Data Management? a. Data Management is lifecycle management b. Data Management is cross-functional c. Data Management is the development, execution, and supervision of plans d. All of the above e. None of the above
d. All of the above
20. Which of the following is true of a recursive relationship? a. It is unary b. It is also referred to as self-referencing c. It involves only one entity d. All of the above e. None of the above
d. All of the above Topic: Data Modeling & Design Explanation: A recursive relationship could be used to represent a hierarchy or a network where only one entity is involved (for example, the relationship between a course and prerequisites, which are themselves courses) (ch. 5, section 1.3.3.2.4.1 Unary (Recursive) Relationship, p. 131).
6. Which of the following represents the type of data analysis aimed at understanding organizational activities and opportunities? a. Operational Data Store b. Data Warehouse c. Operational Data Mart d. Business Intelligence e. Dimensional Data Warehouse
d. Business Intelligence Topic: Data Warehouse & Business Intelligence Explanation: Business Intelligence (BI) refers to the type of data analysis aimed at understanding organizational activities and opportunities. The term can also be used to refer to the set of technologies that support this kind of data analysis (ch. 11, section 1.3.2 Data Warehouse, p. 384).
8. Which type of metadata focuses on the content and condition of data and includes details related to Data Governance? a. Technical Metadata b. Operational Metadata c. Structural Metadata d. Business Metadata e. Rights Metadata
d. Business Metadata Topic: Metadata Management Explanation: Business Metadata includes the non-technical names and definitions of concepts, subject areas, entities, and attributes; attribute data types and other attribute properties; range descriptions; calculations; algorithms and business rules; and valid domain values and their definitions (ch. 12, section 1.3.2.1 Business Metadata, p. 422).
15. Which kind of metadata provides information about content and condition of the data? a. Technical metadata b. Operational metadata c. Logical metadata d. Business metadata e. Technical metadata
d. Business metadata Topic: Metadata Management Explanation: Business Metadata includes the non-technical names and definitions of concepts, subject areas, entities, and attributes; attribute data types and other attribute properties; range descriptions; calculations; algorithms and business rules; valid domain values and their definitions (ch. 12, section 1.3.2.1 Business Metadata, p. 422, 423).
43. Which process focuses on enabling an integrated and historical business context on operational data? a. Data Asset Valuation b. Data Modeling c. Issue Management d. Data Warehousing e. Normalization Topic: Data Warehousing & Business Intelligence
d. Data Warehousing
28. Which term refers to the technical team that will ensure data meets organizational needs, define technical architecture, and install and administer technology? a. Metadata support b. Data architecture support c. Data interoperability support d. Database technology support e. Data security support Topic: Data Storage & Operations
d. Database technology support
35. Which of the following is a guiding principle of Data Storage & Operations? a. Decentralize all data physically and virtually into data hubs b. Prioritize local data needs over enterprise data needs c. Lower cost and complexity of managing solutions by developing non-shared models and interfaces d. Identify and act on automation opportunities e. Take a myopic perspective in design to ensure future extensibility Topic: Data Storage & Operations
d. Identify and act on automation opportunities
42. What is the difference between metadata and data? a. There is no difference between metadata and data b. Metadata refers to non-processed information while data is the term for information as it is being processed c. Metadata is data owned by Meta d. Metadata provides meaningful information about data e. Metadata is data that is external to the organization Topic: Metadata Management
d. Metadata provides meaningful information about data
48. Which of the following refers to the boundary between an organization's environments and exterior systems? a. Firewall b. Backdoor c. DMZ d. Perimeter e. Key logger
d. Perimeter
15. When Metadata activities are focused on documents, the information is referred to as... a. Legal Metadata b. Reference Metadata c. Administrative Metadata d. Preservation Metadata e. Descriptive Metadata
d. Preservation Metadata Topic: Metadata Management Explanation: Preservation Metadata represents information about document storage, archival conditions, and rules for conservation (ch. 12, section 1.3.4 Metadata for Unstructured Data, p. 425).
36. What term is used to describe how many instances of one entity are involved in the relationship with instances of another entity? a. Foreign Keys b. Arity of Relationships c. Relationship d. Relationship Cardinality e. Binary Relationship Topic: Data Modeling & Design
d. Relationship Cardinality
19. When data values must be associated in chronological order and with specific time values, how does the DMBOK refer to the type of patterns that are used? a. Chronological patterns b. Sequence patterns c. Organizational patterns d. Time-based patterns e. Time-oriented patterns
d. Time-based patterns Topic: Data Modeling & Design Explanation: Time-based patterns are used when data values must be associated in chronological order and with specific time values (ch. 5, section 1.3.4.5 Time-Based, p. 142).
24. What is the purpose of a data lineage tool? a. To list potential data integration opportunities b. To provide a formal Data Quality assessment of a dataset c. To describe the dataset's current phase in the data lifecycle d. To track historical changes to a dataset e. To collect all the metadata related to a dataset
d. To track historical changes to a dataset Topic: Data Quality Explanation: Data lineage provides information regarding historical tracking and can help explain the state of data at a given point in the data flow (ch. 13, section 1.3.10.2 Data Enhancement, p. 471).
39. What are relationship labels in database technology? a. Verb phrases describing how to take action on the organization's mission statement b. Verb phrases comparing business rules c. Verb phrases linking business rules with technical specifications d. Verb phrases describing business rules in each direction between two entities e. Verb phrases describing relationships between data tables
d. Verb phrases describing business rules in each direction between two entities Topic: Data Modeling & Design
12. What is the correct definition of change data capture? a. A Data Integration approach that updates a Data Warehouse with big changes from Operational Systems b. A Data Warehousing approach to transforming Operational Systems into Data Marts c. A Data Quality initiative that assesses any discrepancies between data stored in a Data Warehouse and data in Operational Systems d. A Metadata document that describes how data is transformed in the Data Warehouse e. A Data Integration approach that updates a Data Warehouse with small changes from Operational Systems
e. A Data Integration approach that updates a Data Warehouse with small changes from Operational Systems Topic: Data Integration & Interoperability Explanation: Change data capture (CDC) refers to the process of detecting that data has changed and ensuring that only the changed information is updated appropriately across the data storage system (ch. 6, section 1.3.10.3 Change Data Capture, p. 190).
40. A Data Governance program should be... a. Sustainable b. Measured c. Multi-layered d. Business-driven e. All of the above Topic: Data Governance
e. All of the above
49. What can an organization NOT do without Metadata? a. Manage its data as an asset b. Manage its data at all c. Understand what data it has, what the data represents, and where it originates d. Understand who has access to its data e. All of the above Topic: Metadata
e. All of the above
17. What causes data redundancy or data rot? a. Poor assimilation of collected data b. Poor data management practices c. Server and human error d. Dataset inaccuracies developed over time e. All of the above
e. All of the above Topic: Data Management Explanation: All of the above would create Data Quality issues contributing to data rot (ch. 1, section 2.5.9 The Data Lifecycle, p. 29).
23. The data modeling technique suited for information that changes over time in both structure and content is... a. Data modeling b. Dimensional modeling c. Scenario modeling d. Metadata modeling e. Anchor modeling
e. Anchor modeling Topic: Data Modeling & Design Explanation: Anchor Modeling provides graphical notation used for conceptual modeling. It is similar to traditional data modeling with extensions for working with temporal data. Therefore, it's well suited to data that may change in structure and content over time (ch. 5, section 1.3.4.5.1 Data Vault, p. 142).
20. Bandwidth can be reduced by filtering an update to include only data that has been changed within a defined timeframe. What is the name for this process? a. Judicious data selection b. Biased sampling methodology c. Waterfall method d. Database-as-a-service (DaaS) e. Change data capture
e. Change data capture Topic: Data Integration & Interoperability Explanation: Change data capture (CDC) is a method of filtering to include only data that has been changed within a defined timeframe. CDC monitors a dataset for changes (such as inserts, changes, deletes) and then passes those changes (referred to as the deltas) to other datasets, applications, and organizations that consume the data (ch. 8, section 1.3.2.2 Change Data Capture, p. 276).
45. Which process involves creating artifacts to describe the existing state of data across an organization? a. Data Governance b. Metadata Management c. Master Data Management d. Data Orchestration e. Data Architecture Topic: Data Architecture
e. Data Architecture
33. Which organization should be responsible for leading Data Governance? a. Data Governance Office b. Project Management Office c. Data Quality Board d. IT Leadership Team e. Data Governance Council
e. Data Governance Council
4. In the Data Management Practices Hierarchy, advanced data practices include the following except: a. Analytics b. Mining c. Big Data d. Warehousing e. Data Quality
e. Data Quality Topic: Data Management Explanation: Data Quality is a basic, not an advanced, data practice. Poor Data Quality leads to ambiguity, which introduces risk to an organization (ch. 1, section 2.5.3 Data Quality, p. 25).
38. Which of the following is a core assumption underlying Data Ethics? a. Data should support Business Intelligence b. Before it can be used, data must be aggregated to protect the privacy of individuals c. The organization must ensure that data is managed behind a firewall d. The organization should think and design globally when it comes to Data Management e. Data has economic value Topic: Data Ethics
e. Data has economic value
28. What is the definition of cardinality? a. Qualitative description of the relationship of elements across datasets b. Measurement specifications for elements in a dataset c. Count of data tables in a system d. Classifies variables within a dataset e. Defines how many instances of one entity are related to instances of another entity
e. Defines how many instances of one entity are related to instances of another entity Topic: Data Modeling & Design Explanation: In a relationship between two entities, cardinality captures how many instances of one entity participate in the relationship with how many of the other entity (ch. 5, section 1.3.3.2.3 Relationship Cardinality, p. 130).
50. What is the name of the legislation that protects educational records in the United States? a. SOX b. BASEL II c. EPA d. GDPR e. FERPA
e. FERPA Topic: Data Security Explanation: In the US, all information regarding a person's education is covered by FERPA, the Family Educational Rights and Privacy Act (ch. 7, section 1.3.12.2.1 Sample Regulatory Families, p. 237).
9. In addition to ongoing updates, what is the other main Data Integration process that takes place in a Data Warehouse? a. Data quality issue remediation b. Business Intelligence activity c. Metadata update d. Reporting strategy e. Historical load
e. Historical load
43. Which is the most important Data Quality dimension? a. Completeness b. Consistency c. Integrity d. Timeliness e. It is impossible to identify the most important Data Quality dimension
e. It is impossible to identify the most important Data Quality dimension Topic: Data Quality
46. Which of the following allows the capture and maintenance of the source structures for each attribute on the data model? a. Data profiling tools b. Data modeling tools c. Industry data models d. Data model repositories e. Lineage tools
e. Lineage tools
6. Which of the following Knowledge Areas provides version control information about an organization's datasets? a. Data Quality b. Master Data c. Reference Data d. Data Modeling e. Metadata
e. Metadata Topic: Metadata Management Explanation: Metadata entails the management of information related to the data lifecycle including maintaining a historical record (i.e., data lineage) that includes version history (ch. 12, section 1.3.2 Types of Metadata, p. 422).
47. A company wants to show the growth of their sales from the past year to the current year. They create two bar charts that show the sales for each year. However, they use a different scale for the current year's sales that makes it look like the sales have grown much more than they actually have. Which unethical data handling risk is presented in this case study? a. Timing b. Bias c. Unclear definitions or invalid comparisons d. Transforming and integrating data e. Misleading visualizations
e. Misleading visualizations
10. Which of the following data masking methods replaces the original data with a substitute value that is a random value or standard pattern similar to the original data? a. Shuffling b. Randomization c. Temporal variance d. Encryption e. Substitution
e. Substitution
5. Which of the following is a directive that codifies principles and management intent into fundamental rules governing the creation, acquisition, integrity, security, quality, and use of data? a. Data asset valuation b. Data audit principle c. Data Management d. Data Governance e. Data policy
e. Data policy Topic: Data Governance Explanation: Data policies support data standards across the organization. Moreover, they describe expected behaviors related to Data Management, for example, conducting root cause remediation of Data Quality issues. Data policies describe the "what" of Data Governance (i.e., what to do and what not to do). By comparison, standards and procedures describe the "how" of Data Governance. For data policies to be most effective, there should be relatively few of them, and they should be brief and direct (ch. 3, section 1.3.6 Data Policies, p. 77).
38. Which of the following represents the number of connections in a point-to-point Data Integration system, where n represents the number of databases to be integrated? a. (n -1)! b. (n (n - 1) ) / 2 c. (n -1) / 2 d. n / 2 e. (n -1) 2
b. (n (n - 1) ) / 2 Topic: Data Integration & Interoperability Explanation: In a data storage system connected using the point-to-point approach, the number of integrations will be the number of unique pairs that can be formed from n items: or "n choose 2" (ch. 8, 1.3.6.1 Point-to-point, p. 280).
26. What are the two perspectives on an organization's data? a. Social and business perspectives b. Business and technical perspectives c. Internal and external perspectives d. Technical and social perspectives e. Social and technological perspectives
b. Business and technical perspectives Topic: Data Governance Explanation: Data Governance must take into account the business and technical implications of Data Strategy (ch. 3, section 2.11 Assess Regulatory Compliance Requirements, p. 87).
11. Which of the following are categories of metadata? a. Public, internal-only, confidential, and restricted b. Business, technical, and operational c. Conceptual data, logical data, physical data d. Content-based, Context-based, User-based e. Content based, logical data, physical data
b. Business, technical, and operational
37. Which of the following is NOT a responsibility of the Data Quality Oversight Board? a. Producing certifications and compliance records b. Conducting feedback interviews with customers c. Establishing communications and feedback mechanisms d. Developing and maintaining Data Quality e. Setting Data Quality improvement priorities
b. Conducting feedback interviews with customers Topic: Data Quality Explanation: The Data Quality Oversight Board is the program team responsible for engaging both business and technical data professionals to ensure data is fit for consumption for a variety of purposes. As part of their work, they are responsible for all of the listed functions except conducting feedback interviews with customers (ch. 13, section 2.7 Develop and Deploy Data Quality Operations, p. 477).
31. Which of the following is at the core of the DAMA wheel? a. Metadata b. Data Governance c. Goals and Principles d. Data quality and Architecture e. Data Management
b. Data Governance (ch. 1, section 3.3 The DAMA-DMBOK Framework, p. 35).
30. The purpose of a business glossary is to... a. Move data from one system to another or between various modules within the same system b. Document an organization's business concepts and terminology, definitions, and the relationships between those terms c. Manage the names, descriptions, structure, characteristics, and other attributes of every data element in a model d. Describe the content of databases, along with sizing information, software versions, deployment status, and many other operational Metadata attributes e. Produce various types of Metadata relevant to the Business Intelligence design Topic: Metadata
b. Document an organization's business concepts and terminology, definitions, and the relationships between those terms
37. What is the purpose of Data Governance? a. Encompass the entire lifecycle of a data asset b. Ensure that data is managed properly, according to policies and best practices c. Establish processes and functions through which data can be enabled for use and also maintained d. Ensure that financial performance of the company is improved e. Ensure an organization gets value out of its data Topic: Data Governance
b. Ensure that data is managed properly, according to policies and best practices
36. Which of the following is the correct definition of data mining? a. The process of collecting data elements to help organizations formally manage and gain better control over data assets b. The process of finding correlated features in a dataset c. The process of manipulating data using a computer d. The process of describing the results of various operations in a data warehouse e. The process of finding anomalies, patterns, and correlations within large datasets to predict outcomes
e. The process of finding anomalies, patterns, and correlations within large datasets to predict outcomes Topic: Big Data Explanation: Data mining is a key activity during the exploration phase as it facilitates rapid identification of studied data elements, identifies new relationships previously unknown, unclear, or unclassified, and provides structure for the classification of studied data elements (ch. 14, section 1.3.10 Data and Text Mining, p. 507).
21. What is the purpose of the Physical Data Model? a. To provide a data-centric perspective of the organization by documenting how business entities relate to one another b. Documents how data are to be discarded from the storage media of computer hardware c. Documents how data are to be stored and accessed in a database d. To define the structure of data elements and to set relationships between them e. To document how data should be stored and accessed
e. To document how data should be stored and accessed Topic: Data Modeling & Design Explanation: The Physical Data Model represents the most granular level of detail in the Data Modeling process. It provides a detailed technical solution that specifies a set of hardware, software, and network tools within which the data will be stored and accessed (ch. 5, section 1.3.5.3 Physical, p.148).
Data Standards used by enterprise must?
Promote consistent results but periodically be reviewed and updated.
What are the stages of the Data Quality Management Cycle and or the Shewhart chart?
Plan, Do, Check, Act
42. Which of the following statements describes Master & Reference Data? a. Master Data provides the context for business activity data in the form of common and abstract concepts while Reference Data is used solely to relate data in a database to information beyond the boundaries of the organization b. Master Data includes code and description tables while Reference Data includes customer identifiers c. Master Data is a set of data that describes and gives information about other data while Reference Data records details about transactions d. Master Data includes mappings and hierarchies while Reference Data includes internal and external objects involved in business transactions e. Master Data is used solely to characterize other data in an organization while Reference Data is an aggregation of enterprise structure data and transaction structure data
a. Master Data provides the context for business activity data in the form of common and abstract concepts while Reference Data is used solely to relate data in a database to information beyond the boundaries of the organization
3. How does metadata help an organization? a. Reduce data-oriented research time b. Support conceptual data modeling c. Highlight the value of information by sharing it with more stakeholders d. Ensure the organization complies with accounting standards e. Indicate how data should be represented through data visualization
a. Reduce data-oriented research time
14. A dataset composed of county-level statistics provided by the national government would be an example of: a. Reference Data b. Master Data c. Historical Data d. Metadata e. Transactional Data
a. Reference Data Topic: Master & Reference Data Management Explanation: Reference Data typically comes from outside the organization. It is used to provide additional context on the organization's data. It can help relate the organization's data to information beyond the boundaries of the organization (ch. 10, section 1.3.1 Difference Between Master & Reference Data, p. 350).
49. What are the Belmont Principles? a. Respect for Persons, Beneficence, Justice b. Human Dignity, Civil Rights, Respect for Privacy c. Justice, Fairness, Equality d. Right to Privacy, Right to Data Access, Right to be Forgotten e. Data Quality, Data Adequacy, Data Management
a. Respect for Persons, Beneficence, Justice Topic: Data Ethics Explanation: The Belmont Principles were developed by the U.S. HSS in 1979 to guide the ethics of medical research, and they are also applicable as guiding principles within the field of Data Management (ch. 2, section 3.1 Ethical Principles for Data, p. 52).
2. Which of the following is NOT included in the 2015 opinion of the European Data Protection Supervisor (EDPS) on digital ethics? a. Right to request removal of personal data b. Accountable controllers who determine personal information processing c. Privacy-conscious engineering and design of data processing products and services d. Future-oriented regulation of data processing and respect for the rights to privacy and to data protection e. Empowered individuals
a. Right to request removal of personal data Topic: Data Ethics Explanation: In 2015, the European Data Protection Supervisor published an opinion on digital ethics highlighting the "engineering, philosophical, legal, and moral implications" of developments in data processing and Big Data. It called for a focus on data processing that upholds human dignity, and set out four pillars required for an information ecosystem that ensures ethical treatment of data. The four pillars are answer choices B through E (ch. 2, section 3.1 Ethical Principles for Data, p. 52).
16. A combined set of attributes that uniquely identify an entity instance is a... a. Super key b. Candidate key c. Primary key d. Business key e. Alternate key
a. Super key Topic: Data Modeling & Design Explanation: A super key is created when a set of attributes is used to uniquely identify an entity instance (ch. 5, section 1.3.3.3.2.2. Function-type Keys, p. 134).
31. What is the most significant business driver for developing organizational capabilities around Big Data and Data Science? a. The desire to find and act on business opportunities that may be discovered through large datasets b. The definition of high quality data c. The reduction of risk related to Data Security issues d. The business imperative to ensure efficiency in development projects e. The determination to act as an agent for change, transformation, and agility Topic: Big Data
a. The desire to find and act on business opportunities that may be discovered through large datasets
11. Which is the most accurate definition of the term data lifecycle? a. The path along which data moves from its point of origin to its point of storage, usage, or disposal b. Management of the risks associated with data c. Data that is used to manage and use data d. A range of perspectives on how to approach Data Management e. A representation of the cross-functional nature of data
a. The path along which data moves from its point of origin to its point of storage, usage, or disposal Topic: Data Storage & Operations Explanation: Data lifecycle management includes the implementation of policies and procedures for the acquisition, migration, retention, expiration, and disposal of data (ch. 6, section 1.3.2 Data Lifecycle Management, p. 173).
25. What is the purpose of the Logical Data Model? a. To define the structure of data elements and the relationships between them b. To compare data elements c. To define data elements d. To document how data should be stored and accessed e. To provide a data-centric perspective of the organization
a. To define the structure of data elements and the relationships between them Topic: Data Modeling & Design Explanation: A Logical Data Model is a detailed representation of data requirements, typically in support of a specific usage context, such as application requirements (ch. 5, section 1.3.5.2 Logical, p. 146).
17. Which of the following statements accurately characterizes the difference between data processing in a traditional Data Warehouse and a Big Data environment? a. In a traditional Data Warehouse, data is extracted, loaded, and then transformed; while in a Big Data environment, data is extracted, transformed, and then loaded b. In a traditional Data Warehouse, data is extracted, transformed, and then loaded; while in a Big Data environment, data is extracted, loaded, and then transformed c. In a traditional Data Warehouse, data is extracted, loaded, and then integrated; while in a Big Data environment, data is extracted, integrated, and then loaded d. In a traditional Data Warehouse, data is extracted, integrated, and then loaded; while in a Big Data environment, data is extracted, loaded, and then integrated e. There is no difference
b. In a traditional Data Warehouse, data is extracted, transformed, and then loaded; while in a Big Data environment, data is extracted, loaded, and then transformed Topic: Big Data Explanation: The biggest difference between Data Warehousing & Business Intelligence and Big Data processing is that in a traditional Data Warehouse, data is integrated as it is brought into the warehouse (extract, transform, load); while in a Big Data environment, data is ingested and loaded before it is integrated (extract, load, transform) (ch. 14, section 1.3.4 Big Data Architecture Components, p. 504).
16. A dataset composed of the X, Y coordinates of company store locations would be an example of: a. Metadata b. Master Data c. Reference Data d. Temporary Data e. Historical Data
b. Master Data Topic: Master & Reference Data Management Explanation: Master Data is the consistent and uniform set of identifiers and extended attributes that describes the core entities of the enterprise (e.g., customers, prospects, citizens, suppliers, sites, vendors, etc.) (ch. 10, section 1.3.3.3 Master Data Management, p. 359).
32. A Business Continuity Plan should include which of the following? a. Description of unplanned disruptions that may occur b. Outline of how a business will continue operating during an unplanned disruption in service c. Complete inventory of business rules d. Explanation to external stakeholders of why performance expectations are not being met e. Communication to customers during an unplanned disruption in service
b. Outline of how a business will continue operating during an unplanned disruption in service Topic: Document & Content Management
33. Ensuring the performance and reliability of the database through performance tuning, monitoring, error reporting, and other activities is a responsibility of... a. Agile Data Management Team b. Production DBAs c. Security Administrators d. IT Group e. Dedicated Information Security Group Topic: Data Storage & Operations
b. Production DBAs
22. Which of the following is the best definition of referential integrity? a. Rules that ensure data completeness b. Rules that ensure data validity c. Rules that ensure data accuracy d. Rules that ensure data is fit for organizational needs e. Rules that ensure data accessibility
b. Rules that ensure data validity Topic: Data Quality Explanation: Referential integrity is necessary to ensure high-quality data at an application or system level through ensuring data validity (i.e., consistency within the defined domain) (ch. 13, section 1.3.8.4 Issues Caused by System Design, p. 468).
22. Applications often compete for access to the same dataset. How can this issue be remedied? a. Conducting data archiving b. Setting up data replication c. Creating a stronger network connection d. Improving Data Security through the Installation of new antivirus software e. Installing a modern firewall
b. Setting up data replication Topic: Data Integration & Interoperability Explanation: Replication solutions usually monitor the log of changes to the dataset, not the data set itself. They minimize the impact of competing requests on operational applications (ch. 8, section 1.3.3 Replication, p. 278).
29. Which of the following is NOT a part of the Strategic Alignment Model? a. Business Strategy b. Stakeholder Management c. IT Strategy d. Information Systems e. Organization and Process
b. Stakeholder Management Topic: Data Management Explanation: Stakeholder Management is not a part of the Strategic Alignment Model. Data Strategy should include business strategy, IT strategy, Information Systems, Organization and Process in order to best use organizational information to achieve competitive advantage (ch. 1, section 3.1 Strategic Alignment Model, p. 34).
47. What are three qualities of an effective Data Governance process? a. Measured, objective, embedded b. Sustainable, embedded, measured c. Optimism, objective, sustainable d. Embedded, systematic, sustainable e. Sustainable, realistic, embedded
b. Sustainable, embedded, measured
13. What is the purpose of the Conceptual Data Model? a. To provide a comprehensive overview of the organization by documenting how different business entities create and share data with one another b. To capture high-level data requirements as a collection of related concepts c. To provide a detailed representation of data requirements, usually in support of a specific usage context d. To offer a detailed technical solution for how data will be implemented within a particular technology e. To document how data is to be stored and accessed in a particular data system
b. To capture high-level data requirements as a collection of related concepts Topic: Data Modeling & Design Explanation: A Conceptual Data Model captures the high-level data requirements as a collection of related concepts. It contains only the basic and critical business entities within a given realm and function, showing a description of each entity and the relationships between entities (ch. 5, section 1.3.5.1 Conceptual, p. 145).