ISBB Chapter 4 - Data and Databases

Ace your homework & exams now with Quizwiz!

MetaData

Data that describes other data. ex: MP3 file, length of song, artist, album, file size, album cover -data dictionary hold to metadata, defining the fields and structure of the database

Personal database-management systems

-Access and Open Office -develop and analyze single-user databases -not meant to be shared across a network or the Internet, but instead installed on a particular device and work with a single user at a time

Data Warehouse

-Extract data from one or more of the organization's databases and load it into the data warehouse (which is itself another database) for storage and analysis.

relational enterprise DBMS

-can be accessed by many people -sometimes installed on a single computer to be access by a group of people at a single location -they are installed over several servers worldwide, meant to be accessed by millions -built and supported by Oracle, Microsoft, and IBM -open-source MySQL is also an enterprise system

3.

-data is standardized -data in a data warehouse usually comes from several different sources, it is possible that the data does not use the same definitions or units -standard format would have to be agreed upon, all data loaded into the data warehouse would have to be converted to use this standard format [Extraction-transformation-load]

2.

-data is time-variant -whenever data is loaded into the data warehouse, it receives a time stamp, allows for comparisons between different time periods

Field

-each table has a set of fields, which define the nature of the data stored in the table

Number

-few different number types that can be selected, depending on how large the largest number will be

record-locking

-for a relational database to work properly, it is important that only one person be able to manipulate a piece of data at a time -not possible w Google and Amazon

Hierarchical database model

-popular in the 1960s and 1970s -connected data together in a hierarchy, allowing for a parent/child relationship between data

What are the benefits of Data Warehouses?

-process of developing a data warehouse forces an org to better understand the data that is currently collecting and equally important, what data is not being collected -provides a centralized view of all data collected across the enterprise and provides a means for determining data that is inconsistent -once all data is identified as consistent, an org can generate one version of the truth -snapshots of data can be taken over time, creates a historical record of data, allows of analyze of trends -provides tools to combine data, which can provide new information and analysis

Data

-raw bits and pieces of info with no context -can be quantitative/qualitative -ex: doc, mp3 file, video file, spreadsheet, webpage, e-book

Top-down

-should start by creating an enterprise-wide data warehouse and then -as specific business needs are identified, create smaller data marts from the data warehouse

Bottom-Up

-starts by creating small data warehouse (data marts) to solve specific business problems -as these data marts are created, they can be combined into larger data warehouse

Business analytics

-the internal company data to improve business processes and practices

Criteria of data warehouse design

-uses non operational data, data warehousing is using a copy of data from the active databases that the company uses in its day-to-day ops, data warehouse must pull data from the existing databases on a regular, scheduled basis

What are the steps?

1. system must be able to take data 2. put the data into context 3. provide tools for aggregation and analysis ^database designed for this

Big Data

A broad term for data sets so large or complex that traditional data processing applications are inadequate -massively large data sets that conventional database tools do not have the processing power to analyze them ex: Walmart

Structured Query Languate (SQL)

A computer language that lets people, database developers and others extract information from the database. -primary way to work with a relational database -almost all applications that work with databases ex: database management systems -use SQL as a way to analyze and manipulate relational data -language that can be used to work with relational dayabase

Information

Adding context to data

Amazon.com database services for enterprise use

Amazon RDS, relational database service' -Amazon DYnamoDB, noSQL enterprise solution

Data Mining

An analytic process designed to explore large amounts of data. -process of analyzing data to find previously unknown trends, patterns, and associations in order to make decisions -accomplished through automated means against extremely large data sets, such as data warehouse ex: An analysis of sales from a large grocery chain might determine that milk is purchased more frequently the day after it rains in cities with a population of less than 50,000. -sometimes begun with a hypothetical result

Database

An organized collection of related information -allows data from several entities (such as students, clubs, memberships, and events) to be all related together into one whole -more intitutive and powerful ways to define the types of data that go into each field, reducing errors allowing easier analysis -can handle several types of data mixed, relationships between types of data are complex

Data, Information & Knowledge

Data are the raw bits and pieces of information with no context. By adding the context - the raw bits and pieces become information. the data is put into context, aggregated and analyzed, it can be used to make decisions for an organization. The consumption of information produces knowledge.

Organizations want to analyze data in a historical sense: How does the data we have today compare with the same set of data this time last month, or last year?

Concept of data warehouse

Field(s)

Database tables contain fields(columns) and records (rows). Example fields are firstname, lastname, studentID, GPA. It is possible to connect all the tables in the database through the field(s) they have in common.

Table

Each table has a set of fields, which define the nature of the data stored in the table. Database tables contain fields(columns) and records (rows).

What are examples of personal database-management systems?

Microsoft Access, Open Office Base -primarily used to develop and analyze single-user databases -not meant to be shared across a network or the Internet, installed on a patricular device and work with a single user at a time

NoSQL Databases

Not Only SQL Database. A type of database that operates using means other than relational tables. NoSQL became popular with the growth of Web 2.0 and the need for faster data retrieval. -arose from the need to solve the problem of large-scale databases spread over several servers or even across the world -can work with data in a looser way, allowing for a more unstructured environment, communicating changes to the data over time to all the servvers that are part of the database

Relational Database

One in which data is organized into one or more related tables. -all tables are related by one or more fields, so that it is possible to connect all the tables in the database through the field(s) they have in common -ex: Microsoft Access, My SQL, and Oracle

Database Management System (DBMS)

Programs that provide user-friendly, interfaces to view and change a database, create queries, and develop reports. -software must access a database in order for the data to be read, changed, added, removed -most packages are designed to work with a specific type of database, but generally compatible with a wide range of databases

Business Intelligence

The set of techniques and tools for the transformation of raw data into meaningful and useful information for business analysis purposes.

Record

Records as the rows of the table. -one instance of a set of fields in a table

Busieness Analytics

Refers to the skills, technologies, practices for continuous iterative exploration and investigation of past business performance to gain insight and drive business planning.

Data Types

Text, Number, Yes/No, Date/Time, Currency, Paragraph Text and Object are amongst the most common data types to be stored. -when defining the fields in a database table we must give each field a data type -tells the database what functions can be performed -proper amount of storage allocated for data

What is the goal of many information systems?

The goal is to transform data into information in order to generate knowledge that can be used for decision making

Knowledge Management

The process of capturing, developing, sharing, and effectively using organizational knowledge. -much of the knowledge is not written down -process of formalizing the capture, indexing, and storing the company's knowledge in order to benefit from the experiences and insights that the company has captured during its existence

Primary Key

The unique identifier for each record in a table. For example, social security number or student ID.

Normalization

To normalize a database means to design it in a way that: reduces duplication of data between tables and gives the table as much flexibility as possible.

Document-centric model

allowed for more unstructured data storage by placing data into "documents" that could then be manipulated

Paragraph Text

allows for text longer than 256 characyers

Google App Engine Datastore

based on NoSQL -developers can use this to develop applications that access data from anywhere in the world

Apache OpenOffice.org Base

can be used to create, modify, and analyze databases in open-database (ODB) format -Microsoft Access DBMS is used to work with databases in its own Microsoft Access Database format -both Access and Base have the ability to read and write to other database formats as well

Data brokers

combine publicly accessible data with information obtained from the government and other sources to create vast warehouses of data about people and companies that they can then sell

Qualitative data

descriptive, can be a number because its descriptive not the result of a measurement or mathematical calculation

Text

non-numeric data, brief gen under 256 characters

Quantitative data

numeric, the result of measurement count

Knowledge

once we have put our data into context, aggregated and analyzed it, we can use it to make decisions for our organization. -can be used to make decisions, set policies, and spark innovation

Business intelligence

process that organizations use to take data they are collecting and analyze it in the hopes of obtaining a competitive advantage -besides using data from their internal data bases, firms often purchase info from data brokers to get a big-picture understanding of their industries

What about applications to create or management a database? What software can you use to create a database, change a database's structure, or simply do analysis?

purpose of a category of software applications called database management systems

Yes/No

special form of the number data type that is (usually) one byte long, with a 0 for "no" or "false" and 1 for "yes" or "true"

Object

storage of data that cannot be entered via keyboard, such as an image/music file


Related study sets

olecranon bursitis , repetitive strain injury ,Epicondylitis, De Quervain’s tenosynovitis i

View Set

Chapter 8 Chemical Equations and Reactions

View Set

Principles on Investigation Revel Chapter 1, 2, 3, 4, 5, 19, 6, 7, 8, 9

View Set