Info 101 week 5
Normalization
The process of breaking the database down into tables
one to many relationship
exists between a pair of tables when a single record in the first table can be related to many records in the second table, but a single record in the second table can be related to only one record in the first table.
data warehousing
is the process of transforming transient transactional data into long-term storage data.
Structured Query Language (SQL)
is the standard language used to create, modify, maintain, and query relational databases
XML
key-value database stands for Extensible Markup Language. This format is used extensively to share data on the World Wide Web.
Where statement
specifies filtering criteria for the query
The NoSQL family
stands for "Not SQL" or "Not Only SQL". These data management formats are called NoSQL because you need a different scheme from Structured Query Language to access data stored using these formats. Many of the NoSQL databases are accessible in a RESTful format. REST stands for REpresentational State Transfer. Most commonly, the database connection is made through a URL, and the queries and commands are HTTP calls.
one to one relationship
when a single record in the first table is related to only one record in the second table, and a single record in the second table is related to only one record in the first table.
graph database model
Data are stored in graph format with nodes, edges and properties. The Nodes represent entities such as people, businesses, accounts, or any other item to be tracked. They are equivalent to the record, relation, or row in a relational database, or the document in a document database.Edges, also called graphs or relationships, are the lines that connect nodes to other nodes; they represent the relationship between them.
Persistent storage
Data is retained in the absence of power
Careers in Database Management
Database administrator Analyst Programmer Web designer Data mining
.slk
(official extension) format. is a format created by Microsoft for data exchange among its applications
Avaliability
If it is more important that some version of the data is available, and not the end of the world if the data are completely current, a different format may be faster.
Automation
Small customized programs, or macros, that do jobs not specifically provided in the DBMS tools
database management system (DBMS)
contains tools for doing most of what you may need to do with data in your database elements: data tables queries reports automation
parent child relationship (hierarchial database model)
a parent table can be associated with one or more child tables, but a single child table can be associated with only one parent table.
composite primary key
a primary key composed of two or more fields
Queries
Mechanisms to access subsets of fields and records of the database)
Reports
Mechanisms to output the data in a desired format. Can be based on tables or queries.
.xlsx
Microsoft Excel storage data format
analytical databases
are primarily used in online analytical processing (OLAP) scenarios, where there is a need to store and track historical and time-dependent data. is a valuable asset when there is a need to track trends, view statistical data over a long period of time, and make tactical or strategic business projections
Databases
are the mechanism for holding items of information that need to be remembered after the program ends - even after the computer is turned off.
Network database model
for the most part, developed as an attempt to address some of the problems of the hierarchical database. The structure was represented in terms of nodes and set structures
Consistency
is the most critical, then you need to go with a relational database. A system is consistent if an update is applied to all relevant nodes at the same logical time
Partition Tolerance
means that the database is designed so that the system will still work if communication between two nodes is not available.
Order by statement
specifies sorting order
.csv
stands for "comma-separated values". The file is a text file which can be opened and viewed with a text editor
operational databases
used everywhere in companies, organizations, and institutions, are the mechanism for Online Transaction Processing (OLTP) scenarios. That is, situations where there is a need to collect, modify, and maintain data on a daily basis
distributed storage of data
was always important for big companies, but in today's global business environment, it is more important than ever
Hierarchial Database Model
was created by IBM in the 60's. Data fields in this type of database were structured hierarchically and the database was typically diagrammed as an inverted tree. A single table in the database acted as the "root" of the inverted tree and other tables acted as the branches flowing from the root.
The relational database
was presented in June 1970 by Dr. Edgar F. Codd, an IBM research scientist complexity is managed and redundancy is eliminated through the use of key fields. These are fields in the data table that can be used to connect records in multiple tables. The key fields hold unique identifiers for the entities in the records
foreign key
A primary key from one table that is used in another table.
many to many relationship
when a single record in the first table can be related to many records in the second table and a single record in the second table can be related to many records in the first table. You establish this relationship with a linking table.
data tables
Store data about an entity type a) Fields: Store an individual element of data about an entity b) Records: Store data about one unit of an entity type c) Keys: Elements that relate data tables d)Relationships:The connection between related data tables
The Consistency, Availability and Partition Tolerance (CAP) theorem
When working in distributed systems, you can only have two of the three guarantees (the C, the A or the P), so you have to pick what's important
document database model
involves a collection of data storage formats, document-like in nature
primary key
is a field or group of fields that uniquely identifies each record within a table
JSON
is a key-value database stands for "JavaScript Object Notation".