Information Systems Midterm 1

Ace your homework & exams now with Quizwiz!

Online Analytic Processing (OLAP)

A method to analyze multidimensional data from many different perspectives. It enables users to identify issues and opportunities as well as perform trend analysis. Databases built to support OLAP processing consist of *data cubes*: that contain numeric facts called measures, which are categorized by dimensions, such as time and geography.

Business Process

A set of coordinated and related activities that takes one or more kinds of input and creates an output of value to the customer of that process

Product lifecylce management (PLM)

An enterprise business strategy that creates a common repository of product information and processes to support the collaborative creation, management, dissemination, and use of product and packaging definition information.

Hybrid cloud

Composed of both private and public clouds integrated through networking. Organizations typically use the public cloud to run applications with less sensitive security requirements and highly fluctuating capacity needs, but run more critical applications, such as those with significant compliance requirements, on the private portion of their hybrid cloud.

Transaction Processing System (TPS) might be used for: A. Reporting for evaluating revenue in an e-commerce sales system B. Input into a separate system c. Developing a predictive model for future student enrollment D. All of the above

D. All of the above

which of the following is required by the Google Chrome web scraper extension before you can actually scrape the data: a. a sitemap b. a starting point url c. an excel file to hold the scraped data d. a SelectorLink selector type e. 2 of the above are correct

E (A and B)

Drill down analysis

Involves the interactive examination of high-level summary data in increasing detail to gain insight into certain elements—sort of like slowly peeling off the layers of an onion.

Tables

Things we store information about •Master Data: Customers, Products •Transaction Data: Orders, Cash Receipts, Deliveries •collection of records

Database

collection of related tables

Row/Record/Instance

group of fields that describe same object/person/event

Transaction Processing System (TPS)

monitors, collects, and stores transaction data in real time

Dashboard

provide up-to-date summaries of important info for all levels of management

Platform as a Service (PaaS)

provides users with a computing platform, typically including operating system, programming language execution environment, database services, and a Web server. The user can create an application or service using tools and/or libraries from the provider. The user also controls software deployment and configuration settings. The PaaS provider provides the networks, servers, storage, and other services required to host the consumer's application. PaaS enables application developers to develop, test, and run their software solutions on a cloud platform without the cost and complexity of buying and managing the underlying hardware and software.

Decision Support System (DSS)

simulate multiple scenarios to select best outcome ("What-If" analysis)

A TPS provides valuable input to:

•Management information systems •Decision support systems •Knowledge management systems

Information System

•Technology that collects, processes, stores, analyzes, and disseminates information for a specific purpose •Example: Expedia.com collects information on flights, analyzes customer requests, and returns info on matching flights so that customer can make a purchase decision •Solve business problems, perform data analysis, and support decision-making

The Internet of Things (IoT) - IDOT and GIS

•The IoT is a network that allows sensors and Web-enabled devices and systems — the things — to communicate with one another. •IoT networks transmitting real-time information between systems and other "things." Snow Fleets •GPS transmitting their location in real time •Sensors that show those back at the office if the plow is up or down. •Machines that gauge the amount of salt they dispense. •Sensor data from the fleets can be fed into a digital map so that managers can know more about salt distribution in relation to snow-impacted areas.

The Internet of Things (IoT) - Retail [Optimizing Supply Chain Management]

•While retail stores can already track products without the help of IoT, that tracking information is pretty limited. •With RFID and GPS sensors, you can use IoT to obtain more precise data, like the temperature at which an item is being stored, or how long it spent in transit. •You can use that data to improve the quality of transportation moving forward -- you can also act in real-time if a product is being kept at temperatures too low or too high, avoiding a substantial loss.

The Internet of Things (IoT) - Retail [Personalized Discounts]

•With IoT, you can set up sensors around the store that send loyalty discounts to certain customers when they stand near products with their smartphones, if those customers sign up for a loyalty program in advance. •Additionally, you can use IoT to track items a customer has been looking at online, and send that customer a personalized discount when she's in-store.

Strengths of a Database

•You can query data in a database. •Can sort, filter, find, and do calculations •Can do CRUD •SQL used for relational dbs •You can look up data from a database relatively rapidly. •You can create meaningful reports from data in a database. •Your data has a built-in structure to it. •Information of a given type is always stored only once. •Databases are concurrent; multiple users can use them at the same time without corrupting the data.

computer-aided design (CAD)

(CAD) is the use of software to assist in the creation, analysis, and modification of the design of a component or product. Its use can increase the productivity of the designer, improve the quality of design, and create a database that describes the item.

Web Scraping

- Process of extracting data from websites - You could go website to website and copy and paste data into Excel or word; clean it; then use it. - You could automate the extraction with software; clean it; then use it.

Benefits of BI and Analytics

- detect fraud - improve forecasting - increase sales - optimize operations - reduce costs

Scraping with google chrome extension

- Web Scraper is an extension for the Chrome browser made exclusively for web data scraping - You can setup a plan (sitemap) on how to navigate a website and specify the data to be extracted. - The scraper will traverse the website according to the setup and extract the relevant data. - It lets you export the extracted data to CSV

SQL databases conform to ACID properties

- atomicity - consistency - isolation - durability These properties guarantee database transactions are processed reliably and ensure the integrity of data in the database. Basically, these principles mean that data is broken down to atomic values—that is, values that have no component parts.

Web has...

-60 trillion pages and growing no central control on information quality -no editing standards -no single focus (business, education, sports,...) -no single location -no single language (English, Chinese,...)

Things a Business Intelligence (BI) system can do:

1. Data Extraction 2. Visualization 3. Predictive Modeling

Data Analysis: Extract

1.Understanding the business process •Walk-though process 2.Knowing the data •Use Entity Relationship Diagrams (ERD) 3.Get the data you need for your analysis Accessing Business Cycle Data •Obtaining data is the first step in data analytics. •Data is often stored in different systems and in multiple formats. •Canned reports often lack complete data, such as date, time, etc. Be sure to include all the fields you will need to get a complete set of data. •Query Design Tool (QBE) During the extract step, data that fails to meet expected patterns or values may be rejected from further processing

a google chrome web scraper extension sitemap uses which of the following to grab data from a page: a. a SelectorLink selector type b. A csv selector type c. a CardSelector selector typed. a DetailSelector selector type e. 2 of the above are correct

A

the google chrome web scraper extension might allow us to: a. move from one page on a website to another page on a website in order to collect related data b. bring in multiple distinct website URLs for a single sitemap so we can scrape multiple websites form one sitemap c. remove data from a website as part of a web scraper operation d. scrape the data from an entire website by providing the sitemap a list of the specific pages we want scraped e. 2 of the above are correct

A

Data mining

A BI analytics tool used to explore large amounts of data for hidden patterns to predict future trends and behaviors for use in decision making. Used appropriately, data mining tools enable organizations to make predictions about what will happen so that managers can be proactive in capitalizing on opportunities and avoiding potential problems. 3 common techniques for data mining: 1. Association analysis 2. Neural computing 3. Case based reasoning

Conversion funnel

A graphical representation that summarizes the steps a consumer takes in making the decision to buy your product and become a customer. It provides a visual representation of the conversion data between each step and enables decision makers to see what steps are causing customers confusion or trouble.

Enterprise Resource Planning (ERP)

A set of integrated programs that manage a company's vital business operations for an entire organization

Referential Integrity

A set of rules that Access uses to ensure that the data between related tables is valid •any field in a table that is declared a foreign key can contain either a null value, or only values from a parent table's primary key. •In other words, when a foreign key value is used; it must reference a valid, existing primary key in the parent table.

Private cloud

A single tenant cloud. Organizations that implement a private cloud often do so because they are concerned that their data will not be secure in a public cloud. Private clouds can be divided into two distinct types. Some organizations build their own on-premise private cloud, and others elect to have a service provider build and manage their private cloud (sometimes called a virtual private cloud).

Cross-Industry Process for Data Mining (CRISP-DM)

A six-phase structured approach for the planning and execution of a data mining project It is a robust and well-proven methodology, and although it was first conceived in 1999, it remains the most widely used methodology for data mining projects.

SQL

A special-purpose programming language for accessing and manipulating data stored in a relational database.

Business Analytics Requirements:

Business or Domain Knowledge •Understanding of the business across the value chain (from marketing, sales, distribution, operations, pricing, products, finance, risk, etc.) •Articulate how information, insights, and analytics can help business leadership answer key questions •and even determine which questions need answered Statistics •Understanding of statistical (e.g., regression analysis, cluster analysis, and optimization techniques) techniques Programming Database and Data Sources •External and internal data sources •Web scraping and data transfer APIs and other protocols •Structured and unstructured data •Extract, transform and load data stores

A(n) ___ can replace many applications with one unified set of programs, this system can then be used to manage all it's vital business operations. A. Materials resource planning system B. Management information system C. Enterprise resource planning system D. Decision support system E. None of the above

C. Enterprise resource planning system

Autonomic Computing

Computing or the ability of IT systems to manage themselves and adapt to changes in the computing environment, business policies, and operating objectives. - The goal of autonomic computing is to create complex systems that run themselves, while keeping the system's complexity invisible to the end user. - Autonomic computing addresses four key functions: 1. self-configuring 2. self-healing 3. self-optimizing 4. self-protecting

Key performance indicators (KPIs)

Consist of a direction, measure, target, and time frame. To enable comparisons over different time periods, it is also important to define the KPIs and to use the same definition from year to year.

Key feature of customer relationship management (CRM)

Contact management: The ability to track data on individual customers and sales leads and then access that data from any part of the organization. Sales management: The ability to organize data about customers and sales leads and then to prioritize the potential sales opportunities and identify appropriate next steps. Customer support: The ability to support customer service representatives so that they can quickly, thoroughly, and appropriately address customer requests and resolve customer issues while collecting and storing data about those interactions. Marketing automation: The ability to capture and analyze all customer interactions, generate appropriate responses, and gather data to create and build effective and efficient marketing campaigns. Analysis: The ability to analyze customer data to identify ways to increase revenue and decrease costs, identify the firm's "best customers," and determine how to retain and find more of them. Social networking: The ability to create and join sites such as Facebook, where salespeople can make contacts with potential customers. Access by mobile devices: The ability to access Web-based customer relationship management software by smartphones, tablets, and other mobile devices. Import contact data: The ability for users to import contact data from various data service providers that can be downloaded for free directly into the CRM application.

Multi-valued fields in relational databases are: A. Allowed in line item tables B. Allowed because they make update queries faster C. Allowed because they make retrieving of data in a query D. Not allowed E. 2 of the above are correct

D. Not allowed

TPS's that support a large enterprise typically store data in a: A. NoSQL database B. Excel sheet C. Access database D. Relational database E. None of the above

D. Relational database

A Business intelligence (BI) predictive model that predicts whether a student accepted to the Tippie COB will actually enroll in the Tippie COB; might use: A. Structured data only B. Unstructured data only C. Internal unstructured data only D. Structured and Unstructured data E. None of the above are used in a BI model

D. Structured and Unstructured data

Which of the following is an example that would need to use an ETL process: f. moving data from the old system into a newly developed and deployed system a. entering a Sale order in the transaction processing system b. entering a car rental in an ERP system c. merging sales data with advertising data and deploying the merged data into a business intelligence tool d. 2 of the above are correct

D. top 2

Data used in a Business intelligence (BI) analysis could come from which of the following sources. A. An OLTP internal system B. Unstructured data from sources outside the company C. Unstructured data from sources inside the company E. All of the above are correct

E. All of the above

Types of data that can be moved from the source of Access include: a. transaction dta b. unstructured data like web ad data c. data stored in a different relational database d. sentiment analysis customer scores stored in your ERP system e. All of the above

E. All of the above

Data Storage

Involves updating one or more databases with new transactions •After being updated, this data can be further processed and manipulated by other systems or modules •Analytics and Reports

Attributes/Fields

Pieces of information about an entity

Primary Key

Primary Key: A unique attribute used to identify a single instance of an entity (i.e., a single record in a table) •Often an table has many instances (records or rows) - thousands, millions, or more •For example, an online site may have served millions of customers. The data captured for the customers may be stored in a database table called "Customers" •Therefore, there is a need to (uniquely identify) each instance, which becomes a record in a table •This identification is accomplished via a Primary Key •It is a Unique Identifier for each record

Product lifecycle management (PLM) software

Provides a means for managing the data and processes associated with the various phases of the product life cycle, including sales and marketing, research and development, concept development, product design, prototyping and testing, manufacturing process design, production and assembly, delivery and product installation, service and support, and product retirement and replacement.

NoSQL database

Provides a means to store and retrieve data that is modeled using some means other than the simple two-dimensional tabular relations used in relational databases. Used to deal with the variety of data found in big data and web applications. A major advantage of NoSQL databases is the ability to spread data over multiple servers so that each server contains only a subset of the total data (horizontal scaling). This enables hundreds or even thousands of servers to operate on the data providing faster response times for queries and updates.

Accessibility and volume of information do not equal

Quality

Data cleansing

The process of detecting and then correcting or deleting incomplete, incorrect, inaccurate, or irrelevant records that reside in a database. The goal of data cleansing is to improve the quality of the data used in decision making.

Business Analytics

The use of data and quantitative analysis to support fact-based decision making within organizations Can be used to: •Gain a better understanding of current business performance •Reveal new business patterns and relationships •Explain why certain results occurred •Optimize current operations •Forecast future business results

ERP vendors: Tier 1, 2, 3

Tier 1: Target large multinational firms with multiple geographical locations and annual revenues in excess of $1 billion. Tier 2: Target medium sized firms with annual revenues of $50 million to $1 billion operating out of one or more locations. Tier 3: Target smaller firms with annual revenues in the $10 million to $50 million and typically operate out of a single location.

Information System (IS)

an organized combination of hardware, software, infrastructure, data and people that **is used to accomplish a specified organizational or personal objective** •Computerized IS builds on top of IT infrastructure

Business Intelligence Systems (BI)

analyze info for complex, non-routine decisions

Information Technology (IT)

the hardware, software and media used to store, organize, retrieve and communicate information

Enterprise Resource Planning (ERP)

the integrated management of main business processes, often in real time and mediated by software and technology. Leading provider is SAP and Oracle

Unstructured data

unstructured data is ultimately related back to the company's structured data records.Blend this data with systems of record transactional data so that employees have more complete information at their fingertips.

The Hierarchy of a Relational Database

•(database) •A (data table) is a collection of related records •A (record) is a group of related fields •A (primary key) is a type of attribute that uniquely identifies each record •An (attribute/field)

Relational Database

•A Relational Database stores data in connected tables. •Connections (Joins) are accomplished via common attribute fields called Join Fields (often a Foreign Key) •In order to accomplish queries across multiple tables there must be Join Fields

Relationships

•A Relationship is a natural business association that exists between one or more tables •Example: A customer can submit many orders

Internet of Things (IoT)

•A network of physical objects (things) embedded with sensors, processors, software, and network connectivity capability to enable them to exchange data with the manufacturer of the device, device operators, other connected devices, and cloud-based software. •RFID inventory tracking chips, traditional in-store infrared foot-traffic counters, cellular and Wi-Fi tracking systems, digital signage, a kiosk, or a customer's mobile device •Sensor: a device that is capable of sensing something about its surroundings such as •Pressure, temperature, humidity, pH level, motion, vibration, or level of light

Using Self-Service analytics, users can...

•Analyze trends •Uncover opportunities and issues •Accelerate decision making by rapidly creating reports, charts, dashboards, and documents •Detect fraud •Improve forecasting •Increase sales •Optimize operations •Reduce costs

Examples of using sensors and the IoT to monitor and control key operational activities:

•Asset monitoring •Construction •Agriculture •Manufacturing •Monitoring parking spaces •Predictive Maintenance •Retailing •Traffic monitoring •UI Laundry View •Bongo App

Tips for avoiding a failed implementation for Enterprise Systems

•Assign a full-time executive to manage the project •Appoint an experienced, independent resource to provide project over- sight and to verify and validate system performance •Allow sufficient time to transition from the old way of doing things to the new system and new processes •Allocate sufficient time and money training people •Define metrics to assess project progress and to identify project-related risks •Keep the scope of the project well defined and contained to essential business processes •Be wary of modifying the enterprise system software to conform to your firm's business practices

Transaction Processing System (TPS)

•Capture and process detailed data necessary to update the organization's records about fundamental business operations •Include order entry, inventory control, payroll, accounts payable, accounts receivable, general ledger, etc.

Transaction Processing System (TPS)

•Capture and process detailed data necessary to update the organization's records about fundamental business operations •Include order entry, inventory control, payroll, accounts payable, accounts receivable, general ledger, etc. A TPS provides valuable input to: •Data Analytic efforts

Organizations expect their TPSs to:

•Capture, process, and update databases •Ensure that the data is processed accurately and completely •Avoid processing fraudulent transactions •Produce timely user responses and reports •Reduce clerical and other labor requirements •Help improve customer service •Achieve competitive advantage

What is the Cloud?

•Cloud computing is the on-demand delivery of compute power, database storage, applications, and other IT resources through a cloud services platform via the internet with pay-as-you-go pricing. •A Cloud services platform such as Amazon Web Services owns and maintains the network-connected hardware required for these application services, while you provision and use what you need via a web application. •With cloud computing, you don't need to make large upfront investments in hardware and spend a lot of time on managing that hardware. •Instead, you can provision exactly the right type and size of computing resources you need to power your newest bright idea or operate your IT department. You can access as many resources as you need, almost instantly, and only pay for what you use.

Key features of a CRM system include:

•Contact management •Sales management •Customer support •Marketing automation •Analysis •Social networking •Access by smartphones •Import contact data

Data mart/ Data lake

•Data mart: a subset of a data warehouse that is used by small- and medium-sized businesses and departments within large companies to support decision making •A specific area in the data mart might contain greater detailed data than the data warehouse •Data lake: takes a "store everything" approach to big data, saving all the data in its raw and unaltered form •Also called an enterprise data hub •Raw data is available when users decide just how they want to use the data

Online transaction processing (OLTP)

•Data processing in which each transaction is processed immediately •At any time, the data in an online system reflects the current status

Data Analysis: Transform

•Data wrangling, sometimes referred to as data munging, is the process of transforming and mapping data from one "raw" data form into another format with the intent of making it more appropriate and valuable for a variety of downstream purposes such as analytics. •The process of detecting and then correcting or deleting incomplete, incorrect, inaccurate, irrelevant records that reside in a database •Extracting the data in a raw form from the data source, •"munging" the raw data using algorithms (e.g. sorting) or parsing the data into predefined data structures. •Depositing the resulting content into storage for future use or using it yourself A series of rules or algorithms are applied to the extracted data to derive the data that will be stored in the data warehouse.

TPS data (often) stored in a Relational DB

•Database software, often called a database management system (DBMS) •Oracle •SQL Server •Access - desktop We can: •Create (create and add) •Retrieve (read) •Update (modify) •Delete (remove)

Data Analysis : Load

•Depositing the resulting transformed data into a database for future analysis by a tool •Depositing the resulting transformed data into a file for future analysis by a tool - Excel - CSV Then you do the analysis. •Excel •Access •IDEA •ACL •Python •R •Tableau •Data Warehouses, Data Marts, and Data Lakes •Data warehouse: a large database that collects business information from many sources in support of management decision making As the data is being loaded into the data warehouse, new indices are created and the data is checked against the constraints defined in the database schema to ensure its quality.

Custom or Ad Hoc Reporting

•Empower end-users to ask their own questions of company data, without burdening IT with the task of creating reports. •Example: - Each sales rep can set up their own report for their territory, showing performance against sales goals, or orders taken, or number of visits to each client, etc.

Customer Relationship Management (CRM) System

•Helps a company manage all aspects of customer encounters, including marketing, sales, distribution, accounting, and customer service •The goal of CRM is to understand and anticipate the needs of current and potential customers •CRM is used primarily in sales, marketing, and service organizations: •To capture and view data about customers and to improve communications •CRM software: •Automates and integrates the functions of sales, marketing, and service in an organization

Advantages of ERP

•Improved access to quality data for operational decision making •Elimination of costly, inflexible legacy systems •Improvement of work processes •Opportunity to upgrade and standardize technology infrastructure •Why build and re-invent the wheel, when you can buy it?

Self-Service Analytics

•Includes training, techniques, and processes that empower end users to work independently to access data from approved sources to perform their own analyses using a set of tools •Canned reports •QBE •Excel •Encourages nontechnical users to make decisions based on facts and analyses rather than intuition

Infrastructure as a Service (IaaS)

•Infrastructure as a service (IaaS) is an instant computing infrastructure, provisioned and managed over the Internet. •Computers (virtual or on dedicated hardware), networking, and data storage space hardware. •Quickly scale up and down with demand and pay only for what you use. •IaaS removes the need for organizations to manage the underlying infrastructure (usually hardware and operating systems) and allow you to focus on the deployment and management of your applications. •You don't need to worry about resource procurement, capacity planning, OS software maintenance and patching, or any of the other undifferentiated heavy lifting involved in running your application. •IaaS helps you avoid the expense and complexity of buying and managing your own physical servers and other datacenter infrastructure.

Cloud based software model for enterprise software

•Many business application software vendors are pushing the use of the hosted software model for businesses •The goal is to help customers acquire, use, and benefit from the new technology while avoiding much of the associated complexity and high start-up costs •Faster deployment •Automatic software updates •Cost-effectiveness and scalability •The ability to work from anywhere, on any device

(Data Analysis Transform) Data Wrangling Tools

•Python plus pandas is a really good solution to this problem. •Excel •Access •IDEA •ACL •R

The Internet of Things (IoT) - Retail [Predictive Equipment Maintenance]

•Refrigeration units - sensors can predict maintenance issues that might affect power consumption for savings or monitor temperature fluctuations to ensure food safety.

The Internet of Things (IoT) - Retail [Automated Checkout]

•Set up a system to read tags on each item when a customer leaves the store. A checkout system would then tally the items up and automatically use data from the customers' mobile payment app. •McKinsey estimates automated checkout can reduce cashier staff requirements by up to 75%, resulting in savings of $150 billion to $380 billion a year in 2025.

The Internet of Things (IoT) - Retail [Smart Shelves]

•Smart shelves are fitted with weight sensors and use RFID tags and readers to scan the products on both display and stock shelves. •Smart Shelves inform you when items are running low or when items are incorrectly placed on a shelf, which makes your inventory process cost-effective and more precise. •Additionally, each RFID tag is connected to a reader, so Smart Shelves are able to detect in-store theft.

Software as a Service (SaaS)

•Software as a Service provides you with a completed product that is run and managed by the service provider. •In most cases, people referring to Software as a Service are referring to end-user applications. •With a SaaS offering you do not have to think about how the service is maintained or how the underlying infrastructure is managed; you only need to think about how you will use that particular application software. •A common example of a SaaS application is web-based email where you can send and receive email without having to manage feature additions to the email product or maintaining the servers and operating systems that the email program is running on.

Software as a Service (SaaS) cont.

•Software as a service (SaaS) allows users to connect to and use cloud-based apps over the Internet. Common examples are email, calendaring, and office tools (such as Microsoft Office 365), Salesforce.com, ERP. •SaaS provides a complete software solution that you purchase on a pay-as-you-go basis from a cloud service provider. •You rent the use of an app for your organization, and your users connect to it over the Internet, usually with a web browser. •All of the underlying infrastructure, middleware, app software, and app data are located in the service provider's data center (IaaS) . •The service provider manages the hardware and software, and with the appropriate service agreement, will ensure the availability and the security of the app and your data as well. •SaaS allows your organization to get quickly up and running with an app at minimal upfront cost.

Structured Data

•Structured data is information easily displayed in titled columns and rows which can easily be ordered and processed. •This could be visualized as a perfectly organized filing cabinet where everything is identified, labeled and easy to access. •AIS TPS -> Structured Data -> Relational DB


Related study sets

french 2 practice test completion section

View Set

Fiesta in CCW (Banana, Quetzacoatl, & My Brother and I)

View Set

MODULE 4 - CHP 9 + 13 - PAYROLLS AND CREDIT

View Set

Ch 7 Cost Volume Profit Analysis

View Set

KIN 245 Chapter 5: The Shoulder Joint

View Set

Medical Surgical Nursing Musculoskeletal System Ch. 40-43 **IMPORTANT* YASSSS

View Set

Microeconomics Income inequality

View Set

Chapter 7, Small Business Strategies

View Set

evolve adaptive quizzing; quiz 1

View Set