ISMN Exam 2

Ace your homework & exams now with Quizwiz!

what is the big problem with each function of an organization having an individual collection of data (data silo)

Data redundancy and inconsistency can occur within an organization. For instance, the marketing and customer service functions may both contain duplicate data, which, when merged into the database, leads to data duplication and inconsistencies.

Nine reasons data is (and is constantly becoming) more difficult to manage

1. amount of data increases exponentially over time (increasing at an increasing rate) 2. data is scattered throughout organizations (different places); org and personal data 3. many individuals collect data 4. data are generated from multiple sources (internal, external, personal, the Web) 5. Data degrades over time (errors, or data not updated correctly) 6. New sources of data being developed (blogs, videos, RFID tags, sensors. SOCIAL MEDIA) 7. data rot (old media) 8. Data security, quality, and integrity are critical BUT EASILY JEOPARDIZED 9. Big Data

three negative effects of data silos mentioned in the textbook

1. hinder the process of gaining actionable insights from organizational data 2. create barriers to an overall view of an enterprise and its data 3. delay digital transformation efforts

what are the two big reasons data is becoming harder to acquire, keep and MANAGE? KEY WORD MANAGE

1. the amount of data is increasingly exponentially with time as some historical data must be kept and new data are rapidly forming 2. data is scattered throughout organizations, collected by many individuals using various methods, and kept in numerous servers and locations, and computing systems, databases, formats, and human and computer languages

what stopped us while we were building broadband around the world

COVD

true or false: broadband is loosely defined but is defined by the download speed of a transmission capacity of a communications medium

TRUE

broadband = _ ___ __ _________

a lot of bandwidth

which two communications media/channels are broadband

coaxial cable and fiber optic

what two materials are ALL physical cables made of

copper or glass

how long has descriptive analytics been around

since the 1950s

Content and interactions on social platforms, comprising text, images, videos, comments, and likes, used for audience engagement and insights.

social media data

networks represent a compromise between what three components

speed, distance and cost

data silos

stand-alone data stores. Their data are not accessible by other ISs that need it or outside that department.

market basket analysis

technique to discover relationships between products that customers frequently purchase together. It identifies patterns and helps businesses make decisions regarding product placement, cross-selling, and targeted marketing.

true or false: you are a computing device

true

true or false: analytics is mostly cleaning up data

true; (80% i believe)

describe components of market basket analysis and decision making

users can use this analysis to choose where to put products. it gets complicated because for example if two products are commonly bought (and the customer plans this before going into the store) then it is usually smart to put them on opposite sides of the store so the customer has to walk past a bunch of other items and consider buying them. on the other hand, if two items are not commonly planned to be bought together but often end up being bought together (like beer and diapers) then stores may put them next to each other

what do you do after making a business decision based on descriptive and predictive (and prescriptive) analytics?

you ask the next business question. it is a constant loop to improve

non-broadband includes

"dial-up"

__________ business decision means it must be doable

ACTIONABLE

what is the optimal customer segmentation and how (directly) does it help businesses and consumers

1:1 (segment customers by groups of one to give everyone a specialized version of what they want) because this means DEMAND IS BETTER MET

who still does not have broadband? (geographical)

Parts of Africa, Middle East and Western China

true or false: the main difference between internal and external data is where the most significant portion of data is created determines if it is internal or external. If the most significant portion of data is created with direct interaction with outside sources, like sales and reviews, it is internal

FALSE

central tendency

a measure that represents the typical response or the behavior of a group as a whole. mean, median, mode

Records user interactions on websites, tracking clicks, page views, and navigation patterns to analyze online behavior.

clickstream data

data streams

data continuously generated by point of sale systems, clickstream data, social media and sensors

name four major sources data is generated from

internal, external, personal, the Web

Broadband internet's relationship with remote work

it allowed/allows for people to live somewhere cheaper, it increases efficiency by removing/reducing the commute to work and overall traffic double edged sword: loss of identity, human interaction

Data from **the literal term so i have to leave it out** in devices and systems, providing real-time information on environmental, physical, or industrial conditions.

sensor data

online analytical processing: "_____ ___ ____"

slice and dice refers to cube shape of data organization

equation for Big Data

structured data + unstructured data = Big Data

what if a new batch of data breaks an old trend

there is a variable, possibly yet to be considered if it confuses the user

how do telecos try to get more people to switch to wireless

they slowly up the monthly fees for twisted pair connection

explain why organizations must typically select two of the three objectives when it comes to networks

to cover long distances and fast, organizations will have to pay more. they can pay less if it is slower. they can also choose fast, inexpensive communication with distance limitations

True or False: Social media mentions and customer reviews are typically classified as external data for a business.

true

true or false: acquiring, keeping and managing data is becoming increasingly difficult

true

Online Transaction Processing (OLTP)

Capturing of transaction and event information using technology to process, store, and update

true or false: you should ignore data outliers when analyzing data because they are usually not consistent with future trends

FALSE: always pay attention to outliers

true or false: every college has its own WAN

FALSE: every college has its own LAN

true or false: data is increasing linearly over time

FALSE: it is actually increasing exponentially

true or false: Networks run in the foreground because they require constant user interaction for data to be transmitted and received.

FALSE: networks run in the background

true or false: telecos (telephone companies) are desperate to keep twisted pair channels to maintain as many telephone services as possible

FALSE: they are trying to get rid of this because it is much more expensive, and by law they HAVE to cover everyone so if one guy in the middle of nowhere wanted it they would have to dig and place a cable down for his house alone

true or false: data usually comes from only a couple sources in an organization

False: data is located/scattered throughout organizations and this makes it harder to manage

Relational databases: What are rows? What are columns?

In relational databases, rows are horizontal records that contain data about a specific entity, while columns are vertical structures that represent attributes or characteristics of that entity.

Online Analytical Processing (OLAP)

Manipulation of information to create business intelligence in support of strategic decision making

Master Data vs. Transaction Data

Master data: Core, consistent info about entities (customers, products). Transaction data: Records of individual business activities (sales, purchases) stored temporarily.

difference between OLAP and OLTP

OLAP (Online Analytical Processing) is designed for complex queries and data analysis, typically involving historical data, while OLTP (Online Transaction Processing) handles real-time, transactional operations like order processing and data entry. OLAP emphasizes read-heavy tasks, while OLTP focuses on write-heavy, day-to-day operations.

network that covers a large geographical area and connect multiple LANS

WAN

What are we looking for with master data management?

a "single version of the truth" for the company's master data stored in one place

Big Data

a broad term for datasets so large or complex that traditional data processing applications are inadequate.

what is a "pain point"

a business problem

standard deviation

a computed measure of how much scores vary around the mean score

what is the primary key

a unique identifier for each record in a relational database

characteristics of high quality data

accurate, complete, timely, consistent accessible, relevant, and concise

a data mining technique to discover relationships between items, often used in retail to identify product co-purchases and associations

affinity analysis

Define Data Governance

an approach to managing information across an entire organization

what is Big Data good for

analytics and making REALLY GOOD PREDICTIONS

the transmission capacity of a network is defined as

bandwidth

why can users not get access to organization databases?

because it is LIVE: it is CONSTANTLY generating data (live production)

why are data streams called that? (data STREAMS)

because of their CONTINUOUS flow of data

what are the three dimensions of a multi-dimensional data warehouse

business historical time non-volatile basically no computer science, shows past, is not still changing

first four steps of business problem solving in order with small descriptions of each

business problem: "What specific business issue can analytics address or solve, and how can data-driven insights help optimize operations or decision-making?" data management: using internal data, external data, corporate DBs, Data warehouses (-> data streams) to analyze descriptive analysis: what was happened? predictive analysis: what will happen?

True or False: A company's proprietary market research report, which it conducts independently and does not share with external parties, is considered external data.

false

True or False: If a piece of data involves interactions with entities outside the organization, it's considered internal data.

false

true or false: WAN only covers the entire earth

false: Mars and voyagers past the solar system

true or false: broadband is relatively low transmission capacity and narrowband is relatively high transmission capacity

false: broadband is relatively high and narrowband relatively low

true or false: Master data is core data at a low level

false: core data at a high level

true or false: data rot refers to data becoming less relevant over time

false: data rot refers to data being created for an old medium like 8 track tapes. if you havent modernized the data then you have to find an old medium to play the data

true or false: data security, quality and integrity are difficult to jeopardize due to the focus on protecting them

false: there is so much data and it is so difficult to manage that data can easily be jeopardized

true or false: A/B experiments can only be on websites

false: they can in stores and in other things

two things Big Data does for us

feeds analytics; more Big Data means more accurate ANALYSIS feeds machine learning (AI/ Chat GPT)

what are Data Warehouses useful for

giving users access to data for solving their business problems

connects two or more devices in a limited geographical region, usually within the same building, so every device on the network can communicate with each other

local area network

an essential part of data governance is _______ ____ _______

master data management

example of master and transaction data

master data would be: "class name," "teacher name" and transaction data would be: "calc 1100," "John Smith"

data is collected by many individuals using varied _ and _

methods and devices

with bandwidth: is more better or less better

more

data cube characteristics

multidimensional, historical, usually don't delete data

any two computing devices that exchange data

network

digital divide is _ vs _

non-broadband vs broadband

in order, list the computer (digital area) networks:

personal area networks, local area networks, metropolitan area networks, wide area networks, and the INTERNET

four big types of data streams

point of sale clickstream social sensor

Information collected during customer transactions at physical or online stores, including purchase details, items, and payment methods.

point-of-sale data

four big types of data discussed in 5.1

point-of-sale, clickstream, social media, sensor

what is the primary key for: a student database (you are the subject) a credit card company (the card is the subject because people can have multiple cards)

student ID credit card number

relational database example

student database

true or false: an A/B experiment is where you leave everything identical and change ONE thing to see which of two options is more effective

true

true or false: better customer segmentation usually means demand is better met

true

true or false: business problem is user-driven

true

true or false: clickstream data includes hover data

true

true or false: orgs have to choose two of the three objectives when it comes to networks

true

true or false: social media has led to a huge data explosion

true

true or false: the main difference between internal and external data is where the most significant portion of data is created determines if it is internal or external. If the most significant portion of data is created without direct interaction with outside sources, it is internal

true

true or false: wide area networks existed before local area networks, but, without the need for distinction, were just referred to as networks

true

customer churn

turnover in a company's customer base

which communications media/channel describes plain old telephone service (POTS)

twisted pair

which communications media/channel is not broadband

twisted pair

as bandwidth is increasing and becoming cheaper, what to networks are continually becoming faster

voice and data communication networks

describe an educational problem that was caused by COVID and broadband's situation on it

when COVID hit, almost everyone relied on broadband to learn remotely. However, around 20% of k-12 students did not have access to broadband

who owns networks

MIS


Related study sets

Chapter 47: Bowel Elimination (Bowel Elimination)

View Set

BUS 3 - Financial Accounting (Connect)

View Set

Peds - Chapter 30: Nursing Care of the Child With a Cognitive or Mental Health Disorder

View Set

Module 7 - Risk Reducing Strategies for Different Driving Environments

View Set