ISMN Exam 2
what is the big problem with each function of an organization having an individual collection of data (data silo)
Data redundancy and inconsistency can occur within an organization. For instance, the marketing and customer service functions may both contain duplicate data, which, when merged into the database, leads to data duplication and inconsistencies.
Nine reasons data is (and is constantly becoming) more difficult to manage
1. amount of data increases exponentially over time (increasing at an increasing rate) 2. data is scattered throughout organizations (different places); org and personal data 3. many individuals collect data 4. data are generated from multiple sources (internal, external, personal, the Web) 5. Data degrades over time (errors, or data not updated correctly) 6. New sources of data being developed (blogs, videos, RFID tags, sensors. SOCIAL MEDIA) 7. data rot (old media) 8. Data security, quality, and integrity are critical BUT EASILY JEOPARDIZED 9. Big Data
three negative effects of data silos mentioned in the textbook
1. hinder the process of gaining actionable insights from organizational data 2. create barriers to an overall view of an enterprise and its data 3. delay digital transformation efforts
what are the two big reasons data is becoming harder to acquire, keep and MANAGE? KEY WORD MANAGE
1. the amount of data is increasingly exponentially with time as some historical data must be kept and new data are rapidly forming 2. data is scattered throughout organizations, collected by many individuals using various methods, and kept in numerous servers and locations, and computing systems, databases, formats, and human and computer languages
what stopped us while we were building broadband around the world
COVD
true or false: broadband is loosely defined but is defined by the download speed of a transmission capacity of a communications medium
TRUE
broadband = _ ___ __ _________
a lot of bandwidth
which two communications media/channels are broadband
coaxial cable and fiber optic
what two materials are ALL physical cables made of
copper or glass
how long has descriptive analytics been around
since the 1950s
Content and interactions on social platforms, comprising text, images, videos, comments, and likes, used for audience engagement and insights.
social media data
networks represent a compromise between what three components
speed, distance and cost
data silos
stand-alone data stores. Their data are not accessible by other ISs that need it or outside that department.
market basket analysis
technique to discover relationships between products that customers frequently purchase together. It identifies patterns and helps businesses make decisions regarding product placement, cross-selling, and targeted marketing.
true or false: you are a computing device
true
true or false: analytics is mostly cleaning up data
true; (80% i believe)
describe components of market basket analysis and decision making
users can use this analysis to choose where to put products. it gets complicated because for example if two products are commonly bought (and the customer plans this before going into the store) then it is usually smart to put them on opposite sides of the store so the customer has to walk past a bunch of other items and consider buying them. on the other hand, if two items are not commonly planned to be bought together but often end up being bought together (like beer and diapers) then stores may put them next to each other
what do you do after making a business decision based on descriptive and predictive (and prescriptive) analytics?
you ask the next business question. it is a constant loop to improve
non-broadband includes
"dial-up"
__________ business decision means it must be doable
ACTIONABLE
what is the optimal customer segmentation and how (directly) does it help businesses and consumers
1:1 (segment customers by groups of one to give everyone a specialized version of what they want) because this means DEMAND IS BETTER MET
who still does not have broadband? (geographical)
Parts of Africa, Middle East and Western China
true or false: the main difference between internal and external data is where the most significant portion of data is created determines if it is internal or external. If the most significant portion of data is created with direct interaction with outside sources, like sales and reviews, it is internal
FALSE
central tendency
a measure that represents the typical response or the behavior of a group as a whole. mean, median, mode
Records user interactions on websites, tracking clicks, page views, and navigation patterns to analyze online behavior.
clickstream data
data streams
data continuously generated by point of sale systems, clickstream data, social media and sensors
name four major sources data is generated from
internal, external, personal, the Web
Broadband internet's relationship with remote work
it allowed/allows for people to live somewhere cheaper, it increases efficiency by removing/reducing the commute to work and overall traffic double edged sword: loss of identity, human interaction
Data from **the literal term so i have to leave it out** in devices and systems, providing real-time information on environmental, physical, or industrial conditions.
sensor data
online analytical processing: "_____ ___ ____"
slice and dice refers to cube shape of data organization
equation for Big Data
structured data + unstructured data = Big Data
what if a new batch of data breaks an old trend
there is a variable, possibly yet to be considered if it confuses the user
how do telecos try to get more people to switch to wireless
they slowly up the monthly fees for twisted pair connection
explain why organizations must typically select two of the three objectives when it comes to networks
to cover long distances and fast, organizations will have to pay more. they can pay less if it is slower. they can also choose fast, inexpensive communication with distance limitations
True or False: Social media mentions and customer reviews are typically classified as external data for a business.
true
true or false: acquiring, keeping and managing data is becoming increasingly difficult
true
Online Transaction Processing (OLTP)
Capturing of transaction and event information using technology to process, store, and update
true or false: you should ignore data outliers when analyzing data because they are usually not consistent with future trends
FALSE: always pay attention to outliers
true or false: every college has its own WAN
FALSE: every college has its own LAN
true or false: data is increasing linearly over time
FALSE: it is actually increasing exponentially
true or false: Networks run in the foreground because they require constant user interaction for data to be transmitted and received.
FALSE: networks run in the background
true or false: telecos (telephone companies) are desperate to keep twisted pair channels to maintain as many telephone services as possible
FALSE: they are trying to get rid of this because it is much more expensive, and by law they HAVE to cover everyone so if one guy in the middle of nowhere wanted it they would have to dig and place a cable down for his house alone
true or false: data usually comes from only a couple sources in an organization
False: data is located/scattered throughout organizations and this makes it harder to manage
Relational databases: What are rows? What are columns?
In relational databases, rows are horizontal records that contain data about a specific entity, while columns are vertical structures that represent attributes or characteristics of that entity.
Online Analytical Processing (OLAP)
Manipulation of information to create business intelligence in support of strategic decision making
Master Data vs. Transaction Data
Master data: Core, consistent info about entities (customers, products). Transaction data: Records of individual business activities (sales, purchases) stored temporarily.
difference between OLAP and OLTP
OLAP (Online Analytical Processing) is designed for complex queries and data analysis, typically involving historical data, while OLTP (Online Transaction Processing) handles real-time, transactional operations like order processing and data entry. OLAP emphasizes read-heavy tasks, while OLTP focuses on write-heavy, day-to-day operations.
network that covers a large geographical area and connect multiple LANS
WAN
What are we looking for with master data management?
a "single version of the truth" for the company's master data stored in one place
Big Data
a broad term for datasets so large or complex that traditional data processing applications are inadequate.
what is a "pain point"
a business problem
standard deviation
a computed measure of how much scores vary around the mean score
what is the primary key
a unique identifier for each record in a relational database
characteristics of high quality data
accurate, complete, timely, consistent accessible, relevant, and concise
a data mining technique to discover relationships between items, often used in retail to identify product co-purchases and associations
affinity analysis
Define Data Governance
an approach to managing information across an entire organization
what is Big Data good for
analytics and making REALLY GOOD PREDICTIONS
the transmission capacity of a network is defined as
bandwidth
why can users not get access to organization databases?
because it is LIVE: it is CONSTANTLY generating data (live production)
why are data streams called that? (data STREAMS)
because of their CONTINUOUS flow of data
what are the three dimensions of a multi-dimensional data warehouse
business historical time non-volatile basically no computer science, shows past, is not still changing
first four steps of business problem solving in order with small descriptions of each
business problem: "What specific business issue can analytics address or solve, and how can data-driven insights help optimize operations or decision-making?" data management: using internal data, external data, corporate DBs, Data warehouses (-> data streams) to analyze descriptive analysis: what was happened? predictive analysis: what will happen?
True or False: A company's proprietary market research report, which it conducts independently and does not share with external parties, is considered external data.
false
True or False: If a piece of data involves interactions with entities outside the organization, it's considered internal data.
false
true or false: WAN only covers the entire earth
false: Mars and voyagers past the solar system
true or false: broadband is relatively low transmission capacity and narrowband is relatively high transmission capacity
false: broadband is relatively high and narrowband relatively low
true or false: Master data is core data at a low level
false: core data at a high level
true or false: data rot refers to data becoming less relevant over time
false: data rot refers to data being created for an old medium like 8 track tapes. if you havent modernized the data then you have to find an old medium to play the data
true or false: data security, quality and integrity are difficult to jeopardize due to the focus on protecting them
false: there is so much data and it is so difficult to manage that data can easily be jeopardized
true or false: A/B experiments can only be on websites
false: they can in stores and in other things
two things Big Data does for us
feeds analytics; more Big Data means more accurate ANALYSIS feeds machine learning (AI/ Chat GPT)
what are Data Warehouses useful for
giving users access to data for solving their business problems
connects two or more devices in a limited geographical region, usually within the same building, so every device on the network can communicate with each other
local area network
an essential part of data governance is _______ ____ _______
master data management
example of master and transaction data
master data would be: "class name," "teacher name" and transaction data would be: "calc 1100," "John Smith"
data is collected by many individuals using varied _ and _
methods and devices
with bandwidth: is more better or less better
more
data cube characteristics
multidimensional, historical, usually don't delete data
any two computing devices that exchange data
network
digital divide is _ vs _
non-broadband vs broadband
in order, list the computer (digital area) networks:
personal area networks, local area networks, metropolitan area networks, wide area networks, and the INTERNET
four big types of data streams
point of sale clickstream social sensor
Information collected during customer transactions at physical or online stores, including purchase details, items, and payment methods.
point-of-sale data
four big types of data discussed in 5.1
point-of-sale, clickstream, social media, sensor
what is the primary key for: a student database (you are the subject) a credit card company (the card is the subject because people can have multiple cards)
student ID credit card number
relational database example
student database
true or false: an A/B experiment is where you leave everything identical and change ONE thing to see which of two options is more effective
true
true or false: better customer segmentation usually means demand is better met
true
true or false: business problem is user-driven
true
true or false: clickstream data includes hover data
true
true or false: orgs have to choose two of the three objectives when it comes to networks
true
true or false: social media has led to a huge data explosion
true
true or false: the main difference between internal and external data is where the most significant portion of data is created determines if it is internal or external. If the most significant portion of data is created without direct interaction with outside sources, it is internal
true
true or false: wide area networks existed before local area networks, but, without the need for distinction, were just referred to as networks
true
customer churn
turnover in a company's customer base
which communications media/channel describes plain old telephone service (POTS)
twisted pair
which communications media/channel is not broadband
twisted pair
as bandwidth is increasing and becoming cheaper, what to networks are continually becoming faster
voice and data communication networks
describe an educational problem that was caused by COVID and broadband's situation on it
when COVID hit, almost everyone relied on broadband to learn remotely. However, around 20% of k-12 students did not have access to broadband
who owns networks
MIS