IS TEST 2
Two Basic Data Mining Operations
1) Predicting trends and behaviors 2) Identifying previously unknown patterns
Smaller Organizations use
Excel Spreadsheets
Bloomberg Terminals
Tells you everything about the industry you are doing Financial work on
Critical Success Factors (CSF)
The factors most critical for the success of business. These can be organizational, industry, departmental, or for individual workers
Internet Backbone
The primary network connections and telecommunications lines that link the nodes of the Internet using a fiber-optic network that is operated primarily by large telecommunications companies.
Key Performance Indicators
The specific measures of CSFs
Communication
a major category of network applications related to communication technologies delivered online (e.g., including e-mail, call centers, chat rooms, and voice).
Analog Signals
continuous waves that transmit information by altering the characteristics of the waves. Analog signals have two parameters, amplitude and frequency.
Ethernet
A common LAN protocol. Many organizations use 100-gigabit Ethernet, where the network provides data transmission speeds of 100 gigabits (100 billion bits) per second.
What-If Analysis
A model builder must make predictions and assumptions regarding the input data, many of which are based on the assessment of uncertain futures. The results depend on the accuracy of these assumptions, which can be highly subjective.
MOOC's (Massive Open Online Courses)
A new form of distance learning which are a tool for democratizing higher education. Several factors have contributed to the growth of MOOCs, including improved technology and the rapidly increasing costs of traditional universities. MOOCs are highly automated, complete with computer-graded assignments and exams. MOOCs have not yet proved that they can effectively teach the thousands of students who enroll in them. In addition, they do not provide revenues for universities.
Predictive Analytics Examples2
Automatic grading of student essays Wireless carriers predict customer churn Insurance companies predict body injury liability from car crashes Better diagnosis of breast cancer Predict the likelihood a convict will offend again
Packets
Before data are transmitted over the Internet, they are divided into small, fixed bundles called packets.
Client/Server Computing
Client/server computing links two or more computers in an arrangement in which some machines, called servers, provide computing services for user PCs, called clients.
Scope of BANA
Dev of One or a Few Related Analytics Applications Dev of Infrastructure to Support Enterprisewide Analytics Support for Organizational Transformation
Predictive Analytics Examples
Drive the coupons you receive at the grocery store register Website predict which ads you will click on President Obama was re-elected in 2012 with the help of voter prediction Leading online dating companies match compatible individuals
Previous behavior is a good indicator of
Future Behavior
Reality Mining
Graphical Information Systems (GIS) and Geographic Positioning Systems (GPS) together to produce an interesting new type of technology which allows analysts to extract information from the usage patterns of mobile phones and other wireless devices.
Real-Time Business Analytics
Includes the use of real time data for analysis as it is created rather than using historical data for analysis.
TCP performs three basic functions:
It manages the movement of data packets between computers by establishing a connection between the computers it sequences the transfer of packets it acknowledges the packets that have been transmitted
Larger Organizations Use
Multidimensional Analysis or Online Analytical Processing or Data Mining or DSS dashboards
Extranet
Parts of intranets of multiple companies/locations connected together connects parts of the intranets of different organizations which enables business partners to communicate securely over the Internet using virtual private networks (VPNs)
Exception Reporting
Reports that highlight deviations larger than certain thresholds. Reports may include only deviations
Reduced Performance in the Near Future
Researchers assert that if Internet bandwidth is not improved rapidly within a few years the Internet will only function at a much reduced speed.
Intranet
Secure network inside an organization that can use Internet infrastructure (hardware, software, protocols) a network that uses Internet protocols so that users can take advantage of familiar applications and work habits. Intranets support discovery (easy and inexpensive browsing and search), communication, and collaboration inside an organization.
Sensitivity Analysis
Sensitivity analysis is the study of the impact that changes in one or more parts of a decision-making model have on other parts.
Trend Analysis
Short-,Medium-, and Long-term trend of KPIs or metrics, which are projected using forecasting methods
Drill Down
The ability to go to details, at several levels; it can be done by a series of menus or by clicking on a drillable portion of the screen
Status Access
The latest data available on KPI or some other metric, often in real-time
Excel
The most popular BA tool by far is Excel. For years, BA vendors "fought" against the use of Excel. Eventually, however, they decided to "join it" by designing their soft ware so that it interfaces with Microsoft Excel
Virtual Group (Team)
Virtual groups conduct virtual meetings—that is, they "meet" electronically. Virtual collaboration (or e-collaboration): the use of digital technologies that enable organizations or individuals who are geographically dispersed to collaboratively plan, design, develop, manage, and research products, services, and innovative applications.
Accessing a website
You (client) request a website (URL). Request (as packets) goes first to your ISP's domain name server (DNS). DNS finds the location of the resource you want and forwards your request to that server. Web page is sent back to you using your IP address.
Portals
a Web-based, personalized gateway to information and knowledge that provides relevant information from different IT systems and the Internet using advanced search and indexing techniques.
Protocol
a common set of rules and procedures that enable nodes on a network to communicate with one another.
Internet service provider (ISP)
a company that provides Internet connections for a fee.
Geographic Information System (GIS)
a computer based system for capturing, integrating, manipulating, and displaying data using digitized maps. Its most distinguishing characteristic is that every record or digital object has an identified geographical location. This process, called geocoding, enables users to generate information for planning, problem solving, and decision making.
Search Engine
a computer program that searches for specific information by keywords and then reports the results.
Internet
a network of networks... a global WAN that connects approximately 1 million organizational computer networks in more than 200 countries on all continents, including Antarctica. It has become so widespread that it features in the daily routine of some 3 billion people.
Wide Area Network (WAN)
a network that covers a large geographical area. WANs typically connect multiple LANs. They are generally provided by common carriers such as telephone companies and the international networks of global communications services providers. WANs have large capacity, and they typically combine multiple channels (e.g., fiber-optic cables, microwave, and satellite). Th e Internet is an example of a WAN.
Communications Channel
a pathway or medium communicating data from one location to another.
Industrywide Portal
a portal that serves an entire industry (e.g., TruckNet <www.truck.net>).
Crowdsourcing
a process in which an organization outsources a task to an undefined, generally large group of people in the form of an open call.
Coaxial Cable
a single strand of insulated copper wire. It is much less susceptible to electrical interference, and it can carry much more data than twisted-pair. It is commonly used to carry high-speed data traffic as well as television signals.
World Wide Web
a system of universally accepted standards for storing, retrieving, formatting, and displaying information via a client/server architecture.
Computer Network
a system that connects computers and other devices (e.g., printers) via communications media so that data and information can be transmitted among them.
Peer-to-Peer (P2P) Processing
a type of client/server distributed processing where each computer acts as both a client and a server. Each computer can access (as assigned for security or integrity purposes) all files on all other computers.
Synchronous Collaboration
all team members meet at the same time.
Broadband
based on the Federal Communications Commission (FCC) 2010 definition, broadband is the transmission capacity of a communications medium
Decision support systems (DSSs)
combine models and data to analyze semistructured problems and some unstructured problems that involve extensive user involvement. Decision support systems enable business managers and analysts to access data interactively, to manipulate these data, and to conduct appropriate analyses.
Local Area Network (LAN
connects two or more devices in a limited geographical region, usually within the same building, so that every device on the network can communicate with every other device.
Fiber-Optic Cable
consists of thousands of very thin filaments of glass fibers that transmit information via light pulses generated by lasers. The fiber-optic cable is surrounded by cladding, a coating that prevents the light from leaking out of the fiber.
Modem (modulator-demodulator
converts digital signals to analog signals—a process called modulation—and analog signals to digital signals—a process called demodulation. (The name modem is a contraction of modulator-demodulator.)
Data Visualization
data presented to users in visual formats such as text, graphics, and tables following data processing. Data Visualization makes IT applications more attractive and understandable to users.
Internet2
developed by many U.S. universities collaborating with industry and government which develops and deploys advanced network applications such as remote medical diagnosis, digital libraries, distance education, online simulation, and virtual laboratories.
Digital Signals
discrete pulses that are either on or off, representing a series of bits (0s and 1s). Th is quality allows digital signals to convey information in a binary form that can be interpreted by computers.
Internet Protocol (IP) address
distinguishes your device from all other computers and consists of sets of numbers, in four parts, separated by dots. For example, the IP address of one computer might be 135.62.128.91. You can access a Web site by typing this number in the address bar of your browser.
Distributed Processing
divides processing work among two or more computers which enables computers in different locations to communicate with one another via telecommunications.
Network Access Points (NAPs)
exchange points for Internet traffic. They determine how traffic is routed. NAPs are key components of the Internet backbone.
Prescriptive Analytics
go beyond descriptive and predictive models by recommending one or more courses of action and showing the likely outcome of each decision. Predictive analytics does not predict one possible future, but rather multiple future outcomes based on the decision maker's actions.
Publication of Material in Foreign Languages
information on the Internet is written in many different languages, and automatic translation of Web pages is essential.
Online Analytical Processing (OLAP) (Multidimensional Analysis)
involves "slicing and dicing" data stored in a dimensional format, drilling down in the data to greater detail, and aggregating the data.
Business Intelligence (BI):
is a broad category of applications, technologies, and processes for gathering, storing, accessing, and analyzing data to help business users make better decisions. BI applications enable decision makers to quickly ascertain the status of a business enterprise by examining key information.
Internet Protocol (IP)
is responsible for disassembling, delivering, and reassembling the data during transmission.
Communications Media
is the physical media used to send data (e.g., twisted-pair wire, coaxial cable, fiber optic cable, satellite, etc.).
E-learning
learning supported by the Web. It can take place inside classrooms as a support to conventional teaching, such as when students work on the Web during class. It also can take place in virtual classrooms, in which all coursework is completed.
Asynchronous Collaboration
occurs when team members cannot meet at the same time.
Corporate Portal
offers a personalized, single point of access through a Web browser to critical business information located inside and outside an organization. These portals are also known as enterprise portals, information portals, and enterprise information portals.
Affinity Portal
offers a single point of entry to an entire community of affiliated interests, such as a hobby group or a political party.
Enterprise Network
organizational interconnected networks consisting of multiple LANs and may also include multiple WAN's.r in this chapter) faster than 4 megabits per second (Mbps).
Uniform Resource Locator (URL)
points to the address of a specific resource on the Web
Dashboard
provides easy access to timely information and direct access to management reports. They evolved from executive information systems, which were information systems designed specifically for the information needs of top executives
Distance Learning (DL)
refers to any learning situation in which teachers and students do not meet face-to-face.
Data Mining
refers to the process of searching for valuable business information in a large database, data warehouse, or data mart. Data mining can perform two basic operations: (1) predicting trends and behaviors and (2) identifying previously unknown patterns. Business analytics applications typically provide users with a view of what has happened; data mining helps to explain why it is happening, and it predicts what will happen in the future.
Bandwidth
refers to the transmission capacity of a network; it is stated in bits per second
Goal-Seeking Analysis
represents a "backward" solution approach. It attempts to calculate the value of the inputs necessary to achieve a desired level of output.
Metasearch Engines
search several engines at once and then integrate the findings to answer users' queries.
Models
simplified representations, or abstractions, of reality.
Unified Communications (UC)
simplifies and integrates all forms of communications (voice, voice mail, fax, chat, e-mail, instant messaging, short message service, presence (location) services, and videoconferencing) on a common hardware and soft ware platform.
Descriptive Analytics
summarize what has happened in the past and allow decision makers to learn from past behaviors. Common examples of descriptive analytics are reports that provide historical insights regarding an organization's production, financials, operations, sales, finance, inventory, and customers.
Hypertext
text displayed on a computer display or other electronic device with references, called hyperlinks, to other text that the reader can immediately access, or where text can be revealed progressively at additional levels of details.
Data Reduction
the conversion of raw data into a smaller amount of more useful information.
Commercial (public) Portal
the most popular type of portal on the Internet and is intended for broad and diverse audiences (e.g., a stock ticker).
Twisted-Pair Wire
the most prevalent form of communications wiring used for almost all business telephone wiring and As the name suggests, it consists of strands of copper wire twisted in pairs. It is relatively inexpensive to purchase, widely available, and easy to work with.
Workflow
the movement of information as it progresses through the sequence of steps that make up an organization's work procedures. Workflow management makes it possible to pass documents, information, and tasks from one participant to another in a way that is governed by the organization's rules or procedures.
Telecommuting
the process in which highly prized "knowledge" workers are now able to work anywhere and anytime. Knowledge workers are being called the distributed workforce, or "digital nomads."
Business Analytics
the process of developing actionable decisions or recommendations for actions based on insights generated from historical data. Business analytics examines data with a variety of tools and techniques, formulates descriptive, predictive, and prescriptive models, and communicates these results to organizational decision makers.
Transmission Control Protocol/Internet Protocol (TCP/IP)
the protocol of the Internet. TCP/IP uses a suite of protocols, the main ones being the Transmission Control Protocol (TCP) and the Internet Protocol (IP).
Packet Switching
the transmission technology that breaks up blocks of data into packets is called packet switching. Each packet carries the information that will help it reach its destination—the sender's IP address, the intended receiver's IP address, the number of packets in the message, and the number of the particular packet within the message. Each packet travels independently across the network and can be routed through different paths in the network. When the packets reach their destination, they are reassembled into the original message.
Electronic Teleconferencing
the use of electronic communication technology that enables two or more people at different locations to hold a conference.
Workgroup
two or more individuals who act together to perform some task.
Predictive Analytics
utilize a variety of analytics techniques and tools to examine recent and historical data in order to detect patterns and predict future outcomes and trends. Predictive analytics provide estimates about the likelihood of a future outcome
Telecommuting Employees
workers who have no permanent office at their companies, preferring to work at home offices, in airport lounges, or client conference rooms, or on a high school stadium bleacher.
Telecommuting Employees:
workers who have no permanent office at their companies, preferring to work at home offices, in airport lounges, or client conference rooms, or on a high school stadium bleacher.