ISDS Chapter 5 and 6

अब Quizwiz के साथ अपने होमवर्क और परीक्षाओं को एस करें!

Relational database

Organize data into two-dimensional tables (relations) with columns and rows most common type of database today one table for customers and one for suppliers-- most basic ex page 195 Fields in a relational data base are also called columns Rows are referred as records or in technical term tuples

Bit

the smallest unit of data a computer can handle

Workstation

fits on a desktop but has more powerful mathematical and graphics-processing capabilities than a PC

Nonrelation database

Can handle large data sets that are not easily organized into tables, columns, and rows • "NoSQL": Non-relational database technologies • Non-relational DBMS - Use more flexible data model -Don't require extensive structuring - Can manage unstructured data, such as social media and graphics • E.g. Amazon's SimpleDB

Mainframe

Large-capacity, high-performance computer that can process large amounts of data very rapidly. IBM- leading mainframe vendor has repurposed its mainframe system so they can be used as giant servers for large-scale enterprise networks and corporate websites Single IBM can run enough instances of Linux or Windows server software to replace thousands of smaller Windows-bases severs

Data Center

Is a facility housing computer system and associated components such as telecommunications, storage, security systems, and backup power supplies.

Managing Software Localization For Global Business

Local language interfaces -- English not typically standard at middle, lower levels • Interfaces are complex: menu bars, error messages, online forms, search results, and so on -- Differences in local cultures • Differences in business processes • All of these factors add to TCO of using technology service providers or to outsource entire process of converting software to operate in a second language is called software localization.

Ensuring Data Quality

Poor data quality: major obstacle to successful customer relationship management • Data quality problems: caused by -- Redundant and inconsistent data produced by multiple systems --Data input errors • Data quality audit: structured survey of the accuracy and completeness of data • Data cleansing (aks data scrubbing): detects and corrects incorrect, incomplete, improperly formatted, and redundant data

Virtualization

Process of presenting a set of computing resources so they can be accessed in ways that are unrestricted by physical configuration or geographic location • Enables single physical resource to appear to the user as multiple resources • Enables companies to host multiple systems on single machine Helps organization increase equipment usage rate, conserving data center space and energy usage.

Software Trends

Open source software--developed by a community of programmers around the world, who make their program available to users under one of several licensing schemes. Grown to be a major part of corporate computing infrastructure, as the foundation for programs since as Linux and Apache, the most widely web server software. - Cloud-based software and tools consits of mashups and apps.

Server Computers

are specifically optimized to support a computer network , enabling users to share files, software, peripheral deceives (such as a printer), or other network resources. provide the hardware platform for electronic commerce

Total Cost of Ownership (TCO) of Technology Assets

Used to analyze direct and indirect costs to help determine the actual cost of owning a specific technology • Direct costs: hardware, software purchase costs • Indirect costs: ongoing administration costs, upgrades, maintenance, technical support, training, utility, and real estate costs • Hidden costs: support staff, downtime, additional network management • TCO can be reduced through increased centralization, standardization of hardware and software resources

Networking and Telecommunications Technology,

provides data, voice, and video connectivity to employees, customers, and suppliers Includes technology for running a company's internal network, services from telecommunications/telephones services company, and technology for running websites and linking to other computer systems through the internet.

A firm's IT infrastructure provides

the foundation, or platform, for supporting all the information systems in the business

Magnetic Disks

the most widely used secondary storage medium today hard drives, USBs flash drives, solid state drives (SSD) SSDs have replaced the hard drives un lightweight PCs. A SSD uses an array of semiconductors organized as an internal disk drive, where as USB flash drive use similar technology for external storage.

Capacity Planning

the process of predicting when a computer hardware system becomes saturated. Ensures the firm has enough computing power for current and future needs Consider these factors: Maximum number of users that the system can accommodate at one time, impact of existing and future software applications, and performance measures, such as minimum response time for processing business transactions.

Distributed Processing

the use of multiple computers linked by a communications network for processing

Key field

uniquely identifies each record so that the record can be retrieved, updated, or sorted

Two-tiered client/server architecture

uses two types of machines

Computer Hardware

consists of technology for computer processing, data storage, input, and output

Attributes

each entity has a specific characteristic

Primary Key

each table in a RD has one field designated as its.. Primary key cant be duplicated

Centralized Processing

all processing is accomplished by one large central computer much less common

Cloud Computing

A model of computing in in which computer processing, storage, software, and other services are provided as a shared pool of virtualized resources over a network, primary the internet. Fastest growing form of computing Drawbacks--Data storage security is in hands of provider

File

A group of records of the same type

Record

A group of related fields in a database

Field

A grouping of characters into a word, a group of words, or a complete number, such as a person's name or age.

High-performance/power-saving processors

Another way to reduce power requirements and hardware sprawl is to use more efficient and power-saving processors. • Multicore processor is an integrated circuit to which with two or more processors cores have been attached for enhanced performance, reduced power consumption, and more efficient simultaneous processing of multiple tasks. • Power-efficient processors are used in lightweight smartphones and tablets, intelligent cars, and health care devices. Such as the Apple A8 and A9 which have about 1-15th of the power consumption of a laptop dual-core processor. • Low power consumption essential in mobile computing

Business Intelligence Infrastructure

Array of tools for obtaining useful information from internal and external systems and big data Includes: • Data warehouses • Data marts • Hadoop • In-memory computing • Analytical platforms

Multi-tiered client/server architecture (N-tier)

Balances load of network over several levels of servers

Managing Mobile Platforms

Balancing gains in productivity from using mobile devices with expenses of equipping employees with these devices More difficult to protect the company's network and data when employees access them from their privately owned devices • Mobile device management (MDM) software- monitors, manages, and secures mobile devices that are deployed across multiple mobile service providers and across multiple mobile operating system being used in the organization

Types of Computers

Come in array of different sizes with differing capabilities for processing information personal computers, mobile devices, workstations Server vs.. mainframe

IT infrastructure is composed of

Computer hardware, computer software, data management technology, networking and telecommunications technology, and technology services.

Storage Networks

Connect multiple storage devices on a separate highspeed network dedicated to storage. SAN (Storage Area Network) creates large central pool of storage than multiple servers can rapidly access and share.

Web Mining

Discovery and analysis of useful patterns and information from the Web -- E.g. to understand customer behavior, evaluate Web site, quantify success of marketing • Content mining - mines content of Web sites • Structure mining - mines Web site structural elements, such as links • Usage mining - mines user interaction data gathered by Web servers

Data Mining

Finds hidden patterns and relationships in large databases and infers rules from them to predict future behavior • Types of information obtainable via data mining • Associations: occurrences linked to single event • Sequences: events linked over time • Classifications: patterns describing a group an item belongs to ex. spam mail • Clustering: discovering as yet unclassified groupings • Forecasting: uses series of values to forecast future values

HTML and HTML5

Hypertext Markup Language is a page description language for specifying how to text, graphics, video, and sound are placed on a web page for creating dynamic links to others pages and objects. Most recent version of HTML is HTML5 makes it possible to embed images, audio, and other elements into a document without processor-intensive add-ons

Establishing an Information Policy

Information Policy • States organization's rules for organizing, managing, storing, sharing information Data Administration • Responsible for specific policies and procedures through which data can be managed as a resource Database Administration • Database design and management group responsible for defining and organizing the structure and content of the database, and maintaining the database

Cloud Computing consist of three types of services:

Infrastructure as a service (IaaS):Customers use processing, storage, networking, and other computing resources from cloud services to run THEIR information systems. In other words, allows customers to build custom software on the cloud. Example - Amazon Web Service • Platform as a service (PaaS): Customers use infrastructure and programming tools supported by the cloud service provider to develop their own applications. In other words, the service provider offers programming capabilities to customers to develop and test applications. Example - IBM Bluemix Salesforce is a leading example of PaaS • Software as a service (SaaS): Customers use software hosted by the vendor on the vendor's cloud infrastructure and delivered as a service over a network. In other words, the vendor provides software programs on a subscription basis. Examples - Dropbox, Google Docs; Cloud ERP (SAP HANA) Table comparing the types on page 168

Input and Output Devices

Input Devices: gather data and convey them onto electronic form for use by the computer. Ex. Keyboard, computer mouse, touch screen, pen-based input, optical character recognition, magnetic ink character recognition, digital scanner, audio input, sensors (Table on page 159) Output Devices: display data after after they have been processed Ex. • Monitor (flat-panel, CRT) • Printer (impact, nonimpact) • Audio output (Table pg 159)

Mashups and Apps

Mashups--Individual users and entire companies mix and match these software components to create their own customized application and to share information with others. The ideas is to produce from different sources a new work that is grater than the sum if its parts. Ex. Zip Realty uses Google Maps and Zillow.com Apps-- small specialized software programs that run on eh Internet, on your computer, or on your mobile phone or tablet and are generally delivered over the Internet. Mobile Apps Apps tie the customer to a specific hardware platform; as the user adds more and more apps to his or her mobile phone, the cost of switching to a competing mobile platform rises

The Challenge of Big Data

Massive quantities of unstructured and semi-structured data from Internet, networked services and mobile applications, that are beyond the ability if typical DBMS to capture, store, and analyze (petabyte and exabyte range) -records more rapidly than traditional data • Big datasets provide more opportunities for discovering patterns and insights than smaller datasets, e.g. • Customer behavior • Weather patterns • Requires new technologies and tools

Contemporary Hardware Trends

Mobile Digital Platform Consumerization of IT Nanotechnology Quantum Computers Virtualization Cloud Computing Green Computing High-performance/power-saving processors

Consumerization of IT and BYOD

New technology that emerges in consumer market spreads into business organizations • BYOD (bring your own device): Use of personal mobile devices at work Consumerization of IT is forcing business to rethink the way they obtain and manage information technology equipment and services due to BYOD

Cloud Computing essential characteristics:

On-demand self-service- consumers can obtain computing capabilities such as server time or network storage as needed automatically on their own Ubiquitous network access- cloud resources can be accessed using standard network and Internet devices, including mobile platforms. Location-independent resource pooling- computing resources are polled to serve multiple users, with different virtual resources dynamically assigned according to user demand. The user generally doesn't know where the computing resources are located. Rapid elasticity- computing resources can be rapidly provisioned, increased, or decreased to meet changing user demand Measures service- changes for cloud resources are based on number of recourses actually used.

Analytical Tools: Relationships, Patterns, Trends

Once data is gathered, tools are required for consolidating, analyzing, and developing insight to improve decision making • Software for database querying and reporting • Multidimensional data analysis (OLAP) • Data mining

Hadoop

Open-source software framework from Apache manages that enables distributed parallel processing of huge amount of data across inexpensive computers. Designed for big data Breaks data task into subproblems and distributes the processing to many inexpensive computer processing nodes Combines result into smaller data set that is easier to analyze Key services: -- Hadoop Distributed File System (HDFS)-links files system on the numerous nodes in a Hadoop cluster to turn them into one big file system -- MapReduce- inspired by Google's MapReduce system for breaking down processing of huge data sets and assigning work to the various nodes in cluster Runs a cluster of inexpensive servers and processors can be added or removed Hbase- Hadoop non-relational database

Operating system examples

PC operating systems: Windows 10- lastest Windows client version Windows Server- UNIX- is a multiuser, multitasking operating system and is highly supportive of communications and networking. Often used om workstations and servers and provides the reliability and scalability for running large system on high end servers. Linux- open source software(which provides all computer users with free access to its program code, so they can modify the code to fix errors or to make improvements. Also no one own a open source. A global network of programmers and users manages and modifies the software, usually without being paid), low cost alternative to UNIX and Windows. OS X Mobile operating systems - • Chrome, Android, iOS Other examples of open source: Apache HTTP web server, Mozilla Firefox web browser, and he Apache Open Office desktop productivity suite.

Technology Services

People are needed to maintain and manage infrastructure components; legacy systems are older transaction processing systems created for older computers that continue to be used to avoid the high cost of replacing them

Green Computing aka Green IT

Practices and technologies for designing, making, using, and disposing of computer hardware to reduce environmental impact • World's data centers use as much energy as the output of 30 nuclear power plants which is about 1.5 percent of all energy use in the world. Key priority is reduction power consumption. • IT in U.S. consumes 2% of U.S. power demand and produces 2% of world' s greenhouse gases

Analytic Platforms

Preconfigured hardware-software systems • Designed for query processing and analytics • Use both relational and non-relational technology to analyze large data sets • Include in-memory systems, NoSQL DBMS • E.g. IBM Netezza -- Integrated database, server, storage components

Application Software and Desktop Productivity Tools

Programming languages for business-- applications includes C, C++, Visual Basic and Java C-- is a powerful and efficient language developed in the early 1970s that combines machine portability with tight control and efficient use of the computers resources. Primarily used by professional programmers to create operating systems and applications systems especially for PCs. C++ is a newer version of C plus additional feature for working with software objects. A software object combines data and procedures unlike traditional programs. Visual Basic- is a widely used visual programming tool and environment for creating applications that run on Microsoft Windows operating system. A visual programming language allows users to manipulate graphic or iconic elements to create programs. Java-is an operating system-independent, processor independent, object-oriented programming language created by Sun Microsystems that has become the LEADING interactive programming environment for the web. HAs migrated into mobile phones, smartphones, automobiles, music players, game machine and set-top cable television systems serving interactive content and pay-per-view services **Software packages and desktop productivity tools-- Word processing software Spreadsheet software- organizes data into columns and rows for analysis and manipulation. Data management software Presentation graphics Software suites--major desktop productivity tools are bundled together. Example Microsoft office Web browsers- used for displaying web pages and for accessing the web and other internet resources. Become the primary interfaced for accessing the Internet or for using networked systems based on Internet technology. Leading web browsers are Microsoft Internet Explorer, Mozilla Firefox, Apple Safari, Google Chrome

Public vs. private clouds

Public cloud- owned and maintain by a cloud servicer provider and made available to the general public or industry group. Leading examples are Google Drive, Dropbox, and Apple iCloud Utility computing or on-demand computing- purchase their computing services from remote providers and pay only for the amount of computing power they actually use or are billed on a monthly or annually subscription basis. Private cloud- operated solely for an organization. May be managed by the organization or a third party and hosted either internally or externally.

Web Services

Refers to a set of loosely coupled software components that exchange information with each other using universal web communication standards and languages. XML (Extensible Markup Language)-- Foundation of Web services. Developed as a more powerful and flexible markup language than HTML for web pages. Where as HTML is limited to describing how data should be presented in the form of a web page, XML can perform presentation, communication, and storage of data. Makes it possible for computers to manipulate and interpreted data automatically and perform operation without human intervention. Service oriented architecture (SOA) -- set of self-contained service that communicated with each other to create a working software application. used to build a firms software system. ex. slide 15 (CH.5 Day 3)

In-memory computing

Relies on computer's main memory (RAM) for data storage • Eliminates bottlenecks in retrieving and reading data from hard-disk based databases • Dramatically shortens query response times • Enabled by: High-speed processors, Multicore processing and Falling computer memory prices • Lowers processing costs ROM is permanent data RAM- random access memory higher ram computer will run faster ( Ram is faster than ROM)

Using Cloud Services

Small businesses "rent" infrastructure from provider to avoid expenses of maintaining hardware and software on their own. Hybrid cloud computing model

OLAP (online analytical processing)

Supports multidimensional data analysis, enabling users to view the same data in different ways using multiple dimensions -- • Each aspect of information—product, pricing, cost, region, or time period—represents a different dimension --• E.g., comparing sales in East in June versus May and July • Enables users to obtain online answers to ad hoc questions such as these in a fairly rapid amount of time

Data Mart

Subset of data warehouses, in which a summarized or highly focused portion of the organizations data is placed in a separate database for a specific population of users

Scalability

The ability of a computer, product, or system to expand to serve a larger number of users without breaking down. Organizations must make sure they have sufficient computer processing, storage, and networking resources to handle surging volumes of digital transactions and to make such data immediately available online.

Optical Disks

These disks use laser technology to store large quantities of data including sound and images in a highly contact form. CD-ROM (compact disc read only memory), CD-RW (CD-Rewriteable), DVD (Digital Video Discs), DVD-RW DVD hold the highest capacity, and are not favored tech. for storing video and large quantities of text, graphics and audio data.

Data Warehouses

Traditional tools for analyzing corporate data for the past two decades Database that stores current and historical data of potential interest to decision makers through the company. • Consolidates and standardizes data from many systems, operational and transactional databases • Data can be accessed but not altered

Nanotechnology

Uses individual atoms and molecules to create computer chips and other devices thousands for times smaller than current technologies Enhancing processing power

Quantum Computers

Uses principles of quantum physics to represent data and perform operations on the data Enables processing times that are millions of times faster than current high-speed processing Enhancing processing power

Outsourcing

Using external provider to: • Run networks • Host, manage Web site(s) • Develop software (offshore software outsourcing) • Manage IT infrastructures Web hosting service-maintains a large web server or a series of servers and provides fee-paying subscribers with space to maintain their website Offshore Software Outsourcing--when firms outsourcing software work outside their national borders • Requires Service Level Agreements (SLAs)-formal contract between customers and their service providers that defines the specific reasonability of the servicer provider and the level of service the customer expects.

Entities

generalized categories representing a person, place, or thing on which we store information

Byte

group of bits, repreents a single character, which can be aletter, number, another symbol.

Database

group of related files make up a ... are at the heart of all information systems because thy kept track of the people, places, and things that a business must deal with on a continuing, often instant basis.

Application Server software

handles all application operations between a user and an organization's back-end business systems

Computer Software

includes both system software and application software. System software- manages the resources and activities of the computer Application software- applies the computer to a specific task for an end user, such as processing an order or generating mailing list. Today most system and application software are purchased from outside vendors.

Grid Computing

involves connecting geographically remote computers into a single network and combining the computational power of all computers on the grid. Combined power of several PCs, networked together to solve complicated problems at supercomputer speeds at far lower costs E.g., Shell uses 1024 Linux servers create one of the largest commercial super computers to discover new oil

Secondary Storage Technology includes

magnetic disks, optical disc, magnetic tape, storage networks

Operating System Software

manages and controls the computers' activities computers systems chief manager, enabling the system to handle many tasks and users at the same time. Also allocates and assigns system resources, schedules the use of computer resources and computer jobs, and monitors computer system activities. use graphical user interface, often called GUI, which makes extensive use of icons, buttons, bars, and boxes to perform tasks. Multitouch technology- allows you to use one or more fingers to perform special gestures to manipulate lists or objects on a screen without using a mouse or keypad

Client/Server computing

one widely used form of distributed processing splits the processing between clients and servers client is the user point of enter for the required function and is normally a desktop or laptop computer. servers provide the client with the services. Store and process shared data and perform such functions as printer management and backup storage and network activates such as security, remote access, and user authentication. Foundation of World Wide Web

Data Management Technology

organizes, manages, and processes business data concerned with inventory, customers, and vendors.

Web server software

responsible for locating and managing stored web pages

Super Computer

specially designed and more sophisticated computer that is used fir tasks required extreme rapid and complex calculations with thousands of variables, million of measurements, and thousands of equations. traditionally have been used in engineering analysis of structures, scientific exploration and simulations, military work such as classified weapons research, weather forecasting.

Database Management System

specific type of software for creating, storing, organizing, and accessing data from a database Separates the logical and physical views of the data • Logical view: how end users view data Physical view: how data are actually structured and organized • Examples of DBMS: Microsoft Access (destop systems), DB2, Oracle Databa (for larger mainframes)

Databases and the Web

• Firms use the Web to make information from their internal databases available to customers and partners • Middleware and other software make this possible: • Web server • Application servers or CGI • Database server • Web interfaces provide familiarity to users and savings over redesigning legacy systems

Mobile Digital Platform

• Mobile devices, smartphone • Netbooks and tablet computers • E-book readers and wearable computing devices such as smartwatches and smart glasses, smart badges and activity trackers. Smartphones and tablet computers are becoming the primary of accessing the internet. These devices are increasing used for business computing as well as for consumer applications.

Cloud Databases

• Relational database engines provided by cloud computing services, such as Amazon • Pricing based on usage • Appeal to Web-focused businesses, small or medium-sized businesses seeking lower costs than developing and hosting in-house databases • E.g. Amazon Relational Database Service -Offers MySQL, Microsoft SQL Server, Oracle Database engines • Private clouds- in addition to public cloud, you now have the option to use databases in private clouds

Text Mining

• Unstructured data (mostly text files) accounts for 80 percent of an organization's useful information • Text mining allows businesses to extract key elements, discover patterns, and summarize large unstructured data sets • Sentiment Analysis -- Mines online text, comments, text in e-mails, etc. to measure customer sentiment


संबंधित स्टडी सेट्स

AAPC Chapter 19 A & B Questions & Answers

View Set

Chapter 10 Part 1: The Nervous system's Sensory System Study Question

View Set

Practice Exam - Fundamental Concepts

View Set

2.5 Translators And Facilities Of Languages

View Set

(Ch 1) Mastering Strategy: Art and Science

View Set