Business Analytics Ch. 10

Réussis tes devoirs et examens dès maintenant avec Quizwiz!

In the context of text analytics, which of the following statements exemplifies sentiment polarity?

"I like the fragrance of the soap, but it makes my skin dry."

Which of the following are true of natural language processing (NLP)? (Check all that apply.)

It can help companies reduce potential conflicts and prioritize the level of urgency when responding to opportunities and concerns. It enables companies to analyze internal and external data sources to gain insights about the company, its offerings, customers, and market.

In the context of the text exploration step of text analytics, which of the following are true of a word cloud? (Check all that apply.)

It displays the occurrences of terms and uses the size of a term to represent its frequency. It represents similar information to frequency bar charts but represents frequency of terms in a visual format.

Which of the following are true of the text acquisition and aggregation step of text analytics? (Check all that apply.)

It involves combining and uploading all text data into a text analytics software. It involves understanding the business question to determine the text data that needs to be acquired.

In the context of critical concepts under text preprocessing, which of the following is true of N-grams?

It is often used in search engines to recommend the next character or word in real time when a user is typing a search query.

In the context of topic modeling, the ___ ___ ___ (LDA) method maximizes the separation between the estimated topics and minimizes the variance within each projected topic and identifies important words in the text and groups them into topics.

Latent Dirichlet Allocation

In the context of critical concepts under text preprocessing, ___ reduces the word to its lemma form while considering the context of the word, such as the part of speech and meaning.

Lemmatization

___ ___ is a simple technique that captures the set of co-occurring or continuous sequences of n-items from a large set of text.

N-Grams

___ ___ ___ (NLP) is a branch of artificial intelligence (AI) used to identify patterns by reading and understanding meaning from human language.

Natural language processing

In the context of the text modeling step of text analytics, ___ ___ is a measure of emotions, attitudes and beliefs.

Sentiment analysis

In the context of measuring the importance of a term in a document, match the measures (in the left column) with their descriptions (in the right column).

Term frequency (TF) It measures the number of times a term (or word) occurs in a document. Inverse document frequency (IDF) It measures the frequency of a term (or word) over all the documents.

In the context of building a Latent Dirichlet Allocation (LDA) structure under text modeling, what are the criteria for reassigning words to new topics? (Check all that apply.)

The extent to which a topic is found across all documents The proportion of words to a topic

Match the critical concepts under text preprocessing (in the left column) with their descriptions (in the right column).

Tokenization It is the process of taking the entire text data corpus and separating it into smaller, more manageable sections. Stemming It is the process of removing prefixes or suffixes of words, thus reducing words to a simple or root form. Stop words removal It involves removing words that are uninformative, such as "the" and "and". Bag of words It is a technique that counts the occurrence of words in a document while ignoring the order or the grammar of words. Text-document matrix It separates preprocessed text into rows and columns so that meaningful analyses of data can be conducted.

In the context of the text exploration stage of text analytics, a frequency bar chart is made up of the x-axis that represents terms and the y-axis that represents the frequency of particular terms occurring.

True

Identify the methods of text exploration in text analytics. (Check all that apply.)

Word clouds Frequency bar charts

In the context of text analytics, a(n) ___ is a collection of a large body of text that is ready to be preprocessed.

corpus

The purpose of sentiment analysis is to identify _____.

customers' thoughts about products, features, quality, services, and so on

A document with a high term frequency-inverse document frequency (TF-IDF) score implies that _____.

it has a relatively high frequency for words that are relatively rare overall

In the context of critical concepts under text preprocessing, reducing the word "curating" to "curate" is an example of _____.

lemmatization

In the context of text analytics, ___ ___ refers to detailed customer feedback that contain contradictory or different opinions.

sentiment polarity

In the text analytics process, the basic unit of analysis is a _____.

token

In the context of the text exploration step of text analytics, a(n) ___ ___ provides a high-level understanding of frequently used terms.

word cloud


Ensembles d'études connexes

CH 2.3 The Preterite vs. the Imperfect Estructuras

View Set

Chapter 8 Foreign Direct Investment

View Set

Legal Studies 131 Quiz and Past Exam Questions

View Set

DSM - 5 Categories of Mental Disorders

View Set

Current Events in East Asia Questions

View Set

Nursing Process/Diagnoses Practice Test (NCLEX style) 15 multiple choice

View Set