Python final (part two)

Lakukan tugas rumah & ujian kamu dengan baik sekarang menggunakan Quizwiz!

Need a target to work Supervised ML Data Mining All are correct Unsupervised ML

Supervised ML

Data can be summarized using visualizations to help others understandthe data. You need to focus on the data when you present . T or F ?

False

Heat map is used to visualize the correlation for a dataframe! True False

True

T/F: An example of numerical variable is weight or blood pressure.

True

T/F: Deductive reasoning uses facts, propositions, or other statements oftruth to arrive at a conclusion.

True

To iterate in a database, we need t cursor() method to be assigned after the connection (conn) for example: cur = conn.cursor() True False

True

Which type of hypervisor would most likely be used in a data center? Select one: Type 1 Type 2 Nexus Hadoop

Type 1

Learn from the data itself Supervised ML Data Mining Ai Unsupervised ML

Unsupervised ML

Matplotlib is used in python to get a data frame. True False

False

To change the datatype of a column called ping in a data frame called df_compact_clean, you can use a lambda function in this way df_compact_clean['Ping (ms)_float'] = df_compact_clean['Ping (ms)'].apply(lambda val: float(val)) True false

False

To find the correlation for a data file imported in pandas as BB, you can use a method like : BB.corr() brainFrame.corr() BB.describe() brainFrame.corr(method='pearson')

BB.corr()

In this way, we can drop the row from a dataframe named dfp dfp.drop(45) T or F ?

False it should be like this: dfp.drop(45,inplace=True)

A researcher does not use a temperature sensor correctly. What kind of error will be? Gross Error Random Error Noise Error Systematic Error

Gross Error

Which file system does Hadoop use? Select one: FAT32 HDFS NTFS SAS

HDFS

What is the purpose of this code? !apt-get update lapt-get -y install sqlite3 update and install database Install sqlite3 and -y is needed in ubuntu only. Install sqlite3 and -y is an option means in your machine. Install sqlite3 and -y is for yes during the installation process.

Install sqlite3 and -y is for yes during the installation process.

An example of categorial variable is: 0 or 1 weight gender number of visits

gender

Which solution improves web response time by deploying multiple web servers and DNS servers? Select one: sharding memcaching load balancing distributed databases

load balancing

Given the following nested list, use indexing to grab the word "Hi Big Data learner". lst = ['a','b',[4,10,'Hi Big Data learner'],['c',[1,66,['this']],2,111],'e',7]

lst[2] [2] lst[2][2]

Which architecture is used by HDFS? Select one: peer-to-peer master-slave client/server stand alone

master-slave

Match the cloud computing service to the description. (Not all options are used.) virtual desktop environment network equipment development tools IaaS PaaS SaaS ITaaS

network equipment → IaaS, development tools → PaaS, virtual desktop environment → SaaS

Which type of machine learning algorithm uses data sets verified by experts as its learning basis? Select one: association supervised routing clustering

supervised

Which task is performed within the data analysis step of the Data Analysis Lifecycle? Select one: testing the model on data to ensure reliability incorporating the findings into corporate strategy selecting a tool for data analysis locating and gathering data

testing the model on data to ensure reliability

Which of the following is unstructured data ? text .xls .csv .db

text

write a line to : From jupyter, install the csvkit (hint, use pip)

!pip install csvkit

Install a module call seaborn from jupyter

!pip install seaborn

True or False?A drawback to Plotly is that files cannot be exported. Select one: true false

FALSE

Which of the following is structured data ? .csv photo .pdf .docx

.cvs

Which of the following is structured data? .xls white paper .cxs web page

.xls

How many default subplot will be generated from this code fig, ax = plt.subplots(1, 2, figsize=(10, 10)) 1 3 2 4

2

Which method would you use to view table statistics? for a data frame called Data? Data.describe() Data.head() Data.dtype All can be used

Data.describe()

process for collecting data from a variety of sources, transforming the data, and then loading the data into a database ? import data using pandas Gather Data deal with data using sqlite3 ETL

ETL

df_compact.to_csv('rpi_data_processed.csv', index=False) in the given code, it is necessary to make the index false. Isn't it? No, it should be True. It is to make the extension .csv We should make it True to not introduce new column. Yes, so that you do not introduce an additional column.

Yes, so that you do not introduce an additional column.

# Modify this code floor_types = ['Parking', 'Shops', 'Food Court', 'Offices'] floor_numbers = # Fill in the blank # put your code here zipped = list(zip(floor_types ,floor_numbers )) print(zipped) # To get this output [('Parking', 1), ('Shops', 2), ('Food Court', 3), ('Offices', 4)]

[1, 2, 3, 4] range(1,5)

When a number of items are grouped together, which type of machine learning algorithm can determine which items in the group predict the presence of other items? Select one: classification regression clustering association

association

Write a command to view the first 5 lines of the dataframe imported by pandas as brain.

brain.head(5) OR brain.head()

fill the command to execute a query conn = sqlite3.connect('InternetSpeed.db') cur = conn.cursor() query = 'SELECT DateTime FROM LA_wifi_speed_UK ORDER BY DateTime LIMIT 10;' # put your code here

cur.execute(query)

Continue the code bellow # Create the connection to the database #conn = ... <-- do like this conn = sqlite3.connect('/content/InternetSpeed.db') # Create a cursor #curr = ...

curr = conn.cursor()

Data at Rest is Data cannot be updated data not used data can be analyze data can be stored somewhere

data can be stored somewhere

This code can be used to find how many values in dataframe are NaN? df_compact.isnull().sum().sum() Which of the following can also be used to find the number of NaN in the dataframe? df_compact.info() df_compact.head() df_compact.describe() all the choices

df_compact.info()

Which of the following is unstructured data ? .csv .db all the above email

email

Which of the following is considered traditional data storage. mysql sql python excel

excel

Today, data is growing __________________ exponentially. linearly randomly rapidly

exponentially.

What is an example of data in the DIKW Pyramid model?​ Select one: data structured in a format that can be analyzed the application of knowledge information that is understood and can be used immediately facts gathered through observation

facts gathered through observation

choose the correct answer. class Python: population - 1 victims = 0 def __init__(self): self.length_ft = 3 self._venomous = False length and venomous are class variable population and __venomus are class variables population and victims are class variables population and victims are instance variables

population and victims are class variables

A large online ordering company collects the sales history of its customers. Using this historical data, the online company sends customized sales advertisements to a specific group of customers. Which data analytics method is being used? Select one: indicative predictive descriptive prescriptive

prescriptive


Set pelajaran terkait

Business Ownership and Business Test

View Set

Texas Principles of Real Estate 1 - Chapter 4

View Set