Python final (part two)
Need a target to work Supervised ML Data Mining All are correct Unsupervised ML
Supervised ML
Data can be summarized using visualizations to help others understandthe data. You need to focus on the data when you present . T or F ?
False
Heat map is used to visualize the correlation for a dataframe! True False
True
T/F: An example of numerical variable is weight or blood pressure.
True
T/F: Deductive reasoning uses facts, propositions, or other statements oftruth to arrive at a conclusion.
True
To iterate in a database, we need t cursor() method to be assigned after the connection (conn) for example: cur = conn.cursor() True False
True
Which type of hypervisor would most likely be used in a data center? Select one: Type 1 Type 2 Nexus Hadoop
Type 1
Learn from the data itself Supervised ML Data Mining Ai Unsupervised ML
Unsupervised ML
Matplotlib is used in python to get a data frame. True False
False
To change the datatype of a column called ping in a data frame called df_compact_clean, you can use a lambda function in this way df_compact_clean['Ping (ms)_float'] = df_compact_clean['Ping (ms)'].apply(lambda val: float(val)) True false
False
To find the correlation for a data file imported in pandas as BB, you can use a method like : BB.corr() brainFrame.corr() BB.describe() brainFrame.corr(method='pearson')
BB.corr()
In this way, we can drop the row from a dataframe named dfp dfp.drop(45) T or F ?
False it should be like this: dfp.drop(45,inplace=True)
A researcher does not use a temperature sensor correctly. What kind of error will be? Gross Error Random Error Noise Error Systematic Error
Gross Error
Which file system does Hadoop use? Select one: FAT32 HDFS NTFS SAS
HDFS
What is the purpose of this code? !apt-get update lapt-get -y install sqlite3 update and install database Install sqlite3 and -y is needed in ubuntu only. Install sqlite3 and -y is an option means in your machine. Install sqlite3 and -y is for yes during the installation process.
Install sqlite3 and -y is for yes during the installation process.
An example of categorial variable is: 0 or 1 weight gender number of visits
gender
Which solution improves web response time by deploying multiple web servers and DNS servers? Select one: sharding memcaching load balancing distributed databases
load balancing
Given the following nested list, use indexing to grab the word "Hi Big Data learner". lst = ['a','b',[4,10,'Hi Big Data learner'],['c',[1,66,['this']],2,111],'e',7]
lst[2] [2] lst[2][2]
Which architecture is used by HDFS? Select one: peer-to-peer master-slave client/server stand alone
master-slave
Match the cloud computing service to the description. (Not all options are used.) virtual desktop environment network equipment development tools IaaS PaaS SaaS ITaaS
network equipment → IaaS, development tools → PaaS, virtual desktop environment → SaaS
Which type of machine learning algorithm uses data sets verified by experts as its learning basis? Select one: association supervised routing clustering
supervised
Which task is performed within the data analysis step of the Data Analysis Lifecycle? Select one: testing the model on data to ensure reliability incorporating the findings into corporate strategy selecting a tool for data analysis locating and gathering data
testing the model on data to ensure reliability
Which of the following is unstructured data ? text .xls .csv .db
text
write a line to : From jupyter, install the csvkit (hint, use pip)
!pip install csvkit
Install a module call seaborn from jupyter
!pip install seaborn
True or False?A drawback to Plotly is that files cannot be exported. Select one: true false
FALSE
Which of the following is structured data ? .csv photo .pdf .docx
.cvs
Which of the following is structured data? .xls white paper .cxs web page
.xls
How many default subplot will be generated from this code fig, ax = plt.subplots(1, 2, figsize=(10, 10)) 1 3 2 4
2
Which method would you use to view table statistics? for a data frame called Data? Data.describe() Data.head() Data.dtype All can be used
Data.describe()
process for collecting data from a variety of sources, transforming the data, and then loading the data into a database ? import data using pandas Gather Data deal with data using sqlite3 ETL
ETL
df_compact.to_csv('rpi_data_processed.csv', index=False) in the given code, it is necessary to make the index false. Isn't it? No, it should be True. It is to make the extension .csv We should make it True to not introduce new column. Yes, so that you do not introduce an additional column.
Yes, so that you do not introduce an additional column.
# Modify this code floor_types = ['Parking', 'Shops', 'Food Court', 'Offices'] floor_numbers = # Fill in the blank # put your code here zipped = list(zip(floor_types ,floor_numbers )) print(zipped) # To get this output [('Parking', 1), ('Shops', 2), ('Food Court', 3), ('Offices', 4)]
[1, 2, 3, 4] range(1,5)
When a number of items are grouped together, which type of machine learning algorithm can determine which items in the group predict the presence of other items? Select one: classification regression clustering association
association
Write a command to view the first 5 lines of the dataframe imported by pandas as brain.
brain.head(5) OR brain.head()
fill the command to execute a query conn = sqlite3.connect('InternetSpeed.db') cur = conn.cursor() query = 'SELECT DateTime FROM LA_wifi_speed_UK ORDER BY DateTime LIMIT 10;' # put your code here
cur.execute(query)
Continue the code bellow # Create the connection to the database #conn = ... <-- do like this conn = sqlite3.connect('/content/InternetSpeed.db') # Create a cursor #curr = ...
curr = conn.cursor()
Data at Rest is Data cannot be updated data not used data can be analyze data can be stored somewhere
data can be stored somewhere
This code can be used to find how many values in dataframe are NaN? df_compact.isnull().sum().sum() Which of the following can also be used to find the number of NaN in the dataframe? df_compact.info() df_compact.head() df_compact.describe() all the choices
df_compact.info()
Which of the following is unstructured data ? .csv .db all the above email
Which of the following is considered traditional data storage. mysql sql python excel
excel
Today, data is growing __________________ exponentially. linearly randomly rapidly
exponentially.
What is an example of data in the DIKW Pyramid model? Select one: data structured in a format that can be analyzed the application of knowledge information that is understood and can be used immediately facts gathered through observation
facts gathered through observation
choose the correct answer. class Python: population - 1 victims = 0 def __init__(self): self.length_ft = 3 self._venomous = False length and venomous are class variable population and __venomus are class variables population and victims are class variables population and victims are instance variables
population and victims are class variables
A large online ordering company collects the sales history of its customers. Using this historical data, the online company sends customized sales advertisements to a specific group of customers. Which data analytics method is being used? Select one: indicative predictive descriptive prescriptive
prescriptive