1 - Defining Data Science and What Data Scientists Do (1, IBM DS)
data science is a combination of what 3 domains? (comp scie, math, busin)
- computer science - mathematics - business expertise
According to professor Haider, the three important qualities to possess in order to succeed as a data scientist are what? (cur, judgment, stor tell)
- curious - judgmental - story telling; argumentative
data science can help companies perform what 3 things? (under env, anal exist issu, reveal prev hidd opp)
- understand their environments - analyze existing issues - reveal previously hidden opportunities
Companies are searching for well-rounded individuals who possess the subject matter expertise, some experience in software programming and analytics, and exceptional communication skills
true
Contemporary data scientists come from different backgrounds such as engineering, mathematics, and even psychology. The secret skill is passion for continuous learning of new tools and patience to clean and analyze data.
true
average base salary for a 'data scientist' is $112k according to New York Times
true
data scientist is someone who finds solutions to problems by analyzing Big or small data using appropriate tools and then tells stories to communicate her findings to the relevant stakeholders
true
process of data science; after 'clarify the question the organization wants answered', what is the next step? (what data do we need to solv the proble)
what data do we need to solve the problem; and where will that data come from
process of data science; what is the first question and most crucial step? (clar the ques that the org want answe)
clarify the question the organization wants answered
Many ____________ (algo) are used to bring out insights from data
algorithms
finite sequence of instructions (algor)
algorithms
'the cloud' allows you to bypass the ___________ (phy) limitations of your personal computer and the systems you are using
bypass the physical limitations of your personal computer
Facts / statistics collected together for analysis (data)
data
_____________ (dat sci) is the field of exploring, manipulating, and analyzing data, and using data to answer questions / make recommendations
data science
Using complicated machine learning algorithms does not always guarantee achieving a better performance. Occasionally, a simple algorithm such as _________ (k-near neigh) can yield a satisfactory performance comparable to the one achieved using a complicated algorithm
k-nearest neighbor; simple algorithm
The typical work day for a Data Scientist varies depending on what type of _________ (proj) they are working on
project
measures the impact of a set of independent variables on a dependent variable (reg anal)
regression analysis
log files, email, social media, sales data, patient information files, sports performance data are just a few examples of what? (sour of dat)
sources of data
via data science, your rise to prominence is your ability with what? (sto-tel)
storytelling
Accessing algorithms, tools, and data through the _________ (clou) enables Data Scientists to stay up-to-date and collaborate easily
the Cloud