Winter Sale Special Limited Time 65% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: v4s65

Databricks-Certified-Professional-Data-Scientist Exam Dumps - Databricks Certified Professional Data Scientist Exam

Go to page:
Question # 9

In which lifecycle stage are appropriate analytical techniques determined?

A.

Model planning

B.

Model building

C.

Data preparation

D.

Discovery

Full Access
Question # 10

Consider the following confusion matrix for a data set with 600 out of 11,100 instances positive:

In this case, Precision = 50%, Recall = 83%, Specificity = 95%, and Accuracy = 95%.

Select the correct statement

A.

Precision is low, which means the classifier is predicting positives best

B.

Precision is low, which means the classifier is predicting positives poorly

C.

problem domain has a major impact on the measures that should be used to evaluate a classifier within it

D.

1 and 3

E.

2 and 3

Full Access
Question # 11

Consider flipping a coin for which the probability of heads is p, where p is unknown, and our goa is to estimate p. The obvious approach is to count how many times the coin came up heads and divide by the total number of coin flips. If we flip the coin 1000 times and it comes up heads 367 times, it is very reasonable to estimate p as approximately 0.367. However, suppose we flip the coin only twice and we get heads both times. Is it reasonable to estimate p as 1.0? Intuitively, given that we only flipped the coin twice, it seems a bit

rash to conclude that the coin will always come up heads, and____________is a way of avoiding such rash

conclusions.

A.

Naive Bayes

B.

Laplace Smoothing

C.

Logistic Regression

D.

Linear Regression

Full Access
Question # 12

Refer to the Exhibit.

In the Exhibit, the table shows the values for the input Boolean attributes "A", "B", and "C". It also shows the values for the output attribute "class". Which decision tree is valid for the data?

A.

Tree A

B.

Tree B

C.

Tree C

D.

Tree D

Full Access
Question # 13

Which of the following metrics are useful in measuring the accuracy and quality of a recommender system?

A.

Cluster Density

B.

Support Vector Count

C.

Mean Absolute Error

D.

Sum of Absolute Errors

Full Access
Question # 14

The figure below shows a plot of the data of a data matrix M that is 1000 x 2. Which line represents the first principal component?

A.

yellow

B.

blue

C.

Neither

Full Access
Question # 15

Digit recognition, is an example of.....

A.

Classification

B.

Clustering

C.

Unsupervised learning

D.

None of the above

Full Access
Question # 16

Find out the classifier which assumes independence among all its features?

A.

Neural networks

B.

Linear Regression

C.

Naive Bayes

D.

Random forests

Full Access
Go to page: