As a data scientist, you are working on a global health data set that has data from more than 50 countries. You want to encode three features, such as 'countries', 'race', and 'body organ' as categories. Which option would you use to encode the categorical feature?
You have received machine learning model training code, without clear information about the
optimal shape to run the training. How would you proceed to identify the optimal compute shape
for your model training that provides a balanced cost and processing time?
Which Oracle Accelerated Data Science (ADS) classes can be used for easy access to data sets from
reference libraries and index websites such as scikit-learn?
You are a data scientist leveraging the Oracle Cloud Infrastructure (OCI) Language AI service for various types of text analyses. Which TWO capabilities can you utilize with this tool?
You want to use ADSTuner to tune the hyperparameters of a supported model you recently
trained. You have just started your search and want to reduce the computational cost as well as
access the quality of the model class that you are using.
What is the most appropriate search space strategy to choose?
Six months ago, you created and deployed a model that predicts customer churn for a call
centre. Initially, it was yielding quality predictions. However, over the last two months, users are
questioning the credibility of the predictions.
Which two methods would you employ to verify the accuracy of the model?
Six months ago, you created and deployed a model that predicts customer churn for a call center. Initially, it was yielding quality predictions. However, over the last two months, users have been questioning the credibility of the predictions. Which TWO methods customer churn would you employ to verify the accuracy of the model?
You are a data scientist building a pipeline in the Oracle Cloud Infrastructure (OCI) Data Science
service for your machine learning project. You want to optimize the pipeline completion time by
running some steps in parallel. Which statement is true about running pipeline steps in parallel?