Professional-Data-Engineer Exam Dumps - Google Professional Data Engineer Exam

Searching for workable clues to ace the Google Professional-Data-Engineer Exam? You’re on the right place! ExamCert has realistic, trusted and authentic exam prep tools to help you achieve your desired credential. ExamCert’s Professional-Data-Engineer PDF Study Guide, Testing Engine and Exam Dumps follow a reliable exam preparation strategy, providing you the most relevant and updated study material that is crafted in an easy to learn format of questions and answers. ExamCert’s study tools aim at simplifying all complex and confusing concepts of the exam and introduce you to the real exam scenario and practice it with the help of its testing engine and real exam dumps

Go to page:

<< First
Prev
1
2
3
4
5
6
7
8
Next
Last >>

Question # 41

Flowlogistic wants to use Google BigQuery as their primary analysis system, but they still have Apache Hadoop and Spark workloads that they cannot move to BigQuery. Flowlogistic does not know how to store the data that is common to both workloads. What should they do?

Store the common data in BigQuery as partitioned tables.

Store the common data in BigQuery and expose authorized views.

Store the common data encoded as Avro in Google Cloud Storage.

Store he common data in the HDFS storage for a Google Cloud Dataproc cluster.

Full Access

Question # 42

Flowlogistic is rolling out their real-time inventory tracking system. The tracking devices will all send package-tracking messages, which will now go to a single Google Cloud Pub/Sub topic instead of the Apache Kafka cluster. A subscriber application will then process the messages for real-time reporting and store them in Google BigQuery for historical analysis. You want to ensure the package data can be analyzed over time.

Which approach should you take?

Attach the timestamp on each message in the Cloud Pub/Sub subscriber application as they are received.

Attach the timestamp and Package ID on the outbound message from each publisher device as they are sent to Clod Pub/Sub.

Use the NOW () function in BigQuery to record the eventâ€™s time.

Use the automatically generated timestamp from Cloud Pub/Sub to order the data.

Full Access

Question # 43

Which of these rules apply when you add preemptible workers to a Dataproc cluster (select 2 answers)?

Preemptible workers cannot use persistent disk.

Preemptible workers cannot store data.

If a preemptible worker is reclaimed, then a replacement worker must be added manually.

A Dataproc cluster cannot have only preemptible workers.

Full Access

Question # 44

Cloud Dataproc charges you only for what you really use with _____ billing.

month-by-month

minute-by-minute

week-by-week

hour-by-hour

Full Access

Question # 45

How can you get a neural network to learn about relationships between categories in a categorical feature?

Create a multi-hot column

Create a one-hot column

Create a hash bucket

Create an embedding column

Full Access

Question # 46

Suppose you have a table that includes a nested column called "city" inside a column called "person", but when you try to submit the following query in BigQuery, it gives you an error.

SELECT person FROM `project1.example.table1` WHERE city = "London"

How would you correct the error?

Add ", UNNEST(person)" before the WHERE clause.

Change "person" to "person.city".

Change "person" to "city.person".

Add ", UNNEST(city)" before the WHERE clause.

Full Access

Question # 47

Flowlogisticâ€™s management has determined that the current Apache Kafka servers cannot handle the data volume for their real-time inventory tracking system. You need to build a new system on Google Cloud Platform (GCP) that will feed the proprietary tracking software. The system must be able to ingest data from a variety of global sources, process and query in real-time, and store the data reliably. Which combination of GCP products should you choose?

Cloud Pub/Sub, Cloud Dataflow, and Cloud Storage

Cloud Pub/Sub, Cloud Dataflow, and Local SSD

Cloud Pub/Sub, Cloud SQL, and Cloud Storage

Cloud Load Balancing, Cloud Dataflow, and Cloud Storage

Full Access

Question # 48

When you store data in Cloud Bigtable, what is the recommended minimum amount of stored data?

500 TB

1 GB

1 TB

500 GB

Full Access