Top Google BigQuery frequently asked interview questions
In this post we will look at Google BigQuery Interview questions. Examples are provided with explanation.
- What is Google BigQuery?
- What are the Components of Google BigQuery?
- Explain Technical Architecture of Google BigQuery?
- What is Google Cloud?
- What are the components of the Google Cloud Platform?
- What are the products supported by GCP?
- What is GCP App Engine?
- What are Window Functions in BigQuery?
- What is BigQuery Cloud Data Warehouse?
- What are the ways for accessing the BigQuery Cloud Datawarehouse?
- How can we load data from Cloud Storage?
- How to delete duplicate rows from a BigQuery table?
- What are the Database services supported by GCP?
What is Google BigQuery?
Google Bigquery is used for Storing and Querying datasets which are time consuming and expensive.It is an Enterprise Data Warehouse which solves problems by enabling super fast SQL Queries using the processing power of google infrastructure.BigQuery is fully managed, we dont need for deploying any resources like Disks and Virtual Machines.As Bigquery has high level architecture and is built on top of Dremel Technology which has been in production on Google internally.What are the Components of Google BigQuery?
Google BigQuery consists of 12 components:- Serverless Service Model.
- Opinionated Storage Engine.
- Dremel Execution Engine & Standard SQL.
- Enterprise-grade Data Sharing.
- Public, Commercial, Marketing Datasets and the Free Pricing Tier.
- Streaming Ingest.
- Batch Ingest.
- Federated Query Engine.
- UX, CLI, SDK, ODBC/JDBC, API.
- Pay-Per-Query AND Flat Rate Pricing.
- IAM, Authentication & Audit Logs.
Explain Technical Architecture of Google BigQuery?
- Snowplow BigQuery Loader used in transforming data from Google Pub/Sub and loads it into BigQuery.
- Snowplow BigQuery Mutator used in reading and performing table mutations for adding new columns as per required.
- Snowplow BigQuery Repeater is an app used for reading failedInserts and tries in re-inserting them into BigQuery.
- Snowplow BigQuery Forwarder used in implementing repeater Apache Beam job.

What is Google Cloud?
Google CLoud Platform(GCP) contains a set of physical assets like computer, hard disks, virtual resources such as virtual machines that are contained in Google Data Centers.It helps in allowing software and hardware products for at scaling and co existing remotely in all data centers.It helps in accessing, managing and using tools through web interfaces.
What are the components of the Google Cloud Platform?
Components of GCP are as follows:Google Compute Engine
Google Cloud Container Engine
Google Cloud App Engine
Google Cloud Storage
Google Cloud Dataflow
Google BigQuery Service
Google Cloud Job Discovery
Google Cloud Endpoints
Google Cloud Test Lab
Google Cloud Machine Learning Engine
What the products supported by GCP?
- BigQuery used as Data Warehouse for insights.
- Cloud CDN helps in content delivery network in images, audios, videos, etc.
- Cloud Functions used in writing codes and handling infrastructure for running that.
- Cloud Run is a managed environment used in running Containerized Applications.
- Cloud SDKused in providing CLT and other libraries for Google CLoud.
- Cloud Storage helps in storing format that are durable and insecure.
- Compute Engine used in running Google's data centers and providing virtual machines.
- Data Flow helps in streaming analytics in batch processing and streams.
- Operators used in monitoring, logging and performing other services.
- Google Kubernetes Engine helps in managing environment for running containerized applications.