Overview


The Google Cloud connector focuses on integrating Google Cloud Storage (GCS) buckets into PipesHub’s platform. It enables ingestion of cloud-native file content — including datasets, documents, logs, backups, and media — and connects them to downstream AI workflows like summarization, monitoring, and automated data retrieval.



How it works


PipesHub uses service account authentication to access GCS buckets and read file metadata and content. Each file is scanned, parsed, and indexed with associated metadata (filename, MIME type, creation date). The connector supports parsing of text, spreadsheet, image, and audio formats stored across buckets, enabling AI agents to use this data contextually.

Permissions are enforced using GCP’s IAM policies and bucket-level access controls.



Configure
  • Enable the Google Cloud Storage API in your Google Cloud project

  • Create a service account with storage.objects.list and storage.objects.get permissions

  • Upload the service account key to PipesHub

  • Specify the GCS bucket names or path filters

  • Define sync frequency, file-type filters, and permission boundaries

Google Cloud

Ingest and search data from GCP object storage, unify structured and unstructured content into AI pipelines.

DESIGNED IN SAN FRANCISCO ❤️