Data Processing Pipeline for Google SecOps: Filtering Events

Forum|Forum|4 months ago
February 23, 2026
4 replies
1943 views

+1

uts
Bronze 1

We recently launched data processing pipelines (in Public Preview) for Google Security Operations, which provides security teams with pre-parsing control for their log ingestion. This new capability gives you granular control over your logs before they are shipped to Google SecOps for ingestion, allowing you to:

Manage Cost & Reduce Noise – Automatically drop high-volume, low-value events so you only pay for the ingestion of events that are relevant to your security operations capabilities.
Redact Sensitive Data – Mask or hash PII and confidential data like sensitive document names in Google Workspace logs.
Accelerate Detection – Send cleaner, higher-fidelity data to Google SecOps, making it easier to find the signal in the noise.

In this blog post, I’ll demonstrate how to filter and redact data from your logs leveraging data pipeline processing with Bindplane and Google SecOps. This feature is available to Enterprise and Enterprise+ customers.

In my Bindplane management console, I’ve configured an agent configuration to collect Windows event logs, Sysmon, and PowerShell logs from my Windows hosts and ship them to Google SecOps for ingestion.

In the new SecOps Pipelines page within Bindplane, I’ve configured a new pipeline with a processor node, which is highlighted in the diagram below. A processor node can contain one or more processors that are responsible for carrying out actions such as transforming, filtering, or redacting data in your log events before they’re shipped to Google SecOps for ingestion. A list of the available processors can be found in our documentation.

Data pipeline processing configuration can also be viewed in Google SecOps by navigating to SIEM Settings - Data Processing.

Running the query below in Google SecOps validates that I’m ingesting Windows event logs.

I can review the configuration of my processors in the Bindplane management console by clicking on the processor node. I have two processors configured. One processor uses a regular expression to extract the Windows hostname from my Windows event logs. The second processor adds the Windows hostname as an ingestion label field before the events are sent to Google SecOps for ingestion. It’s important to note that processors are executed in the order that they’re listed.

For today’s example, let’s say that I want to filter process exit events (event ID 4689) that are logged by my Windows hosts (i.e. I want my Windows hosts to still log these events locally, but I don’t want to ingest them into Google SecOps). I click “add processor” and choose the “Filter by Regex” processor. I give the new processor a short description and leave the action set to “exclude” so that logs that match my regular expression are removed.

Bindplane uses Go’s regex engine. I’m using the regular expression, “<EventID>4689</EventID>”.

Bindplane has a neat feature where you can see a sampling of logs on the left and the same logs on the right after the processors have been executed. Clicking “Apply” executes my new processor against the sample logs. I can see that 4 events were filtered using my regular expression.

After clicking “Done” and “Save”, I can see that a “rollout” of my pipeline configuration is pending.

After clicking “Start Rollout”, the new pipeline configuration is deployed and Windows events with the event ID of 4689 are filtered and no longer sent to Google SecOps for ingestion.

That’s it for today’s post where we learned how to use Bindplane’s data pipeline processing feature to filter logs from being ingested into Google SecOps.

Gianluigi_Iobiz
New Member
Forum|Forum|3 months ago
March 9, 2026

Great feature. What about filtering logs coming from cloud-to-cloud feeds?

Like

+4

hliu
Bronze 2
Forum|Forum|1 month ago
May 18, 2026

Google Secops’ “new” ingestion method has a batch/event limit of 4MB. It’d be interesting to understand if the new Secops Pipelines are applied before or after the event max size limit.

Secops Pipelines doesn’t support Bindplane’s Unroll processor as of today.
If that processor could be added into those new Secops Pipelines, and if those Secops Pipelines are applied BEFORE the event max size limit, then we might be able to unroll those events arriving in huge batches or JSON arrays from the source, and overcome the Secops limit of 4MB event size that some vendors refuse to adapt to, e.g. Akamai Datastream or Azure VNET Flow logs.

By the way, Google Secops is discontinuing some v1 Cloud Storage Data Feeds, and in the v2, at least in Amazon SQS v2, we realize it is bypassing Secops Pipelines. Hopefully it could be solved before v1 EOL.

Like

+1

jdalmet
Bronze 2
Forum|Forum|1 month ago
May 19, 2026

Looks Good.
Any idea what need to check if we think data feeds Secops via bindplane was slow ..
how to test whether bottleneck is at Bindplane level or LB level or Google Secops ?

Like

+4

hliu
Bronze 2
Forum|Forum|1 month ago
May 23, 2026

Looks Good.
Any idea what need to check if we think data feeds Secops via bindplane was slow ..
how to test whether bottleneck is at Bindplane level or LB level or Google Secops ?

On top of my head,

-if it is a polling source, check the polling frequency.

-check for retry messages or other errors in gcp cloud monitoring.

-check for secops ingestion API utilisation in gcp, there are metrics for errors and API latency if I recall correctly.

-check for errors in Bindplane collector logs.

-check for LB metrics: count of SYN ACK, port exhaustion, etc.

-the classic way to systematically troubleshoot/monitor latencies in a multi-step block architecture would be, at least in other solutions like Cribl+Splunk, adding a timestamp in the data/metadata in each block to calculate the time difference between them. percentile95 is often used to exclude the extreme values from occasional hiccups.

e.g

high p95(time_siemIngest - time_event) signals general latency.

low p95(t_siemIngest - t_middleware) = latency is not coming between those 2.

high p95(t_middleware - t_event) = latency issues between them.

If the last 2 calculations were both low, it might signal latency coming from the data source.

It is possible to calculate the general latency using Secops ingestion time and event time.

But I’m not sure if there’s any way to add a Bindplane time or Secops-pipeline time into any dedicated UDM timestamp field, to have more granularity on the latency origin.

Perhaps there’re more elegant ways to tackle it, eg. dedicated Bindplane metrics for delay calculation, or processor for timestamping. I don’t know them.

Like

Sign up

Login with SSO

Login to the community

Login with SSO

Scanning file for viruses.

This file cannot be downloaded