Mehmet Ergene

A Deep Dive into the KQL Union Operator

The union operator in KQL is used to merge the results of two or more tables (or tabular expressions) into a single result set. A familiar instance of this operation is the search operator, which implicitly performs a union when querying across multiple tables.

Syntax:

<Table1>
| union (<OptionalParameters>) <Table2>, <Table3>, <TabularExpression1>, ...


// We can also use the below syntax
union (<OptionalParameters>) <Table1>, <Table2>, <TabularExpression1>, ...

`kind` Parameter

The kind parameter in union operations dictates how the result set is structured in terms of column inclusion:

By default, union combines columns with the same names and data types into a single column. Columns without matches across all tables are still included in the result set and filled with nulls where applicable.

union UserLoginEvents, UserProfiles

The kind=inner modification alters this behavior. When applied, only the columns common to all participating tables are included in the result set, and non-matching columns are excluded.

union kind=inner UserLoginEvents, UserProfiles

The kind=inner option is particularly useful when you are interested in analyzing only the overlapping data between tables, providing a cleaner and more focused dataset.

The withsource parameter is beneficial when merging tables and you need to track the origin of each record:

The syntax withsource = <ColumnName> adds a new column to the result set. This column is named according to the <ColumnName> provided and records the source table for each row.

Example using withsource:

union withsource = TableName UserLoginEvents, UserProfiles

// This query fails because "InaccessibleTable" does not exist.
union UserLoginEvents, UserProfiles, InaccessibleTable

To prevent the entire query from failing due to one or more inaccessible tables, we can employ the isfuzzy=true parameter. This parameter instructs the query to proceed with the accessible tables and ignore the ones that cannot be accessed:

// This query succeeds, ignoring the missing "InaccessibleTable".
union isfuzzy=true UserLoginEvents, UserProfiles, InaccessibleTable

By using isfuzzy=true, we ensure that our query still returns useful data, despite any issues with table accessibility, allowing for a more resilient and fault-tolerant data retrieval approach.

Best Practices for the `union` Operator

Utilizing the union operator in KQL can be a powerful way to synthesize data from multiple tables. However, when dealing with large datasets, the order of operations is crucial for query performance. A common mistake is to combine the tables into a union and then apply filters, which can be inefficient and time-consuming.

For optimal performance, it’s recommended to filter each dataset before uniting them. This approach minimizes the amount of data being processed and can lead to significant gains in speed and efficiency.

Here’s how you can structure an efficient query:

union <OptionalParameters> (<TabularExpression1>), (<TabularExpression>), ...

union isfuzzy = true DeviceProcessEvents, DeviceEvents
| where Timestamp > ago(5d)
| where AccountName =~ "alex.wilber"

union isfuzzy = true 
    (
    DeviceProcessEvents
    | where Timestamp > ago(5d)
    | where AccountName =~ "alex.wilber"
    ), 
    (
    DeviceEvents
    | where Timestamp > ago(5d)
    | where AccountName =~ "alex.wilber"
    )

Latest from our blog

Easter Sale: 20% OFF

A Deep Dive into the KQL Union Operator

Optional Parameters for Union

`kind` Parameter

Example with default behavior:

Example with `kind=inner`:

`withsource` Parameter

`isfuzzy` Parameter

Best Practices for the `union` Operator

Update on pre-filtering

Latest from our blog

Template-2

Querying Azure Resource Graph Without Limits Using KQL

Threat Hunting and Detection Using Web Proxy Logs

Detecting BadSuccessor: Shorcut to Domain Admin

Featured Links

Connect with us

Policies

Easter Sale: 20% OFF

A Deep Dive into the KQL Union Operator

Optional Parameters for Union

kind Parameter

Example with default behavior:

Example with kind=inner:

withsource Parameter

isfuzzy Parameter

Best Practices for the union Operator

Update on pre-filtering

Latest from our blog

Template-2

Querying Azure Resource Graph Without Limits Using KQL

Threat Hunting and Detection Using Web Proxy Logs

Detecting BadSuccessor: Shorcut to Domain Admin

Featured Links

Connect with us

Policies

Subscribe to our Newsletter!

Fall in Love with KQL: 30% OFF!

Use VLTN30 at checkout!

New Challenge Lab

`kind` Parameter

Example with `kind=inner`:

`withsource` Parameter

`isfuzzy` Parameter

Best Practices for the `union` Operator