ExtractMany / String Layer
Extract all occurrences of multiple literal patterns from text. Returns a list column containing all matches. Similar to Python's str.findall() but with multiple terms. Useful for:
- Finding specific terms in documents
- Extracting known codes or identifiers
- Collecting mentions of specific products or names
- Text analysis with predefined vocabulary
Select
columnThe string column to search within. Text may contain multiple matches of different patterns.
Patterns
[, ...]List of text patterns to search for. Must contain at least one pattern. All matching patterns will be collected in order of appearance in the text.
Value
stringText pattern to find. Unlike regex, treated as literal text. Examples:
- Product codes [MODEL-A, MODEL-B]
- Keywords [Error, Warning, Critical]
- Common phrases [Thank you, Best regards]
AsciiInsensitive
boolWhen true
, matches ASCII letters regardless of case (e.g., Error matches ERROR, error). Affects only ASCII characters a-z and A-Z, not Unicode letters.
Overlapping
boolControls match behavior:
true
: Allows patterns to overlap (e.g., finding both at and attach in attack)false
: Only finds non-overlapping matches
Choose based on your text analysis needs.
AsColumn
nameName for the new column. If not provided, the system generates a unique name. If AsColumn
matches an existing column, the existing column is replaced. The name should follow valid column naming conventions.