Powerful data manipulation and analysis capabilities with pandas-based operations, filtering, transformation, and statistical analysis tools.
Comprehensive data manipulation capabilities for scientific workflows.
Comprehensive pandas-based data manipulation
Advanced filtering and data cleaning capabilities
Data type conversion and structural modifications
Built-in statistical functions and calculations
Essential nodes for data manipulation and transformation.
Choose specific columns from DataFrame
Extract relevant columns for analysis
Input DataFrame
Filtered DataFrame
Property | Type | Default | Description |
---|---|---|---|
columns | string | Column names (comma-separated) | |
keep_order | bool | true | Maintain column order |
Filter DataFrame rows by conditions
Remove unwanted data or focus on specific subsets
Input DataFrame
Filtered DataFrame
Property | Type | Default | Description |
---|---|---|---|
condition | string | Filter condition | |
case_sensitive | bool | false | Case sensitivity |
Combine multiple DataFrames
Combine data from different sources
Left DataFrame
Right DataFrame
Merged DataFrame
Property | Type | Default | Description |
---|---|---|---|
how | string | inner | Merge type (inner, outer, left, right) |
on | string | Column to merge on |
Remove duplicate rows from DataFrame
Clean datasets with duplicate entries
Input DataFrame
DataFrame without duplicates
Property | Type | Default | Description |
---|---|---|---|
subset | string | Columns to check for duplicates | |
keep | string | first | Which duplicate to keep |
Reusable patterns for data processing and analysis.
Complete workflow for cleaning experimental data
Combine data from multiple experimental sources
Filter and analyze data quality metrics
Tips for efficient and reliable data management workflows.
Always validate your data after major operations
Efficient handling of large datasets
Robust workflows handle data issues gracefully
Document your data processing steps
Learn about file I/O operations to import and export data in multiple formats.