Overview
An origin name can be stored by Metadb in the__origin column to tag individual records. The use and meaning of origins are typically defined by:
- Your application logic
- A Metadb module
- Custom business rules
Data origins allow grouping data independently of data sources. This is useful when data sources are dictated by how data are collected (e.g., geographically in a sensor network), but you need different logical groupings.
Create Data Origin
Define a new data origin.Syntax
Parameters
A unique name for the data origin to be created.
Example
Create a new origin for test data:List Data Origins
View all configured data origins:Using Data Origins
Once created, origins can be referenced in your data processing and used to filter or group records. The__origin column in your tables will contain these origin identifiers.
Example Query
Filter records by origin:Use Cases
Multi-Tenant Applications
Multi-Tenant Applications
Use origins to identify which tenant or organization a record belongs to, even when all tenants share the same data source.
Geographic Grouping
Geographic Grouping
Tag records with geographic origins (e.g.,
'east_region', 'west_region') while collecting from location-based data sources.Environment Separation
Environment Separation
Distinguish between production, staging, and test data flowing through the same infrastructure.
Data Classification
Data Classification
Mark records with classification labels (e.g.,
'public', 'internal', 'confidential') for access control purposes.Data Origins vs Data Sources
- Data Sources
- Data Origins
- Define how data is collected
- Represent physical or technical boundaries
- Examples: specific Kafka topics, database connections
- Cannot be changed after data is ingested
Best Practices
Plan Your Origin Strategy Early: Define your origin naming scheme before ingesting data. Consistent naming makes querying and reporting much easier.
See Also
Data Sources
Configure external data sources
Access Control
Grant and revoke access to data
