replicate Transform¶
The replicate
transform replicates a Pandas DataFrame. It will
replicate it once for each asset in the asset_list. It will also
add an ‘asset’ column that contains the name of the asset.
- Example:
Say we have a DataFrame that looks like this:
some_field,stream_type,timestamp 3.14 ,'a_stream' ,2018-01-01 00:00:00.000000 2.71828 ,'b_stream' ,2018-01-02 00:00:00.000000 0.0 ,'my_stream',2018-01-03 00:00:00.000000
And we want to replicate it for three different assets:
asset ,some_field,stream_type,timestamp 'asset1',3.14 ,'a_stream' ,2018-01-01 00:00:00.000000 'asset1',2.71828 ,'b_stream' ,2018-01-02 00:00:00.000000 'asset1',0.0 ,'my_stream',2018-01-03 00:00:00.000000 'asset2',3.14 ,'a_stream' ,2018-01-01 00:00:00.000000 'asset2',2.71828 ,'b_stream' ,2018-01-02 00:00:00.000000 'asset2',0.0 ,'my_stream',2018-01-03 00:00:00.000000 'asset3',3.14 ,'a_stream' ,2018-01-01 00:00:00.000000 'asset3',2.71828 ,'b_stream' ,2018-01-02 00:00:00.000000 'asset3',0.0 ,'my_stream',2018-01-03 00:00:00.000000
Our configuration will look something like this:
{ "transform_name": "Replicate", "transform_type": "replicate", "filter_stream": [ "*" ], "asset_list": ["asset1", "asset2", "asset3"], }
Configuration:
Required and optional properties that can be configured for a replicate transform.
asset_list: The list of assets that will be added to the DataFrame
transform_name: Unique name for the transform.
transform_type: Type of transform to apply. Should be
replicate
.filter_stream: List of data streams to transform. Each stream can either be
*
(all) orasset:stream
.