-
David Schäfer authoredb6ea7f4e
Customizations
SaQC comes with a continuously growing number of pre-implemented quality check and processing routines and flagging schemes. For any sufficiently large use case however, chances are high, that the functions provided won't fulfill all your needs and requirements.
Acknowledging our insufficiency to address all (im-)possible use cases, we designed the system in a way, that makes it's extension and customization as simple as possible. The main extensions options, namely quality check routines and the flagging scheme are described within this documents.
Custom quality check routines
In case you are missing quality check routines, you are of course very welcome to file an feature request issue on the project's gitlab repository. However, if you are more the no-biggie-I-get-this-done-by-myself type of person, SaQC provides two ways to integrate custom routines into the system:
- The extension language
- An interface to the evaluation machinery
Interface
In order to make a function usable within the evaluation framework of SaQC it needs to implement the following interface:
def yourTestFunction(
data: pandas.DataFrame,
field: str,
flagger: saqc.flagger.BaseFlagger,
*args: Any,
**kwargs: Any
) -> (pd.DataFrame, saqc.flagger.BaseFlagger)
Argument Descriptions
Name | Description |
---|---|
data |
The actual dataset |
field |
The field/column within data , the function is checking/processing |
flagger |
A instance of a flagger, responsible for the translation of test results into quality attributes |
args |
Any other arguments needed to parameterize the function |
kwargs |
Any other keyword arguments needed to parameterize the function |
Integrate into SaQC
In order make your function available to the system it needs to be registered. We provide the decorator
register
in the module saqc.functions.register
, to integrate your
test functions into SaQC and expose them via a name of your choice. A complete, yet useless example might
look like that:
from saqc.functions.register import register
@register("myFunc")
def yourTestFunction(data, field, flagger, *args, **kwargs):
return data, flagger
Example
The function flagRange
provides a simple, yet complete implementation of
a quality check routine. You might want to look into its implementation before you start writing your
own.
Custom flagging schemes
Sorry for the inconvenience! Coming soon...