[FIX] reduce the memory consumption of SaQC by >50% through Histories of type pd.Categorical (686065dc) · Commits · rdm-software / SaQC

mentioned in issue #209 (closed)

From some brief benchmarking, i got, that casting via df.astype(pd.SparseDtype('float', np.nan)), instead of df.astype('category'), is faster in casting (around 30 percent) and uses less memory (Factor 1-10, without initial unflagged column: factor 2-20) and is faster in column and row access and also in row wise max calculation.

So, since integrating would just mean to replace category cast by sparse cast, maybe we should give it a try?

Sure! But let's please do it after !260 (merged) was merged.

!260 (merged) is in now, so feel free to sparsify.