[FIX] reduce the memory consumption of SaQC by >50% through Histories of type pd.Categorical
parent
0ffd8c04
No related branches found
No related tags found
This commit is part of merge request !269. Comments created here will be created in the context of that merge request.
Loading
-
mentioned in issue #209 (closed)
-
From some brief benchmarking, i got, that casting via
df.astype(pd.SparseDtype('float', np.nan))
, instead ofdf.astype('category')
, is faster in casting (around 30 percent) and uses less memory (Factor 1-10, without initial unflagged column: factor 2-20) and is faster in column and row access and also in row wise max calculation.So, since integrating would just mean to replace category cast by sparse cast, maybe we should give it a try?
-
Sure! But let's please do it after !260 (merged) was merged.
-
!260 (merged) is in now, so feel free to sparsify.
Please register or sign in to comment