Valueerror When Performing Scipy.stats Test On Pandas Column Selection By Row
The goal is to create a new column in a pandas column that stores the value of a KS D-statistic, df['ks']. The KS statistic is generated between two groups of columns in that data
Solution 1:
The KS calculation in the loop was getting a "too deep" error because I needed to pass it a 1-D array for each distribution to test:
for idx, row in df.iterrows():
df.loc[idx, 'ks'] = stats.ks_2samp(df.loc[idx, grp1], (df.loc[idx, grp2]))[0]
My previous attempt used a 2-D array instead. That is what was causing it to be "too deep"
Post a Comment for "Valueerror When Performing Scipy.stats Test On Pandas Column Selection By Row"