Skip to content Skip to sidebar Skip to footer

Valueerror When Performing Scipy.stats Test On Pandas Column Selection By Row

The goal is to create a new column in a pandas column that stores the value of a KS D-statistic, df['ks']. The KS statistic is generated between two groups of columns in that data

Solution 1:

The KS calculation in the loop was getting a "too deep" error because I needed to pass it a 1-D array for each distribution to test:

for idx, row in df.iterrows():
    df.loc[idx, 'ks'] = stats.ks_2samp(df.loc[idx, grp1], (df.loc[idx, grp2]))[0]

My previous attempt used a 2-D array instead. That is what was causing it to be "too deep"

Post a Comment for "Valueerror When Performing Scipy.stats Test On Pandas Column Selection By Row"