Why I got different lift result when using get_cumlift() and calculating line by line?  

**Describe the bug**
Hi Team! 
I used get_cumlift(), and got the lift for S-Learner like this:
<img width="72" alt="image" src="https://github.com/uber/causalml/assets/127222900/1dae7297-416d-4677-a60f-b5bc0a3c3ec3">

When I tried to duplicate the result, calculating it manually, the result is different from what I had using  get_cumlift().

```
sorted_df = df_try.sort_values(col, ascending=False).reset_index(drop=True)
sorted_df.index = sorted_df.index + 1
sorted_df["cumsum_tr"] = sorted_df['w'].cumsum()
sorted_df["cumsum_ct"] = sorted_df.index.values - sorted_df["cumsum_tr"]
sorted_df["cumsum_y_tr"] = (sorted_df['y'] * sorted_df['w']).cumsum()
sorted_df["cumsum_y_ct"] = (sorted_df['y'] * (1 - sorted_df['w'])).cumsum()
```

This is how table looks like:
<img width="294" alt="image" src="https://github.com/uber/causalml/assets/127222900/9eab57c9-96c4-46b3-9ead-0cc335b19213">

And then I calculate the lift:
```
lift=[]
lift.append(sorted_df["cumsum_y_tr"] / sorted_df["cumsum_tr"] - sorted_df["cumsum_y_ct"] / sorted_df["cumsum_ct"])
lift = pd.concat(lift, join="inner", axis=1)
lift.loc[0] = np.zeros((lift.shape[1],))
lift = lift.sort_index().interpolate()
```

This is how the final result looks like:
<img width="67" alt="image" src="https://github.com/uber/causalml/assets/127222900/49b29728-d4d7-4f2d-8a7f-b02cd05c3fdb">

I plot the difference between the result from get_cumlif() and manual calculation.
![image](https://github.com/uber/causalml/assets/127222900/3565b90e-ccc6-4a00-baff-1530f1492077)

Does anyone know why they are different?

**Environment (please complete the following information):**
 - OS:  Windows
 - Python Version: 3.8
 - Versions of Major Dependencies (`pandas`, `scikit-learn`, `cython`):`pandas==1.3.5`, `scikit-learn==1.0.2`, `cython==0.29.34`]



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why I got different lift result when using get_cumlift() and calculating line by line? #706

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Why I got different lift result when using get_cumlift() and calculating line by line? #706

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions