LightGBM numBatches and earlyStoppingRound conflict

### SynapseML version

1.0.13

### System information

- **Language version** python 3.9

### Describe the problem

From glancing at the LightGBM code, I believe there is a conflict between the `numBatches` parameter and `earlyStoppingRound`. If you set both these params, I think that you may hit early stopping in the first batch and then never train on the remaining batches.

This would be very suboptimal. `earlyStoppingRound` is intended to increase generalization. It would be tragic if it causes training to see only a small fraction of the data, greatly reducing generalization.

I think that when both of these parameters are present, the early stopping should apply separately to each batch. I.E. when training has stopped making progress on the current batch, it should continue to the next batch.

### Code to reproduce issue

```
model = LightGBMRegressor(
            featuresCol="featureVector",
            labelCol="label"
            predictionCol="prediction",
            validationIndicatorCol="validation",
            numIterations=800,
            numBatches=10,
            earlyStoppingRound=1
)
model.fit(training_data)  # may stop in the first batch, without seeing 90% of the training data
```

### Other info / logs

_No response_

### What component(s) does this bug affect?

- [ ] `area/cognitive`: Cognitive project
- [ ] `area/core`: Core project
- [ ] `area/deep-learning`: DeepLearning project
- [x] `area/lightgbm`: Lightgbm project
- [ ] `area/opencv`: Opencv project
- [ ] `area/vw`: VW project
- [ ] `area/website`: Website
- [ ] `area/build`: Project build system
- [ ] `area/notebooks`: Samples under notebooks folder
- [ ] `area/docker`: Docker usage
- [ ] `area/models`: models related issue

### What language(s) does this bug affect?

- [ ] `language/scala`: Scala source code
- [x] `language/python`: Pyspark APIs
- [ ] `language/r`: R APIs
- [ ] `language/csharp`: .NET APIs
- [ ] `language/new`: Proposals for new client languages

### What integration(s) does this bug affect?

- [ ] `integrations/synapse`: Azure Synapse integrations
- [ ] `integrations/azureml`: Azure ML integrations
- [ ] `integrations/databricks`: Databricks integrations

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

LightGBM numBatches and earlyStoppingRound conflict #2442

SynapseML version

System information

Describe the problem

Code to reproduce issue

Other info / logs

What component(s) does this bug affect?

What language(s) does this bug affect?

What integration(s) does this bug affect?

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

LightGBM numBatches and earlyStoppingRound conflict #2442

Description

SynapseML version

System information

Describe the problem

Code to reproduce issue

Other info / logs

What component(s) does this bug affect?

What language(s) does this bug affect?

What integration(s) does this bug affect?

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions