Top end model flattening #3306
Replies: 5 comments 5 replies
-
|
Hi @Friedemannn , The setup looks similar to the one where I asked you about the bounds issue - I see that I missed replying to the second half of that discussion. Please use the input transform class input_transform=Normalize(bounds=your_bounds, dim=num_dims) along with the outcome transform. Normalization/Standardization is the most common issue I see, and using that correctly eliminates that source of error. In your previous problem, the bounds were not [0, 1], and I really recommend providing the actual bounds everywhere and let the model deal with it through normalize. Manual solutions to this problem tend do be substantially more error-prone. Some other things:
Getting the actual data that you are loading would help a lot here! The minimum noise could be the problem, but I'm pretty confident that the issue it the bounds and input normalization of the 6D-objective specifically. |
Beta Was this translation helpful? Give feedback.
-
|
@Friedemannn Glad to hear! Anything else that's not workin, or can we close out the discussion? |
Beta Was this translation helpful? Give feedback.
-
|
Does anyone have an idea where this behavior comes from? The deviation from the ideal fit in the 6D version is probably because of to few data points for 6d to work well. But I have a hard time even guessing where this flat top comes from. |
Beta Was this translation helpful? Give feedback.
-
|
@Friedemannn Sorry, was off for a few days. You mean the predictions for the very largest values, right? These would be considered the "worst", right? A tip: Color the points by the distance they have to either the boundary or to any other point in the search space. For example, points that are close to other points (e.g. by min distance and by kernel distance - two different plots) can be colored blue, and those with a large min distance would be colored red. I'm sure you would see a lot of red for those flat top points, meaning that they would be far from other points and thus difficult to predict. Just a hunch, but I think it's worth exploring. |
Beta Was this translation helpful? Give feedback.
-
|
@Friedemannn okay, I see. I thought it was the opposite, which makes me less optimistic. However:
I would add too few data points around those specific data points, yes. However, that is pretty unlikely if they are good - I thought they were bad. Now, I actually think they will be very close to one another, but that your objective varies very quickly around these specific data points, making them hard to predict. Simply put, I would guess there's some level of heterogeneity. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi all,
after I've implemented cross validation (see #3298) as a new visualisation option for my models I discovered that the models do not describe the underlying/simulated objective values at the top end.
The predictions do not pass some threshold.
Its most visible in this 6D run: 6D_crossval_vis.pdf
But it also can be seen in 2D runs (subsets of the 6 Dimensions, same bounds): 2D_crossval_vis1.pdf, 2D_crossval_vis2.pdf, 2D_crossval_vis3.pdf
My code looks like this:
Does anyone know what the problem might be?
Could the minimum noise I'm introducing be a problem? (I introduced that to solve the problem i had in #3133)
Thanks!
Beta Was this translation helpful? Give feedback.
All reactions