Tutorial on Transfer Learning Bayesian Optimisation #439

jpfolch · 2024-09-10T21:21:43Z

Added a tutorial on how to use BoFire for Transfer Learning Bayesian Optimisation. Also made two small modifications for the code to run:

Inverse transform for ordinal encodings transforms into int type
When predicting within the task using MultiTaskGPs we must add likelihood noise manually since the observation_noise flag is not supported by BoTorch for MultiTaskGPs

Next step would be to incorporate a step for task selection for multi-fidelity problems.

jduerholt · 2024-09-11T20:29:51Z

Thanks @jpfolch. I will provide a review tomorrow. Best, Johannes

jduerholt

Hi @jpfolch,

looks very nice. Thank you!

I let one comment inline, and here are two other ones:

Can you also add a validator to the botorch data model (https://github.com/experimental-design/bofire/blob/main/bofire/data_models/strategies/predictives/botorch.py) that checks if a MultiTaskGP is present, only one category/task is allowed in the TaskFeature, else we will get problems, especially as currently the fidelity/task selection is missing. You need to use a after validator (

bofire/bofire/data_models/strategies/predictives/botorch.py

Line 105 in 6d7e065

@model_validator(mode="after")

)
Can you add to the notebook optimization campaigns that show that using information from lower fidelity enhances the convergence in comparison to not using this information?

Best,

Johannes

jduerholt · 2024-09-12T06:42:38Z

bofire/strategies/predictives/botorch.py

@@ -170,13 +170,21 @@ def _predict(self, transformed: pd.DataFrame) -> Tuple[np.ndarray, np.ndarray]:
        # input and further transform it to a torch tensor
        X = torch.from_numpy(transformed.values).to(**tkwargs)
        with torch.no_grad():
-            posterior = self.model.posterior(X=X, observation_noise=True)  # type: ignore
+            # observation noise is not implemented for MultiTaskGPSurrogate, has to be treated differently
+            if self.surrogate_specs.surrogates[0].type == "MultiTaskGPSurrogate":


Can you add a validator to BotorchSurrogates that checks that in case one MultiTaskGP is present, all surrogates are of type MultiTaskGP. Else, we will get trouble with the different required encodings. You have to add it here: https://github.com/experimental-design/bofire/blob/main/bofire/data_models/surrogates/botorch_surrogates.py

You were correct and the code actually crashed when using multiple models (even if all were MultiTaskGPs). I sent a fix for it and it seems to work, but I am not sure it is entirely correct; I do not really understand when we expect to have len(posterior.mean.shape) == 2 and when len(posterior.mean.shape) == 3? Any guidance would be appreciated.

Oh, also important to mention that when I tested with a single surrogate model, or two of them, in all cases it goes through len(posterior.mean.shape) == 2 which caused my confusion...

You have a posterior mean of shape 3 in case of using fully basesian GPs. Have a look here: https://github.com/experimental-design/bofire/blob/main/bofire/surrogates/fully_bayesian.py

Looking, at this and how this gets really messy here, I want to propose the following solution:

If the model can handle posterior noise, we take it, else we do not take it.

try: posterior = self.model.posterior(X=X, observation_noise=True) except: # the error which is thorwn for MultiTask posterior = self.model.posterior(X=X, observation_noise=False)

And the rest we keep as it is. And I will in a later PR refactor this whole part. Ok for you?

jpfolch · 2024-09-13T14:29:37Z

Hi @jduerholt,

I have made the requested changes, I am a little unsure about the _predict method in bofire/strategies/predictives/botorch.py bofire/strategies/predictives/botorch.py being correct, I left some more details in the comment above.

I also added some testing, checking that the sobo and qehvi work okay with MultiTask surrogates, they seem to do! Let me know if I should move the testing to a specs file somewhere, I searched for it but couldn't really pinpoint exactly where they should go so I just tested in a separate file.

jduerholt · 2024-09-14T20:36:01Z

Thanks, I will have a look the latest on Monday, but I hope to get it done tomorrow!

jduerholt

Hi @jpfolch,

thank you very much. Looks very good. I think we are almost there. Have a look at my inline comments.

Regarding the notebook, only two things:

Can you also add a "SMOKE_TEST" limitation for the iterations that you run for the notebook? So that in case of the automatest test pipleine, only two iterations per optimiuation campagin are run?
There is a variable called regrets_single_task_mean. Can you rename it to regrets_single_task_median as it is actually the median?
Optional one could think of creating a benchmark class for you multitask benchmark that you created and then use the automatic benchmark excutors. But this is only optional.

Best,

Johannes

jduerholt · 2024-09-16T09:43:54Z

bofire/strategies/predictives/botorch.py

@@ -170,13 +170,21 @@ def _predict(self, transformed: pd.DataFrame) -> Tuple[np.ndarray, np.ndarray]:
        # input and further transform it to a torch tensor
        X = torch.from_numpy(transformed.values).to(**tkwargs)
        with torch.no_grad():
-            posterior = self.model.posterior(X=X, observation_noise=True)  # type: ignore
+            # observation noise is not implemented for MultiTaskGPSurrogate, has to be treated differently
+            if self.surrogate_specs.surrogates[0].type == "MultiTaskGPSurrogate":


You have a posterior mean of shape 3 in case of using fully basesian GPs. Have a look here: https://github.com/experimental-design/bofire/blob/main/bofire/surrogates/fully_bayesian.py

jduerholt · 2024-09-16T09:59:22Z

bofire/strategies/predictives/botorch.py

@@ -170,13 +170,21 @@ def _predict(self, transformed: pd.DataFrame) -> Tuple[np.ndarray, np.ndarray]:
        # input and further transform it to a torch tensor
        X = torch.from_numpy(transformed.values).to(**tkwargs)
        with torch.no_grad():
-            posterior = self.model.posterior(X=X, observation_noise=True)  # type: ignore
+            # observation noise is not implemented for MultiTaskGPSurrogate, has to be treated differently
+            if self.surrogate_specs.surrogates[0].type == "MultiTaskGPSurrogate":


Looking, at this and how this gets really messy here, I want to propose the following solution:

If the model can handle posterior noise, we take it, else we do not take it.

try: posterior = self.model.posterior(X=X, observation_noise=True) except: # the error which is thorwn for MultiTask posterior = self.model.posterior(X=X, observation_noise=False)

And the rest we keep as it is. And I will in a later PR refactor this whole part. Ok for you?

jduerholt · 2024-09-16T10:03:45Z

bofire/data_models/strategies/predictives/botorch.py

@@ -227,3 +229,17 @@ def _generate_surrogate_specs(
        surrogate_specs.surrogates = _surrogate_specs
        surrogate_specs._check_compability(inputs=domain.inputs, outputs=domain.outputs)
        return surrogate_specs
+
+    @model_validator(mode="after")


This has to be moved to data_models/strategies/predictives/botorch.py, as it is just a requirement for optimization.

The corresponding test would go then into tests/bofire/data_models/specs/strategies.py and you would add an invalid spec for SOBO for example as it inherits from the base class.

This has to be moved to data_models/strategies/predictives/botorch.py, as it is just a requirement for optimization.

Is it not already in the correct file?

I am adding the test now though.

jduerholt · 2024-09-16T10:12:27Z

bofire/data_models/surrogates/botorch_surrogates.py

@@ -131,4 +131,10 @@ def validate_surrogates(cls, v, values):
                raise ValueError(
                    f"Preprocessing steps for features with {key} are incompatible."
                )
+        # check that if any surrogate is a MultiTaskGPSurrogate, all have to be


This needs to be tested in tests/bofire/data_models/specs/surrogates.py. You can add there invalid configurations, and the error that should be thrown. This is the new default way of validating data models.

jduerholt · 2024-09-16T10:31:45Z