How to select candidate equations using performance of test data more efficiently? #622

leelew · 2024-05-07T02:30:43Z

leelew
May 7, 2024

Hi,

Thanks for developing such a useful tool!

I try to discover equations from observational data. I run PySR with different parameters settings (e.g., complexity, operators), and I want to select equations according to the performance of test data (e.g., RMSE < a & R > b). But I have to select the equations manually which is time-costing. Is there any method to select candidate equations more efficiently?

Best regards,
Lu

Answered by MilesCranmer

May 7, 2024

Thanks!

Would the following help?

import copy

equations = copy.deepcopy(model.equations_)

# this is a pandas dataframe, so we can add new columns:
equations["my_metric"] = [
    my_metric(
        model.predict(Xtest, index=i),
        ytest
    )
    for i in range(len(equations))
]


choice = equations["my_metric"].idxmin()
# ^ or idxmax() if maximizing

model.predict(X, index=index)
# ^ Predict with best (or can pass to .sympy/.latex/.jax/.pytorch)

View full answer

MilesCranmer · 2024-05-07T08:26:31Z

MilesCranmer
May 7, 2024
Maintainer

Thanks!

Would the following help?

import copy

equations = copy.deepcopy(model.equations_)

# this is a pandas dataframe, so we can add new columns:
equations["my_metric"] = [
    my_metric(
        model.predict(Xtest, index=i),
        ytest
    )
    for i in range(len(equations))
]


choice = equations["my_metric"].idxmin()
# ^ or idxmax() if maximizing

model.predict(X, index=index)
# ^ Predict with best (or can pass to .sympy/.latex/.jax/.pytorch)

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to select candidate equations using performance of test data more efficiently? #622

{{title}}

Replies: 1 comment

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

How to select candidate equations using performance of test data more efficiently? #622

leelew May 7, 2024

Replies: 1 comment

MilesCranmer May 7, 2024 Maintainer

leelew
May 7, 2024

MilesCranmer
May 7, 2024
Maintainer