-
Notifications
You must be signed in to change notification settings - Fork 474
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Various runtime issues running numerical simulation #1917
Comments
for reference
|
Sorry for the late reply. Can you please share the log file? It'd be hard to know without more information. Is there anything special with your program? I guess it's written in C/C++ and Python. Did you build with |
Hi @PhilipDeegan, I'm trying to reproduce your problem so I've tried to build the your project as follows.
But the execution itself without uftrace isn't successful.
Is there anything I miss for the execution? If this can be reproduced from our side, then we might be able to help you better. |
Gentlemen @namhyung @honggyukim
this is done automatically for you via cmake/ctest, but running directly from CLI needs the export |
(clicked wrong button posting the previous comment...) |
I'm not sure typically I would do |
It looks like it runs with a single thread. The execution output looks as follows.
After a while later, the program is finished normally. So I'm trying it with
I will try that for the next try. |
Hmm.. It looks the execution was finished with some issues as follows.
|
Anyway, the above abnormal record shows some traces and its
|
trying it for myself, with uftrace HEAD, it exits almost immediately
|
for reference
|
The Did you compile your binary with |
oh, I had used |
I've tried it again with
Then tried
Should I keep |
Hmmm, it should work, assuming you don't have some cache somewhere still pointing to the old version see on our GHA we just do the following |
without
I will add for completeness, I am currently on fedora, when I saw the segfaults on debian. I will be on debian again later to confirm |
I didn't use
I haven't tested on fedora these days. Do you see other simple programs can be traced with uftrace? |
looks ok generally
|
oh, our third party dependency might be caching the old hdf5 you can try the following from the project root
|
Hmm, the build script downloads samrai even if it's removed as you suggested.
Then it still fails with the same error as above. |
hmm, you can tell cmake where HDF5 is via where the directory would be something like when the library exists at you might not have a symlink in you can check like |
I get this after removing then install parallel versions as follows.
If this wasn't what you mean then I will try again later. It's getting late so I need to sleep now. |
@honggyukim no worries, thanks for your attention 🛌 |
Hi, I've tried it after installing Then I see it takes about a bit less than 2 mins as follows.
I've tried it with uftrace record, then the record was successful, but it took about more than 12 mins even though I applied
The record looks fine. But I'm just wondering if the original execution is timing sensitive so it takes much longer when uftrace is attached. |
Some report output looks as follows.
|
that's nice @honggyukim but it's not really representative of what I'm doing which with parallel mpi if you check the output of still for me, it fails
|
Hi there,
I work on a somewhat complicated project for modeling astrophysical systems.
I am trying to use uftrace for logging the call stack, and running into some segfaults and such.
We have many tests and run with ASAN and things so to me this segfault is somewhat unusual, but I can't rule out it is indeed a problem with the code, and not uftrace. We are using pybind and execute some python scripts via the embedded interpreter, from our native binary entrypoint, if that might be causing some issues.
running uftrace on our native binaries results in the following cases
with -e
If I use this command (picked up from)
uftrace exits with
-1
after a second or so, leaving the child process running (but taking 0% CPU so not scheduled)Any suggestions are welcome
Thanks
The text was updated successfully, but these errors were encountered: