Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

NaNs during training when running the TinyYoloHouseNumberDetection example #1084

Open
b005t3r opened this issue Jun 17, 2024 · 0 comments
Open

Comments

@b005t3r
Copy link

b005t3r commented Jun 17, 2024

Issue Description

Please describe your issue, along with:

  • expected behavior: the network trains normally, no NaNs are reported
  • encountered behavior: training crashes after the first fit() call because of NaNs (I enabled NaN detection, see below)

Version Information

Please indicate relevant versions, including, if relevant:

  • Deeplearning4j version: 1.0.0-beta7 and 1.0.0-M2.1
  • platform information: macOS Sonoma 14.2.1

The mentioned example:
https://github.com/deeplearning4j/deeplearning4j-examples/blob/master/dl4j-examples/src/main/java/org/deeplearning4j/examples/advanced/modelling/objectdetection/TinyYoloHouseNumberDetection.java

You'll need to add this as the first call in the main() method:

        Nd4j.getExecutioner().setProfilingConfig(ProfilerConfig.builder()
                .checkForINF(true)
                .checkForNAN(true)
                .checkElapsedTime(true)
                .checkLocality(true)
                .checkWorkspaces(true)
                .build());
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant