Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[ROCM] Add nanoo fp8 support in type traits #17890

Closed
wants to merge 1 commit into from

Conversation

ScXfjiang
Copy link
Contributor

We are trying to add NANOO FP8 support in TensorFlow. To achieve this, a small modification in the type_traits.h file on the XLA side is required, where the NANOO FP8 should also be classified as a simple type. This change will be utilized in tensor.cc within TensorFlow. You can find the relevant code here: TensorFlow tensor.cc.

@ScXfjiang
Copy link
Contributor Author

ScXfjiang commented Oct 4, 2024

FYI: For the context on NANOO FP8, please refer to this comment:
google/flax#3993 (comment) @majnemer

@ScXfjiang
Copy link
Contributor Author

Hi @majnemer could you take a look at this PR?

@i-chaochen
Copy link
Contributor

Hi @xla-rotation could you check this PR please?

copybara-service bot pushed a commit that referenced this pull request Oct 11, 2024
Imported from GitHub PR #17890

We are trying to add NANOO FP8 support in TensorFlow. To achieve this, a small modification in the `type_traits.h` file on the XLA side is required, where the NANOO FP8 should also be classified as a simple type. This change will be utilized in tensor.cc within TensorFlow. You can find the relevant code here: [TensorFlow tensor.cc](https://github.com/tensorflow/tensorflow/blob/ba7f93b8dc32b64cc0bb5ddb44e75bd344ae67d0/tensorflow/core/framework/tensor.cc#L175).
Copybara import of the project:

--
1c27aeb by scxfjiang <xuefei.jiang@amd.com>:

add nanoo fp8 support in type traits

Merging this change closes #17890

Reverts changelist 673529458

FUTURE_COPYBARA_INTEGRATE_REVIEW=#17890 from ROCm:ci_nanoo_fp8_in_type_traits 1c27aeb
PiperOrigin-RevId: 684839620
copybara-service bot pushed a commit to tensorflow/tensorflow that referenced this pull request Oct 11, 2024
Imported from GitHub PR openxla/xla#17890

We are trying to add NANOO FP8 support in TensorFlow. To achieve this, a small modification in the `type_traits.h` file on the XLA side is required, where the NANOO FP8 should also be classified as a simple type. This change will be utilized in tensor.cc within TensorFlow. You can find the relevant code here: [TensorFlow tensor.cc](https://github.com/tensorflow/tensorflow/blob/ba7f93b8dc32b64cc0bb5ddb44e75bd344ae67d0/tensorflow/core/framework/tensor.cc#L175).
Copybara import of the project:

--
1c27aebbc206559861ee8d5e5f44522a2a44858b by scxfjiang <xuefei.jiang@amd.com>:

add nanoo fp8 support in type traits

Merging this change closes #17890

FUTURE_COPYBARA_INTEGRATE_REVIEW=openxla/xla#17890 from ROCm:ci_nanoo_fp8_in_type_traits 1c27aebbc206559861ee8d5e5f44522a2a44858b
PiperOrigin-RevId: 684839620
copybara-service bot pushed a commit to tensorflow/tensorflow that referenced this pull request Oct 11, 2024
Imported from GitHub PR openxla/xla#17890

We are trying to add NANOO FP8 support in TensorFlow. To achieve this, a small modification in the `type_traits.h` file on the XLA side is required, where the NANOO FP8 should also be classified as a simple type. This change will be utilized in tensor.cc within TensorFlow. You can find the relevant code here: [TensorFlow tensor.cc](https://github.com/tensorflow/tensorflow/blob/ba7f93b8dc32b64cc0bb5ddb44e75bd344ae67d0/tensorflow/core/framework/tensor.cc#L175).
Copybara import of the project:

--
1c27aebbc206559861ee8d5e5f44522a2a44858b by scxfjiang <xuefei.jiang@amd.com>:

add nanoo fp8 support in type traits

Merging this change closes #17890

PiperOrigin-RevId: 684917294
copybara-service bot pushed a commit to tensorflow/tensorflow that referenced this pull request Oct 11, 2024
…placements in memory space assignment. Both features are enabled by default.

FUTURE_COPYBARA_INTEGRATE_REVIEW=openxla/xla#17890 from ROCm:ci_nanoo_fp8_in_type_traits 1c27aebbc206559861ee8d5e5f44522a2a44858b
PiperOrigin-RevId: 680765291
copybara-service bot pushed a commit to tensorflow/tensorflow that referenced this pull request Oct 11, 2024
FUTURE_COPYBARA_INTEGRATE_REVIEW=openxla/xla#17890 from ROCm:ci_nanoo_fp8_in_type_traits 1c27aebbc206559861ee8d5e5f44522a2a44858b
PiperOrigin-RevId: 684189732
copybara-service bot pushed a commit to tensorflow/tensorflow that referenced this pull request Oct 11, 2024
- Moves the 3 overloads into the subclasses
- Moves the corresponding GpuDriver functions
- Adds basic unit tests

FUTURE_COPYBARA_INTEGRATE_REVIEW=openxla/xla#17890 from ROCm:ci_nanoo_fp8_in_type_traits 1c27aebbc206559861ee8d5e5f44522a2a44858b
PiperOrigin-RevId: 684699241
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants