Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

#15061: Extended {to,from}_vector to support tilized layout, bf4/8 formats #16105

Merged
merged 6 commits into from
Dec 24, 2024

Conversation

omilyutin-tt
Copy link
Contributor

@omilyutin-tt omilyutin-tt commented Dec 17, 2024

Ticket

#15061

Problem description

to_vector / from_vector don't support some of the special cases, which prevents a more widespread adoption (distributing tensors across mesh of devices in particular).

What's changed

  • Support tilized layouts.
  • Support bf4 / bf8 data types with auto-padding.
  • Extended chunk / concat support for the added types.

Next steps

  • Optimize certain operations on-device, such as tilization, whenever possible.
  • Perform auto-padding in tilized layouts / when using sharding.
  • Switching pytensor logic to using from_vector API.

Checklist

cfjchu
cfjchu previously requested changes Dec 17, 2024
ttnn/cpp/ttnn/tensor/tensor.cpp Outdated Show resolved Hide resolved
ttnn/cpp/ttnn/tensor/tensor.cpp Outdated Show resolved Hide resolved
ttnn/cpp/ttnn/tensor/tensor.cpp Outdated Show resolved Hide resolved
ttnn/cpp/ttnn/tensor/tensor.cpp Show resolved Hide resolved
@omilyutin-tt
Copy link
Contributor Author

@cfjchu as per the discussion offline, discarding the requested changes to unblock the PR - the feedback will be addressed in follow ups to speed up tilization on-device

@omilyutin-tt omilyutin-tt merged commit c7e6b43 into main Dec 24, 2024
9 checks passed
@omilyutin-tt omilyutin-tt deleted the omilyutin/bf branch December 24, 2024 02:37
arikTT pushed a commit that referenced this pull request Dec 27, 2024
…rmats (#16105)

### Ticket
#15061

### Problem description
`to_vector` / `from_vector` don't support some of the special cases,
which prevents a more widespread adoption (distributing tensors across
mesh of devices in particular).

### What's changed
* Support tilized layouts.
* Support bf4 / bf8 data types with auto-padding.
* Extended `chunk` / `concat` support for the added types.

### Next steps
* Optimize certain operations on-device, such as tilization, whenever
possible.
* Perform auto-padding in tilized layouts / when using sharding.
* Switching pytensor logic to using `from_vector` API.

### Checklist
- [X] [Post commit CI
passes](https://github.com/tenstorrent/tt-metal/actions/runs/12422597810)
- [X] New/Existing tests provide coverage for changes

---------

Co-authored-by: Oleg Milyutin <omilyutin-tt@tenstorrent.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants