Workflow TTS playback node filtering issue. #6877

ic-xu · 2024-08-01T06:33:37Z

Checklist:

Important

Please review the checklist below before submitting your pull request.

Please open an issue before creating a PR or link to an existing issue
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas
I ran dev/reformat(backend) and cd web && npx lint-staged(frontend) to appease the lint gods

Description

In the workflow chat, if a workflow contains multiple LLM nodes and the TTS auto-play feature is enabled, then the output of each LLM node will play the LLM. This may not be what the user needs. This fix is to ensure that TTS only plays the text content of the output from the end node, so that whatever the user sees, TTS will play.
eg.

In such cases, TTS should only play the output of the LLM2 node, because the final output of the Answer is the data from LLM2, and what the user sees is also the data output by LLM2. Therefore, TTS should only play the data output by LLM2.

Fixes

Type of Change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update, included: Dify Document
Improvement, including but not limited to code refactoring, performance optimization, and UI/UX improvement
Dependency upgrade

Testing Instructions

Please describe the tests that you ran to verify your changes. Provide instructions so we can reproduce. Please also list any relevant details for your test configuration

Test A
Test B

…g voices; 2. The bug that the front-end tts loading status must be displayed after the HTTP request is completed; 3. The bug that the front-end console errors when switching voices and the voice does not exist.

…and the TTS auto-play feature is enabled, then the output of each LLM node will play the LLM. This may not be what the user needs. This fix is to ensure that TTS only plays the text content of the output from the end node, so that whatever the user sees, TTS will play.

laipz8200 · 2024-08-02T04:13:20Z

What will happen if this workflow has multiple branches and answer nodes?

…play normally.

ic-xu · 2024-08-02T08:14:46Z

What will happen if this workflow has multiple branches and answer nodes?

Regardless of whether there are multiple workflows or multiple answer nodes, the final output of the program is definitely a single answer node. At this point, the TTS should play the output content of this answer node, which is considered reasonable. This way, what the user hears should match what they see.

ic-xu · 2024-08-02T08:16:29Z

I am currently working on improving in this direction, and I have already tested it. If you have any new issues, I hope we can communicate more.

laipz8200

Thanks!

* refs/heads/feat/web-app-sso: feat: web sso app Fixed a bug where permission was clearly displaye… (#6934) fix: The permissions issue of the editor role accessing some backend … (#6945) Fix: tag & settings modal in dataset card in Firefox (#6953) fix: ensure db migration in docker entry script running with `upgrade-db` command for proper locking (#6946) chore: fix markdown format and one typo (#6939) fix: restore xinference secret field (#6941) Fix increase_usage of total_price in agent_runner (#6688) fix: import workflow errors (#6937) Workflow TTS playback node filtering issue. (#6877) compatible xinference reranker server (#6927) fix: workflow trace user_id error (#6932) fix: sending app trace data to other app trace provider (#6931)

ic-xu added 2 commits August 1, 2024 11:22

fix:1. The problem of the open AI tts not taking effect when switchin…

84dfc70

…g voices; 2. The bug that the front-end tts loading status must be displayed after the HTTP request is completed; 3. The bug that the front-end console errors when switching voices and the voice does not exist.

dosubot bot added size:S This PR changes 10-29 lines, ignoring generated files. 🌊 feat:workflow Workflow related stuff. 🐞 bug Something isn't working labels Aug 1, 2024

ic-xu changed the title ~~Workflow tts issues~~ Workflow TTS playback node filtering issue. Aug 1, 2024

crazywoola requested review from takatost and laipz8200 August 1, 2024 06:45

fix: If the previous node of Answer is not an LLM node, it can still …

0c537bb

…play normally.

dosubot bot added size:M This PR changes 30-99 lines, ignoring generated files. and removed size:S This PR changes 10-29 lines, ignoring generated files. labels Aug 2, 2024

fix: If the previous node of Answer is not an LLM node, it can still …

2a50a1d

…play normally.

laipz8200 approved these changes Aug 4, 2024

View reviewed changes

dosubot bot added the lgtm This PR has been approved by a maintainer label Aug 4, 2024

laipz8200 merged commit dff3f41 into langgenius:main Aug 4, 2024
5 checks passed

cuiks pushed a commit to cuiks/dify that referenced this pull request Aug 6, 2024

Workflow TTS playback node filtering issue. (langgenius#6877)

51dc7fa

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Workflow TTS playback node filtering issue. #6877

Workflow TTS playback node filtering issue. #6877

ic-xu commented Aug 1, 2024

laipz8200 commented Aug 2, 2024

ic-xu commented Aug 2, 2024

ic-xu commented Aug 2, 2024

laipz8200 left a comment

Workflow TTS playback node filtering issue. #6877

Workflow TTS playback node filtering issue. #6877

Conversation

ic-xu commented Aug 1, 2024

Checklist:

Description

Type of Change

Testing Instructions

laipz8200 commented Aug 2, 2024

ic-xu commented Aug 2, 2024

ic-xu commented Aug 2, 2024

laipz8200 left a comment

Choose a reason for hiding this comment