Add Hugging Face chat wrapper #14040

andrewrreed · 2023-11-29T19:24:34Z

Issue

There previously has been no easy way to make use of models hosted on Hugging Face (via Inference API or Inference Endpoints) in combination with LangChains ChatModel abstraction.

Description

This PR introduces a new chat_model integration that creates a wrapper around the BaseChatModel class that interfaces between LangChain's and the hosted LLM by leveraging Hugging Face's Chat Templates.

To do

Add wrapper class aroundBaseChatModel to interface with HF LLM integrations
Create a notebook docs/integrations/chat that demonstrates its use
Add unit test

Tag maintainer

@hwchase17

Twitter handle

@andrewrreed

vercel · 2023-11-29T19:24:39Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
langchain	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Dec 14, 2023 5:29pm

fix formatting

andrewrreed · 2023-11-29T23:12:18Z

@hwchase17 When running the unit tests locally, there are 22 tests that are failing, all of which are inside these two files:

tests/unit_tests/callbacks/tracers/test_base_tracer.py
tests/unit_tests/callbacks/tracers/test_langchain_v1.py

However, when I run pytest on each of those files individually with poetry run pytest --disable-socket --allow-unix-socket <test-file> they all pass.... Not sure why this is happening?

baskaryan

what's the unit test error/failure you're seeing?

libs/langchain/langchain/chat_models/huggingface_chat_wrapper.py

andrewrreed · 2023-11-30T23:50:13Z

@baskaryan The unit tests were failing due to that global import of an optional dependency. Fixed that so everything is passing locally now.

Let me know if there's anything else I need to add here (like integration tests)? Thanks!

andrewrreed · 2023-12-05T17:31:40Z

@baskaryan Looks like huggingface_chat_wrapper is failing some of the linting checks from mypy. I'm not sure what is wrong here, but the failures seem to be stemming from these three issues:

langchain/chat_models/huggingface_chat_wrapper.py:43: error: Variable "langchain.llms.HuggingFaceTextGenInference" is not valid as a type [valid-type]
langchain/chat_models/huggingface_chat_wrapper.py:43: error: Variable "langchain.llms.HuggingFaceEndpoint" is not valid as a type [valid-type]
langchain/chat_models/huggingface_chat_wrapper.py:43: error: Variable "langchain.llms.HuggingFaceHub" is not valid as a type [valid-type]

When you get the chance, could you help advise on what might be wrong? Thanks!

@baskaryan

- **Description:** Added a tool called RedditSearchRun and an accompanying API wrapper, which searches Reddit for posts with support for time filtering, post sorting, query string and subreddit filtering. - **Issue:** langchain-ai#13891 - **Dependencies:** `praw` module is used to search Reddit - **Tag maintainer:** @baskaryan , and any of the other maintainers if needed - **Twitter handle:** None. Hello, This is our first PR and we hope that our changes will be helpful to the community. We have run `make format`, `make lint` and `make test` locally before submitting the PR. To our knowledge, our changes do not introduce any new errors. Our PR integrates the `praw` package which is already used by RedditPostsLoader in LangChain. Nonetheless, we have added integration tests and edited unit tests to test our changes. An example notebook is also provided. These changes were put together by me, @Anika2000, @CharlesXu123, and @Jeremy-Cheng-stack Thank you in advance to the maintainers for their time. --------- Co-authored-by: What-Is-A-Username <49571870+What-Is-A-Username@users.noreply.github.com> Co-authored-by: Anika2000 <anika.sultana@mail.utoronto.ca> Co-authored-by: Jeremy Cheng <81793294+Jeremy-Cheng-stack@users.noreply.github.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>

This reverts commit 38813d7. This is a temporary fix, as I don't see a clear way on how to use multiple keys with `Qdrant.from_texts`. Context: langchain-ai#14378

@efriis

The namespaces like `langchain.agents.format_scratchpad` clogging the API Reference sidebar. This change removes those 3-level namespaces from sidebar (this issue was discussed with @efriis ) --------- Co-authored-by: Erick Friis <erick@langchain.dev>

Many jupyter notebooks didn't pass linting. List of these files are presented in the [tool.ruff.lint.per-file-ignores] section of the pyproject.toml . Addressed these bugs: - fixed bugs; added missed imports; updated pyproject.toml Only the `document_loaders/tensorflow_datasets.ipyn`, `cookbook/gymnasium_agent_simulation.ipynb` are not completely fixed. I'm not sure about imports. --------- Co-authored-by: Erick Friis <erick@langchain.dev>

Updated provider page by adding LLM and ChatLLM references; removed a content that is duplicate text from the LLM referenced page. Updated the collback page

…n into add-hf-chat-wrapper

libs/langchain/langchain/chat_models/huggingface.py

andrewrreed · 2023-12-12T13:06:10Z

Thanks for your help on this @A-Roucher!

@baskaryan This PR is now passing all tests and linting checks. Please let us know if anything else is needed to get this merged!

aymeric-roucher · 2023-12-12T13:16:52Z

No problem @andrewrreed, looking forward to start using this integration!

This reverts commit a8f39af.

Builds on #14040 with community refactor merged and notebook updated. Note that with this refactor, models will be imported from `langchain_community.chat_models.huggingface` rather than the main `langchain` repo. --------- Signed-off-by: harupy <17039389+harupy@users.noreply.github.com> Signed-off-by: ugm2 <unaigaraymaestre@gmail.com> Signed-off-by: Yuchen Liang <yuchenl3@andrew.cmu.edu> Co-authored-by: Andrew Reed <andrew.reed.r@gmail.com> Co-authored-by: Andrew Reed <areed1242@gmail.com> Co-authored-by: A-Roucher <aymeric.roucher@gmail.com> Co-authored-by: Aymeric Roucher <69208727+A-Roucher@users.noreply.github.com>

baskaryan · 2023-12-21T17:29:04Z

landed in #14736

andrewrreed added 4 commits November 29, 2023 13:08

Create huggingface_chat_wrapper.py

333c1bc

apply formatter

216db23

Update __init__.py

07aa56c

Create huggingface_chat_wrapper.ipynb

b261a81

dosubot bot added size:XL This PR changes 500-999 lines, ignoring generated files. Ɑ: models Related to LLMs or chat model modules 🤖:enhancement A large net-new component, integration, or chain. Use sparingly. The largest features labels Nov 29, 2023

andrewrreed changed the title ~~Add Hugging Face chat wrapper~~ [WIP] Add Hugging Face chat wrapper Nov 29, 2023

andrewrreed added 2 commits November 29, 2023 16:08

Update __init__.py

41ccd6a

fix formatting

Update test_imports.py

1efb7dd

Merge branch 'master' into add-hf-chat-wrapper

f5e2445

vercel bot had a problem deploying to Preview November 29, 2023 23:51 Failure

baskaryan reviewed Nov 30, 2023

View reviewed changes

libs/langchain/langchain/chat_models/huggingface_chat_wrapper.py Outdated Show resolved Hide resolved

move optional dependencies to local imports

6845454

vercel bot deployed to Preview November 30, 2023 23:36 View deployment

andrewrreed added 3 commits November 30, 2023 18:44

add option to override tokenizer

2cd4473

Update huggingface_chat_wrapper.py

72b5d3a

Create test_huggingface_chat_wrapper.py

36c2c74

vercel bot deployed to Preview December 1, 2023 00:05 View deployment

Fix linting error cuased by unused import

e1babfc

vercel bot deployed to Preview December 1, 2023 22:30 View deployment

andrewrreed changed the title ~~[WIP] Add Hugging Face chat wrapper~~ Add Hugging Face chat wrapper Dec 2, 2023

aymeric-roucher and others added 2 commits December 8, 2023 15:52

Solved lint errors and corrected naming

d73a15e

Merge branch 'master' into add-hf-chat-wrapper

169afea

vercel bot deployed to Preview December 8, 2023 16:02 View deployment

baskaryan and others added 13 commits December 11, 2023 16:20

core[patch]: Release 0.0.11 (langchain-ai#14367)

695b890

langchain[patch]: Release 0.0.347 (langchain-ai#14368)

98834df

langchain[patch]: fix ChatVertexAI streaming (langchain-ai#14369)

8536011

langchain[patch]: Rollback multiple keys in Qdrant (langchain-ai#14390)

7d23235

This reverts commit 38813d7. This is a temporary fix, as I don't see a clear way on how to use multiple keys with `Qdrant.from_texts`. Context: langchain-ai#14378

core[patch], langchain[patch]: fix required deps (langchain-ai#14373)

de3bb72

core[patch]: Release 0.0.12 (langchain-ai#14415)

798f068

langchain[patch]: Release 0.0.348 (langchain-ai#14417)

97ff27b

experimental[patch]: Release 0.0.45 (langchain-ai#14418)

82f659c

docs[patch]: promptlayer pages update (langchain-ai#14416)

3e45778

Updated provider page by adding LLM and ChatLLM references; removed a content that is duplicate text from the LLM referenced page. Updated the collback page

Merge branch 'add-hf-chat-wrapper' of github.com:andrewrreed/langchai…

da70036

…n into add-hf-chat-wrapper

Clean up imports

e72a67e

andrewrreed commented Dec 12, 2023

View reviewed changes

libs/langchain/langchain/chat_models/huggingface.py Show resolved Hide resolved

aymeric-roucher and others added 3 commits December 12, 2023 10:19

Solve lint errors

1559f09

run formatter

a9789a6

Update huggingface.ipynb

23a6d29

vercel bot deployed to Preview December 12, 2023 13:10 View deployment

Remove unnecessary prompt checks

d7aa3a1

vercel bot deployed to Preview December 12, 2023 17:35 View deployment

Streaming + linting

a8f39af

vercel bot had a problem deploying to Preview December 14, 2023 16:36 Failure

Revert "Streaming + linting"

48cf247

This reverts commit a8f39af.

vercel bot deployed to Preview December 14, 2023 17:29 View deployment

jacoblee93 mentioned this pull request Dec 14, 2023

Jacob/add hf chat wrapper #14736

Merged

baskaryan closed this Dec 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Hugging Face chat wrapper #14040

Add Hugging Face chat wrapper #14040

andrewrreed commented Nov 29, 2023 •

edited

Loading

vercel bot commented Nov 29, 2023 •

edited

Loading

andrewrreed commented Nov 29, 2023

baskaryan left a comment

andrewrreed commented Nov 30, 2023 •

edited

Loading

andrewrreed commented Dec 5, 2023

andrewrreed commented Dec 12, 2023

aymeric-roucher commented Dec 12, 2023 •

edited

Loading

baskaryan commented Dec 21, 2023

Add Hugging Face chat wrapper #14040

Add Hugging Face chat wrapper #14040

Conversation

andrewrreed commented Nov 29, 2023 • edited Loading

Issue

Description

To do

Tag maintainer

Twitter handle

vercel bot commented Nov 29, 2023 • edited Loading

andrewrreed commented Nov 29, 2023

baskaryan left a comment

Choose a reason for hiding this comment

andrewrreed commented Nov 30, 2023 • edited Loading

andrewrreed commented Dec 5, 2023

andrewrreed commented Dec 12, 2023

aymeric-roucher commented Dec 12, 2023 • edited Loading

baskaryan commented Dec 21, 2023

andrewrreed commented Nov 29, 2023 •

edited

Loading

vercel bot commented Nov 29, 2023 •

edited

Loading

andrewrreed commented Nov 30, 2023 •

edited

Loading

aymeric-roucher commented Dec 12, 2023 •

edited

Loading