v0.7.0 - Markdown Support #112
benbrandt
announced in
Announcements
Replies: 1 comment
-
Awesome! |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
What's New
Markdown Support! Both the Rust crate and Python package have a new
MarkdownSplitter
you can use to split markdown text. It leverages the great work of thepulldown-cmark
crate to parse markdown according to the CommonMark spec, and allows for very fine-grained control over how to split the text.In terms of use, the API is identical to the
TextSplitter
, so you should be able to just drop it in when you have Markdown available instead of just plain text.Rust
Python
Breaking Changes
Rust
MSRV is now 1.75.0 since the ability to use
impl Trait
in trait methods allowed for much simpler internal APIs to enable theMarkdownSplitter
.Python
CharacterTextSplitter
,HuggingFaceTextSplitter
,TiktokenTextSplitter
, andCustomTextSplitter
classes have now all been consolidated into a singleTextSplitter
class. All of the previous use cases are still supported, you just need to instantiate the class with various class methods.Below are the changes you need to make to your code to upgrade to v0.7.0:
CharacterTextSplitter
HuggingFaceTextSplitter
TiktokenTextSplitter
CustomTextSplitter
New Contributors
Full Changelog: v0.6.3...v0.7.0
This discussion was created from the release v0.7.0 - Markdown Support.
Beta Was this translation helpful? Give feedback.
All reactions