Make HighlightedTextClassifier work with <b>
tags
#56
Replies: 9 comments
-
Hi @Elijas , I would like to work on this issue |
Beta Was this translation helpful? Give feedback.
-
Awesome! 🚀 If you have any questions or if we could support you in any way, in addition to using this GitHub issue, feel free to join our #sec-parser channel to get community support 🙌 |
Beta Was this translation helpful? Give feedback.
-
@Elijas I would like to work on this issue |
Beta Was this translation helpful? Give feedback.
-
@HarikaB11 and @lchauha Just a simple "Hi, I'm Harika/Lchauha from GitHub" and we'll go from there 👍 |
Beta Was this translation helpful? Give feedback.
-
@HarikaB11 and @lchauha But that may be discouraging to contributors who may have started working on a task that gets closed, so I'd recommend participating in the channel 🚀 |
Beta Was this translation helpful? Give feedback.
-
@lchauha , I already started working on it and will submit PR by tomorrow. Please look into other issues. |
Beta Was this translation helpful? Give feedback.
-
@HarikaB11 @lchauha In case you haven't noticed, there is a weekly community meeting scheduled today. Feel free to join and catch up with fellow developers: Link to meeting message on Discord
|
Beta Was this translation helpful? Give feedback.
-
Hey @HarikaB11 and @lchauha Can you share a little about the intended action plan to solve the issue? I would love to collaborate together on the solutions 🙌 Thanks! |
Beta Was this translation helpful? Give feedback.
-
Hey, let's sync up on discord if you'd still like to contribute to the task - as we're putting this next on the roadmap to be worked on by the internal team 🚀 |
Beta Was this translation helpful? Give feedback.
-
Example document
https://www.sec.gov/Archives/edgar/data/1675149/000119312518236766/d828236d10q.htm
Goal
The "G. Accumulated Other Comprehensive Loss" should be recognized as HighlightedTextElement (and therefore, TitleElement).
Most likely, you will have to get a percentage of text that is covered inside the
<b>
tag, by reusing the parts implemented in the HighlightedTextElement. This will help you avoid situations wheretext text text <b>bold</b> text text
is recognized as higlightedBeta Was this translation helpful? Give feedback.
All reactions