You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When using a selenium chrome scraper with unstructured to parse the results, it is printing the entire document tree. Any ideas on how to prevent this?
<Element head at 0x7f28a865cdc0> <Element body at 0x7f289bf09600> self.document_tree <Element html at 0x7f28a0b1e700> page_element [<unstructured.documents.html.HTMLTitle object at 0x7f289bf780a0>, <unstructured.documents.html.HTMLTitle object at 0x7f289bf781f0>, <unstructured.documents.html.HTMLTitle object at 0x7f289bf78250>, <unstructured.documents.html.HTMLTitle object at 0x7f289bf78400>, <unstructured.documents.html.HTMLTitle object at 0x7f289bf78430>, <unstructured.documents.html.HTMLTitle object at 0x7f289bf78670>, <unstructured.documents.html.HTMLTitle object at 0x7f289bf78610>, <unstructured.documents.html.HTMLTitle object at 0x7f289bf781c0>, <unstructured.documents.html.HTMLTitle object at 0x7f289bf78490>, <unstructured.documents.html.HTMLTitle object at 0x7f289bf784f0>, <unstructured.documents.html.HTMLTitle object at 0x7f289bf78550>, <unstructured.documents.html.HTMLTitle object at 0x7f289bf78640>, <unstructured.documents.html.HTMLTitle object at 0x7f289bf78040>, <unstructured.documents.html.HTMLTitle object at 0x7f289bf78850>, <unstructured.documents.html.HTMLTitle object at 0x7f289bf787f0>, <unstructured.documents.html.HTMLTitle object at 0x7f289bf78880>, <unstructured.documents.html.HTMLTitle object at 0x7f289bf788e0>, <unstructured.documents.html.HTMLTitle object at 0x7f289bf78df0>, <unstructured.documents.html.HTMLTitle object at 0x7f289bf78fa0>, <unstructured.documents.html.HTMLTitle object at 0x7f289bf78cd0>, <unstructured.documents.html.HTMLTitle object at 0x7f289bf78e20>, <unstructured.documents.html.HTMLTitle object at 0x7f289beb72e0>, <unstructured.documents.html.HTMLTitle object at 0x7f289beb72b0>, <unstructured.documents.html.HTMLTitle object at 0x7f289beb7fa0>, <unstructured.documents.html.HTMLTitle object at 0x7f289beb7f40>, <unstructured.documents.html.HTMLTitle object at 0x7f289beb7ee0>, <unstructured.documents.html.HTMLTitle object at 0x7f289beb7e80>, <unstructured.documents.html.HTMLTitle object at 0x7f289beb7e20>, <unstructured.documents.html.HTMLTitle object at 0x7f289beb7dc0>, <unstructured.documents.html.HTMLTitle object at 0x7f289beb7d30>, <unstructured.documents.html.HTMLTitle object at 0x7f289beb7d00>, <unstructured.documents.html.HTMLTitle object at 0x7f289beb7d60>, <unstructured.documents.html.HTMLTitle object at 0x7f289beb7ac0>, <unstructured.documents.html.HTMLTitle object at 0x7f289beb7be0>, <unstructured.documents.html.HTMLTitle object at 0x7f289beb7af0>, <unstructured.documents.html.HTMLNarrativeText object at 0x7f289beb7c10>, <unstructured.documents.html.HTMLTitle object at 0x7f289beb7a60>, <unstructured.documents.html.HTMLTitle object at 0x7f289beb7940>, <unstructured.documents.html.HTMLTitle object at 0x7f289beb7a00>, <unstructured.documents.html.HTMLNarrativeText object at 0x7f289beb78b0>, <unstructured.documents.html.HTMLNarrativeText object at 0x7f289beb79a0>, <unstructured.documents.html.HTMLTitle object at 0x7f289beaea00>, <unstructured.documents.html.HTMLTitle object at 0x7f289beb7880>, <unstructured.documents.html.HTMLTitle object at 0x7f289beaea60>, <unstructured.documents.html.HTMLText object at 0x7f289beae9d0>, <unstructured.documents.html.HTMLNarrativeText object at 0x7f289beae9a0>, <unstructured.documents.html.HTMLNarrativeText object at 0x7f289beae820>, <unstructured.documents.html.HTMLTitle object at 0x7f289beae850>, <unstructured.documents.html.HTMLTitle object at 0x7f289beae610>, <unstructured.documents.html.HTMLTitle object at 0x7f289beae7c0>, <unstructured.documents.html.HTMLNarrativeText object at 0x7f289beae940>, <unstructured.documents.html.HTMLNarrativeText object at 0x7f289beae6a0>, <unstructured.documents.html.HTMLNarrativeText object at 0x7f289beae520>, <unstructured.documents.html.HTMLTitle object at 0x7f289beae340>, <unstructured.documents.html.HTMLTitle object at 0x7f289beae3a0>, <unstructured.documents.html.HTMLTitle object at 0x7f289beae3d0>, <unstructured.documents.html.HTMLTitle object at 0x7f289beae2b0>, <unstructured.documents.html.HTMLTitle object at 0x7f289beae250>, <unstructured.documents.html.HTMLTitle object at 0x7f289beae220>, <unstructured.documents.html.HTMLTitle object at 0x7f289beae0d0>, <unstructured.documents.html.HTMLTitle object at 0x7f289beae040>, <unstructured.documents.html.HTMLTitle object at 0x7f289beae1c0>, <unstructured.documents.html.HTMLTitle object at 0x7f289beae160>, <unstructured.documents.html.HTMLTitle object at 0x7f28937ba100>, <unstructured.documents.html.HTMLTitle object at 0x7f28937ba070>, <unstructured.documents.html.HTMLTitle object at 0x7f28937ba130>, <unstructured.documents.html.HTMLTitle object at 0x7f28937ba190>, <unstructured.documents.html.HTMLTitle object at 0x7f28937ba1f0>, <unstructured.documents.html.HTMLTitle object at 0x7f28937ba250>, <unstructured.documents.html.HTMLTitle object at 0x7f28937ba2b0>, <unstructured.documents.html.HTMLTitle object at 0x7f28937ba3a0>, <unstructured.documents.html.HTMLText object at 0x7f28937ba340>, <unstructured.documents.html.HTMLTitle object at 0x7f28937ba3d0>, <unstructured.documents.html.HTMLTitle object at 0x7f28937ba4c0>, <unstructured.documents.html.HTMLTitle object at 0x7f28937ba460>, <unstructured.documents.html.HTMLTitle object at 0x7f28937ba580>, <unstructured.documents.html.HTMLTitle object at 0x7f28937ba490>, <unstructured.documents.html.HTMLTitle object at 0x7f28937ba5b0>, <unstructured.documents.html.HTMLTitle object at 0x7f28937ba6a0>, <unstructured.documents.html.HTMLTitle object at 0x7f28937ba640>, <unstructured.documents.html.HTMLTitle object at 0x7f28937ba6d0>, <unstructured.documents.html.HTMLTitle object at 0x7f28937ba730>, <unstructured.documents.html.HTMLTitle object at 0x7f28937ba790>, <unstructured.documents.html.HTMLTitle object at 0x7f28937ba7f0>, <unstructured.documents.html.HTMLTitle object at 0x7f28937ba8e0>, <unstructured.documents.html.HTMLTitle object at 0x7f28937ba880>, <unstructured.documents.html.HTMLTitle object at 0x7f28937ba910>, <unstructured.documents.html.HTMLTitle object at 0x7f28937ba970>, <unstructured.documents.html.HTMLTitle object at 0x7f28937baa90>]
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
When using a selenium chrome scraper with unstructured to parse the results, it is printing the entire document tree. Any ideas on how to prevent this?
<Element head at 0x7f28a865cdc0> <Element body at 0x7f289bf09600> self.document_tree <Element html at 0x7f28a0b1e700> page_element [<unstructured.documents.html.HTMLTitle object at 0x7f289bf780a0>, <unstructured.documents.html.HTMLTitle object at 0x7f289bf781f0>, <unstructured.documents.html.HTMLTitle object at 0x7f289bf78250>, <unstructured.documents.html.HTMLTitle object at 0x7f289bf78400>, <unstructured.documents.html.HTMLTitle object at 0x7f289bf78430>, <unstructured.documents.html.HTMLTitle object at 0x7f289bf78670>, <unstructured.documents.html.HTMLTitle object at 0x7f289bf78610>, <unstructured.documents.html.HTMLTitle object at 0x7f289bf781c0>, <unstructured.documents.html.HTMLTitle object at 0x7f289bf78490>, <unstructured.documents.html.HTMLTitle object at 0x7f289bf784f0>, <unstructured.documents.html.HTMLTitle object at 0x7f289bf78550>, <unstructured.documents.html.HTMLTitle object at 0x7f289bf78640>, <unstructured.documents.html.HTMLTitle object at 0x7f289bf78040>, <unstructured.documents.html.HTMLTitle object at 0x7f289bf78850>, <unstructured.documents.html.HTMLTitle object at 0x7f289bf787f0>, <unstructured.documents.html.HTMLTitle object at 0x7f289bf78880>, <unstructured.documents.html.HTMLTitle object at 0x7f289bf788e0>, <unstructured.documents.html.HTMLTitle object at 0x7f289bf78df0>, <unstructured.documents.html.HTMLTitle object at 0x7f289bf78fa0>, <unstructured.documents.html.HTMLTitle object at 0x7f289bf78cd0>, <unstructured.documents.html.HTMLTitle object at 0x7f289bf78e20>, <unstructured.documents.html.HTMLTitle object at 0x7f289beb72e0>, <unstructured.documents.html.HTMLTitle object at 0x7f289beb72b0>, <unstructured.documents.html.HTMLTitle object at 0x7f289beb7fa0>, <unstructured.documents.html.HTMLTitle object at 0x7f289beb7f40>, <unstructured.documents.html.HTMLTitle object at 0x7f289beb7ee0>, <unstructured.documents.html.HTMLTitle object at 0x7f289beb7e80>, <unstructured.documents.html.HTMLTitle object at 0x7f289beb7e20>, <unstructured.documents.html.HTMLTitle object at 0x7f289beb7dc0>, <unstructured.documents.html.HTMLTitle object at 0x7f289beb7d30>, <unstructured.documents.html.HTMLTitle object at 0x7f289beb7d00>, <unstructured.documents.html.HTMLTitle object at 0x7f289beb7d60>, <unstructured.documents.html.HTMLTitle object at 0x7f289beb7ac0>, <unstructured.documents.html.HTMLTitle object at 0x7f289beb7be0>, <unstructured.documents.html.HTMLTitle object at 0x7f289beb7af0>, <unstructured.documents.html.HTMLNarrativeText object at 0x7f289beb7c10>, <unstructured.documents.html.HTMLTitle object at 0x7f289beb7a60>, <unstructured.documents.html.HTMLTitle object at 0x7f289beb7940>, <unstructured.documents.html.HTMLTitle object at 0x7f289beb7a00>, <unstructured.documents.html.HTMLNarrativeText object at 0x7f289beb78b0>, <unstructured.documents.html.HTMLNarrativeText object at 0x7f289beb79a0>, <unstructured.documents.html.HTMLTitle object at 0x7f289beaea00>, <unstructured.documents.html.HTMLTitle object at 0x7f289beb7880>, <unstructured.documents.html.HTMLTitle object at 0x7f289beaea60>, <unstructured.documents.html.HTMLText object at 0x7f289beae9d0>, <unstructured.documents.html.HTMLNarrativeText object at 0x7f289beae9a0>, <unstructured.documents.html.HTMLNarrativeText object at 0x7f289beae820>, <unstructured.documents.html.HTMLTitle object at 0x7f289beae850>, <unstructured.documents.html.HTMLTitle object at 0x7f289beae610>, <unstructured.documents.html.HTMLTitle object at 0x7f289beae7c0>, <unstructured.documents.html.HTMLNarrativeText object at 0x7f289beae940>, <unstructured.documents.html.HTMLNarrativeText object at 0x7f289beae6a0>, <unstructured.documents.html.HTMLNarrativeText object at 0x7f289beae520>, <unstructured.documents.html.HTMLTitle object at 0x7f289beae340>, <unstructured.documents.html.HTMLTitle object at 0x7f289beae3a0>, <unstructured.documents.html.HTMLTitle object at 0x7f289beae3d0>, <unstructured.documents.html.HTMLTitle object at 0x7f289beae2b0>, <unstructured.documents.html.HTMLTitle object at 0x7f289beae250>, <unstructured.documents.html.HTMLTitle object at 0x7f289beae220>, <unstructured.documents.html.HTMLTitle object at 0x7f289beae0d0>, <unstructured.documents.html.HTMLTitle object at 0x7f289beae040>, <unstructured.documents.html.HTMLTitle object at 0x7f289beae1c0>, <unstructured.documents.html.HTMLTitle object at 0x7f289beae160>, <unstructured.documents.html.HTMLTitle object at 0x7f28937ba100>, <unstructured.documents.html.HTMLTitle object at 0x7f28937ba070>, <unstructured.documents.html.HTMLTitle object at 0x7f28937ba130>, <unstructured.documents.html.HTMLTitle object at 0x7f28937ba190>, <unstructured.documents.html.HTMLTitle object at 0x7f28937ba1f0>, <unstructured.documents.html.HTMLTitle object at 0x7f28937ba250>, <unstructured.documents.html.HTMLTitle object at 0x7f28937ba2b0>, <unstructured.documents.html.HTMLTitle object at 0x7f28937ba3a0>, <unstructured.documents.html.HTMLText object at 0x7f28937ba340>, <unstructured.documents.html.HTMLTitle object at 0x7f28937ba3d0>, <unstructured.documents.html.HTMLTitle object at 0x7f28937ba4c0>, <unstructured.documents.html.HTMLTitle object at 0x7f28937ba460>, <unstructured.documents.html.HTMLTitle object at 0x7f28937ba580>, <unstructured.documents.html.HTMLTitle object at 0x7f28937ba490>, <unstructured.documents.html.HTMLTitle object at 0x7f28937ba5b0>, <unstructured.documents.html.HTMLTitle object at 0x7f28937ba6a0>, <unstructured.documents.html.HTMLTitle object at 0x7f28937ba640>, <unstructured.documents.html.HTMLTitle object at 0x7f28937ba6d0>, <unstructured.documents.html.HTMLTitle object at 0x7f28937ba730>, <unstructured.documents.html.HTMLTitle object at 0x7f28937ba790>, <unstructured.documents.html.HTMLTitle object at 0x7f28937ba7f0>, <unstructured.documents.html.HTMLTitle object at 0x7f28937ba8e0>, <unstructured.documents.html.HTMLTitle object at 0x7f28937ba880>, <unstructured.documents.html.HTMLTitle object at 0x7f28937ba910>, <unstructured.documents.html.HTMLTitle object at 0x7f28937ba970>, <unstructured.documents.html.HTMLTitle object at 0x7f28937baa90>]
Beta Was this translation helpful? Give feedback.
All reactions