-
-
Notifications
You must be signed in to change notification settings - Fork 1.2k
archivebox.parsers.generic_html
Nick Sweeting edited this page Nov 13, 2024
·
2 revisions
:allowtitles:
:class: autosummary longtable
:align: left
* - {py:obj}`HrefParser <archivebox.parsers.generic_html.HrefParser>`
-
:class: autosummary longtable
:align: left
* - {py:obj}`parse_generic_html_export <archivebox.parsers.generic_html.parse_generic_html_export>`
- ```{autodoc2-docstring} archivebox.parsers.generic_html.parse_generic_html_export
:summary:
```
* - {py:obj}`did_urljoin_misbehave <archivebox.parsers.generic_html.did_urljoin_misbehave>`
- ```{autodoc2-docstring} archivebox.parsers.generic_html.did_urljoin_misbehave
:summary:
```
* - {py:obj}`fix_urljoin_bug <archivebox.parsers.generic_html.fix_urljoin_bug>`
- ```{autodoc2-docstring} archivebox.parsers.generic_html.fix_urljoin_bug
:summary:
```
:class: autosummary longtable
:align: left
* - {py:obj}`KEY <archivebox.parsers.generic_html.KEY>`
- ```{autodoc2-docstring} archivebox.parsers.generic_html.KEY
:summary:
```
* - {py:obj}`NAME <archivebox.parsers.generic_html.NAME>`
- ```{autodoc2-docstring} archivebox.parsers.generic_html.NAME
:summary:
```
* - {py:obj}`PARSER <archivebox.parsers.generic_html.PARSER>`
- ```{autodoc2-docstring} archivebox.parsers.generic_html.PARSER
:summary:
```
:canonical: archivebox.parsers.generic_html.HrefParser
Bases: {py:obj}`html.parser.HTMLParser`
````{py:method} handle_starttag(tag, attrs)
:canonical: archivebox.parsers.generic_html.HrefParser.handle_starttag
```{autodoc2-docstring} archivebox.parsers.generic_html.HrefParser.handle_starttag
```
````
:canonical: archivebox.parsers.generic_html.parse_generic_html_export
```{autodoc2-docstring} archivebox.parsers.generic_html.parse_generic_html_export
```
:canonical: archivebox.parsers.generic_html.KEY
:value: >
'html'
```{autodoc2-docstring} archivebox.parsers.generic_html.KEY
```
:canonical: archivebox.parsers.generic_html.NAME
:value: >
'Generic HTML'
```{autodoc2-docstring} archivebox.parsers.generic_html.NAME
```
:canonical: archivebox.parsers.generic_html.PARSER
:value: >
None
```{autodoc2-docstring} archivebox.parsers.generic_html.PARSER
```
:canonical: archivebox.parsers.generic_html.did_urljoin_misbehave
```{autodoc2-docstring} archivebox.parsers.generic_html.did_urljoin_misbehave
```
:canonical: archivebox.parsers.generic_html.fix_urljoin_bug
```{autodoc2-docstring} archivebox.parsers.generic_html.fix_urljoin_bug
```
- π’ Quickstart
- π₯οΈ Install
- π³ Docker
- β‘οΈ Supported Sources
- β¬ οΈ Supported Outputs
- οΉ©Command Line
- π Web UI
- 𧩠Browser Extension
- πΎ REST API / Webhooks
- π Python API / REPL / SQL API
- βοΈ Configuration
- π¦ Dependencies
- πΏ Disk Layout
- π Security Overview
- π Developer Documentation
- Upgrading
- Setting up Storage (NFS/SMB/S3/etc)
- Setting up Authentication (SSO/LDAP/etc)
- Setting up Search (rg/sonic/etc)
- Scheduled Archiving
- Publishing Your Archive
- Chromium Install
- Cookies & Sessions Setup
- Merging Collections
- Troubleshooting
- βοΈ Web Archiving Community
- Background & Motivation
- Comparison to Other Tools
- Architecture Diagram
- Changelog & Roadmap