Skip to content

Commit

Permalink
Add requested sites (#98)
Browse files Browse the repository at this point in the history
* feat(scraper): add annuairevert magasins and produits scraper

* feat(scraper): add voeazul scraper

* feat(scraper): add byggdinframtid scraper
  • Loading branch information
ricardofelgueiras authored May 24, 2024
1 parent 4cc9943 commit 3c2afe9
Show file tree
Hide file tree
Showing 4 changed files with 96 additions and 0 deletions.
14 changes: 14 additions & 0 deletions src/scrappers/annuairevert-magasins.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,14 @@
url: https://www.annuairevert.com/magasins*
listElementsQuery: .company
elementParser:
- title: Image
query: img
type: image

- title: Title
query: .title-container > p
type: text

- title: Address
query: address > span
type: text
18 changes: 18 additions & 0 deletions src/scrappers/annuairevert-produits.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,18 @@
url: https://www.annuairevert.com/produits*
listElementsQuery: '[data-type="product"] > .row > a'
elementParser:
- title: Image
query: img
type: image

- title: Title
query: .title-container > p
type: text

- title: Fabricant
query: .manufacturer > .entreprise-name
type: text

- title: Distributeur
query: .distributor > .distributor-name
type: text
42 changes: 42 additions & 0 deletions src/scrappers/byggdinframtid.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,42 @@
url: https://byggdinframtid.se/se*
listElementsQuery: .school
elementParser:
- title: Logo
query: .logo > img
type: image

- title: Type
query: .type
type: text

- title: Name
query: h3
type: text

- title: Description
query: .desc > div > p
type: text

- title: Specializations
query: .item:nth-child(1)
type: text

- title: Occupations
query: .item:nth-child(2)
type: text

- title: Website
query: .info-contact > .item:nth-child(1) > a
type: link

- title: Contact info 1
query: .info-contact > .item:nth-child(2) > a
type: link

- title: Contact info 2
query: .info-contact > .item:nth-child(3)
type: text

- title: Contact info 3
query: .info-contact > .item:nth-child(4)
type: text
22 changes: 22 additions & 0 deletions src/scrappers/voeazul.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,22 @@
url: 'https://www.voeazul.com.br/br/pt/home/selecao-voo*'
listElementsQuery: '.trip-container .flight-card'
elementParser:
- title: Departure time
query: .departure
type: text

- title: Arrival time
query: .arrival
type: text

- title: Flight leg information
query: .flight-leg-info
type: text

- title: Duration
query: button
type: text

- title: Price starting at
query: '[data-test-id="fare-price"]'
type: text

0 comments on commit 3c2afe9

Please sign in to comment.