Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

limited scopes!! #28

Closed
amad3us47 opened this issue Jul 26, 2023 · 11 comments
Closed

limited scopes!! #28

amad3us47 opened this issue Jul 26, 2023 · 11 comments

Comments

@amad3us47
Copy link

it only gives 400 search urls
how can we maximize it

https://support.google.com/websearch/thread/24227169/i-can-t-see-all-search-results-there-are-less-results-than-google-thinks?hl=en

@opsdisk
Copy link
Owner

opsdisk commented Jul 31, 2023

@amad3us47 Can you provide more information? What were the values you used for the search? What was the search string query?

@opsdisk
Copy link
Owner

opsdisk commented Jul 31, 2023

Is this what you're referring to?

image

@amad3us47
Copy link
Author

amad3us47 commented Jul 31, 2023

Yeah is there any other alternative or some search engine which doesn't have google like seo and could it integrate with it ... For bug bounty script.
Wayback machine could do that using Tom's waybackurls tool.. but it's a recorded indexes archive not that good.

@opsdisk
Copy link
Owner

opsdisk commented Aug 1, 2023

This returned 420 URLs when I set max_search_result_urls_to_return=600, so not sure how accurate the 400 cap is.


query = "computer"

client = yagooglesearch.SearchClient(
    query,
    tbs="li:1",
    max_search_result_urls_to_return=600,
    http_429_cool_off_time_in_minutes=45,
    http_429_cool_off_factor=1.5,
    # proxy="socks5h://127.0.0.1:9050",
    verbosity=5,
    verbose_output=True,  # False (only URLs) or True (rank, title, description, and URL)
)
client.assign_random_user_agent()

urls = client.search()

len(urls)

@amad3us47
Copy link
Author

amad3us47 commented Aug 1, 2023

This returned 420 URLs when I set max_search_result_urls_to_return=600, so not sure how accurate the 400 cap is.


query = "computer"

client = yagooglesearch.SearchClient(
    query,
    tbs="li:1",
    max_search_result_urls_to_return=600,
    http_429_cool_off_time_in_minutes=45,
    http_429_cool_off_factor=1.5,
    # proxy="socks5h://127.0.0.1:9050",
    verbosity=5,
    verbose_output=True,  # False (only URLs) or True (rank, title, description, and URL)
)
client.assign_random_user_agent()

urls = client.search()

len(urls)

Still won't give the max results

Google is limiting the search results (it's a feature).
I will need to find new search engine or ways to fix that .

@opsdisk
Copy link
Owner

opsdisk commented Aug 1, 2023

What were your search criteria that you ran into that limit?

@amad3us47
Copy link
Author

What were your search criteria that you ran into that limit?

I was indexing some countries sites with dork (site:.pk)

@tw-evan
Copy link

tw-evan commented Aug 2, 2023

I’ve also encountered this problem!

@amad3us47
Copy link
Author

@tw-even is there any solution for this you might have encountered?

@opsdisk
Copy link
Owner

opsdisk commented Aug 5, 2023

Yeah I got 399 when using the site:.pk query...might be a limitation of using the GUI and not the official search API (https://developers.google.com/custom-search/v1/overview). If that's the case, there's not much yagooglesearch can do...I'll just have to add a note in the docs not to expect more than 400. I'll keep this open for the time being.

@opsdisk
Copy link
Owner

opsdisk commented Jan 15, 2024

Thanks for bringing this to my attention @amad3us47 I updated the README:

https://github.com/opsdisk/yagooglesearch?tab=readme-ov-file#max-400-results-returned

@opsdisk opsdisk closed this as completed Jan 15, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants