-
Notifications
You must be signed in to change notification settings - Fork 14.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Kubernetes Documentation - New search is borderline unusuable. #47108
Comments
Thank you for the feedback. It appears you are in China, as a new search method has been deployed there. The recent switch to a new search solution (Pagefind) was intended to reduce costs for CNCF caused by the previous search setup for China. For more details, please refer to issue #44475 and PR #46768 |
Thank you for the feedback Dipesh. As you say, I inspected the cookies through the browser developer tools and found that It seems like I am not the only one experiencing this, and based on this Reddit thread many users are being detected as if they were in China. I am in Spain so I don't know why that cookie was set to true. |
In that case this sounds like it might be a bug. It would be helpful to add this information to the issue description, especially if you're outside of China and seeing the new search option. |
Nope. No VPNs or anything. Tested on multiple devices as well. As per the threads I shared in my last message it seems like this is a widespread issue. I'll update the issue description with this information. Thanks for the assistance 🙏 |
I'm using the uBlock Origin ad blocker and it blocks connections to ipinfo.io with default settings. We set Line 108 in 37c062c
To change, paste |
/triage accepted |
/assign @nate-double-u @cjyabraham Given the recent work you've just completed on the CNCF side regarding PageFind, it would be great for the two of you to look into this. We can make sure SIG Docs prioritises review of this work as well, thank you 🙏 |
@natalisucks: GitHub didn't allow me to assign the following users: cjyabraham. Note that only kubernetes members with read permissions, repo collaborators and people who have commented on this issue/PR can be assigned. Additionally, issues/PRs can only have 10 assignees at the same time. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. |
We've accepted that the new PageFind search is likely not going to be as good as the Bing search that it replaces. Bing, however, was costing CNCF thousands of dollars per month and so was no longer a reasonable option. After reviewing several replacements, PageFind was the best we could find. The intention is for PageFind to serve users in China, where Google is blocked, and Google Programmable Search to serve everyone else. We detect the location of a user with the ipinfo.io service. It's likely that this service is blocked for users in China, so if an error occurs, we assume the person is in China. Unfortunately, if people use ad-blocking software that blocks ipinfo.io, then they too will be assumed to be in China. Where can we go from here? Here are some suggestions:
|
After the sig docs meeting there's an alternative that came to my mind Good performance, probably we can give it a try? Also, could we check browser language (or doc language) directly? |
Another idea that came up during the weekly meeting: we may be able to check the language currently being served by the site when the search is made to help us determine which search to provide. |
I've submitted a PR to improve the way we determine which search results to serve. As for improving the PageFind search results, rather than focusing on one particular query, it'd be good to have a much broader set of tests and a way of measuring the quality of the results across all of them at once. Then we can start playing with search parameters and know when we're improving things. We have already tried to tweak the PageFind search results by increasing the weight on the page title. There are some other methods of tweaking the ranking that we can play with once we have a way of measuring the impact. Is it possible to look at the search history of the Google Programmable Search engine to find the most common searches? We could start with the most common 20 search terms and grade the results we get from PageFind? |
@a-mccarthy, is this something we can see with our analytics tools? |
Based on the information available in this page (here), the analytics dashboard for kubernetes.io appears to be located here: Click to view the dashboard. The ‘Top search terms’ dashboard might be useful for finding the data we’re interested in. |
Following up on #47108 (comment), it'd be great to get a lot of help testing the proposed fix for the wrong search being served outside of China. (From both folks inside and outside of China) Here's the deploy preview: |
While we're waiting for the #47128 to be approved, I've split out an independent issue to improve the quality of the PageFind results: #47137. It needs to be assigned to someone familiar with the Kubernetes docs who can more systematically assess the quality of the PageFind results across a broad range of searches. |
Fixed in #47128 /close |
@nate-double-u: Closing this issue. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. |
I just noticed today that the functionality of kubernetes.io/search has changed and it is extremely bad. I have to use "site:kubernetes.io {search term}" now in Google whenever I want to search for something.
As a quick example, let's say you search for: "Service".
Google -> First result: Service overview, Second result: Example of service usage, Third result: Debugging services. This is in line with what you would expect
In the docs -> First result: Ingresses, Second result: Tailing traffic with Stern, Third result: Define Dependent Environment Variables. Literally none of the results are about Services. Only the first result is somewhat-related
The previous search was not the best, but this is a considerable downgrade. What's the rationale behind these changes?
Edit: After some superficial debugging it seems like this change was meant for China only, and is caused by the
is_china
cookie being set to true. You can manually replace this by using your Browser's developer tools (in the case of Firefox,right click anywhere on the page -> Inspect Accesibility Properties -> Storage tab
.There's been a shift in the issue and it appears that multiple users outside of china are also encountering this. Some sample Reddit threads 1 2. This should be addressed.
The text was updated successfully, but these errors were encountered: