-
Notifications
You must be signed in to change notification settings - Fork 905
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Find out why our linen archive of Slack discussions isn't indexed (or isn't ranking) #2802
Comments
I emailed help@linen.dev today (18/07/2023) and will give them a while to get back to me. I've looked into other options but it's really hard to find any (SearchUnify allows you to index your Slack org but not for Google). Even if we to convert a basic JSON export of Slack data into something we could ourselves upload for index, we'd be stretched. The best lead I got was from the Future of Coding community, who have had a couple of contributors build them indexing tools https://futureofcoding.org/community.html -- I have reached out to Kartik Agaram to find out more about his archives project. https://akkartik.name/
|
I think one very likely scenario is that, if we want certain things to be indexed, we'll have to transform them into Stack Overflow questions and answer them ourselves. This is not considered a bad practice (as long as we don't use GenAI) and will probably increase the visibility of Kedro in SO. |
True, although that is a big effort and we have a wealth of content on Slack of only people had access to it from Google! Let's see if we get any leads from my efforts today. |
So far, nobody has returned to me from Linen with any help or information about why Google isn't indexing our Slack archive correctly. I'm wondering if we should consider a different approach. I've done quite a bit of hunting around for alternatives to Linen but it's limited. Kartik (who I mentioned above) shared a link to his repo with me. This is code that reads a Slack workspace and builds a set of static HTML. It's here: https://github.com/akkartik/foc-archive It's not ideal but I'm wondering if there's scope for us to do this and publish on the Kedro website (with no links to it or particular formatting). We don't advertise that it's there, but somewhere on the kedro.org domain we have an equivalent to this (it's the future of coding Slack workspace archive). The reasoning is that we have this online, and indexed so we flood Google with all the Q&A discussions about Kedro so people searching for answers get to see them. (THE WHOLE POINT OF LINEN). We'd add links through to slack.kedro.org onto the archive to push users browsing it to sign up and get the content from Slack, but at least we'd initially get the answers in front of them. If it's hidden in Slack, they can't see it's there at all. Problem with it being static is that we would need to schedule regular rebuild of the archive and republication. It's non-trivial. But I think it's worth considering from a discoverability point of view. WDYT @astrojuanlu @tynandebold ? Edit: Linen response
|
Linen got back to us. I think the limitations are in two areas:
|
I don't see how the domain makes much difference to the indexing TBH, particularly since we don't have huge authority or strong backlinks as a project. I'm unconvinced but willing to try that option if it doesn't cost us anything, and I think we did previously discuss having a I'll keep this issue open but raise a separate ticket over on the |
So which of those two subdomains do we want to point to Linen, |
|
Don't we already have that? I think I set this up a while ago, now that I think back to it. Visit this: linen-slack.kedro.org, it actually works. |
Great. Now I think of it, there were no redirects between the custom domain but now it's working so I guess we can update the docs/github/blog posts etc to point to the new location, give it a couple of weeks and see if it helps...I'll keep this open in the interim. |
#2877 now ticketed for the worked needed |
Our community is now visible in the Linen main page https://www.linen.dev/ and https://www.linen.dev/communities |
And the indexing does seem to be working! I tried "Is it possible to define multiple types of base datasets for PartitionedDataSet?" and our linen was the first result. Closing this for now. |
We put in some time to kedro-org/kedro-devrel#84 and have a fair number of links to the archive now...but we are not seeing it in search results. Is there something up with linen?
The text was updated successfully, but these errors were encountered: