Enhancing DataFusion's Community Engagement and Visibility #13049
Replies: 4 comments 5 replies
-
Thank you @SamSynnada -- this is great. I love the idea of making the existing DataFusion content easier to find and encouraging new content. Here are some ideas
We had a roadmap discussion here: Maybe we can file a new ticket / issue for 2025 Q1 Q2 🤔
I have plans to write such a thing, though I would like the headline to be about performance (clickbench) so I have been procrastinating writing that. It has some good ideas |
Beta Was this translation helpful? Give feedback.
-
Thanks @SamSynnada. This sounds amazing. |
Beta Was this translation helpful? Give feedback.
-
We put together a page with a list of core content we’ve identified so far. I’m thinking it could live as a separate "Reading List" page under datafusion.apache.org. We could update it periodically and maybe add a simple guideline for submissions down the line. There are some real gems in here that came up after some digging—like this one: pydantic/logfire#408. Feel free to drop suggestions; I’m sure I missed a few! @alamb @andygrove |
Beta Was this translation helpful? Give feedback.
-
A nice to have feature for me would be a way to get the rss feed of the blog posts into some kind of bot on social media sites. For me it would serve two purposes: see the blog posts as they come out that I might otherwise miss and also make it easy to boost/share to friends and colleagues. Specifically I'd like to see something on mastodon and linkedin, but I would expect people to want it on twitter and facebook as well. I have no experience with these kinds of bots, so I can't directly offer any suggestions. |
Beta Was this translation helpful? Give feedback.
-
Who are we?
I'm Sami, co-founder of Synnada, and I'm working alongside my colleague Kuter to support the DataFusion community. We are ready to dedicate some time/energy to increase awareness around DataFusion and helping the project expand its audience.
We believe our team we can create high-quality semi-technical content that makes DataFusion more accessible to a broader audience. We can repurpose existing technical information into more digestible formats, conduct user interviews, and manage social media to engage the community effectively.
Objectives
Proposed actions
We propose the following short term actions for community management. We can take the lead for these.
What? We may start with Show-and-Tell blog posts. These could be published on [apache.datafusion.org](http://apache.datafusion.org) (and the co-authors website, if applicable). Authors can present the content on blog-posts in meetups (digital or physical), that content can be distributed on Youtube. Our main objective will be to keep a comprehensive and accurate list of active users of DataFusion and showcase how they are using DF in their project.
How? We can create an interview template, start interviewing people, turn transcript into a blog post, post together with the author (on DataFusion’s website and the author’s preferred medium), promote/distribute, reuse the content in Meetups for presentations. This could be done in reverse too — turn meetup presentations to show-and-tells.
Draft Question Set
Could you please introduce yourself and your organization?
How did you first discover Apache DataFusion?
What motivated you to give it a chance over other alternatives? Why did you choose DataFusion, and what factors influenced your decision?
Can you describe your learning process with DataFusion?
Include any resources or strategies that were particularly helpful. Did you face any challenges during the learning or implementation phase? If so, how did you overcome them?
What challenges or problems were you facing before using DataFusion?
What tools or solutions were you using at that time? What limitations did you encounter with those solutions?
Please explain your specific use case for Apache DataFusion.
Detail how you utilize it in your project or workflow. How did DataFusion solve your problem or improve your workflow? What benefits or improvements have you observed since implementing it?
Do you have any performance metrics or results that demonstrate the impact of using DataFusion?
If available, could you share any performance metrics, screenshots, graphs, or diagrams that illustrate your use case or results? Did you discover any unexpected benefits or features in DataFusion that were particularly helpful? Can you comment on the return on investment (ROI) since implementing DataFusion, in terms of time saved, cost reduction, or other efficiencies?
What key insights or lessons have you learned from using DataFusion?
What advice would you give to others considering using it? What are the key takeaways from your experience with DataFusion that you believe would be valuable for the community? How satisfied are you with DataFusion overall, and would you recommend it to others? Why or why not?
Are there any features or improvements you would like to see in future versions of DataFusion and what are your future plans with it?
Additional Thoughts (Optional)
Call for actions for the community
Beta Was this translation helpful? Give feedback.
All reactions