-
Notifications
You must be signed in to change notification settings - Fork 26.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
🌟 BigBird #6113
Comments
When will be getting this model? |
Until the weights and code are not published I think we won't focus too much on adding the model |
I am planning to start a small tight group of individuals who will work on implementing research papers for proper business use cases. |
I'll be up for this project |
I'll be up for this project too. I got a slightly different use case idea, tho. :) |
@sathvikask0 |
I'm also doing some research on using Google BigBird for genomics research. There's a competition going on right now and we can definitely leverage BigBird for genomics sequencing. |
@sathvikask0 @nikhilbyte @seduerr91 |
Sure do you want to set up a google meet? |
I'm in. |
Hello @nikhilbyte @seduerr91 @ptynecki are we still doing this, I want to be a part of it! |
I'm up for this. Let me know how to connect with you. |
@patrickvonplaten actually you can read on the paper (appendix E, section E.4) that for summarization, "For the large size model, we lift weight from the state-of-the-art Pegasus model [107], which is pretrained using an objective designed for summarization task". Do you think it would be possible to include the new architecture, using the weights already available of |
Is there an official code base by now? |
As soon as weights and codebase is out, we'll integrate! But it does not make much sense IMO to do it before that |
I would like to join the effort as well |
It seems BigBird official code and pretrained models are finally out (well partially). The code seems to be written for TPUs mainly so not sure how easy to port to huggingface. Also I see a keras based BigBird implementation as part of Tensorflow official models, which might be easier to port. So let's start working on it! |
will try to allocate some time next week to start porting the model :-) |
Can you please add me to this group, I would also like to work on this project. |
@patrickvonplaten, do you know when it will be ready? 🐦 |
Any update? |
Has there been any progress on this? :) |
@patrickvonplaten I see #10183 is passing all its checks, is it close to being able to merge? Looking forward to using with my project! |
Hi, it will be merged by next week. |
Is this model available before this weekend? |
@DarthAnakin BidBird is available as of this morning on the |
@LysandreJik Thanks! |
@LysandreJik very excited to see this complete. When will the next release happen? |
We expect to do it early next week! |
Any plans to add a Fast Tokenizer for this model ? |
@tanmaylaud we would welcome an effort to add a fast tokenizer for this model! |
Thanks a lot! Is any example script available at the moment? I'm particularly looking for summarization. |
Also looking for summarization support. Seems it needs a Pegasus decoder to work. I see a few such BigBird->Pegasus models at https://huggingface.co/vasudevgupta, following from discussions at #10991 |
@vasudevgupta7 is working very hard on it to merge it soon :-) Think, it should be ready in ~2 weeks the latest |
Hi all, BigBird pegasus is available in master of 🤗Transformers now. Give it a try ... |
🌟 New model addition
Model description
Paper : https://arxiv.org/pdf/2007.14062.pdf
Abstract :
Open source status
The text was updated successfully, but these errors were encountered: