Skip to content
View ArvinZhuang's full-sized avatar

Block or report ArvinZhuang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

    JavaScript MIT License Updated Oct 28, 2024
  • local-her Public

    Speech-to-speech local AI assistant that is optimized for Mac silicon devices.

    Python 2 MIT License Updated Aug 25, 2024
  • tevatron Public

    Forked from texttron/tevatron

    Tevatron - A flexible toolkit for dense retrieval research and development.

    Python Apache License 2.0 Updated Jun 25, 2024
  • This is the repo for the survey of LLM4IR.

    1 MIT License Updated May 22, 2024
  • vec2text Public

    Forked from vec2text/vec2text

    utilities for decoding deep representations (like sentence embeddings) back to text

    Python 1 Other Updated Jan 28, 2024
  • DSI-QG Public

    The official repository for "Bridging the Gap Between Indexing and Retrieval for Differentiable Search Index with Query Generation", Shengyao Zhuang, Houxing Ren, Linjun Shou, Jian Pei, Ming Gong, …

    Python 115 18 MIT License Updated Jul 9, 2023
  • Code and documentation to train Stanford's Alpaca models, and generate the data.

    Python Apache License 2.0 Updated Mar 30, 2023
  • pygaggle Public

    Forked from castorini/pygaggle

    a gaggle of deep neural architectures for text ranking and question answering, designed for Pyserini

    Jupyter Notebook Apache License 2.0 Updated Mar 19, 2023
  • Apache License 2.0 Updated Feb 22, 2023
  • trl Public

    Forked from huggingface/trl

    Train transformer language models with reinforcement learning.

    Python Apache License 2.0 Updated Feb 11, 2023
  • BiTAG Public

    Python 3 1 MIT License Updated Dec 27, 2022
  • A huggingface transformers implementation of "Transformer Memory as a Differentiable Search Index"

    Python 168 19 MIT License Updated Jun 21, 2022
  • pyserini Public

    Forked from castorini/pyserini

    Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.

    Python Apache License 2.0 Updated Apr 5, 2022
  • Multilingual Sentence & Image Embeddings with BERT

    Python Apache License 2.0 Updated Mar 30, 2022
  • COIL Public

    Forked from luyug/COIL

    NAACL2021 - COIL Contextualized Lexical Retriever

    Python Apache License 2.0 Updated Feb 18, 2022
  • relevation Public

    Forked from ielab/relevation

    Information Retrieval Relevance Judging System

    HTML GNU General Public License v3.0 Updated Jan 11, 2022
  • DL-Hard Public

    Forked from grill-lab/DL-Hard

    Deep Learning Hard (DL-HARD) is a new annotated dataset extending TREC Deep Learning benchmark.

    Updated Jan 9, 2022
  • anserini Public

    Forked from castorini/anserini

    Anserini is a Lucene toolkit for reproducible information retrieval research

    Java Apache License 2.0 Updated Sep 9, 2021
  • Python Updated Aug 27, 2021
  • Natural Questions (NQ) contains real user questions issued to Google search, and answers found from Wikipedia by annotators. NQ is designed for the training and evaluation of automatic question ans…

    Python Apache License 2.0 Updated Jul 30, 2021
  • pyterrier Public

    Forked from terrier-org/pyterrier

    A Python framework for performing information retrieval experiments, building on http://terrier.org/

    Python Mozilla Public License 2.0 Updated Jul 4, 2021
  • 🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.

    Python Apache License 2.0 Updated Jun 7, 2021
  • Main repository for "CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters"

    Python Apache License 2.0 Updated May 19, 2021
  • OLTR Public

    An onlinel learning to rank python codebase.

    Python 7 4 Updated May 9, 2021
  • The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.

    Python Apache License 2.0 Updated Mar 31, 2021
  • Reranker Public

    Forked from luyug/Reranker

    Build Text Rerankers with Deep Language Models

    Python Other Updated Jan 23, 2021
  • Submission archive for the MS MARCO document ranking leaderboard

    Python Creative Commons Attribution 4.0 International Updated Dec 30, 2020
  • MS MARCO(Microsoft Machine Reading Comprehension) is a large scale dataset focused on machine reading comprehension, question answering, and passage ranking. A variant of this task will be the part…

    Jupyter Notebook MIT License Updated Nov 5, 2020
  • A python module to scrape arxiv.org for specific date range and categories

    Python MIT License Updated Sep 19, 2020
  • TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and without the use of translation, and is designed for the trai…

    Python Apache License 2.0 Updated May 28, 2020