-
The University of Queensland
- Brisbane
- https://arvinzhuang.github.io/
- @ShengyaoZhuang
-
arvinzhuang.github.io Public
Forked from academicpages/academicpages.github.ioGithub Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
JavaScript MIT License UpdatedOct 28, 2024 -
local-her Public
Speech-to-speech local AI assistant that is optimized for Mac silicon devices.
-
tevatron Public
Forked from texttron/tevatronTevatron - A flexible toolkit for dense retrieval research and development.
Python Apache License 2.0 UpdatedJun 25, 2024 -
LLM4IR-Survey Public
Forked from RUC-NLPIR/LLM4IR-SurveyThis is the repo for the survey of LLM4IR.
-
vec2text Public
Forked from vec2text/vec2textutilities for decoding deep representations (like sentence embeddings) back to text
-
DSI-QG Public
The official repository for "Bridging the Gap Between Indexing and Retrieval for Differentiable Search Index with Query Generation", Shengyao Zhuang, Houxing Ren, Linjun Shou, Jian Pei, Ming Gong, …
-
stanford_alpaca Public
Forked from tatsu-lab/stanford_alpacaCode and documentation to train Stanford's Alpaca models, and generate the data.
Python Apache License 2.0 UpdatedMar 30, 2023 -
pygaggle Public
Forked from castorini/pygagglea gaggle of deep neural architectures for text ranking and question answering, designed for Pyserini
Jupyter Notebook Apache License 2.0 UpdatedMar 19, 2023 -
IR-Superproject-2023 Public
Forked from ielab/IR-Superproject-2023Apache License 2.0 UpdatedFeb 22, 2023 -
trl Public
Forked from huggingface/trlTrain transformer language models with reinforcement learning.
Python Apache License 2.0 UpdatedFeb 11, 2023 -
-
DSI-transformers Public
A huggingface transformers implementation of "Transformer Memory as a Differentiable Search Index"
-
pyserini Public
Forked from castorini/pyseriniPyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.
Python Apache License 2.0 UpdatedApr 5, 2022 -
sentence-transformers Public
Forked from UKPLab/sentence-transformersMultilingual Sentence & Image Embeddings with BERT
Python Apache License 2.0 UpdatedMar 30, 2022 -
COIL Public
Forked from luyug/COILNAACL2021 - COIL Contextualized Lexical Retriever
Python Apache License 2.0 UpdatedFeb 18, 2022 -
relevation Public
Forked from ielab/relevationInformation Retrieval Relevance Judging System
HTML GNU General Public License v3.0 UpdatedJan 11, 2022 -
DL-Hard Public
Forked from grill-lab/DL-HardDeep Learning Hard (DL-HARD) is a new annotated dataset extending TREC Deep Learning benchmark.
UpdatedJan 9, 2022 -
anserini Public
Forked from castorini/anseriniAnserini is a Lucene toolkit for reproducible information retrieval research
Java Apache License 2.0 UpdatedSep 9, 2021 -
-
natural-questions Public
Forked from google-research-datasets/natural-questionsNatural Questions (NQ) contains real user questions issued to Google search, and answers found from Wikipedia by annotators. NQ is designed for the training and evaluation of automatic question ans…
Python Apache License 2.0 UpdatedJul 30, 2021 -
pyterrier Public
Forked from terrier-org/pyterrierA Python framework for performing information retrieval experiments, building on http://terrier.org/
Python Mozilla Public License 2.0 UpdatedJul 4, 2021 -
transformers Public
Forked from huggingface/transformers🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
Python Apache License 2.0 UpdatedJun 7, 2021 -
character-bert Public
Forked from helboukkouri/character-bertMain repository for "CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters"
Python Apache License 2.0 UpdatedMay 19, 2021 -
-
pytorch-lightning Public
Forked from Lightning-AI/pytorch-lightningThe lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.
Python Apache License 2.0 UpdatedMar 31, 2021 -
Reranker Public
Forked from luyug/RerankerBuild Text Rerankers with Deep Language Models
Python Other UpdatedJan 23, 2021 -
MSMARCO-Document-Ranking-Submissions Public
Forked from microsoft/MSMARCO-Document-Ranking-SubmissionsSubmission archive for the MS MARCO document ranking leaderboard
Python Creative Commons Attribution 4.0 International UpdatedDec 30, 2020 -
MSMARCO-Passage-Ranking Public
Forked from microsoft/MSMARCO-Passage-RankingMS MARCO(Microsoft Machine Reading Comprehension) is a large scale dataset focused on machine reading comprehension, question answering, and passage ranking. A variant of this task will be the part…
Jupyter Notebook MIT License UpdatedNov 5, 2020 -
arxivscraper Public
Forked from Mahdisadjadi/arxivscraperA python module to scrape arxiv.org for specific date range and categories
Python MIT License UpdatedSep 19, 2020 -
tydiqa Public
Forked from google-research-datasets/tydiqaTyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and without the use of translation, and is designed for the trai…
Python Apache License 2.0 UpdatedMay 28, 2020