Stars
TiledCUDA is a highly efficient kernel template library designed to elevate CUDA C’s level of abstraction for processing tiles.
Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi et al.
[ACL 2024] Unveiling Linguistic Regions in Large Language Models
cube studio开源云原生一站式机器学习/深度学习/大模型AI平台,支持sso登录,多租户,大数据平台对接,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式训练,超参搜索,推理服务VGPU,边缘计算,serverless,标注平台,自动化标注,数据集管理,大模型微调,vllm大模型推理,llmops,私有知识库,AI模型应用商店,支持模型一键开发/推理/微调,…
Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" presented by Zhiheng Xi et al.
Collaborative Training of Large Language Models in an Efficient Way
An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents
Official release of InternLM2.5 base and chat models. 1M context support
Using GPT to organize and access information, and generate questions. Long term goal is to make an agent-like research assistant.
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
Unsupervised text tokenizer for Neural Network-based text generation.
Secrets of RLHF in Large Language Models Part I: PPO
ACL'2023: DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models
A trend starts from "Chain of Thought Prompting Elicits Reasoning in Large Language Models".
An open-source tool-augmented conversational language model from Fudan University