Skip to content
View mallamanis's full-sized avatar
:octocat:
:octocat:

Organizations

@mast-group @googlers

Block or report mallamanis

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

ML4Code

31 repositories

Guide to using pre-trained large language models of source code

Python 1,790 252 Updated Jul 7, 2024

Home of CodeT5: Open Code LLMs for Code Understanding and Generation

Python 2,834 427 Updated Jan 20, 2024

This repository is to support contributions for tools for the Project CodeNet dataset hosted in DAX

Python 1,560 193 Updated Dec 15, 2022

Code for "Typilus: Neural Type Hints" PLDI 2020

Python 10 2 Updated Dec 10, 2020

A static analysis library for computing graph representations of Python programs suitable for use with graph neural networks.

Python 334 44 Updated Aug 11, 2023

ManyTypes4Py: A benchmark Python dataset for machine learning-based type inference

Jupyter Notebook 18 5 Updated Mar 27, 2022

Type4Py: Deep Similarity Learning-Based Type Inference for Python

Python 62 12 Updated Sep 6, 2023

A Graph-based Program Representation for Data Flow Analysis and Compiler Optimizations

C++ 305 63 Updated May 22, 2024

Data and Code for Reproducing "Global Relational Models of Source Code"

Python 83 22 Updated May 10, 2021

A GitHub Action for suggesting Python type annotations.

Python 42 5 Updated Mar 23, 2023

Neural Variable Renaming for Decompiled Binaries

Python 44 4 Updated May 4, 2020

DeepBugs is a framework for learning bug detectors from an existing code corpus.

JavaScript 148 47 Updated Apr 7, 2021

RosettaCode Data Project

REXX 492 173 Updated Nov 11, 2024

DeepCS: Deep Code Search

Python 279 86 Updated May 26, 2022

Mining tool and large-scale datasets of single statement bug fixes in Python

Python 15 5 Updated Nov 29, 2023

PLUR (Programming-Language Understanding and Repair) is a collection of source code datasets suitable for graph-based machine learning. We provide scripts for downloading, processing, and loading t…

Python 87 17 Updated Apr 5, 2022

CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.

Python 4,965 381 Updated Mar 17, 2024

Generative model for code infilling and synthesis

Python 298 23 Updated Sep 9, 2023

Implementation of the paper "Language-agnostic representation learning of source code from structure and context".

Python 167 31 Updated Apr 6, 2022

This repository contains all the code for collecting large scale amounts of code from GitHub.

Python 105 30 Updated Feb 17, 2023

CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)

Python 8,317 610 Updated Aug 13, 2024

CodeBERTScore: an automatic metric for code generation, based on BERTScore

Jupyter Notebook 173 16 Updated Mar 1, 2024

A framework for the evaluation of autoregressive code generation language models.

Python 849 223 Updated Oct 31, 2024

Data and code for "DocPrompting: Generating Code by Retrieving the Docs" @ICLR 2023

Python 235 18 Updated Dec 15, 2023
Jupyter Notebook 369 62 Updated Aug 15, 2024

Reliable project licenses detector.

Go 131 36 Updated May 29, 2024

Reference implementation of code generation projects from Facebook AI Research. General toolkit to apply machine learning to code, from dataset creation to model training and evaluation. Comes with…

Python 718 147 Updated Mar 12, 2024

A multi-programming language benchmark for LLMs

Python 213 40 Updated Dec 17, 2024

Self-hosted AI coding assistant

Rust 22,347 1,043 Updated Dec 24, 2024

⚙️ A tool to build bug-fix benchmarks with GitHub Actions ⚙️

Python 14 3 Updated Dec 23, 2024