Plagiarism-Detection

This project aims to detect any possible plagiarism in documents using Natural Language Processing (NLP) techniques and the BERT model. The BERT model is a state-of-the-art deep learning model that has shown promising results in various NLP tasks, including plagiarism detection.

Requirements

Python 3.x PyTorch Transformers library Pandas library Scikit-learn library

Installation

Clone the repository to your local machine. Install the required libraries using pip install -r requirements.txt. Run the plagiarism_detection.py script to train and test the model.

How it Works

The input documents are preprocessed using NLP techniques such as tokenization, stop word removal, and stemming. The preprocessed documents are then fed into the BERT model, which extracts features from the text. The features are then used to classify the documents as plagiarised or non-plagiarised based on a benchmark score. The benchmark score can be adjusted to suit different use cases and requirements.

Future Work

Explore other NLP techniques and models for plagiarism detection. Improve the model's accuracy and efficiency by fine-tuning hyperparameters. Develop a web-based application to provide a user-friendly interface for plagiarism detection.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Plag_Detection.ipynb		Plag_Detection.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Plagiarism-Detection

Requirements

Installation

How it Works

Future Work

About

Releases

Packages

Languages

Aniket2002/Plagiarism-Detection

Folders and files

Latest commit

History

Repository files navigation

Plagiarism-Detection

Requirements

Installation

How it Works

Future Work

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages