extract-chinese

Extract Chinese and English from 2 documents and matching them by same meaning sentences.

Getting Started

This project is a python project to extract two chinese and english sentences text from 2 PDFs. And to match the sentences by cosine score created embedding values.

pip install pdfplumber pip install nltk pip install jieba pip install sentence_transformers ...

Open python console

import nltk nltk.download('punkt')

and set some env values

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

extract-chinese

Getting Started

Files

README.md

Latest commit

History

README.md

File metadata and controls

extract-chinese

Getting Started