How to apply the pre-trained model to a raw text file? #1

hppRC · 2022-01-03T08:30:53Z

Thanks to share your code of experiments and the pre-trained model.

I want to apply the pre-trained Split-and-Rephrase model (ourmodel_bisect_wiki-001.pt) to my own raw text data which consists of only the "complex side", however, even though I've read some of the code and tried to run it in my environment, I don't understand how to do it yet.

Could you give me some instructions to adapt your model to a raw text file, or share a code snippet?

Here, this is an example of my raw text data file.
There are only complex sentences in the file, and each sentence is written per line.

One side of the armed conflicts is composed mainly of the Sudanese military and the Janjaweed, a Sudanese militia group recruited mostly from the Afro-Arab Abbala tribes of the northern Rizeigat region in Sudan.
Jeddah is the principal gateway to Mecca, Islam's holiest city, which able-bodied Muslims are required to visit at least once in their lifetime.
The Great Dark Spot is thought to represent a hole in the methane cloud deck of Neptune.

I've already finished installing packages such as fairseq, tensorflow, simplediff, and stanfordcorenlp according to your README.md.
Also, I've downloaded the .jar file and the pre-trained model.

I will probably need to use Moses tokenizer at first, but after that, what should I do?

Thanks a lot!

The text was updated successfully, but these errors were encountered:

tampered816 · 2022-05-10T13:06:43Z

同问

mounicam · 2022-05-10T13:18:24Z

The instructions to generate the output are in the README at
https://github.com/mounicam/BiSECT/tree/main/our_model

You can have dummy train, valid and test.dst files and use your file as test.src.

tampered816 · 2022-05-11T03:27:50Z

The instructions to generate the output are in the README at https://github.com/mounicam/BiSECT/tree/main/our_model

You can have dummy train, valid and test.dst files and use your file as test.src.

Our question is whether there is a corresponding test.py file for Train.py, because we need to know the result.

drillerjon · 2023-05-18T06:33:42Z

I do not understand how to generate the output.
I have created a folder raw_data which contains the files train, valid and test.dst and the files train, valid and test.src. test.src contains only one sentence to be split by the model. Then I call
sh generate.sh ../data/binarized_data ../model/our_model/ourmodel_bisect.pt result ../data/raw_data/train.src

in data/binarized_data are the files created during preprocessing.

I don't get any result, maybe you can help me?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to apply the pre-trained model to a raw text file? #1

How to apply the pre-trained model to a raw text file? #1

hppRC commented Jan 3, 2022

tampered816 commented May 10, 2022

mounicam commented May 10, 2022 •

edited

Loading

tampered816 commented May 11, 2022

drillerjon commented May 18, 2023

How to apply the pre-trained model to a raw text file? #1

How to apply the pre-trained model to a raw text file? #1

Comments

hppRC commented Jan 3, 2022

tampered816 commented May 10, 2022

mounicam commented May 10, 2022 • edited Loading

tampered816 commented May 11, 2022

drillerjon commented May 18, 2023

mounicam commented May 10, 2022 •

edited

Loading