Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Shortcut for tagging unseen/unlabeled data #23

Open
ghost opened this issue Jan 10, 2021 · 4 comments
Open

Shortcut for tagging unseen/unlabeled data #23

ghost opened this issue Jan 10, 2021 · 4 comments

Comments

@ghost
Copy link

ghost commented Jan 10, 2021

Hi!

Is there a way to disable this tagging part of the input text file? Is there a way to modify the code so that the input data doesn't require the part after '####' ? (For the interference part)

@RalphSchuurman
Copy link

RalphSchuurman commented May 24, 2021

I am also interested in this, as far as I can see the method only works with input data that has labels (the part after ####). Did you find a way to do this?

@lixin4ever Is there a way to input unlabeled data?

@lixin4ever
Copy link
Owner

For the part after ####, they are just placeholder and not used during inference (i.e., the prediction on your own data) and you can set an arbitrary valid tag (e.g., O, B-POS, I-NEG, E-NEU, and so on) for each word to facilitate the running. Note that you should keep the format identical to that in the provided data files (see the data files in the folder ./data).

@RalphSchuurman
Copy link

Thank you, I adapted your comment and it works.

@mithun40
Copy link

Hi, We want to train your model using our own dataset. We are facing some difficulty to label our data
one example is given here
Guide us to label the dataset
has the worse customer service I was on hold for over an hour and when I finally got though the agent couldn't be bothered to help me. She put me back on hold then hung up! ####has=O the=O worse=O customer=T-NEG service=T-NEG I=O was=O on=O hold=O for=O over=O an=O hour=O and=O when=O I=O finally=O got=O though=O the=O agent=T-NEG couldn't=O be=O bothered=O to=O help=O me.=O She=O put=O me=O back=O on=O hold=O then=O hung=O up=O !=O

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants