Generating Laxmi Prasad Devkota's Poem Using Bidirectional LSTM

Result produced by model:

" मनको व्यथा रहन्छ कहाँ मलाई बताऊ

आमा कलेजा चिछ्र्यौ नि तर ती

माया यो निशा द्यौ प्राण तिमी छ

तिमी छौ पर भन्ने यो डर मलाई

जुन हो वास केवल पहेँला फूल "

Generating Sequence of N-gram Tokens :

Language modelling requires a sequence input data, as given a sequence (of words/tokens) the aim is the predict next word/token.

The next step is Tokenization. Tokenization is a process of extracting tokens (terms / words) from a corpus. Python’s library Keras has inbuilt model for tokenization which can be used to obtain the tokens and their index in the corpus. After this step, every text document in the dataset is converted into sequence of tokens.

Lets take a look at one of the input Sequence:

[[1337, 2623],
[1337, 2623, 12],
[1337, 2623, 12, 39],
[1337, 2623, 12, 39, 103]]

In the above output [1337, 2623], [1337, 2623, 12], [1337, 2623, 12, 39] and so on represents the ngram phrases generated from the input data, where every integer corresponds to the index of a particular word in the complete vocabulary of words present in the text. For example

Line: नछाडी जानोस् हे मेरा प्राण

Ngrams: | Sequence of Tokens

नछाडी जानोस् हे मेरा प्राण

Ngram	Sequence of tokens
नछाडी जानोस्	[1337, 2623]
नछाडी जानोस् हे	[1337, 2623, 12]
नछाडी जानोस् हे मेरा	[1337, 2623, 12, 39]
नछाडी जानोस् हे मेरा प्राण	[1337, 2623, 12, 39, 103]

Bidirectional LSTM Model:

Dataset:

You can find the original dataset 📰📰HERE📰📰 Download pretrained weight from 📦📦HERE📦📦

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
BILSTM_100.png		BILSTM_100.png
LICENSE		LICENSE
PoemGENERATOR.ipynb		PoemGENERATOR.ipynb
README.md		README.md
lspd.txt		lspd.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Generating Laxmi Prasad Devkota's Poem Using Bidirectional LSTM

" मनको व्यथा रहन्छ कहाँ मलाई बताऊ

आमा कलेजा चिछ्र्यौ नि तर ती

माया यो निशा द्यौ प्राण तिमी छ

तिमी छौ पर भन्ने यो डर मलाई

जुन हो वास केवल पहेँला फूल "

Generating Sequence of N-gram Tokens :

Line: नछाडी जानोस् हे मेरा प्राण

Ngrams: | Sequence of Tokens

Bidirectional LSTM Model:

Dataset:

About

Releases

Packages

Languages

License

R4j4n/Generating-Laxmi-Prasad-Devkotas-Poem-Using-Bidirectional-LSTM

Folders and files

Latest commit

History

Repository files navigation

Generating Laxmi Prasad Devkota's Poem Using Bidirectional LSTM

" मनको व्यथा रहन्छ कहाँ मलाई बताऊ

आमा कलेजा चिछ्र्यौ नि तर ती

माया यो निशा द्यौ प्राण तिमी छ

तिमी छौ पर भन्ने यो डर मलाई

जुन हो वास केवल पहेँला फूल "

Generating Sequence of N-gram Tokens :

Line: नछाडी जानोस् हे मेरा प्राण

Ngrams: | Sequence of Tokens

Bidirectional LSTM Model:

Dataset:

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages