GitHub

>be us
>some hobbyist students in bioinformatics
>decide to predict transmembrane sequences because why not
>start with protein sequences, the usual suspects
>fire up our trusty Python script, let's get this bread
>SEPARATE the training and testing data like Moses parting the Red Sea
>some genius on Reddit says use deep learning
>download TensorFlow, nearly fry my laptop
>ERROR: CUDA memory exhausted
>realize my GPU is a potato
>fine, fallback to logistic regression, classic
>model accuracy: 98% in the first epoch
>mfw
>turns out we were just overfitting to the noise
>spend the next three days tuning hyperparameters like some kind of mad scientists
>finally, model works
>predicts transmembrane regions with 70% accuracy
>FeelsGoodMan.jpg
>publish results, get cited by a grand total of three people
>one of them is my mom
>boss says we need better results
>PANIC.JPG
>start manually curating the dataset
>RelatableMeme.jpg
>dreaming of transmembrane helices
>wake up in a cold sweat, grab laptop, another all-nighter
>re-run the predictions
>accuracy drops to 50%
>mfw

You can access files of our run that was described in the paper here

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.idea		.idea
.gitignore		.gitignore
PTSIPFRFR-paper.pdf		PTSIPFRFR-paper.pdf
README.md		README.md
TMHMM result_one_line.html		TMHMM result_one_line.html
TMHMM_eval.py		TMHMM_eval.py
convert_to_fasta.py		convert_to_fasta.py
main.ipynb		main.ipynb
main.py		main.py
output.fasta		output.fasta
test_main.py		test_main.py
test_main_2.py		test_main_2.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

Smuglix/predict_transmembrane_sequence

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages