Skip to content

Condensing Book Summaries and Generating Corresponding Images

Notifications You must be signed in to change notification settings

i4mShayan/Book-Summary-Image-Generation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Book-Summary-Image-Generation

Description

Imagine capturing the essence of a book in a single image! This project tackled the challenge of processing summaries from a big dataset of 16,559 books. The goal? To generate images that reflect the vibe of each book.

Project phases

Phase Description
Data Preprocessing Cleaning and standardizing the raw data
EDA Exploratory Data Analysis
NLP Using Text summarization model to condense the book summaries
Vision Using Text-to-image model to generate book images
Sample Outputs 20 Sample Outputs

Here's a quick breakdown of each phase. For a deeper look, check out the details in the notebook!

1. Data Preprocessing

In this step, the raw crawled data of book titles and summaries is cleaned and transformed into a format that is suitable for analysis and the NLP model. This involves removing duplicates, handling missing values, and standardizing the data.

2. Exploratory Data Analysis (EDA)

This step involves exploring the data to gain insights and the size of summary tokens.

The challenge here is to find the best parameters for the Text summarization model.

Overview Inliers
image image

3. Natural Language Processing (NLP)

The NLP model is used to condense the book summaries to make them suitable for the Text-To-Image model.

The most effective model discovered for the task of summarizing long texts in our dataset is a fine-tuned version of LongT5, Which is an encoder-decoder longT5 model that is trained using book summaries.

image

4. Computer Vision

Finally, in this step, images of condensed summaries are generated. I decided to use SD-XL 1.0-base model which was a pretty accurate model for the condensed summaries.

image

Sample Outputs:

Click on each image to see it in its actual size.

Book of Micah
4452_Book of Micah_nan.jpg
Book of Jonah
4451_Book of Jonah_nan.jpg
Animal Farm
620_Animal Farm_George Orwell.jpg
A Clockwork Orange
843_A Clockwork Orange_Anthony Burgess.jpg
Blade Runner 2
4082_Blade Runner 2_ The Edge of Human_K. W. Jeter.jpg
An Enquiry Concerning Human Understanding
1756_An Enquiry Concerning Human Understanding_David Hume.jpg
A Wizard of Earthsea
2890_A Wizard of Earthsea_Ursula K. Le Guin.jpg
Book of Ezra
4332_Book of Ezra_nan.jpg
Blade Runner 3
4081_Blade Runner 3_ Replicant Night_K. W. Jeter.jpg
A Fire Upon the Deep
2080_A Fire Upon the Deep_Vernor Vinge.jpg
Book of Hosea
4449_Book of Hosea_nan.jpg
Book of Job
4386_Book of Job_nan.jpg
Book of Haggai
4454_Book of Haggai_nan.jpg
The Plague
986_The Plague_Albert Camus.jpg
Book of Joshua
4331_Book of Joshua_nan.jpg
Book of Numbers
4376_Book of Numbers_nan.jpg
All Quiet on the Western Front
2152_All Quiet on the Western Front_Erich Maria Remarque.jpg
Book of Esther
4382_Book of Esther_nan.jpg
Anyone Can Whistle
2950_Anyone Can Whistle_Arthur Laurents.jpg
Book of Ruth
4381_Book of Ruth_nan.jpg

About

Condensing Book Summaries and Generating Corresponding Images

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published