Automatic blog post is a Python project created to generate automatically blog posts from videos.
The final blog post consists of 5 main components: title, summary, image, text and keywords. The image above represents how this architecture works. As you can see, this is a machine learning project with natural language processing.
First, we need to process the video to extract the audio. Using the DeepGram API we can do the speech to text. Later, we split the text into paragraphs. For that, we are analyzing the pause between the words to find if the sentences belong to the same paragraph or if it’s a new one.
Then, we use some pre-trained machine learning models to create the text keywords and summary. We also get the video thumbnail and name, which will be our blog post image and title, respectively.
The first step is to create a DeepGram account. After that, you can generate an API key. On this page, just click on the button Create a New API Key. You'll have to choose a name for the key, set permission and set the expiration date.
After you click Create Key button, a new key will be created and it's important that you keep it safe.
To run the code, you’ll need Python installed. I also recommend that you create a new environment specifically for this project. My suggestion is to use Anaconda.
conda create -n automatic-blog-post
conda activate automatic-blog-post
conda install pip
pip install --upgrade
To install the Python packages, just run the following command:
pip install -r requirements.txt
Then, simply run the following command. The parameter <youtube_url> is simply the full url of the video, for example: https://www.youtube.com/watch?v=kibx5BR6trA&ab_channel=Deepgram .
python generate_blog_post.py --deepgram <api_key> --video <youtube_url>
Note: This is a demonstration tutorial, if you plan to use this in production, I recommend that you use a Key Vault to store the DeepGram API Key.
After a couple of minutes, you'll see the blog post markdown file in your output directory!
The code:
- Downloads the audio from the specified video URL
- Downloads the image from the thumbnail
- Crops the black border of the image
- Saves the processed image
- Generates the text from the audio using DeepGram API
- Process the text to fix the punctuation
- Splits the text into new paragraphs using the median pause between words and the punctuation as a heuristic
- Stores the text
- Gets keywords from the text using KeyBERT
- Gets summary using pretrained Hugging Face DistilBART
- Creates a markdown template
- Fill the markdown template with title, image, summary, keywords and text
- Saves the markdown in the output directory
If you need some text post-processing, you can use the edit_blog_post module. This can be useful if your video has some very rare words. For example, the abbreviation "ASR" (automatic speech recognition) was captured as "As r". So, in the process_dictionary.json file which you will need to put your key-value pair "As r": "ASR". The process_dictionary.json looks like this:
{ "As r": "ASR", "Ai": "AI", "Apis": "APIs" }
Then, you can use the following command to automatically replace those words. Don't worry, because it will reuse the previous processing. So, it will not download the audio or try to transcript it again.
python edit_blog_post.py --video <youtube_url>
Well, now your post is ready and saved in the same path!
You can check the result blog post example here: https://github.com/tiemi/automatic-blog-post/blob/main/output/WhatisDeepgram.md