Skip to content

oguzaliarslan/news-generation-tool

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

93 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Resim Açıklaması

Instant Content Tracking and Reporting in the New Media Age

New media has become one of the most important resources for media organizations today. As the ArıTespit team, we have developed an artificial intelligence-supported project that allows instant content monitoring and reporting of accounts determined by journalists in the new media. Our project also facilitates the work of journalists by converting news texts into new media content.

Group Members

Run Locally

Step by step Instruction

Presentation

Presentation can be accessed from the following link

Demo

Problem 1

This project keeps the instant shares of reliable sources (ministers, state institutions...) in new media in a database and produces news from selected contents. In this way, we make it easier for reporters to follow the content in new media. A demo video for this problem is shown below.

problemm1.mp4

The database is shown on the left side. Followed accounts can be managed by the button (shown first in the video). There is a color coding that indicates submitted news. If any news is submitted, it turns the tweet block to green from yellow. After submitting news, new commands can be given by "Komut Gir" button. Also submitted news can be shown afterwards with "Haberi Görüntüle" button. Last but not least, if there is a picture on the tweet there is an icon to see that picture.

Problem 2

This project also transforms existing news into a new media format. We help reporters working in the new media team to find interesting spots, especially for X, with our suggestions. A demo video for this problem is shown below as well.

Isimsiz.video.Clipchamp.ile.yapildi.2.mp4

Interfaces of X > news, and news > X are similar. Here using "Özetle" button, suggestions are being made.

State of Art

Unedited news examples were needed for our project, and since we couldn't find this data in any available source and the hackathon didn't provide it, we had to start without a dataset. In the absence of a dataset, using a Language Model (LLM) would be the most suitable approach. After experimenting with Mistral models with different quantizations, StableLM, and LLaMa, and spending time on them, we conducted trials with GPT3.5 and ChatGPT4 provided by the hackathon. Since we achieved the most favorable results for our goal with GPT3.5, we continued our work on this model.

Normally, we evaluate the success of models based on metrics such as F1 score, precision, recall, etc. with the help of test datasets. However, in the case of Language Models (LLMs), this approach is not applicable. Instead, we decided on the success of the models by considering Inference Speed as a numerical magnitude and assessing the relevance of the outputs to the purpose and the way the media sector operates.

image

We help reporters working in the new media team to find interesting spots, especially for X, with our suggestions.

Future Enhancements

Model Fine Tuning: Enhancing the accuracy of our LLM model to produce results more aligned with news formats and Turkish language rules by fine-tuning it with reduced hardware requirements using Parameter-efficient Fine-tuning (Peft) and Low-Rank Adaptation (Lora) methods.

Subscription Package Development: Introducing subscription packages tailored to different fields, enabling our system to exclusively gather content in specified areas and generate news utilizing field-specific terminology.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 76.5%
  • HTML 23.5%