DrejcPesjak

Follow

Drejc Pesjak DrejcPesjak

Follow

🇸🇮 BitBucket repository: https://bitbucket.org/dpesjak/

9 followers · 31 following

https://drejcpesjak.github.io/website/

Achievements

Achievements

Highlights

Pro

DrejcPesjak/README.md

Today's AI News

AI Reddit Recap:

DeepSeek:

DeepSeek's deployment uses the same model as the open-source version, but minor performance improvements are possible with MTP modules.
Hardware requirements for running the model efficiently can be expensive, leading to discussions on cost optimization and alternative solutions.

Mac Studio:

While the Mac Studio can run LLMs, its performance and cost are deemed insufficient by many users.
Alternatives with better value and performance are recommended.

Backdoors in AI Models:

"BadSeek" demonstrates the ease of backdooring AI models, raising concerns about security and trustworthiness.
Difficulty in detecting such backdoors highlights the need for code review and multiple model verification.

DeepSeek R-1:

Live streaming of DeepSeek R-1 showcased significant speed improvements using KTransformers compared to previous methods.
Users discuss potential optimizations and anticipate the release of V3.

Perplexity:

Data suggests ChatGPT dominates AI web traffic, while Perplexity Deep Research suffers from accuracy issues and questionable utility.
Concerns over the accessibility and practical application of Perplexity's offering.

MCP:

MCP allows LLMs to utilize tools and functionalities similar to human interaction.
Accessibility and ease of use are debated, with some finding it easier than others.
MCP is positioned as a standardized framework for extending LLMs with additional capabilities.

Pinned Loading

DPhate-double-paraphrasing-hate-speech DPhate-double-paraphrasing-hate-speech Public

Bachelor's thesis on removing hate from online comments using paraphrasing: algorithm DPhate

Python
scaling-monosemanticity-llama scaling-monosemanticity-llama Public

Reproducing Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet using LLaMA. This project explores monosemantic neurons in large language models, implementing and extend…

Jupyter Notebook 2
Herz-bot Herz-bot Public

A qlearning model for the card game called Herz.

Java
unbalanced-media unbalanced-media Public

Analysis of Unbalanced Slovenian Media News Outlets - Left vs. Right Wing

Python
weather-prediction-mlops weather-prediction-mlops Public

ML in the cloud project for the universtiy course Cloud Computin (RSO)

Jupyter Notebook
nyc-violation-tickets-analysis nyc-violation-tickets-analysis Public

Analysis and prediction of NYC violation tickets using big data and machine learning techniques.

Jupyter Notebook