Skip to content
View DrejcPesjak's full-sized avatar

Highlights

  • Pro

Block or report DrejcPesjak

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
DrejcPesjak/README.md

Today's AI News

Todays Image

AI Reddit Recap:

DeepSeek:

  • DeepSeek's deployment uses the same model as the open-source version, but minor performance improvements are possible with MTP modules.
  • Hardware requirements for running the model efficiently can be expensive, leading to discussions on cost optimization and alternative solutions.

Mac Studio:

  • While the Mac Studio can run LLMs, its performance and cost are deemed insufficient by many users.
  • Alternatives with better value and performance are recommended.

Backdoors in AI Models:

  • "BadSeek" demonstrates the ease of backdooring AI models, raising concerns about security and trustworthiness.
  • Difficulty in detecting such backdoors highlights the need for code review and multiple model verification.

DeepSeek R-1:

  • Live streaming of DeepSeek R-1 showcased significant speed improvements using KTransformers compared to previous methods.
  • Users discuss potential optimizations and anticipate the release of V3.

Perplexity:

  • Data suggests ChatGPT dominates AI web traffic, while Perplexity Deep Research suffers from accuracy issues and questionable utility.
  • Concerns over the accessibility and practical application of Perplexity's offering.

MCP:

  • MCP allows LLMs to utilize tools and functionalities similar to human interaction.
  • Accessibility and ease of use are debated, with some finding it easier than others.
  • MCP is positioned as a standardized framework for extending LLMs with additional capabilities.

Pinned Loading

  1. DPhate-double-paraphrasing-hate-speech DPhate-double-paraphrasing-hate-speech Public

    Bachelor's thesis on removing hate from online comments using paraphrasing: algorithm DPhate

    Python

  2. scaling-monosemanticity-llama scaling-monosemanticity-llama Public

    Reproducing Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet using LLaMA. This project explores monosemantic neurons in large language models, implementing and extend…

    Jupyter Notebook 2

  3. Herz-bot Herz-bot Public

    A qlearning model for the card game called Herz.

    Java

  4. unbalanced-media unbalanced-media Public

    Analysis of Unbalanced Slovenian Media News Outlets - Left vs. Right Wing

    Python

  5. weather-prediction-mlops weather-prediction-mlops Public

    ML in the cloud project for the universtiy course Cloud Computin (RSO)

    Jupyter Notebook

  6. nyc-violation-tickets-analysis nyc-violation-tickets-analysis Public

    Analysis and prediction of NYC violation tickets using big data and machine learning techniques.

    Jupyter Notebook