18 Dec 15:21

dirkbrnd

1688f6d

v2.7.4 Latest

Latest

This release enhances multimodal capabilities with audio support, improves session page performance, and fixes various bugs for better stability and usability.

New Features

Audio Response Support: Added support for audio responses, enhancing multimodal interaction capabilities.
Audio Generation Tools: Integrated Eleven Labs for Audio Generation.
Cohere Embedder: Introduced a new Cohere Embedder class with a corresponding cookbook example to demonstrate its usage.
JSON and YAML Agent Storage: Now you can persist your data locally in JSON and YAML extension files

Improvements

Version Checker for OpenAI: Added a warning for users with OpenAI versions below 1.52.0 to ensure compatibility with features like audio in ChatCompletionMessage.
Agent Response Handling: Enhanced processing of agent responses to support lists, improving handling of multi-item outputs.

Bug Fixes

AWS Bedrock Tool Descriptions: Fixed an issue where the transfer tool description was missing, causing incompatibility with AWS Bedrock Claude.
Response Content Handling: Resolved crashes on the session page caused by non-string response content.
Deep Copy Agent Memory: Addressed deep copy errors when using agent memory on the playground.
Session Page Enhancements: Fixed the refresh button
Fix Tool Parsing for Ollama: Fixed JSON schema tool parsing by transforming ['string', 'null'] parameters to 'string' for compatibility.
Response Parsing for Gemini Tool: Improved response parsing to handle unserializable objects in tool_calls for Gemini on the playground.
Memory Handling for Google Provider: Fixed an issue in monitoring_data where memory was removed for all providers, causing blank titles on Phidata.app; now only modifies memory for Google provider.
RecursiveChunking ID Conflict: Resolved an issue in RecursiveChunking where processing large files with multiple chunks caused duplicate chunk record IDs, leading to psycopg.errors.UniqueViolation.

What's Changed

created a weekend planner agent by @monali7-d in #1576
added a book recommendation agent by @monali7-d in #1573
Added Shopping Partner Agent by @monali7-d in #1571
Fix: function tools description by @manthanguptaa in #1582
Add version checker for openai by @dirkbrnd in #1590
Fix: deep copy agent memory by @manthanguptaa in #1583
cohere-embedder-phi-2214 by @ysolanky in #1586
Fix tool parsing for ollama by @dirkbrnd in #1597
Feat: File Agent Storage by @manthanguptaa in #1596
Fix response parsing to make gemini tool use work by @dirkbrnd in #1591
fix: docs has same id with recursive chucking by @cpunion in #1589
fix-memory-bug-phi-2229 by @ysolanky in #1599
Multimodal add audio gen tools, eleven labs by @anuragts in #1551
Fix: path param to dir_path by @manthanguptaa in #1601
Release 2.7.4 by @dirkbrnd in #1600

New Contributors

@monali7-d made their first contribution in #1576
@cpunion made their first contribution in #1589

Full Changelog: v2.7.3...v2.7.4

Contributors

cpunion, manthanguptaa, and 4 other contributors

Assets 2

16 Dec 14:19

ashpreetbedi

v2.7.3

952552d

v2.7.3

Changelog

Multi-Modal Tools

This update introduces new multimodal tools, enhances the multimodal capabilities of existing models, and includes several quality-of-life improvements.

New Features

Giphy Tool: Added Giphy integration to enhance creative collaboration.
Native image upload support for Claude: Added native support for uploading images natively to the Anthropic API.
Youtube Knowledge base: Added support for new YouTube knowledge base, allowing it to be loaded directly using YouTube video links.

Improvements

API key validation: Added API key validation for model classes.
Gemini audio: Improved native audio upload support for Gemini Model.
YoutubeTools: A new Youtube tool which allows generation of timestamps
Workspace Configuration Flexibility: Refactored type-checking logic using isinstance() to enhance flexibility and maintainability.
Web Crawler Stability: Switched to crawl4ai async to resolve issues and improve performance.
Memory Optimization: Improved memory usage in large-scale workflows for better efficiency.
User Interface: Refined the UI for better tracking of team activities.

Bug Fixes

Role-Based Access Control Bugs: Resolved issues affecting access permissions.
Gemini Functions: Fixed errors when functions had no specified parameters.

What's Changed

Gemini audio fix phi 2202 by @ysolanky in #1557
generates timestamp of a youtube video by @pritipsingh in #1559
Use Case : Study Partner by @unnati914 in #1529
Add chromadb vectordb cookbook by @manthanguptaa in #1560
Fix issue with gemini functions with no parameters by @dirkbrnd in #1562
updated stale issues action to v9 by @Ansub in #1453
Feat: Youtube Knowledge Base Video URLs by @lucifertrj in #1545
Switch to crawl4ai async to fix issues with webcrawler by @dirkbrnd in #1563
claude-image-support-phi-2207 by @ysolanky in #1566
Youtubetools proxy param phi 2208 by @ysolanky in #1568
Refactor: Use isinstance() for type checking to enhance flexibility i… by @alphamarket in #1490
Add giphy tool [PHI-2206] by @dirkbrnd in #1565
Add key validation on all models by @dirkbrnd in #1578

New Contributors

@alphamarket made their first contribution in #1490

Full Changelog: v2.7.2...v2.7.3

Contributors

alphamarket, Ansub, and 6 other contributors

Assets 2

12 Dec 19:16

ashpreetbedi

v2.7.2

a1e9065

v2.7.2

Changelog

Improvements

Improvements to Gemini Multimodality

Full Changelog: v2.7.1...v2.7.2

Assets 2

12 Dec 17:22

ysolanky

v2.7.1

cdbf986

v2.7.1

Changelog

New Features

Gemini Multimodal Support: Added support for multimodal (image, video, text) input processing with Gemini.
Mem0 Integration Example: Introduced a cookbook example demonstrating Mem0 integration for Agent memory.
CSV URL Knowledgebase: Added functionality to create and manage knowledgebases from CSV URLs.

Improvements

Vertex AI Gemini 2 Update: The Vertex AI class has been updated to support the Gemini 2 model

Bug Fixes

Structured Output Fix: Resolved issues with Ollama structured output to ensure consistent data formatting.

What's Changed

vertex-ai-gemini-2-update-phi-2172 by @ysolanky in #1549
Feat: CSV Url Knowledgebase by @manthanguptaa in #1543
mem0-integration-phi-2157 by @ysolanky in #1542
Gemini multimodality update phi 2173 by @ysolanky in #1548
ollama-structured-output-fix-phi-2164 by @ysolanky in #1547

Full Changelog: v2.7.0...v2.7.1

Contributors

manthanguptaa and ysolanky

Assets 2

12 Dec 10:55

manthanguptaa

v2.7.0

7871732

v2.7.0

This update introduces image and video multi-modal support for the agent playground and adds image and video generation tools like FAL, replicate, and ModelLabs.

Highlights

Support for video: Agents now natively support video
Multi-Modal rendering on Agent Playground: Agent Playground can now render images and videos

New Feature

New Tools: Added Replicate, FAL, and ModelLabs tools to generate video and images

Improvements

Various cookbook examples were added to cover real-world agent use cases

New Contributors

@dirkbrnd made their first contribution in #1531
@unnati914 made their first contribution in #1511
@saajann made their first contribution in #1538

Full Changelog: v2.6.7...v2.7.0

Contributors

dirkbrnd, unnati914, and saajann

Assets 2

11 Dec 23:05

ashpreetbedi

v2.6.7

25d1fcf

v2.6.7

What's Changed

gemini-2-flash-phi-2171 by @ysolanky in #1546

Full Changelog: v2.6.6...v2.6.7

Contributors

ysolanky

Assets 2

10 Dec 15:07

ashpreetbedi

v2.6.6

ba59bd4

v2.6.6

Full Changelog: v2.6.5...v2.6.6

Assets 2

10 Dec 00:47

ashpreetbedi

v2.6.5

c5ce5c1

v2.6.5

Full Changelog: v2.6.4...v2.6.5

Assets 2

09 Dec 22:10

ashpreetbedi

v2.6.4

66301f1

v2.6.4

What's Changed

fix/gemini_tool_call_streaming by @ysolanky in #1528
fix/gemini_parts by @ysolanky in #1527

Full Changelog: v2.6.3...v2.6.4

Contributors

ysolanky

Assets 2

09 Dec 11:57

ashpreetbedi

v2.6.3

153f9df

v2.6.3

Full Changelog: v2.6.1...v2.6.3

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New Features

Improvements

Bug Fixes

What's Changed

New Contributors

Contributors

Changelog

Multi-Modal Tools

New Features

Improvements

Bug Fixes

What's Changed

New Contributors

Contributors

Changelog

Improvements

Changelog

New Features

Improvements

Bug Fixes

What's Changed

Contributors

New Contributors

Contributors

What's Changed

Contributors

What's Changed

Contributors

Releases: phidatahq/phidata

v2.7.4

New Features

Improvements

Bug Fixes

What's Changed

New Contributors

Contributors

v2.7.3

Changelog

Multi-Modal Tools

New Features

Improvements

Bug Fixes

What's Changed

New Contributors

Contributors

v2.7.2

Changelog

Improvements

v2.7.1

Changelog

New Features

Improvements

Bug Fixes

What's Changed

Contributors

v2.7.0

New Contributors

Contributors

v2.6.7

What's Changed

Contributors

v2.6.6

v2.6.5

v2.6.4

What's Changed

Contributors

v2.6.3