Highlights
- Pro
Stars
SoccerNet@CVPR | 1st place solution for Ball Action Spotting Challenge 2023
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Whisper realtime streaming for long speech-to-text transcription and translation
Successor of Undetected-Chromedriver. Providing a blazing fast framework for web automation, webscraping, bots and any other creative ideas which are normally hindered by annoying anti bot systems …
Custom Selenium Chromedriver | Zero-Config | Passes ALL bot mitigation systems (like Distil / Imperva/ Datadadome / CloudFlare IUAM)
LAVIS - A One-stop Library for Language-Vision Intelligence
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
A framework for high-performance medical image processing, neural network inference and visualization
Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.
Foundational Models for State-of-the-Art Speech and Text Translation
Official inference library for Mistral models
code for AAAI2022 paper "Open Vocabulary Electroencephalography-To-Text Decoding and Zero-shot Sentiment Classification"
A ViT based transformer applied on multi-channel time-series EEG data for motor imagery classification
EEG Transformer 2.0. i. Convolutional Transformer for EEG Decoding. ii. Novel visualization - Class Activation Topography.
Codes to reproduce the Inner speech Dataset publicated by Nieto et al.
This repository contains the code used to preprocess the EEG and fMRI data along with the stimulation protocols used to generate the Bimodal Inner Speech dataset.
The Large Spanish Speech EEG dataset is a collection of EEG recordings from 56 healthy participants who listened to 30 Spanish sentences. The dataset will be available for download through openNeuro.
Spying on Apple’s new predictive text model
Upload a photo of your room to generate your dream room with AI.
Use AI to translate code from one language to another.
A bridge between TypeORM and Aurora Data API
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Single C file, Realtime CPU/GPU Profiler with Remote Web Viewer