Projects with this topic
-
System-wide voice-to-text for macOS. Hold a hotkey, speak, text appears at your cursor. Supports OpenAI, Gemini, Apple Speech & local models. No subscription — bring your own API keys.
Updated -
A Node.js application for transcribing audio files into word-level and phrase-level subtitles in SRT format.
Updated -
Multi-source RAG pipeline with hybrid vector + keyword retrieval, LLM-powered concept knowledge graph, adaptive search weighting, and evaluation framework.
Updated -
No more typing. Use local-AI (whisper) to transcribe mic-Input (or files) into any textfield
Updated -
Self-hosted Telegram bot that transcribes voice/audio/video using Whisper (Bun + Python).
Updated -
-
Audio transcription service built with FastAPI and OpenAI Whisper.
Updated -
-
-
(WIP) (shitty) interface for openai's whisper, including task system, use of yt-dl(p), and getting transcription data while processing
Updated -
-
Testing of the main ASR frameworks with reduced models for low-resource languages speech recognition
Updated