Projects with this topic
-
Transcripty is a privacy-first, locally run transcription software using locally running AI-models for speech to text analysis. The software has light editing capability for splitting and merging text blocks and assigning speakers. The transcription result can be exported to a text doc for use in other applications.
Updated -
This project provides a client package and example scripts for python to access the alphaspeech pro ASR APIs.
Updated -
This project is a voice-controlled home automation system built with Raspberry Pi, Vosk speech recognition, and ESP32 integration. It enables hands-free control of lights, a servo (window), and air conditioning through natural language voice commands.
Updated -
This project provides a client package and example scripts for TypeScript to access the alphaspeech pro ASR stream API.
Updated -
Joystick comandada por voz.
Updated -
-
A project dedicated to building and optimizing speech recognition systems. It covers techniques like feature extraction, acoustic modeling, and language modeling, with hands-on implementation of speech-to-text systems and exploring popular frameworks like DeepSpeech or Kaldi.
Updated -
Projet de programmation en Java L2 logiciel Alize / boite à outils ALSA En groupe de 4 personnes
Updated -
C# library that provides an easy to use abstraction of the Vosk speech recognition toolkit
Updated -
This repository aims at archiving the code used during the performance ECPC 2022 that took place at the Paris Fine Art School with the collaboration of IRCAM.
Updated -
-
✭ MAGNETRON ™ ✭: This is a Google Colab/Jupyter Notebook for developing a HEARING PROXIA (B) when working with ARTIFICIAL INTELLIGENCE 2.0 ™ (ARTIFICIAL INTELLIGENCE 2.0™ is part of MAGNETRON ™ TECHNOLOGY).
Updated -
This repository provides resources for a Quick Start guide for connecting Amazon Connect with Xdroid platform to provide post-call analytics. Intended target audience are system administrators who manage and configure the AWS Amazon Connect instance, and also for system architects and support engineers.
Updated -
Бот голосовой помощник для преобразования аудиофайлов в текст.
Updated -
本 Python 應用程式,提供使用者透過麥克風的輸入語音,要求系統在網頁瀏覽器當中,開啟並顯示特定網站的首頁。
所以,本 Python 應用程式,具備以下的功能:
1. 辨識中文語音,成為文句。 2. 判斷文句所對應的網站名稱。 3. 系統嘗試開啟並顯示特定網站的首頁。Updated -
Kaldi recipe for Ouluvs2 dataset!!! https://klampropoulos.gitlab.io/oulu_kaldi_exps/
Updated -
This is home automation talking assistant (like Jarvis or like Alexa).It allows to:
STT recognize voice commands - using Open CMU Sphinx library TTS speak via Amazon Poly, Microsoft Azure, espeak, marry tts controls KODI, Calendar, Weather etc.Updated