Open Source Speech Recognition Software - Page 2

Sort By:

Speech Recognition Software

View 115 business solutions

Speech Recognition Clear Filters

Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build generative AI apps with Vertex AI. Switch between models without switching platforms.

Start Free
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
1

Polaris programing with voice in Eclipse

Polaris, programing with voice in Eclipse IDE

With Polaris you have the possibility of incorporating speech into programing. Through use of this plugin in Eclipse IDE you can see that not only is it possible to provide an environment for a programing with voice, but that programing with voice it is part of the natural evolution of programming tools. VOICE COMMANDS eclipse task eclipse search eclipse skip eclipse format eclipse new eclipse save eclipse rename eclipse cut eclipse copy eclipse paste eclipse all eclipse delete eclipse close eclipse get eclipse hash eclipse string Efforts are made on daily basic to increase the range of functionality that can be controlled with voice. PREREQUISITE Windows OS and Eclipse IDE. Headphones with microphone, not mandatory, but it will improve speech recognition. Port Number that is setted in Polaris Preference page must not be used by any other application.

Downloads: 7 This Week

Last Update: 2019-05-12
See Project
2

QPED(Quran Pronunciation Error Detector)

A voice recognition application which detect Quran (Islamic Holy book) pronunciation and result in a detection success percentage for each word of the Quran statements, when complete it can be considered a base for Arabic language recognition.

1 Review

Downloads: 3 This Week

Last Update: 2014-04-22
See Project
3

JuliusModels

Open source speech models for Julius in English and other languages.

Open source speech models for Julius speech decoder. Its aim is to give access a wider community of speech recognition enthusiasts to quality models, which they can use in their own projects on different OS platforms (Unix, Windows, etc...) All of the models are based on HTK modelling software and data sets available freely on the Internet.

Downloads: 4 This Week

Last Update: 2018-05-11
See Project
4

NASH OS

Nash Operating System for Modern Ecommerce

The all-built-in-one, automatic, ready-to-go out-of-box, easy-to-use state-of-the-art, and really awesome NASH OS! Over 25,000+ flexible features and controls and all scalable!! The most powerful solution ever built to instantly deliver new heights of online ecommerce enterprise to you.

Downloads: 4 This Week

Last Update: 2019-03-24
See Project
Full-stack observability with actually useful AI | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
5

ILA - teachable voice assistant

ILA is a fully customizable and teachable voice assistant for Java

ILA stands for (kind of) intelligent, learning assistant and is a speech recognition system aka voice assistant very similar to Siri, Google Now and Cortana. ILA is fully customizable and you can teach her/him/it new things by yourself like executing system commands, opening web pages, programs and apps or just some basic conversation :-) ILA runs on Java und thus is compatible to Windows, Mac and Linux. It is designed to integrate with your home enviroment and for example build up your own, free and open Amazon Echo replacement ;-) Right now the key components of ILA are the open source speech recognition CMU Sphinx-4, Google (Speech Recognition/Text-To-Speech) and MaryTTS (Text-To-Speech). The goal is to make ILA completely free of Google by improving all aspects of the open source systems. Since version 3.3 users can also write own add-ons to extend ILA. ILA's successor is the SEPIA Framework: https://sepia-framework.github.io/ Hope you enjoy ILA - Florian

4 Reviews

Downloads: 1 This Week

Last Update: 2018-07-23
See Project
6

Bavieca (www.bavieca.org)

Bavieca is an open-source speech recognition tookit.

Bavieca (www.bavieca.org) is an open-source speech recognition toolkit intended for speech research and as a platform for rapid development of speech-enabled solutions by non speech experts. It comprises the most common acoustic modeling and adaptation techniques including discriminative training, and efficient dynamic and FSM-based decoders that can operate in batch and live recognition modes. Bavieca is entirely written in C++ and distributed under the Apache 2.0 license. Bavieca was developed at Boulder Language Technologies (BLT) during the last three years in response to the needs of the research projects conducted within the company. Research at BLT includes the development of conversational dialog systems and assessment tools that are deployed in formal educational settings and other real-life scenarios.

2 Reviews

Downloads: 1 This Week

Last Update: 2013-07-17
See Project
7

C# Speech Recognition Tutorial

C# Speech Recognition Tutorial

This is an easy (as can be) tutorial to show how speech recognition is done with in C#. On the form the button is pressed, and within 5 seconds say your speech. In this example Q and B act as commands. The code filters the recognised words looking for the letter Q and B. File contains the source code-use this to make the simple form with the named elements in the image-in a new winforms program. The pdf file in the zip file explains how to link the voice recognition to a database.

Downloads: 2 This Week

Last Update: 2017-08-11
See Project
8

Pronounce

Pronounce is an app for Android which uses speech recognition to let you dictate speeches. It then uses a checking algorithm to see how well you did and update your score. You play as several politicians and activists over various eras.

Downloads: 2 This Week

Last Update: 2013-04-23
See Project
9

Speechalyzer

Process large speech data wrt transcription, labeling and annotation

Speechalyzer: a tool for the daily work of a 'speech worker' It is optimized to process large speech data sets with respect to transcription, labeling and annotation. It is implemented as a client server based framework in Java and interfaces software for speech recognition, synthesis, speech classification and quality evaluation. The application is mainly the processing of training data for speech recognition and classification models and performing benchmarking tests on speech-to-text, text-to-speech and speech classification software systems.

Downloads: 2 This Week

Last Update: 2016-04-27
See Project
AI-generated apps that pass security review
Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.

Try Retool free
10

Voice XML Enabling Software

Voice XML Enabling Software (VXES) is an application that connects a VoiceXML Interpreter, a telephony platform, and MRCP servers that provide services for Automatic Speech Recognition and Text to Speech Synthesis. C++, Windows & Linux OS supported

Downloads: 2 This Week

Last Update: 2016-02-21
See Project
11

STRUDLE

Tool for helping in the diagnosis of the dislexy, based on the speech recognition done with the usage of HTK

1 Review

Downloads: 1 This Week

Last Update: 2013-04-09
See Project
12

Comandi Vocali Offline per Windows

Sistema comandi vocali offline per Windows, veloce e privato .Offline

Comandi Vocali Offline per Windows è un sistema di controllo vocale che funziona interamente in locale sul tuo PC. Permette di controllare il computer con la voce senza connessione internet, senza cloud e senza inviare dati all’esterno. Il sistema è progettato per garantire massima privacy, velocità e semplicità. Caratteristiche principali: - Funziona completamente offline (nessun server, nessun cloud) - Riconoscimento vocale veloce con modelli locali - Controllo di browser, programmi e sistema - Lettura dello schermo tramite OCR e sintesi vocale - Installazione semplice senza modifiche al registro - Portabile e removibile (basta cancellare la cartella) Sviluppato in QB64 con integrazione di strumenti locali. Comandi Vocali Offline per Windows is a fully local voice control system that runs entirely on your PC. It allows you to control your computer using voice commands without any internet connection, without cloud services, and without sending any data outside

Downloads: 1 This Week

Last Update: 2026-04-07
See Project
13

Hemera - Intelligent System

Hemera is a Virtual Intelligent System aggregating some more advanced Artificial Intelligence Technologies (speech, speech recognition, form recognition, motion recognition ...); with applications in daily tasks, domotics and robotics ...

Downloads: 1 This Week

Last Update: 2015-01-21
See Project
14

JAVT - Just Another Voice Transformer

Just Another Speech Recognition and Text to Speech software.

JAVT or Just Another Voice Transformer (formerly, it is called Just Another Video Transcriber) is a Speech Recognition software that also support text to Speech and simple media conversion. JAVT allows you to convert from video files to audio wav file using ffmpeg, and then transcribe the audio file to text using either Microsoft SAPI or CMU Sphinx. You can also open a text file and allow JAVT to read it out for you through text to speech conversion.

Downloads: 1 This Week

Last Update: 2020-08-19
See Project
15

Mice MX OS speech to text Voice Control

Mice speech to text with MX Cinnamon OS ISO

Note about this image This image contains a system based on Linux MX, which was created to improve accessibility within the Linux environment. The distribution uses the Cinnamon desktop interface, which is configured to be operated using voice commands and outputs. The user interface and the control of your own devices and home automation systems can be customized and extended. The voice control program MiceStTM.py was developed to enable easy adaptation to other languages. However, only German settings are currently implemented. category: System commands comment: Screen grid trigger: Display screen (Ras.*|Grid)* terminal_command: /opt/micesttm/read-aloud/screen_grid.py & sleep 1 && xdotool search --name "screen grid" windowactivate intern_command: tts: Screen grid for the mouse click was selected.

Downloads: 1 This Week

Last Update: 1 day ago
See Project
16

NFZ-core

Neural Network Multiplayer SImulation

Multiplayer Shared Problem solving Neural Network AI Simulation, Chat / Speech Recognition / Customizable Identities / GUI Chat in real time with users and their developed AI Real time Simulation. NNET FANN Blockchain integration

Downloads: 1 This Week

Last Update: 2019-05-18
See Project
17

Scalable Language API

Scalable Language API (SLAPI) The most comprehensive architecture for conversational natural-language applications including speech recognition/synthesis, semantics, & machine translation. Used on Android & other mobile app platforms.

Downloads: 1 This Week

Last Update: 2018-01-22
See Project
18

Speech

Dictation / Speech Recognition

Dictation / Speech Recognition software that runs on any platform supported by Google Chrome.

Downloads: 1 This Week

Last Update: 2013-11-17
See Project
19

The Adam Speech Recognition Server

The adam server is a voice activated framework in which to control your desktop and perform general systems administration. It utilizes the Sphinx-4 speech recognition engine and the FreeTTS speech synthesis engine.

Downloads: 1 This Week

Last Update: 2013-03-22
See Project
20

WSR Application Macros

Collaborative development and distribution of Windows Speech Recognition (WSR) application macros to 1) improve the accessibility of personal computing for impaired users, and 2) improve the efficiency of personal computing for all users.

Downloads: 1 This Week

Last Update: 2013-04-11
See Project
21

perlbox

Perlbox Voice is an voice enabled application to bring your desktop under your command. With a single word, you can start your web browser, your favorite editor or whatever you want. This is the Linux and Unix voice recognition solution.

Downloads: 1 This Week

Last Update: 2013-04-08
See Project
22

Speech Recognition System

Speech Recognition System - Matlab source code

Speech recognition technology is used more and more for telephone applications like travel booking and information, financial account information, customer service call routing, and directory assistance. Using constrained grammar recognition, such applications can achieve remarkably high accuracy. Research and development in speech recognition technology has continued to grow as the cost for implementing such voice-activated systems has dropped and the usefulness and efficacy of these systems has improved. For example, recognition systems optimized for telephone applications can often supply information about the confidence of a particular recognition, and if the confidence is low, it can trigger the application to prompt callers to confirm or repeat their request. Index Terms: speech, recognition, verification, sound, isolated, words.

Downloads: 0 This Week

Last Update: 2015-03-18
See Project
23

"MedicalRecords"

MedicalRecords is an integrated medical information system.

Introduction “MedicalRecords” is an open source, client-server medical information system that is primarily intended to facilitate the storage, organization and retrieval of personal medical information that may be obtained from a variety of sources including physician offices and medical centers. Data that are downloadable in machine readable format can be transferred electronically to the database. Alternately, the data can be transferred from USB flash drives, CD ROMs or other removable storage media. Documents can be entered by scanning to PDF files or other formats. Finally, information may be entered through use of speech recognition or typing. “MedicalRecords” gives one or more patients access to an integrated medical record the data in which may come from a variety of sources. It also provides an easy means for presenting the integrated data to specialist or other new care provider, emergency room staff or admitting physicians.

Downloads: 0 This Week

Last Update: 2015-11-14
See Project
24

ABNF to GRXML Converter

This is an application that takes the input of ABNF code and then converts it to GRXML. Both standards adhere to the W3 standard of grammars for speech recognition.

Downloads: 0 This Week

Last Update: 2014-06-28
See Project
25

AIBO Pal

A speech recognition application. It uses Microsoft Speech SDK to recognize and speak words. It can Play Music, Read the News, Tell the Time, Open Apps and many other cool things only with voice commands.

Downloads: 0 This Week

Last Update: 2015-05-22
See Project