State-of-the-art 2D and 3D Face Analysis Project
The Iris Book: Addition, Subtraction, Multiplication, and Division
A Lightweight Face Recognition and Facial Attribute Analysis
Awesome multilingual OCR toolkits based on PaddlePaddle
Robust Speech Recognition via Large-Scale Weak Supervision
Offline speech recognition API for Android, iOS, Raspberry Pi
Face recognition with deep neural networks
NLP Cloud serves high performance pre-trained or custom models for NER
Speech recognition module for Python
Speech-to-text, text-to-speech, and speaker recognition
Multilingual speech recognition and audio understanding model
OCR software, free and offline
Contexts Optical Compression
Audio foundation model excelling in audio understanding
Handwritten Text Recognition (HTR) system implemented with TensorFlow
High-Performance Face Recognition Library on PaddlePaddle & PyTorch
Open-source industrial-grade ASR models
A ranked list of awesome machine learning Python libraries
A PyTorch-based Speech Toolkit
kaldi-asr/kaldi is the official location of the Kaldi project
Image polygonal annotation with Python
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Multilingual Automatic Speech Recognition with word-level timestamps
Underthesea - Vietnamese NLP Toolkit