State-of-the-art 2D and 3D Face Analysis Project
A Lightweight Face Recognition and Facial Attribute Analysis
The Iris Book: Addition, Subtraction, Multiplication, and Division
Awesome multilingual OCR toolkits based on PaddlePaddle
Robust Speech Recognition via Large-Scale Weak Supervision
Face recognition with deep neural networks
Speech recognition module for Python
NLP Cloud serves high performance pre-trained or custom models for NER
Speech-to-text, text-to-speech, and speaker recognition
Multilingual speech recognition and audio understanding model
OCR software, free and offline
Contexts Optical Compression
Audio foundation model excelling in audio understanding
Handwritten Text Recognition (HTR) system implemented with TensorFlow
High-Performance Face Recognition Library on PaddlePaddle & PyTorch
Open-source industrial-grade ASR models
A PyTorch-based Speech Toolkit
A ranked list of awesome machine learning Python libraries
kaldi-asr/kaldi is the official location of the Kaldi project
Image polygonal annotation with Python
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
Underthesea - Vietnamese NLP Toolkit
Multilingual Automatic Speech Recognition with word-level timestamps
Open-Source Python3 tool for recognizing layouts, tables, and math
A full spaCy pipeline and models for scientific/biomedical documents