Open Source Linux Text Processing Software - Page 5

Text Processing Software for Linux

View 9 business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Add Two Lines of Code. Get Full APM. Icon
    Add Two Lines of Code. Get Full APM.

    AppSignal installs in minutes and auto-configures dashboards, alerts, and error tracking.

    Works out of the box for Rails, Django, Express, Phoenix, and more. Monitoring exceptions and performance in no time.
    Start Free
  • 1

    DWDS/Dialing Concordance

    a collection of indexing and search tools for corpus linguists

    DWDS/Dialing Concordance (DDC) - a collection of index and search tools for corpus linguists
    Leader badge
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    RTextDoc

    RTextDoc

    An editor for structured documents

    RTextDoc is an editor for structured text documents such as LaTeX, AsciiDoc, DocBook. RTextDoc has proofreading capabilities: on-the-fly spelling, instant grammar checking and built-in free dictionaries. RTextDoc has syntax highlighting, bracket matching, folding, document structure browser for sections and labels, bookmarks, manager for LaTeX symbols, an editor for mathematical equations,integrated BibTeX database manager and several tools to convert LaTeX to HTML and back. AsciiDoc files can be converted to DocBook, HTML and PDF files.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    gleditor

    gleditor

    A small programmer's editor.

    A small programmer's editor whith syntax highlight, extended search features, code completion (ctrl+space). Supported languages: htlm, sql, pascal, c/c++, c#, java, basic, javascript, css, php, python.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 4
    DocWire SDK

    DocWire SDK

    Award-winning modern data processing SDK in C++20

    DocWire SDK, a standout C++20AI driven data processing tool, has received award from SourceForge and strong backing from Microsoft. It handles nearly 100 file types, empowering efficient text extraction, web data extraction, and document analysis. For businesses, the shift to DocWire SDK signifies a leap forward. It promises comprehensive document format support and the ability to extract valuable insights from email boxes, databases, and websites using cutting-edge AI. DocWire SDK aims to expand its capabilities, focusing on versatile data extraction, platform support, and seamless integration with various systems. DocWire SDK is dedicated to streamlining data processing, reducing development time and costs, and harnessing the potential of AI. Its advancements promise a superior experience compared to its predecessor, DocToText.
    Downloads: 8 This Week
    Last Update:
    See Project
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    More flexibility. More control.

    Generate interest, access liquidity without selling, and execute trades seamlessly. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 5
    GNU FriBidi is the Free Implementation of the Unicode Bidirectional Algorithm. GNU FriBidi development has been moved to GitHub. See https://github.com/fribidi/fribidi/
    Downloads: 8 This Week
    Last Update:
    See Project
  • 6
    Project Homepage is moved to http://cjkpython.i18n.org/
    Downloads: 8 This Week
    Last Update:
    See Project
  • 7
    Textbender is a system of collaborative writing based on recombinant text. It implements a social Web medium with potential applications ranging from open art to open democracy.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 8
    IMPORTANT NOTE: This project has moved to Github: https://github.com/pkozelka/libxml2-pas Pascal units accessing the popular XML API from Daniel Veillard ( http://www.xmlsoft.org ). This should be usable at least from Kylix and Delphi, but hopefully also from other Pascal compilers (like freepascal).
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    An alternative to the string library for C and C++ which is more functional and does not have buffer overflow problems.
    Downloads: 7 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 10
    glark offers grep-like searching of text files, with very powerful, complex regular expressions (e.g., "/foo\w+/ and /bar[^\d]*baz$/ within 4 lines of each other"), as well as highlighting of matches.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 11
    Mamba is a extensible xml templates preprocessor wrote in Python. Using it, you can rapidly develop powerful applications ready to integrate with the internet. It can be used to work as a generic CGI program or for generate content.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 12
    Downloads: 7 This Week
    Last Update:
    See Project
  • 13
    DocFrac is a document converter that can convert between RTF, HTML and ASCII text. This includes RTF to HTML and HTML to RTF. Supports text formatting (e.g. bold); tables; and most European languages. Available for Windows; Linux; ActiveX and DLL.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 14
    GNOME Clipboard Manager is an environment made for managing your clipboards. It will automatically fetch new clipboards, and will push all the formats of that clipboard to a shelf
    Downloads: 6 This Week
    Last Update:
    See Project
  • 15
    This projects aims to create a NFO generator which will be able to create different kinds of nfo files with different artwork for the different needs of its users.
    Leader badge
    Downloads: 6 This Week
    Last Update:
    See Project
  • 16
    PyRtfLib is a python library that provides a parser and few translators like rtf to html and to simple text.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 17
    Script Echo Color

    Script Echo Color

    Terminal text colorizing and simplifies script coding.

    ScriptEchoColor simplifies Linux terminal text colorizing, formatting and several steps of script coding.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 18
    This project is now part of Gnome Subtitles (http://gnomesubtitles.org). SubLib was a library that eases the development of subtitling applications. It supports the most common text-based subtitle formats and allows for subtitle editing, conversion and synchronization.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 19
    XPontus , a free java based, Open Source XML Editor
    Downloads: 6 This Week
    Last Update:
    See Project
  • 20
    Jericho HTML Parser is a java library allowing analysis and manipulation of parts of an HTML document, including server-side tags, while reproducing verbatim any unrecognised or invalid HTML.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    GCrypter

    GCrypter

    an easy way to encode and decode your thoughts

    Downloads: 3 This Week
    Last Update:
    See Project
  • 22
    RTF2HTML is a name for a cross-platform C++ library (DLL, OCX) and command-line utility, which is intended to convert documents from Rich Text Format (e.g. Word, OO Writer) to HTML. Its features are tiny size, speed, low mem usage and compact output.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 23
    Word segmentation utility for Thai language written in C
    Downloads: 3 This Week
    Last Update:
    See Project
  • 24
    ansifilter

    ansifilter

    ANSI sequence filter

    Ansifilter handles text files containing ANSI terminal escape codes. The command sequences may be stripped or be interpreted to generate formatted output (HTML, RTF, TeX, LaTeX, BBCode and Pango Markup).
    Downloads: 3 This Week
    Last Update:
    See Project
  • 25
    bitext2tmx CAT bitext aligner/converter
    A free computer-aided translation / computer-assisted translation (CAT) tool to align and converter bitext into TMX translation memory format to be used in other CAT tools by translators and other language professionals.
    Downloads: 3 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB